The invention relates to optical readers in general and specifically to a method for processing images including recognizable characters.
In spite of perpetually increasing processing speeds, the need for optical character recognition (OCR) readers that are programmed to efficiently search captured image data for recognizable indicia and to recognize such indicia has not diminished.
The increase in processing speeds has been accompanied by multiplication of the resolution of imaging arrays, and a corresponding multiplication of the number of processing operations required to process images captured by optical readers. The availability of higher processing speeds and higher resolution imaging systems, in turn, has encouraged users of OCR readers to develop OCR applications requiring an ever increasing number of processing operations.
In addition to being too slow for their designated applications, OCR readers have been observed to be poorly equipped to recognize characters obliquely oriented in a captured image. Existing optical readers generally require a specific orientation of a reader relative to a character during image capture for efficient recognition of a character.
There is a need for a faster OCR reader which provides omnidirectional character recognition.
According to its major aspects and broadly stated, the invention is a method for omnidirectional recognition of recognizable characters in a captured two-dimensional image.
An optical reader configured in accordance with the invention searches for pixel groupings along paths in a starburst pattern, and subjects each located pixel grouping to a preliminary edge crawling process which records the count of edge pixels and records the pixel positions of the grouping's edge so that the size and center of the grouping can be estimated. If two similar-sized pixel groupings are located that are of sizes sufficient to potentially represent recognizable characters, then the reader launches “alignment rails” at pixel positions substantially parallel to a center line connecting the center points of the two similarly sized groupings. The alignment rails define an area within the image likely to include a plurality of recognizable characters of a linear string of characters. The presence of clear areas above and below a pair of similarly and sufficiently sized pixel groupings extending at least the length of a minimal sequence of OCR characters indicates a likelihood that the pair of pixel groupings belong to a linear string of OCR characters.
A reader according to the invention, therefore, searches for recognizable characters along the rail area centerline, and subjects pixel groupings within the rail area to a shape-characterizing edge crawling process for developing data that characterizes the shape of a pixel grouping's edge. Prior to comparison of the shape-characterizing data with stored reference data, the reader adjusts the orientation representation of the developed data by an offset orientation value determined by the orientation of the rails. For pixel groupings within the rail area, the reader compares the developed shape-characterizing data to previously stored reference shape-characterizing data to determine the character represented by the grouping on the basis of the best fit data. These and other details, advantages and benefits of the present invention will become apparent from the detailed description of the preferred embodiment herein below.
a is a flow diagram representing the ordering of operations carried out by an OCR optical reader according to the invention;
b is partial image map illustrating a starburst search pattern;
c is an image map shown as being divided into tiles;
d-1e are diagrams for illustrating a possible binarization process which may be utilized with the invention;
f-1g are diagrams for illustrating a possible threshold determining process which may be utilized with the invention;
a is a block diagram of an exemplary optical reader that may be configured according to the invention;
b-2h illustrate various types of optical readers in which the invention may be incorporated;
a-3g are partial image maps illustrating an undecodable pixel grouping and a decodable pixel grouping as may be subjected to image data processing according to the invention;
a-4d illustrate an example of an image including recognizable characters which may be processed according to the invention;
a illustrates an example of a data format for pixel grouping shape-characterizing data, which may be developed according to the invention;
b is an example of a matrix of stored reference data;
c illustrates a reference character from which stored reference data may be developed.
A flow diagram of the main character recognition algorithm of the present invention for locating and recognizing recognizable characters is shown in
If a prior similar-sized globule is located, then the reader at block 24 launches “alignment rails” substantially parallel to a center line connecting the center points of the similar-sized globules. The term “substantially parallel” herein encompasses relative positions that are in fact parallel. The launched alignment rails are useful for identifying additional characters of a linear string of characters in a captured image and for establishing an orientation of the character string. At block 26, the reader polls pixels along the center line starting at the left bound edge of the area defined by the alignment rails until a pixel grouping in the rail area is located at block 28.
When a pixel grouping within the rail area is located, the reader at block 30 subjects the pixel grouping to a shape determining edge crawling process. In a shape-characterizing edge crawling process, the reader develops data that characterizes the shape of a pixel grouping's edge and processes the shape-characterizing data into a form so that the data can be compared to stored reference data. Importantly, if the rails are obliquely oriented, the reader adjusts the orientation representation of the shape-characterizing data by an offset value determined by the orientation of the rails.
The reader at block 32 then compares the developed shape-characterizing data for the present globule to stored shape-characterizing data for several characters to determine the character represented by the globule by selecting the best fit data of the stored shape-characterizing data database. The reader continues to poll pixels along the rail area center line and continues to attempt to recognize characters represented by located pixel groupings until at block 34 the reader determines that the end of the rail area has been reached.
A block diagram of an exemplary optical reader which may be employed to carry out the invention is shown in
Optical reader 110 includes an illumination assembly 120 for illuminating a target object T, such as a 1D or 2D bar code symbol, and an imaging assembly 130 for receiving an image of object T and generating an electrical output signal indicative of the data optically encoded therein. Illumination assembly 120 may, for example, include an illumination source assembly 122, such as one or more LEDs, together with an illuminating optics assembly 124, such as one or more lenses, diffusers, wedges, and reflectors or a combination of such elements for directing light from light source 122 in the direction of target object T. Illumination assembly 120 may include target illumination and optics for projecting an aiming pattern 127 on target T. Illumination assembly 20 may comprise, for example, laser or light emitting diodes (LEDs) such as white LEDs or red LEDs. Illumination assembly 120 may be eliminated if ambient light levels are certain to be high enough to allow high quality images of object T to be taken. Imaging assembly 130 may include an image sensor 132, such as a 1D or 2D CCD, CMOS, NMOS, PMOS, CID OR CMD solid state image sensor, together with an imaging optics assembly 134 for receiving and focusing an image of object T onto image sensor 132. The array-based imaging assembly shown in
Optical reader 110 of
More particularly, processor 142 is preferably a general purpose, off-the-shelf VLSI integrated circuit microprocessor which has overall control of the circuitry of
The actual division of labor between processors 142 and 144 will naturally depend on the type of off-the-shelf microprocessors that are available, the type of image sensor which is used, the rate at which image data is output by imaging assembly 130, etc. There is nothing in principle, however, that requires that any particular division of labor be made between processors 142 and 144, or even that such a division be made at all. This is because special purpose processor 144 may be eliminated entirely if general purpose processor 142 is fast enough and powerful enough to perform all of the functions contemplated by the present invention. It will, therefore, be understood that neither the number of processors used, nor the division of labor there between, is of any fundamental significance for purposes of the present invention.
With processor architectures of the type shown in
Processor 144 is preferably devoted primarily to controlling the image acquisition process, the A/D conversion process and the storage of image data, including the ability to access memories 146 and 147 via a DMA channel Processor 144 may also perform many timing and communication operations. Processor 144 may, for example, control the illumination of LEDs 122, the timing of image sensor 132 and an analog-to-digital (A/D) converter 136, the transmission and reception of data to and from a processor external to reader 110, through an RS-232, a network such as an Ethernet, a serial bus such as USB, a wireless communication link (or other) compatible I/O interface 137. Processor 144 may also control the outputting of user perceptible data via an output device 138, such as a beeper, a good read LED and/or a display monitor which may be provided by a liquid crystal display such as display 182. Control of output, display and I/O functions may also be shared between processors 142 and 144, as suggested by bus driver I/O and output/display devices 137′ and 138′ or may be duplicated, as suggested by microprocessor serial I/O ports 142A and 142B and I/O and display devices 137″ and 138′. As explained earlier, the specifics of this division of labor is of no significance to the present invention.
b through 2g show examples of types of housings in which the present invention may be incorporated.
In addition to the above elements, readers 110-2 and 110-3 each include a display 182 for displaying information to a user and a keyboard 178 for enabling a user to input commands and data into the reader.
Any one of the readers described with reference to
As will become clear from the ensuing description, the invention need not be incorporated in a portable optical reader. The invention may also be incorporated, for example, in association with a control circuit for controlling a non-portable fixed mount imaging assembly that captures image data representing image information formed on articles transported by an assembly line, or manually transported across a checkout counter at a retail point of sale location.
Referring now to aspects of the recognition algorithm of the invention in greater detail, a starburst search process is more fully explained with reference to
As illustrated in
Searching for pixel groupings in a starburst pattern terminates temporarily when a first pixel grouping is found. The term “pixel grouping” herein refers to a grouping of one or more adjacent (including diagonally adjacent) like valued pixels, typically dark pixels. In order for a “dark” pixel to be recognized in an image, there must be a prior determination of what constitutes a dark pixel. Prior determination of what constitutes a “dark” pixel is normally carried out by binarization of at least a part of an image; that is, representation of a “dark” pixel by a one bit logic “0” value, and representation of a “light” pixel by a one bit logic “1” value (although the opposite convention can be adopted). It will be understood that image binarization referred to herein may be carried by a variety of different methods. For example, an image may be binarized by capturing a binary image comprising one bit “1” or “0” pixel values directly into memory 145 when an initial image map is captured. Alternatively, a grey scale image map may be captured into memory 145 and the pixel values therein may subsequently be converted into one bit binary values. An entire frame of grey scale values may be subjected to binarization prior to searching for indicia represented in the captured image, or else individual pixels of the image map may be binarized “on demand” when they are analyzed.
In addition, pixels of a grey scale image map may be binarized according to a “tile-binarization” process that is described with reference to
According to the tile binarization process, control circuit 140 does not binarize an entire image map prior to searching for recognizable indicia. Instead, in a tile binarization process, control circuit 140 binarizes a tile of an image map only when at least one pixel of the tile is needed for image data analysis. For example, at the time starburst searching commences, control circuit 140 may binarize only center tiles T150, and T151, as indicated by
In addition to binarizing tiles during starburst searching, control circuit 140 may binarize new tiles during other image data processing steps such as during an edge crawling step, to be described more fully hereinbelow, or during an alignment rail area search for recognizable indicia to be described more fully hereinbelow. Control circuit 140 may binarize a new tile when control circuit 140 is required to read a first pixel of the new tile or when control circuit 140 reads a previously binarized pixel value in positional proximity with the new tile. For example, control circuit 140 may binarize a new tile when control circuit 140 reads a pixel value corresponding to a pixel that borders the new tile.
In another aspect of a tile binarization process, the particular type of binarization process executed by control circuit 140 in binarizing a tile of pixel values may be made to vary depending on the type of image data processing presently being carried out by control circuit 140. In one embodiment of the invention, control circuit 140 is made to binarize new tiles as is necessary according to a low resolution binarization process when searching for pixel groupings, or when subjecting a pixel grouping to an edge-length determining edge crawling process to be described more fully herein and to binarize tiles of pixels as is necessary according to a high resolution binarization process when subjecting a pixel grouping to a shape-characterizing edge crawling process as will be described herein.
In a low resolution binarization process, control circuit 140 converts each grey scale value of a tile into a binary “1” or “0” value. In a high resolution binarization process described with reference to
In the above-described embodiment it will be recognized that control circuit 140 may binarize certain tiles according to a high resolution binarization process during execution of shape-characterizing edge crawl which have previously been binarized according to a low resolution binarization process (during searching or length determining edge crawling).
If control circuit 140 may construct both low resolution and high resolution binarized image maps corresponding to the same position of a grey scale image map, then control circuit 140 may store both of these binary representations into memory 145 in a manner such that certain cells of the memory store bits corresponding to both of the low and high resolution binary representations. It is seen with reference again to
Threshold values for use in binarizing grey scale image data may be developed utilizing a variety of different methodologies. Threshold values may be predetermined based on known illumination or exposure conditions. Threshold values may also be based on grey scale values of a threshold-determining frame of image data, which is typically the frame of image data being processed when the reader is of a type adapted for used in variable illumination conditions.
In calculating threshold values based on present or recently captured image data, control circuit 140 may consider every pixel of an image map. However, for increased processing speed, control circuit 140 may be adapted to sample a limited number of threshold-determining pixel values (such as 1/256 of pixels of the entire image map) at substantially evenly spaced apart pixel position for use in determining a similar number of threshold values for an image map (in a tile-binarization scheme implemented with a typically sized image map, this number of thresholds would result in a limited number, for example 4, threshold values being calculated for each tile). This set of grey scale values may be referred to as a sample array of threshold-determining values.
Preferably, the threshold value for use in binarizing a grey scale value at a given pixel position takes into consideration grey scale values of pixels of the threshold-determining frame in positional proximity to the given pixel position preferentially to grey scale values to pixel positions not in positional proximity with the given pixel.
Skilled artisans will recognize that numerous alternative methods are possible for ensuring that a threshold value at a given pixel position depends preferentially on pixel values of neighboring pixels. According to one method for developing threshold values that depend preferentially on the value of neighboring pixels, control circuit 140 may develop the threshold value at each pixel position of a threshold determining image map by calculating the average of the grey scale value at that pixel and of a predetermined arrangement of surrounding pixels. Likewise, control circuit 140 may develop a threshold value for a group of pixels corresponding to a given position in a sample array of threshold determining values by averaging the threshold determining value at the given position and threshold-determining values at positions surrounding the given position.
Another method for determining threshold values that depend preferentially on grey scale values of neighboring pixels is described with reference to
If p0 in the image map of
In executing a length determining edge crawling process (block 16), control circuit 140 counts the number of the pixel grouping's edge pixels and records the position of each edge pixel. In the specific example described, edge pixels are considered the light pixels that border the groupings dark pixels.
In order to count a sequence of edge pixels, control circuit 140 must first establish an initial traveling direction, and then follow a set of edge crawling rules. A possible set of rules for establishing an initial traveling direction is as follows:
(E) else edge crawling fails.
Edge crawling fails under condition (E) if there is no dark pixel neighboring the first edge pixel (violating the starting condition) or if the first edge pixel is completely surrounded by dark pixels.
In the example of
As control circuit 140 determines the position of each new edge pixel, control circuit 140 records the pixel position of the edge pixel into memory 145 and increments an edge pixel counter.
It should be highlighted that the above moves are made relative to the current traveling direction. If having reached the present position by traveling South for example, then the order of checking is (1) is West light, then turn and move West, else (2) is South light, then move South, else (3) is East light, then turn and move East, else (4) turn and move North. Because the above edge crawling rules result in the direction of crawling advancing rightward of the present direction whenever there is a light pixel neighboring the present edge pixel, they may be referred to herein as “right” edge crawling rules. “Left” edge crawling rules can also readily be developed, as will be explained herein.
Following the above edge crawling rules, control circuit 140 determines the sequence of edge pixels for pixel grouping G1 to be: eG10, eG11, eG12 . . . eG125 as is illustrated in
Referring again to the main recognition algorithm flow diagram of
Continuing to search for pixel groupings in a starburst pattern, the next dark pixel encountered by control circuit 140 following the starburst pattern of
The next pixel grouping that is found pursuant to starburst searching is pixel grouping G3. Following the above traveling direction initialization and edge crawling rules, control circuit 140 at block 16 determines the edge pixels of grouping G3 to be edge pixels eG30, eG31, eG32 . . . eG3171 as is indicated in
Control circuit 140 may determine at block 20 whether the sizes of globules G3 and G2 are similar by monitoring the highest and lowest x-axis positions and the highest and lowest y-axis positions of the globule. For example, control circuit 140 may estimate the size of a globule according to the formula:
ESIZEGLOBULE=(X(hi)−X(lo))+(Y(hi)−Y(lo)) eq. 1
Further, control circuit 140 may determine whether the sizes of two globules are similar on the basis of whether the size estimations of the two globules as estimated by eq. 1 are within a predetermined percent age, such as +−12.5 percent. Employing eq. 1 to estimate the size of globule G2, and globule G3, respectively, then
ESIZEG2=(eG269−eG222)+(eG262−eG220)=19+15=34
ESIZEG3=(eG370−eG35)+(eG362−eG3121)=36+43=77
Because the sizes of globules G2 and G3 do not correlate with one another according to the described criteria, control circuit 140, after executing block 20 (size comparison) proceeds again to block 12 to continue polling pixels according to a starburst search pattern.
The next pixel grouping found by control circuit 140 by searching for dark pixels in a starburst pattern is pixel grouping G4 corresponding to the recognizable character “E”. Applying the above traveling direction initialization and edge crawling rules, control circuit 140 determines that the edge pixels of grouping G4 are edge pixels eG40, eG41, eG42 . . . eG4251 as is indicated in
Applying eq. 1 to estimate the size of globule G4, then
ESIZEG4=(eG4112−eG428)+(eG481−eG4248)=39+47=86
Comparing the estimate size of globule G4 to that of globule G3 control circuit 140 confirms at block 20 that the sizes of the two globules G4 and G3 (ESIZEG3=77 and ESIZEG4=86) correlate with one another within the described predetermined tolerance (selected in the exemplary embodiment to be ±12.5%). Having determined that the sizes of two globules correlate, control circuit 140 proceeds to block 24 at which control circuit 140 launches alignment rails at positions determined by characteristics of the similar-sized detected rails.
Alignment rails 50 and 52 as described with reference to
As illustrated with reference to
A scene imaged by an OCR reader will often contain several rows of printed character strings. It can be seen that if control circuit 140 launches rails 50 and 52 pursuant to locating similarly-sized pixel groupings corresponding to characters from different rows, rails 50 and 52 will likely encroach upon pixel groupings corresponding to characters of the different rows.
In one embodiment control circuit 140 is configured so that if a rail 50 encroaches upon a pixel grouping, control circuit 140 makes no attempt to recognize characters within the present rail area. In this embodiment, control circuit proceeds to block 12 to poll pixels in a starburst pattern if during execution of block 24 (rail launch) a rail encroaches upon a pixel grouping, e.g. grouping G7. In an alternative embodiment, control circuit 140 is configured so that the encroachment of a rail upon a pixel grouping results in a truncation of the rail area, but does not preclude the control circuit's searching for recognizable data within the rail area. That is, control circuit may be configured to establish a rail border B perpendicular to rail 52 at the point T where rail 52 encroaches upon pixel grouping.
In yet another embodiment, control circuit 140 is configured so that if the length of a rail, 50 and 52, is below a minimally sufficient length which is preferably determined relative to the edge lengths of located pixel groupings, then control circuit 140 makes no attempt to recognize characters in the present rail area and instead proceeds to block 12 to poll pixels in a starburst pattern. For example, control circuit 12 can be made to proceed to block 12 after a rail launch if the length of rail 50 or 52 is not greater than the average edge length of similarly sized globules located at block 20. In the case that both rails are above a minimally sufficient length but one of the rails nevertheless encroaches on a pixel grouping such as pixel grouping G7, control circuit 140 preferably truncates the rail area by defining a new border B for the rail area as described previously in connection with
Rails 50 and 52 may be launched in accordance with a simple mathematical formula. In one embodiment, rails 50 and 52 are a series of pixel positions at a perpendicular oriented spacing distance, S, from center 51. S may be calculated according to the formula:
S=(ESIZEA,+ESIZEB)/5 eq. 2
Launching rails at distances from center line 51 according to eq. 2 assures that rails 50 and 52 satisfy the requirements of extending through pixel positions which are proximate recognizable characters, but if well aligned relative to a character string are not likely to encroach upon recognizable characters of a linearly arranged string of recognizable characters. Preferably, control circuit 140 grows rails in first and second directions, d1 and d2, starting from a position r0 along a globule's vertical centerline. The center points, G3 and G4 illustrated in the image representation of
C[x,y]=[(AVG(x(hi)+x(lo)),AVG((x(hi)+x(lo))] eq. 3
After rails 50 and 52 are launched, control circuit 140 attempts to recognize characters defined within the boundaries of rails 51, 52 and border b or border B. More particularly, with rails established, control circuit 140 begins searching for additional dark pixel globules at block 26 by polling pixels along center line 52 beginning at the left edge L of the area defined by rails 50, 52 and border b, or border B.
In the example of
Once an initial traveling direction value for shape-characterizing edge crawling is established, the direction of travel about an edge, and the determination of a traveling direction value for each new edge pixel, may be governed by the following shape determining edge crawling rules:
Certain features and advantages of the shape-characterizing edge crawling process as well as methods by which the process can be varied can be easily appreciated with reference to
A first observation, as best seen by the plotted traveling direction values of
Any edge crawl that is not terminated by a termination condition (e.g. the border of an image being reached) will eventually result in a traveling direction value being determined for initial edge pixel, e0, a second time. In the example of
While the traveling direction value, D, discussed thus far characterizes the direction of travel between two pixels along a prescribed path and therefore the relative positioning between two pixel positions, the shape of an edge of a pixel grouping comprising a plurality of pixels can be characterized by high resolution traveling direction values, DH. Control circuit 140 calculates a high resolution traveling direction value DH, by summing N successive traveling direction values, D.
If N is selected to be 4, for example, then control circuit 140 may commence calculation of a running sum of pixel values when a travel direction value, D, is calculated for the 4th edge pixel, E3, as is indicated by the Table 1. The running sum of the directional values may be referred to as a high resolution traveling directional value, DH, since the value represents the general direction of an edge over a succession of edge pixels. When a traveling direction value, D is calculated for each new edge pixel starting with the N=4 edge pixel, the high resolution traveling directional value, DH, is updated. Because DH at any edge pixel, Ex, can be calculated according to the formula DH(Ex)=DH(Ex−1)+D(Ex)−D(Ex−N) then maintaining a running sum of traveling direction values requires minimal processing time.
More particular benefits of maintaining a running sum of traveling direction values are discussed in greater detail with reference to
Increasing the number, N, of traveling direction values summed in a running sum of traveling directional values operates to improve the hiding of irregularities or “bumps” along edges that are generally straight, but results in the recorded DH values characterizing corners of a pixel grouping as rounded edges.
An example illustrating one method by which control circuit 140 at block 30 can apply the above edge crawling rules to develop data characterizing the shape of pixel grouping located along rail area centerline 51 such that the developed data can be compared in a useful manner to stored reference data is described with reference to
To develop traveling direction values which generally descend for succeeding edge pixels about an exterior of a pixel grouping control circuit 140 may apply “left” shape-characterizing edge crawling rules, derivable from the above right edge. Left edge crawling rules for establishing an initial traveling direction, and traveling direction value, D0, for southeastern edge pixel E0 are:
In the example described with reference to
A partial set of traveling direction values for pixel grouping G5 developed according to the above left shape-characterizing edge crawling rules are summarized in Table 1 herein:
After developing shape-characterizing data comprising traveling directions and high resolution traveling direction values as indicated in Table 1, control circuit 140 continuing with the execution of block 30, (unless it determines that an interior of a character has been crawled as explained with reference to
Because the count of edge pixels that comprise an edge of an imaged character are highly unpredictable and depend upon a variety of factors including the reader-to-target distance, font size, and the resolution of the reader, it is normally necessary to scale the present pixel groupings shape-characterizing data so that the data can be productively compared to stored shape-characterizing data. In the embodiment explained with reference to
Similarly, in the generation of developed data 62, the edge pixels of pixel groupings, e.g. grouping G5 are divided into M equal lengthened segments, and for each of the M segments, control circuit 140 calculates an average high resolution traveling direction value, DHAsg0, DHAsg1, . . . DHAsgM−1. Each average high resolution traveling direction value, DHAsgi, is calculated by averaging a string of high resolution traveling direction values, DH, corresponding to a segment of the pixel grouping. When segmenting a pixel grouping, e.g. grouping G5 into a plurality of substantially equal lengthened segments, it is preferred that control circuit 140 accounts for the fact that a linear pair grouping of edge pixels, e.g. edge pixels E9, E10, represent a greater distance along the perimeter of a character than a diagonal grouping of edge pixels, e.g. edge pixels E50 and E51. In the particular example described with reference to
In addition to adjusting for scale discrepancies represented between the shape-characterizing data expressed in Table 1 and the stored reference data 62, control circuit 140 preferably adjusts for orientation discrepancies between the shape-characterizing data and the stored data 62. While by convention stored reference data 62 corresponds to a reference character in a northward aligned orientation, the developed shape characterizing data of Table 1 corresponds to a character pixel grouping obliquely oriented in an image.
A highly significant advantage of the present invention is that the orientation representation of shape-characterizing data developed by the processes described herein can readily be adjusted. As seen with reference to the example of
OFFSET=((CMPSTS)×(rail angle))/360 deg. Eq. 4
Actual stored reference data such as reference data 62 is preferably scaled and offset to values in the 0 to 255 range so as to be readily expressed in binary 8 bit numbers of substantial resolution. The particular stored reference data 62 are generated by scaling high resolution traveling values, DH, by a factor of 8 to generate 8×CMPST=128 compass positions, offsetting DHAsg, values by multiples of the number of compass positions, and adding 40 to avoid final DHAsg, values in the negative range.
Scaling and offsetting the shape-characterizing data of Table 1 on accordance with reference data 62, and adding the orientation OFFSET value to the result, confirms the accuracy of the shape characterization. As seen in Table 1, the string of edge pixels in the northward stretch region NSR of pixel grouping G5 have traveling directions such as {−4, −4, −3, −4, −3, −4} and high resolution traveling direction values such as {−15, −15, −14, −14, −15}. Scaling and offsetting data corresponding to this segment of edge pixels yields DHA value for the segment of DHA=((−88×8)/6)+128+40=INT[50.66]=51 prior to orientation offsetting. Adding the scaled orientation offset of OFFSET=INT[−13.51}=−14 to the DHA value yields a DHA value for the segment of DHA=51−14=37, which corresponds well to DHA=40, modulo 128, values in the stored reference data 60 corresponding to northward stretch reference character segments, e.g. DHASG25, DHASG4 (reference character “T”) of the reference data 62.
In spite of the provision for orientation adjustment of the developed data, the reorientation of data, under certain circumstances, may not be sufficient to result in selection of a best fit set of data from the stored shape-characterizing database. Note, that in the example of
When a pixel grouping's shape-characterizing data is scaled and offset for scale and orientation adjustment and for compatibility with stored reference data, 62, control circuit 140 at block 32 selects the character represented by the pixel grouping by identifying the stored data that best fits the developed data. A variety of known mathematical methods may be utilized for the identification of the best fit data. In one embodiment, control circuit 140 may calculate the absolute value of the difference between each average high resolution traveling direction value, DHA, of the developed data 60 and the corresponding stored data (DHASGi−DHAsgi), sum these differences and select the best fit data as the data yielding the lowest sum. That is, an error sum assigned to a particular reference character may be calculated according to the formula:
ERSUM=|DHASG0−DHAsgo|+|DHASG1−DHAsg1| . . . |DHASGM−1−DHAsgM−1| Eq. 5
Should an initial comparison between developed data and stored data fail to yield a candidate best fit set of data within a predetermined tolerance, it may be beneficial to calculate a difference sum between the developed and stored data by taking into consideration differences between DHA values for segments of the developed data, e.g. DHAsgi and those of either immediately preceding (DHASGi−1) or immediately succeeding (DHASGi+1) segments of the stored data. For example, the error sum assign a particular reference character may be calculated according to the formula:
ERSUM=|DHASG1−DHAsgo|+|DHASG2−DHAsg1| . . . +|DHASGM−1−DHAsgM−2| eq. 6
As is indicated by the loop defined by blocks 24, 26, 28, 30, and 32, control circuit 140, if recognition is successful at block 32, continues to poll pixels along center line 51 of the rail area to locate and recognize character pixel groupings until the end of the rail area is reached (block 34). The operation of control circuit 140 in the case recognition fails at block 32 depends on the particular configuration of control circuit 140.
Control circuit 140 can be adapted so that if control circuit 140 at block 32 fails to select a best fit set of character data from reference data 32 control circuit 140 nevertheless proceeds to block 26 to continue to search for dark pixels along center line 51. Control circuit 140 also may be configured, as explained previously with reference to
Further still, control circuit 32 can be configured to attempt to recognize an upside down imaged character string in the case recognition fails at block 32. The orientation adjustment processing described herein above is useful for correcting for orientation discrepancies between developed (e.g. Table 1) and stored reference data 62 for a recognizable character oriented at any orientation in a captured image within a 180 degree radius (+−90 degrees from horizontal). To facilitate 360 degree recognition of recognizable imaged characters, stored reference data 62 is made to include shape-characterizing data corresponding to upside down oriented reference characters and control circuit 140 is configured so that control circuit modifies its polling of pixels along center line 51 in the case recognition fails at block 32. Specifically, control circuit 140 can be configured to commence polling of pixels in a leftward direction starting from the right edge R of the rail area in the case recognition fails at block 32 in search of pixel groupings representative of upside-down imaged characters. If control circuit 140 locates a pixel grouping during leftward center line pixel polling, control circuit 140 may develop shape-characterizing data beginning at the most northwest edge pixel of the pixel grouping as determined based on the greatest distance to mathematically interpolated point C′, and may proceed to attempt recognize the upside down imaged character as explained previously in connection with block 32 by comparison of the developed data to the stored upside down reference character data of reference data 62.
Control circuit 140 can also be configured to attempt to recognize right side up and upside down imaged characters concurrently. For example, at the time control circuit 140 develops shape-characterizing data as shown in Table 1 starting from the most southeast edge pixel E0, control circuit 140 can also develop shape-characterizing data starting from the most northwest edge pixel location. Control circuit 140 at block 32 can then compare the two sets of developed data, the rightside up developed shape-characterizing data, and the upside down shape-characterizing data to stored reference data 62 that includes reference shape-characterizing data for both rightside up and upside down reference characters. When configured to recognize rightside up and upside down imaged characters concurrently, control circuit 140 may select both a best fit set rightside up reference characters and a best fit set of upside down reference characters and may select between the sets of characters based on which string yields the lowest cumulative error sum.
There is set forth herein:
A1. An optical character recognition optical reader for recognizing recognizable characters in a captured image, said reader comprising:
While this invention has been described in detail with reference to a preferred embodiment, it should be appreciated that the present invention is not limited to that precise embodiment. Rather, in view of the present disclosure which describes the best mode for practicing the invention, many modifications and variations would present themselves to those skilled in the art without departing from the scope and spirit of this invention as defined in the following claims.
The present application is a continuation of U.S. patent application Ser. No. 12/315,858 filed Dec. 5, 2008 entitled “Method For Omnidirectional Processing of 2D Images Including Recognizable Characters” which is a continuation of U.S. patent application Ser. No. 12/069,438 filed Feb. 7, 2008 entitled “Method For Omnidirectional Processing Of 2D Images Including Recognizable Characters” which is a continuation of U.S. patent application Ser. No. 10/774,218 filed Feb. 6, 2004 entitled “Method For Omnidirectional Processing Of 2D Images Including Recognizable Characters” which is a continuation of U.S. patent application Ser. No. 09/724,367 filed Nov. 28, 2000 entitled “Method For Omnidirectional Processing Of 2D Images Including Recognizable Characters.” Priority of each of the above applications is claimed and each of the above applications is incorporated herein by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
3238501 | Mak et al. | Mar 1966 | A |
3271738 | Kamentsky | Sep 1966 | A |
3665164 | Beveridge et al. | May 1972 | A |
3735350 | Lemelson | May 1973 | A |
3767923 | Bardwell | Oct 1973 | A |
3811033 | Herrin et al. | May 1974 | A |
3832682 | Brok et al. | Aug 1974 | A |
3835453 | Narayanan | Sep 1974 | A |
3903517 | Hafner | Sep 1975 | A |
3916154 | Hare et al. | Oct 1975 | A |
3958235 | Duffy | May 1976 | A |
3978319 | Vinal | Aug 1976 | A |
4012716 | Herrin | Mar 1977 | A |
4088981 | Gott | May 1978 | A |
4095096 | Harada et al. | Jun 1978 | A |
4103287 | Frank | Jul 1978 | A |
4107648 | Frank | Aug 1978 | A |
4147295 | Nojiri et al. | Apr 1979 | A |
4272675 | Blanford et al. | Jun 1981 | A |
4379224 | Engstrom | Apr 1983 | A |
4411016 | Wakeland | Oct 1983 | A |
4414468 | Laurer et al. | Nov 1983 | A |
4434548 | Beswick et al. | Mar 1984 | A |
4453084 | Brouwer | Jun 1984 | A |
4488678 | Hara et al. | Dec 1984 | A |
4567610 | McConnell | Jan 1986 | A |
4575625 | Knowles | Mar 1986 | A |
4610025 | Blum et al. | Sep 1986 | A |
4654873 | Fujisawa et al. | Mar 1987 | A |
4685143 | Choate | Aug 1987 | A |
4685180 | Kitaya et al. | Aug 1987 | A |
4688179 | Yamazaki | Aug 1987 | A |
4746789 | Gieles et al. | May 1988 | A |
4757551 | Kobayashi et al. | Jul 1988 | A |
4762063 | Yeagle | Aug 1988 | A |
4805224 | Koezuka et al. | Feb 1989 | A |
4817166 | Gonzalez et al. | Mar 1989 | A |
4850025 | Abe | Jul 1989 | A |
4874936 | Chandler et al. | Oct 1989 | A |
4879456 | Cherry et al. | Nov 1989 | A |
4896029 | Chandler et al. | Jan 1990 | A |
4907283 | Tanaka et al. | Mar 1990 | A |
4924078 | Sant'Anselmo et al. | May 1990 | A |
4933538 | Heiman et al. | Jun 1990 | A |
4939354 | Priddy et al. | Jul 1990 | A |
4948955 | Lee et al. | Aug 1990 | A |
4961231 | Nakayama et al. | Oct 1990 | A |
4962423 | Yamada et al. | Oct 1990 | A |
4963719 | Brooks et al. | Oct 1990 | A |
4972499 | Kurosawa | Nov 1990 | A |
4988852 | Krishnan | Jan 1991 | A |
4998010 | Chandler et al. | Mar 1991 | A |
5036182 | Ouchi et al. | Jul 1991 | A |
5048113 | Yamagata et al. | Sep 1991 | A |
5050224 | Mori | Sep 1991 | A |
5053609 | Priddy et al. | Oct 1991 | A |
5059779 | Krichever et al. | Oct 1991 | A |
5073954 | Van Tyne et al. | Dec 1991 | A |
5077463 | Sato | Dec 1991 | A |
5081685 | Jones et al. | Jan 1992 | A |
5093868 | Tanaka et al. | Mar 1992 | A |
5124536 | Priddy et al. | Jun 1992 | A |
5124537 | Chandler et al. | Jun 1992 | A |
5126542 | Priddy et al. | Jun 1992 | A |
5131053 | Bernzott et al. | Jul 1992 | A |
5134272 | Tsuchiya et al. | Jul 1992 | A |
5161245 | Fenwick | Nov 1992 | A |
5177793 | Murai et al. | Jan 1993 | A |
5179271 | Lindacher et al. | Jan 1993 | A |
5182777 | Nakayama et al. | Jan 1993 | A |
5189292 | Batterman et al. | Feb 1993 | A |
5195147 | Ohta | Mar 1993 | A |
5222158 | Takasaki et al. | Jun 1993 | A |
5227617 | Christopher et al. | Jul 1993 | A |
5235167 | Dvorkis et al. | Aug 1993 | A |
5243655 | Wang | Sep 1993 | A |
5250791 | Heiman et al. | Oct 1993 | A |
5262623 | Batterman et al. | Nov 1993 | A |
5270525 | Ukai et al. | Dec 1993 | A |
5276315 | Surka | Jan 1994 | A |
5286960 | Longacre, Jr. et al. | Feb 1994 | A |
5294783 | Hammond, Jr. et al. | Mar 1994 | A |
5296691 | Waldron et al. | Mar 1994 | A |
5302814 | Kawabata et al. | Apr 1994 | A |
5304787 | Wang | Apr 1994 | A |
5319181 | Shellhammer et al. | Jun 1994 | A |
5325444 | Cass et al. | Jun 1994 | A |
5329105 | Klancnik et al. | Jul 1994 | A |
5329107 | Priddy et al. | Jul 1994 | A |
5335289 | Abdelazim | Aug 1994 | A |
5335290 | Cullen et al. | Aug 1994 | A |
5341438 | Clifford | Aug 1994 | A |
5343028 | Figarella et al. | Aug 1994 | A |
5352878 | Smith et al. | Oct 1994 | A |
5354977 | Rousteai | Oct 1994 | A |
5357093 | Netter et al. | Oct 1994 | A |
5357602 | Ohta | Oct 1994 | A |
5365048 | Komiya et al. | Nov 1994 | A |
5373147 | Noda et al. | Dec 1994 | A |
5378881 | Adachi | Jan 1995 | A |
5378883 | Batterman et al. | Jan 1995 | A |
5384864 | Spitz | Jan 1995 | A |
5410611 | Huttenlocher et al. | Apr 1995 | A |
5412196 | Surka | May 1995 | A |
5412197 | Smith | May 1995 | A |
5414252 | Shinoda et al. | May 1995 | A |
5418862 | Zheng et al. | May 1995 | A |
5428211 | Zheng et al. | Jun 1995 | A |
5438636 | Surka | Aug 1995 | A |
5440110 | Brooks | Aug 1995 | A |
5446271 | Cherry et al. | Aug 1995 | A |
5449893 | Bridgelall et al. | Sep 1995 | A |
5454054 | Iizuka | Sep 1995 | A |
5463214 | Longacre, Jr. et al. | Oct 1995 | A |
5464974 | Priddy et al. | Nov 1995 | A |
5468953 | Priddy et al. | Nov 1995 | A |
5471041 | Inoue et al. | Nov 1995 | A |
5473151 | Priddy et al. | Dec 1995 | A |
5475768 | Diep et al. | Dec 1995 | A |
5477045 | Priddy et al. | Dec 1995 | A |
5478999 | Figarella et al. | Dec 1995 | A |
5479004 | Priddy et al. | Dec 1995 | A |
5479515 | Longacre, Jr. | Dec 1995 | A |
5481098 | Davis et al. | Jan 1996 | A |
5484999 | Priddy et al. | Jan 1996 | A |
5486689 | Ackley | Jan 1996 | A |
5486946 | Jachimowicz et al. | Jan 1996 | A |
5487115 | Surka | Jan 1996 | A |
5489769 | Kubo | Feb 1996 | A |
5504319 | Li et al. | Apr 1996 | A |
5510603 | Hess et al. | Apr 1996 | A |
5510604 | England | Apr 1996 | A |
5510605 | Miyazaki et al. | Apr 1996 | A |
5514858 | Ackley | May 1996 | A |
5515447 | Zheng et al. | May 1996 | A |
5523552 | Shellhammer et al. | Jun 1996 | A |
5524065 | Yagasaki | Jun 1996 | A |
5539191 | Ackley | Jul 1996 | A |
5545888 | Barkan et al. | Aug 1996 | A |
5548346 | Mimura et al. | Aug 1996 | A |
5550363 | Obata et al. | Aug 1996 | A |
5557689 | Huttenlocher et al. | Sep 1996 | A |
5561720 | Lellmann et al. | Oct 1996 | A |
5565669 | Liu | Oct 1996 | A |
5583949 | Smith et al. | Dec 1996 | A |
5588072 | Wang | Dec 1996 | A |
5591952 | Krichever et al. | Jan 1997 | A |
5591956 | Longacre, Jr. et al. | Jan 1997 | A |
5610025 | White et al. | Mar 1997 | A |
5610995 | Zheng et al. | Mar 1997 | A |
5616905 | Sugiyama et al. | Apr 1997 | A |
5637856 | Bridgelall et al. | Jun 1997 | A |
5640466 | Huttenlocher et al. | Jun 1997 | A |
5644765 | Shimura et al. | Jul 1997 | A |
5649027 | Mahajan et al. | Jul 1997 | A |
5670771 | Watanabe et al. | Sep 1997 | A |
5680478 | Wang et al. | Oct 1997 | A |
5719385 | Wike, Jr. et al. | Feb 1998 | A |
5723853 | Longacre, Jr. et al. | Mar 1998 | A |
5727094 | Kitagaki et al. | Mar 1998 | A |
5739518 | Wang | Apr 1998 | A |
5742041 | Liu | Apr 1998 | A |
5756981 | Rousteai | May 1998 | A |
5767978 | Revankar et al. | Jun 1998 | A |
5773806 | Longacre, Jr. | Jun 1998 | A |
5773810 | Hussey et al. | Jun 1998 | A |
5777309 | Maltsev et al. | Jul 1998 | A |
5777314 | Rousteai | Jul 1998 | A |
5778133 | Plesko | Jul 1998 | A |
5786582 | Rousteai et al. | Jul 1998 | A |
5791271 | Futamura | Aug 1998 | A |
5793899 | Wolff et al. | Aug 1998 | A |
5796868 | Dutta-Choudhury | Aug 1998 | A |
5805728 | Munesada et al. | Sep 1998 | A |
5811776 | Liu | Sep 1998 | A |
5811785 | Heiman et al. | Sep 1998 | A |
5818023 | Meyerson et al. | Oct 1998 | A |
5818970 | Ishikawa et al. | Oct 1998 | A |
5825006 | Longacre, Jr. et al. | Oct 1998 | A |
5844219 | Wallner | Dec 1998 | A |
5845007 | Ohashi et al. | Dec 1998 | A |
5854478 | Liu et al. | Dec 1998 | A |
5854853 | Wang | Dec 1998 | A |
5859929 | Zhou et al. | Jan 1999 | A |
5862267 | Liu | Jan 1999 | A |
5867277 | Melen et al. | Feb 1999 | A |
5889270 | van Haagen et al. | Mar 1999 | A |
5902987 | Coffman et al. | May 1999 | A |
5914476 | Gerst, III et al. | Jun 1999 | A |
5929421 | Cherry et al. | Jul 1999 | A |
5942741 | Longacre, Jr. et al. | Aug 1999 | A |
5943441 | Michael | Aug 1999 | A |
5949052 | Longacre, Jr. et al. | Sep 1999 | A |
5953130 | Benedict et al. | Sep 1999 | A |
5965863 | Parker et al. | Oct 1999 | A |
5969325 | Hecht et al. | Oct 1999 | A |
5979763 | Wang et al. | Nov 1999 | A |
5982927 | Koljonen | Nov 1999 | A |
5984366 | Priddy | Nov 1999 | A |
5987172 | Michael | Nov 1999 | A |
5992753 | Xu | Nov 1999 | A |
5996895 | Heiman et al. | Dec 1999 | A |
5999647 | Nakao et al. | Dec 1999 | A |
6000612 | Xu | Dec 1999 | A |
6002793 | Silver et al. | Dec 1999 | A |
6005978 | Garakani | Dec 1999 | A |
6035066 | Michael | Mar 2000 | A |
6062475 | Feng | May 2000 | A |
6064763 | Maltsev | May 2000 | A |
6082621 | Chan et al. | Jul 2000 | A |
6097839 | Liu | Aug 2000 | A |
6115497 | Vaezi et al. | Sep 2000 | A |
6119943 | Christy | Sep 2000 | A |
6129278 | Wang et al. | Oct 2000 | A |
6137907 | Clark et al. | Oct 2000 | A |
6152371 | Schwartz et al. | Nov 2000 | A |
6155488 | Olmstead et al. | Dec 2000 | A |
6157749 | Miyake et al. | Dec 2000 | A |
6175663 | Huang | Jan 2001 | B1 |
6181839 | Kannon et al. | Jan 2001 | B1 |
6212299 | Yuge | Apr 2001 | B1 |
6230975 | Colley et al. | May 2001 | B1 |
6233353 | Danisewicz | May 2001 | B1 |
6250551 | He et al. | Jun 2001 | B1 |
6264105 | Longacre, Jr. et al. | Jul 2001 | B1 |
6298175 | Longacre, Jr. et al. | Oct 2001 | B1 |
6298176 | Longacre, Jr. et al. | Oct 2001 | B2 |
6299064 | Watanabe et al. | Oct 2001 | B2 |
6328213 | He et al. | Dec 2001 | B1 |
6347163 | Roustaei | Feb 2002 | B2 |
6371373 | Ma et al. | Apr 2002 | B1 |
6381364 | Gardos | Apr 2002 | B1 |
6385352 | Roustaei | May 2002 | B1 |
6386454 | Hecht et al. | May 2002 | B2 |
6516096 | Yokose et al. | Feb 2003 | B2 |
6549660 | Lipson et al. | Apr 2003 | B1 |
6549681 | Takiguchi et al. | Apr 2003 | B1 |
6565003 | Ma | May 2003 | B1 |
6575367 | Longacre, Jr. | Jun 2003 | B1 |
6601772 | Rubin et al. | Aug 2003 | B1 |
6631842 | Tsikos et al. | Oct 2003 | B1 |
6647131 | Bradski | Nov 2003 | B1 |
6655595 | Longacre, Jr. et al. | Dec 2003 | B1 |
6685095 | Roustaei et al. | Feb 2004 | B2 |
6695209 | La | Feb 2004 | B1 |
6703633 | Tullis | Mar 2004 | B2 |
6981644 | Cheong et al. | Jan 2006 | B2 |
7006694 | Melikian et al. | Feb 2006 | B1 |
7007852 | Silverbrook et al. | Mar 2006 | B2 |
7024027 | Suri et al. | Apr 2006 | B1 |
7030881 | Perry et al. | Apr 2006 | B2 |
7059524 | Knowles et al. | Jun 2006 | B2 |
7068821 | Matsutani et al. | Jun 2006 | B2 |
7070107 | Tsikos et al. | Jul 2006 | B2 |
7086595 | Zhu et al. | Aug 2006 | B2 |
7219841 | Biss et al. | May 2007 | B2 |
7239346 | Priddy | Jul 2007 | B1 |
7261238 | Carlson et al. | Aug 2007 | B1 |
7331523 | Meier et al. | Feb 2008 | B2 |
7347376 | Biss et al. | Mar 2008 | B1 |
7364081 | Havens et al. | Apr 2008 | B2 |
7387253 | Parker et al. | Jun 2008 | B1 |
7398930 | Longacre, Jr. | Jul 2008 | B2 |
7413127 | Ehrhart et al. | Aug 2008 | B2 |
7416125 | Wang et al. | Aug 2008 | B2 |
20020044689 | Roustaei et al. | Apr 2002 | A1 |
20040101191 | Seul et al. | May 2004 | A1 |
20050047655 | Luo et al. | Mar 2005 | A1 |
20060029183 | Borghese et al. | Feb 2006 | A1 |
20060076423 | Silverbrook et al. | Apr 2006 | A1 |
20060113387 | Baker et al. | Jun 2006 | A1 |
20060211071 | Andre et al. | Sep 2006 | A1 |
20080112613 | Luo et al. | May 2008 | A1 |
Number | Date | Country |
---|---|---|
55-115166 | Sep 1980 | JP |
63-246975 | Oct 1988 | JP |
04-021268 | Jan 1992 | JP |
04-021270 | Jan 1992 | JP |
04-021272 | Jan 1992 | JP |
05-176223 | Jul 1993 | JP |
11-284862 | Oct 1999 | JP |
Number | Date | Country | |
---|---|---|---|
Parent | 12315858 | Dec 2008 | US |
Child | 12814019 | US | |
Parent | 12069438 | Feb 2008 | US |
Child | 12315858 | US | |
Parent | 10774218 | Feb 2004 | US |
Child | 12069438 | US | |
Parent | 09724367 | Nov 2000 | US |
Child | 10774218 | US |