This invention relates to image processing and, more particularly, to computing an Euler number for an image.
Image processing refers to the analysis and manipulation of images using a computer. The discipline encompasses a wide array of techniques, including pattern recognition and other image analysis, compression and encoding, such as for image transmission, and image construction, to name but a few examples. Image processing is used in diverse fields such as astronomy, medicine, military operations, communications, geology, meteorology and so on.
A binary or digital image is typically represented by one or more topological properties. Topological properties represent the geometric shape of the image. Where changes are made to the image itself, such as stretching, deformation, rotation, scaling, translation, or other rubber-sheet transformations, the topological properties do not change. These properties, therefore, are quite useful in image characterization. Topological properties, for example, may be helpful in performing shape matching, object and pattern recognition, image data retrieval, and so on.
One topological property of a digital image is known as an Euler number. An Euler number is defined as the difference between the number of connected components of the binary image and the number of holes in the image. Like the other topological properties, the Euler number remains invariant despite the above-mentioned transformation of the binary image.
For some classes of digital images, Euler numbers have strong discriminatory power. In other words, once the Euler number for a particular digital image is known, the digital image may readily be distinguished from other digital images in its class. This implies that the Euler number may be used for more efficient searching or matching of digital images.
For example, the Euler number may be used in medical diagnosis such as the detection of malaria infected cells. By calculating the Euler number of each cell image, the malaria infected cells may be identified, as the Euler number of an infected cell is often different from that of an uninfected cell. The Euler number may also be used for image searching, such as in a database of logo images. Critical image processing applications such as these involve large amounts of data and, at the same time, expect quick real-time response. Fast computation of the Euler number of an image, for some applications, is, therefore, an indispensable task.
Obtaining the Euler number for a digital image, however, may be computationally expensive. For a pixel matrix including 150,000 pixels, for example, over 300,000 pixel accesses and 50,000 or more processor cycles may be expected in computing the Euler number.
Thus, there is a continuing need for better ways to compute the Euler number for a binary image.
In accordance with the embodiments described herein, an Euler number representation of a binary image is calculated. The calculations may be performed using discrete logic components to analyze the binary image.
In
The system 300 includes circuitry for calculating the Euler number 90 of the binary image. According to the embodiments described below, a portion of the binary image 16 is analyzed such that Euler number 90 for the portion, or partial binary image, may be computed. Additional portions of the binary image 16 are likewise analyzed, in an iterative manner, until an Euler number 90 of the entire binary image 16 may be obtained.
The Euler number 90 may be very useful in characterizing a binary, or two-tone, image. Such characterization, for example, may be useful in performing database retrieval of binary images. Further, the principles embodied herein may be applied to gray-tone or color images which are divided into two-tone portions. Before describing the features of the system 300 further, an explanation of Euler numbers, as well as some properties of Euler numbers, follows.
In
In arriving at the Euler number 90, an analysis of the binary image 16 within the pixel matrix 20 is made. The Euler number is the difference between the number of connected components, or objects, and the number of holes in the binary image 16. A simple, geometric binary image 16 is depicted in
To determine the number of connected components and holes in the binary image 16, a pixel 12 is analyzed in terms of adjacent pixels, or its neighbors, of variable granularity. For a pixel 12 at location (x,y) in the pixel matrix 20, a 4-neighborhood 14 includes the set of pixels at locations [(x−1,y), (x+1,y), (x,y−1), and (x,y+1)]. For example, in
Similarly, an 8-neighborhood 18 of pixel locations [(8,5), (8,6), (8,7), (9,5), (9,7), (10,5), (10,6), (10,7)] is shown in
In
Assume that the shaded pixels represent a pixel value of “1.” All of the shaded pixels shown in the binary image 16 of
Another feature of the binary image 16 used to calculate the Euler number is a hole. A hole is set of background pixels, e.g., pixels of value “0” such that any background pixel in the set is in the neighborhood of at least one background pixel of the same set. Further, the entire set of background pixels is enclosed by a connected component.
Thus, in
Still another feature used to calculate the Euler number, according to one embodiment, is a run. A run, R(i), of the ith column (or row) of the pixel matrix 20 is defined to be a maximal sequence of consecutive 1's in the ith column (or row).
If the binary image 16, known as I, consists of a single row or a single column i, the Euler number of the image 16 is the same as the number of runs, R(i), of the image 16, as shown by the following formula:
Property A: E(I)=R(i)
For example, for the 0th row of the pixel matrix 20 of
Thus, from Property A, the Euler number, E(I), of image I may be determined, where I includes a single row (or column). Additionally, the Euler number satisfies the additive set property. Given two images, I1 and I2, the union (∪) of the two images is defined as a simple juxtaposition of I1 and I2, either vertically or horizontally, without any overlap.
For example, an image, I1, in
In contrast, the intersection (∩) of I1 and I2 is the image formed by the last row (or column) of I1, and the first row (or column) of I2, if the images I1 and I2 are joined horizontally (or vertically). The intersection image I1∩I2 is illustrated in
Suppose binary images I1 and I2 are joined horizontally, i.e., I1 lies entirely above I2, as shown in
From these observations, the Euler number for an image, I, which is the union of partial images, I1 and I2, with Euler numbers E(I1) and E(I2), respectively, may be obtained:
Property B: E(I)=E(I1 ∪I2)=E(I1)+E(I2)−E(I1∩I2)
As shown in
Property C: E(Ii-1∩Ii)=Oi
So, where Ii-1 is the partial image consisting of rows 1, 2, . . . , (i−1) rows of the pixel matrix 20, E(Ii-1) is the Euler number of Ii-1. The Euler number of the image, I, consisting of only a single row, i, is R(i), from Property A. If the row, i, is added to Ii-1 to form the ∪ image, I1, the intersection of these two images is formed by the (i−1)th and the ith rows.
From the above principles, an Euler number, E(I), for an image, I, may be computed incrementally from the partial images, I1, I2, . . . IN-1, IN:
and, to generalize this, the formula:
where IN=I denotes the entire image.
Restating the formula, the Euler number of the binary image 16 is the difference between the sum of the number of runs for all rows (or columns) and the sum of the number of neighboring runs between all consecutive pairs of rows (or columns) of the pixel matrix 20.
Looking back to
The system 300 further includes a neighboring run processors 80, in one embodiment, for calculating the number of neighboring runs for consecutive rows (or columns) of the binary image 16. The neighboring run processor 80 is described in more detail in
A math processor 66 computes the sum of the number of runs and the sum of the number of neighboring runs, according to one embodiment. The math processor 66 additionally may subtract the two sums, as specified above, in computing E(IN). The run processor 70, neighboring run processor 80, and math processor 66 may be implemented using discrete logic, specialized processing hardware such as digital signal processors (DSPs), or software, to name a few possible implementations of the system 300.
Using the formula for calculating the Euler number, E(IN), above, the Euler number of the binary image 16 may be iteratively computed. The Euler number of the first row (or column), comprising a first or partial image, is calculated. The next row (or column) is then added to the partial image and then the Euler number of the union image is calculated. Thereafter, successive rows (or columns) are added together and, accordingly, the Euler number of each partial image is calculated. The process is repeated until all rows (or columns) are exhausted.
In
Succeeding operations are performed iteratively for all i rows (or columns) of the binary image 16. The number of runs, R(i), in the ith row (or column) is calculated (block 108). Likewise, the number of neighboring runs, Oi, between the (i−1)th and the ith rows (or columns) is calculated (block 110).
Once the number of runs, R(i) and the number of neighboring runs, Oi, is known, a new Euler number, E(I), of a partial image consisting of the first and subsequent rows (or columns) may be calculated, according to the formula derived above (block 112). This process is repeated iteratively until all the rows (or columns) of the pixel matrix 20 have been analyzed.
In a second embodiment, multiple run and neighboring run number calculations are performed in parallel, as shown in
While the number of runs in the 1st row is calculated, the numbers of runs in the 2nd row may be calculated, the number of runs in the 3rd row may be calculated, and so on up to the Nth row, where N parallel processors are available (block 204).
Likewise, the number of neighboring runs between the 1st and the 2nd rows may be calculated, while, simultaneously with the operations of block 204, the number of neighboring runs between the 2nd and 3rd rows, between the (N−1)th and Nth rows, and so on are calculated, where (N−1) neighboring run processors are available (block 206). Where N−1 processors are not available, a smaller number of parallel operations may be performed. The variable, i (shown in block 202), could be incremented based upon the number of parallel operations being performed. The operations of blocks 204 and 206 may be repeated, as needed, until both the number of runs and the number of neighboring runs for all rows have been calculated.
In one embodiment, the runs for all the rows are added together (block 208). Likewise, the number of neighboring runs for all pair of consecutive rows are added together (block 210). The Euler number 90 for the complete binary image 16 is calculated by subtracting the sum of the neighboring runs from the sum of the neighboring runs (block 212).
In one embodiment, a very large scale integration (VLSI) architecture is implemented to compute the Euler number 90 for the binary image 16. A special purpose VLSI chip may serve as a coprocessor to a host computer, for example, to expedite computation of the Euler number 90. See, for example,
In one embodiment, the VLSI architecture includes the two processing elements shown in
In
In one embodiment, the run processor 70 calculates the number of transitions from ‘0’to ‘1’ in the N×M pixel matrix 20. The number of runs is equal to the number of such transitions. An input line 30 is fed into the D flip-flop 34, for receiving each pixel in a row (or column) of the pixel matrix 20.
The D flip-flop 34 is set to 0 at the beginning of processing each row (or column). The D flip-flop 34 holds the value of the previous pixel to check for a transition. The pixels in a row (or column) are pipelined into the run processor 70, at the input line 30. At any instant of time, ti, the ith and (i−1)th pixel of a particular row (or column) are checked for a 0-to-1 transition.
In this manner, the run processor 70 identifies runs for each row (or column) of the pixel matrix 20. The D flip-flop 34 further receives a clock 32 to synchronize with each input pixel. When the clock 32 is high, a Q output 36 follows the input line 30. When the clock 32 is low, there is no change to the output. The {overscore (Q)} output 38 AND-ed with the input line 30 produces an output 40, shown as X, which, if high, identifies the start of a run for the row (or column) being analyzed. As shown below, the Q output 36 from the D flip-flop 34, and the output X from the AND gate 44, may be fed into a neighboring run processor 80. The X output 40 is fed into the modulo counter 42, which keeps a count of the number of times X goes high, i.e., the number of runs. The X output 40 may also be received by the neighboring run processor 80, as shown below.
For a binary image 16 in an N×M pixel matrix 20, if the pixels are processed column-by-column (or row-by-row), the maximum number of runs possible in a row (or column) is ┌M/2┐ (or ┌N/2┐). If the processing is done column-by-column, the counter is a modulo ┌M/2┐ counter 42. The modulo ┌M/2┐ counter 42 also receives the clock 32, and, each time a run for the column is identified, the counter 42 is incremented. An output signal 46 indicates the number of runs counted.
Thus, in one embodiment, after all pixels have been processed and all runs have been counted, the output signals 46 are fed to a run adder 72, as shown in
The neighboring run processor 80 identifies the neighboring runs between consecutive rows (or columns) of the binary image 16. The neighboring run processor 80 checks for the condition when a particular run in a row (or column) starts and also determines whether the run is in the neighborhood of another run in its adjacent row (or column). In one embodiment, the pixels 12 of two adjacent rows (or columns) are fed to the neighboring run processor 80 in pipeline.
Additionally, the neighboring run processor 80 receives data from the output of the D flip-flops 34 (Q outputs 36a and 36b) and AND gate 44 (X outputs 40a and 40b) of two consecutive rows (or columns). Accordingly, the neighboring run processor 80 is flanked on either side by a run processor 70, in one embodiment. At any instant of time, ti the ith and (i−1)th pixels 12 of consecutive rows (or columns) are checked for a neighboring run.
As with the run processor 70, the neighboring run processor 80, in one embodiment, uses discrete logic to perform counting operations. AND gates 52, 54, 56, and 58 receive the X signals 40a and 40b, the Q signals 36a and 36b, and the input signals 30a and 30b, as depicted in
The neighboring run processor 80 also includes a modulo counter 60, for counting the occurrences where the output signal 48 goes high. The modulo counter 60 receives the clock signal 32.
For an N×M pixel matrix 20, the maximum number of neighboring runs is 2×┌M/2┐, or M+1. Accordingly, in one embodiment, the modulo counter 60 is a modulo (M+1) counter. The counter produces output signals 62, which comprise a sum of the neighboring runs in a pair of consecutive rows (or columns).
To process an N×M image in parallel, according to one embodiment, N copies of the run processor 70 and N−1 copies of the neighboring run processor 80 are used. For example, in
Each neighboring run processor 80 receives three sets of inputs, according to one embodiment. Two input signals 30 correspond to each pixel in adjacent rows (or column) of the pixel matrix 20. Two X outputs 40 and two Q outputs 36 correspond to the neighboring relation of a run for the adjacent rows (or columns).
In one embodiment, the outputs 46 from the run processors 70 are fed into a run adder 72. Likewise, the outputs 62 from the neighboring run processors 80 are fed into a neighboring run adder 74. The run adder 72 is an N-input adder and the neighboring run adder 72 is a (N−1)-input adder. In one embodiment, the adders 72 and 74 may perform additions up to [N×┌M/2┐] and [2×(N−2)×┌M/2┐], respectively.
The run adder 72 produces output signal 82, shown as X, while the neighboring runs adder 74 produces output signal 84, shown as Y. The circuit of
The system 300 of
The processor-based system 400 further includes a storage 22, such as a memory or other volatile medium. As examples, the storage 22 may be a random access memory (RAM) or a flash/read-only memory (ROM), although this represents but two of many possible types of storage. Additionally, the storage 22 may store a software program 100, such as for implementing the operations of
In one embodiment, the storage 22 may be N×M bits to store all the pixels 12 of the binary image 16 in a N×M pixel matrix 20 (since each pixel 12 in a binary image is either 0 or 1). To calculate the number of neighboring runs between two consecutive rows (or columns), the column (or row) number identifying the starting point and the continuity of a particular run may be ascertained from output values Q 36 and X 40 (
A worst case scenario for calculating the Euler number 90 arises when every alternating pixel 12 is an object pixel. There, the maximum number of runs in a row (column) is ┌M/2┐. Where the rows (columns) are analyzed sequentially, such as depicted in
The time complexity of calculating the Euler number 90 according to the embodiments described herein may be measured in terms of the number of accesses of the pixels 12 of the pixel matrix 20. In one embodiment, the number of pixel accesses for computing the Euler number 90 for a binary image 16 is:
The number of pixel accesses depends on the run distribution of the binary image 16 and, for a substantial number of binary images, R(i)<<M and Oi<<M.
While the present invention has been described with respect to a limited number of embodiments, those skilled in the art will appreciate numerous modifications and variations therefrom. It is intended that the appended claims cover all such modifications and variations as fall within the true spirit and scope of this present invention.
Number | Name | Date | Kind |
---|---|---|---|
3905018 | Gray | Sep 1975 | A |
5553196 | Takatori et al. | Sep 1996 | A |
6335986 | Naoi et al. | Jan 2002 | B1 |