1. Field of the Invention
The present invention relates generally to personal identification using biometric features derived from an image of a human iris, and more particularly, to segmentation of iris images using active contour processing.
2. Related Art
Iris recognition techniques are one of several biometric technologies used commercially for access control and identity verification. Generally, iris recognition includes three main components: iris segmentation, iris encoding and iris matching. During iris segmentation, the iris region is localized in an eye image by a computing system to select the image area occupied by the iris. This process includes identifying the boundary between the pupil and the innermost iris tissue, known as the pupillary boundary, and the boundary between the outermost iris tissue and the sclera, known as the limbic boundary.
Conventional iris recognition techniques are designed based on the assumption that the pupillary and limbic boundaries are well approximated by, for example, circles, ellipses, etc. Referring specifically to circular approximations, the assumption of boundary circularity is satisfied only if the iris is presented frontally to the camera, the eye in question has substantially circular iris boundaries, and the iris is not occluded by eyelashes or eyelids. In practice, however, these constraints are not always satisfied such as when an iris is not frontally presented to the camera. Moreover, in practice it is common for eyelids and eyelashes to occlude significant portions of the pupillary and limbic boundaries, thereby violating the circularity assumption. These deficiencies in conforming to the circularity assumption coupled with other limitations of existing iris segmentation techniques contribute to inaccuracies in iris recognition processes. Thus, while existing methods of iris segmentation have proven effective, the levels of accuracy may be improved upon.
In one aspect of the invention, a method for determining a contour representation of non-occluded regions in an iris image is provided. The method comprises: receiving an initial contour estimate of pupillary and limbic boundaries in the iris image which define an iris image area; determining an initial estimate of a noise boundary contour defining an area containing occluding data points within the iris image; executing an active contour function on the initial estimate of the noise boundary contour in an unwrapped representation of the iris image area to generate a revised noise boundary contour containing a revised set of occluding data points; and excluding from the initial contour estimate the revised set of occluding data points to generate a contour estimate of the non-occluded regions of the iris.
In another aspect of the invention, a method for refining an iris image is provided. The method comprises: receiving an initial contour estimate of pupillary and limbic boundaries in the iris image which define an iris image area; generating a polynomial representation of a selected one of the pupillary and limbic boundaries using the initial contour estimate of the selected boundary; executing an active contour method on the polynomial representation based on intensity data at the boundary of the representation; and generating a revised contour estimate of the selected boundary based on the execution of the active contour method thereby causing the revised estimate to more accurately represent the selected boundary.
In other embodiments a method for segmentation of an obtained iris image having at least one occluded region therein is provided. A Canny transform is performed on the obtained iris image to identify intensity gradients representing edge points within the iris image; performing a circular Hough transform on a plurality of the edge points to identify the pupillary and limbic boundaries within the iris image; performing at least one Radon transform to define two straight line segments each representing a boundary of the occluded region, wherein the occluded region is further bounded by one or more borders of the image; and removing the region bounded by the two straight line segments and the one or more borders from the iris image.
a) through 6(e) depict the results of the application of embodiments of the present invention to the limbic boundary in an image of a partially closed eye.
Aspects of the present invention are generally directed to processing an obtained iris image. Specifically, the obtained iris image is segmented for use in a biometric recognition scheme.
Embodiments of the invention use active contour processing to generate a refined iris image that image takes into account local image content and excludes, for example, occluded areas in an initial iris image from iris matching computations. For example, in one such embodiment, an initial noise boundary contour based on an evaluation of the intensity data in the iris image area is obtained. An active contour method is applied to the initial noise boundary contour to revise the initial noise boundary contour estimate. The revised estimate of the noise boundary contour is used to determine a refined iris image for use in iris matching operations. In certain embodiments, the iris image area defined by initial circular contour estimates of the limbic and pupillary boundaries is unwrapped into a rectangle prior to use of the active contour method to revise the initial noise boundary contour estimate. In other embodiments, the noise boundary contour is revised in the original image and the iris is not unwrapped to a rectangle.
Other embodiments using active contour processing generate a refined iris image by revising a representation of the pupillary and/or limbic boundary. In such embodiments, an initial estimate of either the pupillary or limbic boundary is obtained and a polynomial representation of the selected boundary is generated. This representation is revised using an active contour method based on intensity data at the boundary of the representation to more accurately represent the actual boundary in the iris image. The revised representation is used to generate a refined iris image for use in iris matching operations.
In alternative embodiments, the obtained iris is segmented to remove regions of the image occluded by, for example, eyelashes and/or eyelids. In one such embodiment, a Canny transform is performed on the obtained iris image to identify intensity gradients representing edge points within the iris image. A circular Hough transform is performed on a plurality of the edge points to identify the pupillary and limbic boundaries within the iris image. At least one Radon transform is performed to define two straight line segments. The straight line segments, along with the borders of the iris image, bound the occluded region. The region bounded by the two straight line segments and the iris image borders are removed from the iris image. This revised iris image having the occluded region removed is used in iris matching operations.
In one embodiment, the methods of the present invention are embodied in one or more computer software programs written in a structured computing language such as C for example, and adapted for use in conjunction with currently available iris imaging devices.
As shown in
In certain embodiments described in greater detail below with reference to
In alternative embodiments described in greater detail below with reference to
As shown at block 214 of
In one embodiment, initial circular contour estimates of the pupillary and limbic boundaries can be obtained by conventional means such as a circular Hough transform for example, and are typically specified as three scalars per boundary, namely the x and y positions of the circle center and the radius of the circle.
The active contour model (A. Blake and M. Isard, “Active Contours”, 1998, Springer-Verlag) is an integral form intended to characterize a balanced combination of the stiffness, elasticity and interpolation ability of the contour {right arrow over (ν)}(s) so that changes may be made to {right arrow over (ν)}(s) to optimize the integral form. Specifically, the active contour energy is given by:
Equation (1), which is an integral over an arc length, is a function of the contour {right arrow over (ν)}(s) and may be evaluated multiple times using modified versions of {right arrow over (ν)}(s) to find a contour {right arrow over (ν)}(s) that achieves a minimum value while balancing the results of equations (2), (3) and (4). Einternal (Equation (2)) is a component that weights the amount of elasticity and stiffness in the boundary. The elasticity is measured by the tangent vector magnitude (larger values mean that a point on the curve moves a larger amount, given the same change in the arc length of the parameter (s)). The stiffness is measured by the second derivative vector magnitude and achieves larger values when the boundary {right arrow over (ν)}(s) has more curvature given the same change in the arc length parameter(s).
These two contributions are mixed by the scalar weights α and β. Eimage (Equation (3)) is the negative of the gradient magnitude of the image data at the point {right arrow over (ν)}(s); i.e., it is a measure of how much gray scale variation there is in the neighborhood of {right arrow over (ν)}(s). This term is minimized when the gradient magnitude is large and in isolation, causes the vertices in {right arrow over (ν)}(s) to migrate toward edges in the image. Econ (Equation (4)) is the square of a smoothed version of the gradient magnitude and acts in a manner similar to that of Eimage; in isolation, minimizing it will tend to make the vertices in {right arrow over (ν)}(s) approach the edges in the image. The operator ∇(.) is the gradient of its operand, the asterisk * represents the convolution operator, and Gσ is a two-dimensional Gaussian smoothing operator with standard deviation σ. Modifying the contour {right arrow over (ν)}(s) so as to optimize the energy function with respect to a particular choice of weighting functions α(s) and β(s) and smoothing parameter σ yields a contour that strikes the designed balance between stiffness, elasticity, and ability to interpolate desired positions such as those on the initial contour.
As noted above, the initial circular contour estimates of the limbic and pupillary boundaries may be established by a Hough transform. In one embodiment of the present invention, once the limbic and pupillary boundaries have been established, an initial noise boundary contour is determined by conventional means based on pixel intensity data in the iris image. The noise boundary contour is reflective of local image content such as occluding eyelids and eyelashes for example. This initial estimate of the noise boundary contour is a piece-wise linear approximation of the edge of the eyelid or other occluding object. This approximation is expressed as a contour (s). After the initial noise boundary contour estimate is established, the located iris area may be unwrapped into a rectangular image. In one embodiment, the unwrapped image is in a rectangular arrangement of 20×240 pixels. The unwrapping process is well known and described by Daugman, J. G. “High Confidence visual recognition of persons by a test of statistical independence,” IEEE Trans. On Pattern Analysis and Machine Intelligence, vol. 15, pp. 1148-1161, 1993, which is hereby incorporated by reference herein.
After the initial circular contour estimates are unwrapped, the data representing the identified noise boundary contour within the unwrapped iris image is used as input to Equation (1). Specifically, the initial estimate of the noise boundary contour in the iris image is an initial estimate of (s) and is used as input to Equation (1). The functions α(s) and β(s) in Equation (2) are chosen empirically to balance the quality of the boundary against its smoothness and ability to shrink or stretch to fit the true boundary in the image. Derivatives in Equation (2) are approximated by central differences or they can be obtained analytically from a functional form of {right arrow over (ν)}(s). The image gradient ∇(.) is estimated by finite differences and the smoothing convolution (∇(Gσ(x,y)*I(x,y)))2 is implemented using a loop abstraction. Thus, in one embodiment of the invention, the initial estimate of the noise boundary contour permits an accurate implementation of Equation (1). The application of Equation (1) is performed iteratively until the energy value is optimized, at which point the iterations are terminated and the resulting revised noise boundary contour is reported as the final result, i.e. the area to be excluded from matching computations. Although embodiments of the present invention are described herein with reference to revision of the initial noise boundary within an unwrapped iris image, it should be appreciated that the initial noise boundary may be revised within an image that has not been unwrapped.
The interpolating spline representation is a polynomial (or rational polynomial) having x and y coordinates for points on the boundary. When graphed, such a spline form can interpolate any desired number of boundary points. As noted, the initial points of the interpolating spline are drawn from the initial estimates described above.
At block 220, an active contour method is used to revise the spline interpolation representation based on intensity data at the boundary of the representation. In other words, this process generates a revised contour estimate of the boundary used to create the spline representation. Referring to embodiments in which a limbic boundary is revised, the x and y coordinates of the spline interpolation are adjusted based on intensity data so that the spline interpolation more closely approximates the limbic boundary of the iris. The active contour method used in these embodiments of the present invention may be substantially similar to the active contour method described above with reference to
For example, in one such embodiment, the position space of a point (x,y) on the boundary of the representation is used as an initial estimate of (s) and is used as an input to Equation (1). The functions α(s) and β(s) in Equation (2) are chosen empirically to trade the quality of the boundary against its smoothness and ability to shrink or stretch to fit the true boundary in the image. Derivatives in Equation (2) can be approximated by central differences or obtained analytically from the functional form of {right arrow over (ν)}(s). The image gradient ∇(.) is estimated by finite differences and the smoothing convolution (∇(Gσ(x,y)*I(x,y)))2 is implemented using a loop abstraction. Thus, in one illustrative embodiment, the discretized implementation of the continuous-domain theory for the active contour is faithful. Operationally, the control points that define {right arrow over (ν)}(s) are moved under the control of a gradient descent procedure to cause the discrete approximation to Equation (1) to be minimized.
As shown, the active contour method may be executed iteratively on the representation of the selected boundary. When an iteration is performed, a determination of whether an optimum refinement of the selected boundary has been reached is made at step 222. If an optimum refinement is not reached, the method returns to block 220 for further adjustment of the representation, and steps 220 and 222 are repeated. More specifically, this iterative procedure continues until the energy value in the active contour processing plateaus. At this point, the iterations terminate and the resulting interpolant function is reported as the optimum refinement. The resulting revised contour estimate of the selected boundary is then used to generate a refined iris image used for iris matching operations. In particular, the revised contour estimate is used to delineate iris image areas to be included/excluded from consideration when the scanned iris data is compared to stored iris data in a biometric database.
In certain embodiments either the pupillary or the limbic boundary may be revised in the above manner. In such embodiments, the refined iris image would be defined by the revised pupillary or limbic boundary, and the initial estimate of the other boundary. In other embodiments, both the pupillary and the limbic boundary may be revised in the above manner. In these embodiments, the refined iris image would be defined by the revised boundaries.
Once the iterative process begins however, it can be clearly seen that the limbic boundary curve improves with each successive iteration. These representations of the limbic boundary are contour representations of the limbic boundary of the scanned iris that take into account and exclude local image data, such as noise due to the occluding upper and lower eyelids and eyelashes shown, or image data that falls outside the actual boundary. Thus, as can be seen in
The methods above, while specifically describing the determination of the limbic boundary, are equally applicable to the accurate determination of the pupillary boundary. In this case the pupillary boundary is first approximated by a circle generated by the execution of a Hough transform, or any other known operation, on the scanned iris image. This representation corresponds to the initial scalar estimate of the pupillary boundary which does not adequately approximate the true boundary due to occlusion from the partially closed upper eyelid as well as the non-circularity of the pupil. A more accurate representation of the pupillary boundary takes into account and excludes the local image data (noise), which in this case may comprise the occluding upper and lower eyelids and eyelashes to the extent that they extend into the pupil. Through the use of one or more of the active contours methods described above, the starting point is a circular region within the pupil known to be free of eyelid and eyelash intrusion. The starting circle is iteratively refined until it closely approximates the true pupillary boundary.
As noted above, in certain embodiments of the present invention an obtained iris is segmented to remove regions of the image occluded by, for example, eyelashes and/or eyelids. Such an occluded region is sometimes referred to herein as the eyelid-eyelash noise region.
In one such embodiment, a Canny transform (sometimes referred to herein as a Canny edge detection algorithm or Canny edge detector) is used to detect points within the iris image that correspond to the pupillary and limbic boundaries of the obtained iris image. In operation, the Canny transform uses a Gaussian filter to smooth the iris image. The Canny transform uses a first derivative operator to identify intensity gradients in the smoothed image. The transform evaluates the gradients to determine if a gradient is a local maxima gradient. Non-maxima suppression is used to eliminate intensity gradients which do not correspond to local maxima.
Following the above non-maxima suppression, a threshold comparison is conducted to identify gradients that correspond to edge points. It is generally accepted that intensity gradients having the largest intensity are likely to correspond to edge points. However, it is not possible to specify a threshold intensity at which a given intensity gradient corresponds to an edge point. Thus, the Canny transform uses thresholding with hysteresis to determine which gradients correspond to edges.
Thresholding with hysteresis uses two reference thresholds, a high threshold (T1) and a low threshold (T2), to identify edge points. Specifically, all the gradients that have an intensity which is higher than T1 are marked as edge points. Any other intensity gradients adjacent to one of these identified edge points which have an intensity higher than T2 are also marked as edge points.
After the edges points are identified as described above using the Canny transform, estimates of the pupillary and limbic boundaries can be obtained through the use of, for example, a circular Hough transform. In a circular Hough transform, each of the limbic and pupillary boundaries are specified as three scalars per boundary. These scalars are the position (x, y) of the circle center, and the radius (r) of the circle. During the Hough transform, each edge point generates an estimate of the position (x, y) of the center of boundary to which the edge point corresponds. Each edge point also generates an estimate of the radius (r) of the boundary. These estimates of x, y and r are then used to identify the pupillary and limbic boundaries.
In specific embodiments, an active contour method may be used to refine the limbic and/or pupillary boundaries identified using the Hough transform. This active contour method is optional.
After the pupillary and limbic boundaries are identified, a linear Radon transform is used to define two straight line segments which delineate or model boundaries of the eyelid-eyelash noise region. Because the bounded portions of the iris image are deemed occluded, these regions are removed from the image and thus excluded from iris matching operations.
In certain embodiments of the present invention, a linear Radon transform is used to generate estimates of the boundaries of both the upper and lower eyelids in the obtained iris image. In such embodiments, the obtained iris image is split into four sections of equal size, referred to as a top left section, a top right section, a bottom left section and a bottom right section. The image is split such that there is an overlap of half of the pupil radius between each section. Following division of the iris image, each section has one of the eyelid estimating lines therein. Each such line represent a boundary of the eyelid-eyelash noise region. The eyelid-eyelash noise region is detected in each of these four sections, and the results are joined together to form an iris image from which eyelid-eyelash noise has been removed.
Although embodiments of the present invention have been described herein as refining an iris image using one of the methods described above with reference to
Furthermore, although embodiments of the present invention have been primarily discussed herein with reference to the use of circular estimates of the iris boundaries, it should be appreciated that other estimates may also be used. For example, in certain embodiments elliptical estimates of either of the limbic or pupillary boundaries may be used. These estimates may be generated using any operation or method now know or later developed.
While various embodiments of the present invention have been described above, it should be understood that they have been presented by way of example only, and not limitation. It will be apparent to persons skilled in the relevant art that various changes in form and detail can be made therein without departing from the spirit and scope of the invention. Thus, the breadth and scope of the present invention should not be limited by any of the above-described exemplary embodiments, but should be defined only in accordance with the following claims and their equivalents. All patents and publications discussed herein are incorporated in their entirety by reference thereto.
This application claims priority from U.S. Provisional Patent Application No. 60/992,799 entitled, “SEGMENTATION OF IRIS IMAGES USING ACTIVE CONTOUR PROCESSING,” filed Dec. 6, 2007, which is hereby incorporated by reference herein.
This invention is made with U.S. government support under Grant ID No. CNS-0130839 awarded by the National Science Foundation and Grant ID 2004-DD-BX-1224 awarded by the Department of Justice. The government has certain rights in this invention.
Number | Date | Country | |
---|---|---|---|
60992799 | Dec 2007 | US |