Not applicable.
The present invention relates generally to an edge detection technique having improved feature visibility.
Different image processing techniques are used for different purposes. In many cases, an image processing technique is used to identify edges (or otherwise gradients) within images (or images of a video). Identified edges within an image may be subsequently used for many different applications. For example, the edge identification may be used for motion estimation, image segmentation, target tracking, etc.
Edge identification may be implemented using the differences between the color values (e.g., the intensity) of the pixels included within a region of the image. The pixel differences may include detecting the positions of abrupt changes in the pixel values, and recognizing such positions as corresponding to the outlines of objects. In many cases a low pass filter is first applied to the image, and then the changes are determined for each of the pixels for which a first derivative of the intensity variation gradient within the image reaches a local maximum and exceeds a predetermined threshold value. Each such determination is assumed to be located on an edge of an object in the image.
Another edge detection technique includes a “zero-crossing”, whereby the zero crossings of the second derivative of the gradient are detected to identify the edge locations.
Another edge detection technique includes the use of predetermined shape templates that are compared with the image to identify the approximate positions of objects, with the edge identification technique being applied to the identified objects.
What is desired is a signal processing technique for enhancing the identification of gradients.
The foregoing and other objectives, features, and advantages of the invention will be more readily understood upon consideration of the following detailed description of the invention, taken in conjunction with the accompanying drawings.
Image gradient (or image derivative) is a fundamental technique used in digital image processing. The image gradient identifies information about discontinuities within the image, and is frequently a prominent feature that immediately grasps the viewer's attention. The image gradient is used in conjunction with many image processing and computer vision techniques including, for example, image segmentation, object recognition, object contour extraction, edge detection, 3D, image up-scaling and interpolation, image deposing, image enhancement, motion estimation, visual target tracking, etc. The image gradient may be built upon an image derivative (or a similar such function). A basic definition of the first-order derivative of a one-dimensional function ƒ(x) may be defined as:
The notation uses a partial derivative in order to keep the notation the same as when considering an image function of two (or more) variables, ƒ(x, y), which describes partial derivatives along the two (or more) spatial axes.
Similarly, a basic definition of the second order derivative of the one-dimensional function ƒ(x) may be defined as:
Several exemplary gradient estimation operators include the Prewitt operator, Sobel operator, Roberts operator, and Laplacian operator. The first three operators are capable of detecting the first-order derivative, and the Laplacian operator is a second-order derivative estimator. The corresponding image processing masks for these operators are shown in
It was determined that one reason for this structure identification limitation is that these operators employ a small spatial neighborhood for computing the gradients, therefore making it less aware of the existence of pixels in the local neighborhood that have a larger gradient. To reduce this limitation, a modified gradient estimation technique should include a larger spatial support together with determining a local maximum gradient.
A modified gradient technique may include initially low-pass filtering 100 and/or down-sampling 102 of an input image 104. The low-pass filtering 100 and down-sampling 102 reduces power consumption and also suppresses noise in the input image 104, which, if otherwise not modified, would result in erroneous results in subsequent gradient estimation.
For each input pixel p(x, y), along the horizontal axis, the modified image gradient technique performs a 1-dimensional search in two directions 106, towards the left and towards the right of the current point, to determine the local maximum gradient. This may be expressed as:
gleft(p)=max{g(p(x,y),p(x+i,y))|i=−1:−W}
gright(p)=max{g(p(x,y),p(x+i,y))|i=1:W} (2)
W represents the size of search window, which is preferably set to 3.
Along the vertical axis, similarly, the modified image gradient technique performs a 1-dimensional search in two directions 108, above and below the current point, to determine the local maximum gradient. This may be expressed as:
gup(p)=max{g(p(x,y),p(x,y+j))|j=−1:−W}
gdown(p)=max{g(p(x,y),p(x,y+j))|j=1:W} (3)
The gradient at pixel p(x,y) with respect to p(x+i,y) 110 may be defined as:
The gradient at pixel p(x,y) with respect to p(x,y+j) 112 may be defined as:
Where function ƒ(x,y) is the image intensity at pixel p(x,y). The gradient is generally the intensity difference between point p(x,y) and point p(x+i,y) weighted by their 1-dimensional spatial distance.
On both horizontal axis and vertical axis, the gradients from the two search directions are added 114 to obtain one horizontal and one vertical gradient respectively, as follows:
gx(p)=gleft(p)+gright(p)
gy(p)=gup(p)+gdown(p) (6)
The horizontal or vertical gradient, gx(p) or gy(p) is generally a second order derivative at pixel p(x,y). The resulting gradient (or the parts thereof) are preferably thresholded 116 in any suitable manner.
The proof for the gradients being a second order derivative is provided as follows (for the simplicity of notation, they coordinates are omitted). Suppose the local maximum gradient to the left of p(x) is gleft(p)=g(p(x), p(x−i)), i>0, and the local maximum gradient to the right of p(x) is gright(p)=g(p(x), p(x+j)), j>0, then
Based on the definition of second-order derivative, it may be observed that the technique estimates the second order derivative as the gradient.
The final gradient at pixel p(x, y) is given by the largest of the two gradient:
After the gradient at each pixel is obtained, it is typically thresholded to reduce the low-amplitude gradients from the image. By doing this, the excessive noisy gradients can be removed.
Referring to
Referring to
glap(x0)=ƒ(x1)+ƒ(x−1)−2ƒ(x0)≈10
However, with the described technique, a 1×7 window centered at x0 may be used to compute the local maximum gradient at x0, given by:
gnew(x0)=gleft(x0)+gright(x0)≈(−5)+30=25
Therefore with local maximum gradient search, the described technique is able to reduce those pixels with a small gradient from being detected as edges, and it can detect strong gradient in a local neighborhood which may not be detectable by other techniques. This results in a gradient map that is cleaner and includes more well defined object contours. Also, the larger spatial support in computing the gradients reduces the effects of noise.
The terms and expressions which have been employed in the foregoing specification are used therein as terms of description and not of limitation, and there is no intention, in the use of such terms and expressions, of excluding equivalents of the features shown and described or portions thereof, it being recognized that the scope of the invention is defined and limited only by the claims which follow.
Number | Name | Date | Kind |
---|---|---|---|
5212740 | Paek et al. | May 1993 | A |
5974197 | Lee et al. | Oct 1999 | A |
6167164 | Lee | Dec 2000 | A |
6226015 | Danneels | May 2001 | B1 |
6226050 | Lee | May 2001 | B1 |
6363168 | Kakuma | Mar 2002 | B1 |
6539060 | Lee et al. | Mar 2003 | B1 |
6862368 | He et al. | Mar 2005 | B1 |
6885771 | Takahashi | Apr 2005 | B2 |
6941016 | Wagman et al. | Sep 2005 | B1 |
6992798 | Geurts et al. | Jan 2006 | B2 |
7003161 | Tessadro | Feb 2006 | B2 |
7013047 | Schmidt | Mar 2006 | B2 |
7031525 | Beardsley | Apr 2006 | B2 |
7596243 | Paniconi | Sep 2009 | B2 |
8098334 | Huang et al. | Jan 2012 | B2 |
20030169375 | Lin et al. | Sep 2003 | A1 |
20100135588 | Au et al. | Jun 2010 | A1 |
Number | Date | Country |
---|---|---|
2005142171 | Jun 1993 | JP |
Number | Date | Country | |
---|---|---|---|
20110286671 A1 | Nov 2011 | US |