This invention relates to a method to identify and isolate a component or feature of a digital image (automated segmentation). More particularly, the invention relates to a method for isolating a feature of an image, such as the teeth, and modifying the isolated feature to show the anticipated effect of a treatment such as whitening or isolating a feature such as hair and modifying it to show the effect of a treatment such as coloring.
An image may be captured using any of a variety of methods, but most typically using a standard image capture device (e.g., a digital or web camera or a scanned photographic image might be used), and displayed “live” on a screen. In one embodiment of the invention a “targeting” area may be displayed on the screen, which helps standardize the size (distance from camera) and placement of the image. Once the image is captured, the software analyzes the image, placing each pixel into a color category. All pixels in a category will be part of a particular component or feature of the image, thus isolating and identifying that element.
In one embodiment a digital image of a human face is analyzed to identify pixels that represent the teeth. It identifies the teeth in the image, and then determines their current color and their likely color after a teeth-whitening treatment, which may be determined by a look-up-table or a simple conversion equation.
In another embodiment a digital image of a human head is analyzed to identify pixels that represent hair. It identifies the hair in the image, and then determines its current color. Additional software then uses that information to recommend what coloring products & processes to use to achieve a target color, or to simulate the result when a particular product & process are applied to the existing hair.
One manifestation of the invention is a device for capturing an image and locating a feature of the image using a segmentation program.
Another manifestation of the invention is a device as described above wherein the located feature is modified and redisplayed as part of the original image.
Another manifestation of the invention is a segmentation program for locating a feature of a photographic image.
A more specific manifestation of the invention is a device for capturing an image of a facial feature such as the teeth or hair, locating the facial feature using a segmentation program, modifying the facial feature to display the effect of a change in tooth whiteness or a change in hair color, and displaying the modified image. This device is particularly useful in selling cosmetics.
Another more specific manifestation of the invention is a segmentation program in which a K-means algorithm is used to classify the pixels in an image into color groups.
Another more specific manifestation of the invention is a segmentation program as described immediately above where the original image is blurred to merge smaller segments of the same feature in the image and a pixel representative of the desired feature is located in the blurred image.
Still another manifestation of the invention is a segmentation program in which the pixel identified as described above is grown into a mask using a connection definition.
As illustrated in
Once, the image is in the CIELAB color space, each pixel in the image is classified (Box B). The method for performing this operation is the K-Means algorithm. For a discussion of this algorithm see C. Therrien, Decision, Estimation, and Classification, John Wiley & Sons, N.Y., 1989, pp 217-218. K-Means is a classic iterative pattern recognition algorithm in which a set of data (i.e. the individual pixels) is optimally divided into a set of K classes (i.e. color groups). In this method, an optimization problem is solved to determine the class in which a pixel is contained (i.e. into which color group does the pixel best fit). The selection of the number of categories into which the pixels are classified depends upon the number of distinguishable features in the image or the portion of the image that is the object of the segmentation routine. For the case of teeth, the features may be skin, lips, and teeth (in this case K=3). Additional facial features that are spatially disjointed from the teeth (e.g. moustaches) can be either classified as teeth or non-teeth (e.g., skin or lips) groups. If the feature is incorrectly classified as teeth, the algorithm will keep the teeth and the additional facial feature separate due to their unconnectedness or separation in the image.
After executing the K-means algorithm, there may be several discontinuous or disjointed segments that are contained in the same class or color group. For example teeth and the specular reflection from glossy lips may be within the same class due to the fact that both will appear close to white. If one is only interested in modifying or adjusting the image of the teeth, then it is necessary to separate the teeth pixels from the lip-gloss pixels. If these two sections are not connected, then this separation can be achieved by first identifying a pixel that is within the desired section such as the teeth area. In one embodiment, assuming the desired feature represents the largest section of the image or the targeted area of the image areawise, this identification is achieved by performing a blurring operation on the original RGB image (Box C). The blurred image represents an averaging of the local pixels. Blurring insures that the pixel selected in Box D that is closest in color to the expected color of the desired feature will be in an area of the image that corresponds to the segmented or isolated feature and that the pixel selected is not an aberrant pixel in an area of the image that is not the element to be isolated. The blurring operation has the effect of smearing out the segments that are smaller than the blurring kernel, to the point where no feature including the segment's color is visible. For segments that are larger then the size of the blurring kernel, the color of the segment will remain enabling the determination of the location of one pixel in that segment. The size of the blurring kernel is selected to be smaller than the desired feature. To identify teeth, the expected value might be the whitest pixel. To identify the hair, the expected value might be determined empirically by collecting a set of hair images, blurring them, and computing the average pixel values across the set of images.
The picture shown in
Having identified the location of one pixel that is within the segment of the class to be separated from the image, to identify the rest of the desired feature, a mask is grown by adding to the identified pixel all pixels that are in this category and connected using a connection definition such as an 8-point connection definition which is well known in the art (Box E). The implementation of the growth algorithm is such that it is relatively insensitive to the connection definition. The details of this growth algorithm are given in the block diagram discussion. This mask is the output of Box E and is illustrated by the image shown in
The mask defines the area of interest. If the mask reaches any of the edges of the image or is outside of some defined region, then the area of interest was either not found or not entirely within the image. In this case, the user may be instructed to relocate the desired target (e.g., teeth) in the target box 18 as shown in
Below are provided the mathematical details of the process.
BOX A
via a modified Newton method where the CIELAB values are given by the vector sequence {ui}i=1N, the measured values from the camera (determined by averaging across a set of pixels) are given by {ci}i=1N, and the mapping from CIEXYZ to CIELAB is given by the function F. See G. Wyszecki, W. S. Stiles, Color Science: Concepts and Methods, Quantitative Data and Formaulae, John Wiley & Sons, N.Y., 1982, pp. 166-169. For example, for a Nikon CoolPix 990 camera, G is:
The output CIELAB image {circumflex over (F)}(I(x,y)) is provided as input to an iterative K-Means algorithm in Box B. Specifically the algorithm is as follows:
where kl is the value of the lth class, and kc is the class to which pixel d is assigned. In other words, pixel d is assigned to the class closest to it in terms of Euclidean distance.
where H(a,b) is the blur kernel. The size of the blur kernel should be smaller than the feature that is being detected, and the shape should be symmetric. An example of a symmetric H is given by the equation below, where N is the radius of the blur kernal:
H(a,b)=1/(2N+1)2 ∀−N≦a≦N & −N≦b≦N
BOX D
BOX H
The invention is illustrated in more detail by the following non-limiting example.
An image is shown in
The segmentation method can be used in a device in which various components are integrated into a cabinet 10 that is designed to hang on a wall or sit on a shelf or counter. The device includes a means for capturing a digital image—typically either a conventional web cam or digital camera 14, a display such as a conventional LCD-type color display 12, a source of light (not shown), and an internal processor that runs the subject software. In its idle state, the display screen can display introductory information (marketing-oriented messages, instructions, or continuous video of images as customers or individuals may see as they pass the camera), and invites the customer to activate the unit by pressing the “go” button 16 on the front of the unit. The go button activates the software. A live image of the subject—as captured by the camera—appears on the screen. The subject is instructed (using on screen prompts) to position himself or herself such that the area of interest (teeth or hair, for example) appears in the on-screen box 18, and to press the “go” button 16 again. Pressing the go button the second time freezes the image on the screen, and begins the segmentation and color analysis. The device segments the image, identifying the pixels of interest, measures the color at multiple points, and calculates an average color value. The device performs calculations based on the initial coloring and displays the result, in this case an image of the customer having whiter teeth or an alternative hair color. After an appropriate delay, the device returns to the idle state.
Having described the invention in detail and by reference to specific embodiments thereof, it will be apparent the numerous modifications and variations are possible without separating from the spirit and scope of the invention.
This application claims priority to Provisional Patent Application Ser. No. 60/368,472 filed Mar. 28, 2002.
Number | Name | Date | Kind |
---|---|---|---|
4032627 | Suchan et al. | Jun 1977 | A |
6210159 | Lehmann et al. | Apr 2001 | B1 |
6305933 | Lehmann | Oct 2001 | B1 |
6328567 | Morris et al. | Dec 2001 | B1 |
20030059090 | Zhang et al. | Mar 2003 | A1 |
20030081852 | Pohjola | May 2003 | A1 |
Number | Date | Country | |
---|---|---|---|
20030198384 A1 | Oct 2003 | US |
Number | Date | Country | |
---|---|---|---|
60368472 | Mar 2002 | US |