This disclosure relates generally to the field of image manipulation. More particularly, but not by way of limitation, it relates to techniques for enhancing and repairing images using data from other images.
Photography has been an innovative field since the earliest crude photographs were produced, developing from camera obscura and pinhole cameras to chemically-developed film cameras in the 19the century to digital cameras in the late 20the century. With digital photography has come an ability to manipulate images, providing capabilities not practical or possible with film. Individuals may easily create or collect libraries of thousands of digital images, using software to organize and manipulate images in those libraries. In addition to standalone imaging devices such as traditional cameras, imaging devices are now ubiquitous in a wide variety of other devices, including smartphones and tablet computers.
However, as any photographer knows, not every photograph is a good one. Sometimes the photograph includes an image of a person whose face is blocked, blurry, or whose eyes were closed when the photograph was taken. Sometimes the image may show a spot or temporary blemish on the person's nose or somewhere else visible in the image.
Other times, a photographer may take a picture at some location and for some reason the photograph was taken with an incorrect exposure setting or out of focus, or the photographer inadvertently moved the camera while taking the photograph. Sometimes a photograph may contain some blurry object, such as a random person walking the camera and the subject of the photograph just as the photograph was taken.
While image manipulation techniques exist that may be used to manipulate areas of a photograph, such as the ability to remove the “red-eye” effect when using a photographic flash unit caused by reflection of light by the person's eyes. Other techniques have allowed cloning tools or healing brush tools that use sample data from other points on the photograph to correct imperfections, causing them to blend with the surrounding area of the image. While useful, these existing tools are not always capable of repairing or correcting a photograph as much as would be desired.
A image manipulation technique allows a user to correct an image using samples obtained from other images. These samples may be obtained from one or more other images in a library of images. Matching techniques may identify an image that best matches the image to be corrected, or may aggregate or average multiple images that are identified as containing an area corresponding to the area to be corrected. Identification of the image or images to use as the source of the samples may be automatic or manual. The images may be from a library of images under the control of the user or from a library of images maintained by another person or service provider. Application of the samples to correct the image may be manually or automatically directed.
An image modification method is disclosed. The method includes selecting a first region of a first image; identifying automatically, from a database of images, one or more second images, wherein each of the one or more second images has a region corresponding to the first region; and modifying automatically the first region based, at least in part, on the corresponding region from at least one of the one or more second images.
A non-transitory program storage device is disclosed. The storage device is readable by a programmable control device and has instructions stored thereon to cause the programmable control device to receive an indication that identifies a first region of a first image; identify, from a database of images, one or more second images, wherein each of the one or more second images has a region corresponding to the first region; and modify the first region based, at least in part, on the corresponding region from at least one of the one or more second images.
An programmable device is disclosed. The device includes a programmable control device; a memory coupled to the programmable control device; and software stored in the memory. The software includes instructions causing the programmable control device to receive an indication that identifies a first region of a first image; identify automatically one or more second images from a collection of images, wherein each of the one or more second images has a region corresponding to the first region; and modify the first region automatically based, at least in part, on the corresponding region from at least one of the one or more second images.
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the invention. It will be apparent, however, to one skilled in the art that the invention may be practiced without these specific details. In other instances, structure and devices are shown in block diagram form in order to avoid obscuring the invention. References to numbers without subscripts or suffixes are understood to reference all instance of subscripts and suffixes corresponding to the referenced number. Moreover, the language used in this disclosure has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter, resort to the claims being necessary to determine such inventive subject matter. Reference in the specification to “one embodiment” or to “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiments is included in at least one embodiment of the invention, and multiple references to “one embodiment” or “an embodiment” should not be understood as necessarily all referring to the same embodiment.
By using samples from libraries of other images to correct or enhance an image automatically, additional capability becomes available to the photographer. Other images of a person may be used to correct or enhance an image of that person. Other images of a place may be used to enhance or correct an image of that place. The size of the sample data relative to the image to be corrected or enhanced is not limited. For example, a photograph of a landmark and the photographer's family might be manipulated by replacing the entirety of the photograph with another photograph, overlaying the photographer's family on the other photograph. Sample areas obtained from a source image may be included into a destination image at a resolution different from that of the destination image.
In block 140, if more than one other source image is included in the search results, a resulting source image may be selected using any desired technique, including techniques such as are described below regarding
Once one or more source images are selected, the selected source image or images may be processed to find corresponding facial features, and a source facial region corresponding to the destination facial region may be extracted as a sample data in block 150. The destination facial region may be replaced, at least in part, with the sample data from the one or more source images. Any desired techniques may be used to fit the sample data over the existing facial region, for smoothly stitching the sample data into the original image, such as morphing or healing brush techniques known to the art, which are not further described herein.
Where the source image and the destination image were exposed with different contrast ratios, the sample data may be adjusted to the contrast ratio of the destination image. Alternately, the contrast ratio of the sample data may not be adjusted to match that of the destination image, allowing a destination image with a less than optimum contrast ration to be enhanced by a sample data from a source image with a better contrast ratio. Other image modification techniques, such as gamma correction may be used to match the sample data to the destination image if desired.
The technique 100 may be performed automatically, identifying the facial region, selecting the source image or images, and performing the modification of the source facial region without user input. A user interface may provide an auto-enhance button or other similar user interaction element to trigger the performance of the technique 100.
Alternately, user interaction may be provided for at any stage, allowing the user to approve or disapprove any or all of the identification of the facial region, the selection of the source image or images, the extraction of the sample data, and the modification of the source image based on the sample data. User-configurable setting may provide additional constraints (e.g., allowable age ranges) as well as specifying what library of images to use.
Always described above, the entire facial region may be modified, more limited facial regions, such as a distinct facial feature (e.g., a nose), as well as other less distinct regions, such as simple skin regions.
Additional predetermined criteria may further constrain facial modification using any other desired information. For example, the technique may limit the available images in the library to those within a predefined age range, to avoid making inappropriate modifications. For example, the technique may be configured to avoid modifying an image of an 80-year-old woman with facial data from an image of that woman in her teens. Other criteria may be used, such as lighting information (e.g., do not use an image taken in fluorescent light to modify an image taken an incandescent light) and resolution (e.g., use only images taken with approximately similar resolution).
Other user-specifiable criteria may include shadowing, contrast, and exposure information, and information about the imaging device used to capture the images (e.g., the destination image was taken on an IPHONE® smartphone; only use source images taken by an IPHONE smartphone or an IPAD® tablet) (IPHONE and IPAD are registered trademarks of Apple Inc.). Temporal windows may provide constraints such as a constraint that limits source images to those taken at roughly the same age as the person represented in the destination image. Temporal windows may also be used to limit the source images to those taken in a narrow time range, such as a single photography session.
As described above, the technique 100 is used for correcting or enhancing a destination image. However, the same technique may be used for other purposes to modify facial regions of an image. The source images may be images of the person represented in the source image, but may also be images of other people. For example, a plastic surgeon may use the technique to provide a patient an idea of how the patient may look after surgery., and the selection process may either use a library of images of the patient (e.g., when doing reconstructive surgery, using images of the patient before the injury) or of other people (e.g., modifying the source image to give a cosmetic surgery patient an idea of how they may look after the surgery).
The number of images in the library may affect the usefulness of the technique. For example, a collection of images of a person with only a handful of images is less likely to produce as good a result as a library of hundreds or thousands of images. The library of images may be a personal library, or may be a library to which multiple people have submitted multiple images of multiple people.
The techniques described herein are not limited to modification of facial regions, but may be used for modifying any object represented in the source image, including other bodily features and inanimate objects.
The technique 200 would be useful for enhancing or correcting images of locations. For example, an image may be captured at a popular location. The image has bad exposure and not enough data to correct well using conventional techniques. Or there may be a blurry object in the image that the user may wish to remove, although the user may wish to remove non-blurry objects, as well. (e.g., the user may wish to remove an image of a bystander from a picture.) In this example, many other people have taken a photo in that same location at various times, and uploaded those photos to a shared library of images coupled to a server. The destination image may be automatically or manually enhanced or corrected based on sample data from the other photos that have been uploaded to the server. In this example, source images may be selected that were taken at approximately the same location. A location identifier, such as geodata information associated with the destination image, may be ascertained, then the search of the library of images may identify the source images by matching their location identifiers with the location identifier of the destination image. The sample data extracted from the source images may allow the various kinds of enhancements, including removal of blurry obstructions (e.g., walking passersby) and exposure correction (e.g., the destination image was underexposed, but the library has sample data on hundreds or thousands of correctly exposed photos of the same location at the same time of day, and may use sample data that to repair the photo). The modifications may include replacing data in the photo with better data from other one or more source images in the library. In a variant, the techniques may be used to inform an imaging device as to the best possible imaging attributes, such as exposure or level adjustments, to apply to the imaging, before the image is captured. (For example, a person standing at a very popular place on the edge of the Grand Canyon may be able to take a better image by obtaining sample data from thousands of other images captured at that time of day at that spot.)
The amount of the destination image that can be modified using sample data from the source images is not limited. Any portion or the complete image may be modified. For example, a simple snapshot taken at a popular tourist site may be enhanced, corrected, or even potentially completely replaced by sample data extracted from one or more images captured at the same popular tourist site.
Implementation in a Programmable Device
Storage device 414 may store media (e.g., image and video files), software (e.g., for implementing various functions on device 400), preference information, device profile information, and any other suitable data. Storage device 414 may include one more storage mediums for tangibly recording image data and program instructions, including for example, a hard-drive, permanent memory such as ROM, semi-permanent memory such as RAM, or cache. Program instructions may comprise a software implementation encoded in any desired language (e.g., C or C++).
Memory 412 may include one or more different types of memory which may be used for performing device functions. For example, memory 412 may include cache, ROM, and/or RAM. Communications bus 422 may provide a data transfer path for transferring data to, from, or between at least storage device 414, memory 412, and processor 416. Although referred to as a bus, communications bus. 422 is not limited to any specific data transfer technology. User interface 418 may allow a user to interact with the programmable device 400. For example, the user interface 418 can take a variety of forms, such as a button, keypad, dial, a click wheel, or a touch screen.
In one embodiment, the programmable device 400 may be an programmable device capable of processing and displaying media, such as image and video files. For example, the programmable device 400 may be a device such as such a mobile phone, personal data assistant (PDA), portable music player, monitor, television, laptop, desktop, and tablet computer, or other suitable personal device.
The storage device 414 may provide storage for the library of images described above. Alternately, an external library of images may be communicatively coupled to the programmable device 400, as illustrated below in
Implementation in a Networked System
It is to be understood that the above description is intended to be illustrative, and not restrictive. For example, the above-described embodiments may be used in combination with each other. Many other embodiments will be apparent to those of skill in the art upon reviewing the above description. The scope of the invention therefore should be determined with reference to the appended claims, along with the full scope of equivalents to which such claims are entitled.
Number | Name | Date | Kind |
---|---|---|---|
6574375 | Cullen | Jun 2003 | B1 |
6788809 | Grzeszczuk | Sep 2004 | B1 |
7804982 | Howard | Sep 2010 | B2 |
7970181 | Brandt | Jun 2011 | B1 |
8131116 | Motomura | Mar 2012 | B2 |
8208764 | Guckenberger | Jun 2012 | B2 |
8488893 | Itoh | Jul 2013 | B2 |
8665340 | Prentice | Mar 2014 | B2 |
8971612 | Shotton | Mar 2015 | B2 |
9172938 | Yoneda | Oct 2015 | B2 |
9386230 | Duran | Jul 2016 | B1 |
9413965 | Ryu | Aug 2016 | B2 |
9626747 | Dellinger | Apr 2017 | B2 |
20080196076 | Shatz | Aug 2008 | A1 |
20090238419 | Steinberg | Sep 2009 | A1 |
20090304239 | Itou | Dec 2009 | A1 |
20100158410 | Kusakabe | Jun 2010 | A1 |
20110102630 | Rukes | May 2011 | A1 |
20110292221 | Gu | Dec 2011 | A1 |
20120250937 | Corcoran | Oct 2012 | A1 |
20130246409 | Polansky | Sep 2013 | A1 |
20140126881 | Yoneda | May 2014 | A1 |
20170064205 | Choi | Mar 2017 | A1 |
Number | Date | Country | |
---|---|---|---|
20170195557 A1 | Jul 2017 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13454782 | Apr 2012 | US |
Child | 15449286 | US |