The present disclosure generally relates to the field of digital image processing, and more specifically, to methods of estimating illuminations in images.
Many current information systems, such as map applications, store large quantities of digital images that provide views of the surrounding area for a given map location. Many of these images include undesirable characteristics such as uneven ambient illumination that degrade the content of the images. For example, some map systems contain aerial images which often contain undesirable shadows caused by clouds or other objects. Thus, such images may include an uneven illumination caused by objects which occlude the light source of the image. The presence of the uneven illumination in the images thereby results in undesirable characteristics due to the lack of uniformity of color intensity in the images. As a result, these systems provide users with images of less than desirable quality.
The problems described above are met by a computer-implemented method, a non-transitory computer-readable storage medium and a computer system for removing illumination variation from an image. One embodiment of the computer-implemented method comprises selecting an image containing varying illumination across a plurality of pixels within the image. Patches of pixels from among the plurality of pixels within the image are identified. Similarities between pairs of patches of pixels are calculated based on pixel intensities associated with the pairs of patches of pixels. Illumination values for the plurality of pixels within the image are calculated based on the calculated similarities between the pairs of patches of pixels. The illumination variation from the image is removed based on the calculated illumination values for the plurality of pixels within the image.
Embodiments of the non-transitory computer-readable storage medium store computer-executable code for removing illumination variation from an image. The code are executable to perform steps comprising selecting an image containing varying illumination across a plurality of pixels within the image and identifying patches of pixels from among the plurality of pixels within the image. The steps further include calculating similarities between pairs of patches of pixels based on pixel intensities associated with the pairs of patches of pixels and calculating illumination values for the plurality of pixels within the image based on the calculated similarities between the pairs of patches of pixels. Furthermore, the steps comprise removing the illumination variation from the image based on the calculated illumination values for the plurality of pixels within the image.
Embodiments of the computer system for removing an illumination variation from an image comprises a computer processor and a non-transitory computer-readable storage medium storing executable instructions configured to execute on the computer processor. The instructions when executed by the computer processor are configured to perform steps comprising selecting an image containing varying illumination across a plurality of pixels within the image and identifying patches of pixels from among the plurality of pixels within the image. The steps further include calculating similarities between pairs of patches of pixels based on pixel intensities associated with the pairs of patches of pixels and calculating illumination values for the plurality of pixels within the image based on the calculated similarities between the pairs of patches of pixels. Furthermore, the steps comprise removing the illumination variation from the image based on the calculated illumination values for the plurality of pixels within the image.
The features and advantages described in the specification are not all inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter.
The figures depict, and the detail description describes, various non-limiting embodiments for purposes of illustration only. One skilled in the art will readily recognize from the following discussion that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles described herein.
System Architecture
Note that the illumination described by the illumination map may vary across the illuminated image. For example, part of the illuminated image can be in shadow and therefore have decreased illumination relative to the remainder of the image. Under this formulation, moreover, an “illumination” can describe both a shadow where light is occluded from an object, and a bright area. Shadows and bright areas are simply variations in the illumination map.
Thus, this description uses the term “illumination” to encompass variations in pixel intensity caused by occlusions. An occlusion-type illumination (i.e., a shadow) can be present in an image, for example, when sunlight incident upon an object (e.g., the ground) shown in the image is partially occluded by a cloud or other object. Thus, the occlusion results in a portion of the image being darker than other portions.
In one embodiment, the observed pixel intensity Ĩ of a given pixel in an illuminated image at location (x,y) is based on the following factors:
The relationship of these factors to each other is represented by the following equations:
Ĩp={tilde over (L)}p·{tilde over (R)}p (1)
Ip=log Ĩp=log {tilde over (L)}p+log {tilde over (R)}p=Lp+Rp (2)
In equation 1, the observed intensity Ĩp for pixel p is expressed in the intensity domain. As shown in equation 1, the observed intensity Ĩp for pixel p in the intensity domain is the product of the luminance {tilde over (L)}p of the pixel p and the reflectance {tilde over (R)}p of pixel p. In contrast, the observed intensity Ip for pixel p is expressed in the log domain in equation 2. As shown in equation 2, the observed intensity Ip for pixel p in the log domain is the sum of the log of the luminance {tilde over (L)}p and the log of the reflectance {tilde over (R)}p of pixel p. In the following discussion, the observed intensities for pixels will be discussed with respect to the log domain.
Generally, the image processing server 100 estimates values of illumination (i.e., illumination intensities) present in an illuminated image using an information theoretic approach that recognizes that any illumination change in an image tends to increase the diversity of observed texture intensities of the reflectance of the image. In one embodiment, the texture at a given pixel location is described as the appearance of the local neighborhood around that location. The information theoretic interpretation of this effect is that the entropy of the texture is increased. Similarly, the presence of texture in the image increases the entropy of the illumination function, which is otherwise mostly smooth except at illumination boundaries.
In one embodiment, the image processing server 100 estimates an illumination value L of a pixel p from an illuminated image based on the illumination values of randomly selected pixels from the image and the illumination values of neighboring pixels of pixel p. In one embodiment, the image processing server 100 minimizes an energy function using the illumination values of the randomly selected pixels and the illumination values of the neighboring pixels to determine the illumination value L of pixel p. Thus, the image processing server 100 can estimate the illumination values L that contribute to the pixel intensities Ip in an illuminated image. Alternatively, the illumination value L of a pixel p may be estimated based on all pixels in the illuminated image.
In one embodiment, the image processing server 100 enhances an illuminated image by removing illumination variation from the image based on the estimated illumination values of the pixels in the image. That is, the image processing server 100 corrects the illumination variation present in the image so that the intensity of the illumination in the image is uniform. By correcting the illumination variation in the image, the image processing server 100 removes any shadows that were previously present in the image. Thus, the values of illumination in the illumination map are zero or as close to zero as possible. In other words, the variation in the illumination map is minimized as much as possible. Values of the illumination map may be equal to zero (in the log domain) if the illumination of the brightest part of an illuminated image is set to zero.
As illustrated in
The image database 111 stores a set of illuminated images. The term “image” as employed herein includes image data in general, such as individual still image files or frames of images from a video file. In one embodiment, the images in the image database 111 may be obtained from a variety of sources, such as from an organization producing aerial or street view images of a portion of a city for use with a map system, such as GOOGLE MAPS, STREET VIEW or GOOGLE EARTH. Likewise, images (including videos) may be obtained from users, e.g. as submitted by a user uploading images to a photo repository such as PICASA WEB, or a video repository such as YOUTUBE or GOOGLE VIDEO.
In one embodiment, at least some of the illuminated images stored in the image database 111 are aerial images. An aerial image comprises an image taken off the ground from an elevated position such as from an aircraft. These aerial images may include shadows caused by clouds or other objects. In one embodiment, at least some of the illuminated images stored in the image database 111 exhibit a repetitive pattern. For example, an image may show a façade of a building with a repeating pattern of windows or an image may show a rug that exhibits a repetitive pattern in its design.
Additionally, the image database 111 stores enhanced versions of the illuminated images. In one embodiment, an enhanced version of an illuminated image comprises only the underlying non-illuminated image. That is, the image database 111 stores versions of the images with the variations in the illumination map removed, so that the illumination across the image is uniform and any shadows or other illumination-related artifacts are absent.
The initialization module 101 initializes the image processing server 100 with initial illumination values that can be used to remove illuminations from illuminated images. For a given image, the initialization module 101 determines an initial illumination value that can be used to remove the illuminations from the image. For a given pixel in the image, the image processing server 100 attempts to improve upon the initial illumination value determined by the initialization module 101 until an illumination value L is determined that removes or reduces the illumination from the pixel.
Generally, the initialization module 101 automatically specifies, for an illuminated image, a mask comprising only non-illuminated (e.g., non-shadowed) pixels. The initialization module 101 constrains the illumination for the pixels in the mask such that the luminance L is zero.
To identify the mask, the initialization module 101 applies k-means clustering (e.g., k is equal to 4) to the illuminated image. As a result, the illuminated image is divided into a plurality of pixel clusters where each cluster has an associated scalar illumination value corresponding to the average scalar illumination value of the pixels in the cluster. The initialization module 101 identifies a pixel cluster from the plurality of pixel clusters that represents non-illuminated pixels of the image based on the scalar illumination values of the clusters.
In one embodiment, for façade images with periodic patterns, the initialization module 101 applies a box filter to the image to divide the image into a plurality of pixel clusters. For an illuminated façade image, the initialization module applies the box filter to the image and determines the average scalar illumination value of the pixels within the box defined by the filter. The initialization module 101 identifies the plurality of pixel clusters based on the average scalar illumination values of the illuminated façade image. According to one embodiment, the size of the box filter in the x and y dimensions (e.g., 15 pixels by 17 pixels) is equivalent to the period of the repetitive pattern in the x and y dimensions for the illuminated image.
To identify the pixel cluster representing non-illuminated pixels of the image, the initialization module 101 extracts features from the plurality of clusters to form feature vectors that describes the clusters. In one embodiment, the extracted features comprise the mean scalar illumination value
where pixel i≠pixel j. The intuition is that if the selected pixel cluster i represents the non-illuminated cluster, the illumination value Li and the illumination value Lj together would normalize the pixels from the other clusters, including illuminated clusters, to the right intensity.
When applied to an image—or more specifically, to the feature vectors of the image—the initialization module 101 generates information that describes a measure of how strongly each cluster represents a cluster of non-illuminated pixels. To generate the information, the initialization module 101 applies a set of weights that are associated with features of non-illuminated pixels to a feature vector. Based on the applied weights and the feature vector, the initialization module 101 calculates a score which may be a Boolean score representing whether or not a given cluster represents a non-illuminated pixel cluster. Alternatively, the score is a real number (e.g., ranging from 0.0 to 1.0), integer, or other scalar value representing a measure of likelihood or probability that the cluster represents a non-illuminated pixel cluster. In one embodiment, numbers or scores exceeding some threshold can be considered to indicate that the cluster represents a non-illuminated pixel cluster. In one embodiment, the cluster with a highest score that exceeds the threshold is considered the non-illuminated pixel cluster and the illumination value
In one embodiment, the training module 103 trains the initialization module 101 to identify pixel clusters that represent non-illuminated pixels using a training set of illuminated images (e.g., 90 images). Each image from the training set includes an indication (e.g., a label) of non-illuminated (e.g., non-shadowed) portions of the image. The training module 103 identifies pixel clusters of the image and extracts the features of the image as previously described above. In one embodiment, the training module 103 assigns a positive label (e.g., +1) to the dominant pixel cluster (i.e., the largest pixel cluster) representing the non-illuminated portion of the image which was designated in the image and assigns a negative label (e.g., −1) to all other clusters. The pixel clusters assigned the negative label include other pixel clusters representing non-illuminated portions of the image and other illuminated portions of the image which are not dominant.
The training module 103 applies a training algorithm to the initialization module 101 to learn the set of weights on the features of the training set that are associated with the dominant pixel cluster as previously described above so that the initialization module 101 can recognize non-illuminated portions of illuminated images. In one embodiment, the training algorithm is AdaBoost, the details of which are known to one having ordinary skill in the art.
The similarity module 105 determines the similarity of pixels in illuminated images. The similarity of pixels in an illuminated image is used to determine the illumination values which cause the illumination in the image. The similarity of pixels can be used to determine the illumination values because it is assumed that for pixels with a similar underlying reflectance (i.e., similar pixels), the difference in pixel intensity of the illuminated pixel and the non-illuminated pixel in the log domain (or ratios in the intensity domain) is due to the difference in the illumination values of the pixels.
To determine the similarity, the similarity module 105 identifies local pixel neighbors for each pixel i in an illuminated image. In one embodiment, the local pixel neighbors of a given pixel i comprise pixels that are spatially located directly adjacent to the pixel i. Pixel i and its local pixel neighbors form a patch Pi with pixel i located at the center of patch Pi in one embodiment. For example,
Referring back to
Ri=Ii−Lp (3)
In equation 3, the reflectance Ri for the pixel i located at the center of the patch Pi is a function of the intensity of the pixel (which is a known value) and luminance. Note that equation 3 assumes that the illumination value LP is constant for patch Pi except for at the boundaries of the patch. That is, equation 3 assumes that the illumination value LP is constant across all the pixels included in patch Pi.
In one embodiment, the similarity module 105 identifies a plurality of random pixels j and their local pixel neighbors to compare against pixel i. In one embodiment, the random pixels j represent non-illuminated pixels (i.e., non-shadowed). Each random pixel j and its local pixel neighbors collectively form a patch Pj as previously described above. The similarity module 105 calculates the reflectance Rj for the pixel j located at the center of each patch Pj as described with respect to equation 3.
The similarity module 105 compares pairs of patches to determine the similarity of the patches with respect to reflectance. That is, the similarity module compares patch Pi with each of the plurality of patches Pj to determine the similarity C(Pi,Pj) between patch Pi and patch Pj. In one embodiment, the similarity C(Pi,Pj) between patch Pi and patch Pj describes the difference in reflectance of the patches Pi and Pj. In one embodiment, the similarity C(Pi,Pj) between patch Pi and patch Pj is represented by the following equation:
C(Pi,Pj)=|Ri−Rj|2 (4)
As shown in equation 4, the similarity module 105 may determine the similarity C(Pi,Pj) between patch Pi and patch Pj based on the difference of the reflectance Ri of pixel i from patch Pi and the reflectance Rj of pixel j from patch Pj.
Alternatively, the similarity C(Pi,Pj) between patch Pi and patch Pj is represented by the following equation:
where k represents each corresponding pixel from patch Pi and patch Pj.
In equation 5, the similarity module 105 determines the similarity C(Pi,Pj) between patch Pi and patch Pj based on the reflectance value of all pixels in patch Pi and patch Pj. Equation 5 illustrates that the similarity C(Pi,Pj) between patch Pi and patch Pj is based on the summation of the difference of reflectance values across corresponding pairs of pixels from patch Pi and patch Pj. For example, for each pixel k in patch Pi and its corresponding pixel k in patch Pj, the similarity module 105 determines the difference of the reflectance values of the pair of pixels. The similarity module 105 then sums the square of the differences.
In another embodiment, the similarity C(Pi,Pj) between patch Pi and patch Pj is represented by the following equation:
C(Pi,Pj)=|Hist(Pi)−Hist(Pj)|2 (6)
The similarity module 105 may determine a histogram of the reflectance values of pixels within patch Pi which is represented by Hist(Pi) in equation 6. The histogram describes the distribution of the reflectance values of pixels within patch Pi. The similarity module 105 also determines a histogram of the reflectance values of pixels within patch Pj which is represented by Hist(Pj). In equation 6, the similarity C(Pi,Pj) between patch Pi and patch Pj is based on the difference between the distribution of the reflectance values of pixels within patch Pi and patch Pj.
In one embodiment, the similarity module 105 determines a weight Wij for the similarity C(Pi,Pj) between patch Pi and patch Pj. The weight Wij describes the influence that the similarity C(Pi,Pj) between patch Pi and patch Pj has on the determination of the luminance value used to remove the illumination from pixel pi. In one embodiment, the weight Wij is represented by the following equation:
Wij=e−C(P
Because the weight is based on a decaying exponential function, the similarity module 105 assigns a higher weight to the similarity C(Pi,Pj) of patch Pi and patch Pj if the patches are similar (i.e., a smaller value of C(Pi,Pj)) and the similarity module 105 assigns a lower weight to the similarity C(Pi,Pj) of patch Pi and patch Pj if the patches are not similar (i.e., a larger value of C(Pi,Pj)).
Referring now to
As described previously, the similarity module 105 also identifies random pixels and their associated patches for comparison against a given pixel. In
The similarity module 105 respectively identifies patch P1352 and patch P2233 as the local pixel neighbors for pixel p1352 and pixel p2233. The similarity module 105 calculates the pixel intensities P1352I1352 and P2233I2233 for pixels p1352 and p2233 and the mean pixel intensities P1352IM and P2233IM for patch P1352 and P2233 according to equations 3 and 4.
Based on the pixel intensities P1352I1352 and P2233I2233 and the mean pixel intensities P1352IM and P2233IM, the similarity module 105 calculates the similarity P1352M of the pixel intensity P1352I1352 and the mean pixel intensity P1352IM and the similarity P2233M of the pixel intensity P2233I2233 and the mean pixel intensity P2233IM according to equation 5. The similarity module 105 determines the respective similarities C(P1753,P1352) and C(P1753,P2233) between patch P1753 and patches P1352 and P2233 and their associated weights according to equations 6 and 7.
Referring back to
where:
As shown above, the energy E is a sum of two terms. The first term measures the difference between luminance values of a given pixel i of patch Pi and a random pixel j from patch Pj across the plurality of randomly selected pixels j for pixel i. The weight Wij controls the influence that the patches from which pixel i and pixel j are located in the determination of the value of Li. Typically, the pixel intensity Iij of a given pixel is known and is subtracted from the luminance values of pixel i and pixel j.
In one embodiment, the second term of the energy function smoothes the illumination values within a region of the image corresponding to patch Pi (i.e., the local pixel neighbors of pixel i). The optimization module 105 operates under the assumption that the illumination values of pixels within patch Pi should be of equivalent or similar intensity. In the second term of the energy function, the optimization module 109 compares illumination values of pixel i's local pixel neighbors k. The second term of the energy function is based on the summation of the difference between the illumination value Li of pixel i and the illumination values Lk of neighboring pixel instances.
Furthermore, the second term of the energy function E is multiplied by a weight Vik. In one embodiment, weight Vik controls the smoothing factor of the illumination values represented by the second term of the energy function E and is based on a decaying exponential function. Because the weight Vik is based on a decaying exponential function, the smaller the variance in the luminance values between pixel i and neighboring pixels k, the higher the weight Vik.
In one embodiment, to solve for the illumination values Li, the optimization module 109 initializes the solution for the energy function E using the initial luminance values determined by the initialization module 101. The optimization module 109 solves for the illumination values Li using the initial luminance values and iteratively optimizes the energy function E for each pixel location to solve for the illumination value that minimizes the energy function for all locations in order to improve upon the values provided by the initialization module 101.
In one embodiment, the optimization module 109 applies the technique of iteratively re-weighted least squares to solve for the illumination values thereby producing sharper changes in the illumination map. The optimization module 109 creates a matrix based on the weights Wij and Vik that represents the illumination map responsive to solving the energy function. The matrix comprises illumination values for every pixel location in the illuminated image. The illumination values included in the matrix are associated with the weights indicative of the most similar pixels (e.g., the weights with the highest value). Each illumination value in the matrix describes the intensity of the illumination at a particular location in the image.
The image enhancement module 107 enhances illuminated images based on the optimized illumination values described in the matrix. In one embodiment, the image enhancement module 107 enhances an illuminated image by removing illumination variations from the image. To remove the illumination variation (i.e., correct the varying illumination in the image), the optimization module 107 multiplies the pixel intensity of each pixel in the illuminated image in the intensity domain by the inverse of the optimized illumination value for that pixel as determined by the optimization module 109, thereby removing the value of the illumination component from the pixel. By multiplying the inverse of the optimized illumination values for the pixels of the illuminated image by the pixel intensities of the pixels, the optimization module 109 removes the illumination map component of the illuminated image. The enhanced image has uniform intensity such that any shadows or other illumination-related artifacts are absent from the image. The image enhancement module 107 stores the enhanced image in the image database 111.
Referring now to
In one embodiment, the image processing server 100 selects 401 from the image database 111 an illuminated image comprising varying illumination. The selected image may represent an aerial image, an image exhibiting a repetitive pattern, or any image that includes varying illumination. The image processing server 100 calculates 403 an illumination value to initialize the image processing server 100 that will be used as a starting point to determine the illumination values of the selected image.
For each pixel in the image, the image processing server 100 identifies 405 patches of pixels for the given pixel. The patches may include a patch associated with the local neighbor pixels and patches of randomly selected pixels. The image processing server 100 calculates the similarity 407 between the patches. That is, the image processing server 100 calculates the similarity of pairs of patches where each pair includes a patch comprising the given pixel and a patch associated with a randomly selected pixel. The image processing server 100 then optimizes an energy function based on the calculated similarity to produce an illumination value for each pixel in the image. The image processing server 100 removes 411 the illumination variation from the image by multiplying the intensity of each pixel with its corresponding inversed illumination value determined in step 409.
The storage device 508 is any non-transitory computer-readable storage medium, such as a hard drive, compact disk read-only memory (CD-ROM), DVD, or a solid-state memory device. The memory 506 holds instructions and data used by the processor 502. The pointing device 514 may be a mouse, track ball, or other type of pointing device, and is used in combination with the keyboard 510 to input data into the computer system 500. The graphics adapter 512 displays images and other information on the display 518. The network adapter 516 couples the computer system 500 to a local or wide area network.
As is known in the art, a computer 500 can have different and/or other components than those shown in
The disclosure herein has been described in particular detail with respect to one possible embodiment. Those of skill in the art will appreciate that other embodiments may be practiced. First, the particular naming of the components and variables, capitalization of terms, the attributes, data structures, or any other programming or structural aspect is not mandatory or significant, and the mechanisms that implement the invention or its features may have different names, formats, or protocols. Also, the particular division of functionality between the various system components described herein is merely exemplary, and not mandatory; functions performed by a single system component may instead be performed by multiple components, and functions performed by multiple components may instead performed by a single component.
Some portions of above description present features in terms of algorithms and symbolic representations of operations on information. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. These operations, while described functionally or logically, are understood to be implemented by computer programs. Furthermore, it has also proven convenient at times, to refer to these arrangements of operations as modules or by functional names, without loss of generality.
Unless specifically stated otherwise as apparent from the above discussion, it is appreciated that throughout the description, discussions utilizing terms such as “determining” or “displaying” or the like, refer to the action and processes of a computer system, or similar electronic computing device, that manipulates and transforms data represented as physical (electronic) quantities within the computer system memories or registers or other such information storage, transmission or display devices.
Certain aspects of the embodiments disclosed herein include process steps and instructions described herein in the form of an algorithm. It should be noted that the process steps and instructions could be embodied in software, firmware or hardware, and when embodied in software, could be downloaded to reside on and be operated from different platforms used by real time network operating systems.
The algorithms and operations presented herein are not inherently related to any particular computer or other apparatus. Various general-purpose systems may also be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the required method steps. The required structure for a variety of these systems will be apparent to those of skill in the art, along with equivalent variations. In addition, the present invention is not described with reference to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any references to specific languages are provided for invention of enablement and best mode of the present invention.
The embodiments disclosed herein are well suited to a wide variety of computer network systems over numerous topologies. Within this field, the configuration and management of large networks comprise storage devices and computers that are communicatively coupled to dissimilar computers and storage devices over a network, such as the Internet.
Finally, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter. Accordingly, the disclosure herein is intended to be illustrative, but not limiting, of the scope of the invention, which is set forth in the following claims.
This application claims the benefit of U.S. Provisional Application No. 61/418,853, filed Dec. 1, 2010, which is incorporated by reference in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5003618 | Meno | Mar 1991 | A |
5081656 | Baker et al. | Jan 1992 | A |
5235434 | Wober | Aug 1993 | A |
5325449 | Burt et al. | Jun 1994 | A |
5455870 | Sepai et al. | Oct 1995 | A |
5488674 | Burt et al. | Jan 1996 | A |
5497430 | Sadovnik et al. | Mar 1996 | A |
5537669 | Evans et al. | Jul 1996 | A |
5761329 | Chen et al. | Jun 1998 | A |
5841129 | Bacsa | Nov 1998 | A |
5963670 | Lipson et al. | Oct 1999 | A |
6005247 | Baum | Dec 1999 | A |
6016196 | Mermelstein | Jan 2000 | A |
6055097 | Lanni et al. | Apr 2000 | A |
6091846 | Lin et al. | Jul 2000 | A |
6201541 | Shalom et al. | Mar 2001 | B1 |
6205239 | Lin et al. | Mar 2001 | B1 |
6240423 | Hirata | May 2001 | B1 |
6246788 | Pattikonda et al. | Jun 2001 | B1 |
6249601 | Kim et al. | Jun 2001 | B1 |
6249616 | Hashimoto | Jun 2001 | B1 |
6253173 | Ma | Jun 2001 | B1 |
6255642 | Cragg et al. | Jul 2001 | B1 |
6269179 | Vachtsevanos et al. | Jul 2001 | B1 |
6292582 | Lin et al. | Sep 2001 | B1 |
6459821 | Cullen | Oct 2002 | B1 |
6463426 | Lipson et al. | Oct 2002 | B1 |
6466694 | Kamada et al. | Oct 2002 | B2 |
6477275 | Melikian et al. | Nov 2002 | B1 |
6483938 | Hennessey et al. | Nov 2002 | B1 |
6516085 | Wiley et al. | Feb 2003 | B1 |
6535650 | Poulo et al. | Mar 2003 | B1 |
6587581 | Matsuyama et al. | Jul 2003 | B1 |
6665432 | Evans et al. | Dec 2003 | B1 |
6674890 | Maeda et al. | Jan 2004 | B2 |
6675174 | Bolle et al. | Jan 2004 | B1 |
6681056 | Tseng et al. | Jan 2004 | B1 |
6683984 | Simske et al. | Jan 2004 | B1 |
6701026 | Zheng et al. | Mar 2004 | B1 |
6762769 | Guo et al. | Jul 2004 | B2 |
6766047 | Muraki et al. | Jul 2004 | B2 |
6826300 | Liu et al. | Nov 2004 | B2 |
6898331 | Tiana | May 2005 | B2 |
7006671 | Yamaguchi | Feb 2006 | B2 |
7120626 | Li et al. | Oct 2006 | B2 |
7126137 | Singer et al. | Oct 2006 | B2 |
7272241 | Demi et al. | Sep 2007 | B2 |
7295324 | Jones et al. | Nov 2007 | B2 |
7315642 | Bartov | Jan 2008 | B2 |
7358994 | Yano | Apr 2008 | B2 |
7366323 | Yao | Apr 2008 | B1 |
7376251 | Stober | May 2008 | B2 |
7388979 | Sakai et al. | Jun 2008 | B2 |
7440637 | Schechner et al. | Oct 2008 | B2 |
7445344 | Lang et al. | Nov 2008 | B2 |
7447338 | Kim | Nov 2008 | B2 |
7577313 | Georgiev | Aug 2009 | B1 |
7653491 | Schadt et al. | Jan 2010 | B2 |
7668787 | Bier | Feb 2010 | B2 |
7697746 | Kawaguchi | Apr 2010 | B2 |
7706599 | Sim | Apr 2010 | B2 |
7706603 | Najafi et al. | Apr 2010 | B2 |
7724960 | Mikhael et al. | May 2010 | B1 |
7734097 | Porikli et al. | Jun 2010 | B1 |
7778488 | Nord et al. | Aug 2010 | B2 |
7783102 | Kawaragi | Aug 2010 | B2 |
7801335 | Hanna et al. | Sep 2010 | B2 |
7803633 | Spivey et al. | Sep 2010 | B2 |
7876974 | Brajovic | Jan 2011 | B2 |
7941004 | Zhu et al. | May 2011 | B2 |
7957565 | Sharma et al. | Jun 2011 | B1 |
8001157 | Bier | Aug 2011 | B2 |
8005292 | Sakai et al. | Aug 2011 | B2 |
8023742 | Brown et al. | Sep 2011 | B2 |
8090429 | Vija et al. | Jan 2012 | B2 |
8103090 | Ma et al. | Jan 2012 | B2 |
8131786 | Bengio et al. | Mar 2012 | B1 |
8139883 | Zhang et al. | Mar 2012 | B2 |
8175376 | Marchesotti et al. | May 2012 | B2 |
8260008 | Hanna et al. | Sep 2012 | B2 |
8345940 | Mattiuzzi et al. | Jan 2013 | B2 |
8369579 | Frigerio | Feb 2013 | B2 |
8369608 | Gunaratne | Feb 2013 | B2 |
8384806 | Robinson et al. | Feb 2013 | B2 |
8391634 | Kwatra et al. | Mar 2013 | B1 |
8396325 | Kwatra et al. | Mar 2013 | B1 |
20010055429 | Hirosawa et al. | Dec 2001 | A1 |
20030016881 | Matsuura | Jan 2003 | A1 |
20030142860 | Glasser et al. | Jul 2003 | A1 |
20030169906 | Gokturk et al. | Sep 2003 | A1 |
20030190090 | Beeman et al. | Oct 2003 | A1 |
20040032979 | Honda et al. | Feb 2004 | A1 |
20040139067 | Houle | Jul 2004 | A1 |
20040169658 | Shin et al. | Sep 2004 | A1 |
20040202361 | Evans et al. | Oct 2004 | A1 |
20050100209 | Lewis et al. | May 2005 | A1 |
20050104974 | Watanabe et al. | May 2005 | A1 |
20050141002 | Takano et al. | Jun 2005 | A1 |
20050149494 | Lindh et al. | Jul 2005 | A1 |
20050163397 | Baldwin | Jul 2005 | A1 |
20050179910 | Bartov | Aug 2005 | A1 |
20050238198 | Brown et al. | Oct 2005 | A1 |
20060017739 | Fang et al. | Jan 2006 | A1 |
20060018539 | Sato et al. | Jan 2006 | A1 |
20060023944 | Oisel et al. | Feb 2006 | A1 |
20060034540 | Zavadsky et al. | Feb 2006 | A1 |
20060082849 | Kaku | Apr 2006 | A1 |
20060188113 | Tice et al. | Aug 2006 | A1 |
20060215882 | Ando et al. | Sep 2006 | A1 |
20060233423 | Najafi et al. | Oct 2006 | A1 |
20060244757 | Fang et al. | Nov 2006 | A1 |
20070050411 | Hull et al. | Mar 2007 | A1 |
20070104389 | Wells | May 2007 | A1 |
20070140556 | Willamowski et al. | Jun 2007 | A1 |
20070183629 | Porikli et al. | Aug 2007 | A1 |
20070248282 | Suzuki | Oct 2007 | A1 |
20070269107 | Iwai et al. | Nov 2007 | A1 |
20070280556 | Mullick et al. | Dec 2007 | A1 |
20070282935 | Khan et al. | Dec 2007 | A1 |
20070286523 | Kim et al. | Dec 2007 | A1 |
20070297653 | Bolle et al. | Dec 2007 | A1 |
20080025635 | Han et al. | Jan 2008 | A1 |
20080063238 | Wiedemann et al. | Mar 2008 | A1 |
20080118151 | Bouguet et al. | May 2008 | A1 |
20080118160 | Fan et al. | May 2008 | A1 |
20080130960 | Yagnik | Jun 2008 | A1 |
20080235216 | Ruttenberg | Sep 2008 | A1 |
20080253681 | Chesnokov | Oct 2008 | A1 |
20080260274 | Winder et al. | Oct 2008 | A1 |
20090003725 | Merkel et al. | Jan 2009 | A1 |
20090034835 | Maxwell et al. | Feb 2009 | A1 |
20090208106 | Dunlop et al. | Aug 2009 | A1 |
20090252428 | Winder | Oct 2009 | A1 |
20090274385 | Zhu et al. | Nov 2009 | A1 |
20100008581 | Bressan | Jan 2010 | A1 |
20100013965 | Pugh et al. | Jan 2010 | A1 |
20100026888 | Pearlstein et al. | Feb 2010 | A1 |
20100040285 | Csurka et al. | Feb 2010 | A1 |
20100080472 | Asano | Apr 2010 | A1 |
20100191722 | Boiman et al. | Jul 2010 | A1 |
20100272357 | Maxwell et al. | Oct 2010 | A1 |
20110035035 | Khan et al. | Feb 2011 | A1 |
20110158533 | Gutelzon et al. | Jun 2011 | A1 |
20110202528 | Deolalikar et al. | Aug 2011 | A1 |
20110280453 | Chen et al. | Nov 2011 | A1 |
20110286628 | Goncalves et al. | Nov 2011 | A1 |
20110297369 | Kumaran et al. | Dec 2011 | A1 |
20110304729 | Arcaini et al. | Dec 2011 | A1 |
20110311129 | Milanfar et al. | Dec 2011 | A1 |
20120020546 | Inoue et al. | Jan 2012 | A1 |
20120045134 | Perronnin et al. | Feb 2012 | A1 |
20120099796 | Zitnick, III | Apr 2012 | A1 |
20120141044 | Kwatra et al. | Jun 2012 | A1 |
20120281923 | Irani et al. | Nov 2012 | A1 |
20130051685 | Shechtman et al. | Feb 2013 | A1 |
20130156315 | Friedhoff et al. | Jun 2013 | A1 |
20130182909 | Rodriguez-Serrano | Jul 2013 | A1 |
Entry |
---|
“Iteratively reweighted least squares,” Wikipedia, Last Modified Jun. 23, 2009, 2 pages, [online] [Retrieved on Jul. 14, 2009] Retrieved from the internet <URL:http://en.wikipedia.org/wiki/Iteratively—reweighted—least—squares>. |
Boiman, O., et al., In defense of nearest-neighbor based image classification, IEEE Conference on Computer Vision and Pattern Recognition, Jun. 23-28, 2008, pp. 1-8. |
Dai, S., et al., “Soft edge smoothness prior for alpha channel super resolution,” IEEE Conference on Computer Vision and Pattern Recognition, 2007, 8 pages. |
Efros, A., et al., “Image quilting for texture synthesis and transfer,” Computer Graphics Annual Conference (SIGGRAPH 2001); Los Angeles, CA; United States; Aug. 12-17, 2001. pp. 341-346. |
Efros, A., et al., “Texture Synthesis by Non-parametric Sampling,” 7th IEEE International Conference on Computer Vision, Sep. 20-27, 1999, 10 Pages. |
Elad, M., et al., “Restoration of single super-resolution image from several blurred, noisy and down-sampled measured images,” IEEE Transactions on Image Processing, Dec. 1997, pp. 1646-1658, vol. 6, No. 12. |
Elkan, C., Using the triangle inequality to accelerate k-means, Proceedings of the Twentieth International Conference on Machine Learning (ICML-2003), Washington DC, 2003, 7 pages. |
Farsiu, S., et al., “Fast and robust multiframe super resolution,” IEEE Transactions on Image Processing, Oct. 2004, pp. 1327-1344, vol. 13, Is. 10. |
Finlayson, G.D., et al., Removing shadows from images, ECCV 2002: European Conference on Computer Vision, 2002, 14 pages. |
Freeman, W.T., et al., “Example-based super resolution,” IEEE Computer Graphics and Applications, 2002, 14 pages. |
Grauman, K., et al., “Approximate correspondence in high dimensions,” In NIPS, 2006, 8 pages. |
Han, J., et al., “Fast example-based surface texture synthesis via discrete optimization,” The Visual Computer, 2006, pp. 918-925, vol. 22. |
Hays, J., et al., “Scene completion using millions of photographs,” ACM Transactions on Graphics GRAPH, 2007, vol. 26, No. 3. |
Irani, M., et al., “Motion analysis for image enhancement: Resolution, occlusion and transparency,” Journal of Visual Communication and Image Representation, 1993, 24 pages. |
Korah, T., et al., “Analysis of building textures for reconstructing partially occluded facades,” European Conference on Computer Vision, 2008, 14 pages. |
Kumar, N., et al., “What is a good nearest neighbors algorithm for finding similar patches in images,” ECCV, 2008. |
Kwatra, V., et al., “Graphcut textures: Images and video synthesis using graph cuts,” ACM Transactions on Graphics, Jul. 2003, pp. 277-286, vol. 22, No. 3. |
Kwatra, V., et al, “Texture optimization for example-based synthesis,” ACM Transactions on Graphics, SIGGRAPH, 2005, pp. 795-802, vol. 24, No. 3. |
Lefebvre, S., et al., “Parallel controllable texture synthesis,” ACM SIGGRAPH, 2005, pp. 777-786. |
Mohan, A., et al., “Editing soft shadows in a digital photograph,” IEEE Computer Graphics and Applications, 2007, pp. 23-31, vol. 27, No. 2. |
Muller, P., et al., “Image -based procedural modeling of facades,” ACM Transactions on Graphics, Jul. 2007, pp. 85-1 to 85-10, vol. 26, No. 3. |
Park, M., et al., “Deformed Lattice Detection via Mean-Shift Belief Propagation,” European Conference on Computer Vision (ECCV), Oct., 2008, pp. 474-485. |
Pauly, M., et al., “Discovering structural regularity in 3rd geometry,” ACM Transactions on Graphics, 2008, 11 Pages. |
Salvador, E., et al., “Cast shadow segmentation using invariant color features, Computer vision and image understanding,” Computer Vision and Image Understanding, 2004, pp. 238-259, vol. 95. |
Schindler, G., et al. “Detecting and matching repeated patterns for automatic geo-tagging in urban environments,” Computer Vision and Pattern Recognition, CVPR, 2008, pp. 1-7. |
Shotton, J., et al., “Semantic texton forest for image categorization and segmentation,” Computer Vision and Pattern Recognition, CVPR, 2008, pp. 1-8. |
Sun, J., et al., “Image hallucination with primal sketch priors,” Computer Vision and Pattern Recognition, 2003, 8 pages. |
Torralba, A., et al., “Context-based vision system for place and object recognition,” Proceedings. Ninth IEEE International Conference, Oct. 13-16, 2003, pp. 273-380, vol. 1. |
Vazquez, E., et al., “Image segmentation in the presence of shadows and highlights,” Computer Vision—ECCV, 2008, 14 pages. |
Wang, Q., et al., “Patch based blind image super resolution,” ICCV 2005. Tenth IEEE International Conference, Oct. 17-21, 2005, pp. 709-716, vol. 1. |
Wang, H., et al., “Factoring repeated content within and among images,” International Conference on Computer Graphics and Interactive Techniques, ACM SIGGRAPH 2008, 10 pages. |
Wu, T., et al., “Natural shadow matting, ”ACM Transactions on Graphics, Jun. 2009, Pages , vol. 26, Issue 2, Article 8. |
Zhao, T., et al., “Segmentation and tracking of multiple humans in complex situations,” CVPR, 2001, 8 Pages. |
Cao, G., et al., “Covariance estimation for high dimensional data vectors using the sparse matrix transform,” In Daphne Koller, Dale Schuurmans, Yoshua Bengio, and Lon Bottou, editors, NIPS, pp. 225-232. MIT Press, 2008. |
Higham, N.J., “Computing the nearest correlation matrix a problem from finance,” IMA Journal of Numerical Analysis, 22(3):329-343, 2002. |
Jain, A. K., et al., “Statistical pattern recognition: A review,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(1):4-37, 2000. |
Porikli, W. F., et al., “Fast construction of covariance matrices for arbitrary size image windows,” In Proc. Intl. Conf. on Image Processing, pp. 1581-1584, 2006. |
Qi, J., et al., “Fast computation of the covariance of map reconstructions of pet images,” vol. 3661, pp. 344-355, 1999. |
Stein, C., et al., “Improving the usual estimator of a normal covariance matrix,” Dept. of Statistics, Stanford University, Report 37, 1972. |
PCT International Search Report and Written Opinion, PCT/US2011/062787, May 4, 2012, 5 Pages. |
Arbel, E., “Texture-preserving shadow removal in color images containing curved surfaces,” In CVPR, 2007, 8 pages. |
Awate, S. P., et al., “Unsupervised, information theoretic, adaptive image filtering for image restoration,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Mar. 2006, pp. 364-376, vol. 28, No. 3. |
Bell, M., et al., Learning local evidence for shading and reflectance. In ICCV, 2001, pp. 670-677. |
Bousseau, A., et al., User-assisted intrinsic images. In SIGGRAPH Asia, Dec. 2009, pp. 130:1-130:10, vol. 28, No. 5. |
Finlayson, G., et al., “Detecting illumination in images,” In ICCV, IEEE, 2007, 8 pages. |
Finlayson, G., et al., “Entropy minimization for shadow removal,” International Journal of Computer Vision, 2009, pp. 35-57, vol. 85. |
Finlayson, G., et al., “Intrinsic images by entropy minimization,” In ECCV, 2004, pp. 582-595, LNCS 3023. |
Jiang, X., et al., “Correlation-based intrinsic image extraction from a single image,” 11th European Conference on Computer Vision—ECCV, Sep. 5-11, 2010, pp. 58-71, LNCS 6314, Part IV. |
Kwatra, V., et al., “Shadow Removal for Aerial Imagery by Information Theoretic Intrinsic Image Analysis,” International Conference on Computational Photography, IEEE, 2012, 8 pages. |
Lalonde, J.-F., et al., “Detecting ground shadows in outdoor consumer photographs,” 11th European Conference on Computer Vision—ECCV, Sep. 5-11, 2010, pp. 321-335, LNCS 6312, Part II. |
Liu, F., et al., “Texture-consistent shadow removal,” 10th European Conference on Computer Vision-ECCV, Oct. 12-18, 2008, pp. 437-450, LNCS 5305, Part IV. |
Maxwell, B., et al., “A bi-illuminant dichromatic reflection model for understanding images,” In CVPR, IEEE 2008, 8 Pages. |
Narasimhan, S., et al., “A class of photometric invariants: Separating material from shape and illumination,” Proceedings of the Ninth IEEE International Conference on Computer Vision, 2003, 8 Pages. |
Principle, J. C., et al., “Information-theoretic learning using renyi's quadratic entropy,” In Proceedings of the First International Workshop on Independent Component Analysis and Signal Separation, Aussois, pp. 407-412, 1999. |
Shor, Y., et al., “The shadow meets the mask: Pyramid-based shadow removal,” in Eurographics, 2008, 10 Pages, vol. 27, No. 2. |
Tappen, M., et al., “Recovering intrinsic images from a single image,” IEEE Trans. Pattern Anal. Mach. Intell., Sep. 2005, pp. 1459-1472, vol. 27, No. 9. |
Weiss, Y., “Deriving intrinsic images from image sequences,” In ICCV, 2001, pp. 68-75. |
Wu, T.-P., et al., “A bayesian approach for shadow extraction from a single image,” Proceedings of the 10th IEEE International Conference on Computer Vision, 2005, 8 Pages. |
Zhu, J., et al., “Learning to recognize shadows in monochromatic natural images,” in CVPR, 2010, pp. 223-230. |
Datta, R. et al., “Content-Based Image Retrieval-Approaches and Trends of the New Age,” MIR, 2005, pp. 1-10. |
Deselaers, T. et al., “Features for Image Retrieval: An Experimental Comparison,” Information Retrieval, 2008, pp. 1-22, vol. 11, No. 2. |
Deselaers, T. et al., “Content-Based Image Retrieval, Tutorial Image Retrieval,” Tutorial ICPR, 2008, pp. 1-66. |
Idrissi, K. et al., “Multi-Component Entropy Segmentation for Color Image Retrieval,” Image and Signal Processing and Analysis, 2001, pp. 132-137. |
Ke, Y. et al., “Efficient Near-Duplicate Detection and Sub-Image Retrieval,” ACM Multimedia, 2004, pp. 1-9. |
Oliva, A. et al., “Building the Gist of a Scene: The Role of Global Image Features in Recognition,” Progress in Brain Research, 2006, pp. 1-19. |
Rahman, M. et al., “A Feature Level Fusion in Similarity Matching to Content-Based Image Retrieval,” 2006 9th International Conference on Information Fusion, 2006, pp. 1-6. |
Raoui, Y. et al., “Global and Local Image Descriptors for Content Based Image Retrieval and Object Recognition,” Applied Mathematical Sciences, 2010, pp. 2109-2136, vol. 5, 2011, No. 42. |
Smeulders, A. et al., “Content-Based Image Retrieval at the End of the Early Years,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2000, pp. 1349-1380, vol. 22, No. 12. |
Yang, J. et al., “Evaluating Bag-of-Visual-Worlds Representations in Scene Classification,” MIR, 2007, pp. 197-206. |
Bouveyron, C., et al., “Dimension Reduction and Classification Methods for Object Recognition in Vision,” 5th French-Danish Workshop on Spatial Statistics and Image Analysis in Biology, 2004, pp. 1-4. |
Ma, H., et al., “Shadow Segmentation and Compensation in High Resolution Satellite Images,” IEEE, 2008, pp. II-1036-11-1039. |
Number | Date | Country | |
---|---|---|---|
20120141044 A1 | Jun 2012 | US |
Number | Date | Country | |
---|---|---|---|
61418853 | Dec 2010 | US |