Systems and methods for estimating depth and visibility from a reference viewpoint for pixels in a set of images captured from different viewpoints

Description

FIELD OF THE INVENTION

The present invention generally relates to digital cameras and more specifically to the detection and correction of parallax in images captured using array cameras.

BACKGROUND

Binocular viewing of a scene creates two slightly different images of the scene due to the different fields of view of each eye. These differences, referred to as binocular disparity (or parallax), provide information that can be used to calculate depth in the visual scene, providing a major means of depth perception. The impression of depth associated with stereoscopic depth perception can also be obtained under other conditions, such as when an observer views a scene with only one eye while moving. The observed parallax can be utilized to obtain depth information for objects in the scene. Similar principles in machine vision can be used to gather depth information.

Two or more cameras separated by a distance can take pictures of the same scene and the captured images can be compared by shifting the pixels of two or more images to find parts of the images that match. The amount an object shifts between different camera views is called the disparity, which is inversely proportional to the distance to the object. A disparity search that detects the shift of an object in multiple images can be used to calculate the distance to the object based upon the baseline distance between the cameras and the focal length of the cameras involved. The approach of using two or more cameras to generate stereoscopic three-dimensional images is commonly referred to as multi-view stereo.

Multi-view stereo can generally be described in terms of the following components: matching criterion, aggregation method, and winner selection. The matching criterion is used as a means of measuring the similarity of pixels or regions across different images. A typical error measure is the RGB or intensity difference between images (these differences can be squared, or robust measures can be used). Some methods compute subpixel disparities by computing the analytic minimum of the local error surface or use gradient-based techniques. One method involves taking the minimum difference between a pixel in one image and the interpolated intensity function in the other image. The aggregation method refers to the manner in which the error function over the search space is computed or accumulated. The most direct way is to apply search windows of a fixed size over a prescribed disparity space for multiple cameras. Others use adaptive windows, shiftable windows, or multiple masks. Another set of methods accumulates votes in 3D space, e.g., a space sweep approach and voxel coloring and its variants. Once the initial or aggregated matching costs have been computed, a decision is made as to the correct disparity assignment for each pixel. Local methods do this at each pixel independently, typically by picking the disparity with the minimum aggregated value. Cooperative/competitive algorithms can be used to iteratively decide on the best assignments. Dynamic programming can be used for computing depths associated with edge features or general intensity similarity matches. These approaches can take advantage of one-dimensional ordering constraints along the epipolar line to handle depth discontinuities and unmatched regions. Yet another class of methods formulate stereo matching as a global optimization problem, which can be solved by global methods such as simulated annealing and graph cuts.

More recently, researches have used multiple cameras spanning a wider synthetic aperture to capture light field images (e.g. the Stanford Multi-Camera Array). A light field, which is often defined as a 4D function characterizing the light from all direction at all points in a scene, can be interpreted as a two-dimensional (2D) collection of 2D images of a scene. Due to practical constraints, it is typically difficult to simultaneously capture the collection of 2D images of a scene that form a light field. However, the closer in time at which the image data is captured by each of the cameras, the less likely that variations in light intensity (e.g. the otherwise imperceptible flicker of fluorescent lights) or object motion will result in time dependent variations between the captured images. Processes involving capturing and resampling a light field can be utilized to simulate cameras with large apertures. For example, an array of M×N cameras pointing at a scene can simulate the focusing effects of a lens as large as the array. Use of camera arrays in this way can be referred to as synthetic aperture photography.

While stereo matching was originally formulated as the recovery of 3D shape from a pair of images, a light field captured using a camera array can also be used to reconstruct a 3D shape using similar algorithms to those used in stereo matching. The challenge, as more images are added, is that the prevalence of partially occluded regions (pixels visible in some but not all images) also increases.

SUMMARY OF THE INVENTION

Systems and methods in accordance with embodiments of the invention can perform parallax detection and correction in images captured using array cameras. An embodiment of the method of the invention for estimating distances to objects within a scene from a light field comprising a set of images captured from different viewpoints using a processor configured by an image processing application includes: selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints; normalizing the set of images to increase the similarity of corresponding pixels within the set of images; and determining initial depth estimates for pixel locations in an image from the reference viewpoint using at least a subset of the set of images, where an initial depth estimate for a given pixel location in the image from the reference viewpoint is determined by: identifying pixels in the at least a subset of the set of images that correspond to the given pixel location in the image from the reference viewpoint based upon expected disparity at a plurality of depths; comparing the similarity of the corresponding pixels identified at each of the plurality of depths; and selecting the depth from the plurality of depths at which the identified corresponding pixels have the highest degree of similarity as an initial depth estimate for the given pixel location in the image from the reference viewpoint. In addition, the method includes identifying corresponding pixels in the set of images using the initial depth estimates; comparing the similarity of the corresponding pixels in the set of images to detect mismatched pixels. When an initial depth estimate does not result in the detection of a mismatch between corresponding pixels in the set of images, selecting the initial depth estimate as the current depth estimate for the pixel location in the image from the reference viewpoint. When an initial depth estimate results in the detection of a mismatch between corresponding pixels in the set of images, selecting the current depth estimate for the pixel location in the image from the reference viewpoint by: determining a set of candidate depth estimates using a plurality of different subsets of the set of images; identifying corresponding pixels in each of the plurality of subsets of the set of images based upon the candidate depth estimates; and selecting the candidate depth of the subset having the most similar corresponding pixels as the current depth estimate for the pixel location in the image from the reference viewpoint.

In a further embodiment, selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints includes selecting a viewpoint from the set consisting of: the viewpoint of one of the images; and a virtual viewpoint.

In another embodiment, a pixel in a given image from the set of images that corresponds to a pixel location in the image from the reference viewpoint is determined by applying a scene dependent shift to the pixel location in the image from the reference viewpoint that is determined based upon: the depth estimate of the pixel location in the image from the reference viewpoint; and the baseline between the viewpoint of the given image and the reference viewpoint.

In a still further embodiment, the subsets of the set of images used to determine the set of candidate depth estimates are selected based upon the viewpoints of the images in the sets of images to exploit patterns of visibility characteristic of natural scenes that are likely to result in at least one subset in which a given pixel location in the image from the reference viewpoint is visible in each image in the subset.

In still another embodiment, the set of images are captured within multiple color channels; selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints includes selecting one of the images as a reference image and selecting the viewpoint of the reference image as the reference viewpoint; and the subsets of the set of images used to determine the set of candidate depth estimates are selected so that the same number of images in the color channel containing the reference image appears in each subset.

In a yet further embodiment, the subsets of the set of images used to determine the set of candidate depth estimates are also selected so that there are at least two images in the color channels that do not contain the reference image in each subset.

Yet another embodiment also includes determining the visibility of the pixels in the set of images from the reference viewpoint by: identifying corresponding pixels in the set of images using the current depth estimates; and determining that a pixel in a given image is not visible in the image from the reference viewpoint when the pixel fails a photometric similarity criterion determined based upon a comparison of corresponding pixels.

In a further embodiment again, selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints includes selecting one of the images in the set of images as a reference image and selecting the viewpoint of the reference image as the reference viewpoint; and determining that a pixel in a given image is not visible in the image from the reference viewpoint when the pixel fails a photometric similarity criterion determined based upon a comparison of corresponding pixels further includes comparing the pixel in the given image to the corresponding pixel in the reference image.

In another embodiment again, the photometric similarity criterion includes a similarity threshold that adapts based upon at least the intensity of at least one of the pixel in the given image and the pixel in the reference image.

In a further additional embodiment, the photometric similarity criterion includes a similarity threshold that adapts as a function of the photometric distance between the corresponding pixel from the reference image and the corresponding pixel that is most similar to the pixel from the reference image.

In another additional embodiment, the photometric similarity criterion includes a similarity threshold that adapts based upon the signal to noise ratio of the pixel in the reference image.

In a still yet further embodiment, adapting the similarity threshold based upon the signal to noise ratio is approximated by scaling the photometric distance of the corresponding pixel from the reference image and the corresponding pixel that is most similar to the pixel from the reference image is and applying an offset to obtain an appropriate threshold.

In still yet another embodiment, the set of images includes images captured in a plurality of color channels and the reference image is an image captured in a first color channel and the given image is in the second color channel; determining that a pixel in a given image is not visible in the reference viewpoint when the pixel fails a photometric similarity criterion determined based upon a comparison of corresponding pixels further includes: selecting an image in the second color channel in which the corresponding pixel in the image from the reference viewpoint is visible as a reference image for the second color channel; and comparing the pixel in the given image to the corresponding pixel in the reference image for the second color channel.

In a still further embodiment again, selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints includes selecting a virtual viewpoint as the reference viewpoint; and determining that a pixel in a given image is not visible in the image from the reference viewpoint when the pixel fails a photometric similarity criterion determined based upon a comparison of corresponding pixels further includes: selecting an image adjacent the virtual viewpoint as a reference image; and comparing the pixel in the given image to the corresponding pixel in the reference image.

In still another embodiment again, the image adjacent the virtual viewpoint is selected based upon the corresponding pixel in the selected image to the pixel from the given image being visible in an image from the reference viewpoint.

A yet further embodiment again also includes updating the depth estimate for a given pixel location in the image from the reference viewpoint based upon the visibility of the pixels in the set of images from the reference viewpoint by: generating an updated subset of the set of images using images in which the given pixel location in the image from the reference viewpoint is determined to be visible based upon the current depth estimate for the given pixel; identifying pixels in the updated subset of the set of images that correspond to the given pixel location in the image from the reference viewpoint based upon expected disparity at a plurality of depths; comparing the similarity of the corresponding pixels in the updated subset of images identified at each of the plurality of depths; and selecting the depth from the plurality of depths at which the identified corresponding pixels in the updated subset of the set of images have the highest degree of similarity as an updated depth estimate for the given pixel location in the image from the reference viewpoint.

In yet another embodiment again, the subsets of the set of images are pairs of images; and the updated subset of the set of images includes at least three images.

In a still further additional embodiment, normalizing the set of images to increase the similarity of corresponding pixels within the set of images further includes utilizing calibration information to correct for photometric variations and scene-independent geometric distortions in the images in the set of images, and rectification of the images in the set of images

In still another additional embodiment, normalizing the set of images to increase the similarity of corresponding pixels within the set of images further includes resampling the images to increase the similarity of corresponding pixels in the set of images; and the scene-independent geometric corrections applied to the images are determined at a sub-pixel resolution.

In a yet further additional embodiment, utilizing calibration information to correct for photometric variations further includes performing any one of the normalization processes selected from the group consisting of: Black Level calculation and adjustment; vignetting correction; lateral color correction; and temperature normalization.

In yet another additional embodiment, the scene-independent geometric corrections also include rectification to account for distortion and rotation of lenses in an array of cameras that captured the set of images.

In a further additional embodiment again, a cost function is utilized to determine the similarity of corresponding pixels.

In another additional embodiment again, determining the similarity of corresponding pixels further includes spatially filtering the calculated costs.

In another further embodiment, selecting the depth from the plurality of depths at which the identified corresponding pixels have the highest degree of similarity as an initial depth estimate for the given pixel location in the image from the reference viewpoint further includes selecting the depth from the plurality of depths at which the filtered cost function for the identified corresponding pixels indicates the highest level of similarity.

In still another further embodiment, the cost function utilizes at least one similarity measure selected from the group consisting of: the L1 norm of a pair of corresponding pixels; the L2 norm of a pair of corresponding pixels; and the variance of a set of corresponding pixels.

In yet another further embodiment, the set of images are captured within multiple color channels and the cost function determines the similarity of pixels in each of the multiple color channels.

Another further embodiment again also includes generating confidence metrics for the current depth estimates for pixel locations in the image from the reference viewpoint.

In another further additional embodiment, the confidence metric encodes a plurality of confidence factors.

Still yet another further embodiment also includes filtering the depth map based upon the confidence map.

Still another further embodiment again also includes detecting occlusion of pixels in images within the set of images that correspond to specific pixel locations in the image from the reference viewpoint based upon the initial depth estimates by searching along lines parallel to the baselines between the reference viewpoint and the viewpoints of the images in the set of images to locate occluding pixels; when an initial depth estimate results in the detection of a corresponding pixel in at least one image being occluded, selecting the current depth estimate for the pixel location in the image from the reference viewpoint by: determining a set of candidate depth estimates using a plurality of different subsets of the set of images that exclude the at least one image in which the given pixel is occluded; identifying corresponding pixels in each of the plurality of subsets of the set of images based upon the candidate depth estimates; and selecting the candidate depth of the subset having the most similar corresponding pixels as the current depth estimate for the pixel location in the image from the reference viewpoint.

In still another further additional embodiment, searching along lines parallel to the baselines between the reference viewpoint and the viewpoints of the images in the set of images to locate occluding pixels further includes determining that a pixel corresponding to a pixel location (x₁, y₁) in an image from the reference viewpoint is occluded in an alternate view image by a pixel location (x₂, y₂) in the image from the reference viewpoint when

|s₂−s₁−√{square root over ((x₂−x₁)²+(y₂−y₁)²)}|≤threshold

where s₁and s₂are scene dependent geometric shifts applied to pixel locations (x₁, y₁) and pixel (x₂, y₂) to shift the pixels along a line parallel to the baseline between the reference viewpoint and the viewpoint of the alternate view image to shift the pixels into the viewpoint of the alternate view image based upon the initial depth estimates for each pixel.

In yet another further embodiment again, the decision to designate a pixel as being occluded considers at least one of the similarity of the pixels and the confidence of the estimated depths of the pixels (x₁, y₁) and (x₂, y₂).

In a specific embodiment, a cost function is utilized to determine the similarity of corresponding pixels.

In another specific embodiment, determining the similarity of corresponding pixels further comprises spatially filtering the calculated costs.

In a further specific embodiment, the spatial filtering of the calculated costs utilizes a filter selected from the group consisting of: a fixed-coefficient filter; and an edge-preserving filter.

In a still further specific embodiment, selecting the depth from the plurality of depths at which the identified corresponding pixels have the highest degree of similarity as an initial depth estimate for the given pixel location in the image from the reference viewpoint further includes selecting the depth from the plurality of depths at which the filtered cost function for the identified corresponding pixels indicates the highest level of similarity.

In still another specific embodiment, the set of images are captured within a single color channel and the cost function is a function of the variance of the corresponding pixel.

In a yet further specific embodiment, the cost function is an aggregated cost function CV(x, y, d) over each image i in the set of images that includes the following term

$CV (x, y, d) = \sum_{i} \frac{{Cost}^{i, Ref} (x, y, d) \times V^{i, Ref} (x, y)}{number of visible cameras at (x, y)}$

where Cost^i,Ref(x, y, d) is a similarity measure (i.e. the cost function),

- d is depth of pixel (x, y), and
- V^i,Ref(x, y) is the visibility of pixel (x, y) and initially V^i,Ref(x, y)=1 for all cameras.

In a further specific embodiment again, the individual costs Cost^i,Ref(x, y, d) are computed based on each disparity hypothesis d for each pixel (x, y) for cameras i, Ref as follows:

Cost^i,Ref(x,y,d)=S{Iⁱ(x,y,d),I^Ref(x,y,d)}

where S is the similarity measure (for example), and

- Iⁱis the calibrated image i after geometric calibration.

In yet another specific embodiment, the aggregated cost considers the similarity of the shifted images at the candidate depth as follows:

$CV (x, y, d) = \sum_{k \in K} \frac{(x, y) {Cost}^{k, Ref} (x, y, d) \times V^{k, Ref}}{number of visible cameras in K} + \sum_{i, j \in L} \frac{{Cost}^{i, j} (x, y, d) \times V^{i, Ref} (x, y) \times V^{j, Ref} (x, y)}{number of pairs of cameras in L}$

where K is a set of cameras in the same spectral channel as the reference camera,

- L is a set of pairs of cameras, where both cameras in each pair are in the same spectral channel (which can be a different spectral channel to the reference camera where the light field includes image data in multiple spectral channels),
  
  Cost^k,Ref(x,y,d)=S{ImageRef(x,y),ShiftedImage^k(x,y,d)}, and
  Cost^i,j(x,y,d)=S{ShiftedImageⁱ(x,y,d),ShiftedImage^j(x,y,d)}

In a further specific embodiment again, the aggregated cost function is spatially filtered using a filter so that the weighted aggregated cost function is as follows:

$FilteredCV (x, y, d) = \frac{1}{Norm} \sum_{(x_{1}, y_{1}) \in N (x, y)} CV (x_{1}, y_{1}, d) \times wd (x, y, x_{1}, y_{1}) \times wr (I_{Ref} (x, y) - I_{Ref} (x_{1}, y_{1}))$

where N(x, y) is the immediate neighborhood of the pixel (x, y), which can be square, circular, rectangular, or any other shape appropriate to the requirements of a specific application,

- Norm is a normalization term,
- I_Ref(x, y) is the image data from the reference camera,
- wd is a weighting function based on pixel distance, and
- wr is a weighting function based on intensity difference.

In a further embodiment, the filter is a box filter and wd and wr are constant coefficients.

In another embodiment, the filter is a bilateral filter and wd and wr are both Gaussian weighting functions.

In a still further embodiment, a depth estimate for a pixel location (x, y) in the image from the reference viewpoint is determined by selecting the depth that minimizes the filtered cost at each pixel location in the depth map as follows:

D(x,y)=argmin{FilteredCV(x,y,d)}

In still another embodiment, the set of images are captured within multiple color channels and the cost function incorporates the L1 norm of image data from the multiple color channels.

In a yet further embodiment, the set of images are captured within multiple color channels and the cost function incorporates the L2 norm of image data from the multiple color channels.

In yet another embodiment, the set of images are captured within multiple color channels including at least Red, Green and Blue color channels; selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints comprises selecting one of the images in the Green color channel as a Green reference image and selecting the viewpoint of the Green reference image as the reference viewpoint; and the cost function Cost(x, y, d) for a pixel location (x, y) in the image from the reference viewpoint at a depth d is:

Cost(x,y,d)=γ_G(x,y).Cost_G(x,y,d)+γ_R(x,y).Cost_R(x,y,d)+γ_B(x,y).Cost_B(x,y,d)

where Cost_G(x, y, d) is the measure of the similarity of a pixel location (x, y) in the image from the reference viewpoint to corresponding pixels in locations within a set of Green images based upon the depth d,

Cost_R(x, y, d) is the measure of the similarity of corresponding pixels in locations within a set of Red images determined based upon the depth d and the pixel location (x, y) in the image from the reference viewpoint,

Cost_B(x, y, d) is the measure of the similarity of corresponding pixels in locations within a set of Blue images determined based upon the depth d and the pixel location (x, y) in the image from the reference viewpoint, and

γ_G, γ_R, and γ_Bare weighting factors for the Green, Red and Blue cost functions respectively.

In a further embodiment again, the Cost_G(x, y, d) uses a similarity measure selected from the group consisting of an L1 norm, an L2 norm, and variance across the pixels in the images in the set of images that are within the Green color channel.

In another embodiment again, the cost measures for the Red (Cost_R(x, y, d)) and Blue color channels (Cost_B(x, y, d)) are determined by calculating the aggregated difference between unique pairs of corresponding pixels in images within the color channel.

In a further additional embodiment, calculating the aggregated difference between each unique pair of corresponding pixels in images within a color channel comprises determining a combination cost metric for unique pairs of corresponding pixels in images within the color channel.

In another additional embodiment, the combination cost metric (Cost_C(x, y, d)) for a Red color channel including four images (C_A, C_B, C_C, and C_D) can be determined as follows:

Cost_C(x,y,d)=|C_A(x_A,y_A)−C^B(x_B,y_B)|+|C_A(x_A,y_A)−C^C(x_C,y_D)|+|C_A(x_A,y_A)−C^D(x_D,y_D)|+|C_B(x_B,y_B)−C^C(x_C,y_D)|+|C_B(x_B,y_B)−C^D(x_D,y_D)|+|C_C(x_C,y_C)−C^C(x_D,y_D)|

where (x_A, y_A), (x_B, y_B), (x_C, y_C), and (x_D, y_D) are corresponding pixel locations determined based upon the disparity in each of the images C_A, C_B, C_C, and C_Drespectively at depth d.

In a still yet further embodiment, the combination cost metric is determined utilizing at least one selected from the group consisting of: the L1 norm of the pixel brightness values; the L2 norm of the pixel brightness values; and the variance in the pixel brightness values.

In still yet another embodiment, the weighting factors γ_G, γ_R, and γ_Bare fixed.

In a still further embodiment again, the weighting factors γ_G, γ_R, and γ_Bvary spatially with the pixel location (x, y) in the image from the reference viewpoint.

In still another embodiment again, the weighting factors γ_G, γ_R, and γ_Bvary based upon the estimated SNR at the pixel location (x, y) in the image from the reference viewpoint; and strong SNR at the pixel location (x, y) in the image from the reference viewpoint is used to reduce the weighting applied to the Red and Blue color channels.

In a further embodiment, the confidence metric encodes a plurality of confidence factors.

In another embodiment, the confidence metric for the depth estimate for a given pixel location in the image from the reference viewpoint comprises at least one confidence factor selected from the group consisting of: an indication that the given pixel is within a textureless region within an image; a measure of the signal to noise ration (SNR) in a region surrounding a given pixel; the number of corresponding pixels used to generate the depth estimate; an indication of the number of depths searched to generate the depth estimate; an indication that the given pixel is adjacent a high contrast edge; and an indication that the given pixel is adjacent a high contrast boundary.

In a still further embodiment, the confidence metric for the depth estimate for a given pixel location in the image from the reference viewpoint comprises at least one confidence factor selected from the group consisting of: an indication that the given pixel lies on a gradient edge; an indication that the corresponding pixels to the given pixel are mismatched; an indication that corresponding pixels to the given pixel are occluded; an indication that depth estimates generated using different reference cameras exceed a threshold for the given pixel; an indication that the depth estimates generated using different subsets of cameras exceed a threshold for the given pixel; an indication as to whether the depth of the given threshold exceeds a threshold; an indication that the given pixel is defective; and an indication that corresponding pixels to the given pixel are defective.

In still another embodiment, the confidence metric for the depth estimate for a given pixel location in the image from the reference viewpoint comprises at least: a measure of the SNR in a region surrounding a given pixel; and the number of corresponding pixels used to generate the depth estimate.

In a yet further embodiment, the confidence metric encodes at least one binary confidence factor.

In yet another embodiment, the confidence metric encodes at least one confidence factor represented as a range of degrees of confidence.

In a further embodiment again, the confidence metric encodes at least one confidence factor determined by comparing the similarity of the pixels in the set of images that were used to generate the finalized depth estimate for a given pixel location in the image from the reference viewpoint.

In another embodiment again, a cost function is utilized to generate a cost metric indicating the similarity of corresponding pixels; and comparing the similarity of the pixels in the set of images that were used to generate the depth estimate for a given pixel location in the image from the reference viewpoint further comprises: applying a threshold to a cost metric of the pixels in the set of images that were used to generate the finalized depth estimate for a given pixel location in the image from the reference viewpoint; and when the cost metric exceeds the threshold, assigning a confidence metric that indicates that the finalized depth estimate for the given pixel location in the image from the reference viewpoint was generated using at least one pixel in the set of images that is a problem pixel.

In a further additional embodiment, the threshold is modified based upon at least one of: a mean intensity of a region surrounding the given pixel location in the image from the reference viewpoint; and noise statistics for at least one sensor used to capture the set of images.

In a still yet further embodiment, the mean intensity of a region surrounding the given pixel location in the image from the reference viewpoint is calculated using a spatial box N×N averaging filter centered around the given pixel.

In still yet another embodiment, the set of images are captured within multiple color channels including at least Red, Green and Blue color channels; selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints comprises selecting one of the images in the Green color channel as a Green reference image and selecting the viewpoint of the Green reference image as the reference viewpoint; and the mean intensity is used to determine the noise statistics for the Green channel using a table that relates a particular mean at a particular exposure and gain to a desired threshold.

In a still further embodiment again, selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints comprises selecting one of the images as a reference image and selecting the viewpoint of the reference image as the reference viewpoint; and a cost function is utilized to generate a cost metric indicating the similarity of corresponding pixels; a confidence metric based upon general mismatch is obtained using the following formula:

Confidence(x,y)=F(Cost_min(x,y),Cost^d(x,y),I(x,y)^cam,Sensor)

where Cost_min(x, y) is the minimum cost of a disparity search over the desired depth range,

- Cost^d(x, y) denotes that cost data from any depth or depths (beside the minimum depth),
- I(x, y)cam image data captured by any camera can be utilized to augment the confidence;
- Sensor is the sensor prior, which can include known properties of the sensor, such as (but not limited to) noise statistics or characterization, defective pixels, properties of the sensor affecting any captured images (such as gain or exposure),
- Camera intrinsics is the camera intrinsic, which specifies elements intrinsic to the camera and camera array that can impact confidence including (but not limited to) the baseline separation between cameras in the array (affects precision of depth measurements), and the arrangement of the color filters (affects performance in the occlusion zones in certain scenarios).

In still another embodiment again, selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints comprises selecting one of the images as a reference image and selecting the viewpoint of the reference image as the reference viewpoint; and a cost function is utilized to generate a cost metric indicating the similarity of corresponding pixels; and a confidence metric based upon general mismatch is obtained using the following formula:

$Confidence (x, y) = a \times \frac{{Cost}_{\min} (x, y)}{Avg (x, y)} + offset$

where Avg(x, y) is the mean intensity of the reference image in a spatial neighborhood surrounding (x, y), or an estimate of the mean intensity in the neighborhood, that is used to adjust the confidence based upon the intensity of the reference image in the region of (x, y),

- a and offset are empirically chosen scale and offset factors used to adjust the confidence with prior information about the gain and noise statistics of the sensor.
- a and offset are empirically chosen scale and offset factors used to adjust the confidence with prior information about the gain and noise statistics of at least one sensor used to capture images in the set of images.

In a yet further embodiment again, generating confidence metrics for the depth estimates for pixel locations in the image from the reference viewpoint includes determining at least one sensor gain used to capture at least one of the set of images and adjusting the confidence metrics based upon the sensor gain.

In yet another embodiment again, generating confidence metrics for the depth estimates for pixel locations in the image from the reference viewpoint comprises determining at least one exposure time used to capture at least one of the set of images and adjusting the confidence metrics based upon the sensor gain.

A still further additional embodiment also includes outputting a depth map containing the finalized depth estimates for pixel locations in the image from the reference viewpoint, and outputting a confidence map containing confidence metrics for the finalized depth estimates contained within the depth map.

Still another additional embodiment also includes filtering the depth map based upon the confidence map.

Yet another further additional embodiment includes estimating distances to objects within a scene from the light field comprising a set of images captured from different viewpoints using a processor configured by an image processing application by: selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints; normalizing the set of images to increase the similarity of corresponding pixels within the set of images; determining initial depth estimates for pixel locations in an image from the reference viewpoint using at least a subset of the set of images, where an initial depth estimate for a given pixel location in the image from the reference viewpoint is determined by: identifying pixels in the at least a subset of the set of images that correspond to the given pixel location in the image from the reference viewpoint based upon expected disparity at a plurality of depths; comparing the similarity of the corresponding pixels identified at each of the plurality of depths; and selecting the depth from the plurality of depths at which the identified corresponding pixels have the highest degree of similarity as an initial depth estimate for the given pixel location in the image from the reference viewpoint. In addition, the process of estimating distances further includes identifying corresponding pixels in the set of images using the initial depth estimates; comparing the similarity of the corresponding pixels in the set of images to detect mismatched pixels; when an initial depth estimate does not result in the detection of a mismatch between corresponding pixels in the set of images, selecting the initial depth estimate as the current depth estimate for the pixel location in the image from the reference viewpoint; and when an initial depth estimate results in the detection of a mismatch between corresponding pixels in the set of images, selecting the current depth estimate for the pixel location in the image from the reference viewpoint by: determining a set of candidate depth estimates using a plurality of different subsets of the set of images; identifying corresponding pixels in each of the plurality of subsets of the set of images based upon the candidate depth estimates; and selecting the candidate depth of the subset having the most similar corresponding pixels as the current depth estimate for the pixel location in the image from the reference viewpoint. The process further including determining the visibility of the pixels in the set of images from the reference viewpoint by: identifying corresponding pixels in the set of images using the current depth estimates; and determining that a pixel in a given image is not visible in the reference viewpoint when the pixel fails a photometric similarity criterion determined based upon a comparison of corresponding pixels; and fusing pixels from the set of images using the processor configured by the image processing application based upon the depth estimates to create a fused image having a resolution that is greater than the resolutions of the images in the set of images by: identifying the pixels from the set of images that are visible in an image from the reference viewpoint using the visibility information; and applying scene dependent geometric shifts to the pixels from the set of images that are visible in an image from the reference viewpoint to shift the pixels into the reference viewpoint, where the scene dependent geometric shifts are determined using the current depth estimates; and fusing the shifted pixels from the set of images to create a fused image from the reference viewpoint having a resolution that is greater than the resolutions of the images in the set of images.

Another further embodiment also includes synthesizing an image from the reference viewpoint using the processor configured by the image processing application to perform a super resolution process based upon the fused image from the reference viewpoint, the set of images captured from different viewpoints, the current depth estimates, and the visibility information.

A further embodiment of the invention includes a processor, and memory containing a set of images captured from different viewpoints and an image processing application. In addition, the image processing application configures the processor to: select a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints; normalize the set of images to increase the similarity of corresponding pixels within the set of images; determine initial depth estimates for pixel locations in an image from the reference viewpoint using at least a subset of the set of images, where an initial depth estimate for a given pixel location in the image from the reference viewpoint is determined by: identifying pixels in the at least a subset of the set of images that correspond to the given pixel location in the image from the reference viewpoint based upon expected disparity at a plurality of depths; comparing the similarity of the corresponding pixels identified at each of the plurality of depths; and selecting the depth from the plurality of depths at which the identified corresponding pixels have the highest degree of similarity as an initial depth estimate for the given pixel location in the image from the reference viewpoint. The application further configures the processor to identify corresponding pixels in the set of images using the initial depth estimates; compare the similarity of the corresponding pixels in the set of images to detect mismatched pixels. When an initial depth estimate does not result in the detection of a mismatch between corresponding pixels in the set of images, the application configures the processor to select the initial depth estimate as the current depth estimate for the pixel location in the image from the reference viewpoint. When an initial depth estimate results in the detection of a mismatch between corresponding pixels in the set of images, the application configures the processor to select the current depth estimate for the pixel location in the image from the reference viewpoint by: determining a set of candidate depth estimates using a plurality of different subsets of the set of images; identifying corresponding pixels in each of the plurality of subsets of the set of images based upon the candidate depth estimates; and selecting the candidate depth of the subset having the most similar corresponding pixels as the current depth estimate for the pixel location in the image from the reference viewpoint.

In another embodiment, the image processing application further configures the processor to: determine the visibility of the pixels in the set of images from the reference viewpoint by: identifying corresponding pixels in the set of images using the current depth estimates; and determining that a pixel in a given image is not visible in the reference viewpoint when the pixel fails a photometric similarity criterion determined based upon a comparison of corresponding pixels; and fuse pixels from the set of images using the depth estimates to create a fused image having a resolution that is greater than the resolutions of the images in the set of images by: identifying the pixels from the set of images that are visible in an image from the reference viewpoint using the visibility information; and applying scene dependent geometric shifts to the pixels from the set of images that are visible in an image from the reference viewpoint to shift the pixels into the reference viewpoint, where the scene dependent geometric shifts are determined using the current depth estimates; and fusing the shifted pixels from the set of images to create a fused image from the reference viewpoint having a resolution that is greater than the resolutions of the images in the set of images.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 conceptual illustrates of an array camera in accordance with an embodiment of the invention.

FIG. 1A conceptually illustrates an array camera module in accordance with an embodiment of the invention.

FIG. 1C conceptually illustrates a color filter pattern for a 4×4 array camera module in accordance with an embodiment of the invention.

FIG. 2 conceptually illustrates capturing image data using a reference camera and an alternate view camera.

FIGS. 3A and 3B conceptually illustrate the effect of parallax in images of a scene captured by a reference camera and an alternate view camera.

FIG. 4 is a flowchart illustrating a process for generating a depth map from a captured light field including a plurality of images captured from different viewpoints in accordance with an embodiment of the invention.

FIG. 5 is a flowchart of a process for normalizing captured image data in accordance with an embodiment of the invention.

FIG. 6 is a flowchart of a process for iteratively refining a depth map based upon visibility information in accordance with embodiments of the invention.

FIG. 7 conceptually illustrates a subset of cameras within an array camera that can be utilized to generate estimates of distances to objects within a scene in accordance with an embodiment of the invention.

FIG. 8 is a flowchart illustrating a process for performing a disparity search using visibility information in accordance with an embodiment of the invention.

FIG. 8A is a flowchart illustrating a process for estimating depth using images captured by subsets of cameras in a camera array in accordance with an embodiment of the invention.

FIGS. 8B-8I conceptually illustrate subsets of cameras in a 5×5 array camera that can be utilized to obtain depth estimates in accordance with embodiments of the invention.

FIGS. 8J-8M conceptually illustrate subsets of cameras in a 4×4 array camera that can be utilized to obtain depth estimates in accordance with embodiments of the invention.

FIG. 9 conceptually illustrates a process for searching an epipolar line for pixels that occlude a given pixel in accordance with an embodiment of the invention.

FIG. 10 conceptually illustrates a 5×5 array camera that can be utilized to construct a depth map in accordance with an embodiment of the invention.

FIG. 11 is a flowchart illustrating a process for determining visibility based upon the photometric similarity of corresponding pixels in accordance with an embodiment of the invention.

FIG. 12 conceptually illustrates one of many virtual viewpoints that can be defined with respect to a 4×4 array camera in accordance with an embodiment of the invention.

FIG. 13 is a flowchart illustrating a process for generating a sparse depth map in accordance with an embodiment of the invention.

FIG. 14 conceptually illustrates a set of pixels that can be utilized as indicator pixels when generating a sparse depth map in accordance with an embodiment of the invention.

FIG. 15 is a flowchart illustrating a process for detecting textureless regions using the SNR surrounding a pixel in accordance with an embodiment of the invention.

FIG. 16 is a system for generating a depth map and visibility information in accordance with an embodiment of the invention.

FIG. 17 is a flowchart illustrating a process for synthesizing a higher resolution image from a plurality of lower resolution images captured from different viewpoints using super-resolution processing in accordance with an embodiment of the invention.

FIGS. 18A and 18B conceptually illustrate sources of noise in depth estimates.

FIGS. 18C-18H conceptually illustrate the generation of a depth map and a confidence map from captured image data and the use of the confidence map to filter the depth map in accordance with an embodiment of the invention.

FIGS. 18I-18N similarly conceptually illustrate the generation of a depth map and a confidence map from captured image data and the use of the confidence map to filter the depth map using close up images in accordance with an embodiment of the invention.

DETAILED DESCRIPTION

Turning now to the drawings, systems and methods for parallax detection and correction in images captured using array cameras are illustrated. Array cameras, such as those described in U.S. patent application Ser. No. 12/935,504 entitled “Capturing and Processing of Images using Monolithic Camera Array with Heterogeneous Imagers” to Venkataraman et al., can be utilized to capture light field images. In a number of embodiments, super-resolution processes such as those described in U.S. patent application Ser. No. 12/967,807 entitled “Systems and Methods for Synthesizing High Resolution Images Using Super-Resolution Processes” to Lelescu et al., are utilized to synthesize a higher resolution 2D image or a stereo pair of higher resolution 2D images from the lower resolution images in the light field captured by an array camera. The terms high or higher resolution and low or lower resolution are used here in a relative sense and not to indicate the specific resolutions of the images captured by the array camera. The disclosures of U.S. patent application Ser. No. 12/935,504 and U.S. patent application Ser. No. 12/967,807 are hereby incorporated by reference in their entirety.

Each two-dimensional (2D) image in a captured light field is from the viewpoint of one of the cameras in the array camera. Due to the different viewpoint of each of the cameras, parallax results in variations in the position of objects within the different images of the scene. Systems and methods in accordance with embodiments of the invention provide an accurate account of the pixel disparity as a result of parallax between the different cameras in the array, so that appropriate scene-dependent geometric shifts can be applied to the pixels of the captured images when performing super-resolution processing.

A high resolution image synthesized using super-resolution processing is synthesized from a specific viewpoint that can be referred to as a reference viewpoint. The reference viewpoint can be from the viewpoint of one of the cameras in a camera array. Alternatively, the reference viewpoint can be an arbitrary virtual viewpoint where there is no physical camera. A benefit of synthesizing a high resolution image from the viewpoint of one of the cameras (as opposed to a virtual viewpoint) is that the disparity of the pixels in the light field can be determined with respect to the image in the light field captured from the reference viewpoint. When a virtual viewpoint is utilized, none of the captured image data is from the reference viewpoint and so the process instead relies solely on cameras away from the reference position to determine the best match.

Array cameras in accordance with many embodiments of the invention use the disparity between the pixels in the images within a light field to generate a depth map from the reference viewpoint. A depth map indicates the distance of scene objects from a reference viewpoint and can be utilized to determine scene dependent geometric corrections to apply to the pixels from each of the images within a captured light field to correct for disparity when performing super-resolution processing. In several embodiments, an initial depth map of the reference viewpoint is generated and as part of that process or as a subsequent process occluded pixels and/or other types of mismatched pixels are detected. The process of detecting pixels that are occluded can also be thought of as determining whether a pixel in an image captured from the reference viewpoint is visible in the image from a non-reference viewpoint. When a pixel in the image captured from the reference viewpoint is not visible in a second image, utilizing image data from the second image when determining the depth of the pixel in the reference image introduces error into the depth determination. Therefore, by detecting the pixels in the reference image that are occluded in one or more images in the light field, the accuracy of the depth map can be improved. In several embodiments, the initial depth map is updated by determining the depths of occluded pixels using image data captured from cameras in which the pixels are visible (i.e. not occluded). In a number of embodiments, the likely presence of occlusions and/or other sources of mismatched pixels can be detected during the process of generating an initial depth estimate and subsets of a set of images that correspond to different patterns of visibility within a scene can be used to determine a set of candidate depth estimates. The candidate depth of the subset of images having the most similar corresponding pixels can be used as the new depth estimate and the new depth estimate used to determine the visibility of the corresponding pixels in some or all of the remaining set of images.

A depth map from a reference viewpoint can be utilized to determine the scene dependent geometric shifts that are likely to have occurred in images captured from other viewpoints. These scene dependent geometric shifts can be utilized in super-resolution processing. In addition, the scene dependent geometric shifts can be utilized to refine the determinations of the visibility of pixels within the light field from the reference viewpoint. In a number of embodiments, the scene dependent geometric shifts are utilized to compare the similarity of pixels. Assuming the depth of a pixel from the reference viewpoint is correctly determined, then the similarity of the pixels is indicative of whether the pixel is visible. A similar pixel is likely to be the pixel observed from the reference viewpoint shifted due to disparity. If the pixels are dissimilar, then the pixel observed from the reference viewpoint is likely occluded in the second image. In many embodiments, visibility information is utilized in further updating depth maps. In several embodiments, visibility information is generated and provided along with the depth map for use in super-resolution processing.

In a number of embodiments, the computational complexity of generating depth maps is reduced by generating a sparse depth map that includes additional depth estimates in regions where additional depth information is desirable such as (but not limited to) regions involving depth transitions and/or regions containing pixels that are occluded in one or more images within the light field.

Many array cameras capture color information using different cameras (see for example the array cameras disclosed in U.S. patent application Ser. No. 12/935,504). In many embodiments, the viewpoint of a Green camera is utilized as the reference viewpoint. An initial depth map can be generated using the images captured by other Green cameras in the array camera and the depth map used to determine the visibility of Red, Green, and Blue pixels within the light field. In other embodiments, image data in multiple color channels can be utilized to perform depth estimation. In several embodiments, the similarity of corresponding pixels in each color channel is considered when estimating depth. In a number of embodiments, the similarity of sets of corresponding pixels in different color channels is also considered when estimating depth. Depth estimation using various cost functions that consider the similarity of corresponding pixels at specific depths in a single spectral channel, in multiple spectral channels, and/or across spectral channels in accordance with embodiments of the invention are discussed further below.

In several embodiments, the array camera can include one or more cameras that capture image data in multiple color channels. For example, an array camera may include one or more cameras that have a Bayer color filter pattern, in addition to or as an alternative to monochrome cameras. When the viewpoint of a camera that captures multiple color channels is used as the reference viewpoint for the purpose of generating a depth map, a depth map and visibility information can be determined for each color channel captured from the reference viewpoint. When a reference image contains information concerning multiple color channels, depth and visibility information can be more reliably created based upon the disparity of the pixels in the light field with respect to the reference image than by registering the pixels in one channel with respect to the depth and visibility of pixels in another color channel. A disadvantage of utilizing the viewpoint of a camera that captures image data in multiple color channels as a reference viewpoint is that the resolution of the depth information in each of the captured color channels is reduced relative to a camera that captures image data using the same number of pixels in a single channel. Accordingly, the configuration of the array camera and the selection of the viewpoint to utilize as the reference viewpoint typically depend upon the requirements of a specific application.

Once a depth map and visibility information are generated for the pixels in the light field, the depth map and visibility information can be provided to a super-resolution processing pipeline in accordance with embodiments of the invention to synthesize a higher resolution 2D image of the scene. The depth map can be utilized to correct for parallax between the different low resolution images and visibility information can be utilized during fusion to prevent the fusion of occluded pixels (i.e. pixels in an alternate view image that are not visible from the reference viewpoint). In several embodiments, the process of generating a depth map also includes generating a confidence map that includes confidence metrics for the depth estimates in the depth map. In several embodiments, the depth metrics encode at least one confidence factor indicative of the reliability of the corresponding depth estimate. In a number of embodiments, the confidence metric includes at least a confidence factor based on the signal to noise ratio (SNR) in the region of the pixel location with which the depth estimate is associated, and a confidence factor based upon the number of pixels in a set of images that correspond to the pixel location with which the depth map is associated that were utilized to generate the depth estimate and/or are occluded. Systems and methods for detecting and correcting disparity in images captured by array cameras in accordance with embodiments of the invention are described below. Before discussing the detection and correction of parallax, however, various array cameras in accordance with embodiments of the invention are discussed.

Array Camera Architecture

Array cameras in accordance with embodiments of the invention can include a camera module including an array of cameras and a processor configured to read out and process image data from the camera module to synthesize images. An array camera in accordance with an embodiment of the invention is illustrated in FIG. 1. The array camera 100 includes a camera module 102 with an array of individual cameras 104 where an array of individual cameras refers to a plurality of cameras in a particular arrangement, such as (but not limited to) the square arrangement utilized in the illustrated embodiment. The camera module 102 is connected to the processor 108. The processor is also configured to communicate with one or more different types of memory 110 that can be utilized to store image data and/or contain machine readable instructions utilized to configure the processor to perform processes including (but not limited to) the various processes described below. In many embodiments, the memory contains an image processing application that is configured to process a light field comprising a plurality of images to generate a depth map(s), a visibility map(s), a confidence map(s), and/or a higher resolution image(s) using any of the processes outlined in detail below. As is discussed further below, a depth map typically provides depth estimates for pixels in an image from a reference viewpoint (e.g. a higher resolution image synthesized from a reference viewpoint). A variety of visibility maps can be generated as appropriate to the requirements of specific applications including (but not limited to) visibility maps indicating whether pixel locations in a reference image are visible in specific images within a light field, visibility maps indicating whether specific pixels in an image within the light field are visible from the reference viewpoint, and visibility maps indicating whether a pixel visible in one alternate view image is visible in another alternate view image. In other embodiments, any of a variety of applications can be stored in memory and utilized to process image data using the processes described herein. In several embodiments, processes in accordance with embodiments of the invention can be implemented in hardware using an application specific integration circuit, and/or a field programmable gate array, or implemented partially in hardware and software.

Processors 108 in accordance with many embodiments of the invention are configured using appropriate software to take the image data within the light field and synthesize one or more high resolution images. In several embodiments, the high resolution image is synthesized from a reference viewpoint, typically that of a reference focal plane 104 within the sensor 102. In many embodiments, the processor is able to synthesize an image from a virtual viewpoint, which does not correspond to the viewpoints of any of the focal planes 104 in the sensor 102. The images in the light field will include a scene-dependent disparity due to the different fields of view of the focal planes used to capture the images. Processes for detecting and correcting for disparity are discussed further below. Although a specific array camera architecture is illustrated in FIG. 1, alternative architectures can also be utilized in accordance with embodiments of the invention.

Array Camera Modules

Array camera modules in accordance with embodiments of the invention can be constructed from an imager array or sensor including an array of focal planes and an optic array including a lens stack for each focal plane in the imager array. Sensors including multiple focal planes are discussed in U.S. patent application Ser. No. 13/106,797 entitled “Architectures for System on Chip Array Cameras”, to Pain et al., the disclosure of which is incorporated herein by reference in its entirety. Light filters can be used within each optical channel formed by the lens stacks in the optic array to enable different cameras within an array camera module to capture image data with respect to different portions of the electromagnetic spectrum (i.e. within different spectral channels).

An array camera module in accordance with an embodiment of the invention is illustrated in FIG. 1A. The array camera module 150 includes an imager array 152 including an array of focal planes 154 along with a corresponding optic array 156 including an array of lens stacks 158. Within the array of lens stacks, each lens stack 158 creates an optical channel that forms an image of the scene on an array of light sensitive pixels within a corresponding focal plane 154. Each pairing of a lens stack 158 and focal plane 154 forms a single camera 104 within the camera module. Each pixel within a focal plane 154 of a camera 104 generates image data that can be sent from the camera 104 to the processor 108. In many embodiments, the lens stack within each optical channel is configured so that pixels of each focal plane 158 sample the same object space or region within the scene. In several embodiments, the lens stacks are configured so that the pixels that sample the same object space do so with sub-pixel offsets to provide sampling diversity that can be utilized to recover increased resolution through the use of super-resolution processes. The term sampling diversity refers to the fact that the images from different viewpoints sample the same object in the scene but with slight sub-pixel offsets. By processing the images with sub-pixel precision, additional information encoded due to the sub-pixel offsets can be recovered when compared to simply sampling the object space with a single image.

In the illustrated embodiment, the focal planes are configured in a 5×5 array. Each focal plane 154 on the sensor is capable of capturing an image of the scene. Typically, each focal plane includes a plurality of rows of pixels that also forms a plurality of columns of pixels, and each focal plane is contained within a region of the imager that does not contain pixels from another focal plane. In many embodiments, image data capture and readout of each focal plane can be independently controlled. In this way, image capture settings including (but not limited to) the exposure times and analog gains of pixels within a focal plane can be determined independently to enable image capture settings to be tailored based upon factors including (but not limited to) a specific color channel and/or a specific portion of the scene dynamic range. The sensor elements utilized in the focal planes can be individual light sensing elements such as, but not limited to, traditional CIS (CMOS Image Sensor) pixels, CCD (charge-coupled device) pixels, high dynamic range sensor elements, multispectral sensor elements and/or any other structure configured to generate an electrical signal indicative of light incident on the structure. In many embodiments, the sensor elements of each focal plane have similar physical properties and receive light via the same optical channel and color filter (where present). In other embodiments, the sensor elements have different characteristics and, in many instances, the characteristics of the sensor elements are related to the color filter applied to each sensor element.

In several embodiments, color filters in individual cameras can be used to pattern the camera module with π filter groups as further discussed in U.S. Provisional Patent Application No. 61/641,165 entitled “Camera Modules Patterned with pi Filter Groups” filed May 1, 2012, the disclosure of which is incorporated by reference herein in its entirety. These cameras can be used to capture data with respect to different colors, or a specific portion of the spectrum. In contrast to applying color filters to the pixels of the camera, color filters in many embodiments of the invention are included in the lens stack. Any of a variety of color filter configurations can be utilized including the configuration in FIG. 1C including eight Green cameras, four Blue cameras, and four Red cameras, where the cameras are more evenly distributed around the center of the camera. For example, a Green color camera can include a lens stack with a Green light filter that allows Green light to pass through the optical channel. In many embodiments, the pixels in each focal plane are the same and the light information captured by the pixels is differentiated by the color filters in the corresponding lens stack for each filter plane. Although a specific construction of a camera module with an optic array including color filters in the lens stacks is described above, camera modules including π filter groups can be implemented in a variety of ways including (but not limited to) by applying color filters to the pixels of the focal planes of the camera module similar to the manner in which color filters are applied to the pixels of a conventional color camera. In several embodiments, at least one of the cameras in the camera module can include uniform color filters applied to the pixels in its focal plane. In many embodiments, a Bayer filter pattern is applied to the pixels of one of the cameras in a camera module. In a number of embodiments, camera modules are constructed in which color filters are utilized in both the lens stacks and on the pixels of the imager array.

Although specific array cameras and imager arrays are discussed above, many different array cameras can be utilized to capture image data and synthesize images in accordance with embodiments of the invention. Systems and methods for detecting and correcting parallax in image data captured by an array camera in accordance with embodiments of the invention are discussed below.

Determining Parallax/Disparity

In a number of embodiments, the individual cameras in the array camera used to capture the light field have similar fields of view, fixed apertures, and focal lengths. As a result, the cameras tend to have very similar depth of field. Parallax in a two camera system is illustrated in FIG. 2. The two cameras 200, 202, include a lens stack 204 and a focal plane 206. Each camera has a back focal length f, and the two cameras are separated by the baseline distance of 2h. The field of view of both cameras encompasses a scene including a foreground object 208 and a background object 210. The scene from the viewpoint of the first camera 200 is illustrated in FIG. 3A. In the image 300 captured by the first camera, the foreground object 208 appears located slightly to the right of the background object 210. The scene from the viewpoint of the second camera 202 is illustrated in FIG. 3B. In the image 302 captured by the second camera, the foreground object 208 appears shifted to the left hand side of the background object 210. The disparity introduced by the different fields of view of the two cameras 200, 202, is equal to the difference between the location of the foreground object 208 in the image captured by the first camera (indicated in the image captured by the second camera by ghost lines 304) and its location in the image captured by the second camera. As is discussed further below, the distance from the two cameras to the foreground object can be obtained by determining the disparity of the foreground object in the two captured images.

Referring again to FIG. 2, the point (x_o, y_o, z_o) on the foreground object will appear on the focal plane of each camera at an offset from the camera's optical axis. The offset of the point on the focal plane of the first camera 200 relative to its optical axis 212 is shown as −u_L. The offset of the point on the focal plane of the second camera 202 relative to its optical axis 214 is shown as u_R. Using similar triangles, the offset between the images captured by the two cameras can be observed as follows:

$\frac{h - x_{o}}{z_{o}} = \frac{- u_{L}}{f}$

$\frac{h + x_{o}}{z_{o}} = \frac{u_{R}}{f}$

Combining the two equations yields the disparity (or parallax) between the two cameras as:

$Δ_{parallax} = u_{R} - u_{L} = \frac{2 h f}{z_{o}}$

From the above equation, it can be seen that disparity between images captured by the cameras is along a vector in the direction of the baseline of the two cameras, which can be referred to as the epipolar line between the two cameras. Furthermore, the magnitude of the disparity is directly proportional to the baseline separation of the two cameras and the back focal length of the cameras and is inversely proportional to the distance from the camera to an object appearing in the scene.

Occlusions in Array Cameras

When multiple images of a scene are captured from different perspectives and the scene includes foreground objects, the disparity in the location of the foreground object in each of the images results in portions of the scene behind the foreground object being visible in some but not all of the images. A pixel that captures image data concerning a portion of a scene, which is not visible in images captured of the scene from other viewpoints, can be referred to as an occluded pixel. Referring again to FIGS. 3A and 3B, when the viewpoint of the second camera is selected as a reference viewpoint the pixels contained within the ghost lines 304 in the image 302 can be considered to be occluded pixels (i.e. the pixels capture image data from a portion of the scene that is visible in the image 302 captured by the second camera 202 and is not visible in the image 300 captured by the first camera 200). The pixels contained in the ghost line 306 in the first image can be considered to be revealed pixels (i.e. pixels that are not visible in the reference viewpoint, but that are revealed by shifting to an alternate viewpoint). In the second image, the pixels of the foreground object 208 can be referred to as occluding pixels as they capture portions of the scene that occlude the pixels contained within the ghost lines 304 in the image 302. Due to the occlusion of the pixels contained within the ghost lines 304 in the second image 302, the distance from the camera to portions of the scene visible within the ghost lines 304 cannot be determined from the two images as there are no corresponding pixels in the image 300 shown in FIG. 3A.

As is discussed further below, increasing the number of cameras capturing images of a scene from different viewpoints in complementary occlusion zones around the reference viewpoint increases the likelihood that every portion of the scene visible from the reference viewpoint is also visible from the viewpoint of at least one of the other cameras. When the array camera uses different cameras to capture different wavelengths of light (e.g. RGB), distributing at least one camera that captures each wavelength of light in the quadrants surrounding a reference viewpoint can significantly decrease the likelihood that a portion of the scene visible from the reference viewpoint will be occluded in every other image captured within a specific color channel. The distribution of color filters in array cameras to reduce the likelihood of occlusions in accordance with embodiments of the invention is discussed further in U.S. Provisional Patent Application Ser. No. 61/641,164 entitled “Camera Modules Patterned with π Filter Groups”, to Nisenzon et al., filed May 1, 2012, the disclosure of which is incorporated herein by reference in its entirety.

Using Disparity to Generate Depth Maps in Array Cameras

Array cameras in accordance with many embodiments of the invention use disparity observed in images captured by the array cameras to generate a depth map. A depth map is typically regarded as being a layer of metadata concerning an image that describes the distance from the camera to specific pixels or groups of pixels within the image (depending upon the resolution of the depth map relative to the resolution of the original input images). Array cameras in accordance with a number of embodiments of the invention use depth maps for a variety of purposes including (but not limited to) generating scene dependent geometric shifts during the synthesis of a high resolution image and/or performing dynamic refocusing of a synthesized image.

Based upon the discussion of disparity above, the process of determining the depth of a portion of a scene based upon pixel disparity is theoretically straightforward. When the viewpoint of a specific camera in the array camera is chosen as a reference viewpoint, the distance to a portion of the scene visible from the reference viewpoint can be determined using the disparity between the corresponding pixels in some or all of the images captured by the camera array. In the absence of occlusions, a pixel corresponding to a pixel in the image captured from the reference viewpoint will be located in each non-reference or alternate view image along an epipolar line (i.e. a line parallel to the baseline vector between the two cameras). The distance along the epipolar line of the disparity corresponds to the distance between the camera and the portion of the scene captured by the pixels. Therefore, by comparing the pixels in the captured images that are expected to correspond at a specific depth, a search can be conducted for the depth that yields the pixels having the highest degree of similarity. The depth at which the corresponding pixels in the captured images have the highest degree of similarity can be selected as the most likely distance between the camera and the portion of the scene captured by the pixel. As is discussed below, similarity can be determined with respect to corresponding pixels within a single spectral channel, within multiple spectral channels, and/or across spectral channels as appropriate to the requirements of specific applications in accordance with embodiments of the invention.

Many challenges exist, however, in determining an accurate depth map using the method outlined above. In several embodiments, the cameras in an array camera are similar but not the same. Therefore, image characteristics including (but not limited to) optical characteristics, different sensor characteristics (such as variations in sensor response due to offsets, different transmission or gain responses, non-linear characteristics of pixel response), noise in the captured images, and/or warps or distortions related to manufacturing tolerances related to the assembly process can vary between the images reducing the similarity of corresponding pixels in different images. In addition, super-resolution processes rely on sampling diversity in the images captured by an imager array in order to synthesize higher resolution images. However, increasing sampling diversity can also involve decreasing similarity between corresponding pixels in captured images in a light field. Given that the process for determining depth outlined above relies upon the similarity of pixels, the presence of photometric differences and sampling diversity between the captured images can reduce the accuracy with which a depth map can be determined.

The generation of a depth map is further complicated by occlusions. As discussed above, an occlusion occurs when a pixel that is visible from the reference viewpoint is not visible in one or more of the captured images. The effect of an occlusion is that at the correct depth, the pixel location that would otherwise be occupied by a corresponding pixel is occupied by a pixel sampling another portion of the scene (typically an object closer to the camera). The occluding pixel is often very different to the occluded pixel. Therefore, a comparison of the similarity of the pixels at the correct depth is less likely to result in a significantly higher degree of similarity than at other depths. Effectively, the occluding pixel acts as a strong outlier masking the similarity of those pixels, which in fact correspond at the correct depth. Accordingly, the presence of occlusions can introduce a strong source of error into a depth map.

Processes for generating depth maps in accordance with many embodiments of the invention attempt to minimize sources of error that can be introduced into a depth map by sources including (but not limited to) those outlined above. A general process for generating a depth map in accordance with an embodiment of the invention is illustrated in FIG. 4. The process 400 involves capturing (402) a light field using an array camera. In a number of embodiments, a reference viewpoint is selected (404). In many embodiments, the reference viewpoint is predetermined. In several embodiments, the reference viewpoint can be determined based upon the captured light field or a specific operation requested by a user of the array camera (e.g. generation of a stereoscopic 3D image pair). Prior to determining a depth map, the raw image data is normalized (406) to increase the similarity of corresponding pixels in the captured images. In many embodiments, normalization involves utilizing calibration information to correct for variations in the images captured by the cameras including (but not limited to) photometric variations and scene-independent geometric distortions introduced by each camera's lens stack. In several embodiments, the normalization of the raw image data also involves pre-filtering to reduce the effects of aliasing and noise on the similarity of corresponding pixels in the images, and/or rectification of the image data to simplify the geometry of the parallax search. The filter can be a Gaussian filter or an edge-preserving filter, a fixed-coefficient filter (box) and/or any other appropriate filter. In a number of embodiments, normalization also includes resampling the captured images to increase the similarity of corresponding pixels in the captured images by correcting for geometric lens distortion, for example. Processes performed during the normalization or raw image data in accordance with embodiments of the invention are discussed further below.

An initial depth map is determined (408) for the pixels of an image captured from the reference viewpoint. The initial depth map is used to determine (410) likely occlusion zones and the depths of pixels in the occlusion zones are updated (412) by determining the depths of the pixels in occlusion zones using images in which a corresponding pixel is visible. As is discussed further below, depth estimates can be updated using competing subsets of images corresponding to different visibility patterns encountered in real world scenes. Although a specific sequence is shown in FIG. 4, in many embodiments occlusion zones are detected at the same time the initial depth map is generated.

A normalization process involving resampling the raw image data to reduce scene-independent geometric differences can reduce errors by correcting linear and/or non-linear lens distortion which might otherwise compromise the ability to match corresponding pixels in each of the captured images. In addition, updating the depth map in occlusion zones with depth measurements that exclude occluded pixels further reduces sources of error in the resulting depth map. Although a general process for generating a depth map is illustrated in FIG. 4, variations and alternatives to the illustrated processes for generating depth maps can be utilized in accordance with embodiments of the invention. Processes for calibrating raw image data, determining initial depth maps, and for updating depth maps to account for occlusions in accordance with embodiments of the invention are discussed further below.

Increasing Similarity of Corresponding Pixels in Captured Image Data

The greater the similarity between the images captured by each of the cameras in an array camera, the higher the likelihood that a measurement of corresponding pixels in the images at different hypothesized depths will result in highest similarity being detected at the correct depth. As is disclosed in U.S. patent application Ser. No. 12/935,504 (incorporated by reference above) the images captured by cameras in an array camera typically differ in a number of ways including (but not limited to) variations in the optics from one camera to another can introduce photometric differences, aliasing, noise, and scene-independent geometric distortions. Photometric differences and scene-independent geometric distortions can be corrected through filtering and calibration. Photometric calibration data used to perform photometric normalization and scene-independent geometric corrections that compensate for scene-independent geometric distortions can be generated using an off line calibration process and/or a subsequent recalibration process. The photometric calibration data can be provided to a photometric normalization module or process that can perform any of a variety of photometric adjustments to the images captured by an array camera including (but not limited to) pre-filtering to reduce the effects of aliasing and noise, Black Level calculation and adjustments, vignetting correction, and lateral color correction. In several embodiments, the photometric normalization module also performs temperature normalization. The scene-independent geometric corrections determined using a calibration process can also be applied to the captured images to increase the correspondence between the images. When the captured images are used to synthesize a higher resolution image using super-resolution processing, the scene-independent geometric corrections applied to the images are typically determined at a sub-pixel resolution. Accordingly, the scene-independent geometric corrections are typically determined with a higher degree of precision than the corrections utilized during registration in conventional stereoscopic 3D imaging. In many embodiments, the scene-independent geometric corrections also involve rectification to account for distortion and rotation of the lenses of the array camera relative to the focal planes so that the epipolar lines of the non-reference images are easily aligned with those of the image captured from the reference viewpoint. By normalizing geometrically in this way, the searches performed to determine the depths of corresponding pixels can be simplified to be searches along straight lines in various cameras, and the precision of depth measurements can be improved.

Systems and methods for calibrating array cameras to generate a set of scene-independent geometric corrections and photometric corrections that can be applied to images captured by an array camera in accordance with embodiments of the invention are described in U.S. patent application Ser. No. 61/780,748, entitled “Systems and Methods for Calibration of an Array Camera” to Mullis, Jr., filed Mar. 13, 2013, the disclosure of which is incorporated by reference in its entirety.

In a number of embodiments, the correspondence of the pixels in the captured images is increased by resampling the images to detect objects to sub-pixel precision shifts in the fields of view of the cameras in the array camera.

A process for applying corrections to images captured by an array camera to increase the correspondence between the captured images in accordance with embodiments of the invention is illustrated in FIG. 5. The process 500 includes photometrically normalizing the captured images (502), applying scene-independent geometric corrections (504) to the normalized images. In some embodiments, an additional rectification process (505) is needed to ensure that all cameras are co-planar and parallax search can be reduced to epipolar lines only. The processes shown in FIG. 5 increase the correspondence between the resulting images. Therefore, searches for pixel correspondence between the images are more likely to result in accurate depth measurements.

Although specific processes for increasing the correspondence between images captured by an array camera(s) in accordance with embodiments of the invention are discussed above with respect to FIG. 5, any of a variety of processes that increase the correspondence between the captured images can be utilized prior to generating a depth map in accordance with embodiments of the invention. Processes for generating depth maps in accordance with embodiments of the invention are discussed further below.

Generating a Depth Map

The process of generating a depth map involves utilizing disparity between images to estimate the depth of objects within a scene. As noted above, occlusions can impact the reliability of depth measurements obtained using cost functions in the manner outlined above. Typically such occlusions will manifest themselves as significant mismatches according to the similarity metric used to compare corresponding pixels (potentially masking the similarity of the visible pixels). However, many embodiments of the invention generate an initial depth map and then address any errors that may have been introduced into the creation of the initial depth map by occlusions. In several embodiments, the initial depth map is utilized to identify pixels in the image captured from a reference viewpoint that may be occluded in images captured by the array camera from other viewpoints. When an occlusion is detected, the depth information for the pixel in the image captured from the reference viewpoint can be updated by excluding pixels from the image in which the pixel is occluded from the similarity comparisons. In several embodiments, depth estimates impacted by occlusions can be updated using competing subsets of images corresponding to different visibility patterns encountered in real world scenes. In certain embodiments, the updated depth estimates can be utilized to identify corresponding pixels that are occluded and the depth estimation process iterated using the visibility information so that the impact of occlusions on the precision of the depth map can be reduced. In several embodiments, the process of generating updated depth estimates using subsets of images is sufficiently robust that the need to iteratively refine the depth map and visibility estimates can be reduced or eliminated.

A process for determining depth of pixels in an image captured from a reference viewpoint in accordance with an embodiment of the invention is illustrated in FIG. 6. The process 600 includes determining (602) an initial depth map using some or all of the images captured by an array camera. The visibility of each pixel in the image captured from the reference viewpoint in each of the captured images is then determined (604). Where a corresponding pixel location is occluded, the depth of the pixel in the image captured from the reference viewpoint can be recalculated (606) excluding the image in which the corresponding pixel location is occluded from the cost function. A decision (608) is made concerning whether to continue to iterate. As depth measurements in occlusion zones are refined, additional information is obtained concerning the visibility of pixels within the occlusion zones in each of the captured images. Therefore, repeating the recalculation of the depths of pixels in the occlusion zones as the visibility information is refined can iteratively improve the precision of the depth map. Any of a variety of termination conditions appropriate to a specific application can be utilized to determine when to terminate the iterative loop including (but not limited to) the completion of a predetermined number of iterations and/or the number of pixels for which updated depth information is determined in a specific pass through the iterative loop falling below a predetermined number. In several embodiments, a single iteration only is performed due to the exploitation of subsets of the set of images corresponding to real world visibility patterns to update depth estimates generated using mismatched pixels.

Once a finalized depth map is obtained, the visibility of each of the pixels in the captured images is determined (610) and the depth map and/or visibility information can be utilized for a variety of purposes including but not limited to the synthesis of a high resolution image using super-resolution processing.

The computational complexity of a process similar to the process illustrated in FIG. 7 depends on the number of images compared when performing depth determinations. The further a camera is from the reference viewpoint the larger the disparity that will be observed. In addition, the furthest cameras in the array encompass all the other cameras within their envelope. Typically larger magnitude shifts enable depth to be determined with greater precision. Therefore, using a camera that captures an image from a reference viewpoint and the cameras that are furthest from that camera to determine depth information can improve precision of the detected depth. In addition, using an aggregated cost originating from cameras with various baselines and directions can significantly improve reliability of depth estimates due to increased likelihood of reducing periodicity in matches. In the case of a 5×5 array (see FIG. 7), the central Green camera (700) can be utilized to capture an image from a reference viewpoint and the image data captured by the central camera can be compared to image data captured by the Green cameras (702) located in the four corners of the array to determine depth. In other arrays, images captured by any of a variety of combinations of cameras can be utilized to determine depth in accordance with embodiments of the invention. As is discussed further below, selection of specific subsets of cameras can decrease the likelihood that a pixel in a reference image will be occluded in image data captured by other cameras in the subset.

Although a specific process for generating a depth map and/or visibility information in accordance with an embodiment of the invention is illustrated in FIG. 6, any of a variety of processes can be utilized that involve determining an initial depth map and then refining the depth map by detecting occluded pixels and updating the depth measurements to exclude occluded pixels. Specific processes for determining depth and visibility of pixels in accordance with embodiments of the invention are discussed further below.

Determining an Initial Depth Map

Processes for determining the distance from an array camera to an object in a scene involve locating the depth at which corresponding pixels in images captured by the array camera have the highest degree of similarity. As discussed above, at a specific depth a pixel in an image captured from a reference viewpoint will shift a known distance along an epipolar line between the reference viewpoint and each of the cameras in the camera array. The pixel in the image captured from the reference viewpoint and the “shifted” pixels in the other images (i.e. the pixels in the images located in locations determined based upon the anticipated shift for a specific distance) are the corresponding pixels. When a hypothesized depth is incorrect, the corresponding pixels may exhibit very little similarity (although in some scenes incorrect depths have high degrees of similarity due to features such as periodic texture). When the hypothesized depth is correct, the corresponding pixels will ideally exhibit the highest degree of similarity of any of the hypothesized depths. When a depth map is used in super-resolution processing of a captured light field, a depth map can be determined with sufficient precision to enable detection of sub-pixel shifts. In super-resolution processing, it is the scene-dependent shifts that are utilized and not the depth directly. Therefore, the ability to detect depth corresponding to sub-pixel shift precision can significantly improve the performance of the super-resolution processing. The manner in which resampling of the pixels of the captured images can be utilized to determine depth with sub-pixel shift precision is discussed further below.

In many embodiments, a parallax search of a number of depths within a range in physical distance (e.g. 20 cm to infinity) is utilized to inform the disparities searched when performing depth estimation. The search range can be divided into a number of depth indices such that the parallax shifts between consecutive depth indices is constant in pixels for a particular image and is set based upon a minimum sub-pixel precisions as measured for the images captured by cameras in the array corresponding to the largest baselines with respect to the reference viewpoint (see for example FIG. 7). This increases the likelihood of sufficient accuracy in the depth estimates for use as inputs to a super-resolution process. In other embodiments, consecutive depth indices need not correspond to constant pixel shifts and the depth search can adapt based upon the characteristics of the scene.

In several embodiments, a cost function is utilized to determine the similarity of corresponding pixels. The specific cost function that is used typically depends upon the configuration of the array camera, the number of images captured by the array camera, and the number of color channels utilized by the array camera. In a number of embodiments, the array camera includes a single color channel and/or a depth map is generated using cameras within a single color channel. Where image data from within a single color channel is utilized to generate the depth map, a cost function can be utilized that measures the variance of the corresponding pixels. In several embodiments, sums of L1 norms, L2 norms, or some other metrics can be used. For example, the aggregation of similarity metrics with respect to a target (typically reference but non-reference may also be used). The smaller the variance, the greater the similarity between the pixels.

Image data from multiple spectral channels can also be utilized to generate a depth map. In several embodiments, the depth at a given pixel location is estimated by looking at the similarity of corresponding pixels from images within each of the spectral channels. In a number of embodiments, the process of determining the depth at a given pixel location also involves using information concerning the similarity of corresponding pixels from images across different spectral channels. Cost functions that can be utilized when generating a depth map using image data captured using multiple color channels include (but are not limited to) L1 norms, L2 norms, or a combination of L1 and L2 norms, of the combinations of image data from the different color channels and/or the variance/standard deviation of corresponding pixels within multiple individual color channels. In other embodiments, truncated versions of the L1 and L2 norms and/or any block-based similarity measure based on rank, census, correlation, and/or any other appropriate metric such as those practiced in multiview stereo disparity detection techniques can be utilized.

As is discussed further below, many embodiments of the invention utilize subsets of cameras including cameras from multiple color channels grouped based upon characteristics of natural scenes when determining the depth of a pixel location in an image form a reference viewpoint to decrease the likelihood that a given pixel location is occluded in the alternate view images captured by the other cameras in the subset of cameras. Where an array camera utilizes a Bayer filter in the camera that captures an image from the reference viewpoint, then a variety of cost functions can be utilized to determine pixel similarity including (but not limited to) cost functions that measure the combination of Red variance, Green variance, and Blue variance. In addition, different cost functions can be applied to the pixels in different regions of an image. In several embodiments, a depth map is generated from image data captured by a central Green camera and a cluster of Red, Blue and Green cameras in each of the four corners of a camera array using this technique (see for example FIG. 7).

A process for determining the depth of a pixel using images captured by an array camera in accordance with an embodiment of the invention is illustrated in FIG. 8. The process 800 includes selecting (802) an initial hypothesized depth or distance d for a selected pixel from an image captured from a reference viewpoint. Based upon the location of the pixel within the reference image and information concerning the baseline between the reference viewpoint and the viewpoints of the other cameras used to perform the depth measurement, the corresponding pixel locations in each of the captured images at the hypothesized depth d are determined (804). In many embodiments, the input images to the parallax detection process are not geometrically corrected, and the geometric correction is applied on-the-fly by adding a vector offset to the parallax shift during the search to identify corresponding pixels at a given depth d. In other embodiments, the geometric correction is applied to the images before the search commences during a normalization process and no geometric correction vector must be added during the search when calculating pixel correspondences (i.e. the geometric corrections are pre-calculated). In the latter case, the pre-correction of geometric distortion can make the algorithm significantly more efficient on parallel processors such as SIMD and GPUs.

As noted above, occlusions can introduce errors into depth estimates. When occlusion/visibility information is available, occluded pixels can be disregarded (806) as part of the depth measurement. When information concerning the visibility of pixels is not available (e.g. during the generation of an initial depth map and/or during the generation of a depth estimate using a subset of images), the similarity of all of the pixels in the corresponding pixel locations is used to determine depth. As is discussed below with reference to FIGS. 8A-8I, initial depth searches can be performed with respect to image data captured from subsets of images captured by the array camera to identify a specific subset of cameras in which a given pixel in the reference image is visible.

When the corresponding pixels have been identified, the similarity of the corresponding pixels can be measured (808). In many embodiments, the similarity of the pixels is determined using a cost function. The specific cost function utilized depends upon the pixel information that is compared. As noted above, in one embodiment, when pixels from a single color channel are compared the cost function can consider L1 norms, L2 norms, and/or the variance of corresponding pixels. When pixels from multiple color channels are compared, more complex cost functions can be utilized including (but not limited to) cost functions that incorporate the L1 and/or L2 norms of the image data from multiple color channels and/or the variance/standard deviation of corresponding pixels within multiple individual color channels. In other embodiments, truncated versions of the L1 and L2 norms and/or any block-based similarity measure based on rank, census, correlation, and/or any other appropriate metric such as those practiced in multiview stereo disparity detection techniques can be utilized. In several embodiments, the process of determining similarity utilizing a cost function involves spatially filtering the calculated costs using a filter such as (but not limited to) a fixed-coefficient filter (such as a Gaussian filter), or in an alternative embodiment, an edge-preserving filter. In the latter embodiment, filtering with an edge-preserving filter in this way is a form of adaptive support that utilizes information from photometrically similar neighboring pixels to improve the depth estimates. Without the filtering the depth measurements are pixel-wise and are noisier than if they are filtered. Smoothing the cost function using adaptive support can prevent the generation of incorrect depths. In a number of embodiments, the calculated costs are spatially filtered using a bilateral filter, where the bilateral filter weights are determined from the reference image but, in contrast to a normal bilateral filter, the resulting filter weights applied to the calculated costs and not the reference image. In this way the reference image data can be used as a guide to improve the denoising of the cost estimates. In a number of embodiments, a box filter and/or any other filter appropriate to the requirements of a specific application can be utilized.

The calculation of the cost function for corresponding pixels at different depths is repeated sweeping across a range of hypothesized depths (812) until the depth search is complete (810). The most likely depth can then be determined (814) as the hypothesized depth at which the (filtered) cost function indicates that the corresponding pixels have the highest level of similarity. In several embodiments, for a given depth computation early termination can occur if a single camera shows a very high mismatch. In this condition, the process can skip onto the next hypothesized depth since match at the current depth would be unacceptable. In many embodiments, the process of performing depth sampling (i.e. comparing pixels in alternate view images based upon the disparity at a specific depth) involves sampling depth uniformly in disparity space. Stated another way, depth samples can be taken at uniform pixel shifts along an epipolar line. In a number of embodiments, the search does not involve uniform sampling in disparity space. In several embodiments, the search exploits image characteristics to increase the efficiency of the search. In several embodiments, the search uses prior information about where objects are in the scene, such as from a coarser or lower spatial resolution depth map or reduced search resolution in disparity (e.g. from an image preview), to determine or restrict which depths are sampled in trying to form a higher resolution depth map. For example, a preview depth map may be used to determine that there are no objects beyond a particular distance, in which case for the depth search, no depth samples would be allocated beyond that distance.

Many images exhibit regions of similar color, therefore, the search for the most likely hypothesized depth can be performed intelligently by selecting a first set of hypothesized depths that are more coarsely distributed across the range of possible hypothesized depths and then locating the depth among these that exhibits the highest degree of similarity. A second search can then be performed to refine within a more granular range of depths around the depth that exhibited the highest degree of similarity in the first set of depths. In the event that the more granular search fails and the best pixel found is not from a region exhibiting similar color, a full search can be performed across the entire range of depths at more precise intervals than in the original first coarse search. However, if a satisfactory match is found in the second search, the depth that exhibits the highest level of similarity within the second search can be used as the most likely hypothesized depth.

In many embodiments, searches for the most likely depth of a pixel are performed utilizing depth information determined for adjacent pixels. In several embodiments, the search is performed by searching around the depth of one or more adjacent pixels, by searching around a depth determined based on the depths of adjacent pixels (e.g. based on the average depth of adjacent pixels or based on linear interpolations of pairs adjacent pixels) and/or by searching around a previously identified depth (e.g. a depth determined with respect to a preview image and/or a previous frame in a video sequence). Searching in this way can also simplify the application of spatial filters when determining depth (see discussion below). In other embodiments, any of a variety of techniques can be utilized to reduce the computational complexity of locating the most likely depth of the pixels in an image.

Although specific processes for determining the depth of a pixel in an image captured from a reference viewpoint are discussed above with respect to FIG. 8, any of a variety of processes can be utilized to determine the depth of a pixel including process that determine the depth of a pixel from a virtual viewpoint based upon a plurality of images captured by an array camera. Processes similar to the process illustrated in FIG. 8 can be utilized to generate an initial depth map and then to refine the depth map by ignoring images in which a corresponding pixel to a pixel location in an image from the reference viewpoint is occluded. Processes for determining pixel correspondence using adaptive support in accordance with embodiments of the invention are discussed further below.

Determining Pixel Correspondence in the Presence of Occlusions

Wherever there is a depth transition or discontinuity in the reference viewpoint, pixels adjacent the depth transition are likely to be occluded in at least one of the images captured by the array camera. Specifically, the pixels adjacent to the transition that are further in distance from the camera are likely to be occluded by the pixels adjacent the camera that are closer to the camera. Ideally, a depth map is determined using an aggregated cost function CV(x, y, d) for each visible camera i in the array that excludes occluded pixels as follows:

$CV (x, y, d) = \sum_{i} \frac{{Cost}^{i, Ref} (x, y, d) \times V^{i, Ref} (x, y)}{number of visible cameras at (x, y)}$

where Cost^i,Ref(x, y, d) is a similarity measure (i.e. the cost function),

- d is depth of pixel (x, y), and
- V^i,Ref(x, y) is the visibility of pixel (x, y) and initially V^i,Ref(x, y)=1 for all cameras.

where S is the similarity measure (for example), and

- Iⁱis the calibrated image i after geometric calibration.

In several embodiments, the process of generating an aggregated cost can involve use of images to which the scene-dependent geometric shifts corresponding to a specific hypothesized or candidate depth are applied to all pixels in the image. In this way, a shifted image can be generated for each candidate depth searched. Using the shifted images, an aggregated cost at each depth for a specific pixel location (x, y) in an image from the reference viewpoint can be generated in the manner outlined above utilizing the similarity between the shifted images and the reference image. In addition, the aggregated cost can consider the similarity of the shifted images at the candidate depth as follows:

$CV (x, y, d) = \sum_{k \in K} \frac{(x, y) {Cost}^{k, Ref} (x, y, d) \times V^{k, Ref}}{number of cameras in K} + \sum_{i, j \in L} \frac{{Cost}^{i, j} (x, y, d) \times V^{i, Ref} (x, y) \times V^{j, Ref} (x, y)}{number of pairs of cameras in L}$

Where K is a set of cameras in the same spectral channel as the reference camera,

- L is a set of pairs of cameras, where both cameras in each pair are in the same spectral channel (which can be a different spectral channel to the reference camera where the light field includes image data in multiple spectral channels),
  
  Cost^k,Ref(x,y,d)=S{ImageRef(x,y),ShiftedImage^k(x,y,d)}, and
  Cost^i,j(x,y,d)=S{ShiftedImageⁱ(x,y,d),ShiftedImage^j(x,y,d)}

In a number of embodiments, the sets K and L do not necessarily contain all cameras or pairs of cameras that satisfy the requirements in K and L. Furthermore, the cumulative cost function can also be constructed using a cost term in which the set of L includes arbitrarily large groups of cameras for which the cost of corresponding pixels is determined. In many embodiments, the similarity metric S is the L1 norm. In several embodiments, the similarity metric can be any of a number of well known similarity metrics including (but not limited to) the L2 norm, the variance or standard deviation of the corresponding pixels (particularly where L includes larger groups of cameras) window-based similarity metrics incorporating correlation, rank, census and/or any other measure appropriate to the requirements of a specific application. Although comparisons are discussed above in the context of shifted images, as can be readily appreciated comparisons can be performed by applying shifts to individual pixel locations and comparing corresponding pixels at a hypothesized depth (as opposed to applying shifts to all pixels in an image and then comparing the shifted images).

In a number of embodiments, the cost function can also consider similarity between corresponding pixels across different spectral channels. In several embodiments, the similarity of neighborhoods of pixels in pixels from different spectral channels can be evaluated using any of a variety of metrics including (but not limited to) the cross-correlation of the pixels in the neighborhoods, the normalized cross-correlation between the pixels in the neighborhoods and/or any other metric for measuring the similarity of the relative values of two sets of pixels such as (but not limited to) entropy measures including measuring mutual information.

In several embodiments, different weightings can be applied to the similarity of corresponding pixels within a spectral channel containing a reference image and the reference image, the similarity of corresponding pixels within alternate view images in the same spectral channel, and/or the similarity of corresponding pixels within images in different spectral channels.

As discussed above, the aggregated cost function can be spatially filtered as follows:

FilteredCV(x,y,d)=Filter_x_n_y_n_∈N(x,y){Cost(x_n,y_n,d)}

where the Filter is applied in a neighborhood N(x, y) surrounding pixel location (x, y).

The filter can be a simple 3×3 or N×N box filter or some other filter including (but not limited to) a joint bilateral filter that uses the reference image as guidance, a fixed coefficient filter (such as a Gaussian filter, or a box filter), or any appropriate edge preserving filter. In several embodiments, the weighted aggregated cost function is as follows:

$FilteredCV (x, y, d) = \frac{1}{Norm} \sum_{(x_{1}, y_{1}) \in N (x, y)} CV (x_{1}, y_{1}, d) \times wd (x, y, x_{1}, y_{1}) \times wr (I_{Ref} (x, y) - I_{Ref} (x_{1}, y_{1}))$

where N(x, y) is the immediate neighborhood of the pixel (x, y), which can be square, circular, rectangular, or any other shape appropriate to the requirements of a specific application,

- Norm is a normalization term,
- I_Ref(x, y) is the image data from the reference camera,
- wd is a weighting function based on pixel distance, and
- wr is a weighting function based on intensity difference.

In many embodiments, the filter is a bilateral filter and wd and wr are both Gaussian weighting functions.

Based upon the filtered aggregated cost function, a depth map can be computed by selecting the depth that minimizes the filtered cost at each pixel location in the depth map as follows:

D(x,y)=argmin_d{FilteredCV(x,y,d)}

When the aggregated cost function is filtered using an edge preserving filter in the manner outlined above, the likelihood that noise will result in the incorrect detection of occluded pixels is reduced. Instead of computing depths for individual pixels, an adaptive support window is used around each pixel to filter noise in a manner that preserves depth transitions. Utilizing a filter such as (but not limited to) a bilateral filter provides an adaptive window of support that adapts based upon the content. In many embodiments, a bilateral filter is used in which the reference image is used to define the spatial and range support for the bilateral filter (i.e. the parameters that define the size of the window of pixels that contribute to the aggregated cost function for a specific pixel). As a result, smoothing of the cost function of a pixel can be achieved using the calculated cost function of pixels that are part of the same surface. In other embodiments, filters such as (but not limited to) box filters are less computationally complex and provide sufficient filtering for the requirements of specific applications.

Determining Pixel Correspondence for Pixels in Multiple Spectral Channels

Array cameras in accordance with many embodiments of the invention include cameras in multiple spectral channels such as, but not limited to, Red, Green and Blue cameras. The cost metric CV(x, y, d) is described above in the context of a single spectral channel and multiple spectral channels. In the case of an array camera including Red, Green, and Blue cameras, the cost function can consider the similarity of pixels in the Green cameras, the similarity in pixels in the Red cameras, and the similarity of pixels in the Blue cameras at a particular depth. Where a camera in a specific color channel is chosen as the reference camera (e.g. a Green camera), pixels in the other channels (e.g. Red and Blue cameras) are difficult to directly compare to pixels in the reference image. However, the disparity at a particular depth can be determined and the intensity values of corresponding pixels in other color channels can be compared. Incorporating these additional comparisons into the depth estimate can improve depth estimates by utilizing information across all color channels. Various cost functions that can be utilized to perform depth estimation in array cameras that include Red, Green, and Blue cameras are discussed further below. As can be readily appreciated, however, the same cost functions can be utilized with respect to any set of spectral channels in accordance with embodiments of the invention.

In several embodiments, image data is captured using an array camera including Red, Green and Blue cameras and a Green camera is selected as a reference camera. A cost function can be utilized that considers pixel correspondence between pixels in a set of Green cameras, between pixels in a set of Red cameras, and between pixels in a set of Blue cameras when determining depth estimates. In several embodiments, the following cost function can be utilized:

Cost(x,y,d)=γ_G(x,y).Cost_G(x,y,d)+γ_R(x,y).Cost_R(x,y,d)+γ_B(x,y).Cost_B(x,y,d)

where Cost_G(x, y, d) is the measure the similarity of pixels in locations within a set of Green cameras determined based upon the depth d and the location of the pixel (x, y) in the reference Green camera,

- Cost_R(x, y, d) is the measure of the similarity of corresponding pixels in locations within a set of Red cameras determined based upon the depth d and the location of the pixel (x, y) in the reference Green camera,
- Cost_B(x, y, d) is the measure of the similarity of corresponding pixels in locations within a set of Blue cameras determined based upon the depth d and the location of the pixel (x, y) in the reference Green camera, and
- γ_G, γ_R, and γ_Bare weighting factors for the Green, Red and Blue cost functions respectively which may be constants for the entire reference viewpoint, or may vary spatially.

The spatial weighting may depend on the captured image data (for example using edge gradients), may correct or use known properties of the sensor (for example using a noise model prior for a given sensor to calculate SNR), as well as properties of the cost function (which is another case where the spatial weighting depends on the image data). Additionally, imaging parameters utilized during the capture of image data can also be considered in determining the weightings, such as (but not limited to) the gain or detected light level at which the image is captured, can be used to modulate the weighting factors.

The cost function Cost_G(x, y, d) can be one of the metrics described above. In many embodiments, Cost_G(x, y, d) uses a similarity measure based upon an L1 norm comparing a pixel in an alternate view image with a pixel in the reference image, an L2 norm comparing a pixel in an alternate view image with a pixel in the reference image, and/or variance across the pixels in the set of images captured by the Green cameras. In other embodiments, truncated versions of the L1 and L2 norms and/or any block-based similarity measure based on rank, census, correlation, and/or any other appropriate metric such as those practiced in multiview stereo disparity detection techniques can be utilized.

In a number of embodiments, the cost functions for the other color channels (i.e. Cost_R(x, y, d) and Cost_B(x, y, d)) do not utilize a comparison that includes a pixel from the reference image as the basis of determining pixel correspondence. In several embodiments, the similarity of corresponding pixels are performed by calculating the aggregated difference between each unique pair of corresponding pixels in the set of cameras within the color channel. In the example of an array camera in which depth is determined using four Red cameras, R_A, R_B, R_C, and R_D, the cost can be determined as follows:

Cost_R(x,y,d)=|R_A(x_A,y_A)−R^B(x_B,y_B)|+|R_A(x_A,y_A)−R^C(x_C,y_D)|+|R_A(x_A,y_A)−R^D(x_D,y_D)|+|R_B(x_B,y_B)−R^C(x_C,y_D)|+|R_B(x_B,y_B)−R^D(x_D,y_D)|+|R_C(x_C,y_C)−R^C(x_D,y_D)|

where (x_A, y_A), (x_B, y_B), (x_C, y_C), and (x_D, y_D) are corresponding pixel locations determined based upon the disparity in each of the images R_A, R_B, R_C, and R_Drespectively at depth d.

The above metric can be referred to as the combination cost metric and can be applied within any color channel that does not contain the reference camera. In several embodiments, a combination metric can be utilized that does not include all combinations of unique pairs of corresponding pixels in the set of cameras within the color channel. In several embodiments, unique pairs of corresponding pixels from a subset of the images captured by an array camera can be utilized. When depths are determined for a virtual viewpoint, none of the spectral channels contain the “reference camera” and the combination cost metric can be applied in each of the spectral channels. Although the combination cost metric is shown above utilizing the L1 norm to determine the similarity between pixel intensity values, in other embodiments, the L2 norm, the pixel variance, truncated versions of the L1 and L2 norms and/or any block-based similarity measure based on rank, census, correlation, and/or any other appropriate metric such as those practiced in multiview stereo disparity detection techniques can be utilized.

Weighting factors (e.g. γ_G, γ_R, and γ_B) can be used to determine the contribution of each of the spectral channels to a depth estimate. The weights can be fixed or vary from pixel to pixel (i.e. spatially-varying) in the reference image. In many embodiments, a map of signal-to-noise ratio (SNR) can be generated with respect to the reference image using an SNR estimator. In several embodiments, the SNR estimator can determine SNR based upon a prior characterization of signal noise. Areas where the SNR response is high can indicate the presence of texture or high signal. Areas where the SNR estimate is low can indicate a textureless region, consisting almost entirely of noise. In certain situations, the data from the images might be noisy in certain spectral channels, but not in others. For example, an area may appear textureless in Green images, but have signal content in images captured by Red cameras. Therefore, the Green cameras will contribute little useful information to the cost function and may actually introduce noise into the depth estimation process, resulting in a less reliable depth estimate than if only Red or Blue cameras were included in the cost function. Therefore, an SNR map can be utilized to determine weightings to apply to each of the color channels. In several embodiments, if the SNR estimate in the reference image for a pixel (x, y) is low, meaning that the immediate region around pixel (x, y) is likely textureless and does not contain significant signal, then the weighting for the color channel containing the reference image should be reduced at the pixel (x, y).

In many embodiments, a stricter condition can also be used and/or used as an alternative in which the weighting for the spectral channel containing the reference image should be reduced at the pixel (x, y), when the SNR estimate in the reference image at a pixel (x, y) and for the radius of maximum parallax (along epipolar lines) in the reference image for all of the cameras show low SNR, then the weighting for the spectral channel containing the reference image should be reduced at the pixel (x, y). The radius of maximum parallax can be determined only with respect to pixels located along epipolar lines determined with respect to the other cameras in the camera array within the spectral channel. The stricter criterion acknowledges that though the SNR may be low at the pixel location (x, y) in the reference image, there may be content some distance away (less than a maximum parallax shift) from pixel (x, y) in another camera within the color channel containing the reference camera which could disqualify a candidate depth from being a likely match. Therefore, though the pixel location (x, y) may have low SNR, nearby content may still provide useful information to disqualifying certain depths as possibilities.

In a number of embodiments, strong SNR in the reference image may be used to reduce the weighting applied to the other color channels to save computation (i.e., fewer cameras must be searched). In addition, the SNR may be estimated for a camera in one of the other color channels to determine the weighting that should be applied to the color channel in the cost function. In many embodiments, the process of determining the SNR involves estimating SNR along the epipolar line which connects the pixel location (x, y) in the alternate view camera to the reference camera. Then, the epipolar line or line(s) may be searched for regions of high SNR. If high SNR contributions are found in the alternate view camera along the epipolar line, the weighting of the color channel to which the alternate view camera belongs can be set so that the color channel contributes to the cost metric for the pixel location (x, y) in the reference image. If, instead, along the epipolar line beginning at the pixel location (x, y), the alternate view image shows only low SNR, then the contribution of the color channel containing the alternate view image can be correspondingly reduced. In many embodiments, multiple cameras in each color channel are considered when determining the weighting to apply to the color channel when determining depth estimates. Although specific processes for estimating depth using information contained within color channels that do not include the reference camera are described above, any of a variety of processes can be utilized to determine depth estimates based upon information contained in multiple color channels as appropriate to the requirements of specific applications in accordance with embodiments of the invention.

Based upon the initial depth map, visibility V^i,Ref(x, y) can be updated based upon the computed depth map D(x, y) or based upon a filtered depth map F(D(x, y)) that is filtered using either a fixed coefficient filter(such as a Gaussian or a box filter), or adaptive or edge preserving filter such as (but not limited to) a joint bilateral filter that uses the reference image as guidance. A variety of techniques can be utilized for determining whether a pixel in an image captured from a reference viewpoint is occluded in another image. In a number of embodiments, an initial depth map is utilized to identify pixels that may be occluded. Information concerning foreground objects can be utilized to identify zones in which occlusions are likely to occur. In addition, any of a variety of additional characteristics of an initial depth map can be utilized to detect occlusions including (but not limited to) unusual depth transitions within the depth map and/or pixel depth values that are inconsistent with local pixel depth values.

Although much of the discussion above with respect to FIG. 8 relates to generation of depth estimates using image data in a single spectral channel or in the Red, Green, and Blue color channels, the techniques described above are equally appropriate with respect to any of a variety of spectral channels (the terms color channel and spectral channels being used hererin interchangeably). An aggregated cost function similar to those described above can be utilized including a cost term determined with respect to each spectral channel using any of the techniques described above. In addition, the cost function can include cost terms that weight the similarity of pixels across spectral channels using techniques similar to the those described above. The accuracy of resulting depth estimates can depend upon the extent to which the depth estimate utilizes pixels from an image in which a pixel location (x, y) in a reference image is occluded. Techniques for improving the accuracy of depth estimates when a pixel location (x, y) in an image from a reference viewpoint are occluded within one or more images in the set of images are discussed further below.

Generating a Depth Map Accounting for Occlusions Using Subsets of Images

Patterns of visibility occur in natural scenes. Therefore, the pattern V^i,Ref(x, y) is typically not arbitrary. A strong prior exists concerning the cameras in which a pixel in the reference image is visible. In many embodiments, a depth map can be determined in a manner that also provides an estimate for V^i,Ref(x, y) in which there is a low likelihood that cameras in which pixel (x, y) is occluded are incorrectly identified. Based upon the estimate for V^i,Ref(x, y), a depth estimate can be determined without the need to iterate to refine the visibility of V^i,Ref(x, y). Alternatively, additional iterations can be performed to refine the depth map by including additional cameras based upon visibility information obtained using a reliable depth estimate. By obtaining a better initial depth map, however, the iterative process is likely to converge more rapidly.

In many embodiments, the process of generating a depth map involves determining depth using multiple clusters or subsets of cameras that each correspond to a different pattern of visibility within the scene and selecting the depth estimate as the depth determined using the subset of images captured by the cluster of cameras in which corresponding pixels have the highest similarity. A process for determining a depth for a pixel (x, y) using images captured by groups of cameras representing subsets of a camera array in accordance with an embodiment of the invention is illustrated in FIG. 8A. The process 850 includes selecting (852) an initial group of cameras corresponding to a specific pattern of visibility within the scene and determining (854) a candidate depth estimate using the subset of image data captured by the group of cameras is generated based upon the depth at which corresponding pixels within the group of cameras have the highest similarity. In several embodiments, the process is similar to that outlined above with respect to FIG. 8 with the exception that the costs are determined for each subset of images and the lowest cost depth estimate generated using the subsets is selected as a candidate depth estimate for a relevant pixel location. As is discussed further below, the similarity of corresponding pixels within the subset of images at the candidate depth estimate is then compared against the similarity of corresponding pixels within other subsets of images at other candidate depth estimates to determine the candidate depth estimate that is most reliable within the context of a given application. In many embodiments, the candidate depth estimate that is selected as the depth estimate for the pixel location is determined based upon the subset of images having the most similar corresponding pixels at the candidate depth estimate.

In many embodiments, the group of cameras can include cameras from multiple color channels and a cost metric weighting similarity in each color channel is utilized to estimate the depth of pixel (x, y) in the reference image using the group of cameras. The process then iterates (856, 858, 854) through a plurality of different groups of cameras that each correspond to different patterns of visibility within the scene until a depth estimate is determined for each of the groups of cameras. The depth estimate for pixel location (x, y) in an image from the reference viewpoint can be obtained by selecting the depth estimate from the group of cameras in which the corresponding pixels at the estimated depth have the highest similarity. As noted above, similarity of pixels can be determined using a cost function that weights similarity of pixels in multiple spectral channels and/or across spectral channels. The subset of images in which the corresponding pixels at the estimated depth have the highest similarity corresponds to a specific pattern of visibility and provides an initial estimate of V^i,Ref(x, y) that has a low likelihood of incorrectly identifying that the pixel location (x, y) in an image from the reference viewpoint is visible in a camera in which it is occluded.

Although the discussion provided above is presented in the context of performing searches with respect to each group of cameras with respect to pixel locations (x, y) in the reference image, depth maps can be separately estimated for the pixels in the reference image using each group of cameras corresponding to a specific visibility pattern. In this way, the cost metrics determined for pixel location (x, y) using a particular group of cameras can be filtered to smooth out noise in the cost function. Therefore, the depth estimate for the pixel location (x, y) can be selected using the depth estimate for the group of cameras having the smallest filtered cost metric. The filters applied to the cost metrics determined using a specific group of cameras can be fixed, or can be spatially adaptive. The specific filters utilized can be determined based upon the requirements of specific applications in accordance with embodiments of the invention. Following selection of the depth estimates for the pixels in the reference image, additional filtering can be performed to further smooth noise in the initial depth map.

The clusters or groupings of cameras utilized to detect particular patterns of visibility within a scene can depend upon the numbers of cameras in an array camera, the camera that is selected as the reference camera, and/or the distribution of cameras from different color channels within the array. Eight groups of cameras in a 5×5 array corresponding to different patterns of visibility that are likely to be present within a scene with respect to pixels in a reference camera located at the center of the array are shown in FIGS. 8B-8I. The eight groups are generated by rotating and flipping the same group template, which includes 12 cameras. Depending upon the orientation of the group template, this includes seven Green cameras, and either three Red cameras and 2 Blue cameras, or 3 Blue cameras and 2 Red cameras. As noted above, the group template can be utilized to select groups of cameras when estimating depth for a pixel (x, y) in a reference Green camera located at the center of the 5×5 array. The depth of the pixel location (x, y) can be estimated by selecting the depth estimate from the group of cameras in which the corresponding pixels in the three color channels at the estimated depth have the highest similarity.

Although specific groups are shown in FIGS. 8B-8I for selecting groups of cameras, any of a variety of templates corresponding to common visibility patterns within a scene can be utilized. For example, groups of cameras along a single epipolar line can be selected as described below with reference to FIG. 10. In many embodiments the groups are selected so that the same number of cameras in the color channel containing the reference camera appears in each group of cameras. In addition, the groups can be determined so that there are at least two cameras in the other color channels in each group of cameras. If the groups include an uneven number of cameras, then the cost metric with respect to different sized groups may be biased and the bias can be accounted for through normalization. In many embodiments, the groups of cameras are selected to provide baseline diversity (contrast with the groups illustrated in FIG. 10 that are selected based upon sharing a common baseline). The greater the number of different radial epipolar lines on which depth searches are performed, the more likely one of the images captured by a group of cameras will contain information that can assist in identifying incorrect depths. In several embodiments, the group of cameras are selected so that the central angle of the sector defined by the epipolar lines of each group is the same.

In smaller array cameras, such as (but not limited to) 4×4 array cameras, and depending upon the pattern of color filters utilized within the array it may not be possible to select groups of cameras that contain the same number of cameras in each color channel. In several embodiments, a color filter pattern is utilized so that groups of cameras corresponding to common visibility patterns contain the same number of cameras in a single color channel. In this way, image data captured within the color channel can be utilized to estimate depths for occluded or otherwise mismatched pixels by comparing the filtered costs of depth estimates obtained using the different subgroups. Four groups of cameras in a 4×4 array corresponding to different patterns of visibility that are likely to be present within a scene with respect to pixels in a reference camera located at the center of the array are shown in FIGS. 8J-8M. The four groups are generated by rotating and flipping the same group template, which includes 4 cameras. In the illustrated embodiment, there are three color channels: Red, Green, and Blue. Each group of cameras includes three Green cameras and one Blue or Red camera. Due to the presence of a single Red or Blue camera, in several embodiments depth estimates are determined using the image data captured in the Green color channel. For a given pixel location (x, y), the image data in the Red or Blue camera of the group that yielded the most reliable depth estimate (i.e. the lowest cost) is assumed visible in an image from the reference viewpoint. Accordingly, the pixel value in the pixel location in the Red or Blue image corresponding to the pixel location (x, y) in the image from the reference viewpoint can be utilized as a reference pixel for the purpose of calculating the visibility of corresponding pixels in other images within the Red or Blue color channels. For each of the groups shown in FIGS. 8J-8M, one of the spectral channels is excluded from the group. The use of a π filter group can, however, be utilized to identify which of the images in the excluded color channel should be used as a reference image for the purpose of determining the visibility of pixels in the excluded color channel. When the viewpoint of a central camera of a π camera group is utilized as a reference viewpoint, two images in the excluded color channel will have been captured from viewpoints on opposite sides of the reference viewpoint. In typical natural scenes, a pixel location within an image from the reference viewpoint is likely to be visible in at least one of images captured by the adjacent cameras in the excluded color channel. In order to determine which (if any) of the images is most likely to contain a corresponding pixel to a pixel location in an image from the reference viewpoint that is visible, the similarity of the corresponding pixels within the two subgroups that contain the two images. Assuming that the corresponding pixels in at least one of the subgroups achieves a threshold level of similarity, then the image in the subgroup in which the corresponding pixels have the highest level of similarity can be selected as a reference image for the excluded color channel. In this way, the visibility of corresponding pixels in any image within the excluded color channel can be determined based upon its similarity with the corresponding pixel from the reference image for the excluded color channel. Where neither image captured from the adjacent viewpoints to the reference viewpoint reliably contain a visible pixel corresponding to a pixel location within an image from the reference viewpoint, then alternative techniques can be utilized to identify an image within the excluded color channel that contains a corresponding pixel that is visible and/or to determine the visibility of pixels within individual images from the excluded color channel. In several embodiments, visibility can be determined by performing epipolar line searches in the manner described herein. In a number of embodiments, cross-channel similarity measures can be used to determine a corresponding pixel within the images in an excluded color channel that can be utilized as a reference pixel. In several embodiments, the image in which the neighborhood surrounding the corresponding pixel exhibits the highest cross-correlation (or any other appropriate cross-channel similarity measure) with the reference image can be utilized as a reference pixel for the purpose of determining the visibility of the other corresponding pixels in the images in the excluded color channel. A similar approach can be taken with array cameras including different sized camera arrays.

In many embodiments, the groups of cameras used to estimate depth for a pixel in a reference camera correspond to pairs of cameras within the array. Through use of thresholds, cameras in which a pixel (x, y) is likely to be visible can be identified and an initial visibility map V^i,Ref(x, y) constructed. The threshold can be a hard threshold, and/or a threshold based upon the SNR in the reference image. The same is also true of larger groups of cameras such as those illustrated in FIGS. 8B-8I. Thresholds can be used to detect the presence of one or more outlier pixels within a set of corresponding pixels. Groups of cameras that are found to not contain outliers can then be combined, and the depth recalculated with this new combined set, to improve the precision of depth estimates. In a similar manner, an initial depth map can be constructed by initially assuming that all cameras are visible in the visibility map V^i,Ref(x, y). Any of a variety of techniques can be utilized to determine whether a pixel (x, y) is occluded in at least one of the cameras in the camera array including (but not limited to) use of thresholds in the manner outlined above, and/or performing occlusion searches along epipolar lines. The depth of pixels that are likely to be occluded in at least one of the cameras in the array can then be estimated again using a process similar to the process outlined above with respect to FIG. 8A. In this way, the cameras in which the pixel is occluded can be rapidly identified.

Although a variety of processes for determining depth maps and visibility maps when estimating depth for pixels within a reference image are described above with reference to FIGS. 8A-8I, any of a variety of processes can be utilized to determine an initial depth map and/or visibility map in accordance with the requirements of specific applications in accordance with embodiments of the invention. Processes for identifying occluded pixels including processes that involve performing searches for occluded pixels along epipolar lines in accordance with embodiments of the invention are discussed further below.

Identifying Occluded Pixels

A challenge associated with identifying occluded pixels from an initial depth map is that depths within the depth map that are determined using occluded pixels may be incorrect. The most reliable depth estimates are those of the objects in the scene that are closest to the camera. These are the objects that also give rise to the greatest disparity and can potentially result in the largest number of pixel occlusions (depending upon the distribution of objects within the scene. Therefore, a determination can be made concerning whether a pixel visible in the reference image is occluded in a second image by searching for an occluding pixel in the reference image along the baseline vector. An occluding pixel is a pixel that is sufficiently close to the camera that the disparity observed from the perspective of the second image would be sufficiently large as to occlude the pixel visible in the reference image. The search for occluding pixels can be understood with reference to FIG. 9. An image captured from a reference viewpoint 900 is shown in FIG. 9. In order to determine the visibility of pixel (x₁, y₁) with a depth d₁in a second image captured by the array camera, a search is conducted along a line 902 parallel to the baseline between the camera that captured the reference image and the camera that captured the second image. The pixel (x₁, y₁) will be occluded, when a pixel (x₂, y₂) is closer to the camera (i.e. located at a depth d2<d₁). Therefore, all pixels (x₂, y₂) where d2≥d₁can be disregarded. Where the scene-dependent geometric shifts of each pixel (s2 and S₁respectively) are greater than the distance between the two pixels along the line 902 parallel to the baseline vector, then pixel (x₂, y₂) will also occlude the pixel (x₁, y₁). Stated another way, pixel (x₂, y₂) occludes pixel (x₁, y₁) when

|s₂−s₁−√{square root over ((x₂−x₁)²+(y₂−y₁)²)}|≤threshold

In several embodiments, the threshold in the above expression can be determined as the inverse of the super-resolution factor used during subsequent super-resolution processing (e.g. when the super-resolution process attempts to achieve an increase in resolution of a factor of 3, then a threshold of ⅓ of a pixel is appropriate). When no pixel can be found satisfying the above expression, then the pixel (x₁, y₁) can be concluded to be visible in the second image. Where the above expression is satisfied for some pixel (x₂, y₂), then the pixel (x₁, y₁) can be considered to be occluded. Therefore, the process illustrated in FIG. 8 can be repeated to create an updated depth estimate for the pixel (x, y) disregarding the second image (and any other images in which the pixel is occluded). As can readily be appreciated, incorrect depths in the initial depth estimate can result in visible pixels being disregarded in future iterations of the process for determining a depth map. Using adaptive support to provide depths that are photometrically consistent decreases the likelihood that noise will result in the detection of false pixel occlusions, which eliminate useful information from subsequent process iterations. In many embodiments, the decision to designate a pixel as being occluded considers the similarity of the pixels and the confidence of the estimated depths of the pixels (x₁, y₁) and (x₂, y₂). As is discussed further below, a confidence map can be generated with respect to the depth map of the reference image and the confidence map indicates the reliability of a specific depth map. Therefore, a possible occlusion identified using the expression provided above in which scene-dependent geometric shifts of each pixel (s2 and S₁respectively) are based upon unreliable depth estimates can be disregarded. Similarly, a possible occlusion involving pixels where the difference in the intensities of the pixels is below a predetermined threshold can be disregarded. Where the pixels values are sufficiently similar, a depth estimate generated in reliance on the pixel will largely be unaffected even if the pixel is occluded. In other embodiments, a variety of other considerations can be taken into account when determining whether to indicate a pixel as occluded in a visibility map as appropriate to the requirements of specific applications.

The search discussed above with respect to FIG. 9 can be conducted along the epipolar line corresponding to every camera utilized to perform the depth estimation. When the captured images are rectified correctly, the search can be simplified by aligning the baselines of the cameras relative to the rows and columns of the pixels captured from the reference viewpoint (see discussion of rectification above). The search for occluding pixels need not be performed with respect to every pixel in the reference image. Instead, an initial search can be conducted for pixels that are likely to be occluded in one or more images captured by the array camera including (but not limited to) pixels proximate depth transitions. Searches can then be performed for occluding pixels with respect to pixels that are considered likely to be occluded. In addition, the search for occluding pixels can be performed more efficiently by computing the projections of pixels based upon distance in advance (the projections indicate the depth at which adjacent pixels along the baseline will be occluded). In addition, once a pixel is determined to be occluded the process for detecting occlusion of adjacent pixels can be simplified by utilizing the projection of the occluding pixel. In other embodiments, any of a variety of techniques can be utilized to more efficiently locate occluding and occluded pixels including (but not limited to) rendering the image based on depth in accordance with embodiments of the invention.

As noted above, including occluded pixels in the determination of the initial depth map can introduce errors in the resulting pixel depths. When occlusions are detected using a process similar to any of the processes outlined above and the depth map updated, errors in the depth map are removed. As errors in the depth map are removed, more accurate predictions can be made as to the pixels that are occluded. Accordingly, the process of detecting occlusions can be performed iteratively until a stopping criterion is reached. In a number of embodiments, the stopping criterion can be (but is not limited to) the number of occluded pixels detected in a specific iteration (that were not previously detected) falling below a predetermined number and/or the completion of a predetermined number of iterations.

Referring back to FIG. 6, a process for generating and refining a depth map in accordance with an embodiment of the invention is illustrated. In many instances, the processes for determining (602) the initial depth map will have a tendency to overestimate the disparity of occluded pixels. This has the effect of pushing occluded pixels into the foreground. Therefore, in a number of embodiments, a pixel that occludes another pixel can also be treated like an occluded pixel for the purposes of updating (606) the depth map. In this way, the depth of background pixels that have incorrect initial depth measurements can be detected and updated. As is discussed further below, the visibility of pixels that are ignored can be separately determined (610) once the depth map is finalized. In a number of embodiments, processes such as the process 600 illustrated in FIG. 6 also involve application of a bilateral filter to the depth map to help stabilize the depth map as the process iterates.

The accuracy of a depth estimate typically increases with the number of images in the light field captured by the array camera utilized in generating the depth estimate. Considering a smaller number of images can, however, reduce the computational complexity of obtaining a depth estimate. When a depth transition occurs, occlusions will typically occur in images captured on one side of the reference viewpoint. Therefore, a search for occluding pixels similar to the search described above can be utilized to determine whether a pixel is occluded in a group of images captured to one side of the reference camera. If the search indicates that no occlusions occurred, then the depth of the pixel can be determined using that group of images and the depth map updated.

A 5×5 array camera that can be utilized to construct a depth map using the Green cameras in the array is illustrated in FIG. 10. The array camera 1010 includes a central reference Green camera (1012). The remaining Green cameras in the array can be utilized to form eight radial groups of three Green cameras for the purpose of determining depth of pixels that are occluded in at least one of the images captured by the Green cameras. Although radial groups are illustrated in FIG. 10, groups of cameras in each of four quadrants surrounding the reference viewpoint can also be utilized. A group may be as small as a pair of cameras, one of which is the camera that captures an image from the reference viewpoint. In many embodiments, groups such as those discussed above with reference to FIGS. 8A-8I can also be utilized.

Although specific processes for detecting pixel occlusions are discussed above with respect to FIGS. 6, 8A-8I, 9, and 10, any of a variety of processes can be utilized to generate a depth map including (but not limited to) processes that reduce the computational complexity of detecting occlusions in accordance with embodiments of the invention. In several embodiments, the process of determining the depth of each pixel can involve searching based upon both hypothesized depths and hypothesized visibility and the combination of depth and visibility that yields the highest pixel correspondence selected as the most likely depth and set of occlusions. Visibility determined in this way can be confirmed by using the approach described above for detecting occluding pixels.

In many embodiments, information concerning the visibility of pixels in the captured images from the reference viewpoint is utilized in processes including (but not limited to) super-resolution processing. Processes for determining the visibility of pixels in images captured by an array camera from a reference viewpoint using a depth map in accordance with embodiments of the invention are discussed further below.

Determining Visibility of Pixels

Pixel visibility can be utilized in determining a depth map and in a variety of other applications including (but not limited to) super-resolution processing. In several embodiments, a depth map for an image captured from a reference viewpoint generated using a process similar to the processes outlined above is utilized to generate a visibility map for the other images captured by an array camera (i.e. the images captured from alternate viewpoints). In several embodiments, visibility maps can be determined with respect to whether pixels in alternate view images are visible from the reference viewpoint and whether a pixel in a first alternate view image is visible any of the other alternate view images. In a number of embodiments, the process of determining visibility maps for the images captured within a single color channel involves comparing the photometric similarity of pixels corresponding to a pixel in the image captured from the reference viewpoint. Pixels that are considered to have a predetermined level of similarity can be considered visible and pixels that are below a threshold level of similarity can be considered occluded. The threshold utilized to determine the photometric similarity of corresponding pixels can adapt based upon the similarity of the corresponding pixels. In several embodiments, the threshold is determined as a function of the photometric distance between the pixel from the reference image and the corresponding pixel that is most similar to the pixel from the reference image. When an array captures image data in multiple color channels, the visibility of pixels in a single color channel can be utilized to determine the visibility of pixels in other channels.

A process for determining the visibility of corresponding pixels to a pixel within a reference image in accordance with an embodiment of the invention is illustrated in FIG. 11. The process 1100 includes selecting (1102) a pixel from an image captured from the reference viewpoint. A depth map generated using a process similar to the processes described above can be utilized to determine the depth of the selected pixel. Based upon the depth of the selected pixel, the locations of the corresponding pixels in each image captured by the array camera can be determined (1104). The similarity of the corresponding pixels to the selected pixel from the reference image can be utilized to determine the visibility of the corresponding pixels. In a number of embodiments, the photometric distance of the pixels is utilized as a measure of similarity and a threshold used to determine pixels that are likely visible and pixels that are likely occluded. In many embodiments, the threshold varies depending upon the characteristics of the pixels that are compared. In certain embodiments, the threshold value used to determine similarity of corresponding pixels is determined using the intensity of a reference pixel, as the average of a subset of pixel intensity values for corresponding pixels that are found to be visible in a given color channel. In several embodiments, the specific corresponding pixel intensities that are averaged can depend upon corresponding camera baseline and confidence values for the pixels (if available) and associated matching costs for the pixels. In several embodiments, the threshold is determined (1106) as a function of the photometric distance between the selected pixel from the reference image and the corresponding pixel that is photometrically closest to the selected pixel. In a number of embodiments, the threshold is based upon the pixel intensity of the corresponding pixel in the reference image and/or the intensity of the pixel in the alternate view image. In certain embodiments, the threshold is determined using an SNR model for the captured image. In a number of embodiments, the photometric distance of the selected pixel and the closest corresponding pixel is scaled and/or an offset is added to obtain an appropriate threshold. In other embodiments, any of a variety of techniques can be utilized for determining a threshold for determining the visibility of a corresponding pixel including (but not limited to) using a fixed threshold.

The selected pixel from the reference image and the corresponding pixels are compared (1108) and the threshold used to determine (1110) the similarity of the pixels. When the photometric distance of the selected pixel from the reference image and one of the corresponding pixels is less than the threshold, then the corresponding pixel is determined (1112) to be visible. When the photometric distance of the selected pixel from the reference image and one of the corresponding pixels exceeds the threshold, then the corresponding pixel is determined (1114) to be occluded.

The process (1100) illustrated in FIG. 11 can be repeated for a subset or all of the pixels in the reference image to generate visibility maps for the corresponding pixels in other images captured by the array camera. In embodiments where all of the pixels in the camera that captures an image from the reference viewpoint are in a single color channel, then processes similar to the process illustrated in FIG. 11 effectively generate visibility for images captured within a single color channel. When the array camera also includes images captured in other color channels, the visibility of pixels in images that are not in the same color channel as the reference image can be determined by performing similar comparisons to those described above with respect to corresponding pixels from images within the spectral channel that known or are likely visible in the reference viewpoint. In other embodiments, the camera that captures the reference image employs a Bayer filter (or another appropriate filter pattern) that enables the capture of image data in multiple color channels from the reference viewpoint. In which case, processes similar to those illustrated in FIG. 11 can be utilized to generate visibility information for images in multiple color channels, where the process involves demosaicing the Bayer filter pattern to obtain a Red and Blue pixel value at each position in the reference view.

Although specific processes are discussed above in the context of FIG. 11, any of a variety of processes can be utilized to determine the visibility of pixels in images captured by an array camera in accordance with embodiments of the invention including (but not limited to) processes that iteratively refine visibility information as part of the process of generating a depth map. In many embodiments, the process of generating a depth map and a visibility map also includes generating a confidence map that can provide information concerning the reliability of the estimated depths within the confidence map. Processes for determining confidence maps in accordance with embodiments of the invention are discussed further below.

Confidence Maps

Processes for generating depth maps, including those described above, can result in regions within a depth map in which depth estimates are unreliable. A textureless region of an image synthesized using image data captured by an array camera is illustrated in FIG. 18A and the depth map for the image generated using processes similar to those described above in accordance with embodiments of the invention is illustrated in FIG. 18B. In the textureless region 1800, the cost metric used to determine depth in the manner described above is noisy and though a minimum cost (i.e., at a depth where the cameras show maximum correspondence) can be found, the result depends largely on sensor and photon noise and not any significant underlying signal. The pixel correspondence in such regions (as measured by the cost metric) is indeed greatest at the depth shown, but the resulting depth shown is not the correct depth of the object. In contrast, in the edge region 1802, the cost function shows a depth at which the cost is minimized with great certainty. There, the edge signal is much greater than the noise and so the disparity corresponding to the actual depth of the object can be detected with higher confidence.

Depth errors are not limited to textureless regions, however. Another class of depth errors occurs in zones of occlusion, where certain background regions are visible in some cameras, and not others. This sort of error can be seen along the rim of the tire, where the foreground region intersects the background region. In the depth map, there appear to be regions 1804 containing incorrect depth estimates at the interface between the foreground to background.

When generating a depth map, a confidence map can be generated, which describes numerically, through some measure, the reliability of different depth estimates within the depth map. The confidence map can be used by later processing stages, or by third-party applications, to determine which regions of the depth map can be most relied upon for further processing. For example, a depth measurement utility can allow a user to click on regions of an image synthesized using a super-resolution process to obtain the depth of a particular pixel. If the user clicks on a pixel of the image, the confidence map can be checked before returning a result. If the confidence of the depth at the requested location is low, then the user interface can avoid reporting the depth at that location. If the confidence map is high, then the user interface can safely report the depth at the selected location. That is, the confidence map can be used to qualify the results for particular applications and not return an inaccurate value where the depth map is known to contain errors.

The confidence map can be encoded in a variety of ways, and there may be multiple classes or axes of confidence encoded within a single confidence map. A variety of confidence measures that can be utilized to encode a confidence metric in a confidence map in accordance with embodiments of the invention are discussed below.

In several embodiments, a confidence map is encoded with a confidence measure based on whether the depth estimate of a pixel is within a textureless region within an image. As noted above, textureless regions can be detected based upon SNR in the region surrounding a given pixel. If the SNR is above a known tunable threshold, the region may be marked textureless in a binary fashion. Alternatively, the SNR itself (without being thresholded) may be remapped linearly or non-linearly and used to serve as a continuous indicator of confidence.

In many embodiments, a gradient edge map (e.g. Prewitt or Canny) may be calculated and used as a confidence metric within a confidence map. Since edges and texture typically have high confidence, gradient maps and SNR maps often provide a good coarse measure of confidence for a depth map.

In a number of embodiments, the confidence map can be encoded based upon whether a particular region is low confidence due to occlusions and/or mismatch and conflicting measurements between cameras (i.e. there may be texture in a region that is detected by an SNR map, but there may still be a depth error occurring because in that area the parallax detection process detects and/or is unable to resolve occlusions or otherwise confidently estimate depth for any other reason).

In a number of embodiments, a confidence metric is determined as the “best cost” achieved during the depth estimation process, or a linear, non-linear, or quantized remapping of this quantity. If the minimum cost achieved during depth estimation is above a selected threshold, the region may be marked low confidence on the basis of a lack of correspondence between multiple views at the estimated depth.

In a number of embodiments, occlusions may be detected by comparing the best costs between different subgroups or depth maps generated between different groups of cameras and if the difference between best costs is greater than a threshold, marking low confidence for the pixel locations where the costs found in subsets of images differ.

In a number of embodiments, the confidence map can be encoded based upon whether a particular region is low confidence due to adaptive processing steps in the depth estimation process itself. For example, if fewer depths were searched in a particular region, this information can be encoded numerically in the confidence map to highlight that the depth is less reliable. In many embodiments, certain regions of the depth map are actually searched through correspondence search, and other regions of the depth map, the depths are interpolated based on results from a depth search on a sparser set of points. In such a case, the pixels with interpolated depths are given lower confidence values than pixels where the correspondences have actually been searched.

In several embodiments, the expected precision of a depth measurement can also be coded in the confidence map as a numerical quantity. In many instances, depths farther away from the camera are measured with greater error and so should be less trusted. In such cases the confidence map can mark such areas as involving lower confidence depth estimates. The confidence can be proportional to the expected depth error between adjacent search positions at that depth. In certain embodiments, the disparity corresponding to the minimum distance supported by the parallax search (i.e. this will be the maximum disparity observed between any two cameras for all supported depths) can be determined. Once the maximum disparity is found, the search will search a number of disparities up to the maximum disparity. In many embodiments the maximum disparity is D low resolution pixels and the number of depths searched is N. The number of pixels of disparity between adjacent search positions along an epipolar line is D/(N−1). The depth in meters for any one of the N disparities that is searched (indexed by n<N) are d_n=C/(n*D/(N−1)) where C is a constant that incorporates information about the baselines and focal lengths of the individual low resolution cameras having the maximum baselines. If, at a particular point in the depth map, the depth is d_n, then the expected measurement error at d_nis e_n=1/2*max(|d_n−d_n+1|, |d_n−d_n−1|). Basically, the expected measurement error is the error due to searching a fixed, discrete number of points along the epipolar line. The error value itself may be mapped linearly or non-linearly to provide a confidence value with respect to a depth estimate at a particular pixel location within the depth map. The higher the error, the less confident the depth estimate. In several embodiments, disparities searched are not spaced equally, but may be coarser in some regions than others. Accordingly, error can be calculated similarly between adjacent indices (whatever the distribution of search positions along the epipolar line) so that the confidence map reflects the calculated error in depth. In a number of embodiments, the confidence map reflects the maximum error in estimated disparity (not depth), the inverse of the quantity listed above. This may be more useful for applications such as image fusion, whereas the depth error would be more useful for measurement applications that occur in real world coordinates (such as, but not limited to, 3D modeling).

In a number of embodiments, the confidence map can mark regions as low confidence due to known or detected lens or sensor defects that make the depth map unreliable. Defective pixels (a term that includes both defective pixels on the sensor as well as pixels affected by lens defects) may be detected during image processing or offline in a pre-processing calibration step. In one embodiment, if the total number of pixel defects within a radius of a particular pixel (x, y) in any reference camera exceeds a pre-set threshold, the pixel (x, y) is marked low confidence in the depth map due to pixel defects. In another embodiment, a similar confidence value may be defined where confidence increases proportionally (not as a hard threshold) as a function of the number of pixel defects in any camera within a radius and/or region surrounding the reference pixel (x, y) (or pixels known to be affected by lens defects). In another embodiment, the confidence may be a pre-calculated value for specific configurations of defective pixels that are known to create errors of varying severity. In several embodiments the confidence value for the defect takes into account the local image content in calculating the confidence value.

In several embodiments, the confidence map may mark as low confidence areas where the reference image appears textureless but in other color channels there is textured content. In one embodiment, a pixel (x, y) in the reference camera is searched within a local radius and/or region. If within this local radius and/or region the content is considered to be textureless in Green, but if the same search within another (perhaps larger) local radius/region for Red and/or Blue cameras turns up sufficient texture in images within the Red and/or Blue color channels, the region can be marked as lower confidence due to the fact that the Green color channel is less useful in this detection scenario and depth results will be less reliable (though often correct).

In a number of embodiments the confidence map numerically encodes as low confidence scenarios in which there is photometric mismatch due to lens flare or features in the scene. In many embodiments, the local statistics (mean and variance) of a region of interest around the pixel location (x, y) may be calculated and compared to the local statistics of a similar region in another camera. In this way, local image statistics between two neighborhoods in the same general region of multiple images can be compared to detect possible lens flare, the presence of which reduces confidence. In other embodiments, any of a variety of techniques can be utilized to compare neighborhoods in multiple images to detect lens flare. The resulting confidence measure can be a scaled or non-linearly mapped function of the difference between the mean and variance of the regions across images captured by multiple cameras. The greater the mean and variance differences between the images captured by the cameras, the less likely the depth is reliable and the lower the confidence will be.

In a number of embodiments the confidence map adapts to lighting conditions to reduce the confidence when the image is very dark and noisy. In certain embodiments, the sensor gain at the time the image was taken will result in an absolute reduction in confidence for all depths. In another embodiment, the analog gain and exposure time of the sensor are taken into account when computing a SNR ratio, or thresholds for edge gradients at different levels of noise. In many embodiments, the analog gains and exposure times for different focal planes can be utilized for different cameras in a camera array used to capture a set of images.

To detect regions which are of low confidence due to occlusions, the best-achieved cost metric may be stored during the parallax search. For regions which show significant occlusion, the best achieved cost metric usually greatly exceeds the minimum value that would occur if there were no occlusion and all cameras saw the same content. Accordingly, a threshold can be applied to the best achieved costs. If the best achieved cost exceeds the threshold, then the region is marked as likely to have been occluded or to have had some other problem (such as photometric non-uniformity).

For certain similarity metrics, the low-confidence threshold for occlusion can be corrected for the mean intensity of the region as well as the noise statistics of the sensor. In many embodiments, the mean of the region in the reference image is calculated using a spatial box N x N averaging filter centered around the pixel of interest. In other embodiments, once the mean is known, the noise statistics for the color channel containing the reference camera may be calculated by a table lookup which relates a particular mean at a particular exposure and gain to a desired threshold. If the best matching value greatly exceeds the expected noise, then the pixel can be marked as unreliable due to possible occlusion.

A non-binary measure of confidence due to general mismatch can be obtained using the following formula:

Confidence(x,y)=F(Cost_min(x,y),Cost^d(x,y),I(x,y)^cam,Sensor,Camera intrinsics)

where Cost_min(x, y) is the minimum cost of a disparity search over the desired depth range,

Cost^d(x, y) denotes that cost data from any depth or depths (beside the minimum depth),

I(x, y)cam image data captured by any camera can be utilized to augment the confidence;

Sensor is the sensor prior, which can include known properties of the sensor, such as (but not limited to) noise statistics or characterization, defective pixels, properties of the sensor affecting any captured images (such as gain or exposure),

Camera intrinsics is the camera intrinsic, which specifies elements intrinsic to the camera and camera array that can impact confidence including (but not limited to) the baseline separation between cameras in the array (affects precision of depth measurements), and the arrangement of the color filters (affects performance in the occlusion zones in certain scenarios).

In several embodiments, Confidence(x, y) may make use of values neighboring pixel location (x, y) (i.e. spatial neighborhoods) for all the arguments.

In many embodiments, a confidence map can be encoded based upon factors including (but not limited to) one or more of the above factors. Each factor may be encoded in a binary fashion or may be represented as a range of (quantized) degrees of confidence, or may be non-quantized ranges or derivatives thereof. For example, the confidence along a particular axis may be represented not as a single bit, but as multiple bits which represent the level of confidence that a region is textureless. In certain embodiments the confidence along a particular axis may be represented as a continuous or approximately continuous (i.e. floating point) quantity. Other factors considered when determining confidence can be determined using any of a variety of ranges of degrees of confidence as appropriate to the requirements of specific applications in accordance with embodiments of the invention. In several embodiments, an arbitrary number of confidence codes or values are included in a confidence map for a particular pixel where one may specify any or all of these conditions. Specific confidence metrics are discussed further below.

In a particular embodiment where only the minimum cost is considered and noise statistics of the sensor follow a linear model, a simplified form may be used:

$Confidence (x, y) = a \times \frac{{Cost}_{\min} (x, y)}{Avg (x, y)} + offset$

- a and offset are empirically chosen scale and offset factors used to adjust the confidence with prior information about the gain and noise statistics of the sensor.

In this case, higher values would indicate lower confidence, and it would be up to the image processing pipeline to determine how to threshold the results.

In general, the confidence map provides metadata describing the depth estimates contained within the depth map that quantifies numerically the accuracy of detected depths in the image. In many embodiments, the confidence map may be provided in an external delivery format along with the depth map for use with the depth map in applications including (but not limited to) machine vision, gesture recognition, post capture image refocusing, real-time applications, image special effects, super-resolution, or other applications. An example of the manner in which a confidence map can be utilized in a depth estimation process to filter a depth map in accordance with an embodiment of the invention is illustrated in FIGS. 18C-18H. Turning first to FIG. 18C is an image of a scene containing objects at different depths synthesized using a super-resolution process from multiple images captured in different color channels (specifically Red, Green and Blue color channels). A depth map generated from the reference viewpoint using processes similar to those outlined above is illustrated in FIG. 18D. As can be readily appreciated, the depth map is noisy. A confidence map generated using any of a variety of the metrics outlined above can be generated as part of the process of generating a depth map. A binary confidence map generated by thresholding SNR in accordance with an embodiment of the invention is illustrated in FIG. 18E. An 8-bit confidence map generated based upon SNR in accordance with an embodiment of the invention is illustrated in FIG. 18F. A binary confidence map generated by combining a confidence factor generated by thresholding SNR and a confidence factor generated by thresholding the number of corresponding pixels that are occluded in accordance with an embodiment of the invention is illustrated in FIG. 18G. In several embodiments, the confidence map can be utilized to filter the depth map. A depth map that is filtered based upon a binary confidence map generated by thresholding SNR in accordance with an embodiment of the invention is illustrated in FIG. 18H. Comparing the depth maps shown in FIGS. 18D and 18E reveals the value of the use of the confidence map in interpreting depth information generated using any depth estimation process. Although a specific confidence metric and filtering process are described above with reference to FIGS. 18C-18H, any of a variety of confidence metrics can be utilized in the filtering and/or additional processing of depth estimates and/or depth maps in accordance with embodiments of the invention. The generation of confidence maps and the use of confidence maps to filter depth maps in accordance with embodiments of the invention is further illustrated in the close up images shown in FIGS. 18I-18L. With specific regard to FIG. 18I, a close up image of an object synthesized from a light field of images captured in Red, Green, and Blue color channels (each image containing image data in a single color channel) using super-resolution processing is shown. A depth map generated using the techniques outlined above is illustrated in FIG. 18J. A binary confidence map generated by thresholding SNR generated in accordance with an embodiment of the invention is illustrated in FIG. 18K. A multibit resolution confidence map generated in accordance with an embodiment of the invention using SNR is illustrated in FIG. 18L. A binary confidence map generated by thresholding the SNR of the region surrounding each pixel and by thresholding the number of pixels in the images within the light field that correspond to a pixel location in the image from the reference viewpoint that are occluded in accordance with an embodiment of the invention is illustrated in FIG. 18M. A depth map filtered using the binary confidence map shown in FIG. 18M is illustrated in FIG. 18N.

In several embodiments, a confidence map generated using one or more of the metrics described above can be inserted as an additional channel of information into the JPEG-DZ file format or other file formats. In several embodiments, the confidence map is encoded and decoded using processes similar to those outlined in U.S. patent application Ser. No. 13/631,731 to Venkataraman et al. entitled “Systems and Methods for Encoding Light Field Image Files”, filed Sep. 28, 2012. The disclosure of U.S. patent application Ser. No. 13/631,731 is herein incorporated by reference in its entirety. Although specific metrics for determining confidence are described above, any of a variety of metrics for determining confidence appropriate to the requirements of a specific application can be utilized in accordance with embodiments of the invention.

Reducing Computational Complexity

A variety of strategies can be utilized to reduce the computational complexity of the processes outlined above for determining depth maps and for determining the visibility of images captured by a camera array. In several embodiments, a depth map is constructed by only searching for depth at a reduced (i.e. sparse) subset of pixel locations. The depth search is done at fewer points (i.e. a sparser set of points in the image) and for points that depth is not calculated, the depth is assigned through other means. By the end, this sparse depth search provides a depth for every pixel location in a reference image where some pixels are searched and others are filled in through interpolation. As previously stated, not every pixel in the final depth map has a depth obtained by comparing the similarity of the pixel to corresponding pixels in the captured images. Instead, in regions where no correspondence search is done, the depths of many of the pixels are determined using processes including (but not limited to) averaging the depths of surrounding pixels (where the correspondence search has been run) or interpolating the depths of adjacent pixels which have been calculated. By reducing the number of pixels for which depth measurements are performed, the amount of computation used to generate a depth map can be reduced. In several embodiments, the amount of computation used when detecting a depth map can also be reduced by detecting textureless areas of the image and using processes including (but not limited to) assigning a single depth value from the nearest indicator pixel where depth has been calculated, averaging the depths of surrounding pixels or interpolating the depths of adjacent pixels to determine the depth of pixels in the textureless areas of the image. In other embodiments, any of a variety of processes for reducing the computational complexity of generating a depth map can be utilized as appropriate to the requirements of specific applications in accordance with embodiments of the invention including varying the precision of the depth estimates within the depth map based upon characteristics of the scene including (but not limited to) regions containing edges, and/or based upon object distance. Processes for generating depth maps from sparse depth searches and for detecting textureless regions in images in accordance with embodiments of the invention are discussed further below.

Generating Depth Maps From Sparse Depth Search

Processes for generating depth maps through sparse search in accordance with embodiments of the invention typically involve determining depth of a sparse set of pixels spaced or distributed throughout the reference image. Based upon this initial depth map consisting of sparse points, depth transitions can be detected and the depths of pixels surrounding the depth transitions can be directly measured using the processes outlined above. The depths of the remaining pixels can be determined based upon the depths of sparse pixels in the depth map. In many embodiments, the depth measurements are performed using a subset of the pixels in the reference image at the resolution at which they were captured.

A process for determining a depth map through sparse search in accordance with an embodiment of the invention is illustrated in FIG. 13. The process 1300 includes dividing (1302) the reference image into spatial blocks (or groups of associated pixels) and generating (1304) depth measurements for a sparser subset of indicator pixels within the spatial blocks. Here, spatial block may be taken to refer interchangeably to a rectangular block of pixels, or a subset of associated pixels that need not conform to any particular shape.

Indicator pixels are a subset of the pixels within the spatial block (or group of associated pixels) and are typically selected to provide information concerning variation in depth across the spatial block. A spatial block 1400 including a plurality of indicator pixels 1402 in accordance with an embodiment of the invention is illustrated in FIG. 14. The indicator pixels 1402 are selected at the edges and at the center of the spatial block. Although specific indicator pixels are illustrated in FIG. 14, the arrangement of indicators within a spatial block or group of associated pixels can be varied and/or any of a variety of pixels within a spatial block can be selected as indicator pixels as appropriate to the requirements of a specific application. In a number of embodiments, different shaped spatial blocks are utilized and the shape of the spatial block can be varied. In several embodiments, the arrangement of indicator pixels within the spatial blocks can be varied. In many embodiments, the indicator pixels are selected based upon scene content. In certain embodiments, the indicator pixels are selected based on which points within the spatial block have the highest SNR in the reference image to increase the likelihood that the points most likely to give confident depth results are used. In another embodiment, fixed spatial positions are chosen for some indicator pixels (as indicated in FIG. 14) for all blocks, and some subset of indicator pixels are assigned to points with highest SNR in the spatial block (i.e. a mixed configuration). In another embodiment, a segmentation process can be used to create relevant spatial regions based on scene content. Although a rectangular spatial block is shown other techniques could be used for splitting the image into spatial clusters, which contain some indicator pixels as described above. Furthermore spatial blocks can be larger in certain portions of the image than in others.

Referring back to FIG. 13, depth can be assigned (1306) to the pixels in each block based upon the depths of the indicator pixels. In several embodiments, the assigned depth is obtained through interpolation of the neighboring indicator pixels. In several embodiments, the depth of a non-indicator pixel may be calculated as a normalized weighted average of the distances to the nearest indicator pixels within a fixed neighborhood. Alternatively, nearest neighbor interpolation (1308) can be utilized to assign depths to the pixels in the spatial block based upon the depth measurements of the indicator pixels. In another embodiment, weights for the interpolation can incorporate intensity similarity as well as spatial distance to the nearest indicator pixels. In another embodiment, a non-linear regression such as (but not limited to) a Kernel Regression may be used to fill in the missing positions between depths sampled at the indicator pixel positions. In another embodiment, a single depth for the entire block is assigned by minimizing the summed costs of the indicator pixels within the block. In other embodiments, any of a variety of techniques can be utilized to generate depth information for pixels within a spatial block.

In many embodiments, the reliability of each of the spatial blocks in the depth map is determined (1310). Within the spatial block, depths will have been estimated both for indicator pixels (where search has occurred) and non-indicator pixels (where depths have been interpolated based on indicator pixel results). For the indicator and non-indicator pixels, the costs of the estimated depths within the block are determined. The costs of each pixel in the block are summed to create a reliability indicator. If the total cost of all pixels within the block is greater than a threshold, then the spatial block is marked as unreliable due to the fact that the estimated depths for some pixels appear to have poor correspondence. Where a spatial block has been determined to have low reliability of poor spatial correspondence, then the block is likely to contain a depth transition or occlusion. If such is the case, then the full correspondence search and occlusion processing can be run within the spatial block.

If a spatial block is determined to have a depth transition per the criteria above, then the spatial block may be ‘split’ and new sets indicator pixels selected in each of the two child spatial blocks and the process iterated. In one embodiment, the block may be split in half. In another embodiment, the block may be split into unequal regions depending on the depths solved by the indicator pixels within the spatial block.

Where depth transitions are detected within and/or between spatial blocks, the depth map can be refined (1312) by performing additional depth measurements within the spatial block that contains the depth transitions. In this way, the computational complexity of generating the depth map is reduced by reducing the number of depth measurements performed in generating an accurate depth map.

Although a specific process for generating a depth map from sparse searches in accordance with embodiments of the invention is illustrated in FIG. 13, any of a variety of processes that generate a depth map by performing fewer depth measurements in regions of similar or slowly transitioning depth can be utilized in accordance with embodiments of the invention.

Reducing Computation in Textureless Regions of Images

In many embodiments, the process of generating a depth map involves reducing the amount of computation needed for textureless regions of the image. Textureless areas can be ambiguous with parallax, because the corresponding pixels at many hypothesized depths may be similar. Therefore, depth measurements within a textureless area can generate unreliable and noisy results. In many embodiments, the SNR in the region surrounding a pixel is used when determining the depth of the pixel to identify whether the pixel is in a textureless area. An initial depth estimate or a set of initial depth estimates for a given pixel can be determined based upon the depth of at least one adjacent pixel for which a depth has previously been determined. When the variance of the corresponding pixels for the given pixel (or any other similarity measure) is below the SNR threshold in the region surrounding the pixel, the pixel can be assumed to be part of a textureless area and (one of) the approaches described below can be used to select the depth of pixel. Otherwise, a depth measurement can be performed using a process similar to the processes described above.

In many embodiments, textureless regions may be detected using a fixed threshold on the SNR. The computation for the search in such regions may be reduced by reducing the number of depths searched. In many embodiments, the full set of depths will be searched until a minimum cost depth is identified that is below a noise-dependent threshold that takes into account the noise characteristics of the sensor. When the minimum cost is found to be below the threshold the depth is accepted as the depth of the textureless region and no more depths are searched (i.e. the search is terminated as soon as a depth that has “close enough” correspondence is found). In many embodiments, the search in textureless regions may save computation by searching the full range of disparity but at larger increments than are done in the normal search for a region with texture (i.e. reducing the number of depths searched)—the best cost will be selected as the depth of the pixel in the textureless region.

A process for detecting textureless regions using the SNR surrounding a pixel in accordance with an embodiment of the invention is illustrated in FIG. 15. The process 1500 includes selecting (1502) a pixel from the reference image and detecting (1504) the SNR in the region around the selected pixel. An initial hypothesized depth d can be determined (1506) for the pixel. In many embodiments, the initial hypothesized depth d is determined based upon the depth of one or more pixels in the region surrounding the selected pixel. A determination (1508) is then made concerning whether the variance or cost of the corresponding pixels at the hypothesized depth is below a threshold that can be (but is not limited to) predetermined or a function of the SNR in the region surrounding the selected pixel. In other embodiments, any of a variety of similarity measures can be utilized to determine whether the region surrounding the pixel is textureless. In the event that variance or cost of the corresponding pixels is below a noise or predetermined threshold, then the hypothesized depth is selected as the most likely depth on the assumption that the pixel is located within a textureless region. When the variance or cost of the corresponding pixels exceeds the noise or predetermined threshold, then the depth of a pixel is determined in accordance with a process similar to the processes described above.

Although a specific process for detecting textureless areas within a reference image are described above with respect to FIG. 15, any of a variety of processes for detecting textureless areas in an image can be utilized in accordance with embodiments. Furthermore, any of a variety of processes can be utilized to detect other characteristics of an image that can be relied upon to reduce the number of depth measurements that are made in generating a reliable depth map in accordance with embodiments of the invention.

Generating Depth Maps from Virtual Viewpoints

While much of the discussion provided above describes the generation of depth maps with respect to images captured by a reference camera, systems and methods in accordance with embodiments of the invention can synthesize images from virtual viewpoints. A virtual viewpoint is a reference viewpoint that does not correspond to the viewpoint of any cameras within a camera array. Accordingly, irrespective of the number of color channels within a camera array, none of the color channels include a camera in which image data is captured from the reference viewpoint. An example of a virtual viewpoint that can be defined in accordance with an embodiment of the invention is illustrated in FIG. 12. The array camera module 1200 includes a 4×4 array of cameras including 8 Green cameras, 4 Red cameras, and 4 Blue cameras. A virtual camera 1202 is defined with a virtual viewpoint at the center of the array camera module. Although a specific virtual viewpoint is illustrated in FIG. 12, any virtual viewpoint can be arbitrarily defined with respect to the cameras within a camera array.

When determining a virtual viewpoint depth map, there is no explicit reference camera which can be searched and used for cost metric comparisons. In many embodiments, the depth of a given pixel (x, y) in an image synthesized from the virtual viewpoint is determined by calculating the effective baseline from the virtual imager with respect to all other cameras in the array. The baseline for a camera at position (i,j) with respect to the virtual viewpoint would be (i,j)−(i_v,j_v) where (i_v,j_v) is the location of the virtual viewpoint 1202. Once the baselines between the individual cameras is determined with respect to the virtual viewpoint, the process of estimating depth proceeds by searching for depths at which corresponding pixels having the highest similarity. For each pixel (x, y) in the virtual reference camera (i.e. an image from the virtual viewpoint), the search can proceed much as in the typical parallax scenario, where for each depth d, the disparity with respect to each of the alternate view cameras is determined at that depth, and then the similarity of corresponding pixels in one or more of the color channels is determined using an appropriate cost metric. In many embodiments, the combination cost metric described above for determining the similarity of pixels in color channels that do not contain the reference camera can be utilized. In many embodiments, a camera adjacent the virtual viewpoint in a specific color channel can be used as a reference camera for the purpose of determining the similarity of the pixel in the chosen reference camera with corresponding pixels in image data captured by other cameras in the color channel. In many embodiments, a Green camera is chosen as a reference camera for the purpose of determining the similarity of corresponding Green pixels and a combination cost metric is used for corresponding pixels in other color channels. In many embodiments, the process of determining an initial depth map for the virtual viewpoint can involve forming groups of cameras corresponding to patterns of visibility within the scene in a similar manner to that described above with respect to FIGS. 8A-8I.

A depth map generated for a virtual viewpoint can be utilized to synthesize a high resolution image from a virtual viewpoint using a super-resolution process in accordance with embodiments of the invention. The primary difference in the synthesis of a high resolution image from a virtual viewpoint is that the high resolution grid is from a virtual viewpoint, and the pixels are fused to the high resolution grid using correspondences calculated with baselines which are with respect to the virtual view position and not a physical reference camera. In this case there is no physical reference camera having pixels that map regularly on the high resolution grid. As can be readily appreciated, processes for determining confidence maps for depth maps with respect to virtual viewpoints can be determined using similar accommodations related to analyzing the synthesized reference image or choosing an image close to the virtual viewpoint as a proxy for performing SNR and/or other related measurements. Although specific processes for generating depth maps with respect to virtual viewpoints are described above, any of a variety of processes incorporating the cost metrics and techniques outlined above can be utilized to generate depth estimates for virtual viewpoints in accordance with embodiments of the invention. Systems for performing parallax detection and correction and for generating depth maps in accordance with embodiments of the invention are discussed further below.

Systems for Performing Parallax Detection

A system for generating a depth map and visibility information using processes similar to those described above is illustrated in FIG. 16. The system includes a parallax detection module 1600 that takes as an input captured images that form a light field and calibration information for an array camera and outputs a depth map, and the estimated visibility of the pixels of the captured images. In many embodiments, the parallax detection module 1600 also outputs a confidence map indicating the reliability of the depth measurements for specific pixels within the reference image. As is discussed further below, the depth map, estimated visibility information, and/or confidence map can be provided to a super-resolution processing module within an array camera to generate a higher resolution image from the captured images and to any of a variety of applications that can utilize depth, confidence and/or visibility information. In many embodiments, the parallax detection module and the super-resolution module are implemented in software and/or firmware on a microprocessor within the array camera. In several embodiments, the software associated with the parallax detection module and the super-resolution module is stored within memory within the array camera. In other embodiments, the parallax detection module and/or the super-resolution module can be implemented using any appropriately configured hardware and/or software. The generation of high resolution images from a light field captured by an array camera using a depth map generated in accordance with embodiments of the invention is discussed further below.

Super-Resolution Processing Using Depth Maps

As is noted in U.S. patent application Ser. No. 12/967,807 (incorporated by reference above) disparity between images can introduce significant artifacts when performing super-resolution processing. Therefore, the super-resolution processes disclosed in U.S. patent application Ser. No. 12/967,807 can involve applying scene dependent geometric corrections to the location of each of the pixels in the images captured by an array camera prior to using the images to synthesize a higher resolution image. The baseline and back focal length of the cameras in an array camera can be readily determined, therefore, the unknown quantity in estimating the scene dependent geometric shifts observed in the captured images is the distance between the array camera and different portions of the scene. When a depth map and visibility information is generated in accordance with the processes outlined above, the scene dependent geometric shifts resulting from the depths of each of the pixels can be determined and occluded pixels can be ignored when performing super-resolution processing. In many embodiments, a confidence map is generated as part of the process of generating a depth map and the confidence map is provided as an input to the super-resolution process to assist the super-resolution process in evaluating the reliability of depth estimates contained within the depth map when performing fusion of the pixels from the input images.

A process for generating a high resolution image using a light field captured by an array camera involving the generation of a depth map in accordance with an embodiment of the invention is illustrated in FIG. 17. The process 1700 involves capturing (1702) a light field using an array camera and selecting (1704) a reference viewpoint that can be utilized to synthesize a high resolution image. In many embodiments, the reference viewpoint is predetermined based upon the configuration of the array camera. In a number of embodiments, calibration information is utilized to increase (1706) the correspondence between captured images. In many embodiments, the correspondence between captured images involves resampling the images. An initial depth map is determined (1708) and occlusions are determined and used to update (1710) the depth map. In several embodiments, the process of detecting occlusions and updating the depth map is iterative.

In a number of embodiments, the depth map is utilized to generate (1712) information concerning the visibility of the pixels within the captured light field from the reference viewpoint. In several embodiments, a confidence map is (optionally) generated (1713) with respect to the depth estimates contained within the depth map and the depth map, the visibility information, and/or the confidence map are provided (1714) to a super-resolution processing pipeline. In several embodiments, the super-resolution processing pipeline is similar to any of the super-resolution processing pipelines disclosed in U.S. patent application Ser. No. 12/967,807. The super-resolution processing pipeline utilizes information including the light field, the depth map, the visibility information, and the confidence map to synthesize (1718) a high resolution image from the reference viewpoint, which is output (1718) by the array camera. In several embodiments, the process of synthesizing a higher resolution image involves a pilot fusion of the image data from the light field onto a higher resolution grid. The result of the pilot fusion can then be utilized as a starting point to synthesize a higher resolution image using the super-resolution process.

As is described in U.S. patent application Ser. No. 12/967,807, the process illustrated in FIG. 17 can be performed to synthesize a stereoscopic 3D image pair from the captured light field. Although a specific process for synthesizing a high resolution image from a captured light field is illustrated in FIG. 17, any of a variety of processes for synthesizing high resolution images from captured light fields involving the measurement of the depth of pixels within the light field can be utilized in accordance with embodiments of the invention.

While the above description contains many specific embodiments of the invention, these should not be construed as limitations on the scope of the invention, but rather as an example of one embodiment thereof. Accordingly, the scope of the invention should be determined not by the embodiments illustrated, but by the appended claims and their equivalents.

Claims

1. A method of estimating distances to objects within a scene from a light field comprising a set of images captured from different viewpoints using a processor configured by an image processing application, the method comprising: selecting a reference image from a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints;determining depth estimates for pixel locations in the reference image using at least a subset of the set of images, where a depth estimate for a given pixel location in the reference image is determined by:dividing the reference image into spatial blocks;generating depth measurements for a sparser subset of indicator pixels within the spatial blocks, where the spatial block includes a plurality of indicator pixels;assigning a depth to the pixels in each of the spatial blocks based upon the depths of the indicator pixels.
2. The method of claim 1, further comprising: determine a reliability of the depth assigned to each of the spatial blocks;when a depth transition is detected within spatial blocks, refining the depth of the spatial block by performing additional depth measurements within the spatial bock that includes the depth transitions.
3. The method of claim 1, wherein a depth of a non-indicator pixels is calculated as a normalized weighted average of the distances to the nearest indicator pixels within a fixed neighborhood.
4. The method of claim 1, further comprising using nearest neighbor interpolation to assign depths to the pixels in the spatial block based upon the depth measurements of the indicator pixels.
5. The method of claim 1, wherein assigning a depth to the pixels in each of the spatial blocks is obtained through interpolation of neighboring indicator pixels.
6. The method of claim 5, wherein weights for the interpolation incorporate intensity similarity and spatial distance to the nearest indicator pixel.
7. The method of claim 1, further comprising measuring similarity of corresponding pixels using a block-based similarity measure.
8. The method of claim 1, wherein selecting a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints comprises selecting a viewpoint from the set consisting of: the viewpoint of one of the images; anda virtual viewpoint.
9. The method of claim 1, further comprising: identifying pixels as corresponding pixels in the at least a subset of the set of images that correspond to the given pixel location in the reference image based upon expected disparity at a plurality of depths, wherein disparity determines an amount an object shifts between images from different cameras and is based on a baseline distance between the different cameras;comparing a similarity of intensity values of the corresponding pixels identified in the subset of the set of images at each of the plurality of depths; andselecting the depth from the plurality of depths at which the identified corresponding pixels have the highest degree of similarity as a depth estimate for the given pixel location in the reference image.
10. The method of claim 9, wherein comparing a similarity of intensity values of the corresponding pixels comprises using a cost function to determine the similarity of corresponding pixels.
11. The method of claim 9, wherein identifying corresponding pixels comprises determining that a pixel in a given image from the set of images corresponds to the pixel location in the reference image from the reference viewpoint by applying a scene dependent shift to the pixel location in the reference image from the reference viewpoint that is determined based upon: a depth estimate of the pixel location in the reference image; anda baseline between the viewpoint of the given image and the reference viewpoint.
12. The method of claim 9, further comprising generating confidence metrics for the current depth estimates for pixel locations in the reference image from the reference viewpoint, wherein a confidence metric for a particular pixel location is based on a signal to noise ratio, SNR, in a region of the particular pixel location with which a depth estimate is associated.
13. The method of claim 9, wherein selecting the depth from the plurality of depths at which the identified corresponding pixels have the highest degree of similarity as the depth estimate for the given pixel location in the reference image from the reference viewpoint further comprises selecting the depth from the plurality of depths at which the cost function for the identified corresponding pixels indicates the highest level of similarity.
14. The method of claim 9, further comprising fusing pixels from the set of images using the processor configured by the image processing application based upon the depth estimates to create a fused image having a resolution that is greater than the resolutions of the images in the set of images.
15. The method of claim 14, further comprising synthesizing an image from the reference viewpoint using the processor configured by the image processing application to perform a super-resolution process based upon the fused image from the reference viewpoint, the set of images captured from different viewpoints, and the depth estimates.
16. The method of claim 1, further comprising: normalizing the set of images, wherein the normalizing comprises using photometric calibration data to perform photometric normalization.
17. The method of claim 16, wherein normalizing the set of images further comprises: utilizing calibration information to correct for photometric variations and scene-independent geometric distortions in the images in the set of images; andrectification of the images in the set of images.
18. The method of claim 17, wherein normalizing the set of images further comprises: resampling the images to increase the similarity of corresponding pixels by correcting for geometric lens distortion; andapplying scene-independent geometric corrections to the images at a sub-pixel resolution.
19. An array camera, comprising: a plurality of cameras;a processing system;memory containing an image processing application; wherein the processor is configured by the image processing application to estimate distances to objects within a scene by:selecting a reference image from a reference viewpoint relative to the viewpoints of the set of images captured from different viewpoints;determining depth estimates for pixel locations in the reference image using at least a subset of the set of images, where a depth estimate for a given pixel location in the reference image is determined by:dividing the reference image into spatial blocks;generating depth measurements for a sparser subset of indicator pixels within the spatial blocks, where the spatial block includes a plurality of indicator pixels;assigning a depth to the pixels in each of the spatial blocks based upon the depths of the indicator pixels.

CROSS-REFERENCE TO RELATED APPLICATIONS

The current application claims priority as a continuation of U.S. patent application Ser. No. 16/537,369 entitled “System and Methods for Measuring Depth Using an Array of Independently Controllable Cameras”, filed Aug. 9, 2019 to Ciurea et al., which is a continuation of U.S. patent application Ser. No. 15/858,974 entitled “Systems and Methods for Estimating Depth and Visibility from a Reference Viewpoint for Pixels in a Set of Images Captured from Different Viewpoints”, filed Dec. 29, 2017 and issued on Aug. 13, 2019 as U.S. Pat. No. 10,380,752 to Ciurea et al., which is a continuation of U.S. patent application Ser. No. 14/988,680 entitled “Systems and Methods for Estimating Depth and Visibility from a Reference Viewpoint for Pixels in a Set of Images Captured from Different Viewpoints”, filed Jan. 5, 2016 and issued on Jan. 2, 2018 as U.S. Pat. No. 9,858,673 to Ciurea et al., which is a continuation of U.S. patent application Ser. No. 14/526,392 entitled “Systems and Methods for Estimating Depth and Visibility from a Reference Viewpoint for Pixels in a Set of Images Captured from Different Viewpoints”, filed Oct. 28, 2014 and issued on Jan. 12, 2016 as U.S. Pat. No. 9,235,900 to Ciurea et al., which is a continuation of U.S. patent application Ser. No. 14/329,754 entitled “Systems and Methods for Estimating Depth and Visibility from a Reference Viewpoint for Pixels in a Set of Images Captured from Different Viewpoints”, filed Jul. 11, 2014 and issued on Jan. 19, 2016 as U.S. Pat. No. 9,240,049 to Ciurea et al., which is a continuation of U.S. patent application Ser. No. 14/144,458 entitled “Systems and Methods for Performing Depth Estimation using Image Data from Multiple Spectral Channels”, filed Dec. 30, 2013 and issued on Jul. 15, 2014 as U.S. Pat. No. 8,780,113 to Ciurea et al., which is a continuation of U.S. patent application Ser. No. 13/972,881 entitled “Systems and Methods for Parallax Detection and Correction in Images Captured Using Array Cameras that Contain Occlusions using Subsets of Images to Perform Depth Estimation”, filed Aug. 21, 2013 and issued on Dec. 31, 2013 as U.S. Pat. No. 8,619,082 to Ciurea et al., which claims priority to U.S. Provisional Patent Application Ser. No. 61/691,666 to Venkataraman et al. entitled “Systems and Methods for Parallax Detection and Correction in Images Captured using Array Cameras”, filed Aug. 21, 2012 and U.S. Provisional Patent Application Ser. No. 61/780,906 to Venkataraman et al. entitled “Systems and Methods for Parallax Detection and Correction in Images Captured using Array Cameras”, filed Mar. 13, 2013, the disclosures of which are incorporated herein by reference in their entireties.

US Referenced Citations (1201)

Number	Name	Date	Kind
4124798	Thompson	Nov 1978	A
4198646	Alexander et al.	Apr 1980	A
4323925	Abell et al.	Apr 1982	A
4460449	Montalbano	Jul 1984	A
4467365	Murayama et al.	Aug 1984	A
4652909	Glenn	Mar 1987	A
4888645	Mitchell et al.	Dec 1989	A
4899060	Lischke	Feb 1990	A
4962425	Rea	Oct 1990	A
5005083	Grage et al.	Apr 1991	A
5070414	Tsutsumi	Dec 1991	A
5144448	Hornbaker et al.	Sep 1992	A
5157499	Oguma et al.	Oct 1992	A
5325449	Burt et al.	Jun 1994	A
5327125	Iwase et al.	Jul 1994	A
5463464	Ladewski	Oct 1995	A
5488674	Burt et al.	Jan 1996	A
5629524	Stettner et al.	May 1997	A
5638461	Fridge	Jun 1997	A
5757425	Barton et al.	May 1998	A
5793900	Nourbakhsh et al.	Aug 1998	A
5801919	Griencewic	Sep 1998	A
5808350	Jack et al.	Sep 1998	A
5832312	Rieger et al.	Nov 1998	A
5833507	Woodgate et al.	Nov 1998	A
5880691	Fossum et al.	Mar 1999	A
5911008	Niikura et al.	Jun 1999	A
5933190	Dierickx et al.	Aug 1999	A
5963664	Kumar et al.	Oct 1999	A
5973844	Burger	Oct 1999	A
6002743	Telymonde	Dec 1999	A
6005607	Uomori et al.	Dec 1999	A
6034690	Gallery et al.	Mar 2000	A
6069351	Mack	May 2000	A
6069365	Chow et al.	May 2000	A
6095989	Hay et al.	Aug 2000	A
6097394	Levoy et al.	Aug 2000	A
6124974	Burger	Sep 2000	A
6130786	Osawa et al.	Oct 2000	A
6137100	Fossum et al.	Oct 2000	A
6137535	Meyers	Oct 2000	A
6141048	Meyers	Oct 2000	A
6160909	Melen	Dec 2000	A
6163414	Kikuchi et al.	Dec 2000	A
6172352	Liu	Jan 2001	B1
6175379	Uomori et al.	Jan 2001	B1
6205241	Melen	Mar 2001	B1
6239909	Hayashi et al.	May 2001	B1
6292713	Jouppi et al.	Sep 2001	B1
6340994	Margulis et al.	Jan 2002	B1
6358862	Ireland et al.	Mar 2002	B1
6373518	Sogawa	Apr 2002	B1
6419638	Hay et al.	Jul 2002	B1
6443579	Myers	Sep 2002	B1
6445815	Sato	Sep 2002	B1
6476805	Shum et al.	Nov 2002	B1
6477260	Shimomura	Nov 2002	B1
6502097	Chan et al.	Dec 2002	B1
6525302	Dowski, Jr. et al.	Feb 2003	B2
6552742	Seta	Apr 2003	B1
6563537	Kawamura et al.	May 2003	B1
6571466	Glenn et al.	Jun 2003	B1
6603513	Berezin	Aug 2003	B1
6611289	Yu et al.	Aug 2003	B1
6627896	Hashimoto et al.	Sep 2003	B1
6628330	Lin	Sep 2003	B1
6628845	Stone et al.	Sep 2003	B1
6635941	Suda	Oct 2003	B2
6639596	Shum et al.	Oct 2003	B1
6647142	Beardsley	Nov 2003	B1
6657218	Noda	Dec 2003	B2
6671399	Berestov	Dec 2003	B1
6674892	Melen	Jan 2004	B1
6750904	Lambert	Jun 2004	B1
6765617	Tangen et al.	Jul 2004	B1
6771833	Edgar	Aug 2004	B1
6774941	Boisvert et al.	Aug 2004	B1
6788338	Dinev et al.	Sep 2004	B1
6795253	Shinohara	Sep 2004	B2
6801653	Wu et al.	Oct 2004	B1
6819328	Moriwaki et al.	Nov 2004	B1
6819358	Kagle et al.	Nov 2004	B1
6833863	Clemens	Dec 2004	B1
6879735	Portniaguine et al.	Apr 2005	B1
6897454	Sasaki et al.	May 2005	B2
6903770	Kobayashi et al.	Jun 2005	B1
6909121	Nishikawa	Jun 2005	B2
6917702	Beardsley	Jul 2005	B2
6927922	George et al.	Aug 2005	B2
6958862	Joseph	Oct 2005	B1
6985175	Iwai et al.	Jan 2006	B2
7015954	Foote et al.	Mar 2006	B1
7085409	Sawhney et al.	Aug 2006	B2
7161614	Yamashita et al.	Jan 2007	B1
7199348	Olsen et al.	Apr 2007	B2
7206449	Raskar et al.	Apr 2007	B2
7215364	Wachtel et al.	May 2007	B2
7235785	Hornback et al.	Jun 2007	B2
7245761	Swaminathan et al.	Jul 2007	B2
7262799	Suda	Aug 2007	B2
7292735	Blake et al.	Nov 2007	B2
7295697	Satoh	Nov 2007	B1
7333651	Kim et al.	Feb 2008	B1
7369165	Bosco et al.	May 2008	B2
7391572	Jacobowitz et al.	Jun 2008	B2
7408725	Sato	Aug 2008	B2
7425984	Chen et al.	Sep 2008	B2
7430312	Gu	Sep 2008	B2
7471765	Jaffray et al.	Dec 2008	B2
7496293	Shamir et al.	Feb 2009	B2
7564019	Olsen et al.	Jul 2009	B2
7599547	Sun et al.	Oct 2009	B2
7606484	Richards et al.	Oct 2009	B1
7620265	Wolff et al.	Nov 2009	B1
7633511	Shum et al.	Dec 2009	B2
7639435	Chiang	Dec 2009	B2
7639838	Nims	Dec 2009	B2
7646549	Zalevsky et al.	Jan 2010	B2
7657090	Omatsu et al.	Feb 2010	B2
7667824	Moran	Feb 2010	B1
7675080	Boettiger	Mar 2010	B2
7675681	Tomikawa et al.	Mar 2010	B2
7706634	Schmitt et al.	Apr 2010	B2
7723662	Levoy et al.	May 2010	B2
7738013	Galambos et al.	Jun 2010	B2
7741620	Doering et al.	Jun 2010	B2
7782364	Smith	Aug 2010	B2
7826153	Hong	Nov 2010	B2
7840067	Shen et al.	Nov 2010	B2
7912673	Hébert et al.	Mar 2011	B2
7924321	Nayar et al.	Apr 2011	B2
7956871	Fainstain et al.	Jun 2011	B2
7965314	Miller et al.	Jun 2011	B1
7973834	Yang	Jul 2011	B2
7986018	Rennie	Jul 2011	B2
7990447	Honda et al.	Aug 2011	B2
8000498	Shih et al.	Aug 2011	B2
8013904	Tan et al.	Sep 2011	B2
8027531	Wilburn et al.	Sep 2011	B2
8044994	Vetro et al.	Oct 2011	B2
8055466	Bryll	Nov 2011	B2
8077245	Adamo et al.	Dec 2011	B2
8089515	Chebil et al.	Jan 2012	B2
8098297	Crisan et al.	Jan 2012	B2
8098304	Pinto et al.	Jan 2012	B2
8106949	Tan et al.	Jan 2012	B2
8111910	Tanaka	Feb 2012	B2
8126279	Marcellin et al.	Feb 2012	B2
8130120	Kawabata et al.	Mar 2012	B2
8131097	Lelescu et al.	Mar 2012	B2
8149323	Li et al.	Apr 2012	B2
8164629	Zhang	Apr 2012	B1
8169486	Corcoran et al.	May 2012	B2
8180145	Wu et al.	May 2012	B2
8189065	Georgiev et al.	May 2012	B2
8189089	Georgiev et al.	May 2012	B1
8194296	Compton et al.	Jun 2012	B2
8212914	Chiu	Jul 2012	B2
8213711	Tam	Jul 2012	B2
8231814	Duparre	Jul 2012	B2
8242426	Ward et al.	Aug 2012	B2
8244027	Takahashi	Aug 2012	B2
8244058	Intwala et al.	Aug 2012	B1
8254668	Mashitani et al.	Aug 2012	B2
8279325	Pitts et al.	Oct 2012	B2
8280194	Wong et al.	Oct 2012	B2
8284240	Saint-Pierre et al.	Oct 2012	B2
8289409	Chang	Oct 2012	B2
8289440	Pitts et al.	Oct 2012	B2
8290358	Georgiev	Oct 2012	B1
8294099	Blackwell, Jr.	Oct 2012	B2
8294754	Jung et al.	Oct 2012	B2
8300085	Yang et al.	Oct 2012	B2
8305456	McMahon	Nov 2012	B1
8315476	Georgiev et al.	Nov 2012	B1
8345144	Georgiev et al.	Jan 2013	B1
8360574	Ishak et al.	Jan 2013	B2
8400555	Georgiev et al.	Mar 2013	B1
8406562	Bassi et al.	Mar 2013	B2
8411146	Twede	Apr 2013	B2
8416282	Lablans	Apr 2013	B2
8446492	Nakano et al.	May 2013	B2
8456517	Spektor et al.	Jun 2013	B2
8493496	Freedman et al.	Jul 2013	B2
8514291	Chang	Aug 2013	B2
8514491	Duparre	Aug 2013	B2
8541730	Inuiya	Sep 2013	B2
8542933	Venkataraman et al.	Sep 2013	B2
8553093	Wong et al.	Oct 2013	B2
8559705	Ng	Oct 2013	B2
8559756	Georgiev et al.	Oct 2013	B2
8565547	Strandemar	Oct 2013	B2
8576302	Yoshikawa	Nov 2013	B2
8577183	Robinson	Nov 2013	B2
8581995	Lin et al.	Nov 2013	B2
8619082	Ciurea et al.	Dec 2013	B1
8648918	Kauker et al.	Feb 2014	B2
8648919	Mantzel et al.	Feb 2014	B2
8655052	Spooner et al.	Feb 2014	B2
8682107	Yoon et al.	Mar 2014	B2
8687087	Pertsel et al.	Apr 2014	B2
8692893	McMahon	Apr 2014	B2
8749620	Knight et al.	Jun 2014	B1
8754941	Sarwari et al.	Jun 2014	B1
8773536	Zhang	Jul 2014	B1
8780113	Ciurea et al.	Jul 2014	B1
8804255	Duparre	Aug 2014	B2
8823813	Mantzel et al.	Sep 2014	B2
8830375	Ludwig	Sep 2014	B2
8831367	Venkataraman et al.	Sep 2014	B2
8831377	Pitts et al.	Sep 2014	B2
8836793	Kriesel et al.	Sep 2014	B1
8842201	Tajiri	Sep 2014	B2
8854462	Herbin et al.	Oct 2014	B2
8861089	Duparre	Oct 2014	B2
8866912	Mullis	Oct 2014	B2
8866920	Venkataraman et al.	Oct 2014	B2
8866951	Keelan	Oct 2014	B2
8878950	Lelescu et al.	Nov 2014	B2
8885059	Venkataraman et al.	Nov 2014	B1
8885922	Ito et al.	Nov 2014	B2
8896594	Xiong et al.	Nov 2014	B2
8896719	Venkataraman et al.	Nov 2014	B1
8902321	Venkataraman et al.	Dec 2014	B2
8928793	McMahon	Jan 2015	B2
8977038	Tian et al.	Mar 2015	B2
9001226	Ng et al.	Apr 2015	B1
9019426	Han et al.	Apr 2015	B2
9025894	Venkataraman et al.	May 2015	B2
9025895	Venkataraman et al.	May 2015	B2
9030528	Pesach et al.	May 2015	B2
9031335	Venkataraman et al.	May 2015	B2
9031342	Venkataraman	May 2015	B2
9031343	Venkataraman	May 2015	B2
9036928	Venkataraman	May 2015	B2
9036931	Venkataraman et al.	May 2015	B2
9041823	Venkataraman et al.	May 2015	B2
9041824	Lelescu et al.	May 2015	B2
9041829	Venkataraman et al.	May 2015	B2
9042667	Venkataraman et al.	May 2015	B2
9047684	Lelescu et al.	Jun 2015	B2
9049367	Venkataraman et al.	Jun 2015	B2
9055233	Venkataraman et al.	Jun 2015	B2
9060120	Venkataraman et al.	Jun 2015	B2
9060124	Venkataraman et al.	Jun 2015	B2
9077893	Venkataraman et al.	Jul 2015	B2
9094661	Venkataraman et al.	Jul 2015	B2
9100586	McMahon et al.	Aug 2015	B2
9100635	Duparre et al.	Aug 2015	B2
9123117	Ciurea et al.	Sep 2015	B2
9123118	Ciurea et al.	Sep 2015	B2
9124815	Venkataraman et al.	Sep 2015	B2
9124831	Mullis	Sep 2015	B2
9124864	Mullis	Sep 2015	B2
9128228	Duparre	Sep 2015	B2
9129183	Venkataraman et al.	Sep 2015	B2
9129377	Ciurea et al.	Sep 2015	B2
9143711	McMahon	Sep 2015	B2
9147254	Florian et al.	Sep 2015	B2
9185276	Rodda et al.	Nov 2015	B2
9188765	Venkataraman et al.	Nov 2015	B2
9191580	Venkataraman et al.	Nov 2015	B2
9197821	McMahon	Nov 2015	B2
9210392	Nisenzon et al.	Dec 2015	B2
9214013	Venkataraman et al.	Dec 2015	B2
9235898	Venkataraman et al.	Jan 2016	B2
9235900	Ciurea et al.	Jan 2016	B2
9240049	Ciurea et al.	Jan 2016	B2
9253380	Venkataraman et al.	Feb 2016	B2
9253397	Lee et al.	Feb 2016	B2
9256974	Hines	Feb 2016	B1
9264592	Rodda et al.	Feb 2016	B2
9264610	Duparre	Feb 2016	B2
9361662	Lelescu et al.	Jun 2016	B2
9374512	Venkataraman et al.	Jun 2016	B2
9412206	McMahon et al.	Aug 2016	B2
9413953	Maeda	Aug 2016	B2
9426343	Rodda et al.	Aug 2016	B2
9426361	Venkataraman et al.	Aug 2016	B2
9438888	Venkataraman et al.	Sep 2016	B2
9445003	Lelescu et al.	Sep 2016	B1
9456134	Venkataraman et al.	Sep 2016	B2
9456196	Kim et al.	Sep 2016	B2
9462164	Venkataraman et al.	Oct 2016	B2
9485496	Venkataraman et al.	Nov 2016	B2
9497370	Venkataraman et al.	Nov 2016	B2
9497429	Mullis et al.	Nov 2016	B2
9516222	Duparre et al.	Dec 2016	B2
9519972	Venkataraman et al.	Dec 2016	B2
9521319	Rodda et al.	Dec 2016	B2
9521416	McMahon et al.	Dec 2016	B1
9536166	Venkataraman et al.	Jan 2017	B2
9576369	Venkataraman et al.	Feb 2017	B2
9578237	Duparre et al.	Feb 2017	B2
9578259	Molina	Feb 2017	B2
9602805	Venkataraman et al.	Mar 2017	B2
9633442	Venkataraman et al.	Apr 2017	B2
9635274	Lin et al.	Apr 2017	B2
9638883	Duparre	May 2017	B1
9661310	Deng et al.	May 2017	B2
9706132	Nisenzon et al.	Jul 2017	B2
9712759	Venkataraman et al.	Jul 2017	B2
9733486	Lelescu et al.	Aug 2017	B2
9741118	Mullis	Aug 2017	B2
9743051	Venkataraman et al.	Aug 2017	B2
9749547	Venkataraman et al.	Aug 2017	B2
9749568	McMahon	Aug 2017	B2
9754422	McMahon et al.	Sep 2017	B2
9766380	Duparre et al.	Sep 2017	B2
9769365	Jannard	Sep 2017	B1
9774789	Ciurea et al.	Sep 2017	B2
9774831	Venkataraman et al.	Sep 2017	B2
9787911	McMahon et al.	Oct 2017	B2
9794476	Nayar et al.	Oct 2017	B2
9800856	Venkataraman et al.	Oct 2017	B2
9800859	Venkataraman et al.	Oct 2017	B2
9807382	Duparre et al.	Oct 2017	B2
9811753	Venkataraman et al.	Nov 2017	B2
9813616	Lelescu et al.	Nov 2017	B2
9813617	Venkataraman et al.	Nov 2017	B2
9826212	Newton et al.	Nov 2017	B2
9858673	Ciurea et al.	Jan 2018	B2
9864921	Venkataraman et al.	Jan 2018	B2
9888194	Duparre	Feb 2018	B2
9898856	Yang et al.	Feb 2018	B2
9917998	Venkataraman et al.	Mar 2018	B2
9924092	Rodda et al.	Mar 2018	B2
9936148	McMahon	Apr 2018	B2
9955070	Lelescu et al.	Apr 2018	B2
9986224	Mullis	May 2018	B2
10009538	Venkataraman et al.	Jun 2018	B2
10019816	Venkataraman et al.	Jul 2018	B2
10027901	Venkataraman et al.	Jul 2018	B2
10089740	Srikanth et al.	Oct 2018	B2
10091405	Molina	Oct 2018	B2
10119808	Venkataraman et al.	Nov 2018	B2
10122993	Venkataraman et al.	Nov 2018	B2
10127682	Mullis	Nov 2018	B2
10142560	Venkataraman et al.	Nov 2018	B2
10182216	Mullis et al.	Jan 2019	B2
10187560	Chen et al.	Jan 2019	B2
10225543	Mullis	Mar 2019	B2
10250871	Ciurea et al.	Apr 2019	B2
10261219	Duparre et al.	Apr 2019	B2
10275676	Venkataraman et al.	Apr 2019	B2
10306120	Duparre	May 2019	B2
10311649	McMohan et al.	Jun 2019	B2
10334241	Duparre et al.	Jun 2019	B2
10366472	Lelescu et al.	Jul 2019	B2
10375302	Nayar et al.	Aug 2019	B2
10375319	Venkataraman et al.	Aug 2019	B2
10380752	Ciurea et al.	Aug 2019	B2
10382665	Sa et al.	Aug 2019	B2
10390005	Nisenzon et al.	Aug 2019	B2
10412314	McMahon et al.	Sep 2019	B2
10430682	Venkataraman et al.	Oct 2019	B2
10455168	McMahon	Oct 2019	B2
10455218	Venkataraman et al.	Oct 2019	B2
10462362	Lelescu et al.	Oct 2019	B2
10540806	Yang et al.	Jan 2020	B2
10542208	Lelescu et al.	Jan 2020	B2
10547772	Molina	Jan 2020	B2
10560684	Mullis	Feb 2020	B2
10574905	Srikanth et al.	Feb 2020	B2
10638099	Mullis et al.	Apr 2020	B2
10674138	Venkataraman et al.	Jun 2020	B2
10694114	Venkataraman et al.	Jun 2020	B2
10708492	Venkataraman et al.	Jul 2020	B2
10735635	Duparre	Aug 2020	B2
10767981	Venkataraman et al.	Sep 2020	B2
10839485	Lelescu et al.	Nov 2020	B2
10909707	Ciurea et al.	Feb 2021	B2
10958892	Mullis	Mar 2021	B2
10984276	Venkataraman et al.	Apr 2021	B2
11022725	Duparre et al.	Jun 2021	B2
11272161	Mullis	Mar 2022	B2
11315321	Lucas et al.	Apr 2022	B2
11412158	Venkataraman et al.	Aug 2022	B2
11423513	Lelescu et al.	Aug 2022	B2
11486698	Venkataraman et al.	Nov 2022	B2
11570423	Mullis	Jan 2023	B2
11729365	Venkataraman et al.	Aug 2023	B2
11792538	Venkataraman et al.	Oct 2023	B2
20010005225	Clark et al.	Jun 2001	A1
20010019621	Hanna et al.	Sep 2001	A1
20010028038	Hamaguchi et al.	Oct 2001	A1
20010038387	Tomooka et al.	Nov 2001	A1
20020012056	Trevino et al.	Jan 2002	A1
20020015536	Warren et al.	Feb 2002	A1
20020027608	Johnson et al.	Mar 2002	A1
20020028014	Ono	Mar 2002	A1
20020039438	Mori et al.	Apr 2002	A1
20020057845	Fossum et al.	May 2002	A1
20020061131	Sawhney et al.	May 2002	A1
20020063807	Margulis	May 2002	A1
20020075450	Aratani et al.	Jun 2002	A1
20020087403	Meyers et al.	Jul 2002	A1
20020089596	Yasuo	Jul 2002	A1
20020094027	Sato et al.	Jul 2002	A1
20020101528	Lee et al.	Aug 2002	A1
20020113867	Takigawa et al.	Aug 2002	A1
20020113888	Sonoda et al.	Aug 2002	A1
20020118113	Oku et al.	Aug 2002	A1
20020120634	Min et al.	Aug 2002	A1
20020122113	Foote	Sep 2002	A1
20020163054	Suda	Nov 2002	A1
20020167537	Trajkovic	Nov 2002	A1
20020171666	Endo et al.	Nov 2002	A1
20020177054	Saitoh et al.	Nov 2002	A1
20020190991	Efran et al.	Dec 2002	A1
20020195548	Dowski, Jr. et al.	Dec 2002	A1
20030025227	Daniell	Feb 2003	A1
20030026474	Yano	Feb 2003	A1
20030086079	Barth et al.	May 2003	A1
20030124763	Fan et al.	Jul 2003	A1
20030140347	Varsa	Jul 2003	A1
20030156189	Utsumi et al.	Aug 2003	A1
20030179418	Wengender et al.	Sep 2003	A1
20030188659	Merry et al.	Oct 2003	A1
20030190072	Adkins et al.	Oct 2003	A1
20030198377	Ng	Oct 2003	A1
20030211405	Venkataraman	Nov 2003	A1
20030231179	Suzuki	Dec 2003	A1
20040003409	Berstis	Jan 2004	A1
20040008271	Hagimori et al.	Jan 2004	A1
20040012689	Tinnerino et al.	Jan 2004	A1
20040027358	Nakao	Feb 2004	A1
20040047274	Amanai	Mar 2004	A1
20040050104	Ghosh et al.	Mar 2004	A1
20040056966	Schechner et al.	Mar 2004	A1
20040061787	Liu et al.	Apr 2004	A1
20040066454	Otani et al.	Apr 2004	A1
20040071367	Irani et al.	Apr 2004	A1
20040075654	Hsiao et al.	Apr 2004	A1
20040096119	Williams et al.	May 2004	A1
20040100570	Shizukuishi	May 2004	A1
20040105021	Hu	Jun 2004	A1
20040114807	Lelescu et al.	Jun 2004	A1
20040141659	Zhang	Jul 2004	A1
20040151401	Sawhney et al.	Aug 2004	A1
20040165090	Ning	Aug 2004	A1
20040169617	Yelton et al.	Sep 2004	A1
20040170340	Tipping et al.	Sep 2004	A1
20040174439	Upton	Sep 2004	A1
20040179008	Gordon et al.	Sep 2004	A1
20040179834	Szajewski et al.	Sep 2004	A1
20040196379	Chen et al.	Oct 2004	A1
20040207600	Zhang et al.	Oct 2004	A1
20040207836	Chhibber et al.	Oct 2004	A1
20040212734	Macinnis et al.	Oct 2004	A1
20040213449	Safaee-Rad et al.	Oct 2004	A1
20040218809	Blake et al.	Nov 2004	A1
20040234873	Venkataraman	Nov 2004	A1
20040239782	Equitz et al.	Dec 2004	A1
20040239885	Jaynes et al.	Dec 2004	A1
20040240052	Minefuji et al.	Dec 2004	A1
20040251509	Choi	Dec 2004	A1
20040264806	Herley	Dec 2004	A1
20050006477	Patel	Jan 2005	A1
20050007461	Chou et al.	Jan 2005	A1
20050009313	Suzuki et al.	Jan 2005	A1
20050010621	Pinto et al.	Jan 2005	A1
20050012035	Miller	Jan 2005	A1
20050036778	DeMonte	Feb 2005	A1
20050047678	Jones et al.	Mar 2005	A1
20050048690	Yamamoto	Mar 2005	A1
20050068436	Fraenkel et al.	Mar 2005	A1
20050083531	Millerd et al.	Apr 2005	A1
20050084179	Hanna et al.	Apr 2005	A1
20050128509	Tokkonen et al.	Jun 2005	A1
20050128595	Shimizu	Jun 2005	A1
20050132098	Sonoda et al.	Jun 2005	A1
20050134698	Schroeder et al.	Jun 2005	A1
20050134699	Nagashima	Jun 2005	A1
20050134712	Gruhlke et al.	Jun 2005	A1
20050147277	Higaki et al.	Jul 2005	A1
20050151759	Gonzalez-Banos et al.	Jul 2005	A1
20050168924	Wu et al.	Aug 2005	A1
20050175257	Kuroki	Aug 2005	A1
20050185711	Pfister et al.	Aug 2005	A1
20050205785	Hornback et al.	Sep 2005	A1
20050219264	Shum et al.	Oct 2005	A1
20050219363	Kohler et al.	Oct 2005	A1
20050224843	Boemler	Oct 2005	A1
20050225654	Feldman et al.	Oct 2005	A1
20050265633	Piacentino et al.	Dec 2005	A1
20050275946	Choo et al.	Dec 2005	A1
20050286612	Takanashi	Dec 2005	A1
20050286756	Hong et al.	Dec 2005	A1
20060002635	Nestares et al.	Jan 2006	A1
20060007331	Izumi et al.	Jan 2006	A1
20060013318	Webb et al.	Jan 2006	A1
20060018509	Miyoshi	Jan 2006	A1
20060023197	Joel	Feb 2006	A1
20060023314	Boettiger et al.	Feb 2006	A1
20060028476	Sobel et al.	Feb 2006	A1
20060029270	Berestov et al.	Feb 2006	A1
20060029271	Miyoshi et al.	Feb 2006	A1
20060033005	Jerdev et al.	Feb 2006	A1
20060034003	Zalevsky	Feb 2006	A1
20060034531	Poon et al.	Feb 2006	A1
20060035415	Wood	Feb 2006	A1
20060038879	Kremen	Feb 2006	A1
20060038891	Okutomi et al.	Feb 2006	A1
20060039611	Rother et al.	Feb 2006	A1
20060046204	Ono et al.	Mar 2006	A1
20060049930	Zruya et al.	Mar 2006	A1
20060050980	Kohashi et al.	Mar 2006	A1
20060054780	Garrood et al.	Mar 2006	A1
20060054782	Olsen et al.	Mar 2006	A1
20060055811	Frtiz et al.	Mar 2006	A1
20060069478	Iwama	Mar 2006	A1
20060072029	Miyatake et al.	Apr 2006	A1
20060087747	Ohzawa et al.	Apr 2006	A1
20060098888	Morishita	May 2006	A1
20060103754	Wenstrand et al.	May 2006	A1
20060119597	Oshino	Jun 2006	A1
20060125936	Gruhike et al.	Jun 2006	A1
20060138322	Costello et al.	Jun 2006	A1
20060139475	Esch et al.	Jun 2006	A1
20060152803	Provitola	Jul 2006	A1
20060157640	Perlman et al.	Jul 2006	A1
20060159369	Young	Jul 2006	A1
20060176566	Boettiger et al.	Aug 2006	A1
20060187322	Janson, Jr. et al.	Aug 2006	A1
20060187338	May et al.	Aug 2006	A1
20060197937	Bamji et al.	Sep 2006	A1
20060203100	Ajito et al.	Sep 2006	A1
20060203113	Wada et al.	Sep 2006	A1
20060210146	Gu	Sep 2006	A1
20060210186	Berkner	Sep 2006	A1
20060214085	Olsen et al.	Sep 2006	A1
20060221250	Rossbach et al.	Oct 2006	A1
20060239549	Kelly et al.	Oct 2006	A1
20060243889	Farnworth et al.	Nov 2006	A1
20060251410	Trutna	Nov 2006	A1
20060274174	Tewinkle	Dec 2006	A1
20060278948	Yamaguchi et al.	Dec 2006	A1
20060279648	Senba et al.	Dec 2006	A1
20060289772	Johnson et al.	Dec 2006	A1
20070002159	Olsen et al.	Jan 2007	A1
20070008575	Yu et al.	Jan 2007	A1
20070009150	Suwa	Jan 2007	A1
20070024614	Tam et al.	Feb 2007	A1
20070030356	Yea et al.	Feb 2007	A1
20070035707	Margulis	Feb 2007	A1
20070036427	Nakamura et al.	Feb 2007	A1
20070040828	Zalevsky et al.	Feb 2007	A1
20070040922	McKee et al.	Feb 2007	A1
20070041391	Lin et al.	Feb 2007	A1
20070052825	Cho	Mar 2007	A1
20070083114	Yang et al.	Apr 2007	A1
20070085917	Kobayashi	Apr 2007	A1
20070092245	Bazakos et al.	Apr 2007	A1
20070102622	Olsen et al.	May 2007	A1
20070116447	Ye	May 2007	A1
20070126898	Feldman et al.	Jun 2007	A1
20070127831	Venkataraman	Jun 2007	A1
20070139333	Sato et al.	Jun 2007	A1
20070140685	Wu	Jun 2007	A1
20070146503	Shiraki	Jun 2007	A1
20070146511	Kinoshita et al.	Jun 2007	A1
20070153335	Hosaka	Jul 2007	A1
20070158427	Zhu et al.	Jul 2007	A1
20070159541	Sparks et al.	Jul 2007	A1
20070160310	Tanida et al.	Jul 2007	A1
20070165931	Higaki	Jul 2007	A1
20070166447	Ur-Rehman et al.	Jul 2007	A1
20070171290	Kroger	Jul 2007	A1
20070177004	Kolehmainen et al.	Aug 2007	A1
20070182843	Shimamura et al.	Aug 2007	A1
20070201859	Sarrat	Aug 2007	A1
20070206241	Smith et al.	Sep 2007	A1
20070211164	Olsen et al.	Sep 2007	A1
20070216765	Wong et al.	Sep 2007	A1
20070225600	Weibrecht et al.	Sep 2007	A1
20070228256	Mentzer et al.	Oct 2007	A1
20070236595	Pan et al.	Oct 2007	A1
20070242141	Ciurea	Oct 2007	A1
20070247517	Zhang et al.	Oct 2007	A1
20070257184	Olsen et al.	Nov 2007	A1
20070258006	Olsen et al.	Nov 2007	A1
20070258706	Raskar et al.	Nov 2007	A1
20070263113	Baek et al.	Nov 2007	A1
20070263114	Gurevich et al.	Nov 2007	A1
20070268374	Robinson	Nov 2007	A1
20070296721	Chang et al.	Dec 2007	A1
20070296832	Ota et al.	Dec 2007	A1
20070296835	Olsen et al.	Dec 2007	A1
20070296847	Chang et al.	Dec 2007	A1
20070297696	Hamza et al.	Dec 2007	A1
20080006859	Mionetto	Jan 2008	A1
20080019611	Larkin et al.	Jan 2008	A1
20080024683	Damera-Venkata et al.	Jan 2008	A1
20080025649	Liu et al.	Jan 2008	A1
20080029714	Olsen et al.	Feb 2008	A1
20080030592	Border et al.	Feb 2008	A1
20080030597	Olsen et al.	Feb 2008	A1
20080043095	Vetro et al.	Feb 2008	A1
20080043096	Vetro et al.	Feb 2008	A1
20080054518	Ra et al.	Mar 2008	A1
20080056302	Erdal et al.	Mar 2008	A1
20080062164	Bassi et al.	Mar 2008	A1
20080079805	Takagi et al.	Apr 2008	A1
20080080028	Bakin et al.	Apr 2008	A1
20080084486	Enge et al.	Apr 2008	A1
20080088793	Sverdrup et al.	Apr 2008	A1
20080095523	Schilling-Benz et al.	Apr 2008	A1
20080099804	Venezia et al.	May 2008	A1
20080106620	Sawachi	May 2008	A1
20080112059	Choi et al.	May 2008	A1
20080112635	Kondo et al.	May 2008	A1
20080117289	Schowengerdt et al.	May 2008	A1
20080118241	TeKolste et al.	May 2008	A1
20080131019	Ng	Jun 2008	A1
20080131107	Ueno	Jun 2008	A1
20080151097	Chen et al.	Jun 2008	A1
20080152215	Horie et al.	Jun 2008	A1
20080152296	Oh et al.	Jun 2008	A1
20080156991	Hu et al.	Jul 2008	A1
20080158259	Kempf et al.	Jul 2008	A1
20080158375	Kakkori et al.	Jul 2008	A1
20080158698	Chang et al.	Jul 2008	A1
20080165257	Boettiger	Jul 2008	A1
20080174670	Olsen et al.	Jul 2008	A1
20080187305	Raskar et al.	Aug 2008	A1
20080193026	Horie et al.	Aug 2008	A1
20080208506	Kuwata	Aug 2008	A1
20080211737	Kim et al.	Sep 2008	A1
20080218610	Chapman et al.	Sep 2008	A1
20080218611	Parulski et al.	Sep 2008	A1
20080218612	Border et al.	Sep 2008	A1
20080218613	Janson et al.	Sep 2008	A1
20080219654	Border et al.	Sep 2008	A1
20080239116	Smith	Oct 2008	A1
20080240598	Hasegawa	Oct 2008	A1
20080247638	Tanida et al.	Oct 2008	A1
20080247653	Moussavi et al.	Oct 2008	A1
20080272416	Yun	Nov 2008	A1
20080273751	Yuan et al.	Nov 2008	A1
20080278591	Barna et al.	Nov 2008	A1
20080278610	Boettiger	Nov 2008	A1
20080284880	Numata	Nov 2008	A1
20080291295	Kato et al.	Nov 2008	A1
20080298674	Baker et al.	Dec 2008	A1
20080310501	Ward et al.	Dec 2008	A1
20090027543	Kanehiro	Jan 2009	A1
20090050946	Duparre et al.	Feb 2009	A1
20090052743	Techmer	Feb 2009	A1
20090060281	Tanida et al.	Mar 2009	A1
20090066693	Carson	Mar 2009	A1
20090079862	Subbotin	Mar 2009	A1
20090086074	Li et al.	Apr 2009	A1
20090091645	Trimeche et al.	Apr 2009	A1
20090091806	Inuiya	Apr 2009	A1
20090092363	Daum et al.	Apr 2009	A1
20090096050	Park	Apr 2009	A1
20090102956	Georgiev	Apr 2009	A1
20090103792	Rahn et al.	Apr 2009	A1
20090109306	Shan et al.	Apr 2009	A1
20090127430	Hirasawa et al.	May 2009	A1
20090128644	Camp, Jr. et al.	May 2009	A1
20090128833	Yahav	May 2009	A1
20090129667	Ho et al.	May 2009	A1
20090140131	Utagawa	Jun 2009	A1
20090141933	Wagg	Jun 2009	A1
20090147919	Goto et al.	Jun 2009	A1
20090152664	Klem et al.	Jun 2009	A1
20090167922	Perlman et al.	Jul 2009	A1
20090167923	Safaee-Rad et al.	Jul 2009	A1
20090167934	Gupta	Jul 2009	A1
20090175349	Ye et al.	Jul 2009	A1
20090179142	Duparre et al.	Jul 2009	A1
20090180021	Kikuchi et al.	Jul 2009	A1
20090200622	Tai et al.	Aug 2009	A1
20090201371	Matsuda et al.	Aug 2009	A1
20090207235	Francini et al.	Aug 2009	A1
20090219435	Yuan	Sep 2009	A1
20090225203	Tanida et al.	Sep 2009	A1
20090237520	Kaneko et al.	Sep 2009	A1
20090245573	Saptharishi et al.	Oct 2009	A1
20090256947	Ciurea et al.	Oct 2009	A1
20090263017	Tanbakuchi	Oct 2009	A1
20090268192	Koenck et al.	Oct 2009	A1
20090268970	Babacan et al.	Oct 2009	A1
20090268983	Stone et al.	Oct 2009	A1
20090273663	Yoshida	Nov 2009	A1
20090274387	Jin	Nov 2009	A1
20090279800	Uetani et al.	Nov 2009	A1
20090284651	Srinivasan	Nov 2009	A1
20090290811	Imai	Nov 2009	A1
20090297056	Lelescu et al.	Dec 2009	A1
20090302205	Olsen et al.	Dec 2009	A9
20090315982	Schmidt et al.	Dec 2009	A1
20090317061	Jung et al.	Dec 2009	A1
20090322876	Lee et al.	Dec 2009	A1
20090323195	Hembree et al.	Dec 2009	A1
20090323206	Oliver et al.	Dec 2009	A1
20090324118	Maslov et al.	Dec 2009	A1
20100002126	Wenstrand et al.	Jan 2010	A1
20100002313	Duparre et al.	Jan 2010	A1
20100002314	Duparre	Jan 2010	A1
20100007714	Kim et al.	Jan 2010	A1
20100013927	Nixon	Jan 2010	A1
20100044815	Chang	Feb 2010	A1
20100045809	Packard	Feb 2010	A1
20100053342	Hwang et al.	Mar 2010	A1
20100053347	Agarwala et al.	Mar 2010	A1
20100053415	Yun	Mar 2010	A1
20100053600	Tanida et al.	Mar 2010	A1
20100060746	Olsen et al.	Mar 2010	A9
20100073463	Momonoi et al.	Mar 2010	A1
20100074532	Gordon et al.	Mar 2010	A1
20100085351	Deb et al.	Apr 2010	A1
20100085425	Tan	Apr 2010	A1
20100086227	Sun et al.	Apr 2010	A1
20100091389	Henriksen et al.	Apr 2010	A1
20100097491	Farina et al.	Apr 2010	A1
20100103175	Okutomi et al.	Apr 2010	A1
20100103259	Tanida et al.	Apr 2010	A1
20100103308	Butterfield et al.	Apr 2010	A1
20100111444	Coffman	May 2010	A1
20100118127	Nam et al.	May 2010	A1
20100128145	Pitts et al.	May 2010	A1
20100129048	Pitts et al.	May 2010	A1
20100133230	Henriksen et al.	Jun 2010	A1
20100133418	Sargent et al.	Jun 2010	A1
20100141802	Knight et al.	Jun 2010	A1
20100142828	Chang et al.	Jun 2010	A1
20100142839	Lakus-Becker	Jun 2010	A1
20100157073	Kondo et al.	Jun 2010	A1
20100165152	Lim	Jul 2010	A1
20100166410	Chang	Jul 2010	A1
20100171866	Brady et al.	Jul 2010	A1
20100177411	Hegde et al.	Jul 2010	A1
20100182406	Benitez	Jul 2010	A1
20100194860	Mentz et al.	Aug 2010	A1
20100194901	van Hoorebeke et al.	Aug 2010	A1
20100195716	Klein Gunnewiek et al.	Aug 2010	A1
20100201809	Oyama et al.	Aug 2010	A1
20100201834	Maruyama et al.	Aug 2010	A1
20100202054	Niederer	Aug 2010	A1
20100202683	Robinson	Aug 2010	A1
20100208100	Olsen et al.	Aug 2010	A9
20100214423	Ogawa	Aug 2010	A1
20100220212	Perlman et al.	Sep 2010	A1
20100223237	Mishra et al.	Sep 2010	A1
20100225740	Jung et al.	Sep 2010	A1
20100231285	Boomer et al.	Sep 2010	A1
20100238327	Griffith et al.	Sep 2010	A1
20100244165	Lake et al.	Sep 2010	A1
20100245684	Xiao et al.	Sep 2010	A1
20100254627	Panahpour Tehrani et al.	Oct 2010	A1
20100259610	Petersen	Oct 2010	A1
20100265346	Iizuka	Oct 2010	A1
20100265381	Yamamoto et al.	Oct 2010	A1
20100265385	Knight et al.	Oct 2010	A1
20100277629	Tanaka	Nov 2010	A1
20100281070	Chan et al.	Nov 2010	A1
20100289941	Ito et al.	Nov 2010	A1
20100290483	Park et al.	Nov 2010	A1
20100302423	Adams, Jr. et al.	Dec 2010	A1
20100309292	Ho et al.	Dec 2010	A1
20100309368	Choi et al.	Dec 2010	A1
20100321595	Chiu	Dec 2010	A1
20100321640	Yeh et al.	Dec 2010	A1
20100328456	Alakarhu	Dec 2010	A1
20100329556	Mitarai et al.	Dec 2010	A1
20100329582	Albu et al.	Dec 2010	A1
20110001037	Tewinkle	Jan 2011	A1
20110018973	Takayama	Jan 2011	A1
20110019048	Raynor et al.	Jan 2011	A1
20110019243	Constant, Jr. et al.	Jan 2011	A1
20110031381	Tay et al.	Feb 2011	A1
20110032341	Ignatov et al.	Feb 2011	A1
20110032370	Ludwig	Feb 2011	A1
20110033129	Robinson	Feb 2011	A1
20110038536	Gong	Feb 2011	A1
20110043604	Peleg et al.	Feb 2011	A1
20110043613	Rohaly et al.	Feb 2011	A1
20110043661	Podoleanu	Feb 2011	A1
20110043665	Ogasahara	Feb 2011	A1
20110043668	McKinnon et al.	Feb 2011	A1
20110044502	Liu et al.	Feb 2011	A1
20110051255	Lee et al.	Mar 2011	A1
20110055729	Mason et al.	Mar 2011	A1
20110064327	Dagher et al.	Mar 2011	A1
20110069189	Venkataraman	Mar 2011	A1
20110080487	Venkataraman et al.	Apr 2011	A1
20110085028	Samadani et al.	Apr 2011	A1
20110090217	Mashitani et al.	Apr 2011	A1
20110108708	Olsen et al.	May 2011	A1
20110115886	Nguyen et al.	May 2011	A1
20110121421	Charbon et al.	May 2011	A1
20110122308	Duparre	May 2011	A1
20110128393	Tavi et al.	Jun 2011	A1
20110128412	Milnes et al.	Jun 2011	A1
20110129165	Lim et al.	Jun 2011	A1
20110141309	Nagashima et al.	Jun 2011	A1
20110142138	Tian et al.	Jun 2011	A1
20110149408	Hahgholt et al.	Jun 2011	A1
20110149409	Haugholt et al.	Jun 2011	A1
20110150321	Cheong et al.	Jun 2011	A1
20110153248	Gu et al.	Jun 2011	A1
20110157321	Nakajima et al.	Jun 2011	A1
20110157451	Chang	Jun 2011	A1
20110169921	Lee et al.	Jul 2011	A1
20110169994	DiFrancesco et al.	Jul 2011	A1
20110176020	Chang	Jul 2011	A1
20110181797	Galstian et al.	Jul 2011	A1
20110193944	Lian et al.	Aug 2011	A1
20110199458	Hayasaka et al.	Aug 2011	A1
20110200319	Kravitz et al.	Aug 2011	A1
20110206291	Kashani et al.	Aug 2011	A1
20110207074	Hall-Holt et al.	Aug 2011	A1
20110211068	Yokota	Sep 2011	A1
20110211077	Nayar et al.	Sep 2011	A1
20110211824	Georgiev et al.	Sep 2011	A1
20110221599	Högasten	Sep 2011	A1
20110221658	Haddick et al.	Sep 2011	A1
20110221939	Jerdev	Sep 2011	A1
20110221950	Oostra et al.	Sep 2011	A1
20110222757	Yeatman, Jr. et al.	Sep 2011	A1
20110228142	Brueckner et al.	Sep 2011	A1
20110228144	Tian et al.	Sep 2011	A1
20110234825	Liu et al.	Sep 2011	A1
20110234841	Akeley et al.	Sep 2011	A1
20110241234	Duparre	Oct 2011	A1
20110242342	Goma et al.	Oct 2011	A1
20110242355	Goma et al.	Oct 2011	A1
20110242356	Aleksic et al.	Oct 2011	A1
20110243428	Das Gupta et al.	Oct 2011	A1
20110255592	Sung et al.	Oct 2011	A1
20110255745	Hodder et al.	Oct 2011	A1
20110261993	Weiming et al.	Oct 2011	A1
20110267264	Mccarthy et al.	Nov 2011	A1
20110267348	Lin et al.	Nov 2011	A1
20110267531	Imai	Nov 2011	A1
20110273531	Ito et al.	Nov 2011	A1
20110274175	Sumitomo	Nov 2011	A1
20110274366	Tardif	Nov 2011	A1
20110279705	Kuang et al.	Nov 2011	A1
20110279721	McMahon	Nov 2011	A1
20110285701	Chen et al.	Nov 2011	A1
20110285866	Bhrugumalla et al.	Nov 2011	A1
20110285910	Bamji et al.	Nov 2011	A1
20110292216	Fergus et al.	Dec 2011	A1
20110298898	Jung et al.	Dec 2011	A1
20110298917	Yanagita	Dec 2011	A1
20110300929	Tardif et al.	Dec 2011	A1
20110310980	Mathew	Dec 2011	A1
20110316968	Taguchi et al.	Dec 2011	A1
20110317766	Lim et al.	Dec 2011	A1
20120012748	Pain et al.	Jan 2012	A1
20120013748	Stanwood et al.	Jan 2012	A1
20120014456	Martinez Bauza et al.	Jan 2012	A1
20120019530	Baker	Jan 2012	A1
20120019700	Gaber	Jan 2012	A1
20120023456	Sun et al.	Jan 2012	A1
20120026297	Sato	Feb 2012	A1
20120026342	Yu et al.	Feb 2012	A1
20120026366	Golan et al.	Feb 2012	A1
20120026451	Nystrom	Feb 2012	A1
20120026478	Chen et al.	Feb 2012	A1
20120038745	Yu et al.	Feb 2012	A1
20120039525	Tian et al.	Feb 2012	A1
20120044249	Mashitani et al.	Feb 2012	A1
20120044372	Côtéet al.	Feb 2012	A1
20120051624	Ando	Mar 2012	A1
20120056982	Katz et al.	Mar 2012	A1
20120057040	Park et al.	Mar 2012	A1
20120062697	Treado et al.	Mar 2012	A1
20120062702	Jiang et al.	Mar 2012	A1
20120062756	Tian et al.	Mar 2012	A1
20120069235	Imai	Mar 2012	A1
20120081519	Goma et al.	Apr 2012	A1
20120086803	Malzbender et al.	Apr 2012	A1
20120105590	Fukumoto et al.	May 2012	A1
20120105654	Kwatra et al.	May 2012	A1
20120105691	Waqas et al.	May 2012	A1
20120113232	Joblove	May 2012	A1
20120113318	Galstian et al.	May 2012	A1
20120113413	Miahczylowicz-Wolski et al.	May 2012	A1
20120114224	Xu et al.	May 2012	A1
20120120264	Lee et al.	May 2012	A1
20120127275	Von Zitzewitz et al.	May 2012	A1
20120147139	Li et al.	Jun 2012	A1
20120147205	Lelescu et al.	Jun 2012	A1
20120153153	Chang et al.	Jun 2012	A1
20120154551	Inoue	Jun 2012	A1
20120155830	Sasaki et al.	Jun 2012	A1
20120163672	McKinnon	Jun 2012	A1
20120163725	Fukuhara	Jun 2012	A1
20120169433	Mullins et al.	Jul 2012	A1
20120170134	Bolis et al.	Jul 2012	A1
20120176479	Mayhew et al.	Jul 2012	A1
20120176481	Lukk et al.	Jul 2012	A1
20120188235	Wu et al.	Jul 2012	A1
20120188341	Klein Gunnewiek et al.	Jul 2012	A1
20120188389	Lin et al.	Jul 2012	A1
20120188420	Black et al.	Jul 2012	A1
20120188634	Kubala et al.	Jul 2012	A1
20120198677	Duparre	Aug 2012	A1
20120200669	Lai et al.	Aug 2012	A1
20120200726	Bugnariu	Aug 2012	A1
20120200734	Tang	Aug 2012	A1
20120206582	DiCarlo et al.	Aug 2012	A1
20120219236	Ali et al.	Aug 2012	A1
20120224083	Jovanovski et al.	Sep 2012	A1
20120229602	Chen et al.	Sep 2012	A1
20120229628	Ishiyama et al.	Sep 2012	A1
20120237114	Park et al.	Sep 2012	A1
20120249550	Akeley et al.	Oct 2012	A1
20120249750	Izzat et al.	Oct 2012	A1
20120249836	Ali et al.	Oct 2012	A1
20120249853	Krolczyk et al.	Oct 2012	A1
20120250990	Bocirnea	Oct 2012	A1
20120262601	Choi et al.	Oct 2012	A1
20120262607	Shimura et al.	Oct 2012	A1
20120268574	Gidon et al.	Oct 2012	A1
20120274626	Hsieh	Nov 2012	A1
20120287291	McMahon	Nov 2012	A1
20120290257	Hodge et al.	Nov 2012	A1
20120293489	Chen et al.	Nov 2012	A1
20120293624	Chen et al.	Nov 2012	A1
20120293695	Tanaka	Nov 2012	A1
20120307093	Miyoshi	Dec 2012	A1
20120307099	Yahata	Dec 2012	A1
20120314033	Lee et al.	Dec 2012	A1
20120314937	Kim et al.	Dec 2012	A1
20120327222	Ng et al.	Dec 2012	A1
20130002828	Ding et al.	Jan 2013	A1
20130003184	Duparre	Jan 2013	A1
20130010073	Do et al.	Jan 2013	A1
20130016245	Yuba	Jan 2013	A1
20130016885	Tsujimoto	Jan 2013	A1
20130022111	Chen et al.	Jan 2013	A1
20130027580	Olsen et al.	Jan 2013	A1
20130033579	Wajs	Feb 2013	A1
20130033585	Li et al.	Feb 2013	A1
20130038696	Ding et al.	Feb 2013	A1
20130047396	Au et al.	Feb 2013	A1
20130050504	Safaee-Rad et al.	Feb 2013	A1
20130050526	Keelan	Feb 2013	A1
20130057710	McMahon	Mar 2013	A1
20130070060	Chatterjee et al.	Mar 2013	A1
20130076967	Brunner et al.	Mar 2013	A1
20130077859	Stauder et al.	Mar 2013	A1
20130077880	Venkataraman et al.	Mar 2013	A1
20130077882	Venkataraman et al.	Mar 2013	A1
20130083172	Baba	Apr 2013	A1
20130088489	Schmeitz et al.	Apr 2013	A1
20130088637	Duparre	Apr 2013	A1
20130093842	Yahata	Apr 2013	A1
20130100254	Morioka et al.	Apr 2013	A1
20130107061	Kumar et al.	May 2013	A1
20130113888	Koguchi	May 2013	A1
20130113899	Morohoshi et al.	May 2013	A1
20130113939	Strandemar	May 2013	A1
20130120536	Song et al.	May 2013	A1
20130120605	Georgiev et al.	May 2013	A1
20130121559	Hu et al.	May 2013	A1
20130127988	Wang et al.	May 2013	A1
20130128068	Georgiev et al.	May 2013	A1
20130128069	Georgiev et al.	May 2013	A1
20130128087	Georgiev et al.	May 2013	A1
20130128121	Agarwala et al.	May 2013	A1
20130135315	Bares et al.	May 2013	A1
20130135448	Nagumo et al.	May 2013	A1
20130147979	McMahon et al.	Jun 2013	A1
20130155050	Rastogi et al.	Jun 2013	A1
20130162641	Zhang et al.	Jun 2013	A1
20130169754	Aronsson et al.	Jul 2013	A1
20130176394	Tian et al.	Jul 2013	A1
20130208138	Li et al.	Aug 2013	A1
20130215108	McMahon et al.	Aug 2013	A1
20130215231	Hiramoto et al.	Aug 2013	A1
20130222556	Shimada	Aug 2013	A1
20130222656	Kaneko	Aug 2013	A1
20130223759	Nishiyama	Aug 2013	A1
20130229540	Farina et al.	Sep 2013	A1
20130230237	Schlosser et al.	Sep 2013	A1
20130250123	Zhang et al.	Sep 2013	A1
20130250150	Malone et al.	Sep 2013	A1
20130258067	Zhang et al.	Oct 2013	A1
20130259317	Gaddy	Oct 2013	A1
20130265459	Duparre et al.	Oct 2013	A1
20130274596	Azizian et al.	Oct 2013	A1
20130274923	By	Oct 2013	A1
20130286236	Mankowski	Oct 2013	A1
20130293760	Nisenzon et al.	Nov 2013	A1
20130308197	Duparre	Nov 2013	A1
20130321581	El-Ghoroury et al.	Dec 2013	A1
20130321589	Kirk et al.	Dec 2013	A1
20130335598	Gustavsson et al.	Dec 2013	A1
20130342641	Morioka et al.	Dec 2013	A1
20140002674	Duparre et al.	Jan 2014	A1
20140002675	Duparre et al.	Jan 2014	A1
20140009586	McNamer et al.	Jan 2014	A1
20140013273	Ng	Jan 2014	A1
20140037137	Broaddus et al.	Feb 2014	A1
20140037140	Benhimane et al.	Feb 2014	A1
20140043507	Wang et al.	Feb 2014	A1
20140059462	Wernersson	Feb 2014	A1
20140076336	Clayton et al.	Mar 2014	A1
20140078333	Miao	Mar 2014	A1
20140079336	Venkataraman et al.	Mar 2014	A1
20140081454	Nuyujukian et al.	Mar 2014	A1
20140085502	Lin et al.	Mar 2014	A1
20140092281	Nisenzon et al.	Apr 2014	A1
20140098266	Nayar et al.	Apr 2014	A1
20140098267	Tian et al.	Apr 2014	A1
20140104490	Hsieh et al.	Apr 2014	A1
20140118493	Sali et al.	May 2014	A1
20140118584	Lee et al.	May 2014	A1
20140125771	Grossmann et al.	May 2014	A1
20140132810	McMahon	May 2014	A1
20140139642	Ni et al.	May 2014	A1
20140139643	Hogasten et al.	May 2014	A1
20140140626	Cho et al.	May 2014	A1
20140146132	Bagnato et al.	May 2014	A1
20140146201	Knight et al.	May 2014	A1
20140168424	Attar	Jun 2014	A1
20140176592	Wilburn et al.	Jun 2014	A1
20140183334	Wang et al.	Jul 2014	A1
20140186045	Poddar et al.	Jul 2014	A1
20140192154	Jeong et al.	Jul 2014	A1
20140192253	Laroia	Jul 2014	A1
20140198188	Izawa	Jul 2014	A1
20140204183	Lee et al.	Jul 2014	A1
20140218546	McMahon	Aug 2014	A1
20140232822	Venkataraman et al.	Aug 2014	A1
20140240528	Venkataraman et al.	Aug 2014	A1
20140240529	Venkataraman et al.	Aug 2014	A1
20140253738	Mullis	Sep 2014	A1
20140267243	Venkataraman et al.	Sep 2014	A1
20140267286	Duparre	Sep 2014	A1
20140267633	Venkataraman et al.	Sep 2014	A1
20140267762	Mullis et al.	Sep 2014	A1
20140267829	McMahon et al.	Sep 2014	A1
20140267890	Lelescu et al.	Sep 2014	A1
20140285675	Mullis	Sep 2014	A1
20140300706	Song	Oct 2014	A1
20140313315	Shoham et al.	Oct 2014	A1
20140321712	Ciurea et al.	Oct 2014	A1
20140333731	Venkataraman et al.	Nov 2014	A1
20140333764	Venkataraman et al.	Nov 2014	A1
20140333787	Venkataraman et al.	Nov 2014	A1
20140340539	Venkataraman et al.	Nov 2014	A1
20140347509	Venkataraman et al.	Nov 2014	A1
20140347748	Duparre	Nov 2014	A1
20140354773	Venkataraman et al.	Dec 2014	A1
20140354843	Venkataraman et al.	Dec 2014	A1
20140354844	Venkataraman et al.	Dec 2014	A1
20140354853	Venkataraman et al.	Dec 2014	A1
20140354854	Venkataraman et al.	Dec 2014	A1
20140354855	Venkataraman et al.	Dec 2014	A1
20140355870	Venkataraman et al.	Dec 2014	A1
20140368662	Venkataraman et al.	Dec 2014	A1
20140368683	Venkataraman et al.	Dec 2014	A1
20140368684	Venkataraman et al.	Dec 2014	A1
20140368685	Venkataraman et al.	Dec 2014	A1
20140368686	Duparre	Dec 2014	A1
20140369612	Venkataraman et al.	Dec 2014	A1
20140369615	Venkataraman et al.	Dec 2014	A1
20140376825	Venkataraman et al.	Dec 2014	A1
20140376826	Venkataraman et al.	Dec 2014	A1
20150002734	Lee	Jan 2015	A1
20150003752	Venkataraman et al.	Jan 2015	A1
20150003753	Venkataraman et al.	Jan 2015	A1
20150009353	Venkataraman et al.	Jan 2015	A1
20150009354	Venkataraman et al.	Jan 2015	A1
20150009362	Venkataraman et al.	Jan 2015	A1
20150015669	Venkataraman et al.	Jan 2015	A1
20150035992	Mullis	Feb 2015	A1
20150036014	Lelescu et al.	Feb 2015	A1
20150036015	Lelescu et al.	Feb 2015	A1
20150042766	Ciurea et al.	Feb 2015	A1
20150042767	Ciurea et al.	Feb 2015	A1
20150042833	Lelescu et al.	Feb 2015	A1
20150049915	Ciurea et al.	Feb 2015	A1
20150049916	Ciurea et al.	Feb 2015	A1
20150049917	Ciurea et al.	Feb 2015	A1
20150055884	Venkataraman et al.	Feb 2015	A1
20150085073	Bruls et al.	Mar 2015	A1
20150085174	Shabtay et al.	Mar 2015	A1
20150091900	Yang et al.	Apr 2015	A1
20150098079	Montgomery et al.	Apr 2015	A1
20150104076	Hayasaka	Apr 2015	A1
20150104101	Bryant et al.	Apr 2015	A1
20150122411	Rodda et al.	May 2015	A1
20150124059	Georgiev et al.	May 2015	A1
20150124113	Rodda et al.	May 2015	A1
20150124151	Rodda et al.	May 2015	A1
20150138346	Venkataraman et al.	May 2015	A1
20150146029	Venkataraman et al.	May 2015	A1
20150146030	Venkataraman et al.	May 2015	A1
20150161798	Venkataraman et al.	Jun 2015	A1
20150199793	Venkataraman et al.	Jul 2015	A1
20150199841	Venkataraman et al.	Jul 2015	A1
20150235476	McMahon et al.	Aug 2015	A1
20150237329	Venkataraman et al.	Aug 2015	A1
20150243480	Yamada	Aug 2015	A1
20150244927	Laroia et al.	Aug 2015	A1
20150245013	Venkataraman et al.	Aug 2015	A1
20150248744	Hayasaka et al.	Sep 2015	A1
20150254868	Srikanth et al.	Sep 2015	A1
20150264337	Venkataraman et al.	Sep 2015	A1
20150296137	Duparre et al.	Oct 2015	A1
20150312455	Venkataraman et al.	Oct 2015	A1
20150326852	Duparre et al.	Nov 2015	A1
20150332468	Hayasaka et al.	Nov 2015	A1
20150373261	Rodda et al.	Dec 2015	A1
20160037097	Duparre	Feb 2016	A1
20160044252	Molina	Feb 2016	A1
20160044257	Venkataraman et al.	Feb 2016	A1
20160057332	Ciurea et al.	Feb 2016	A1
20160065934	Kaza et al.	Mar 2016	A1
20160163051	Mullis	Jun 2016	A1
20160165106	Duparre	Jun 2016	A1
20160165134	Lelescu et al.	Jun 2016	A1
20160165147	Nisenzon et al.	Jun 2016	A1
20160165212	Mullis	Jun 2016	A1
20160191768	Shin et al.	Jun 2016	A1
20160195733	Lelescu et al.	Jul 2016	A1
20160198096	McMahon et al.	Jul 2016	A1
20160227195	Venkataraman et al.	Aug 2016	A1
20160249001	McMahon	Aug 2016	A1
20160255333	Nisenzon et al.	Sep 2016	A1
20160266284	Duparre et al.	Sep 2016	A1
20160267665	Venkataraman et al.	Sep 2016	A1
20160267672	Ciurea et al.	Sep 2016	A1
20160269626	McMahon	Sep 2016	A1
20160269627	McMahon	Sep 2016	A1
20160269650	Venkataraman et al.	Sep 2016	A1
20160269651	Venkataraman et al.	Sep 2016	A1
20160269664	Duparre	Sep 2016	A1
20160316140	Nayar et al.	Oct 2016	A1
20170006233	Venkataraman et al.	Jan 2017	A1
20170048468	Pain et al.	Feb 2017	A1
20170053382	Lelescu et al.	Feb 2017	A1
20170054901	Venkataraman et al.	Feb 2017	A1
20170070672	Rodda et al.	Mar 2017	A1
20170070673	Lelescu et al.	Mar 2017	A1
20170078568	Venkataraman et al.	Mar 2017	A1
20170085845	Venkataraman et al.	Mar 2017	A1
20170094243	Venkataraman et al.	Mar 2017	A1
20170099465	Mullis et al.	Apr 2017	A1
20170163862	Molina	Jun 2017	A1
20170178363	Venkataraman et al.	Jun 2017	A1
20170187933	Duparre	Jun 2017	A1
20170188011	Panescu et al.	Jun 2017	A1
20170244960	Ciurea et al.	Aug 2017	A1
20170257562	Venkataraman et al.	Sep 2017	A1
20170365104	McMahon et al.	Dec 2017	A1
20180007284	Venkataraman et al.	Jan 2018	A1
20180013945	Ciurea et al.	Jan 2018	A1
20180024330	Laroia	Jan 2018	A1
20180035057	McMahon et al.	Feb 2018	A1
20180040135	Mullis	Feb 2018	A1
20180048830	Venkataraman et al.	Feb 2018	A1
20180048879	Venkataraman et al.	Feb 2018	A1
20180081090	Duparre et al.	Mar 2018	A1
20180097993	Nayar et al.	Apr 2018	A1
20180109782	Duparre et al.	Apr 2018	A1
20180124311	Lelescu et al.	May 2018	A1
20180139382	Venkataraman et al.	May 2018	A1
20180197035	Venkataraman et al.	Jul 2018	A1
20180211402	Ciurea et al.	Jul 2018	A1
20180227511	McMahon	Aug 2018	A1
20180240265	Yang et al.	Aug 2018	A1
20180270473	Mullis	Sep 2018	A1
20180302554	Lelescu et al.	Oct 2018	A1
20180330182	Venkataraman et al.	Nov 2018	A1
20190037116	Molina	Jan 2019	A1
20190037150	Srikanth et al.	Jan 2019	A1
20190043253	Lucas et al.	Feb 2019	A1
20190063905	Venkataraman et al.	Feb 2019	A1
20190089947	Venkataraman et al.	Mar 2019	A1
20190098209	Venkataraman et al.	Mar 2019	A1
20190109998	Venkataraman et al.	Apr 2019	A1
20190215496	Mullis et al.	Jul 2019	A1
20190230348	Ciurea et al.	Jul 2019	A1
20190235138	Duparre et al.	Aug 2019	A1
20190268586	Mullis	Aug 2019	A1
20190289176	Duparre	Sep 2019	A1
20190347768	Lelescu et al.	Nov 2019	A1
20190356863	Venkataraman et al.	Nov 2019	A1
20190362515	Ciurea et al.	Nov 2019	A1
20190364263	Jannard et al.	Nov 2019	A1
20200026948	Venkataraman et al.	Jan 2020	A1
20200252597	Mullis	Aug 2020	A1
20200389604	Venkataraman et al.	Dec 2020	A1
20210063141	Venkataraman et al.	Mar 2021	A1
20210133927	Lelescu et al.	May 2021	A1
20210281816	Mullis	Sep 2021	A1
20210312207	Venkataraman et al.	Oct 2021	A1
20220239890	Mullis	Jul 2022	A1
20220385848	Venkataraman et al.	Dec 2022	A1
20220414829	Lelescu et al.	Dec 2022	A1
20230152087	Venkataraman et al.	May 2023	A1

Foreign Referenced Citations (239)

Number	Date	Country
1619358	May 2005	CN
1669332	Sep 2005	CN
1839394	Sep 2006	CN
1985524	Jun 2007	CN
101010619	Aug 2007	CN
101046882	Oct 2007	CN
101064780	Oct 2007	CN
101102388	Jan 2008	CN
101147392	Mar 2008	CN
201043890	Apr 2008	CN
101212566	Jul 2008	CN
101312540	Nov 2008	CN
101427372	May 2009	CN
101593350	Dec 2009	CN
101606086	Dec 2009	CN
101883291	Nov 2010	CN
102037717	Apr 2011	CN
102184720	Sep 2011	CN
102375199	Mar 2012	CN
103004180	Mar 2013	CN
104081414	Oct 2014	CN
104508681	Apr 2015	CN
104662589	May 2015	CN
104685513	Jun 2015	CN
104685860	Jun 2015	CN
104081414	Aug 2017	CN
104662589	Aug 2017	CN
107077743	Aug 2017	CN
107230236	Oct 2017	CN
107346061	Nov 2017	CN
104685513	Apr 2018	CN
104335246	Sep 2018	CN
107346061	Apr 2020	CN
107230236	Dec 2020	CN
602011041799.1	Sep 2017	DE
0677821	Oct 1995	EP
0840502	May 1998	EP
1201407	May 2002	EP
1355274	Oct 2003	EP
1418766	May 2004	EP
1734766	Dec 2006	EP
1243945	Jan 2009	EP
2026563	Feb 2009	EP
2104334	Sep 2009	EP
2244484	Oct 2010	EP
0957642	Apr 2011	EP
2336816	Jun 2011	EP
2339532	Jun 2011	EP
2381418	Oct 2011	EP
2502115	Sep 2012	EP
2569935	Mar 2013	EP
2652678	Oct 2013	EP
2761534	Aug 2014	EP
2777245	Sep 2014	EP
2867718	May 2015	EP
2873028	May 2015	EP
2888698	Jul 2015	EP
2888720	Jul 2015	EP
2901671	Aug 2015	EP
2973476	Jan 2016	EP
3066690	Sep 2016	EP
2569935	Dec 2016	EP
3201877	Aug 2017	EP
2652678	Sep 2017	EP
3201877	Mar 2018	EP
2817955	Apr 2018	EP
3328048	May 2018	EP
3075140	Jun 2018	EP
3201877	Dec 2018	EP
3467776	Apr 2019	EP
2761534	Nov 2020	EP
2888720	Mar 2021	EP
3328048	Apr 2021	EP
3869797	Aug 2021	EP
3876510	Sep 2021	EP
3869797	Jul 2023	EP
2482022	Jan 2012	GB
2708CHENP2014	Aug 2015	IN
361194	Mar 2021	IN
59-025483	Feb 1984	JP
64-037177	Feb 1989	JP
02-285772	Nov 1990	JP
06129851	May 1994	JP
07-015457	Jan 1995	JP
H0756112	Mar 1995	JP
09171075	Jun 1997	JP
09181913	Jul 1997	JP
10253351	Sep 1998	JP
11142609	May 1999	JP
11223708	Aug 1999	JP
11325889	Nov 1999	JP
2000209503	Jul 2000	JP
2001008235	Jan 2001	JP
2001194114	Jul 2001	JP
2001264033	Sep 2001	JP
2001277260	Oct 2001	JP
2001337263	Dec 2001	JP
2002195910	Jul 2002	JP
2002205310	Jul 2002	JP
2002209226	Jul 2002	JP
2002250607	Sep 2002	JP
2002252338	Sep 2002	JP
2003094445	Apr 2003	JP
2003139910	May 2003	JP
2003163938	Jun 2003	JP
2003298920	Oct 2003	JP
2004221585	Aug 2004	JP
2005116022	Apr 2005	JP
2005181460	Jul 2005	JP
2005295381	Oct 2005	JP
2005303694	Oct 2005	JP
2005341569	Dec 2005	JP
2005354124	Dec 2005	JP
2006033228	Feb 2006	JP
2006033493	Feb 2006	JP
2006047944	Feb 2006	JP
2006258930	Sep 2006	JP
2007520107	Jul 2007	JP
2007259136	Oct 2007	JP
2008039852	Feb 2008	JP
2008055908	Mar 2008	JP
2008507874	Mar 2008	JP
2008172735	Jul 2008	JP
2008258885	Oct 2008	JP
2009064421	Mar 2009	JP
2009132010	Jun 2009	JP
2009300268	Dec 2009	JP
2010139288	Jun 2010	JP
2011017764	Jan 2011	JP
2011030184	Feb 2011	JP
2011109484	Jun 2011	JP
2011523538	Aug 2011	JP
2011203238	Oct 2011	JP
2012504805	Feb 2012	JP
2013509022	Mar 2013	JP
WO2011052064	Mar 2013	JP
2013526801	Jun 2013	JP
2014521117	Aug 2014	JP
2014535191	Dec 2014	JP
2015522178	Aug 2015	JP
2015534734	Dec 2015	JP
5848754	Jan 2016	JP
2016524125	Aug 2016	JP
6140709	May 2017	JP
2017163550	Sep 2017	JP
2017163587	Sep 2017	JP
2017531976	Oct 2017	JP
6546613	Jul 2019	JP
2019-220957	Dec 2019	JP
6630891	Dec 2019	JP
2020017999	Jan 2020	JP
6767543	Sep 2020	JP
6767558	Sep 2020	JP
1020110097647	Aug 2011	KR
20170063827	Jun 2017	KR
101824672	Feb 2018	KR
101843994	Mar 2018	KR
10-2002165	Jul 2019	KR
10-2111181	May 2020	KR
191151	Jul 2013	SG
11201500910	Oct 2015	SG
200828994	Jul 2008	TW
200939739	Sep 2009	TW
201228382	Jul 2012	TW
I535292	May 2016	TW
1994020875	Sep 1994	WO
2005057922	Jun 2005	WO
2006039906	Apr 2006	WO
2006039906	Apr 2006	WO
2007013250	Feb 2007	WO
2007052191	May 2007	WO
2007083579	Jul 2007	WO
2007134137	Nov 2007	WO
2008045198	Apr 2008	WO
2008050904	May 2008	WO
2008108271	Sep 2008	WO
2008108926	Sep 2008	WO
2008150817	Dec 2008	WO
2009073950	Jun 2009	WO
2009151903	Dec 2009	WO
2009157273	Dec 2009	WO
2010037512	Apr 2010	WO
2011008443	Jan 2011	WO
2011026527	Mar 2011	WO
2011046607	Apr 2011	WO
2011055655	May 2011	WO
2011063347	May 2011	WO
2011105814	Sep 2011	WO
2011116203	Sep 2011	WO
2011063347	Oct 2011	WO
2011143501	Nov 2011	WO
2012057619	May 2012	WO
2012057620	May 2012	WO
2012057621	May 2012	WO
2012057622	May 2012	WO
2012057623	May 2012	WO
2012057620	Jun 2012	WO
2012074361	Jun 2012	WO
2012078126	Jun 2012	WO
2012082904	Jun 2012	WO
2012155119	Nov 2012	WO
2013003276	Jan 2013	WO
2013043751	Mar 2013	WO
2013043761	Mar 2013	WO
2013049699	Apr 2013	WO
2013055960	Apr 2013	WO
2013119706	Aug 2013	WO
2013126578	Aug 2013	WO
2013166215	Nov 2013	WO
2014004134	Jan 2014	WO
2014005123	Jan 2014	WO
2014031795	Feb 2014	WO
2014052974	Apr 2014	WO
2014032020	May 2014	WO
2014078443	May 2014	WO
2014130849	Aug 2014	WO
2014133974	Sep 2014	WO
2014138695	Sep 2014	WO
2014138697	Sep 2014	WO
2014144157	Sep 2014	WO
2014145856	Sep 2014	WO
2014149403	Sep 2014	WO
2014149902	Sep 2014	WO
2014150856	Sep 2014	WO
2014153098	Sep 2014	WO
2014159721	Oct 2014	WO
2014159779	Oct 2014	WO
2014160142	Oct 2014	WO
2014164550	Oct 2014	WO
2014164909	Oct 2014	WO
2014165244	Oct 2014	WO
2014133974	Apr 2015	WO
2015048694	Apr 2015	WO
2015070105	May 2015	WO
2015074078	May 2015	WO
2015081279	Jun 2015	WO
2015134996	Sep 2015	WO
2015183824	Dec 2015	WO
2016054089	Apr 2016	WO

Non-Patent Literature Citations (312)

Entry
US 8,957,977 B2, 02/2015, Venkataraman et al. (withdrawn)
Extended European Search Report for EP Application No. 11781313.9, Completed Oct. 1, 2013, dated Oct. 8, 2013, 6 pages.
Extended European Search Report for EP Application No. 13810429.4, Completed Jan. 7, 2016, dated Jan. 15, 2016, 6 Pgs.
Extended European Search Report for European Application EP12782935.6, completed Aug. 28, 2014, dated Sep. 4, 2014, 7 Pgs.
Extended European Search Report for European Application EP12804266.0, Report Completed Jan. 27, 2015, dated Feb. 3, 2015, 7 Pgs.
Extended European Search Report for European Application EP12835041.0, Report Completed Jan. 28, 2015, dated Feb. 4, 2015, 6 Pgs.
Extended European Search Report for European Application EP13751714.0, completed Aug. 5, 2015, dated Aug. 18, 2015, 8 Pgs.
Extended European Search Report for European Application EP13810229.8, Report Completed Apr. 14, 2016, dated Apr. 21, 2016, 7 pgs.
Extended European Search Report for European Application No. 10832330.4, completed Sep. 26, 2013, dated Oct. 4, 2013, 7 pgs.
Extended European Search Report for European Application No. 11848308.0, Search completed Jan. 13, 2016, dated Jan. 22, 2016, 10 Pgs.
Extended European Search Report for European Application No. 13830945.5, Search completed Jun. 28, 2016, dated Jul. 7, 2016, 14 Pgs.
Extended European Search Report for European Application No. 13841613.6, Search completed Jul. 18, 2016, dated Jul. 26, 2016, 8 Pgs.
Extended European Search Report for European Application No. 14763087.5, Search completed Dec. 7, 2016, dated Dec. 19, 2016, 9 pgs.
Extended European Search Report for European Application No. 14860103.2, Search completed Feb. 23, 2017, dated Mar. 3, 2017, 7 Pgs.
Extended European Search Report for European Application No. 14865463.5, Search completed May 30, 2017, dated Jun. 8, 2017, 6 Pgs.
Extended European Search Report for European Application No. 15847754.7, Search completed Jan. 25, 2018, dated Feb. 9, 2018, 8 Pgs.
Extended European Search Report for European Application No. 18151530.5, Completed Mar. 28, 2018, dated Apr. 20, 2018, 11 pages.
Extended European Search Report for European Application No. 18205326.4, Search completed Jan. 8, 2019 , dated Jan. 18, 2019, 9 Pgs.
International Preliminary Report on Patentability for International Application No. PCT/US2009/044687, Completed Jul. 30, 2010, 9 pgs.
International Preliminary Report on Patentability for International Application No. PCT/US2012/056151, Report dated Mar. 25, 2014, 9 pgs.
International Preliminary Report on Patentability for International Application No. PCT/US2012/056166, Report dated Mar. 25, 2014, Report dated Apr. 3, 2014 8 pgs.
International Preliminary Report on Patentability for International Application No. PCT/US2012/058093, Report dated Sep. 18, 2013, dated Oct. 22, 2013, 40 pgs.
International Preliminary Report on Patentability for International Application No. PCT/US2012/059813, Search Completed Apr. 15, 2014, 7 pgs.
International Preliminary Report on Patentability for International Application No. PCT/US2013/059991, dated Mar. 17, 2015, dated Mar. 26, 2015, 8 pgs.
International Preliminary Report on Patentability for International Application PCT/US2015/032467, Report dated Nov. 29, 2016, dated Dec. 8, 2016, 9 Pgs.
International Preliminary Report on Patentability for International Application PCT/US10/057661, dated May 22, 2012, dated May 31, 2012, 10 pages.
International Preliminary Report on Patentability for International Application PCT/US11/036349, Report dated Nov. 13, 2012, dated Nov. 22, 2012, 9 pages.
International Preliminary Report on Patentability for International Application PCT/US13/56065, dated Feb. 24, 2015, dated Mar. 5, 2015, 4 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2011/064921, dated Jun. 18, 2013, dated Jun. 27, 2013, 14 pgs.
International Preliminary Report on Patentability for International Application PCT/US2013/024987, dated Aug. 12, 2014, 13 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2013/027146, completed Aug. 26, 2014, dated Sep. 4, 2014, 10 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2013/039155, completed Nov. 4, 2014, dated Nov. 13, 2014, 10 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2013/046002, dated Dec. 31, 2014, dated Jan. 8, 2015, 6 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2013/048772, dated Dec. 31, 2014, dated Jan. 8, 2015, 8 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2013/056502, dated Feb. 24, 2015, dated Mar. 5, 2015, 7 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2013/069932, dated May 19, 2015, dated May 28, 2015, 12 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/017766, dated Aug. 25, 2015, dated Sep. 3, 2015, 8 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/018084, dated Aug. 25, 2015, dated Sep. 3, 2015, 11 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/018116, dated Sep. 15, 2015, dated Sep. 24, 2015, 12 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/021439, dated Sep. 15, 2015, dated Sep. 24, 2015, 9 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/022118, dated Sep. 8, 2015, dated Sep. 17, 2015, 4 pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/022123, dated Sep. 8, 2015, dated Sep. 17, 2015, 4 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/022774, dated Sep. 22, 2015, dated Oct. 1, 2015, 5 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/023762, dated Mar. 2, 2015, dated Mar. 9, 2015, 10 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/024407, dated Sep. 15, 2015, dated Sep. 24, 2015, 8 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/024903, dated Sep. 15, 2015, dated Sep. 24, 2015, 12 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/024947, dated Sep. 15, 2015, dated Sep. 24, 2015, 7 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/025100, dated Sep. 15, 2015, dated Sep. 24, 2015, 4 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/025904, dated Sep. 15, 2015, dated Sep. 24, 2015, 5 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/028447, dated Sep. 15, 2015, dated Sep. 24, 2015, 7 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/029052, dated Sep. 15, 2015, dated Sep. 24, 2015, 9 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/030692, dated Sep. 15, 2015, dated Sep. 24, 2015, 6 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/064693, dated May 10, 2016, dated May 19, 2016, 14 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/066229, dated May 24, 2016, dated Jun. 2, 2016, 9 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2014/067740, dated May 31, 2016, dated Jun. 9, 2016, 9 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2015/019529, dated Sep. 13, 2016, dated Sep. 22, 2016, 9 Pgs.
International Preliminary Report on Patentability for International Application PCT/US2015/053013, dated Apr. 4, 2017, dated Apr. 13, 2017, 8 Pgs.
International Preliminary Report on Patentability for International Application PCT/US13/62720, dated Mar. 31, 2015, dated Apr. 9, 2015, 8 Pgs.
International Search Report and Written Opinion for International Application No. PCT/US13/46002, completed Nov. 13, 2013, dated Nov. 29, 2013, 7 pgs.
International Search Report and Written Opinion for International Application No. PCT/US13/56065, Completed Nov. 25, 2013, dated Nov. 26, 2013, 8 pgs.
International Search Report and Written Opinion for International Application No. PCT/US13/59991, Completed Feb. 6, 2014, dated Feb. 26, 2014, 8 pgs.
International Search Report and Written Opinion for International Application No. PCT/US2012/056166, Report Completed Nov. 10, 2012, dated Nov. 20, 2012, 9 pgs.
International Search Report and Written Opinion for International Application No. PCT/US2013/024987, Completed Mar. 27, 2013, dated Apr. 15, 2013, 14 pgs.
International Search Report and Written Opinion for International Application No. PCT/US2013/027146, completed Apr. 2, 2013, 11 pgs.
International Search Report and Written Opinion for International Application No. PCT/US2013/039155, completed Jul. 1, 2013, dated Jul. 11, 2013, 11 Pgs.
International Search Report and Written Opinion for International Application No. PCT/US2013/048772, Completed Oct. 21, 2013, dated Nov. 8, 2013, 6 pgs.
International Search Report and Written Opinion for International Application No. PCT/US2013/056502, Completed Feb. 18, 2014, dated Mar. 19, 2014, 7 pgs.
International Search Report and Written Opinion for International Application No. PCT/US2013/069932, Completed Mar. 14, 2014, dated Apr. 14, 2014, 12 pgs.
International Search Report and Written Opinion for International Application No. PCT/US2015/019529, completed May 5, 2015, dated Jun. 8, 2015, 11 Pgs.
International Search Report and Written Opinion for International Application No. PCT/US2015/032467, Search completed Jul. 27, 2015, dated Aug. 19, 2015, 10 Pgs.
International Search Report and Written Opinion for International Application No. PCT/US2015/053013, completed Dec. 1, 2015, dated Dec. 30, 2015, 9 Pgs.
International Search Report and Written Opinion for International Application PCT/US11/36349, dated Aug. 22, 2011, 11 pgs.
International Search Report and Written Opinion for International Application PCT/US13/62720, completed Mar. 25, 2014, dated Apr. 21, 2014, 9 Pgs.
International Search Report and Written Opinion for International Application PCT/US14/17766, completed May 28, 2014, dated Jun. 18, 2014, 9 Pgs.
International Search Report and Written Opinion for International Application PCT/US14/18084, completed May 23, 2014, dated Jun. 10, 2014, 12 pgs.
International Search Report and Written Opinion for International Application PCT/US14/18116, Report completed May 13, 2014, 12 pgs.
International Search Report and Written Opinion for International Application PCT/US14/21439, completed Jun. 5, 2014, dated Jun. 20, 2014, 10 Pgs.
International Search Report and Written Opinion for International Application PCT/US14/22118, completed Jun. 9, 2014, dated Jun. 25, 2014, 5 pgs.
International Search Report and Written Opinion for International Application PCT/US14/22774 report completed Jun. 9, 2014, dated Jul. 14, 2014, 6 Pgs.
International Search Report and Written Opinion for International Application PCT/US14/24407, report completed Jun. 11, 2014, dated Jul. 8, 2014, 9 Pgs.
International Search Report and Written Opinion for International Application PCT/US14/25100, report completed Jul. 7, 2014, dated Aug. 7, 2014, 5 Pgs.
International Search Report and Written Opinion for International Application PCT/US14/25904 report completed Jun. 10, 2014, dated Jul. 10, 2014, 6 Pgs.
International Search Report and Written Opinion for International Application PCT/US2009/044687, completed Jan. 5, 2010, dated Jan. 13, 2010, 9 pgs.
International Search Report and Written Opinion for International Application PCT/US2010/057661, completed Mar. 9, 2011, 14 pgs.
International Search Report and Written Opinion for International Application PCT/US2011/064921, completed Feb. 25, 2011, dated Mar. 6, 2012, 17 pgs.
International Search Report and Written Opinion for International Application PCT/US2012/037670, dated Jul. 18, 2012, Completed Jul. 5, 2012, 9 pgs.
International Search Report and Written Opinion for International Application PCT/US2012/044014, completed Oct. 12, 2012, 15 pgs.
International Search Report and Written Opinion for International Application PCT/US2012/056151, completed Nov. 14, 2012, 10 pgs.
International Search Report and Written Opinion for International Application PCT/US2012/058093, Report completed Nov. 15, 2012, 12 pgs.
International Search Report and Written Opinion for International Application PCT/US2012/059813, completed Dec. 17, 2012, 8 pgs.
International Search Report and Written Opinion for International Application PCT/US2014/022123, completed Jun. 9, 2014, dated Jun. 25, 2014, 5 pgs.
International Search Report and Written Opinion for International Application PCT/US2014/023762, Completed May 30, 2014, dated Jul. 3, 2014, 6 Pgs.
International Search Report and Written Opinion for International Application PCT/US2014/024903, completed Jun. 12, 2014, dated Jun. 27, 2014, 13 pgs.
International Search Report and Written Opinion for International Application PCT/US2014/024947, Completed Jul. 8, 2014, dated Aug. 5, 2014, 8 Pgs.
International Search Report and Written Opinion for International Application PCT/US2014/028447, completed Jun. 30, 2014, dated Jul. 21, 2014, 8 Pgs.
International Search Report and Written Opinion for International Application PCT/US2014/029052, completed Jun. 30, 2014, dated Jul. 24, 2014, 10 Pgs.
International Search Report and Written Opinion for International Application PCT/US2014/030692, completed Jul. 28, 2014, dated Aug. 27, 2014, 7 Pgs.
International Search Report and Written Opinion for International Application PCT/US2014/064693, Completed Mar. 7, 2015, dated Apr. 2, 2015, 15 pgs.
International Search Report and Written Opinion for International Application PCT/US2014/066229, Completed Mar. 6, 2015, dated Mar. 19, 2015, 9 Pgs.
International Search Report and Written Opinion for International Application PCT/US2014/067740, Completed Jan. 29, 2015, dated Mar. 3, 2015, 10 pgs.
Office Action for U.S. Appl. No. 12/952,106, dated Aug. 16, 2012, 12 pgs.
Supplementary European Search Report for EP Application No. 13831768.0, Search completed May 18, 2016, dated May 30, 2016, 13 Pgs.
Supplementary European Search Report for European Application 09763194.9, completed Nov. 7, 2011, dated Nov. 29, 2011, 9 pgs.
“Exchangeable image file format for digital still cameras: Exif Version 2.2”, Japan Electronics and Information Technology Industries Association, Prepared by Technical Standardization Committee on AV & IT Storage Systems and Equipment, JEITA CP-3451, Apr. 2002, Retrieved from: http://www.exif.org/Exif2-2.PDF, 154 pgs.
“File Formats Version 6”, Alias Systems, 2004, 40 pgs.
“Light fields and computational photography”, Stanford Computer Graphics Laboratory, Retrieved from: http://graphics.stanford.edu/projects/lightfield/, Earliest publication online: Feb. 10, 1997, 3 pgs.
Aufderheide et al., “A MEMS-based Smart Sensor System for Estimation of Camera Pose for Computer Vision Applications”, Research and Innovation Conference 2011, Jul. 29, 2011, pp. 1-10.
Baker et al., “Limits on Super-Resolution and How to Break Them”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Sep. 2002, vol. 24, No. 9, pp. 1167-1183.
Barron et al., “Intrinsic Scene Properties from a Single RGB-D Image”, 2013 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 23-28, 2013, Portland, OR, USA, pp. 17-24.
Bennett et al., “Multispectral Bilateral Video Fusion”, 2007 IEEE Transactions on Image Processing, May 2007, published Apr. 16, 2007, vol. 16, No. 5, pp. 1185-1194.
Bennett et al., “Multispectral Video Fusion”, Computer Graphics (ACM SIGGRAPH Proceedings), Jul. 25, 2006, published Jul. 30, 2006, 1 pg.
Bertalmio et al., “Image Inpainting”, Proceedings of the 27th Annual Conference on Computer Graphics and Interactive Techniques, 2000, ACM Pres/Addison-Wesley Publishing Co., pp. 417-424.
Bertero et al., “Super-resolution in computational imaging”, Micron, Jan. 1, 2003, vol. 34, Issues 6-7, 17 pgs.
Bishop et al., “Full-Resolution Depth Map Estimation from an Aliased Plenoptic Light Field”, ACCV Nov. 8, 2010, Part II, LNCS 6493, pp. 186-200.
Bishop et al., “Light Field Superresolution”, Computational Photography (ICCP), 2009 IEEE International Conference, Conference Date April 16-17, published Jan. 26, 2009, 9 pgs.
Bishop et al., “The Light Field Camera: Extended Depth of Field, Aliasing, and Superresolution”, IEEE Transactions on Pattern Analysis and Machine Intelligence, May 2012, vol. 34, No. 5, published Aug. 18, 2011, pp. 972-986.
Borman, “Topics in Multiframe Superresolution Restoration”, Thesis of Sean Borman, Apr. 2004, 282 pgs.
Borman et al., “Image Sequence Processing”, Dekker Encyclopedia of Optical Engineering, Oct. 14, 2002, 81 pgs.
Borman et al., “Block-Matching Sub-Pixel Motion Estimation from Noisy, Under-Sampled Frames—An Empirical Performance Evaluation”, Proc SPIE, Dec. 28, 1998, vol. 3653, 10 pgs.
Borman et al., “Image Resampling and Constraint Formulation for Multi- Frame Super-Resolution Restoration”, Proc. SPIE, published Jul. 1, 2003, vol. 5016, 12 pgs.
Borman et al., “Linear models for multi-frame super-resolution restoration under non-affine registration and spatially varying PSF”, Proc. SPIE, May 21, 2004, vol. 5299, 12 pgs.
Borman et al., “Nonlinear Prediction Methods for Estimation of Clique Weighting Parameters in NonGaussian Image Models”, Proc. SPIE, Sep. 22, 1998, vol. 3459, 9 pgs.
Borman et al., “Simultaneous Multi-Frame MAP Super-Resolution Video Enhancement Using Spatio-Temporal Priors”, Image Processing, 1999, ICIP 99 Proceedings, vol. 3, pp. 469-473.
Borman et al., “Super-Resolution from Image Sequences—A Review”, Circuits & Systems, 1998, pp. 374-378.
Bose et al., “Superresolution and Noise Filtering Using Moving Least Squares”, IEEE Transactions on Image Processing, Aug. 2006, vol. 15, Issue 8, published Jul. 17, 2006, pp. 2239-2248.
Boye et al., “Comparison of Subpixel Image Registration Algorithms”, Proc. of SPIE—IS&T Electronic Imaging, Feb. 3, 2009, vol. 7246, pp. 72460X-1-72460X-9; doi: 10.1117/12.810369.
Bruckner et al., “Artificial compound eye applying hyperacuity”, Optics Express, Dec. 11, 2006, vol. 14, No. 25, pp. 12076-12084.
Bruckner et al., “Driving microoptical imaging systems towards miniature camera applications”, Proc. SPIE, Micro-Optics, May 13, 2010, 11 pgs.
Bruckner et al., “Thin wafer-level camera lenses inspired by insect compound eyes”, Optics Express, Nov. 22, 2010, vol. 18, No. 24, pp. 24379-24394.
Bryan et al., “Perspective Distortion from Interpersonal Distance Is an Implicit Visual Cue for Social Judgments of Faces”, PLOS One, vol. 7, Issue 9, Sep. 26, 2012, e45301, doi:10.1371/journal.pone.0045301, 9 pages.
Capel, “Image Mosaicing and Super-resolution”, Retrieved on Nov. 10, 2012, Retrieved from the Internet at URL:<http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.226.2643&rep=rep1 &type=pdf>, 2001, 269 pgs.
Carroll et al., “Image Warps for Artistic Perspective Manipulation”, ACM Transactions on Graphics (TOG), vol. 29, No. 4, Jul. 26, 2010, Article No. 127, 9 pgs.
Chan et al., “Extending the Depth of Field in a Compound-Eye Imaging System with Super-Resolution Reconstruction”, Proceedings—International Conference on Pattern Recognition, Jan. 1, 2006, vol. 3, pp. 623-626.
Chan et al., “Investigation of Computational Compound-Eye Imaging System with Super-Resolution Reconstruction”, IEEE, ISASSP, Jun. 19, 2006, pp. 1177-1180.
Chan et al., “Super-resolution reconstruction in a computational compound-eye imaging system”, Multidim Syst Sign Process, published online Feb. 23, 2007, vol. 18, pp. 83-101.
Chen et al., “Image Matting with Local and Nonlocal Smooth Priors”, CVPR '13 Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 23, 2013, pp. 1902-1907.
Chen et al., “Interactive deformation of light fields”, Symposium on Interactive 3D Graphics, 2005, pp. 139-146.
Chen et al., “KNN matting”, 2012 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 16-21, 2012, Providence, RI, USA, pp. 869-876.
Chen et al., “KNN Matting”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Sep. 2013, vol. 35, No. 9, pp. 2175-2188.
Collins et al., “An Active Camera System for Acquiring Multi-View Video”, IEEE 2002 International Conference on Image Processing, Date of Conference: Sep. 22-25, 2002, Rochester, NY, 4 pgs.
Cooper et al., “The perceptual basis of common photographic practice”, Journal of Vision, vol. 12, No. 5, Article 8, May 25, 2012, pp. 1-14.
Crabb et al., “Real-time foreground segmentation via range and color imaging”, 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, Anchorage, AK, USA, Jun. 23-28, 2008, pp. 1-5.
Debevec et al., “Recovering High Dynamic Range Radiance Maps from Photographs”, Computer Graphics (ACM SIGGRAPH Proceedings), Aug. 16, 1997, 10 pgs.
Do, Minh N., “Immersive Visual Communication with Depth”, Presented at Microsoft Research, Jun. 15, 2011, Retrieved from: http://minhdo.ece.illinois.edu/talks/ImmersiveComm.pdf, 42 pgs.
Do et al., “Immersive Visual Communication”, IEEE Signal Processing Magazine, vol. 28, Issue 1, Jan. 2011, DOI: 10.1109/MSP.2010.939075, Retrieved from: http://minhdo.ece.illinois.edu/publications/ImmerComm_SPM.pdf, pp. 58-66.
Drouin et al., “Fast Multiple-Baseline Stereo with Occlusion”, Fifth International Conference on 3-D Digital Imaging and Modeling (3DIM'05), Ottawa, Ontario, Canada, Jun. 13-16, 2005, pp. 540-547.
Drouin et al., “Geo-Consistency for Wide Multi-Camera Stereo”, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05), vol. 1, Jun. 20-25, 2005, pp. 351-358.
Drouin et al., “Improving Border Localization of Multi-Baseline Stereo Using Border-Cut”, International Journal of Computer Vision, Jul. 5, 2006, vol. 83, Issue 3, 8 pgs.
Drulea et al., “Motion Estimation Using the Correlation Transform”, IEEE Transactions on Image Processing, Aug. 2013, vol. 22, No. 8, pp. 3260-3270, first published May 14, 2013.
Duparre et al., “Artificial apposition compound eye fabricated by micro-optics technology”, Applied Optics, Aug. 1, 2004, vol. 43, No. 22, pp. 4303-4310.
Duparre et al., “Artificial compound eye zoom camera”, Bioinspiration & Biomimetics, Nov. 21, 2008, vol. 3, pp. 1-6.
Duparre et al., “Artificial compound eyes—different concepts and their application to ultra flat image acquisition sensors”, MOEMS and Miniaturized Systems IV, Proc. SPIE 5346, Jan. 24, 2004, pp. 89-100.
Duparre et al., “Chirped arrays of refractive ellipsoidal microlenses for aberration correction under oblique incidence”, Optics Express, Dec. 26, 2005, vol. 13, No. 26, pp. 10539-10551.
Duparre et al., “Micro-optical artificial compound eyes”, Bioinspiration & Biomimetics, Apr. 6, 2006, vol. 1, pp. R1-R16.
Duparre et al., “Microoptical artificial compound eyes—from design to experimental verification of two different concepts”, Proc. of SPIE, Optical Design and Engineering II, vol. 5962, Oct. 17, 2005, pp. 59622A-1-59622A-12.
Duparre et al., “Microoptical Artificial Compound Eyes—Two Different Concepts for Compact Imaging Systems”, 11th Microoptics Conference, Oct. 30-Nov. 2, 2005, 2 pgs.
Duparre et al., “Microoptical telescope compound eye”, Optics Express, Feb. 7, 2005, vol. 13, No. 3, pp. 889-903.
Duparre et al., “Micro-optically fabricated artificial apposition compound eye”, Electronic Imaging—Science and Technology, Prod. SPIE 5301, Jan. 2004, pp. 25-33.
Duparre et al., “Novel Optics/Micro-Optics for Miniature Imaging Systems”, Proc. of SPIE, Apr. 21, 2006, vol. 6196, pp. 619607-1-619607-15.
Duparre et al., “Theoretical analysis of an artificial superposition compound eye for application in ultra flat digital image acquisition devices”, Optical Systems Design, Proc. SPIE 5249, Sep. 2003, pp. 408-418.
Duparre et al., “Thin compound-eye camera”, Applied Optics, May 20, 2005, vol. 44, No. 15, pp. 2949-2956.
Duparre et al., “Ultra-Thin Camera Based on Artificial Apposition Compound Eyes”, 10th Microoptics Conference, Sep. 1-3, 2004, 2 pgs.
Eng et al., “Gaze correction for 3D tele-immersive communication system”, IVMSP Workshop, 2013 IEEE 11th. IEEE, Jun. 10, 2013.
Fanaswala, “Regularized Super-Resolution of Multi-View Images”, Retrieved on Nov. 10, 2012 (Nov. 10, 2012). Retrieved from the Internet at URL:<http://www.site.uottawa.ca/-edubois/theses/Fanaswala_thesis.pdf>, 2009, 163 pgs.
Fang et al., “Volume Morphing Methods for Landmark Based 3D Image Deformation”, SPIE vol. 2710, Proc. 1996 SPIE Intl Symposium on Medical Imaging, Newport Beach, CA, Feb. 10, 1996, pp. 404-415.
Farrell et al., “Resolution and Light Sensitivity Tradeoff with Pixel Size”, Proceedings of the SPIE Electronic Imaging 2006 Conference, Feb. 2, 2006, vol. 6069, 8 pgs.
Farsiu et al., “Advances and Challenges in Super-Resolution”, International Journal of Imaging Systems and Technology, Aug. 12, 2004, vol. 14, pp. 47-57.
Farsiu et al., “Fast and Robust Multiframe Super Resolution”, IEEE Transactions on Image Processing, Oct. 2004, published Sep. 3, 2004, vol. 13, No. 10, pp. 1327-1344.
Farsiu et al., “Multiframe Demosaicing and Super-Resolution of Color Images”, IEEE Transactions on Image Processing, Jan. 2006, vol. 15, No. 1, date of publication Dec. 12, 2005, pp. 141-159.
Fecker et al., “Depth Map Compression for Unstructured Lumigraph Rendering”, Proc. SPIE 6077, Proceedings Visual Communications and Image Processing 2006, Jan. 18, 2006, pp. 60770B-1-60770B-8.
Feris et al., “Multi-Flash Stereopsis: Depth Edge Preserving Stereo with Small Baseline Illumination”, IEEE Trans on PAMI, 2006, 31 pgs.
Fife et al., “A 3D Multi-Aperture Image Sensor Architecture”, Custom Integrated Circuits Conference, 2006, CICC '06, IEEE, pp. 281-284.
Fife et al., “A 3MPixel Multi-Aperture Image Sensor with 0.7Mu Pixels in 0.11Mu CMOS”, ISSCC 2008, Session 2, Image Sensors & Technology, 2008, pp. 48-50.
Fischer et al., “Optical System Design”, 2nd Edition, SPIE Press, Feb. 14, 2008, pp. 191-198.
Fischer et al., “Optical System Design”, 2nd Edition, SPIE Press, Feb. 14, 2008, pp. 49-58.
Gastal et al., “Shared Sampling for Real-Time Alpha Matting”, Computer Graphics Forum, EuroGraphics 2010, vol. 29, Issue 2, May 2010, pp. 575-584.
Georgeiv et al., “Light Field Camera Design for Integral View Photography”, Adobe Systems Incorporated, Adobe Technical Report, 2003, 13 pgs.
Georgiev et al., “Light-Field Capture by Multiplexing in the Frequency Domain”, Adobe Systems Incorporated, Adobe Technical Report, 2003, 13 pgs.
Goldman et al., “Video Object Annotation, Navigation, and Composition”, In Proceedings of UIST 2008, Oct. 19-22, 2008, Monterey CA, USA, pp. 3-12.
Gortler et al., “The Lumigraph”, In Proceedings of SIGGRAPH 1996, published Aug. 1, 1996, pp. 43-54.
Gupta et al., “Perceptual Organization and Recognition of Indoor Scenes from RGB-D Images”, 2013 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 23-28, 2013, Portland, OR, USA, pp. 564-571.
Hacohen et al., “Non-Rigid Dense Correspondence with Applications for Image Enhancement”, ACM Transactions on Graphics, vol. 30, No. 4, Aug. 7, 2011, 9 pgs.
Hamilton, “JPEG File Interchange Format, Version 1.02”, Sep. 1, 1992, 9 pgs.
Hardie, “A Fast Image Super-Algorithm Using an Adaptive Wiener Filter”, IEEE Transactions on Image Processing, Dec. 2007, published Nov. 19, 2007, vol. 16, No. 12, pp. 2953-2964.
Hasinoff et al., “Search-and-Replace Editing for Personal Photo Collections”, 2010 International Conference: Computational Photography (ICCP) Mar. 2010, pp. 1-8.
Hernandez-Lopez et al., “Detecting objects using color and depth segmentation with Kinect sensor”, Procedia Technology, vol. 3, Jan. 1, 2012, pp. 196-204, XP055307680, ISSN: 2212-0173, DOI: 10.1016/j.protcy.2012.03.021.
Holoeye Photonics AG, “LC 2012 Spatial Light Modulator (transmissive)”, Sep. 18, 2013, retrieved from https://web.archive.org/web/20130918151716/http://holoeye.com/spatial-light-modulators/lc-2012-spatial-light-modulator/ on Oct. 20, 2017, 3 pages.
Holoeye Photonics AG, “Spatial Light Modulators”, Oct. 2, 2013, Brochure retrieved from https://web.archive.org/web/20131002061028/http://holoeye.com/wp-content/uploads/Spatial_Light_Modulators.pdf on Oct. 13, 2017, 4 pgs.
Holoeye Photonics AG, “Spatial Light Modulators”, Sep. 18, 2013, retrieved from https://web.archive.org/web/20130918113140/http://holoeye.com/spatial-light-modulators/ on Oct. 13, 2017, 4 pages.
Horisaki et al., “Irregular Lens Arrangement Design to Improve Imaging Performance of Compound-Eye Imaging Systems”, Applied Physics Express, Jan. 29, 2010, vol. 3, pp. 022501-1-022501-3.
Horisaki et al., “Superposition Imaging for Three-Dimensionally Space-Invariant Point Spread Functions”, Applied Physics Express, Oct. 13, 2011, vol. 4, pp. 112501-1-112501-3.
Horn et al., “LightShop: Interactive Light Field Manipulation and Rendering”, In Proceedings of I3D, Jan. 1, 2007, pp. 121-128.
Isaksen et al., “Dynamically Reparameterized Light Fields”, In Proceedings of SIGGRAPH 2000, pp. 297-306.
Izadi et al., “KinectFusion: Real-time 3D Reconstruction and Interaction Using a Moving Depth Camera”, UIST'11, Oct. 16-19, 2011, Santa Barbara, CA, pp. 559-568.
Janoch et al., “A category-level 3-D object dataset: Putting the Kinect to work”, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops), Nov. 6-13, 2011, Barcelona, Spain, pp. 1168-1174.
Jarabo et al., “Efficient Propagation of Light Field Edits”, In Proceedings of SIACG 2011, pp. 75-80.
Jiang et al., “Panoramic 3D Reconstruction Using Rotational Stereo Camera with Simple Epipolar Constraints”, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), vol. 1, Jun. 17-22, 2006, New York, NY, USA, pp. 371-378.
Joshi, Neel S., “Color Calibration for Arrays of Inexpensive Image Sensors”, Master's with Distinction in Research Report, Stanford University, Department of Computer Science, Mar. 2004, 30 pgs.
Joshi et al., “Synthetic Aperture Tracking: Tracking Through Occlusions”, ICCV IEEE 11th International Conference on Computer Vision; Publication [online]. Oct. 2007 [retrieved Jul. 28, 2014]. Retrieved from the Internet: <URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=4409032&isnumber=4408819>; pp. 1-8.
Kang et al., “Handling Occlusions in Dense Multi-view Stereo”, Computer Vision and Pattern Recognition, 2001, vol. 1, pp. I-103-I-110.
Kim, “Scene Reconstruction from a Light Field”, Master Thesis, Sep. 1, 2010 (Sep. 1, 2010), pp. 1-72.
Kim et al., “Scene reconstruction from high spatio-angular resolution light fields”, ACM Transactions on Graphics (TOG)—SIGGRAPH 2013 Conference Proceedings, vol. 32 Issue 4, Article 73, Jul. 21, 2013, 11 pages.
Kitamura et al., “Reconstruction of a high-resolution image on a compound-eye image-capturing system”, Applied Optics, Mar. 10, 2004, vol. 43, No. 8, pp. 1719-1727.
Konolige, Kurt, “Projected Texture Stereo”, 2010 IEEE International Conference on Robotics and Automation, May 3-7, 2010, p. 148-155.
Krishnamurthy et al., “Compression and Transmission of Depth Maps for Image-Based Rendering”, Image Processing, 2001, pp. 828-831.
Kubota et al., “Reconstructing Dense Light Field From Array of Multifocus Images for Novel View Synthesis”, IEEE Transactions on Image Processing, vol. 16, No. 1, Jan. 2007, pp. 269-279.
Kutulakos et al., “Occluding Contour Detection Using Affine Invariants and Purposive Viewpoint Control”, Computer Vision and Pattern Recognition, Proceedings CVPR 94, Seattle, Washington, Jun. 21-23, 1994, 8 pgs.
Lai et al., “A Large-Scale Hierarchical Multi-View RGB-D Object Dataset”, Proceedings—IEEE International Conference on Robotics and Automation, Conference Date May 9-13, 2011, 8 pgs., DOI: 10.1109/ICRA.201135980382.
Lane et al., “A Survey of Mobile Phone Sensing”, IEEE Communications Magazine, vol. 48, Issue 9, Sep. 2010, pp. 140-150.
Lee et al., “Automatic Upright Adjustment of Photographs”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2012, pp. 877-884.
Lee et al., “Electroactive Polymer Actuator for Lens-Drive Unit in Auto-Focus Compact Camera Module”, ETRI Journal, vol. 31, No. 6, Dec. 2009, pp. 695-702.
Lee et al., “Nonlocal matting”, CVPR 2011, Jun. 20-25, 2011, pp. 2193-2200.
LensVector, “How LensVector Autofocus Works”, 2010, printed Nov. 2, 2012 from http://www.lensvector.com/overview.html, 1 pg.
Levin et al., “A Closed Form Solution to Natural Image Matting”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2006, vol. 1, pp. 61-68.
Levin et al., “Spectral Matting”, 2007 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 17-22, 2007, Minneapolis, MN, USA, pp. 1-8.
Levoy, “Light Fields and Computational Imaging”, IEEE Computer Society, Sep. 1, 2006, vol. 39, Issue No. 8, pp. 46-55.
Levoy et al., “Light Field Rendering”, Proc. ADM SIGGRAPH '96, pp. 1-12.
Li et al., “A Hybrid Camera for Motion Deblurring and Depth Map Super-Resolution”, Jun. 23-28, 2008, IEEE Conference on Computer Vision and Pattern Recognition, 8 pgs. Retrieved from www.eecis.udel.edu/˜jye/lab_research/08/deblur-feng.pdf on Feb. 5, 2014.
Li et al., “Fusing Images With Different Focuses Using Support Vector Machines”, IEEE Transactions on Neural Networks, vol. 15, No. 6, Nov. 8, 2004, pp. 1555-1561.
Lim, Jongwoo, “Optimized Projection Pattern Supplementing Stereo Systems”, 2009 IEEE International Conference on Robotics and Automation, May 12-17, 2009, pp. 2823-2829.
Liu et al., “Virtual View Reconstruction Using Temporal Information”, 2012 IEEE International Conference on Multimedia and Expo, 2012, pp. 115-120.
Lo et al., “Stereoscopic 3D Copy & Paste”, ACM Transactions on Graphics, vol. 29, No. 6, Article 147, Dec. 2010, pp. 147:1-147:10.
Martinez et al., “Simple Telemedicine for Developing Regions: Camera Phones and Paper-Based Microfluidic Devices for Real-Time, Off-Site Diagnosis”, Analytical Chemistry (American Chemical Society), vol. 80, No. 10, May 15, 2008, pp. 3699-3707.
McGuire et al., “Defocus video matting”, ACM Transactions on Graphics (TOG)—Proceedings of ACM SIGGRAPH 2005, vol. 24, Issue 3, Jul. 2005, pp. 567-576.
Merkle et al., “Adaptation and optimization of coding algorithms for mobile 3DTV”, Mobile3DTV Project No. 216503, Nov. 2008, 55 pgs.
Mitra et al., “Light Field Denoising, Light Field Superresolution and Stereo Camera Based Refocussing using a GMM Light Field Patch Prior”, Computer Vision and Pattern Recognition Workshops (CVPRW), 2012 IEEE Computer Society Conference on Jun. 16-21, 2012, pp. 22-28.
Moreno-Noguer et al., “Active Refocusing of Images and Videos”, ACM Transactions on Graphics (TOG)—Proceedings of ACM SIGGRAPH 2007, vol. 26, Issue 3, Jul. 2007, 10 pages.
Muehlebach, “Camera Auto Exposure Control for VSLAM Applications”, Studies on Mechatronics, Swiss Federal Institute of Technology Zurich, Autumn Term 2010 course, 67 pgs.
Nayar, “Computational Cameras: Redefining the Image”, IEEE Computer Society, Aug. 14, 2006, pp. 30-38.
Ng, “Digital Light Field Photography”, Thesis, Jul. 2006, 203 pgs.
Ng et al., “Light Field Photography with a Hand-held Plenoptic Camera”, Stanford Tech Report CTSR 2005-02, Apr. 20, 2005, pp. 1-11.
Ng et al., “Super-Resolution Image Restoration from Blurred Low-Resolution Images”, Journal of Mathematical Imaging and Vision, 2005, vol. 23, pp. 367-378.
Nguyen et al., “Error Analysis for Image-Based Rendering with Depth Information”, IEEE Transactions on Image Processing, vol. 18, Issue 4, Apr. 2009, pp. 703-716.
Nguyen et al., “Image-Based Rendering with Depth Information Using the Propagation Algorithm”, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005, vol. 5, Mar. 23-23, 2005, pp. II-589-II-592.
Nishihara, H.K., “Prism: A Practical Real-Time Imaging Stereo Matcher”, Massachusetts Institute of Technology, A.I. Memo 780, May 1984, 32 pgs.
Nitta et al., “Image reconstruction for thin observation module by bound optics by using the iterative backprojection method”, Applied Optics, May 1, 2006, vol. 45, No. 13, pp. 2893-2900.
Nomura et al., “Scene Collages and Flexible Camera Arrays”, Proceedings of Eurographics Symposium on Rendering, Jun. 2007, 12 pgs.
Park et al., “Multispectral Imaging Using Multiplexed Illumination”, 2007 IEEE 11th International Conference on Computer Vision, Oct. 14-21, 2007, Rio de Janeiro, Brazil, pp. 1-8.
Park et al., “Super-Resolution Image Reconstruction”, IEEE Signal Processing Magazine, May 2003, pp. 21-36.
Parkkinen et al., “Characteristic Spectra of Munsell Colors”, Journal of the Optical Society of America A, vol. 6, Issue 2, Feb. 1989, pp. 318-322.
Perwass et al., “Single Lens 3D-Camera with Extended Depth-of-Field”, printed from www.raytrix.de, Jan. 22, 2012, 15 pgs.
Pham et al., “Robust Super-Resolution without Regularization”, Journal of Physics: Conference Series 124, Jul. 2008, pp. 1-19.
Philips 3D Solutions, “3D Interface Specifications, White Paper”, Feb. 15, 2008, 2005-2008 Philips Electronics Nederland B.V., Philips 3D Solutions retrieved from www.philips.com/3dsolutions, 29 pgs., Feb. 15, 2008.
Polight, “Designing Imaging Products Using Reflowable Autofocus Lenses”, printed Nov. 2, 2012 from http://www.polight.no/tunable-polymer-autofocus-lens-html--11.html, 1 pg.
Pouydebasque et al., “Varifocal liquid lenses with integrated actuator, high focusing power and low operating voltage fabricated on 200 mm wafers”, Sensors and Actuators A: Physical, vol. 172, Issue 1, Dec. 2011, pp. 280-286.
Protter et al., “Generalizing the Nonlocal-Means to Super-Resolution Reconstruction”, IEEE Transactions on Image Processing, Dec. 2, 2008, vol. 18, No. 1, pp. 36-51.
Radtke et al., “Laser lithographic fabrication and characterization of a spherical artificial compound eye”, Optics Express, Mar. 19, 2007, vol. 15, No. 6, pp. 3067-3077.
Rajan et al., “Simultaneous Estimation of Super Resolved Scene and Depth Map from Low Resolution Defocused Observations”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, No. 9, Sep. 8, 2003, pp. 1-16.
Rander et al., “Virtualized Reality: Constructing Time-Varying Virtual Worlds From Real World Events”, Proc. of IEEE Visualization '97, Phoenix, Arizona, Oct. 19-24, 1997, pp. 277-283, 552.
Rhemann et al., “Fast Cost-Volume Filtering for Visual Correspondence and Beyond”, IEEE Trans. Pattern Anal. Mach. Intell, 2013, vol. 35, No. 2, pp. 504-511.
Rhemann et al., “A perceptually motivated online benchmark for image matting”, 2009 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 20-25, 2009, Miami, FL, USA, pp. 1826-1833.
Robert et al., “Dense Depth Map Reconstruction :A Minimization and Regularization Approach which Preserves Discontinuities”, European Conference on Computer Vision (ECCV), pp. 439-451, (1996).
Robertson et al., “Dynamic Range Improvement Through Multiple Exposures”, In Proc. of the Int. Conf. on Image Processing, 1999, 5 pgs.
Robertson et al., “Estimation-theoretic approach to dynamic range enhancement using multiple exposures”, Journal of Electronic Imaging, Apr. 2003, vol. 12, No. 2, pp. 219-228.
Roy et al., “Non-Uniform Hierarchical Pyramid Stereo for Large Images”, Computer and Robot Vision, 2002, pp. 208-215.
Sauer et al., “Parallel Computation of Sequential Pixel Updates in Statistical Tomographic Reconstruction”, ICIP 1995 Proceedings of the 1995 International Conference on Image Processing, Date of Conference: Oct. 23-26, 1995, pp. 93-96.
Scharstein et al., “High-Accuracy Stereo Depth Maps Using Structured Light”, IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2003), Jun. 2003, vol. 1, pp. 195-202.
Seitz et al., “Plenoptic Image Editing”, International Journal of Computer Vision 48, Conference Date Jan. 7, 1998, 29 pgs, DOI: 10.1109/ICCV.1998.710696 ⋅ Source: DBLP Conference: Computer Vision, Sixth International Conference.
Shotton et al., “Real-time human pose recognition in parts from single depth images”, CVPR 2011, Jun. 20-25, 2011, Colorado Springs, CO, USA, pp. 1297-1304.
Shum et al., “A Review of Image-based Rendering Techniques”, Visual Communications and Image Processing 2000, May 2000, 12 pgs.
Shum et al., “Pop-Up Light Field: An Interactive Image-Based Modeling and Rendering System”, Apr. 2004, ACM Transactions on Graphics, vol. 23, No. 2, pp. 143-162. Retrieved from http://131.107.65.14/en-us/um/people/jiansun/papers/PopupLightField_TOG.pdf on Feb. 5, 2014.
Silberman et al., “Indoor segmentation and support inference from RGBD images”, ECCV'12 Proceedings of the 12th European conference on Computer Vision, vol. Part V, Oct. 7-13, 2012, Florence, Italy, pp. 746-760.
Stober, “Stanford researchers developing 3-D camera with 12,616 lenses”, Stanford Report, Mar. 19, 2008, Retrieved from: http://news.stanford.edu/news/2008/march19/camera-031908.html, 5 pgs.
Stollberg et al., “The Gabor superlens as an alternative wafer-level camera approach inspired by superposition compound eyes of nocturnal insects”, Optics Express, Aug. 31, 2009, vol. 17, No. 18, pp. 15747-15759.
Sun et al., “Image Super-Resolution Using Gradient Profile Prior”, 2008 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 23-28, 2008, 8 pgs.; DOI: 10.1109/CVPR.2008.4587659.
Taguchi et al., “Rendering-Oriented Decoding for a Distributed Multiview Coding System Using a Coset Code”, Hindawi Publishing Corporation, EURASIP Journal on Image and Video Processing, vol. 2009, Article ID 251081, Online: Apr. 22, 2009, 12 pages.
Takeda et al., “Super-resolution Without Explicit Subpixel Motion Estimation”, IEEE Transaction on Image Processing, Sep. 2009, vol. 18, No. 9, pp. 1958-1975.
Tallon et al., “Upsampling and Denoising of Depth Maps Via Joint-Segmentation”, 20th European Signal Processing Conference, Aug. 27-31, 2012, 5 pgs.
Tanida et al., “Color imaging with an integrated compound imaging system”, Optics Express, Sep. 8, 2003, vol. 11, No. 18, pp. 2109-2117.
Tanida et al., “Thin observation module by bound optics (TOMBO): concept and experimental verification”, Applied Optics, Apr. 10, 2001, vol. 40, No. 11, pp. 1806-1813.
Tao et al., “Depth from Combining Defocus and Correspondence Using Light-Field Cameras”, ICCV '13 Proceedings of the 2013 IEEE International Conference on Computer Vision, Dec. 1, 2013, pp. 673-680.
Taylor, “Virtual camera movement: The way of the future?”, American Cinematographer vol. 77, No. 9, Sep. 1996, 93-100.
Tseng et al., “Automatic 3-D depth recovery from a single urban-scene image”, 2012 Visual Communications and Image Processing, Nov. 27-30, 2012, San Diego, CA, USA, pp. 1-6.
Vaish et al., “Reconstructing Occluded Surfaces Using Synthetic Apertures: Stereo, Focus and Robust Measures”, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06), vol. 2, Jun. 17-22, 2006, pp. 2331-2338.
Vaish et al., “Synthetic Aperture Focusing Using a Shear-Warp Factorization of the Viewing Transform”, IEEE Workshop on A3DISS, CVPR, 2005, 8 pgs.
Vaish et al., “Using Plane + Parallax for Calibrating Dense Camera Arrays”, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2004, 8 pgs.
Van Der Wal et al., “The Acadia Vision Processor”, Proceedings Fifth IEEE International Workshop on Computer Architectures for Machine Perception, Sep. 13, 2000, Padova, Italy, pp. 31-40.
Veilleux, “CCD Gain Lab: The Theory”, University of Maryland, College Park—Observational Astronomy (ASTR 310), Oct. 19, 2006, pp. 1-5 (online], [retrieved on May 13, 2014]. Retrieved from the Internet <URL: http://www.astro.umd.edu/˜veilleux/ASTR310/fall06/ccd_theory.pdf, 5 pgs.
Venkataraman et al., “PiCam: An Ultra-Thin High Performance Monolithic Camera Array”, ACM Transactions on Graphics (TOG), ACM, US, vol. 32, No. 6, 1 Nov. 1, 2013, pp. 1-13.
Vetro et al., “Coding Approaches for End-To-End 3D TV Systems”, Mitsubishi Electric Research Laboratories, Inc., TR2004-137, Dec. 2004, 6 pgs.
Viola et al., “Robust Real-time Object Detection”, Cambridge Research Laboratory, Technical Report Series, Compaq, CRL Jan. 2001, Feb. 2001, Printed from: http://www.hpl.hp.com/techreports/Compaq-DEC/CRL-2001-1.pdf, 30 pgs.
Vuong et al., “A New Auto Exposure and Auto White-Balance Algorithm to Detect High Dynamic Range Conditions Using CMOS Technology”, Proceedings of the World Congress on Engineering and Computer Science 2008, WCECS 2008, Oct. 22-24, 2008.
Wang, “Calculation of Image Position, Size and Orientation Using First Order Properties”, Dec. 29, 2010, OPTI521 Tutorial, 10 pgs.
Wang et al., “Automatic Natural Video Matting with Depth”, 15th Pacific Conference on Computer Graphics and Applications, PG '07, Oct. 29-Nov. 2, 2007, Maui, HI, USA, pp. 469-472.
Wang et al., “Image and Video Matting: A Survey”, Foundations and Trends, Computer Graphics and Vision, vol. 3, No. 2, 2007, pp. 91-175.
Wang et al., “Soft scissors: an interactive tool for realtime high quality matting”, ACM Transactions on Graphics (TOG)—Proceedings of ACM SIGGRAPH 2007, vol. 26, Issue 3, Article 9, Jul. 2007, 6 pages, published Aug. 5, 2007.
Wetzstein et al., “Computational Plenoptic Imaging”, Computer Graphics Forum, 2011, vol. 30, No. 8, pp. 2397-2426.
Wheeler et al., “Super-Resolution Image Synthesis Using Projections Onto Convex Sets in the Frequency Domain”, Proc. SPIE, Mar. 11, 2005, vol. 5674, 12 pgs.
Wieringa et al., “Remote Non-invasive Stereoscopic Imaging of Blood Vessels: First In-vivo Results of a New Multispectral Contrast Enhancement Technology”, Annals of Biomedical Engineering, vol. 34, No. 12, Dec. 2006, pp. 1870-1878, Published online Oct. 12, 2006.
Wikipedia, “Polarizing Filter (Photography)”, retrieved from http://en.wikipedia.org/wiki/Polarizing_filter_(photography) on Dec. 12, 2012, last modified on Sep. 26, 2012, 5 pgs.
Wilburn, “High Performance Imaging Using Arrays of Inexpensive Cameras”, Thesis of Bennett Wilburn, Dec. 2004, 128 pgs.
Wilburn et al., “High Performance Imaging Using Large Camera Arrays”, ACM Transactions on Graphics, Jul. 2005, vol. 24, No. 3, pp. 1-12.
Wilburn et al., “High-Speed Videography Using a Dense Camera Array”, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004., vol. 2, Jun. 27-Jul. 2, 2004, pp. 294-301.
Wilburn et al., “The Light Field Video Camera”, Proceedings of Media Processors 2002, SPIE Electronic Imaging, 2002, 8 pgs.
Wippermann et al., “Design and fabrication of a chirped array of refractive ellipsoidal micro-lenses for an apposition eye camera objective”, Proceedings of SPIE, Optical Design and Engineering II, Oct. 15, 2005, 59622C-1-59622C-11.
Wu et al., “A virtual view synthesis algorithm based on image inpainting”, 2012 Third International Conference on Networking and Distributed Computing, Hangzhou, China, Oct. 21-24, 2012, pp. 153-156.
Xu, “Real-Time Realistic Rendering and High Dynamic Range Image Display and Compression”, Dissertation, School of Computer Science in the College of Engineering and Computer Science at the University of Central Florida, Orlando, Florida, Fall Term 2005, 192 pgs.
Yang et al., “A Real-Time Distributed Light Field Camera”, Eurographics Workshop on Rendering (2002), published Jul. 26, 2002, pp. 1-10.
Yang et al., “Superresolution Using Preconditioned Conjugate Gradient Method”, Proceedings of SPIE—The International Society for Optical Engineering, Jul. 2002, 8 pgs.
Yokochi et al., “Extrinsic Camera Parameter Estimation Based-on Feature Tracking and GPS Data”, 2006, Nara Institute of Science and Technology, Graduate School of Information Science, LNCS 3851, pp. 369-378.
Zhang et al., “A Self-Reconfigurable Camera Array”, Eurographics Symposium on Rendering, published Aug. 8, 2004, 12 pgs.
Zhang et al., “Depth estimation, spatially variant image registration, and super-resolution using a multi-lenslet camera”, Proceedings of SPIE, vol. 7705, Apr. 23, 2010, pp. 770505-770505-8, XP055113797 ISSN: 0277-786X, DOI: 10.1117/12.852171.
Zheng et al., “Balloon Motion Estimation Using Two Frames”, Proceedings of the Asilomar Conference on Signals, Systems and Computers, IEEE, Comp. Soc. Press, US, vol. 2 of 02, Nov. 4, 1991, pp. 1057-1061.
Zhu et al., “Fusion of Time-of-Flight Depth and Stereo for High Accuracy Depth Maps”, 2008 IEEE Conference on Computer Vision and Pattern Recognition, Jun. 23-28, 2008, Anchorage, AK, USA, pp. 1-8.
Zomet et al., “Robust Super-Resolution”, IEEE, 2001, pp. 1-6.
Extended European Search Report for European Application No. 23177057.9, Search completed Dec. 4, 2023, dated Dec. 15, 2023, 14 Pgs.
“Light Field—Wikipedia, the Free Encyclopedia”, The Wayback Machine—Aug. 15, 2012, https://web.archive.Org/web/20120815204221/http://en.wikipedia.org/wiki/Light_field, XP093106694, Retrieved from the Internet , URL: https://web.archive.org/web/20120815204221/https://en.Wikipedia.org/wiki/Light_field#Ways—to_create—light—fields [retrieved on Nov. 28, 2023].
Debevec et al., “Modeling and Rendering Architecture from Photographs: a Hybrid Geometry- and Image-Based Approach”, Computer Graphics Proceedings. Siggraph '96, Jan. 19, 1996, pp. 11-20, XP093106889, New York, US DOI: 10.1145/237170.237191 ISBN: 978-0-89791-746-9 Retrieved from the Internet:, URL:https ://www2.eecs.berkeley.edu/Pubs/TechRpts/1996/CSD-96-893.pdf> [retrieved on Nov. 28, 2023].
Extended European Search Report for European Application No. 21169308.0, Search completed Aug. 2, 2021, dated Aug. 9, 2021, 9 Pgs.
Extended Search Report for European Application No. 21155002.5, Search completed Jun. 7, 2021, dated Jun. 11, 2021, 14 Pgs.
Hasinoff et al., “Burst photography for high dynamic range and low-light imaging on mobile cameras”, ACM Transactions on Graphics, vol. 35, No. 6, Article No. 192, pp. 1-12, https://doi.org/10.1145/2980179.2980254.
Takeda et al., “Robust Kernel Regression for Restoration and Reconstruction of Images from Sparse Noisy Data”, 2006 International Conference on Image Processing, Atlanta, GA, USA, 2006, pp. 1257-1260, doi: 10.1109/ICIP.2006.312573.

Related Publications (1)

	Number	Date	Country
	20210150748 A1	May 2021	US

Provisional Applications (2)

	Number	Date	Country
	61780906	Mar 2013	US
	61691666	Aug 2012	US

Continuations (7)

	Number	Date	Country
Parent	16537369	Aug 2019	US
Child	17162542		US
Parent	15858974	Dec 2017	US
Child	16537369		US
Parent	14988680	Jan 2016	US
Child	15858974		US
Parent	14526392	Oct 2014	US
Child	14988680		US
Parent	14329754	Jul 2014	US
Child	14526392		US
Parent	14144458	Dec 2013	US
Child	14329754		US
Parent	13972881	Aug 2013	US
Child	14144458		US

Systems and methods for estimating depth and visibility from a reference viewpoint for pixels in a set of images captured from different viewpoints

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

CPC

International Classifications

Term Extension

Abstract