Autofocus for stereo images

Description

TECHNICAL FIELD

The present embodiments relate to stereoscopic image capture, and in particular, to methods, apparatus and systems for determining an appropriate focal depth for a stereoscopic image capture device.

BACKGROUND

Stereopsis is the process by which the human brain interprets an object's depth based on the relative displacement of the object as seen from the left and right eyes. The stereoscopic effect may be artificially induced by taking first and second images of a scene from first and second laterally offset viewing positions and presenting the images separately to each of the left and right eyes. By capturing a succession of stereoscopic image pairs in time, the image pairs may be successively presented to the eyes to form a stereoscopic movie that appears to the user as having three-dimensions.

Two traditional cameras may be used to acquire each of the images of a stereoscopic image pair. A traditional camera may be properly focused using an autofocus procedure which captures a plurality of images at different focal depths. The focal depth corresponding to the highest frequency content is then used for subsequent image captures. Traditional movie cameras may use this method to autofocus during video capture. However, the capture of frames will need to be periodically delayed while the autofocus functionality is performed.

While suitable for capturing 2D images with a single traditional camera, this autofocus technique may be unsuitable for stereoscopic image capture. In particular, the technique may disrupt the video stream and may be affected by camera movement, such as by the user's hand motions.

SUMMARY

Certain embodiments contemplate a method in an electronic device for determining a focal depth for an image sensor. They method may comprise: receiving a first image associated with a first viewpoint; receiving a second image associated with a second viewpoint; determining a first plurality of keypoints based on the first image; correlating keypoints from the first plurality of keypoints with positions in the second image; determining a plurality of disparities associated with each of the first plurality of keypoints; and determining a focal depth based upon the plurality of disparities, the position of the first viewpoint and the position of the second viewpoint.

In certain embodiments, the method may further comprise determining a second plurality of keypoints based on the second image. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image comprises correlating keypoints from the first plurality of keypoints with keypoints from the second plurality of keypoints. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image comprises iterating over pixels within a search range in the second image. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image comprises determining the mean square error between pixels in the first image and the second image. In some embodiments, determining a first plurality of keypoints based on the first image comprises determining Scale Invariant Feature Transform (SIFT) keypoints based on the first image. In some embodiments, determining a first plurality of keypoints based on the first image comprises sub-sampling the first image, applying a high-pass filter to the first image, calculating the power of the first image, and thresholding the first image. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image occurs in realtime. In some embodiments, the electronic device comprises a mobile phone.

Certain embodiments contemplate a computer readable medium comprising instructions configured to cause a computer to perform the steps of: receiving a first image associated with a first viewpoint; receiving a second image associated with a second viewpoint; determining a first plurality of keypoints based on the first image; correlating keypoints from the first plurality of keypoints with positions in the second image; determining a plurality of disparities associated with each of the first plurality of keypoints; and determining a focal depth based upon the plurality of disparities, the position of the first viewpoint and the position of the second viewpoint.

In some embodiments, the instructions are also configured to cause the processor to determine a second plurality of keypoints based on the second image. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image comprises correlating keypoints from the first plurality of keypoints with keypoints from the second plurality of keypoints. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image comprises iterating over pixels within a search range in the second image. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image comprises determining the mean square error between pixels in the first image and the second image. In some embodiments, determining a first plurality of keypoints based on the first image comprises determining Scale Invariant Feature Transform (SIFT) keypoints based on the first image. In some embodiments, determining a first plurality of keypoints based on the first image comprises sub-sampling the first image, applying a high-pass filter to the first image, calculating the power of the first image, and thresholding the first image. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image occurs in realtime. In some embodiments, the computer is located in a mobile phone.

Certain embodiments contemplate a system for focusing a stereoscopic capture device. The system may comprise a first image sensor configured to generate a first image associated with a first viewpoint; a second image sensor configured to generate a second image associated with a second viewpoint; a feature generation module configured to determine a first plurality of keypoints based on the first image; a keypoint correlation module configured to correlate keypoints from the first plurality of keypoints with positions in the second image; a disparity determination module configured to determine a plurality of disparities associated with each of the first plurality of keypoints; and a depth determination module configured to determine a focal depth based upon the plurality of disparities, the position of the first viewpoint and the position of the second viewpoint.

In some embodiments, the feature generation module may be configured to determine a second plurality of keypoints based on the second image. In some embodiments, the software module configured to correlate keypoints is configured to correlate keypoints from the first plurality of keypoints with keypoints from the second plurality of keypoints. In some embodiments, the software module configured to correlate keypoints is configured to iterate over pixels within a search range in the second image. In some embodiments, correlating keypoints from the first plurality of keypoints with positions in the second image comprises determining the mean square error between pixels in the first image and the second image.

In some embodiments, the feature generation module is configured to determine Scale Invariant Feature Transform (SIFT) keypoints based on the first image. In some embodiments, the feature generation module is configured to sub-sample the first image, apply a high-pass filter to the first image, calculate the power of the first image, and threshold the first image.

In some embodiments, the software module configured to correlate keypoints correlates keypoints from the first plurality of keypoints with positions in the second image in realtime. In some embodiments, the stereoscopic capture device is located on a mobile phone. In some embodiments, the software module configured to determine a focal depth comprises a disparity histogram.

Certain embodiments contemplate a system for focusing a stereoscopic capture device, the system comprising: means for receiving a first image associated with a first viewpoint; means for receiving a second image associated with a second viewpoint; means for determining a first plurality of keypoints based on the first image; means for correlating keypoints from the first plurality of keypoints with positions in the second image; means for determining a plurality of disparities associated with each of the first plurality of keypoints; and means for determining a focal depth based upon the plurality of disparities, the position of the first viewpoint and the position of the second viewpoint.

In some embodiments the means for receiving a first image comprises a first sensor, the means for receiving a second image comprises a second sensor, the means for determining a first plurality of keypoints comprises a feature generation module, the means for correlating comprises a keypoint correlation module, the means for determining a plurality of disparities comprises a disparity determination module, and the means for determining a focal depth comprises a depth determination module. In some embodiments, the means for determining a first plurality of keypoints is configured to determine a second plurality of keypoints based on the second image. In some embodiments, the means for correlating keypoints from the first plurality of keypoints with positions in the second image is configured to correlate keypoints from the first plurality of keypoints with keypoints from the second plurality of keypoints. In some embodiments, the means for correlating keypoints from the first plurality of keypoints with positions in the second image is configured to iterate over pixels within a search range in the second image. In some embodiments, the means for correlating keypoints from the first plurality of keypoints with positions in the second image is configured to determine the mean square error between pixels in the first image and the second image. In some embodiments, the means for determining a first plurality of keypoints is configured to determine Scale Invariant Feature Transform (SIFT) keypoints based on the first image. In some embodiments, the means for determining a first plurality of keypoints is configured to sub-sample the first image, apply a high-pass filter to the first image, calculate the power of the first image, and threshold the first image. In some embodiments, the means for correlating keypoints correlates the keypoints from the first plurality of keypoints with positions in the second image in realtime. In some embodiments, the stereoscopic capture device is located on a mobile phone.

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosed aspects will hereinafter be described in conjunction with the appended drawings, provided to illustrate and not to limit the disclosed aspects, wherein like designations denote like elements.

FIG. 1 is a generalized diagram depicting one possible mobile device comprising a sensor arrangement facilitating the capture of stereoscopic images.

FIG. 2 is a block diagram of certain of the components in a mobile device, such as the mobile device of FIG. 1.

FIG. 3 depicts the capturing of an object at a first and second position using a stereo pair of capture devices.

FIG. 4 is a graph depicting the relationship between object distance and pixel disparity for a particular camera arrangement.

FIG. 5A is a diagram depicting a top-down view of an arbitrary scene and two image capture sensors positioned so as to achieve a stereoscopic effect.

FIG. 5B depicts one of the pair of stereoscopic images taken of the scene in FIG. 5A with the magnitude and direction of object disparities in the scene overlaid.

FIG. 6 depicts the graph of FIG. 4, but with an object disparity histogram and a corresponding object depth histogram overlaid.

FIG. 7 depicts a flow diagram for the process by which certain embodiments determine a new focal depth.

FIG. 8 depicts a flow diagram for the process by which certain of the embodiments determine keypoints.

FIG. 9 depicts a stereoscopic image pair and a region in which keypoints are correlated between each of the images.

DETAILED DESCRIPTION

Embodiments relate to systems and methods of determining or setting configuration data in a stereoscopic camera. In one embodiment, the configuration data relates to the proper focal length of the two lenses of the stereoscopic camera. In one embodiment, a first camera receives a first image from a scene and a second camera receives a second image of the same scene. A set of keypoints are determined from analysis of the first image. The keypoints, can be, for example, Keypoints may comprise any data structure which can be consistently replicated from a portion of an image and thereby permit unique identification of the image portion. In some embodiments, a keypoint may comprise a plurality of pixels corresponding to a portion of an image. The keypoint may be associated with a position in the image. After determining a keypoint in the first image, the system looks for a similar position in the second image. Once the similar position in the second image is identified, the system calculates the difference between the keypoints in the first image, and the corresponding position in the second image. This allows the system to determine the focal depth of the scene by knowing the disparity between the same keypoint positions in both frames, along with the positions of the stereoscopic lenses.

Present embodiments contemplate systems, apparatus, and methods to determine an appropriate focal depth for a sensor based upon at least a pair of stereoscopic images. Particularly, certain of the embodiments contemplate determining keypoints for each image, identifying correlations between the keypoints, and deriving object distances from the correlations. One skilled in the art will recognize that these embodiments may be implemented in hardware, software, firmware, or any combination thereof. The stereoscopic system may be implemented on a wide range of electronic devices, including mobile wireless communication devices, personal digital assistants (PDAs), laptop computers, desktop computers, digital cameras, digital recording devices, and the like.

FIG. 1 depicts a mobile device 100 comprising a sensor arrangement facilitating the capture of stereoscopic images, or other means for receiving an image. Such a device may be a mobile phone, personal digital assistant, gaming device, or the like. The device 100 may comprise a first sensor 101a and a second sensor 101b separated by a distance d. The device may also comprise user input controls 102 and a display 103. In some embodiments, the sensors 101a and 101b may be situated such that they are horizontally, but not vertically, offset when the users holds device 100 so as to capture a stereoscopic picture or movie.

Although this particular device depicts two sensors 101a and 101b one skilled in the art may readily conceive of a stereoscopic image capture device which comprises more or less than two image sensors. For example, a device with only a single sensor may operate in combination with a series of lenses or reflecting surfaces to acquire two images at the positions of sensors 101a and 101b in rapid succession. This arrangement would likewise be able to acquire a stereoscopic image pair for use with the methods described below and the single sensor could be focused accordingly. Thus, the methods and systems discussed in this application will be applicable to any system which acquires two images from a first and second viewpoint, so long as those viewpoints facilitate a stereoscopic depiction of the image scene. Thus, reference to a pair of image sensors should not be considered to exclude the possibility of a single image sensor receiving images from two viewpoints.

FIG. 2 is a block diagram of certain of the components in a mobile device, such as the mobile device 100 depicted in FIG. 1. Sensor 101a receives a first image of the stereoscopic image pair and sensor 101b receives a second image of the stereoscopic image pair. In some embodiments, the sensors may receive the images simultaneously. The device may comprise a video front end 102 and memory 103. Video front end 102 may process incoming raw image data from sensors 101a and 101b and store the data in memory 103. Memory 103 may also comprise various applications and software drivers for the mobile device 100. For example, a display driver module 104 may be in communication with the display 103. A user input module 106 may similarly be in communication with a user interface 102. A wireless communication driver module 107 may be in communication with wireless communications hardware 112.

The memory may also be in communication with a General Processor 113. The General Processor 113 may comprise sub-processing units, or subprocessors, such as an Advanced RISC Machine (ARM), digital signal processor (DSP), or graphical processing unit (GPU). These processors may communicate with local memory 114 when handling various operations.

Certain of the present embodiments contemplate the addition of a “Focal Depth Analysis Module” 115a, 115b to the system architecture. In some embodiments, the module may take the form of a dedicated processor 115a, or a portion of a processor located on the general processor. In some embodiments the module may comprise software code 115b stored in a computer readable medium such as memory 103. Some embodiments may place portions of the module at a dedicated processor 115a and memory 115b as firmware, or as a software-hardware combination. In some embodiments, the module may reside at any location in FIG. 2 which permits access to a feature generation system, such as a SIFT feature generation system, and to sensors 101a and 101b. Thus, the module may take advantage of preexisting hardware or software configured for feature generation and/or detection. One skilled in the art will recognize that the embodiments described below could be implemented using a subprocessor on the General Processor 113, or could be stored as a separate application in memory 103. In some embodiments the SIFT feature generation system may be found in software, whereas in other embodiments the SIFT feature generation system may be found in hardware.

Certain of the present embodiments provide auto-focus functionality which takes advantage of geometric properties of stereoscopic image capture. FIG. 3 depicts, via a top-down view, the stereoscopic image capture of an object 304 at a first position 300a and second position 300b using a stereoscopic camera arrangement. A first image capture device 301a may be located at a first position laterally separated from a second capture device 301b located at a second position. The first capture device 301a may capture a first image of the scene from the first position and the second capture device 301b may capture a second image of the scene from the second position. The first and second images will accordingly be associated with first and second viewpoints of the scene based on the positions and orientations of capture devices 301a and 301b. Object 304 may appear in both images. In some embodiments, capture device 301a and capture device 301b may be the same as sensors 101a and 101b of FIG. 1 respectively. Capture devices 301a, 301b may be calibrated to have no vertical disparity and to possess fairly close focal distances.

The center of device 301a's viewpoint passes along line 302a. Similarly, the center of device 301b's viewpoint passes along line 302b. These two centerlines intersect at the position 303. As mentioned, the object 304 appears in each of the first and second images. With regard to position 300a, however, the object 304 appears to the right of centerline 302a, by an amount 305a and to the left of centerline 302b by an amount 305b. Conversely, in position 300b, the object 304 appears to the left of centerline 302a by an amount 306a, and to the right of centerline 302b, by an amount 306b. In this manner, the relative position of the object in the z-direction is reflected by the relative displacement in each of the left and right images.

Object disparity may be defined as the difference between an object's position in the first image as compared to the object's position in the second image. Where there is no vertical disparity between the capture devices, the disparity may comprise only the lateral offset from the position in one image to the position in another. One may arbitrarily take the disparity as the difference between the left and right, or right and left images. For the purposes of this description, the disparity is defined as the position of the object in the image from sensor 301b minus the position of the object in the image from sensor 301a (with the x-direction positive as indicated in FIG. 3). Thus, negative disparity results from the depiction of object 304 in the position 300a and positive disparity results from the depiction of object 304 in the position 300b.

With knowledge of the sensor positions and relative orientations one can construct a graph of the relationship between the observed disparity and an object's distance, or depth, from the camera arrangement. FIG. 4, for example, is a graph of this relationship for one particular sensor arrangement. As the disparity 401 increases, the object distance 402 increases as well. An initial negative disparity may be present for objects very near the camera arrangement, i.e. those having little depth 402 in the z-direction of FIG. 3. As the object is moved further from the camera arrangement (i.e. the depth increases) the disparity becomes increasingly positive, beginning to plateau for objects at a considerable distance. One may recognize that the chart of FIG. 4 may depend upon the angles at which sensors 301a, 301b are oriented. Similarly, though the sensors may be parallel to one another as in FIGS. 1 and 3, displacement in the z and y directions between the sensors may also result in modifications to the graph. Such a graph may be stored in memory on the device, or in a similar storage structure for quick reference.

FIG. 5a is a top-down view of a scene comprising several objects. Again, image capture devices 301a and 301b may be used to acquire images 501a and 501b respectively. Objects 502-504, are located at various depths within the scene. Consequently, disparities between the object positions in images 501a and 501b will be observed. FIG. 5B depicts the image 501a with certain of the disparity magnitudes and directions indicated at the pixel positions for which they occur. For example, a plurality of positive disparities 510 emerge for the distant object 504, and a plurality of negative disparities 511 appear for the closer object 502. With reference to the graph of FIG. 4, an automated system may determine the depth associated with each disparity. For example, as shown in FIG. 6, disparities of the same magnitude have been accumulated and plotted to form the disparity histogram 601. The corresponding depths 602 may be derived from the relationship of the sensors to generate depth histogram 602. The depth histogram 602 would suggest the presence of one or more objects in the region of each maximum at the indicated depth.

An autofocus operation comprises the determination of the proper focal depth for one or both of the sensors 101a, 101b. In some embodiments, the proper focal depth may be determined by taking the mean, median, or similar statistic of object depth histogram (or the object disparity histogram in conjunction with a graph such as FIG. 4). The median statistic provides some robustness to outlying value while a special order statistic filter may be used to accommodate a particular application. The statistic selected may depend upon the relative weights to be given very distant and very close objects. For example, focal quality may be roughly the same through one range of depths, but vary dramatically in a second range. These variations are discussed in greater detail below.

While one could generate the disparity histogram 601 by determining the disparity of every pixel for every object found in each of the first and second images, this may be computationally expensive and impractical on a mobile device. Not only would the correlation of every pixel require iterating through a substantial number of pixels, but each image may comprise multiple pixels of the same value, making identification of an individual pixel and its correlation to an object in each image difficult.

In lieu of analyzing every pixel, certain of the present embodiments contemplate creating a “sparse” disparity map or “sparse” corresponding depth map of the image contents. In certain embodiments, keypoints may be determined in each of the images and the disparities between keypoints, or between keypoints and pixels, rather than between all or most of the pixels in the images, may be used to infer object depth. Since there are fewer keypoints than pixels, the consequent disparity or depth map is “sparse”. Keypoints may comprise any data structure which can be consistently replicated from a portion of an image and thereby permit unique identification of the image portion. The keypoint may be associated with a position in the image. The keypoint's unique determination permits the keypoints to be identified from similar or identical portions in a second image. In some embodiments, keypoints may comprise Scale Invariant Feature Transform (SIFT) keypoints, or keypoints of a similar feature generation module. In some embodiments, the system may reuse machine vision components preexisting in the general processor 113 or subprocessors to determine keypoints. For example, high pass filtering blocks may be reused for keypoint detection. Alternatively, software libraries for performing machine vision operations stored in memory 103 may be used to generate keypoints. In this manner, certain implementations may economically take advantage of functionality associated with other applications to generate keypoints for performing autofocus. Alternative means for determining a plurality of keypoints, such as feature generation modules employing algorithms other than SIFT, are described in greater detail below.

FIG. 7 is a flow diagram depicting an autofocus process 700 for stereoscopic image capture, which may be implemented by certain of the present embodiments. The process begins 701 by acquiring, or receiving, at least a pair of stereoscopic images 702. Certain embodiments contemplate cropping a region of interest from each of the images, so as to reduce computation time. Once the images are received, the system determines keypoints from the first image 703. As mentioned, in some embodiments these keypoints may be determined using SIFT or other feature detection hardware, firmware, or software. In some embodiments, the system may also determine keypoints in the second image. The system may then correlate keypoints from the first image with pixel regions (such as a particular pixel position) in the second image 704. A “keypoint correlation” software, firmware, or hardware module may be configured to perform this operation. Certain portions of the operation may distributed among other modules (firmware, hardware, or software), creating other means for correlating keypoints. This operation may serve to identify the same image region in the first image in the second image.

Disparities D may then be calculated between each keypoint position of the first image and the correlated pixel positions of the second image 705. Where keypoints have been calculated for both images, the disparities between the keypoints may be determined by subtracting the relative positions of each of the correlated keypoints. The disparities may then be organized as a disparity histogram similar to 601 of FIG. 6 and a corresponding depth histogram, similar to histogram 602, may be determined. The depth histogram may then be used to determine the optimal focal depth for the sensor based on the selected statistic. A “disparity determination” software, firmware, or hardware module may be configured to perform this operation. Certain portions of the operation may distributed among other modules (firmware, hardware, or software), creating other means for determining disparities.

In the embodiments implementing process 700, to improve computation efficiency, the process 700 determines the statistic (in this case, the average) of the disparities 706 rather than converting to a depth for each disparity and then determining the average of the depths. Only the depth of the single statistical value need then be determined 707 with reference to a graph similar to that of FIG. 4. This depth may then be used as the new camera focus depth 708 during subsequent image captures. As mentioned, other embodiments may instead convert each of the disparities to a depth and then average the depths. Other embodiments may alternatively take the mean, median, or some other statistic to determine the desired focal depth. A “depth determination” software, firmware, or hardware module may be configured to perform this operation. The module may operate in conjunction with a disparity histogram. Certain portions of the operation may distributed among other modules (firmware, hardware, or software), creating other means for determining a focal depth. Once the focus depth has been determined, image sensor 101a may be adjusted. Sensor 101b may also be adjusted by the processor 113, or sensor 101b may track the focal depth of sensor 101a independently. As mentioned above, in certain embodiments, only a single sensor may be adjusted based upon the determined focal depth.

In variations of these embodiments, the system may alternatively use information from the scene to determine the depth, rather than simply take the average of the disparities. For example, in lieu of taking the average, keypoint disparities may be weighted based upon their presence in a single object and upon lighting conditions. For example, the histograms may be enhanced by weighting each point from the histogram by the focal quality associated with a certain focus distance. In certain camera arrangements if the focal point was set to 3 meters, objects between 2 m and infinity may have good focus, objects between 1 m-2 m may have fair focus, and objects between 0.5-1 m may have bad focus. The histograms of FIG. 6 would accordingly be weighted so that a preferred focal range was selected more often than the other ranges. This may be referred to as a “region weighted saliency” in certain embodiments. In other variations, frequency information from the image may be incorporated into the keypoints selection. Objects comprising textures may generate more keypoints than objects without textures or with little texture, and thereby affect the average. Accordingly, keypoints associated with textured objects may receive different weights from non-textured objects. In one variation, regions within a texture may be detected and these regions then used so as to lower the weight for keypoints in that region.

When capturing a stereoscopic movie, the process 700 may be applied to a single frame, i.e. a single pair of stereoscopic images. The determined focal depth may then be used by the image sensors during subsequent image captures until the camera arrangement or scene is modified so as to necessitate reassessment of the proper focus. The operation 700 therefore has the benefit that there need not be any state dependency. That is, a traditional auto-focus system would need to periodically time-out the movie capture process and capture multiple focal depths to reassess the focus. Process 700, in contrast, may be “instantaneous” in that it produces no frame delay. This facilitates seamless focus tracking. The process may further guarantee system stability, as there is no feedback (or dependency) between the current focus position and the focus position estimation. Additionally, since the focus operation may be accomplished with a single frame, the user's hand motion will be less likely to generate any blur.

As mentioned, the keypoints generated in steps 703 and 704 of process 700 may comprise any data structure which can assign an identity to a portion of the image and be consistently recognized when applied to the second image. As mentioned, in some embodiments the keypoints may comprise Scale Invariant Feature Transform (SIFT) keypoints generated from a SIFT feature generation module. FIG. 8 depicts another possible process for generating keypoints in a feature generation module.

The process 800 begins 801 by receiving 802 one of the pair of raw stereoscopic images. The image may then be subsampled 803, possibly to improve the algorithm's robustness to noise and to decrease computational demands. The image may then be passed through a horizontal high pass filter 804. In some embodiments, the filter may comprise a 3×4 kernel with a response given by

$h = [\begin{matrix} - 1 & - 1 & 1 & 1 \\ - 1 & - 1 & 1 & 1 \\ - 1 & - 1 & 1 & 1 \end{matrix}]$

The process may then calculate the power 805 of the image, i.e. by taking the square of each value. Finally, the process may threshold the values 806 to eliminate noise and low-power values. From among the remaining values which exceeded the threshold, the system will identify “maximum values”. In some embodiments the maximum values may be those image portions which exceeded the threshold, while in other embodiments the maximum values may be defined relative to their local neighbors. For example, the delta between neighboring pixels exceeding the threshold may be used to identify a maximum value. The identified maximum values represent keypoint positions which may be used for the disparity determination steps described above. The system may store these keypoint positions 807 before ending 808. By subsampling and thresholding the image in this manner the computation time required to determine pixel locations which may serve as keypoints may be reduced. As a keypoint in these embodiments comprises a pixel position, the pixel position may occasionally be referred to as a “keypoint”. However, one will readily recognize variations, wherein keypoints comprise both a position and an array of the neighboring pixel values and positions. Keypoints may also refer to frequency content of an image portion or to gradients in pixel value, rather than to pixel values or pixel positions directly. SIFT keypoints, for example, may comprise a vector indicating the pixel gradient.

Once the keypoints in the first image have been determined, it may still remain to correlate the keypoints with positions in the second image so that the disparities between image portions may be determined. A possible method for determining whether keypoints, such as the keypoints generated by the process 800 of FIG. 8, are correlated with a position in an image will now be described with respect to FIG. 9. FIG. 9 depicts stereoscopic image pair 900a and 900b. Based on the configuration of the two sensors used to capture images 900a and 900b, a search region 902 around each keypoint 901 may be determined. The search region 902 may specify the maximum distance a keypoint may be displaced in the left and right images (i.e., a maximum expected disparity) as a consequence of the capture device configuration. In some embodiments, as the image sensors may lack vertical disparity, keypoints 901 may generally lie on vertical edges in the scene.

Search region 902 is located in the same absolute position of each of images 900a and 900b. In FIG. 9, search region 902 comprises a rectangle having the height of a single pixel, since it is assumed that no vertical disparity between images 900a and 900b exists (i.e. only pixels in a same row are considered). The height of the region 902 may be increased in relation to the amount of vertical disparity that may be present between images 900a and 900b. The system may iterate through each pixel in the search region 902 of the second image 900b and determine the pixel's correspondence with the portions of image 900a surrounding the keypoint 901. This may be accomplished in some embodiments using a correlation metric described in further detail below.

In certain embodiments, keypoints may have been determined for both image 900a and image 900b, rather than simply for image 900a. When attempting to correlate keypoints between the images, the system may identify keypoints within the search region 902 in the second image 900b. If only one keypoint of image 900b is found in search region 902, this keypoint may be correlated with the keypoint 901 from the first image. Where more than one keypoint of image 900b is present in the region 902, the system may apply a correlation metric to each keypoint of image 900b in the search region 902 to determine which keypoint best corresponds with the keypoint 901 from the first image. As with metrics applied when keypoints are taken for only one image, the metric may consider pixel values neighboring the pixel positions of keypoints 901 and 901b to verify that keypoints 901 and 901b are more likely to refer to the same portion of the scene in each of images 900a and 900b. Where keypoints are created for both images, it may be necessary only to iterate between keypoints in the region 902 rather than between each pixel within the region.

In the embodiments described above, the system iterates through certain of the pixels in the search region 902 corresponding to the determined keypoint 901. The system may apply a correlation metric to each pixel in the region 902. The pixel in the region 902 having the maximum correlation with the region surrounding the position of keypoint 901 may then be correlated with the keypoint 901. The computational cost to iterate through each pixel of image 900b in the range 902 may be less than the cost to compute keypoints for all of image 900b and to then determine the correlations between each keypoint. In some embodiments, however, where only a few keypoints have been generated, the system may determine correlations between all the keypoints directly, rather than iterate between regions 902 associated with the keypoints of one image.

In certain embodiments, the correlation metric used to identify a keypoint or pixel position in image 900b corresponding to a keypoint in 900a may comprise the calculation of the mean square error for pixels surrounding a position in image 900b under consideration and the pixels surrounding the keypoint position in image 900a. That is, the mean square error of pixels neighboring keypoint 901 of the first image and the neighboring pixels for positions in search region 902 in image 900b may be used as a correlation metric. The mean square error may be calculated as:

$R (Δ) = \sum_{i = - 3}^{3} \sum_{j = - 3}^{3} {(S_{left} (i + M, j + N) - S_{right} (i + M + Δ, j + N))}^{2}$

where R is the mean squared error, S_leftcomprises the pixel values in the image 900a, S_rightcomprises the pixel values in the image 900b, M and N comprise the horizontal and vertical offsets into the image to the region 902 for the current keypoint or pixel position of region 902 under investigation, and Δ comprises the horizontal shift applied for the current position in the search region 902 (the first parameter to S_leftand S_rightis a column position/x-axis and the second a row position/y-axis). Although the mean squared error is within a 7×7 window in the above example, one may readily envision a range of window dimensions depending upon the resolution of the image and the subsampling applied. Furthermore, as sensors 101a and 101b are presumed not to have any vertical disparity in the above example, the search region 902 extends only horizontally and Δ appears only in the x-axis/column direction. More robust systems may compensate for errors in sensor positioning by increasing the height of the search region 902 and including a Δ in the vertical direction. As the image has been downsampled, sub pixel resolution may be determined using interpolation, such as polynomial interpolation, in some embodiments to facilitate more accurate determinations of the pixel in region 902 of image 900b correlated with the keypoint 901. That is, the displacement of sensor 101b relative to sensor 101a may not be an exact, integer number of pixels. Thus, particularly after sub-sampling, accurate correlation of the keypoint 901 may require including locations between pixels in the search region 902. The, position in region 902 of image 900b maximally correlated with keypoint 901 may fall between pixel positions, at an interpolated point.

The various illustrative logical blocks, modules, and circuits described in connection with the implementations disclosed herein may be implemented or performed with a general purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components, or any combination thereof designed to perform the functions described herein. A general purpose processor may be a microprocessor, but in the alternative, the processor may be any conventional processor, controller, microcontroller, or state machine. A processor may also be implemented as a combination of computing devices, e.g., a combination of a DSP and a microprocessor, a plurality of microprocessors, one or more microprocessors in conjunction with a DSP core, or any other such configuration.

The steps of a method or process described in connection with the implementations disclosed herein may be embodied directly in hardware, in a software module executed by a processor, or in a combination of the two. A software module may reside in RAM memory, flash memory, ROM memory, EPROM memory, EEPROM memory, registers, hard disk, a removable disk, a CD-ROM, or any other form of non-transitory storage medium known in the art. An exemplary computer-readable storage medium is coupled to the processor such the processor can read information from, and write information to, the computer-readable storage medium. In the alternative, the storage medium may be integral to the processor. The processor and the storage medium may reside in an ASIC. The ASIC may reside in a user terminal, camera, or other device. In the alternative, the processor and the storage medium may reside as discrete components in a user terminal, camera, or other device.

Headings are included herein for reference and to aid in locating various sections. These headings are not intended to limit the scope of the concepts described with respect thereto. Such concepts may have applicability throughout the entire specification.

The previous description of the disclosed implementations is provided to enable any person skilled in the art to make or use the present invention. Various modifications to these implementations will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other implementations without departing from the spirit or scope of the invention. Thus, the present invention is not intended to be limited to the implementations shown herein but is to be accorded the widest scope consistent with the principles and novel features disclosed herein.

Claims

1. A method in an electronic device for determining a focal depth for an image sensor, comprising: receiving a first image in the visible spectrum, the first image associated with a first viewpoint;receiving a second image in the visible spectrum, the second image associated with a second viewpoint;determining a first plurality of keypoints based on the first image;correlating keypoints from the first plurality of keypoints with positions in the second image;determining a plurality of disparities associated with each of the first plurality of keypoints to form a sparse depth map; anddetermining a focal depth for capturing a third image in the visible spectrum based upon the sparse depth map, the position of the first viewpoint and the position of the second viewpoint.
2. The method of claim 1, further comprising determining a second plurality of keypoints based on the second image.
3. The method of claim 2, wherein correlating keypoints from the first plurality of keypoints with positions in the second image comprises correlating keypoints from the first plurality of keypoints with keypoints from the second plurality of keypoints.
4. The method of claim 1, wherein correlating keypoints from the first plurality of keypoints with positions in the second image comprises iterating over pixels within a search range in the second image.
5. The method of claim 4, wherein correlating keypoints from the first plurality of keypoints with positions in the second image comprises determining the mean square error between pixels in the first image and the second image.
6. The method of claim 1, wherein determining a first plurality of keypoints based on the first image comprises determining Scale Invariant Feature Transform (SIFT) keypoints based on the first image.
7. The method of claim 1, wherein determining a first plurality of keypoints based on the first image comprises sub-sampling the first image, applying a high-pass filter to the first image, calculating the power of the first image, and thresholding the first image.
8. The method of claim 1, wherein correlating keypoints from the first plurality of keypoints with positions in the second image occurs in realtime.
9. The method of claim 1, wherein the electronic device comprises a mobile phone.
10. A non-transitory computer readable medium comprising instructions configured to cause a computer to perform the steps of: receiving a first image in the visible spectrum, the first image associated with a first viewpoint;receiving a second image in the visible spectrum, the second image associated with a second viewpoint;determining a first plurality of keypoints based on the first image;correlating keypoints from the first plurality of keypoints with positions in the second image;determining a plurality of disparities associated with each of the first plurality of keypoints to form a sparse depth map; anddetermining a focal depth for capturing a third image in the visible spectrum based upon the sparse depth map, the position of the first viewpoint and the position of the second viewpoint.
11. The non-transitory computer readable medium of claim 10, further comprising determining a second plurality of keypoints based on the second image.
12. The non-transitory computer readable medium of claim 11, wherein correlating keypoints from the first plurality of keypoints with positions in the second image comprises correlating keypoints from the first plurality of keypoints with keypoints from the second plurality of keypoints.
13. The non-transitory computer readable medium of claim 10, wherein correlating keypoints from the first plurality of keypoints with positions in the second image comprises iterating over pixels within a search range in the second image.
14. The non-transitory computer readable medium of claim 13, wherein correlating keypoints from the first plurality of keypoints with positions in the second image comprises determining the mean square error between pixels in the first image and the second image.
15. The non-transitory computer readable medium of claim 10, wherein determining a first plurality of keypoints based on the first image comprises determining Scale Invariant Feature Transform (SIFT) keypoints based on the first image.
16. The non-transitory computer readable medium of claim 10, wherein determining a first plurality of keypoints based on the first image comprises sub-sampling the first image, applying a high-pass filter to the first image, calculating the power of the first image, and thresholding the first image.
17. The non-transitory computer readable medium of claim 10, wherein correlating keypoints from the first plurality of keypoints with positions in the second image occurs in realtime.
18. The non-transitory computer readable medium of claim 10, wherein the computer is located in a mobile phone.
19. A system for focusing a stereoscopic capture device, the system comprising: a first image sensor configured to generate a first image in the visible spectrum, the first image associated with a first viewpoint;a second image sensor configured to generate a second image in the visible spectrum, the second image associated with a second viewpoint;a feature generation module configured to determine a first plurality of keypoints based on the first image;a keypoint correlation module configured to correlate keypoints from the first plurality of keypoints with positions in the second image;a disparity determination module configured to determine a plurality of disparities associated with each of the first plurality of keypoints to form a sparse depth map; anda depth determination module configured to determine a focal depth for capturing a third image in the visible spectrum based upon the sparse depth map, the position of the first viewpoint and the position of the second viewpoint.
20. The system of claim 19, wherein the feature generation module is configured to determine a second plurality of keypoints based on the second image.
21. The system of claim 20, wherein the module configured to correlate keypoints is configured to correlate keypoints from the first plurality of keypoints with keypoints from the second plurality of keypoints.
22. The system of claim 19, wherein the module configured to correlate keypoints is configured to iterate over pixels within a search range in the second image.
23. The system of claim 22, wherein correlating keypoints from the first plurality of keypoints with positions in the second image comprises determining the mean square error between pixels in the first image and the second image.
24. The system of claim 19, wherein the feature generation module is configured to determine Scale Invariant Feature Transform (SIFT) keypoints based on the first image.
25. The system of claim 19, wherein the feature generation module is configured to sub-sample the first image, apply a high-pass filter to the first image, calculate the power of the first image, and threshold the first image.
26. The system of claim 19, wherein the module configured to correlate keypoints correlates keypoints from the first plurality of keypoints with positions in the second image in realtime.
27. The system of claim 19, wherein the stereoscopic capture device is located on a mobile phone.
28. The system of claim 19, wherein the module configured to determine a focal depth comprises a disparity histogram.
29. A system for focusing a stereoscopic capture device, the system comprising: means for receiving a first image in the visible spectrum, the first image associated with a first viewpoint;means for receiving a second image in the visible spectrum, the second image associated with a second viewpoint;means for determining a first plurality of keypoints based on the first image;means for correlating keypoints from the first plurality of keypoints with positions in the second image;means for determining a plurality of disparities associated with each of the first plurality of keypoints to form a sparse depth map; andmeans for determining a focal depth for capturing a third image in the visible spectrum based upon the sparse depth map, the position of the first viewpoint and the position of the second viewpoint.
30. The system of claim 29, wherein the means for receiving a first image comprises a first sensor, the means for receiving a second image comprises a second sensor, the means for determining a first plurality of keypoints comprises a feature generation module, the means for correlating comprises a keypoint correlation module, the means for determining a plurality of disparities comprises a disparity determination module, and the means for determining a focal depth comprises a depth determination module.
31. The system of claim 29, wherein the means for determining a first plurality of keypoints is configured to determine a second plurality of keypoints based on the second image.
32. The system of claim 31, wherein the means for correlating keypoints from the first plurality of keypoints with positions in the second image is configured to correlate keypoints from the first plurality of keypoints with keypoints from the second plurality of keypoints.
33. The system of claim 29, wherein the means for correlating keypoints from the first plurality of keypoints with positions in the second image is configured to iterate over pixels within a search range in the second image.
34. The system of claim 33, wherein the means for correlating keypoints from the first plurality of keypoints with positions in the second image is configured to determine the mean square error between pixels in the first image and the second image.
35. The system of claim 29, wherein the means for determining a first plurality of keypoints is configured to determine Scale Invariant Feature Transform (SIFT) keypoints based on the first image.
36. The system of claim 29, wherein the means for determining a first plurality of keypoints is configured to sub-sample the first image, apply a high-pass filter to the first image, calculate the power of the first image, and threshold the first image.
37. The system of claim 29, wherein the means for correlating keypoints correlates the keypoints from the first plurality of keypoints with positions in the second image in realtime.
38. The system of claim 29, wherein the stereoscopic capture device is located on a mobile phone.

CLAIM OF PRIORITY UNDER 35 U.S.C. §119

The present Application for Patent claims priority to U.S. Provisional Patent Application Ser. No. 61/371,979, entitled “INSTANTANEOUS AUTOFOCUS FROM STEREO IMAGES,” filed Aug. 9, 2010; and U.S. Provisional Patent Application Ser. No. 61/489,231, entitled “AUTOFOCUS FOR STEREO IMAGES,” filed on May 23, 2011, which applications are incorporated by reference herein.

US Referenced Citations (213)

Number	Name	Date	Kind
4114171	Altman	Sep 1978	A
4437745	Hajnal	Mar 1984	A
4639586	Fender et al.	Jan 1987	A
4740780	Brown et al.	Apr 1988	A
4751570	Robinson	Jun 1988	A
5012273	Nakamura et al.	Apr 1991	A
5016109	Gaylord	May 1991	A
5063441	Lipton et al.	Nov 1991	A
5142357	Lipton et al.	Aug 1992	A
5194959	Kaneko et al.	Mar 1993	A
5207000	Chang et al.	May 1993	A
5231461	Silvergate et al.	Jul 1993	A
5243413	Gitlin et al.	Sep 1993	A
5313542	Castonguay	May 1994	A
5475617	Castonguay	Dec 1995	A
5539483	Nalwa	Jul 1996	A
5606627	Kuo	Feb 1997	A
5614941	Hines	Mar 1997	A
5640222	Paul	Jun 1997	A
5642299	Hardin et al.	Jun 1997	A
5686960	Sussman et al.	Nov 1997	A
5721585	Keast et al.	Feb 1998	A
5734507	Harvey	Mar 1998	A
5745305	Nalwa	Apr 1998	A
5793527	Nalwa	Aug 1998	A
5903306	Heckendorn et al.	May 1999	A
5926411	Russell	Jul 1999	A
5990934	Nalwa	Nov 1999	A
6111702	Nalwa	Aug 2000	A
6115176	Nalwa	Sep 2000	A
6128143	Nalwa	Oct 2000	A
6141145	Nalwa	Oct 2000	A
6144501	Nalwa	Nov 2000	A
6195204	Nalwa	Feb 2001	B1
6219090	Nalwa	Apr 2001	B1
6285365	Nalwa	Sep 2001	B1
6356397	Nalwa	Mar 2002	B1
6611289	Yu et al.	Aug 2003	B1
6628897	Suzuki	Sep 2003	B2
6650774	Szeliski	Nov 2003	B1
6700711	Nalwa	Mar 2004	B2
6701081	Dwyer et al.	Mar 2004	B1
6768509	Bradski et al.	Jul 2004	B1
6775437	Kazarinov et al.	Aug 2004	B2
6798406	Jones et al.	Sep 2004	B1
6809887	Gao et al.	Oct 2004	B1
6850279	Scherling	Feb 2005	B1
6855111	Yokoi et al.	Feb 2005	B2
6861633	Osborn	Mar 2005	B2
7006123	Yoshikawa et al.	Feb 2006	B2
7039292	Breiholz	May 2006	B1
7084904	Liu et al.	Aug 2006	B2
7116351	Yoshikawa	Oct 2006	B2
7215479	Bakin	May 2007	B1
7253394	Kang	Aug 2007	B2
7271803	Ejiri et al.	Sep 2007	B2
7336299	Kostrzewski et al.	Feb 2008	B2
7612953	Nagai et al.	Nov 2009	B2
7710463	Foote	May 2010	B2
7805071	Mitani	Sep 2010	B2
7817354	Wilson	Oct 2010	B2
7893957	Peters et al.	Feb 2011	B2
7961398	Tocci	Jun 2011	B2
8004557	Pan	Aug 2011	B2
8098276	Chang et al.	Jan 2012	B2
8115813	Tang	Feb 2012	B2
8139125	Scherling	Mar 2012	B2
8228417	Georgiev et al.	Jul 2012	B1
8267601	Campbell et al.	Sep 2012	B2
8284263	Oohara et al.	Oct 2012	B2
8294073	Vance et al.	Oct 2012	B1
8356035	Baluja	Jan 2013	B1
8400555	Georgiev et al.	Mar 2013	B1
8482813	Kawano et al.	Jul 2013	B2
8791984	Jones et al.	Jul 2014	B2
8988564	Webster et al.	Mar 2015	B2
9316810	Mercado	Apr 2016	B2
20010028482	Nishioka	Oct 2001	A1
20020070365	Karellas	Jun 2002	A1
20020136150	Mihara et al.	Sep 2002	A1
20030024987	Zhu	Feb 2003	A1
20030038814	Blume	Feb 2003	A1
20030214575	Yoshikawa	Nov 2003	A1
20040021767	Endo et al.	Feb 2004	A1
20040066449	Givon	Apr 2004	A1
20040105025	Scherling	Jun 2004	A1
20040183907	Hovanky et al.	Sep 2004	A1
20040246333	Steuart et al.	Dec 2004	A1
20040263611	Cutler	Dec 2004	A1
20050053274	Mayer et al.	Mar 2005	A1
20050057659	Hasegawa	Mar 2005	A1
20050081629	Hoshal	Apr 2005	A1
20050111106	Matsumoto et al.	May 2005	A1
20050185711	Pfister et al.	Aug 2005	A1
20050218297	Suda et al.	Oct 2005	A1
20060023074	Cutler	Feb 2006	A1
20060023106	Yee et al.	Feb 2006	A1
20060023278	Nishioka	Feb 2006	A1
20060140446	Luo et al.	Jun 2006	A1
20060193509	Criminisi et al.	Aug 2006	A1
20060215054	Liang et al.	Sep 2006	A1
20060215903	Nishiyama	Sep 2006	A1
20060238441	Benjamin	Oct 2006	A1
20070024739	Konno	Feb 2007	A1
20070058961	Kobayashi et al.	Mar 2007	A1
20070064142	Misawa	Mar 2007	A1
20070085903	Zhang	Apr 2007	A1
20070164202	Wurz et al.	Jul 2007	A1
20070216796	Lenel et al.	Sep 2007	A1
20070242152	Chen	Oct 2007	A1
20070263115	Horidan et al.	Nov 2007	A1
20070268983	Elam	Nov 2007	A1
20080088702	Linsenmaier et al.	Apr 2008	A1
20080117289	Schowengerdt	May 2008	A1
20080218612	Border et al.	Sep 2008	A1
20080259172	Tamaru	Oct 2008	A1
20080266404	Sato	Oct 2008	A1
20080290435	Oliver et al.	Nov 2008	A1
20080291543	Nomura et al.	Nov 2008	A1
20080297612	Yoshikawa	Dec 2008	A1
20080316301	Givon	Dec 2008	A1
20090003646	Au	Jan 2009	A1
20090005112	Sorek et al.	Jan 2009	A1
20090015812	Schultz et al.	Jan 2009	A1
20090051804	Nomura et al.	Feb 2009	A1
20090085846	Cho	Apr 2009	A1
20090096994	Smits	Apr 2009	A1
20090153726	Lim	Jun 2009	A1
20090160931	Pockett et al.	Jun 2009	A1
20090268983	Stone et al.	Oct 2009	A1
20090268985	Wong et al.	Oct 2009	A1
20090296984	Nijim et al.	Dec 2009	A1
20090315808	Ishii	Dec 2009	A1
20100044555	Ohara et al.	Feb 2010	A1
20100045774	Len et al.	Feb 2010	A1
20100066812	Kajihara et al.	Mar 2010	A1
20100165155	Chang	Jul 2010	A1
20100202766	Takizawa et al.	Aug 2010	A1
20100215249	Heitz	Aug 2010	A1
20100232681	Fujieda et al.	Sep 2010	A1
20100259655	Takayama	Oct 2010	A1
20100265363	Kim	Oct 2010	A1
20100278423	Itoh et al.	Nov 2010	A1
20100289878	Sato	Nov 2010	A1
20100290703	Sim et al.	Nov 2010	A1
20100302396	Golub et al.	Dec 2010	A1
20100309286	Chen et al.	Dec 2010	A1
20100309333	Smith	Dec 2010	A1
20110001789	Wilson et al.	Jan 2011	A1
20110007135	Okada et al.	Jan 2011	A1
20110009163	Fletcher et al.	Jan 2011	A1
20110012998	Pan	Jan 2011	A1
20110043623	Fukuta et al.	Feb 2011	A1
20110090575	Mori	Apr 2011	A1
20110096089	Shenhav et al.	Apr 2011	A1
20110096988	Suen et al.	Apr 2011	A1
20110128412	Milnes et al.	Jun 2011	A1
20110181588	Barenbrug et al.	Jul 2011	A1
20110213664	Osterhout et al.	Sep 2011	A1
20110235899	Tanaka	Sep 2011	A1
20110249341	Difrancesco et al.	Oct 2011	A1
20110304764	Shigemitsu et al.	Dec 2011	A1
20120008148	Pryce et al.	Jan 2012	A1
20120044368	Lin et al.	Feb 2012	A1
20120056987	Fedoroff	Mar 2012	A1
20120075168	Osterhout et al.	Mar 2012	A1
20120249750	Izzat	Oct 2012	A1
20120249815	Bohn et al.	Oct 2012	A1
20120269400	Heyward	Oct 2012	A1
20120281072	Georgiev	Nov 2012	A1
20120293607	Bhogal et al.	Nov 2012	A1
20120293632	Yukich	Nov 2012	A1
20120327195	Cheng	Dec 2012	A1
20130003140	Keniston et al.	Jan 2013	A1
20130010084	Hatano	Jan 2013	A1
20130070055	Atanassov et al.	Mar 2013	A1
20130141802	Yang	Jun 2013	A1
20130222556	Shimada	Aug 2013	A1
20130229529	Lablans	Sep 2013	A1
20130250053	Levy	Sep 2013	A1
20130250123	Zhang et al.	Sep 2013	A1
20130260823	Shukla et al.	Oct 2013	A1
20130278785	Nomura et al.	Oct 2013	A1
20130286451	Verhaegh	Oct 2013	A1
20130335598	Gustavsson et al.	Dec 2013	A1
20130335600	Gustavsson et al.	Dec 2013	A1
20140016832	Kong et al.	Jan 2014	A1
20140085502	Lin et al.	Mar 2014	A1
20140104378	Kauff et al.	Apr 2014	A1
20140111650	Georgiev et al.	Apr 2014	A1
20140139623	McCain et al.	May 2014	A1
20140152852	Ito et al.	Jun 2014	A1
20140184749	Hilliges et al.	Jul 2014	A1
20140192253	Laroia	Jul 2014	A1
20140285673	Hundley et al.	Sep 2014	A1
20150049172	Ramachandra et al.	Feb 2015	A1
20150070562	Nayar et al.	Mar 2015	A1
20150125092	Zhuo et al.	May 2015	A1
20150177524	Webster et al.	Jun 2015	A1
20150244934	Duparre et al.	Aug 2015	A1
20150253647	Mercado	Sep 2015	A1
20150286033	Osborne	Oct 2015	A1
20150288865	Osborne	Oct 2015	A1
20150370040	Georgiev	Dec 2015	A1
20150371387	Atanassov	Dec 2015	A1
20150373252	Georgiev	Dec 2015	A1
20150373262	Georgiev	Dec 2015	A1
20150373263	Georgiev	Dec 2015	A1
20150373268	Osborne	Dec 2015	A1
20150373269	Osborne	Dec 2015	A1
20150373279	Osborne	Dec 2015	A1
20160085059	Mercado	Mar 2016	A1
20160127646	Osborne	May 2016	A1

Foreign Referenced Citations (40)

Number	Date	Country
101201459	Jun 2008	CN
101581828	Nov 2009	CN
0610605	Aug 1994	EP
0751416	Jan 1997	EP
1176812	Jan 2002	EP
1383342	Jan 2004	EP
1816514	Aug 2007	EP
1832912	Sep 2007	EP
2242252	Oct 2010	EP
2354390	Mar 2001	GB
2354391	Mar 2001	GB
H089424	Jan 1996	JP
H08125835	May 1996	JP
8194274	Jul 1996	JP
H08242453	Sep 1996	JP
2001194114	Jul 2001	JP
2003304561	Oct 2003	JP
3791847	Jun 2006	JP
2006279538	Oct 2006	JP
2007147457	Jun 2007	JP
2007323615	Dec 2007	JP
2008009424	Jan 2008	JP
H0847001	Sep 2008	JP
2010041381	Feb 2010	JP
2010067014	Mar 2010	JP
2010128820	Jun 2010	JP
2010524279	Jul 2010	JP
WO-9321560	Oct 1993	WO
WO-9847291	Oct 1998	WO
WO-2006075528	Jul 2006	WO
WO-2007129147	Nov 2007	WO
WO-2008112054	Sep 2008	WO
WO-2009047681	Apr 2009	WO
WO-2009086330	Jul 2009	WO
WO-2010019757	Feb 2010	WO
WO-2012136388	Oct 2012	WO
WO-2012164339	Dec 2012	WO
WO-2013154433	Oct 2013	WO
WO-2014012603	Jan 2014	WO
WO-2014025588	Feb 2014	WO

Non-Patent Literature Citations (13)

Entry
International Search Report and Written Opinion—PCT/US2011/047126—ISA/EPO—Oct. 26, 2011.
Narkhede, et al., “Stereoscopic Imaging: A Real-Time, In Depth Look”, IEEE Potentials, Feb./Mar. 2004, vol. 23, Issue 1, pp. 38-42.
Hoff, et al., “Surfaces from Stereo: Integrating Feature Matching Disparity Estimation, and Contour Detection”, IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 11, No. 2, Feb. 1989.
Arican, et al., “Intermediate View Generation for Perceived Depth Adjustment of Sterio Video”, Mitsubishi Electric Research Laboratories, http://www.merl.com, TR2009-052, Sep. 2009.
Han Y., et al., “Removing Illumination from Image Pair for Stereo Matching”, Audio, Language and Image Processing (ICALIP), 2012 International Conference on, IEEE, Jul. 16, 2012, XP032278010, pp. 508-512.
Krotkov E., et al., “Active vision for reliable ranging: Cooperating focus, stereo, and vergence”, International Journal of Computer Vision. vol. 11, No. 2, Oct. 1, 1993 (Oct. 1, 1993), pp. 187-203, XP055149875, ISSN: 0920-5691. DOI: 10.1007/BF01469228.
Murphy M., et al., “Lens Drivers Focus on Performance in High-Resolution Camera Modules,” Analog Dialogue, Nov. 2006, vol. 40, pp. 1-3.
Ricoh Imagine Change: “New RICOH THETA Model, Capturing 360-degree Images in One Shot, is on Sale Soon—Spherical Video Function, API and SDK (Beta Version)”, News Release, Oct. 28, 2014, 3 pages.
Shuchun Y., et al., “Preprocessing for stereo vision based on LOG filter”, Proceedings of 2011 6th International Forum on Strategic Technology, Aug. 2011, XP055211077, pp. 1074-1077.
Sun W.S., et al., “Single-Lens Camera Based on a Pyramid Prism Array to Capture Four Images,” Optical Review, 2013, vol. 20 (2), pp. 145-152.
Hao M., et al., “Object Location Technique for Binocular Stereo Vision Based on Scale Invariant Feature Points”, SIFT, Journal of Harbin Engineering University, Jun. 2009, vol. 30, No. 6 pp. 649-653.
Kawanishi T., et al., “Generation of High-Resolution Stereo Panoramic Images by Omnidirectional Imaging Sensor Using Hexagonal Pyramidal Mirrors”, Patiern Recognition, 1998, Proceedings, Fourteenth International Conference on Brisbane, QLD., Australia Aug. 16-20, 1998, Los Alamitos, CA, USA,IEEE Comput. Soc, US, Jan. 1, 1998 (Jan. 1, 1998), pp. 485-489, vol. 1, XP031098377, ISBN: 978-0-8186-8512-5.
Tan K-H., et al., “Multiview Panoramic Cameras Using a Pyramid”, Omnidirectional Vision, 2002 Proceedings, Third Workshop on Jun. 2, 2002, Piscataway, NJ, USA,IEEE Jan. 1, 2002 (Jan. 1, 2002), pp. 87-93, XP010611080, ISBN: 978-0-7695-1629-5.

Related Publications (1)

	Number	Date	Country
	20120033051 A1	Feb 2012	US

Provisional Applications (2)

	Number	Date	Country
	61371979	Aug 2010	US
	61489231	May 2011	US

Autofocus for stereo images

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

US

CPC

International Classifications

Abstract