Image sensors and sensing methods to obtain time-of-flight and phase detection information

Description

FIELD

Embodiments disclosed herein relate in general to digital cameras and in particular to thin zoom digital cameras.

BACKGROUND

Recently, mobile devices such as cellphones (and in particular smartphones) have started to incorporate projection-and-imaging Time-of-Flight (ToF) systems. ToF systems are used for 3D based (or depth map) applications such as background-foreground segmentation, face modeling, security face identification (such as unlocking the mobile device, authorizing payments, etc.), augmented reality, camera effects (such as Bokeh), avatar animation, etc.

In ToF systems, depth information is obtained by measuring the travel time of a light pulse emitted by a light source by means of an image sensor with very high temporal resolution on a nanosecond (ns) time scale. In general, a ToF system uses light in the near infrared (NIR) region, referring to a wavelength range of about 780 nm-1120 nm, which is invisible to the human eye. Techniques for ToF can be divided into direct ToF (d-ToF) and indirect ToF (i-ToF), see FIG. 1. In d-ToF, an arrival time of a single light pulse is measured by a single photon avalanche diode (SPAD) sensor. In i-ToF, some form of light pulse is sent in several intervals and the arrival time is measured by continuously accumulating charges within buckets. These buckets have defined timestamps called demodulation taps (storage floating diffusion or “FD”). Typical modulation frequencies are about 100 MHz.

According to the emitted light pulse shape, i-ToF techniques are divided into continuous wave i-ToF (“CW ToF”) and pulsed i-ToF (or “gated ToF”). In CW i-ToF, a sine signal is emitted repeatedly. In gated ToF, a rectangular function is emitted repeatedly. A ToF image sensor pixel that acts as the receiver of the back-reflected light pulses can be operated in CW ToF and gated ToF mode. In gated ToF, the signals of an image pixel's demodulation taps (storage FD) refer to spatial “3D slices” of a scene. Each tap is operated at a specific delay time with respect to pulse transmission and thus captures only light returning from a specific depth range. The depth of a 3D slice is defined by the pulse length.

Another 3D imaging method is stereo imaging. Images of a scene are captured from two points of view (POV) that are separated from each other by a vector called “baseline” B (not shown herein). 3D information is extracted by triangulation, which examines the relative positions of objects in the two images.

Phase-Detect Auto-Focus (PDAF) pixels can be used for stereo imaging with one aperture only. A most common implementation is dual-pixel autofocus (“2-PD AF”). In 2-PD AF, a sensor pixel is covered by one on-chip microlens (OCL) and divided into two photodiodes (PDs) I and II, as shown in FIG. 2. The signal of all PD I and all PD II from an image sensor including the 2-PD pixels correspond to an image of a scene as seen through the right side and the left side of a camera's lens respectively. So if one outputs all PD Is for forming a first image and all PD IIs for forming a second image, a stereo image pair is obtained that has a baseline B=aperture radius and a disparity of zero at the focus plane. From the stereo image pair, a stereo depth map can be calculated as known in the art. Here and further on, the terms “left side” and “right side” are discussed exemplarily and representative for any other orientation of the pixel in a camera and scene, such as “bottom side” and “top side” etc. When referring to stereo cameras hereinbelow, the reference is to a PDAF based stereo camera with a single aperture having a “virtual” baseline.

The disclosure below deals solely with i-ToF. For simplicity, the term ToF replaces i-ToF in the entire disclosure. In the following, ToF (i.e. i-ToF) pixels are shown as equivalent circuits (such as in FIG. 3A-B and FIG. 4A-C) for describing electrical properties of the pixel, or as schematic drawings (such as in FIG. 6A-B and FIG. 7) for describing optical and/or visual properties of the pixel. For the sake of simplicity, we refer as “ToF pixel” or “pixel” in both presentations.

FIG. 3A shows a known art embodiment of “4-tap” (or “4-phase”) ToF pixel. 4-tap refers to the four “ToF phases” (or simply “phases”) sampled in each measurement cycle. “ToF phase” or “phase” refers here and in the following to the phase relation of 1) a transmitted periodic light signal and 2) the returning periodic light signal captured by a ToF image sensor. It is not to be confused with the definition of “PDAF” given above. Generated charges are stored in 4 “storage nodes” C_A-C_D(or sometimes called “charge collection bins”) that are assigned to PGA (“pulse generator A”, or sometimes called FDA for “floating diffusion node A”), PGB, PGC and PGD. The charges correspond to light that reaches the sensor having 4 phases (usually 0 deg, 90 deg, 180 deg and 270 deg) with respect to the phase of the transmitted light. The charges in each storage node are converted to output signals V_outvia source followers (S/F) acting as buffer. The distance of an object point can be calculated for each pixel from the four output signals V_out, as known in the art. Four phases are necessary for demodulating a sinus-like modulation signal. In 4-tap ToF, a depth map can be calculated for each frame (often referred to as “1-shot depth map”). In the following, the terms “storage node”, “PGA” and “phase” etc. may be used interchangeably, implying that a pulse generator such as PGA corresponds to a particular phase and is accompanied by a storage node such as C_A.

Other embodiments known in the art may use a “2-tap” (or “2-phase”) ToF pixel shown in FIG. 3B. For obtaining the 4 phases necessary for depth calculation, image data from two frames may be required. Typically, one may use t consecutive frames for depth calculation. In 2-tap implementation, two frames are required for calculating a 1-shot depth map.

In some examples commercially available today, 2-tap and 4-tap ToF cameras are used in a “tap-shuffle” read-out mode in order to mitigate sensor artifacts. For tap-shuffle, in a first frame the 2 phases (or 4 phases) are sampled in a “regular” order, i.e. PGA may sample the 0 deg phase and PGB may sample the 180 deg phase. In a second frame, the 2 phases (or 4 phases) are sampled in a “reversed” order, i.e. PGA may sample the 180 deg phase and PGB may sample the 0 deg phase. For depth map calculation, averaged signals of both frames are used. That is, 4 frames are required for a 2-tap tap-shuffle depth map, and 2 frames are required for a 4-tap tap-shuffle depth map. Tap-shuffle increases the depth map accuracy, which is beneficial, but it increases the capture time of a depth map.

In some examples of ToF cameras commercially available today, a dual-frequency modulation is used for mitigating aliasing effects that lead to ambiguous depth measurements. For dual-frequency modulation the ToF pixel is operated at a first modulation frequency (e.g. 90 MHz) for generating a first depth map. For a second depth map the ToF pixel is operated at a second modulation frequency (e.g. 50 MHz). The final depth map is generated by a pixel-level calculation based on inputs from the first and the second depth map. For generating a depth map that uses both tap-shuffle and dual frequency modulation, a 2-tap depth map requires 8 frames and a 4-tap depth map requires 4 frames.

ToF image sensors can also be used for “regular” 2D imaging, i.e. for generating 2D images not including depth information.

Downsides of stereo imaging are for example missing disparity information for in-focus objects or for scene segments not including textures or any contrast gradients and a small baseline of single-aperture stereo cameras.

Downsides of ToF are for example low signal-to-noise ratios (SNR) for specular objects, scenes with high amount of background light, large lens-object distances as well as artifacts such as “flying pixel”, “multi-path”, multi-user interference and motion blur.

A challenge in smartphone based computational photography is to overcome the downsides of ToF and stereo imaging.

It would be beneficial to have a ToF image sensor that provides both time-of-flight image data as well as 2PD stereo image information and a method based on the output of this image sensor for generating a fused ToF/stereo vision depth map.

SUMMARY

In various embodiments, there are provided image sensor pixels comprising: a plurality of sub-pixels, each sub-pixel including a photodiode; a microlens covering the plurality of sub-pixels; and a read-out circuit (ROC) for extracting indirect time-of-flight (i-ToF) phase signals of each sub-pixel individually, wherein the image sensor pixel is an i-ToF image sensor pixel.

In some embodiments, the plurality of sub-pixels includes 2 sub-pixels.

In some embodiments, the plurality of sub-pixels includes 4 sub-pixels.

In some embodiments, each sub-pixel is a 4-tap pixel including 4 pulse generators.

In some embodiments, an i-ToF image sensor pixel includes a switch, wherein in one state the switch is closed so that the sub-pixels together form one pixel and the ROC reads out the one pixel for generating an i-ToF depth map, and wherein in another state the switch is opened so that the ROC reads out the sub-pixels individually for generating a stereo depth map.

In some embodiments, an image sensor pixel as above or below is included in an image sensor of a camera having a focal length fin the range of 1.5 mm-10 mm.

In some embodiments, an image sensor pixel as above or below is included in an image sensor of a camera having a f number f/# in the range of 1-3.

In some embodiments, an image sensor pixel as above or below is included in an image sensor of a stereo camera having a baseline B in the range of 0.5 mm-10 mm.

In some embodiments, each sub-pixel has a size of 1 μm-10 μm.

In some embodiments, the i-ToF image sensor pixel is integrated in an image sensor of a stereo camera that has a vertical baseline and a horizontal baseline.

In various embodiments, there are provided image sensors comprising a first image sensor pixel surrounded by regular i-ToF pixels as above or below, and a closest second image sensor pixel as above or below located at least five pixels away from the first image sensor pixel.

In some embodiments, the using a ROC to read out i-ToF phase signals of each sub-pixel individually includes using the ROC to read out all i-ToF signals generated by the plurality of the sub-pixels.

In some embodiments, the using a ROC to read out i-ToF phase signals of each sub-pixel individually includes using the ROC to read out fewer than all i-ToF signals generated by the plurality of the sub-pixels.

In some embodiments, the read out i-ToF phase signals are used to calculate a relative ToF depth map.

In some embodiments, the read out of fewer that all i-ToF phase signals reduces a cycle time required for phase image capturing by more than 50% with respect to a cycle time where all the i-ToF phase signals are read out.

In some embodiments, the read out of fewer than all i-ToF signals includes a read out of one i-ToF signal.

In some embodiments, the one-i-ToF phase signal is the i-ToF phase signal that includes the highest amount of image information of a scene.

In some embodiments, the read out i-ToF phase signals correspond to a stereo camera having a vertical or a horizontal baseline.

In some embodiments, the read out i-ToF phase signals correspond to a stereo camera having a vertical and a horizontal baseline.

In some embodiments, the using a read-out circuit to read out i-ToF phase signals of each sub-pixel individually further includes:

obtaining a stereo depth map calculated from the i-ToF phase signals,

obtaining an i-ToF depth map calculated from the i-ToF phase signals,

analyzing the stereo depth map and the i-ToF depth map for assigning stereo scores to segments of the stereo depth map and ToF scores to the segments of the i-ToF depth map, and

generating a fused depth map by using stereo depth map data for segments that have high stereo scores and using ToF depth map data for segments that have high ToF scores.

In some embodiments, the generating the fused depth map is done without using ToF depth map information, and the stereo depth map is calculated from the i-ToF phase signals from a single frame

In some embodiments, the i-ToF depth map is a 1-shot depth map.

In some embodiments, the i-ToF phase signals that are used for calculating the i-ToF depth map are obtained at 2 different modulation frequencies.

In some embodiments, the i-ToF phase signals that are used for calculating the i-ToF depth map are obtained by shuffling pulse generators in the pixel.

In some embodiments, the using a ROC to read out i-ToF phase signals of each sub-pixel individually includes using the ROC to read out all i-ToF signals generated by the plurality of the sub-pixels.

In some embodiments, fewer than all i-ToF phase signals are read out and used for obtaining the stereo depth map.

In some embodiments, only one of the i-ToF phase signals is read out and used for obtaining the stereo depth map.

In some embodiments, the relative TOF depth map is used to generate a high fps depth map stream having a fps greater than 30.

In some embodiments, the 1-shot depth map is used to generate a high fps depth map stream having a fps greater than 30.

In some embodiments, the fps is greater than 50.

In some embodiments, the fps is greater than 75.

In some embodiments, the only one of the i-ToF phase signals that is read out is the i-ToF phase signal that includes the highest amount of image information of a scene.

BRIEF DESCRIPTION OF THE DRAWINGS

Non-limiting examples of embodiments disclosed herein are described below with reference to figures attached hereto that are listed following this paragraph. Identical structures, elements or parts that appear in more than one figure are generally labeled with a same numeral in all the figures in which they appear. The drawings and descriptions are meant to illuminate and clarify embodiments disclosed herein, and should not be considered limiting in any way.

FIG. 1 illustrates direct (d-ToF) and indirect ToF (i-ToF) techniques;

FIG. 2 illustrates a system for 2PD AF using Phase-Detect Auto-Focus (PDAF) pixels;

FIG. 3A shows a known art embodiment of a “4-tap” ToF pixel;

FIG. 3B shows a known art embodiment of a “2-tap” ToF pixel;

FIG. 4A shows an embodiment of an equivalent circuit of a 2PD ToF pixel disclosed herein;

FIG. 4B shows another embodiment of an equivalent circuit of a 2PD ToF pixel disclosed herein;

FIG. 4C shows an embodiment of an equivalent circuit of a 4PD ToF pixel disclosed herein;

FIG. 4D shows yet another embodiment of a 2PD ToF pixel disclosed herein;

FIG. 4E shows a 2PD ToF depth measurement scenario disclosed herein;

FIG. 5 shows a method for generating a depth map based on a 2PD ToF pixel disclosed herein;

FIG. 6A shows a top view on a 2PD ToF pixel disclosed herein;

FIG. 6B shows a top view on another 2PD ToF pixel disclosed herein;

FIG. 7 shows a top view on a 4PD ToF pixel disclosed herein;

FIG. 8 shows a comparison of the expected disparity versus object-lens distance using an image sensor and sensing method disclosed herein.

DETAILED DESCRIPTION

FIG. 4A shows an embodiment of an equivalent circuit of a 2PD ToF pixel disclosed herein, the pixel numbered 400. Pixel 400 is divided into a 1^stsub-pixel (also marked henceforth as SP1) and a 2^ndsub-pixel (also marked henceforth as SP2), where each sub-pixel (SP) resembles a “2-tap” ToF pixel. Each SP includes one PD: sub-pixel 1 (SP1) includes PD1 and sub-pixel 2 (SP2) includes PD2. Each PD is connected to two storage nodes, so that charge collected from PD1 is stored in PGA1 and PGB1, and charge collected from PD2 is stored in PGA2 and PGB2. With reference to FIG. 2, PD1 and PD2 may respectively correspond to a left PD and to a right PD that collect light passing through a right side and a left side of a camera's lens respectively. Thus all sub-pixels 1 may correspond to a left side of a camera's lens and all sub-pixels 2 may correspond to a right side of a camera's lens. In conclusion, a stereo image with baseline B=aperture radius may be obtained by considering all or some plurality (of number N) of 1^stsub-pixels forming a left-side 2D image and all or some plurality (of number N) of 2^ndsub-pixels forming a right-side 2D image.

FIG. 4B shows another embodiment of an equivalent circuit of a 2PD ToF pixel disclosed herein, the pixel numbered 410. Pixel 410 is divided into a 1^stand a 2^ndSP, whereas each SP resembles a “4-tap” ToF pixel. SP1 includes PD1 and SP2 includes PD2. Each PD is connected to four storage nodes, so that charge collected from PD1 is stored in PGA1, PGB1, PGC1 and PGD1 and charge collected from PD2 is stored in PGA2, PGB2, PGG2 and PGD2. With reference to FIG. 2, and as in pixel 410, PD1 and PD2 may correspond to a left PD and to a right PD that collect light passing through a right side and a left side of a camera's lens respectively. Pixel 400 or pixel 410 may be included in a 2PD ToF Pixel with a pixel layout such as shown in FIG. 6A and FIG. 6B.

In a “binning mode”, SPs of ToF pixels may be summarized as a single “effective” pixel. In some examples, a binning mode may be implemented in the analog domain by adding the signals V_outof equal phases, for example and with reference to FIG. 4A by adding V_out,A1and V_out,A2as well as adding V_out,B1and V_out,B2. In other examples, a binning mode may be implemented by adding charges present in the storage nodes C of equal phases, for example and with reference to FIG. 4A by adding C_A1and C_A2as well as adding C_B1and C_B2. In yet other examples, a binning mode may be implemented in the digital domain.

FIG. 4C shows an embodiment of an equivalent circuit of a 4PD ToF pixel disclosed herein, the pixel numbered 420. Pixel 420 is divided into four SPs marked SP1, SP2, sub-pixel 3 (SP3) and sub-pixel 4 (SP4), whereas each SP resembles a “2-tap” ToF pixel. SP1 includes PD1, SP2 includes PD2, SP3 includes PD3 and SP4 includes PD4. Each PD is connected to two storage nodes, so that charge collected from each PD is stored in the two connected storage nodes C_Aiand C_Biwhere i=1, . . . , 4. Pixel 420 may be included in a pixel such as shown in FIG. 7. As an example and with reference to FIG. 2 and FIG. 7, SP1 may correspond to a top-left SP such as SP 704 that collects light passing through a bottom-right side of a camera's lens. SP2 may correspond to a bottom-left SP such as SP 706 that collects light passing through a top-right side of a camera's lens etc. In stereo imaging such as phase imaging, a depth in a scene that varies along only one direction can be sensed with a stereo camera having a baseline parallel to that direction, but not with a stereo camera having a baseline which is orthogonal to that direction. As known in the art, with a sensor having only 2PD pixels of identical orientation, only a horizontal or only a vertical depth can be sensed by phase imaging. With a 4PD pixel described herein both a horizontal and a vertical depth can be sensed by phase imaging.

Other 4-PD embodiments may include 4 SPs realized in a 4-tap ToF pixel structure, i.e. each SP i (i=1, . . . , 4) may have 4 storage nodes PGA_i-PGD_i. Charges collected by each PD of the 4 PDs may be stored in the 4 storage nodes PGA_i-PGD_i(i=1, . . . , 4). For example, charges collected in PD1 may be stored in each of C_A1, C_B1, C_C1and C_D1etc. . . .

FIG. 4D shows another embodiment of a 2PD ToF pixel disclosed herein, the pixel numbered 430. An image sensor based on pixel 430 is not to be used for simultaneously generating a ToF and a stereo depth map, but for generating a ToF depth map or a stereo depth map.

For generating a ToF depth map, switch 432 is closed (not shown), so that PD1 and PD2 together form one PD. The one PD is driven in a 2-tap ToF pixel and a ToF depth map is calculated as known in the art.

For generating a stereo depth map, switch 432 is opened (as shown in FIG. 4D), and PD1 and PD2 are separated from each other. PD1 and PD2 are read out separately and a stereo depth map is calculated as known in the art.

In some embodiments, the switches of all pixels included in a ToF image sensor may be controlled together, i.e. the switches of all pixels may be opened, or the switches of all pixels may be opened closed. In other embodiments, each pixel or each group of pixels may be controlled individually. For example based on information from past images or frames, one may open or close the switch of a particular pixel for calculating a stereo depth or a ToF depth of this particular pixel.

FIG. 4E shows a 2PD ToF depth measurement scenario disclosed herein, with a camera 440 including a 2PD ToF pixel 442. A signal from a first object (“object 1”) distanced at z₁from camera 440 passes the left half of the camera's lens, a signal from a second object (“object 2”) distanced at z₂>z₁from camera 440 passes the right half of the camera's lens.

If a pixel like 430 is used for calculating a ToF depth, the depth signal will suffer from “flying pixel” artifact. For generating a ToF depth, in a pixel like 430 PD1 and PD2 together form one PD. In the given scenario this means that the depth signals of object 1 (at z₁) and object 2 (at z₂) are intermixed, leading to a flying pixel depth signal (“z_FP”) which provides a depth signal z₁<z_FP<z₂.

If a pixel like 400 or like 410 is used for calculating a ToF depth, the depth signal will not suffer from “flying pixel” artifact, as for generating a ToF depth, PD1 and PD2 can be evaluated independently.

FIG. 5 shows a method for generating a depth map based on a 2-tap 2PD ToF pixel like pixels 400. In 2-tap, two phases (PGA and PGB) are measured in each image (or “frame”). In step 502 a first phase image is captured. In the first image and exemplarily, PGA1 and PGA2 may both be configured to capture a 0 deg phase, and PGB1 and PGB2 may both be configured to capture a 180 deg phase. In step 504, a second phase image is captured. In the second image and exemplarily, PGA1 and PGA2 may both be configured to capture a 90 deg phase, and PGB1 and PGB2 may both be configured to capture a 270 deg phase.

In step 506 all phase values are output for further processing. Further processing may be performed by an application processor (AP) or any other processing device, as known in the art. The further processing includes the calculation and analysis of a stereo depth map (steps 508a-512a) as well as the calculation and analysis of a ToF depth map (steps 508b-510b). Steps 508a-512a and steps 508b-510b may be performed sequentially or in parallel such as depicted in FIG. 5.

With reference to first and second images described above, consider a first example (“Example 1”) referring to a 2-tap pixel and a “1-shot depth map” approach. In Example 1, both step 502 and step 504 are performed once for capturing two images that in sum include 4 phases (0 deg, 90 deg, 180 deg and 270 deg). The 4 phases are output (step 506) and a ToF depth map is calculated in 508b. In other examples referring to a 2-tap pixel design, methods known in the art such as tap-shuffle and dual-frequency may be applied. For this, step 502, step 504 (and step 506) may be performed repeatedly, e.g. four times when using dual-frequency and tap-shuffle for each of the two frequencies.

In a second example (“Example 2”), referring to a 4-tap pixel design and a “1-shot depth map” approach, there may be only one image capture required, i.e. only step 502 may be performed before outputting the 4 phases in step 506.

Stereo Depth Map

In step 508a, 2D images of SP1 and of SP2 are generated. 2D images of SP1 correspond to left-side images (i.e. images that contain only image data passing the left side of the camera lens), while 2D images of SP2 correspond to right-side images (i.e. images contain only image data passing the right side of the camera lens). Generation of 2D images may be performed according to different options. In the following, we refer to Example 1.

In some examples that may be referred to as “single-phase” images, a 2D image may be generated by outputting the values of one of the four storage node signals. Exemplarily referring only to the left-side 2D image (SP1), the four existing storage node signals are: PGA1 (0 deg), PGB1 (90 deg), PGB1 (180 deg) and PGB1 (270 deg). In some examples of single-phase images, only the storage node signal containing the highest amount of image information may be output for forming the 2D image. As an example for determining a highest amount of image information, one may sum over the particular phase signals of all pixels for each storage node, and define the storage node having the largest sum as the storage node that contains the highest amount of image information.

In other examples that may be referred to as “all-phase” images, a 2D image may be generated by outputting the sum over all signals of all the storage nodes. Exemplarily for SP1, the pixel's value may be obtained by summing PGA1 (0 deg), PGB1 (90 deg), PGB1 (180 deg) and PGB1 (270 deg).

In yet other examples of images, a 2D image may be generated by using some combination of single-phase images and all-phase-images. As an example, one may use only two out of the four existing storage node signals for generating the 2D image.

In yet other examples, a 2D image may be generated by using only storage node signals from identical frames, i.e. only from an image captured in step 502, or only from an image captured in step 504. This method for 2D image generation may be beneficial when capturing a dynamic scene where there are significant changes between the two captures in step 502 and 504, as a depth map can be calculated from each frame. In comparison to e.g. a depth map generated by ToF with using tap-shuffle and dual-frequency, for 2-tap and 4-tap ToF this corresponds to ×8 and ×4 increase in depth map fps respectively.

In yet other examples where more than two frames are captured (i.e. where steps 502-506 are performed repeatedly), a 2D image may be generated by averaging over storage node signals from different frames. For example, one may average over identical phases of all captured frames or one may average over particular phases (e.g. PGA1 and PGA2) of all captured frames or some of the captured frames.

In step 510a, left-side and right-side 2D images are used to calculate a stereo depth map. As known, for a regular stereo vision system having two apertures spatially separated by baseline B, an object's distance can be calculated and/or estimated using equation 1:

$\begin{matrix} Z^{'} = \frac{f \cdot B}{D \cdot ps} & (1) \end{matrix}$

where Z′ is the depth estimation for a particular pixel which may to be calculated by a processing unit, f is the camera's focal length, D is the disparity in pixels, and ps is the pixel size of the image sensor. The disparity in pixels refers to the property of stereo vision systems (e.g. to a dual-camera) that, when after image alignment an object point in focus is imaged to two different image points in the two output images, the magnitude of this difference is the disparity D. Via the measurement of the disparity D between two aligned stereo images, the depth of an object can be calculated according to the equation 1.

For the regular stereo vision system see above, disparity D is given by

$\begin{matrix} D = \frac{f \cdot B}{Z \cdot ps} & (2) \end{matrix}$

with Z being the object-lens-distance of an object point. For an object at infinity, D approaches zero.

For a 2PD camera as described above, the disparity is zero for an object point in focus, i.e. in focus the stereo image pair entirely overlaps. So for the 2PD camera with baseline B=aperture radius, disparity D is given by

$\begin{matrix} D = \frac{f \cdot B}{p s} \cdot (\frac{1}{z} - \frac{1}{z_{0}}) & (3) \end{matrix}$

with z₀being the distance from the lens to the focus plane.

In step 512a, the stereo depth map is analyzed. The analysis may assign a confidence score to particular pixels or segments of pixels of the depth map. A high confidence score may refer to a high quality depth information, and a low confidence score may refer to a low quality depth information. Low quality depth information may e.g. be obtained for captured scene segments that do not include clearly visible textures, contours or any other contrast gradients that are required for aligning the stereo images and for determining disparity D, and/or have medium (3-5 m) or large (>5 m) lens-object distances.

Additionally, the analysis may assign a resolution score to particular pixels or segments of pixels of the depth map. The resolution score may serve as a measure of the depth resolution and/or the spatial resolution (i.e. pixel resolution) of the depth map.

The resolution score and the confidence score of a stereo depth are called “stereo score”.

ToF Depth Map

In step 508b, the object-lens distance (i.e. depth) of all object points in a scene is calculated by using the 4 phases (0 deg, 90 deg, 180 deg and 270 deg) as known in the art for ToF. In some examples, before calculating the ToF depth image, all or some of the phase signals of the SPs that have identical phase relation may be summed (e.g. by “binning” as described above). An identical phase relation may be given for PGA1 and PGA2 as well as for PGB1 and PGB2 etc. In other examples, the ToF depth image may be calculated by using the phase signals of each of the SPs individually, i.e. a plurality of ToF depth images may be calculated. In some examples, one may fuse the plurality of ToF depth images to obtain a single ToF depth image. In other examples, one may average the plurality of ToF depth images to obtain a single ToF depth image.

In step 510b, the ToF depth map is analyzed. The analysis may assign a confidence score to particular pixels or segments of pixels of the depth map. A high confidence score may refer to a high quality depth information, and a low confidence score may refer to a low quality depth information. Low quality depth information may be obtained for ToF depth map segments that include:

- specular objects which do not reflect much light in direction of the ToF sensor;
- a high amount of ambient or background light;
- fast moving objects that lead to motion blur artifacts;
- “flying pixel” and “multi-path” artifacts as known in the art;
- multi-user interference as known in the art, or
- large (>4 m) lens-object distances.

Additionally, the analysis may assign a resolution score to particular pixels or segments of pixels of the ToF depth map. Resolution score and confidence score of a ToF depth map are called ToF score.

Fusion of Stereo and ToF Information

In step 514, a high-quality depth map is generated by fusing stereo and ToF depth map segments as known in the art. In some examples, one may consult measures such as a confidence score or a resolution score in order to decide whether the stereo depth map or the ToF depth map is to be used for the particular segment of the fused depth map.

In step 516, the fused depth map generated in step 514 is output to a program or user. In some examples, the fused depth map generated in step 514 may include stereo depth information or ToF depth information only. A depth image including stereo depth information only may e.g. be beneficial for obtaining a stream of depth maps having high fps, i.e. a fast depth map mode, as from the 2PD stereo image pair a depth map can be calculated for each frame.

In examples for fast depth map modes, a ToF pixel such as 2-tap ToF pixel 400 may be operated in a high fps mode that does not support ToF depth calculation.

Consider an example (“Example 3”) for achieving a high fps depth map stream by including stereo depth information only: one may capture a first phase image in step 502 and output the phase of this first phase image in step 506 without capturing a second phase image in step 504. From this first image, a stereo depth map may be calculated in step 510a which is output in step 516.

Another example (“Example 4”) for achieving a high fps depth map stream may be based on a reduced read out scheme and including stereo depth information only. Here and in the following, a depth map fps may be called “high” for fps=30 or more, e.g. fps=60 or fps=240. In example 4, one may expose a pixel such as pixel 400 and collect charges in the storage nodes as known in the art. However, for the sake of higher fps one may e.g. read out only PGA1 and PGA2, but one may not read out PGB1 and PGB2. This is in contrast with the commonly performed reading out of PGA1, PGA2, PGB1 and PGB2 that are required for ToF depth map generation. The overall cycle time T_cyclerequired for phase image capturing comprises an “integration” phase lasting the integration time T_intwhich may e.g. be about 0.1 ms-5 ms, and a read out phase lasting the read out time T_read. In general, T_readtakes a significantly larger share of T_cyclethan T_int. As an example with relevance for a modern 4-tap ToF image sensor, T_readmay e.g. make up about 50%-90% of T_cycle, and T_readmay be about T_read=5·T_int−25·T_int. Here, T_readis the time required for reading out all taps, and it can be reduced by not reading all taps. So referring to a 2-tap pixel where only one tap per SP is read out, T_cyclecan be reduced by 10%-100%, leading to a fps increase by 10%-100%. Referring to a 4-tap ToF pixel such as pixel 410 where only one tap per SP is read out, T_cyclecan be reduced by 10%-300% leading to a fps increase by 10%-300%. For example, one may read out only PGA1 and PGA2 but not read out PGB-PGD1 and PGB-PGD2. The phase images of only PGA1 and PGA2 may be used for extracting a stereo depth map. Whereas we refer here to reading out PGA1 and PGA2 only, and not reading out all other storage nodes, one may, in an analog manner, only read PGB1 and PGB2. Other possibilities may include reading out only PGA1 and PGB2 and not reading out all other storage nodes, etc. One may select which storage node pair to read out according to a pre-defined read-out scheme, e.g. such as always reading out PGA1 and PGA2 only. In other examples one may select the read-out scheme dynamically, e.g. according to the amount of scene information stored in the respective storage nodes. For example, one may determine in pre-view, i.e. before the actual depth map is captured according to steps 502-516, which storage node pair (such as PGA1 and PGA2, or PGB1 and PGB2 etc.) includes the highest amount of image information.

In other examples for fast depth map modes, a ToF pixel may be operated in a high fps mode that supports calculation of a relative ToF depth map. A relative depth map provides a depth value for a particular pixel not as an absolute depth value (such as e.g. a depth of 1 m or 1.5 m), but only as a ratio of the depth of the other pixels in the sensor. As an example, the depth value of a particular pixel located at a position (i, j) in the sensor array may be d_ij. Value d_ijmay have no absolute depth assigned, but may be expressed in terms of other pixels in the sensor, e.g. depth value d_ijmay be 75% the depth value of a neighboring pixel at a position (i+1,j), i.e. d_ij=0.75·d_i+1j. Wherein for the calculation of an absolute depth map four phase signals are required, for calculating a relative depth only two (or more) phase signals are required.

Consider an example (“Example 5”) relevant for a 4-tap pixel such as pixel 410: for achieving a high fps depth map stream including a relative ToF depth map a reduced read out scheme as described in Example 4 may be used. The 4-tap pixel may be integrated in a “gated ToF” system as known in the art, i.e. the light source of the ToF system may emit a rectangular pulse. In gated ToF, the storage nodes correspond to particular depth slices in a scene. One may therefore select which storage node pairs to read out according to which depth slices are considered to carry the most relevant or important information of a scene. E.g. one may read out only the pairs PGA1 and PGA2 as well as PGB1 and PGB2, but one may not read out the pairs PGC1 and PGG2 as well as PGD1 and PGD2. This may allow for a fps increase of the depth map stream of 10%-100%.

Another example (“Example 6”) is relevant for a 2-tap pixel such as pixel 400 and for achieving a high fps depth map stream including a relative ToF depth map. A reduced read out scheme may e.g. be:

- in step 502, read out only PGA1 and PGA2 (which may sample the 0 deg phase) but do not read out PGB1 and PGB2 (which may sample the 180 deg phase).
- in step 504, read out only PGA1 and PGA2 (which may sample the 90 deg phase) but do not read out PGB1 and PGB2 (which may sample the 270 deg phase).

This may allow for a fps increase of the depth map stream of 10%-100%.

In some examples, the combination or fusion of stereo depth and ToF depth may be used for overcoming the ToF depth ambiguity, e.g. instead of using the dual-frequency modulation. So instead of using a second and additional modulation/demodulation frequency, mitigating depth ambiguity may be performed by using the stereo depth map calculated in step 510a. Also this can be used for increasing fps of a depth map stream.

A yet another example (“Example 7”) is especially relevant for a pixel like 2-tap pixel 430. In a first example of example 7 (switch 432 open) for generating a stereo depth map, only steps 508a, 510a and 512a may be performed, and steps 508b and 510b may not be performed. In a second example of example 7 (switch 432 closed) for generating a ToF depth map, only steps 508b and 510b may be performed and steps 508a, 510a and 512a may not be performed.

In a yet another example (“Example 8”) and for a pixel like 2-tap pixel 430, in a further step that preceeds step 502, it may be decided for each pixel (or group of pixel) whether it is used as a ToF pixel or as a stereo pixel. For pixel 430 used as ToF pixel, switch 432 is closed, for pixel 430 being used as stereo pixel, switch 432 is opened. The decision whether to use a particular pixel as a ToF or as a stereo pixel, may e.g. be decided based on the ToF score and/or the stereo score that are obtained from prior depth images. In some examples for generating a depth map only using stereo image data, one may operate a 2PD ToF pixel as described herein in a “passive” manner, i.e. one may not use the light source of the ToF system but one may rely on the ambient or background illumination only.

FIG. 6A shows a top view of an exemplary 2PD ToF pixel layout disclosed herein, the pixel layout numbered 602. “Layout” refers here to the physical or visual appearance of a pixel. Pixel layout 602 comprises a first SP 604 hosting a first PD (PD1) and a second SP 606 hosting a second PD (PD2). Each of the two SPs may be realized in a 2-tap pixel design (such as shown in FIG. 4A) or in a 4-tap pixel design (such as shown in FIG. 4B). An OCL 608 covers both 604 and 606. The SPs and the OCL are oriented horizontally, corresponding to a horizontal baseline B (not shown).

FIG. 6B shows an exemplary top view of another 2PD ToF pixel layout 602′ as disclosed herein. Pixel layout 602′ comprises a first subpixel 604′ (hosting PD1) and a second subpixel 606′ (hosting PD2). An OCL 608′ covers both 604′ and 606′. The SPs and the OCL are oriented vertically, corresponding to a vertical baseline B (not shown). Here “vertical” is defined by assuming a ToF image sensor as disclosed herein included in a ToF camera so that the vertical OCL is oriented parallel to a vertical line in the scene. The same holds for the definition of a horizontal orientation of the OCL.

In some examples, pixels with pixel layout 602 or 602′ may be “sparsely” integrated into an image sensor, i.e. these 2PD ToF pixels may be surrounded by regular (i.e. non-2PD) ToF pixels. A “next” 2PD ToF pixel may e.g. be located 5 or 10 or 25 or 50 pixels away from a 2-PD pixel with a pixel layout such as 602 or 602′. In other examples and such as shown in FIG. 6A, all ToF pixels may be 2PD pixels that are covered by a joint OCL.

FIG. 7 shows a top view on an exemplary 4PD ToF pixel layout 702 as disclosed herein. Pixel layout 702 comprises a first SP 704 hosting a first PD (PD1), a second SP 706 hosting a second PD (PD2), a third SP 708 hosting a third PD (PD3) and a fourth SP 710 hosting a fourth PD (PD4). Each of the four SPs may be realized in a 2-tap pixel design (such as shown in FIG. 4C) or in a 4-tap pixel design (not shown). An OCL 712 covers 704, 706, 708 and 710. In some examples, pixels with pixel layout 702 may be “sparsely” integrated into an image sensor, i.e. the 4PD ToF pixels may be surrounded by regular (i.e. non-4PD) ToF pixels and wherein a next 4PD ToF pixel may be located 5 or 10 or 25 or 50 pixels away from a 4PD-pixel in a pixel layout such as 702. In other examples and such as shown in FIG. 7, all ToF pixels may be 4PD pixels covered by a joint OCL. There are SP pairs having horizontal orientation and there are SP pairs having vertical orientation, so the 4PD Pixel design corresponds to horizontal and vertical baselines B (not shown).

FIG. 8 shows a comparison of the expected disparity versus object-lens distance. The disparity on the y-axis is given in units of pixels for two different pixel sizes of 3.5 μm and of 1.5 μm. Object-lens distances “z” from zero meter to 5 meter are shown on the x-axis in units of meter. The expected disparity is calculated by using the following values, which may resemble a ToF camera such as used in a today's smartphone:

- Focal length f=3.9 mm
- Lens focused at infinity (i.e. z₀⁻¹=0 in equation 3)
- f/#=1.6
- Aperture radius=1.21 mm (=baseline B)
- ToF pixel size: 7 um (3.5 um PD size: “p_size=3.5[μm]”) vs. 3 μm (1.5 um PD size: “p_size=1.5[μm]”)

As a rule of thumb known in the art, for meaningful depth estimation a disparity of −0.5 pixel or more is required. Accordingly and with reference to FIG. 8, we expect that a meaningful depth sensing range d may be about d≤3 m for a 3 μm ToF pixel and about d≤1.5 m for a 7 μm ToF pixel. Depth sensing range d refers here to the object-lens distance of an object point. This implies that miniaturization of the ToF pixel size may be beneficial for 2PD (or 4PD) based depth map estimation. Furthermore and with reference to equation 3, a ToF camera having a larger f·B ratio may also have an increased depth sensing range d.

In some examples, techniques for stereo baseline magnification such as e.g. described by Zhou et al. in “Stereo Magnification: Learning view synthesis using multiplane images” published in [ACM Trans. Graph., Vol. 37, No. 4, Article 65. Publication date: August 2018] may be used.

While this disclosure describes a limited number of embodiments, it will be appreciated that many variations, modifications and other applications of such embodiments may be made. In general, the disclosure is to be understood as not limited by the specific embodiments described herein, but only by the scope of the appended claims.

All references mentioned in this specification are herein incorporated in their entirety by reference into the specification, to the same extent as if each individual reference was specifically and individually indicated to be incorporated herein by reference. In addition, citation or identification of any reference in this application shall not be construed as an admission that such reference is available as prior art to the present application.

Claims

1. An image sensor pixel, comprising: a) a plurality of sub-pixels, each sub-pixel including a photodiode;b) a microlens covering the plurality of sub-pixels; andc) a read-out circuit (ROC) for extracting indirect time-of-flight (i-ToF) phase signals of each sub-pixel individually, wherein the image sensor pixel is an i-ToF image sensor pixel, wherein the i-ToF image sensor pixel includes a switch, wherein in one state the switch is closed so that the sub-pixels together form one pixel and the ROC reads out the one pixel for generating an i-ToF depth map, and wherein in another state the switch is opened so that the ROC reads out the sub-pixels individually for generating a stereo depth map.
2. The image sensor pixel of claim 1, wherein the plurality of sub-pixels includes 2 sub-pixels.
3. The image sensor pixel of claim 1, wherein the plurality of sub-pixels includes 4 sub-pixels.
4. The image sensor pixel of claim 1, wherein each sub-pixel is a 4-tap pixel including 4 pulse generators.
5. The image sensor pixel of claim 1, wherein each sub-pixel is a 2-tap pixel including 2 pulse generators.
6. The image sensor pixel of claim 1, included in an image sensor of a camera having a focal length fin the range of 1.5 mm-10 mm.
7. The image sensor pixel of claim 1, included in an image sensor of a stereo camera having a baseline B in the range of 0.5 mm-10 mm.
8. The image sensor pixel of claim 1, wherein each sub-pixel has a size of 1 μm-10 μm.
9. The image sensor pixel of claim 1, included in an image sensor of a stereo camera that has a vertical or a horizontal baseline.
10. A method, comprising: a) providing an indirect time-of-flight (i-ToF) image sensor pixel that includes a plurality of sub-pixels, each sub-pixel including a photodiode, wherein the plurality of sub-pixels is covered by a microlens; andb) using a read-out circuit (ROC) to read out i-ToF phase signals of each sub-pixel individually, by: obtaining a stereo depth map calculated from the i-ToF phase signals,obtaining an i-ToF depth map calculated from the i-ToF phase signals,analyzing the stereo depth map and the i-ToF depth map for assigning stereo scores to segments of the stereo depth map and ToF scores to the segments of the i-ToF depth map, andgenerating a fused depth map by using stereo depth map data for segments that have high stereo scores and using ToF depth map data for segments that have high ToF scores.
11. The method of claim 10, wherein the using a ROC to read out i-ToF phase signals of each sub-pixel individually includes using the ROC to read out fewer than all i-ToF signals generated by the plurality of the sub-pixels.
12. The method of claim 11, wherein the read out i-ToF phase signals are used to calculate a relative ToF depth map.
13. The method of claim 11, wherein the read out of the fewer than all i-ToF phase signals reduces a cycle time required for phase image capturing by more than 50% with respect to a cycle time where all the i-ToF phase signals are read out.
14. The method of claim 10, wherein the i-ToF depth map is a 1-shot depth map.
15. The method of claim 10, wherein the using a ROC to read out i-ToF phase signals of each sub-pixel individually includes using the ROC to read out fewer than all i-ToF signals generated by the plurality of the sub-pixels.
16. The method of claim 10, wherein fewer than all i-ToF phase signals are read out and used for obtaining the stereo depth map.
17. The method of claim 14, wherein the 1-shot depth map is used to generate a high fps depth map stream having a fps≥35.

CROSS REFERENCE TO RELATED APPLICATIONS

This application claims priority from US Provisional Patent Applications Nos. 63/052,001 filed Jul. 15, 2020, and 63/055,912 filed Jul. 24, 2020, both of which are expressly incorporated herein by reference in their entirety.

US Referenced Citations (292)

Number	Name	Date	Kind
4199785	McCullough et al.	Apr 1980	A
5005083	Grage et al.	Apr 1991	A
5032917	Aschwanden	Jul 1991	A
5041852	Misawa et al.	Aug 1991	A
5051830	von Hoessle	Sep 1991	A
5099263	Matsumoto et al.	Mar 1992	A
5248971	Mandl	Sep 1993	A
5287093	Amano et al.	Feb 1994	A
5394520	Hall	Feb 1995	A
5436660	Sakamoto	Jul 1995	A
5444478	Lelong et al.	Aug 1995	A
5459520	Sasaki	Oct 1995	A
5657402	Bender et al.	Aug 1997	A
5682198	Katayama et al.	Oct 1997	A
5768443	Michael et al.	Jun 1998	A
5926190	Turkowski et al.	Jul 1999	A
5940641	McIntyre et al.	Aug 1999	A
5982951	Katayama et al.	Nov 1999	A
6101334	Fantone	Aug 2000	A
6128416	Oura	Oct 2000	A
6148120	Sussman	Nov 2000	A
6208765	Bergen	Mar 2001	B1
6268611	Pettersson et al.	Jul 2001	B1
6549215	Jouppi	Apr 2003	B2
6611289	Yu et al.	Aug 2003	B1
6643416	Daniels et al.	Nov 2003	B1
6650368	Doron	Nov 2003	B1
6680748	Monti	Jan 2004	B1
6714665	Hanna et al.	Mar 2004	B1
6724421	Glatt	Apr 2004	B1
6738073	Park et al.	May 2004	B2
6741250	Furlan et al.	May 2004	B1
6750903	Miyatake et al.	Jun 2004	B1
6778207	Lee et al.	Aug 2004	B1
7002583	Rabb, III	Feb 2006	B2
7015954	Foote et al.	Mar 2006	B1
7038716	Klein et al.	May 2006	B2
7199348	Olsen et al.	Apr 2007	B2
7206136	Labaziewicz et al.	Apr 2007	B2
7248294	Slatter	Jul 2007	B2
7256944	Labaziewicz et al.	Aug 2007	B2
7305180	Labaziewicz et al.	Dec 2007	B2
7339621	Fortier	Mar 2008	B2
7346217	Gold, Jr.	Mar 2008	B1
7365793	Cheatle et al.	Apr 2008	B2
7411610	Doyle	Aug 2008	B2
7424218	Baudisch et al.	Sep 2008	B2
7509041	Hosono	Mar 2009	B2
7533819	Barkan et al.	May 2009	B2
7619683	Davis	Nov 2009	B2
7738016	Toyofuku	Jun 2010	B2
7773121	Huntsberger et al.	Aug 2010	B1
7809256	Kuroda et al.	Oct 2010	B2
7880776	LeGall et al.	Feb 2011	B2
7918398	Li et al.	Apr 2011	B2
7964835	Olsen et al.	Jun 2011	B2
7978239	Deever et al.	Jul 2011	B2
8115825	Culbert et al.	Feb 2012	B2
8149327	Lin et al.	Apr 2012	B2
8154610	Jo et al.	Apr 2012	B2
8238695	Davey et al.	Aug 2012	B1
8274552	Dahi et al.	Sep 2012	B2
8390729	Long et al.	Mar 2013	B2
8391697	Cho et al.	Mar 2013	B2
8400555	Georgiev et al.	Mar 2013	B1
8439265	Ferren et al.	May 2013	B2
8446484	Muukki et al.	May 2013	B2
8483452	Ueda et al.	Jul 2013	B2
8514491	Duparre	Aug 2013	B2
8547389	Hoppe et al.	Oct 2013	B2
8553106	Scarff	Oct 2013	B2
8587691	Takane	Nov 2013	B2
8619148	Watts et al.	Dec 2013	B1
8803990	Smith	Aug 2014	B2
8896655	Mauchly et al.	Nov 2014	B2
8976255	Matsuoto et al.	Mar 2015	B2
9019387	Nakano	Apr 2015	B2
9025073	Attar et al.	May 2015	B2
9025077	Attar et al.	May 2015	B2
9041835	Honda	May 2015	B2
9137447	Shibuno	Sep 2015	B2
9185291	Shabtay et al.	Nov 2015	B1
9215377	Sokeila et al.	Dec 2015	B2
9215385	Luo	Dec 2015	B2
9270875	Brisedoux et al.	Feb 2016	B2
9286680	Jiang et al.	Mar 2016	B1
9344626	Silverstein et al.	May 2016	B2
9360671	Zhou	Jun 2016	B1
9369621	Malone et al.	Jun 2016	B2
9413930	Geerds	Aug 2016	B2
9413984	Attar et al.	Aug 2016	B2
9420180	Jin	Aug 2016	B2
9438792	Nakada et al.	Sep 2016	B2
9485432	Medasani et al.	Nov 2016	B1
9578257	Attar et al.	Feb 2017	B2
9618748	Munger et al.	Apr 2017	B2
9681057	Attar et al.	Jun 2017	B2
9723220	Sugie	Aug 2017	B2
9736365	Laroia	Aug 2017	B2
9736391	Du et al.	Aug 2017	B2
9768310	Ahn et al.	Sep 2017	B2
9800798	Ravirala et al.	Oct 2017	B2
9851803	Fisher et al.	Dec 2017	B2
9894287	Qian et al.	Feb 2018	B2
9900522	Lu	Feb 2018	B2
9927600	Goldenberg et al.	Mar 2018	B2
20020005902	Yuen	Jan 2002	A1
20020030163	Zhang	Mar 2002	A1
20020063711	Park et al.	May 2002	A1
20020075258	Park et al.	Jun 2002	A1
20020122113	Foote	Sep 2002	A1
20020167741	Koiwai et al.	Nov 2002	A1
20030030729	Prentice et al.	Feb 2003	A1
20030093805	Gin	May 2003	A1
20030160886	Misawa et al.	Aug 2003	A1
20030202113	Yoshikawa	Oct 2003	A1
20040008773	Itokawa	Jan 2004	A1
20040012683	Yamasaki et al.	Jan 2004	A1
20040017386	Liu et al.	Jan 2004	A1
20040027367	Pilu	Feb 2004	A1
20040061788	Bateman	Apr 2004	A1
20040141065	Hara et al.	Jul 2004	A1
20040141086	Mihara	Jul 2004	A1
20040240052	Minefuji et al.	Dec 2004	A1
20050013509	Samadani	Jan 2005	A1
20050046740	Davis	Mar 2005	A1
20050157184	Nakanishi et al.	Jul 2005	A1
20050168834	Matsumoto et al.	Aug 2005	A1
20050185049	Iwai et al.	Aug 2005	A1
20050200718	Lee	Sep 2005	A1
20060054782	Olsen et al.	Mar 2006	A1
20060056056	Ahiska et al.	Mar 2006	A1
20060067672	Washisu et al.	Mar 2006	A1
20060102907	Lee et al.	May 2006	A1
20060125937	LeGall et al.	Jun 2006	A1
20060170793	Pasquarette et al.	Aug 2006	A1
20060175549	Miller et al.	Aug 2006	A1
20060187310	Janson et al.	Aug 2006	A1
20060187322	Janson et al.	Aug 2006	A1
20060187338	May et al.	Aug 2006	A1
20060227236	Pak	Oct 2006	A1
20070024737	Nakamura et al.	Feb 2007	A1
20070126911	Nanjo	Jun 2007	A1
20070177025	Kopet et al.	Aug 2007	A1
20070188653	Pollock et al.	Aug 2007	A1
20070189386	Imagawa et al.	Aug 2007	A1
20070257184	Olsen et al.	Nov 2007	A1
20070285550	Son	Dec 2007	A1
20080017557	Witdouck	Jan 2008	A1
20080024614	Li et al.	Jan 2008	A1
20080025634	Border et al.	Jan 2008	A1
20080030592	Border et al.	Feb 2008	A1
20080030611	Jenkins	Feb 2008	A1
20080084484	Ochi et al.	Apr 2008	A1
20080106629	Kurtz et al.	May 2008	A1
20080117316	Orimoto	May 2008	A1
20080129831	Cho et al.	Jun 2008	A1
20080218611	Parulski et al.	Sep 2008	A1
20080218612	Border et al.	Sep 2008	A1
20080218613	Janson et al.	Sep 2008	A1
20080219654	Border et al.	Sep 2008	A1
20090086074	Li et al.	Apr 2009	A1
20090109556	Shimizu et al.	Apr 2009	A1
20090122195	Van Baar et al.	May 2009	A1
20090122406	Rouvinen et al.	May 2009	A1
20090128644	Camp et al.	May 2009	A1
20090219547	Kauhanen et al.	Sep 2009	A1
20090252484	Hasuda et al.	Oct 2009	A1
20090295949	Ojala	Dec 2009	A1
20090324135	Kondo et al.	Dec 2009	A1
20100013906	Border et al.	Jan 2010	A1
20100020221	Tupman et al.	Jan 2010	A1
20100060746	Olsen et al.	Mar 2010	A9
20100097444	Lablans	Apr 2010	A1
20100103194	Chen et al.	Apr 2010	A1
20100165131	Makimoto et al.	Jul 2010	A1
20100196001	Ryynänen et al.	Aug 2010	A1
20100238327	Griffith et al.	Sep 2010	A1
20100259836	Kang et al.	Oct 2010	A1
20100283842	Guissin et al.	Nov 2010	A1
20100321494	Peterson et al.	Dec 2010	A1
20110058320	Kim et al.	Mar 2011	A1
20110063417	Peters et al.	Mar 2011	A1
20110063446	McMordie et al.	Mar 2011	A1
20110064327	Dagher et al.	Mar 2011	A1
20110080487	Venkataraman et al.	Apr 2011	A1
20110128288	Petrou et al.	Jun 2011	A1
20110164172	Shintani et al.	Jul 2011	A1
20110229054	Weston et al.	Sep 2011	A1
20110234798	Chou	Sep 2011	A1
20110234853	Hayashi et al.	Sep 2011	A1
20110234881	Wakabayashi et al.	Sep 2011	A1
20110242286	Pace et al.	Oct 2011	A1
20110242355	Goma et al.	Oct 2011	A1
20110298966	Kirschstein et al.	Dec 2011	A1
20120026366	Golan et al.	Feb 2012	A1
20120044372	Cote et al.	Feb 2012	A1
20120062780	Morihisa	Mar 2012	A1
20120069235	Imai	Mar 2012	A1
20120075489	Nishihara	Mar 2012	A1
20120105579	Jeon et al.	May 2012	A1
20120124525	Kang	May 2012	A1
20120154547	Aizawa	Jun 2012	A1
20120154614	Moriya et al.	Jun 2012	A1
20120196648	Havens et al.	Aug 2012	A1
20120229663	Nelson et al.	Sep 2012	A1
20120249815	Bohn et al.	Oct 2012	A1
20120287315	Huang et al.	Nov 2012	A1
20120320467	Baik et al.	Dec 2012	A1
20130002928	Imai	Jan 2013	A1
20130016427	Sugawara	Jan 2013	A1
20130063629	Webster et al.	Mar 2013	A1
20130076922	Shihoh et al.	Mar 2013	A1
20130093842	Yahata	Apr 2013	A1
20130094126	Rappoport et al.	Apr 2013	A1
20130113894	Mirlay	May 2013	A1
20130135445	Dahi et al.	May 2013	A1
20130155176	Paripally et al.	Jun 2013	A1
20130182150	Asakura	Jul 2013	A1
20130201360	Song	Aug 2013	A1
20130202273	Ouedraogo et al.	Aug 2013	A1
20130235224	Park et al.	Sep 2013	A1
20130250150	Malone et al.	Sep 2013	A1
20130258044	Betts-LaCroix	Oct 2013	A1
20130270419	Singh et al.	Oct 2013	A1
20130278785	Nomura et al.	Oct 2013	A1
20130321668	Kamath	Dec 2013	A1
20140009631	Topliss	Jan 2014	A1
20140049615	Uwagawa	Feb 2014	A1
20140118584	Lee et al.	May 2014	A1
20140160311	Hwang et al.	Jun 2014	A1
20140192238	Attar et al.	Jul 2014	A1
20140192253	Laroia	Jul 2014	A1
20140218587	Shah	Aug 2014	A1
20140313316	Olsson et al.	Oct 2014	A1
20140362242	Takizawa	Dec 2014	A1
20150002683	Hu et al.	Jan 2015	A1
20150042870	Chan et al.	Feb 2015	A1
20150070781	Cheng et al.	Mar 2015	A1
20150092066	Geiss et al.	Apr 2015	A1
20150103147	Ho et al.	Apr 2015	A1
20150138381	Ahn	May 2015	A1
20150154776	Zhang et al.	Jun 2015	A1
20150162048	Hirata et al.	Jun 2015	A1
20150195458	Nakayama et al.	Jul 2015	A1
20150215516	Dolgin	Jul 2015	A1
20150237280	Choi et al.	Aug 2015	A1
20150242994	Shen	Aug 2015	A1
20150244906	Wu et al.	Aug 2015	A1
20150253543	Mercado	Sep 2015	A1
20150253647	Mercado	Sep 2015	A1
20150261299	Wajs	Sep 2015	A1
20150271471	Hsieh et al.	Sep 2015	A1
20150281678	Park et al.	Oct 2015	A1
20150286033	Osborne	Oct 2015	A1
20150316744	Chen	Nov 2015	A1
20150334309	Peng et al.	Nov 2015	A1
20160044250	Shabtay et al.	Feb 2016	A1
20160070088	Koguchi	Mar 2016	A1
20160154202	Wippermann et al.	Jun 2016	A1
20160154204	Lim et al.	Jun 2016	A1
20160212358	Shikata	Jul 2016	A1
20160212418	Demirdjian et al.	Jul 2016	A1
20160241751	Park	Aug 2016	A1
20160291295	Shabtay et al.	Oct 2016	A1
20160295112	Georgiev et al.	Oct 2016	A1
20160301840	Du et al.	Oct 2016	A1
20160353008	Osborne	Dec 2016	A1
20160353012	Kao et al.	Dec 2016	A1
20170019616	Zhu et al.	Jan 2017	A1
20170070731	Darling et al.	Mar 2017	A1
20170187962	Lee et al.	Jun 2017	A1
20170214846	Du et al.	Jul 2017	A1
20170214866	Zhu et al.	Jul 2017	A1
20170242225	Fiske	Aug 2017	A1
20170289458	Song et al.	Oct 2017	A1
20180013944	Evans, V et al.	Jan 2018	A1
20180017844	Yu et al.	Jan 2018	A1
20180024329	Goldenberg et al.	Jan 2018	A1
20180059379	Chou	Mar 2018	A1
20180120674	Avivi et al.	May 2018	A1
20180150973	Tang et al.	May 2018	A1
20180176426	Wei et al.	Jun 2018	A1
20180198897	Tang et al.	Jul 2018	A1
20180241922	Baldwin et al.	Aug 2018	A1
20180295292	Lee et al.	Oct 2018	A1
20180300901	Wakai et al.	Oct 2018	A1
20190121103	Bachar et al.	Apr 2019	A1
20190339364	Nash	Nov 2019	A1
20210408094	Jang	Dec 2021	A1
20220187508	Duke	Jun 2022	A1
20220251643	Chen	Aug 2022	A1

Foreign Referenced Citations (42)

Number	Date	Country
101276415	Oct 2008	CN
201514511	Jun 2010	CN
102739949	Oct 2012	CN
103024272	Apr 2013	CN
103841404	Jun 2014	CN
1536633	Jun 2005	EP
1780567	May 2007	EP
2523450	Nov 2012	EP
S59191146	Oct 1984	JP
04211230	Aug 1992	JP
H07318864	Dec 1995	JP
08271976	Oct 1996	JP
2002010276	Jan 2002	JP
2003298920	Oct 2003	JP
2004133054	Apr 2004	JP
2004245982	Sep 2004	JP
2005099265	Apr 2005	JP
2006238325	Sep 2006	JP
2007228006	Sep 2007	JP
2007306282	Nov 2007	JP
2008076485	Apr 2008	JP
2010204341	Sep 2010	JP
2011085666	Apr 2011	JP
2013106289	May 2013	JP
20070005946	Jan 2007	KR
20090058229	Jun 2009	KR
20100008936	Jan 2010	KR
20140014787	Feb 2014	KR
101477178	Dec 2014	KR
20140144126	Dec 2014	KR
20150118012	Oct 2015	KR
2000027131	May 2000	WO
2004084542	Sep 2004	WO
2006008805	Jan 2006	WO
2010122841	Oct 2010	WO
2014072818	May 2014	WO
WO-2016107962	Jul 2016	WO
2017025822	Feb 2017	WO
2017037688	Mar 2017	WO
2018130898	Jul 2018	WO
WO-2020029921	Feb 2020	WO
WO-2021247406	Sep 2021	WO

Non-Patent Literature Citations (16)

Entry
Statistical Modeling and Performance Characterization of a Real-Time Dual Camera Surveillance System, Greienhagen et al., Publisher: IEEE, 2000, 8 pages.
A 3MPixel Multi-Aperture Image Sensor with 0.7μm Pixels in 0 11μm CMOS, Fife et al., Stanford University, 2008, 3 pages.
Dual camera intelligent sensor for high definition 360 degrees surveillance, Scotti et al., Publisher: IET, May 9, 2000, 8 pages.
Dual-sensor foveated imaging system, Hua et al., Publisher: Optical Society of America, Jan. 14, 2008, 11 pages.
Defocus Video Matting, McGuire et al., Publisher: ACM SIGGRAPH, Jul. 31, 2005, 11 pages.
Compact multi-aperture imaging with high angular resolution, Santacana et al., Publisher: Optical Society of America, 2015, 10 pages.
Multi-Aperture Photography, Green et al., Publisher: Mitsubishi Electric Research Laboratories, Inc., Jul. 2007, 10 pages.
Multispectral Bilateral Video Fusion, Bennett et al., Publisher: IEEE, May 2007, 10 pages.
Super-resolution imaging using a camera array, Santacana et al., Publisher: Optical Society of America, 2014, 6 pages.
Optical Splitting Trees for High-Precision Monocular Imaging, McGuire et al., Publisher: IEEE, 2007, 11 pages.
High Performance Imaging Using Large Camera Arrays, Wilburn et al., Publisher: Association for Computing Machinery, Inc., 2005, 12 pages.
Real-time Edge-Aware Image Processing with the Bilateral Grid, Chen et al., Publisher: ACM SIGGRAPH, 2007, 9 pages.
Superimposed multi-resolution imaging, Carles et al., Publisher: Optical Society of America, 2017, 13 pages.
Viewfinder Alignment, Adams et al., Publisher: Eurographics, 2008, 10 pages.
Dual-Camera System for Multi-Level Activity Recognition, Bodor et al., Publisher: IEEE, Oct. 2014, 6 pages.
Engineered to the task: Why camera-phone cameras are different, Giles Humpston, Publisher: Solid State Technology, Jun. 2009, 3 pages.

Related Publications (1)

	Number	Date	Country
	20220021832 A1	Jan 2022	US

Provisional Applications (2)

	Number	Date	Country
	63055912	Jul 2020	US
	63052001	Jul 2020	US

Image sensors and sensing methods to obtain time-of-flight and phase detection information

Information

Patent Number

Date Filed

Date Issued

Inventors

Original Assignees

Examiners

Agents

CPC

Field of Search

US

CPC

International Classifications

Abstract