The present invention is related to three-dimensional (3D) imaging, and in particular to 3D light-field cameras, methods and systems for capturing and presenting 3D images.
There is an emerging interest in developing a light-field (LF) camera, also called a plenoptic camera. An LF camera uses a microlens array to capture four-dimensional (4D) light field information about the scene. Such light field information may be used to improve the resolution of computer graphics and computer vision applications.
Aspects of the present invention relate to a method of generating an image of a scene. Light representing the scene is directed through a lens module coupled to an imaging sensor. The lens module includes a surface having a slit-shaped aperture and a cylindrical lens array positioned along an optical axis of the imaging sensor. A longitudinal direction of the slit-shaped aperture is arranged orthogonal to a cylindrical axis of the cylindrical lens array. Light directed through the lens module is captured by the imaging sensor to form a 3D LF image.
Aspects of the present invention also relate to a 3D LF camera. The 3D LF camera includes a surface having a slit-shaped aperture mounted on a lens, an imaging sensor and a cylindrical lens array disposed between the imaging sensor and the lens. The cylindrical lens array is arranged along an optical axis of the imaging sensor. A longitudinal direction of the slit-shaped aperture is arranged orthogonal to a cylindrical axis of the cylindrical lens array. The imaging sensor is configured to capture at least one 3D LF image of a scene.
Aspects of the present invention also relate to a 3D photograph. The 3D photograph includes a 3D light field printed image of a scene and a cylindrical lens array disposed on the 3D light field printed image. The combination of the 3D light field printed image and the cylindrical lens array forms a 3D stereoscopic image.
The invention may be understood from the following detailed description when read in connection with the accompanying drawings. It is emphasized that, according to common practice, various features/elements of the drawings may not be drawn to scale. On the contrary, the dimensions of the various features/elements may be arbitrarily expanded or reduced for clarity. Moreover, in the drawings, common numerical references are used to represent like features/elements. The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee. Included in the drawing are the following figures:
Current light field cameras suffer from poor resolution. For example, light field cameras with a 10 megapixel sensor only produce images at a very low resolution (e.g., about 1 megapixel). The low resolution resultant image is inherent to the design of all current light field cameras: they sacrifice spatial resolution for angular resolution. The spatial resolution is defined as the sampling rate in space. In a conventional (non-light field) camera, the spatial resolution amounts to the sensor's resolution. In a light field camera, the total number of sampling points in space is equal to the number of lenses. Given that the size of the lens is usually several times larger than the pixel pitch, the spatial resolution may be reduced. However, pixels underneath each lens will record rays passing through its common sampling point with different directions. The directional specificity defines the camera's angular resolution. Under the assumption that the sensor has limited resolution, there may be a trade-off between the spatial resolution and the angular resolution, which equates to a balance between image resolution and the number of views.
Current cameras which capture a light field include 4D light field cameras which record both angular and spatial information in all directions. For example, one current 4D light field camera includes a 328×328 microlens array attached to an imaging sensor, where each microlens covers about 100 pixels. In this example, a light field of about 328×328 spatial resolution and about 10×10 angular resolution may be obtained. The inherent tradeoff made by the light field camera provides more angular resolution at the expense of lowering spatial resolution. Although the camera in this example is equipped with an 11 megapixel sensor, it only delivers images with an effective resolution of about 700×700. Other 4D light field cameras share a similar design and similar limitations.
With respect to 3D display, most existing 3D televisions uses shutter-glass technology to display stereoscopic 3D images. A disadvantage of this technique is that it produces flickering (which can be noticed except at very high refresh rates.) In addition, current 3D viewing techniques (such as by shutter glasses) are inconvenient and expensive for viewing 3D photographs.
Aspects of the present invention include a 3D light field camera that combines a camera, a cylindrical lens array attached to the imaging sensor of the camera and a modified lens with a narrow-slit aperture. In some examples, the camera may include a digital single-lens reflex (DSLR) camera. In some examples, a 3D light field camera uses a vertical cylindrical lens array. The vertical cylindrical lens array may be used to maintain the vertical resolution while only trading between the horizontal resolution and the angular resolution. To reduce defocus blurs, the cylindrical lens array may be coupled with a slit shaped aperture.
With the rapid growth of 3D display technology, people are more likely to watch 3D content instead of two dimensional (2D) images. Example 3D light field cameras of the present invention go beyond the capability of merely watching 3D content. With exemplary 3D light field cameras and exemplary systems for capturing and presenting 3D images, the 3D content may be captured directly from the scene and then displayed. By attaching a cylindrical lens array to the sensor and a narrow-slit mask to the aperture, a consumer DSLR camera may be converted to a 3D light field camera. Users can take pictures with an exemplary 3D light field camera similarly to a conventional camera.
Aspects of the present invention also relate to exemplary methods and systems for rendering 3D stereoscopic images from a raw light field image. With the captured raw light field image, 3D stereoscopic images may be rendered from different perspectives with view-dependent features such as occlusion and reflection. Because the 3D light field camera can simultaneously capture the scene from different viewpoints in a single shot, the acquired views will exhibit parallax, i.e., closer objects exhibit larger disparity across views. The capability of preserving parallax enables naked eye 3D visualization of the scene/object. The same capability enables preservation of view-dependent features such as reflections where each view (i.e., sub-image) captures a slightly different image of the scene. In some examples, the system may render a refocused image at predetermined focus depth from the raw light field image. Example methods use image based rendering (IBR) techniques. Specifically, simple geometry (such as a 3D plane) may be used as a proxy to scene geometry. All captured views can be warped onto the geometry and re-rendered (e.g., via ray-tracing or texture mapping) to the desired view. The process is analogous to specifying a focal depth in commodity SLR cameras. When all views are combined after warping, the results emulate defocus blurs in conventional wide aperture photography.
Aspects of the present invention also relate to methods and devices for 3D viewing. According to some examples, the device falls into the category of an autostereoscopic 3D display, i.e., viewing 3D without glasses.
Referring to
3D LF camera 102 includes 3D LF lens module 118 and camera 120. As described further below with respect to
3D LF camera 102 may be configured to capture (raw) 3D LF image 128 of a scene. In some examples, 3D LF camera 102 may capture two or more 3D LF images 128 of the scene, such as over a predetermined time period. Thus, in some examples, 3D LF camera 102 may include a video camera. In general, 3D LF camera 102 may capture at least one 3D LF image 128 of the scene.
Controller 104 may be coupled to one or more of 3D LF camera 102, rendering module 106, storage 108, display 110, user interface 112 and printer 114, to control capture, storage, display, printing and/or processing of 3D LF image(s) 128. Controller 104 may include, for example, a logic circuit, a digital signal processor or a microprocessor. It is understood that one or more functions of rendering module 106 may be performed by controller 104.
Rendering module 106 may be configured to process 3D LF image(s) 128 to form rendered image(s) 130. Rendering module 106 may be configured to calibrate 3D LF camera 102 to locate a lens center of each lens 212 (
Storage 108 may be configured to store at least one of raw 3D LF image(s) 128 (from 3D LF camera 102 or via controller 104) or rendered image(s) 130 (from rendering module 106). Storage 108 may also store parameters associated with controller 104 and/or rendering module 106. Although storage 108 is shown separate from 3D LF camera 102, in some examples, storage 108 may be part of 3D LF camera 102. Storage 108 may include any suitable tangible, non-transitory computer readable medium, for example, a magnetic disk, an optical disk or a hard drive.
Raw 3D LF image(s) 128 (from 3D LF camera 102) and/or rendered image(s) 130 (from rendering module 106) may be displayed on display 110. Display 110 may include any suitable display device configured to display raw 3D LF image(s) 128/rendered image(s) 130.
User interface 112 may include any suitable user interface capable of receiving user input associated with, for example, selection of rendering to be performed by rendering module 106, parameters associated with rendering module 106, storage selection in storage 108 for captured images 128/rendered images 130, display selection for images 128, 130 and/or print selection for images 128, 130. User interface 112 may include, for example, a pointing device, a keyboard and/or a display device. Although user interface 112 and display 110 are illustrated as separate devices, it is understood that the functions of user interface 112 and display 110 may be combined into one device.
Raw 3D LF image 128 and/or rendered image 130 may be printed by printer 114, to form printed image 122. Printer 114 may include any suitable printer device configured to print raw 3D LF image 128/rendered image 130. In some examples, printer 114 may include a laser printer configured to print a color and/or a black and white printed image 122. In some examples, printed image 122 includes a glossy finish paper.
Referring to
3D photograph 116 may be used to capture other objects, such as a sculpture, food, etc. For example, a restaurant may use 3D photograph 116 to generate a 3D menu or display of their food. 3D photograph 116 may be inexpensive and portable, making it suitable for product advertising.
Referring back to
A suitable 3D LF camera 102, controller 104, rendering module 106, storage 108, display 110, user interface 112, printer 114 and 3D photograph 116 may be understood by the skilled person from the description herein.
Referring next to
As shown in
As an example, aperture 206 has width of about 1.3 mm. Cylindrical lens array 210 includes 40 lenses 212. Lens array 210 is of size 10 mm by 10 mm where each lens 212 has a pitch of about 0.25 mm and a focal length of about 1.6 mm. In general, the width of aperture 206, the number of lenses 212, the pitch of each lens 212, the focal length of each lens and the size of lens array 210 may be selected to produce a desired resolution for 3D LF lens module 118. In the example above, the selected parameters of lens module 118 produces an effective resolution of about 2000×2000. The number of rays captured by 3D LF camera 102 may also depends on the resolution of imaging sensor 214. In an example, the resolution of imaging sensor 214 is approximately 5,184×3,456.
As shown in
Users may capture images with 3D LF camera 102 similarly as with a conventional camera (such as a DSLR camera), by attaching 3D LF lens module 118 to camera 120. Thus, by simply pressing a shutter button of camera 120, at least one 3D LF image may be captured, the same way a 2D image is typically captured. Accordingly, there may be a minimal learning curve for using 3D LF camera 102. 3D LF images 128 (
Referring to
In a conventional camera, the value of each pixel is the integral of many rays across the aperture, which results in a high spatial resolution but very low angular resolution. 3D LF camera 102 is capable of diverging rays in one direction, while maintaining high spatial resolution in the other direction. Specifically, cone of rays 220 emitted from object 402 will be converged and partially blocked by slit mask 204 on main lens 108, becoming sheet of rays 222. Rays 222 may be optically sorted by direction via cylindrical lens array 212, to form sorted rays 412. Sorted rays 412 from cylindrical lens array 210 are then directed onto pixels (not shown) of imaging sensor 214.
As shown in
As shown in
Referring to
At step 600, a 3D LF image of a reference scene is captured, for example, via 3D LF camera 102 (
3D LF camera 102 may generate images 128 with parallax. In general the exact placement of cylindrical lens array 210 is unknown, and the baseline between cylindrical lens may be a non-integer multiple of pixel pitch. Therefore, to locate the image centers of lenses 212, an image of a white scene is captured in step 600. Because of vignetting, the brightest line along each lenslet image is taken, in step 602, to approximate the center of cylindrical lens. The lenslet image refers to the image formed by pixels lying right beneath a cylindrical lenslet 212 (
At step 604, a 3D LF image 128 is captured of a desired scene, for example, via 3D LF camera 102 (
At step 608, a focus depth is selected, for example, via user interface 112 (
Based on classical radiometry, the irradiance of a point on the film (or image plane where the film is positioned) is the integral of all the rays across the aperture reaching the point:
Where F is the separation between lens 208 (
To focus at a different plane, the separation between the lens plane and the film plane is changed. For example, to focus at a new depth F′, as shown in
Using a similar triangle, the ray (u, x′), where x′ is the coordinate on film plane, can be re-parameterized as
at the original x plane. As a result, if α=F′/F is defined as the relative depth of the film plane, then
Therefore, the final equation for pixel value (x′, y′) in the film at the depth F′=α·F from the lens plane becomes:
Because each object emits sheet of rays 222 (
Thus, rays may be traced through the center of each lens (located in step 602) and used to render the refocused image. Here, the term LF corresponds to the sub-aperture images and the integral can be interpreted as adding transformed sub-aperture images.
At step 612, the shifted sub-aperture images (step 610) are combined to form refocused (rendered) image 130 (
It is contemplated that a non-transitory computer readable medium may store computer readable instructions for machine execution of the steps 602 and 606-612.
Referring to
At step 620, steps 604-606 are repeated, to form a set of sub-aperture images. At step 622, a viewpoint for the image is selected, for example, via user interface 112 (
At step 624, instead of using a uniform weight, a different weight may be assigned to different sub-aperture images. For example, higher weight(s) may be assigned to sub-aperture image(s) closer to the selected (synthetic) viewpoint, for example, via rendering module 106. At step 626, lower weight(s) may be assigned to other sub-aperture images in the set of sub-aperture images that are farther away from the selected viewpoint, for example, via rendering module 106. At step 628, rendering module 106 may apply a shift-and-add algorithm to the weighted sub-aperture images (steps 624-626) to form perspective (rendered) image 130 (
At optional step 630, rendering module 106 may generate a stereoscopic view image from the perspective image (step 628) (or from raw 3D LF image 128 or the refocused image in step 612 of
It is contemplated that a non-transitory computer readable medium may store computer readable instructions for machine execution of the steps 624-630.
Referring to
Example 3D LF camera 102 (
Referring next to
In the example, an XSi DSLR camera (e.g., camera 120) manufactured by Canon Inc. (Tokyo, Japan) with a sensor resolution of 5,184×3,456 is used to capture the data. The width of slit 206 (
To generate the refocused image, shown in
Although the invention has been described in terms of methods and systems for capturing, processing and presenting 3D images, it is contemplated that one or more steps and/or components may be implemented in software for use with microprocessors/general purpose computers (not shown). In this embodiment, one or more of the functions of the various components and/or steps described above may be implemented in software that controls a computer. The software may be embodied in non-transitory tangible computer readable media (such as, by way of non-limiting example, a magnetic disk, optical disk, hard drive, etc.) for execution by the computer. As described herein, devices 104, 106, 110, 112 and 114, shown in
Although the invention is illustrated and described herein with reference to specific embodiments, the invention is not intended to be limited to the details shown. Rather, various modifications may be made in the details within the scope and range of equivalents of the claims and without departing from the invention.
This application is a U.S. National Phase application of PCT Application No. US2014/072099, filed Dec. 23, 2014 and published on Jul. 2, 2015, as WO/2015/100301, which claims priority to U.S. Provisional Application No. 61/920,074 entitled 3-D LIGHT FIELD CAMERA AND PHOTOGRAPHY METHOD, filed on Dec. 23, 2013, and U.S. Provisional Application No. 61/931,051 entitled 3-D LIGHT FIELD CAMERA AND PHOTOGRAPHY METHOD, filed on Jan. 24, 2014, the contents of which are incorporated herein by reference.
The present invention was supported in part by Grant Number 0845268 from the National Science Foundation. The United States Government may have certain rights to the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2014/072099 | 12/23/2014 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2015/100301 | 7/2/2015 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
8106994 | Ichimura | Jan 2012 | B2 |
8681249 | Kobayashi et al. | Mar 2014 | B2 |
20030052836 | Matsumoto | Mar 2003 | A1 |
20050219693 | Hartkop | Oct 2005 | A1 |
20070230944 | Georgiev | Oct 2007 | A1 |
20090185801 | Georgiev | Jul 2009 | A1 |
20090190022 | Ichimura | Jul 2009 | A1 |
20090295829 | Georgiev et al. | Dec 2009 | A1 |
20100003024 | Agrawal | Jan 2010 | A1 |
20110174998 | Molnar | Jul 2011 | A1 |
20110261464 | Hoffman | Oct 2011 | A1 |
20110273609 | DiFrancesco | Nov 2011 | A1 |
20120229683 | Kobayashi et al. | Sep 2012 | A1 |
20130222606 | Pitts | Aug 2013 | A1 |
Number | Date | Country |
---|---|---|
101500085 | Aug 2009 | CN |
103019021 | Apr 2013 | CN |
2004239932 | Aug 2004 | JP |
2008294741 | Dec 2008 | JP |
2009290268 | Dec 2009 | JP |
2012186764 | Sep 2012 | JP |
2013105151 | May 2013 | JP |
2011029440 | Mar 2011 | WO |
WO-2011029440 | Mar 2011 | WO |
2013038628 | Mar 2013 | WO |
2013068882 | May 2013 | WO |
Entry |
---|
Machine translation of WO-2011029440-A1 (Hahn, Mar. 2011) (Year: 2011). |
Supplementary European Search Report for Application No. 14874923.7, dated Jun. 30, 2017, 7 pages. |
International Search Report dated Mar. 18, 2015 for International Application No. PCT/US2014/072099, 2 pages. |
International Preliminary Report on Patentability and Written Opinion of the International Searching Authority dated Jun. 28, 2016 for International Application No. PCT/US2014/072099, 6 pages. |
Notice of Reasons for Rejection for Japanese Application No. 2016-543029, dated Dec. 4, 2018, with translation, 6 pages. |
Chinese Office Action for Chinese Application No. 201480073939.9, dated Jul. 19, 2018, with translation, 24 pages. |
Chinese Office Action with Search Report for Chinese Application No. 201480073939.9, dated Mar. 21, 2019, 29 pages. |
Number | Date | Country | |
---|---|---|---|
20160330432 A1 | Nov 2016 | US |
Number | Date | Country | |
---|---|---|---|
61920074 | Dec 2013 | US | |
61931051 | Jan 2014 | US |