METHOD AND AN APPARATUS FOR GENERATING DATA REPRESENTATIVE OF A PIXEL BEAM

TECHNICAL FIELD

The present invention relates to generation of data representing a light field.

BACKGROUND

The acquisition of four-dimensional or 4D light-field data, which can be viewed as a sampling of a 4D light field, i.e. the recording of light rays, is explained in the article “Understanding camera trade-offs through a Bayesian analysis of light field projections” by Anat Levin and al., published in the conference proceedings of ECCV 2008 is an hectic research subject.

Compared to classical two-dimensional or 2D images obtained from a camera, 4D light-field data enable a user to have access to more post-processing features that enhance the rendering of images and the interactivity with the user. For example, with 4D light-field data, it is possible to perform refocusing of images with freely selected distances of focalization meaning that the position of a focal plane can be specified/selected a posteriori, as well as changing slightly the point of view in the scene of an image. In order to acquire 4D light-field data, several techniques can be used. For example, a plenoptic camera is able to acquire 4D light-field data. Details of the architecture of a plenoptic camera are provided in FIG. 1A. FIG. 1A is a diagram schematically representing a plenoptic camera 100. The plenoptic camera 100 comprises a main lens 101, a microlens array 102 comprising a plurality of micro-lenses 103 arranged in a two-dimensional array and an image sensor 104.

Another way to acquire 4D light-field data is to use a camera array as depicted in FIG. 1B. FIG. 1B represents a multi-array camera 110. The multi-array camera 110 comprises a lens array 112 and an image sensor 114.

In the example of the plenoptic camera 100 as shown in FIG. 1A, the main lens 101 receives light from an object (not shown on the figure) in an object field of the main lens 101 and passes the light through an image field of the main lens 101.

At last, another way of acquiring a 4D light field is to use a conventional camera that is configured to capture a sequence of 2D images of a same scene at different focal planes. For example, the technique described in the document “Light field capture using focal plane sweeping and its optical reconstruction using 3D displays” by J.-H. Park et al., published in OPTICS EXPRESS, Vol. 22, No. 21, in October 2014, may be used to achieve the acquisition of 4D light field data by means of a conventional camera.

There are several ways to represent 4D light-field data. Indeed, in the Chapter 3.3 of the Ph.D dissertation thesis entitled “Digital Light Field Photography” by Ren Ng, published in July 2006, three different ways to represent 4D light-field data are described. Firstly, 4D light-field data can be represented, when recorded by a plenoptic camera by a collection of micro-lens images. 4D light-field data in this representation are named raw images or raw 4D light-field data. Secondly, 4D light-field data can be represented, either when recorded by a plenoptic camera or by a camera array, by a set of sub-aperture images. A sub-aperture image corresponds to a captured image of a scene from a point of view, the point of view being slightly different between two sub-aperture images. These sub-aperture images give information about the parallax and depth of the imaged scene. Thirdly, 4D light-field data can be represented by a set of epipolar images see for example the article entitled: “Generating EPI Representation of a 4D Fields with a Single Lens Focused Plenoptic Camera”, by S. Wanner and al., published in the conference proceedings of ISVC 2011. Light-field data can take up large amounts of storage space which can make storage cumbersome and processing less efficient. In addition light-field acquisition devices are extremely heterogeneous. Light-field cameras are of different types for example plenoptic or camera arrays. Within each type there are many differences such as different optical arrangements, or micro-lenses of different focal lengths. Each camera has its own proprietary file format. At present there is no standard supporting the acquisition and transmission of multi-dimensional information for an exhaustive over-view of the different parameters upon which a light-field depends. As such, acquired light-field data for different cameras have a diversity of formats. The present invention has been devised with the foregoing in mind.

SUMMARY OF INVENTION

According to a first aspect of the invention there is provided a computer implemented method for sorting a collection of rays of light, called generating rays, each generating ray of said collection generates, in an object space of an optical acquisition system, a surface of a volume occupied by a set of rays of light passing through a pupil of said optical acquisition system and a conjugate of at least one pixel of a sensor of said optical acquisition system, by rotating around a revolution axis of said volume, said volume occupied by said set of rays of light being called a pixel beam, the method comprising:

- selecting, for at least one pixel beam, a generating ray generating the surface of said pixel beam which crosses a reference straight line,
- generating a collection of sorted generating rays comprising the selected generating ray.

According to an embodiment of the invention, in the method for sorting a collection of rays of light, the surface of the pixel beam generated by a generating ray is a hyperboloid of one sheet.

According to an embodiment of the invention, in the method for sorting a collection of rays of light, the reference straight line is parallel to an optical axis of a main lens of the optical acquisition system.

According to an embodiment of the invention, in the method for sorting a collection of rays of light, the reference straight line is parallel to a central axis of a lens-array of the optical acquisition system.

According to an embodiment of the invention, in the method for sorting a collection of rays of light, selecting the generating ray consists in:

- computing coordinates of a vector defining the selected generating ray by multiplying a vector defining another generating ray of said pixel beam, belonging to the collection of generating rays, with a rotation matrix,
- the distance between the selected generating ray and the reference straight line being equal to zero since said selected generating ray and said reference straight line cross each other, computing the value of the rotation angle of the rotation matrix based on the points and vectors defining said selected generating ray and said reference straight line.

Another objet of the invention is a method for generating data representative of a volume, in an object space of an optical acquisition system, occupied by a set of rays of light passing through a pupil of said optical acquisition system and a conjugate of at least one pixel of a sensor of said optical acquisition system said volume occupied by said set of rays of light being called a pixel beam, the method comprising:

- acquiring a first collection of rays of light and a second collection of rays of light, called generating rays, each generating ray of said collection of generating rays generates a surface of a pixel beam by rotating around a revolution axis of said pixel beam, the revolution axis of said pixel beam being a ray of light belonging to the first collection of rays of light;
- computing intersection data defining intersections of a generating ray and the revolution axis of a given pixel beam with a plurality of given reference planes, said reference planes being parallel to one another and corresponding to different depths in the object space;
- obtaining ray diagram parameters defining the graphical representation of the intersection data in a 2D ray diagram to provide data representative of said pixel beam.

According to an embodiment of the invention, in the method for generating data representative of a pixel beam, the intersection data corresponding to the generating ray and the revolution axis are graphically represented in the ray diagram as datalines and the ray diagram parameters include data representative of at least one of:

- the slope of a dataline; and
- an intersection of a dataline with an axis of the ray diagram.

According to an embodiment of the invention, in the method for generating data representative of a pixel beam, the data representative of the pixel beam further comprises colour data representing the colour of the corresponding second generating ray and revolution axis.

According to an embodiment of the invention, in the method for generating data representative of a pixel beam, the acquired collection of generating rays is obtained in accordance with the method for generating data representative of a pixel beam.

Another object of the invention concerns an apparatus for sorting a collection of rays of light, called generating rays, each generating ray of said collection generates, in an object space of an optical acquisition system, a surface of a volume occupied by a set of rays of light passing through a pupil of said optical acquisition system and a conjugate of at least one pixel of a sensor of said optical acquisition system, by rotating around a revolution axis of said volume, said volume occupied by said set of rays of light being called a pixel beam, said apparatus comprising a processor configured to:

- select, for at least one pixel beam, a generating ray generating the surface of said pixel beam which crosses a reference straight line,
- generate a collection of sorted generating rays comprising the selected generating ray.

According to an embodiment of the invention, the processor of the apparatus for sorting a collection of rays of light is configured to select the generating ray by:

- computing coordinates of a vector defining the selected generating ray by multiplying a vector defining another generating ray of said pixel beam, belonging to the collection of generating rays, with a rotation matrix,
- the distance between the selected generating ray and the reference straight line being equal to zero since said selected generating ray and said reference straight line cross each other, computing the value of the rotation angle of the rotation matrix based on the points and vectors defining said selected generating ray and said reference straight line.

Another object of the invention concerns an apparatus for generating data representative of a volume, in an object space of an optical acquisition system, occupied by a set of rays of light passing through a pupil of said optical acquisition system and a conjugate of at least one pixel of a sensor of said optical acquisition system said volume occupied by said set of rays of light being called a pixel beam, the apparatus comprising a processor configured to:

- acquire a first collection of rays of light and a second collection of rays of light, called generating rays, each generating ray of said collection of generating rays generates a surface of a pixel beam by rotating around a revolution axis of said pixel beam, the revolution axis of said pixel beam being a ray of light belonging to the first collection of rays of light;
- computing intersection data defining intersections of a generating ray and the revolution axis of a given pixel beam with a plurality of given reference planes, said reference planes being parallel to one another and corresponding to different depths in the object space;
- obtain ray diagram parameters defining the graphical representation of the intersection data in a 2D ray diagram to provide data representative of said pixel beam.

According to an embodiment of the invention, the processor of the apparatus for generating data representative of a pixel beam, the acquired collection of generating rays is obtained in accordance with the method of any one of claims 1 to 5.

Another objet of the invention is a light field imaging device comprising:

- an array of micro lenses arranged in a regular lattice structure;
- a photosensor configured to capture light projected on the photosensor from the array of micro lenses, the photosensor comprising sets of pixels, each set of pixels being optically associated with a respective micro lens of the array of micro lenses; and
- a device for providing metadata in accordance with claim 8.

Another objet of the invention is a digital file comprising data representative of a volume, in an object space of an optical acquisition system, occupied by a set of rays of light passing through a pupil of said optical acquisition system and a conjugate of at least one pixel of a sensor of said optical acquisition system said volume occupied by said set of rays of light being called a pixel beam said data comprising:

- a ray diagram parameters defining a graphical representation in a 2D ray diagram of intersection data of a first ray of light, called a generating ray, and second ray of light, said generating ray generating a surface of a pixel beam by rotating around the other ray of light, said other ray of light being a revolution axis of said pixel beam, the intersection data defining intersections of the generating ray and the revolution axis of the pixel beam with a plurality of given reference planes, said reference planes being parallel to one another and corresponding to different depths in the object space;
- colour data defining colours of the generating ray and the revolution axis of the pixel beam. Some processes implemented by elements of the invention may be computer implemented. Accordingly, such elements may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as a “circuit”, “module” or “system’. Furthermore, such elements may take the form of a computer program product embodied in any tangible medium of expression having computer usable program code embodied in the medium.

Since elements of the present invention can be implemented in software, the present invention can be embodied as computer readable code for provision to a programmable apparatus on any suitable carrier medium. A tangible carrier medium may comprise a storage medium such as a floppy disk, a CD-ROM, a hard disk drive, a magnetic tape device or a solid state memory device and the like. A transient carrier medium may include a signal such as an electrical signal, an electronic signal, an optical signal, an acoustic signal, a magnetic signal or an electromagnetic signal, e.g. a microwave or RF signal.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the invention will now be described, by way of example only, and with reference to the following drawings in which:

FIG. 1A is a diagram schematically representing a plenoptic camera;

FIG. 1B represents a multi-array camera;

FIG. 2A is a functional diagram of a light-field camera according to an embodiment of the invention;

FIG. 2B is a functional diagram of a light-field data formator and light-field data processor according to an embodiment of the invention;

FIG. 3 is an example of a raw light-field image formed on a photosensor array;

FIG. 4 represents a volume occupied by a set of rays of light in an object space of an optical system of a camera or optical acquisition system;

FIG. 5 represents a hyperboloid of one sheet;

FIG. 6A is a functional block diagram illustrating modules of a device for sorting generating rays of a pixel beam in accordance with one or more embodiments of the invention;

FIG. 6B is a flow chart illustrating steps of a method for sorting the generating rays of a pixel beam in accordance with one or more embodiments of the invention;

FIGS. 7A and 7B graphically illustrate the use of reference planes for parameterisation of light-field data in accordance with one or more embodiments of the invention;

FIG. 8 schematically illustrates representation of light-field rays with respect to reference planes in accordance with embodiments of the invention;

FIG. 9A is a flow chart illustrating steps of a method in accordance with one or more embodiments of the invention;

FIG. 9B is a functional block diagram illustrating modules of a device for providing a light data format in accordance with one or more embodiments of the invention;

FIG. 10 schematically illustrates parameters for representation of light-field rays in accordance with embodiments of the invention;

FIG. 11 is a 2D ray diagram graphically illustrating intersection data in accordance with embodiments of the invention;

FIG. 12 graphically illustrates a digital line generated in accordance with embodiments of the invention;

FIG. 13 graphically illustrates digitals line generated in accordance with embodiments of the invention;

FIG. 14A-14C graphically illustrate Radon transforms applied to a digital line in accordance with embodiments of the invention; and

FIG. 15 is a 2D ray diagram graphically illustrating intersection data for a plurality of cameras in accordance with embodiments of the invention;

FIG. 16 represents the geometric shape of a Gaussian beam.

DETAILED DESCRIPTION

As will be appreciated by one skilled in the art, aspects of the present principles can be embodied as a system, method or computer readable medium. Accordingly, aspects of the present principles can take the form of an entirely hardware embodiment, an entirely software embodiment, (including firmware, resident software, micro-code, and so forth) or an embodiment combining software and hardware aspects that can all generally be referred to herein as a “circuit”, “module”, or “system”. Furthermore, aspects of the present principles can take the form of a computer readable storage medium. Any combination of one or more computer readable storage medium(a) may be utilized.

Embodiments of the invention provide formatting of light-field data for further processing applications such as format conversion, refocusing, viewpoint change and 3D image generation.

FIG. 2A is a block diagram of a light-field camera device in accordance with an embodiment of the invention. The light-field camera comprises an aperture/shutter 202, a main (objective) lens 201, a micro lens array 210 and a photosensor array 220 in accordance with the light-field camera of FIG. 1A. In some embodiments the light-field camera includes a shutter release that is activated to capture a light-field image of a subject or scene. It will be appreciated that the functional features may also be applied to the light-field camera of FIG. 1B.

The photosensor array 220 provides light-field image data which is acquired by LF Data acquisition module 240 for generation of a light-field data format by light-field data formatting module 250 and/or for processing by light-field data processor 255. Light-field data may be stored, after acquisition and after processing, in memory 290 in a raw data format, as sub aperture images or focal stacks, or in a light-field data format in accordance with embodiments of the invention.

In the illustrated example, the light-field data formatting module 150 and the light-field data processor 255 are disposed in or integrated into the light-field camera 200. In other embodiments of the invention the light-field data formatting module 250 and/or the light-field data processor 255 may be provided in a separate component external to the light-field capture camera. The separate component may be local or remote with respect to the light-field image capture device. It will be appreciated that any suitable wired or wireless protocol may be used for transmitting light-field image data to the formatting module 250 or light-field data processor 255; for example the light-field data processor may transfer captured light-field image data and/or other data via the Internet, a cellular data network, a WiFi network, a BlueTooth communication protocol, and/or any other suitable means.

The light-field data formatting module 250 is configured to generate data representative of the acquired light-field, in accordance with embodiments of the invention. The light-field data formatting module 250 may be implemented in software, hardware or a combination thereof.

The light-field data processor 255 is configured to operate on raw light-field image data received directly from the LF data acquisition module 240 for example to generate focal stacks or a matrix of views in accordance with embodiments of the invention. Output data, such as, for example, still images, 2D video streams, and the like of the captured scene may be generated. The light-field data processor may be implemented in software, hardware or a combination thereof.

In at least one embodiment, the light-field camera 200 may also include a user interface 260 for enabling a user to provide user input to control operation of camera 100 by controller 270. Control of the camera may include one or more of control of optical parameters of the camera such as shutter speed, or in the case of an adjustable light-field camera, control of the relative distance between the microlens array and the photosensor, or the relative distance between the objective lens and the microlens array.

In some embodiments the relative distances between optical elements of the light-field camera may be manually adjusted. Control of the camera may also include control of other light-field data acquisition parameters, light-field data formatting parameters or light-field processing parameters of the camera. The user interface 260 may comprise any suitable user input device(s) such as a touchscreen, buttons, keyboard, pointing device, and/or the like. In this way, input received by the user interface can be used to control and/or configure the LF data formatting module 250 for controlling the data formatting, the LF data processor 255 for controlling the processing of the acquired light-field data and controller 270 for controlling the light-field camera 200.

The light-field camera includes a power source 280, such as one or more replaceable or rechargeable batteries. The light-field camera comprises memory 290 for storing captured light-field data and/or rendered final images or other data such as software for implementing methods of embodiments of the invention. The memory can include external and/or internal memory. In at least one embodiment, the memory can be provided at a separate device and/or location from camera 200. In one embodiment, the memory includes a removable/swappable storage device such as a memory stick.

The light-field camera may also include a display unit 265 (e.g., an LCD screen) for viewing scenes in front of the camera prior to capture and/or for viewing previously captured and/or rendered images. The screen 265 may also be used to display one or more menus or other information to the user. The light-field camera may further include one or more I/O interfaces 295, such as FireWire or Universal Serial Bus (USB) interfaces, or wired or wireless communication interfaces for data communication via the Internet, a cellular data network, a WiFi network, a BlueTooth communication protocol, and/or any other suitable means. The I/O interface 295 may be used for transferring data, such as light-field representative data generated by LF data formatting module in accordance with embodiments of the invention and light-field data such as raw light-field data or data processed by LF data processor 255, to and from external devices such as computer systems or display units, for rendering applications.

FIG. 2B is a block diagram illustrating a particular embodiment of a potential implementation of light-field data formatting module 250 and the light-field data processor 253.

The circuit 2000 includes memory 2090, a memory controller 2045 and processing circuitry 2040 comprising one or more processing units (CPU(s)). The one or more processing units 2040 are configured to run various software programs and/or sets of instructions stored in the memory 2090 to perform various functions including light-field data formatting and light-field data processing. Software components stored in the memory include a data formatting module (or set of instructions) 2050 for generating data representative of acquired light data in accordance with embodiments of the invention and a light-field data processing module (or set of instructions) 2055 for processing light-field data in accordance with embodiments of the invention. Other modules may be included in the memory for applications of the light-field camera device such as an operating system module 2051 for controlling general system tasks (e.g. power management, memory management) and for facilitating communication between the various hardware and software components of the device 2000, and an interface module 2052 for controlling and managing communication with other devices via I/O interface ports.

FIG. 3 illustrates an example of a 2D image formed on the photosensor array 104 of FIG. 1A or the photosensor array 114 of FIG. 1B. The 2D image, often referred to as a raw image representing 4D light-field, is composed of an array of micro images MI, each micro image being produced by the respective micro lens (i, j) of the microlens array 102,112. The micro images are arranged in the array in a rectangular lattice structure defined by axes i and j. A micro lens image may be referenced by the respective micro lens coordinates (i, j). A pixel PI of the photosensor 104, 114 may be referenced by its spatial coordinates (x, y). 4D light-field data associated with a given pixel may be referenced as (x, y, i, j).

There are several ways of representing (or defining) a 4D light-field image. For example, a 4D light-field image can be represented, by a collection of micro-lens images as previously described with reference to FIG. 3. A 4D light-field image may also be represented, when recorded by a plenoptic camera by a set of sub-aperture images. Each sub-aperture image of composed of pixels of the same position selected from each microlens image. Furthermore, a 4D light-field image may be represented by a set of epipolar images, which is not the case of the pixel beam.

Embodiments of the invention provide a representation of light-field data based on the notion of pixel beam. In this way the diversity in formats and light-field devices may be taken into account. Indeed, one drawback of ray based formats, is that the parametrization planes have to be sampled to reflect the pixel formats and sizes. Therefore, the sampling needs to be defined along other data in order to recover physical meaningful information.

A pixel beam 40, as shown on FIG. 4, represents a volume occupied by a set of rays of light in an object space of an optical system 41 of a camera. The set of rays of light is sensed by a pixel 42 of a sensor 43 of the camera through a pupil 44 of said optical system 41. Contrary to rays, pixel beams 40 may be sample at will since they convey per se the “etendul” which corresponds to the preservation of the energy across sections of the physical light rays.

A pupil of an optical system is defined as the image of an aperture stop as seen through part of said optical system, i.e. the lenses of the camera which precedes said aperture stop. An aperture stop is an opening which limits the amount of light which passes through the optical system of the camera. For example, an adjustable diaphragm located inside a camera lens is the aperture stop for the lens. The amount of light admitted through the diaphragm is controlled by the diameter of the diaphragm opening which may be adapted depending of the amount of light a user of the camera wishes to admit, or the depth of field the user wishes. For example, making the aperture smaller reduces the amount of light admitted through the diaphragm, but increases the depth of field. The apparent size of a stop may be larger or smaller than its physical size because of the refractive action of a lens. Formally, a pupil is the image of the aperture stop through the optical system of the camera.

A pixel beam 40 is defined as a pencil of rays of light that reaches a given pixel 42 when propagating through the optical system 41 via an entrance pupil 44. As light travels on straight lines in free space, the shape of such a pixel beam 40 can be defined by two sections, one being the conjugate 45 of the pixel 42, and the other being the entrance pupil 44. The pixel 42 is defined by its non-null surface and its sensitivity map.

Thus, a pixel beam may be represented by an hyperboloid of one sheet 50, as shown on FIG. 5, supported by two elements: the pupil 54 and the conjugate 55 of the pixel 42 in the object space of the camera.

A hyperboloid of one sheet is a ruled surface that can support the notion of pencil of rays of light and is compatible with the notion of “etendul” of physical light beams.

A hyperboloid of one sheet corresponds to the geometry of a Gaussian beam. Indeed, in optics, a Gaussian beam is a beam of monochromatic electromagnetic radiation whose transverse magnetic and electric field amplitude profiles are given by a Gaussian function; this also implies a Gaussian intensity profile. This fundamental transverse Gaussian mode describes an intended output of most lasers, since such a beam of light can be focused into the most concentrated spot.

The equations below assume a beam with a circular cross-section at all values of custom-character this can be seen by noting that a single transverse dimension, r, appears.

At a position custom-character along the beam (measured from the focus), the spot size parameter w is given by¹

$w (z) = w_{0} \sqrt{1 + {(\frac{z}{z_{R}})}^{2}}$

where w₀is the waist size.

As represented on FIG. 16, at a distance from the waist equal to custom-character _R, the width w of the beam is equal to √{square root over (2)}w₀.

Although the tails of a Gaussian function never actually reach zero, for. This means that far from the waist, the beam “edge” is cone-shaped. The angle between lines along that cone (whose r=w(z)) and the central axis of the beam (r=0) is called the divergence of the beam.

The total angular spread of the beam far from the waist is then given by Θ=20.

Since a hyperboloid of one sheet is a ruled surface, one family of straight lines, called generating rays, rotating around a revolution axis, called chief ray, of the hyperboloid, describe such a surface. The chief ray of a hyperboloid and the generating rays describing its surface are not parallel to each other and they never intercept each other. The knowledge of parameters defining the chief ray and any generating ray belonging to the family of generating lines of the hyperboloid are sufficient to define a pixel beam 40, 50.

The general equation of a hyperboloid of one sheet representing a pixel beam 40, 50 is:

$\begin{matrix} \frac{{(x - x_{0} - z \cdot \tan θ_{x})}^{2}}{a^{2}} + \frac{{(y - y_{0} - z \cdot \tan θ_{y})}^{2}}{b^{2}} - \frac{{(z - z_{0})}^{2}}{c^{2}} = 1 & (1) \end{matrix}$

where (x₀, y₀, z₀) are the coordinates of a centre of the waist of the pixel beam in a (x, y, custom-character ) coordinates system centred on the centre of the pupil, a, b, c are homologous to the length of semi-axes along Ox, Oy, Oz respectively, where a represents the radius of the of waist along Ox; b represents the radius of the waist along Oy and c defines an angular aperture of the pixel beam, and θ_x, θ_y, define the chief ray directions relative to the entrance of the pupil 44 centre. They depend on the pixel 42 position on the sensor 43 and on the optical elements of the optical system 41. More precisely, the parameters θ_x, θ_yrepresent shear angles defining a direction of the conjugate 45 of the pixel 42 from the centre of the pupil 44. In some embodiments of the invention, a and b have identical values, in these cases, the waist has a circular shape.

The parameter c defines the angular aperture a of the pixel beam 40, 50 and is given by the formula

$\tan (α) = \frac{a}{c},$

or by the expression given by the following

- equation:

$c^{2} = \frac{a^{2} z_{P}^{2}}{r^{2} - a^{2}}$

where r represents the pupil radius, and z_Prepresents the position of the waist of the pixel beam in front of the pupil.

However, such a representation of a pixel beam 40, 50 takes up large amounts of storage space since the classical file format for storing rays consists in storing a position and a direction in a 3D space. A solution for reducing the amount of storage space required to store a representation a pixel beam is described hereinafter in reference to FIG. 9B.

By nature, while chief rays will behave smoothly passing through the microlenses centres of the microlens array of the camera, generating rays suffer from stronger deviations on the borders of the microlenses. Thus the generating rays tend to hit the main lens of the camera at larger aperture angles. Aberrations like distortion will thus disturb the generating rays, while the chief rays are less affected by such aberrations. This disturbance of the generating rays makes it difficult to run the method described in reference to FIG. 9B since said method works with ordered collections of rays. To this end the inventors of the present invention propose a method for sorting the generating rays of a collection of pixel beams of a camera in order to feed the method according to FIG. 6B with such a sorted collection of generating rays.

FIG. 6A is a block diagram schematically illustrating the main modules of an apparatus for sorting the generating rays of a collection of pixel beams of a camera according to one or more embodiments of the invention. In an embodiment of the invention, such an apparatus may be embedded in the light-field data formatting module 250.

The apparatus 600 comprises a processor 601, a storage unit 602, an input device 603, a display device 604, and an interface unit 605 which are connected by a bus 606. Of course, constituent elements of the computer apparatus 600 may be connected by a connection other than a bus connection.

The processor 601 controls operations of the apparatus 600. The storage unit 602 stores at least one program capable of sorting the generating rays of a collection of pixel beams of a camera to be executed by the processor 601, and various data, including parameters related to the optical system 21 of the optical acquisition system, parameters used by computations performed by the processor 601, intermediate data of computations performed by the processor 601, and so on. The processor 601 may be formed by any known and suitable hardware, or software, or a combination of hardware and software. For example, the processor 601 may be formed by dedicated hardware such as a processing circuit, or by a programmable processing unit such as a CPU (Central Processing Unit) that executes a program stored in a memory thereof.

The storage unit 602 may be formed by any suitable storage or means capable of storing the program, data, or the like in a computer-readable manner. Examples of the storage unit 602 include non-transitory computer-readable storage media such as semiconductor memory devices, and magnetic, optical, or magneto-optical recording media loaded into a read and write unit. The program causes the processor 601 to perform a process for computing parameters representing a volume occupied by a set of rays of light in an object space of an optical system and encoding these parameters with an image captured by the optical acquisition system according to an embodiment of the present disclosure as described hereinafter with reference to FIG. 9B.

The input device 603 may be formed by a keyboard, a pointing device such as a mouse, or the like for use by the user to input commands, to make user's selections of parameters used for generating a parametric representation of a volume occupied by a set of rays of light in an object space of an optical system. The output device 604 may be formed by a display device to display, for example, a Graphical User Interface (GUI), images generated according to an embodiment of the present disclosure. The input device 603 and the output device 604 may be formed integrally by a touchscreen panel, for example.

The interface unit 605 provides an interface between the apparatus 600 and an external apparatus. The interface unit 605 may be communicable with the external apparatus via cable or wireless communication. In an embodiment, the external apparatus may be a camera, or a portable device embedding such a camera like a mobile phone, a tablet, etc.

FIG. 6B is a flow chart illustrating the steps of a method for sorting the generating rays of a collection of pixel beams of a camera according to one or more embodiments of the invention.

In a preliminary step S601, parameters (x₀, y₀, z₀), a, b, c and θ_x, θ_ydefining the different pixel beams associated to the pixels of the sensor of the camera are acquired either by calibrating the camera of by retrieving such parameters from a data file stored in a remote server or on a local storage unit such as the memory 290 of the camera or a flash disk connected to the camera. This acquisition or calibration may be executed by the processor 601 of the apparatus 600.

The computation of the values of the parameters (x₀, y₀, z₀), a, b, c and θ_x, θ_yis realized, for example, by running a program capable of modelling a propagation of rays of light through the optical system of the camera. Such a program is for example an optical design program such as Zemax, ©, ASAP © or CodeV©. An optical design program is used to design and analyze optical systems. An optical design program models the propagation of rays of light through the optical system; and can model the effect of optical elements such as simple lenses, aspheric lenses, gradient index lenses, mirrors, and diffractive optical elements, etc. The optical design program may be executed by the processor 601 of the apparatus 600.

In a step S602 executed by the processor 601, the shear of the chief ray of a pixel beam is removed. Unshearing the chief ray consists in writing:

${\begin{matrix} \overline{x} = x - z \cdot \tan θ_{x} \\ \overline{y} = y - z \cdot \tan θ_{y} \end{matrix}$

which gives a hyperboloid of one sheet which chief ray is parallel to the Oz axis:

$\begin{matrix} \frac{{(\overline{x} - x_{0})}^{2}}{a^{2}} + \frac{{(\overline{y} - y_{0})}^{2}}{b^{2}} - \frac{{(z - z_{0})}^{2}}{c^{2}} = 1 & (2) \end{matrix}$

where (x, y, z) are the coordinates of a point belonging to the surface of the hyperboloid, and (x₀, y₀, z₀) are the coordinates of the centre of the waist of the considered pixel beam.

Ina step S603, the processor 601 computes the centering of the hyperboloid on the point of coordinates (x₀, y₀, z₀) and then compute the normalization of the hyperboloid which gives:

$\begin{matrix} {\begin{matrix} X = \frac{(\overline{x} - x_{0})}{a} \\ Y = \frac{(\overline{y} - y_{0})}{b} \\ Z = \frac{(z - z_{0})}{c} \end{matrix} & (3) \end{matrix}$

Thus equation (1) now reads

X
²
+Y
²
−Z
²=1 (4)

Unshearing then centering and normalizing a pixel beam reverts to use the function

$T = {\begin{matrix} X = \frac{(x - z \cdot \tan θ_{x} - x_{0})}{a} \\ Y = \frac{(y - z \cdot \tan θ_{y} - y_{0})}{b} \\ Z = \frac{(z - z_{0})}{c} \end{matrix}$

transforming (x, y, z) coordinates into (X, Y, Z) coordinates.

Since the central axis of the hyperboloid is the Oz axis, two points belonging to this axis have the following set of coordinates (0,0,0) and (0,0,1) in the (XYZ) coordinate system. This central axis of the hyperboloid, transformed back in the original coordinate system (x, y, z), is the chief ray ρ_Cof the pixel beam.

The hyperboloid defined by equation (4) has two families of generating rays:

- a first family of generating rays is given by the rotation around the OZ axis of a straight line joining a first point of coordinates (1, 0, 0) and a second point of coordinates (1, ζ, ζ) for any ζ∈*, example ζ=1, and
- a second family of generating rays is given by the rotation around the OZ axis of a straight line joining the point of coordinates (1, 0, 0) and a third point of coordinates (1, ζ, ζ) for any ζ∈*.

Any of these generating rays, transformed back in the original coordinate system, can be selected as ρ_G0a generating ray of a pixel beam.

In the following description of the method according to an embodiment of the invention, two points G₀which coordinantes are (1,0,0) and I₀which coordiantes are (1, 1,1) in the (XYZ) coordinate system as defining the initial generating ray ρ_G0in the (XYZ) coordinate system.

In a step S604, the processor 601 applies the function T as defined above to a reference straight line Δ in the object space of the camera. In an embodiment of the invention, the reference straight line Δ is an optical axis of a main lens of the camera. In another embodiment of the invention, the reference straight line Δ is a central axis of a lens-array of the camera, in a third embodiment the reference straight line Δ is a line with a direction forming an angle inferior or equal to

$\frac{π}{4}$

with the optical axis of the main lens of the camera.

The reference straight line Δ is defined by two points of coordinates P_Δ0(x_Δ0, y_Δ0, z_Δ0) and P_Δ1(x_Δ1, y_Δ1, z_Δ1) in the (x, y, z) coordinates system.

Applying the function T to the coordinates of P_Δ0and P_Δ1gives in the (XYZ) coordinate system:

${\begin{matrix} P_{Δ 0} & (X_{Δ 0}, Y_{Δ0}, Z_{Δ 0}) = T ((x_{Δ0}, y_{Δ0}, z_{Δ 0})) \\ P_{Δ 1} & (X_{Δ 1}, Y_{Δ1}, Z_{Δ 1}) = T ((x_{Δ1}, y_{Δ 1}, z_{Δ 1})) \end{matrix}$

As the chief ray ρ_Cof the pixel beam is the Oz axis in the (XYZ) coordinate system, a rotation of angle φ around the chief ray ρ_Cis given by the rotation matrix:

$R_{φ} = [\begin{matrix} \cos φ & - s in φ & 0 \\ \sin φ & \cos φ & 0 \\ 0 & 0 & 1 \end{matrix}] .$

Naming ρ_Gφ(G_φ, I_φ) the image of the generating ray ρ_G0(G₀, I₀) by the rotation of angle φ around the chief ray ρ_Cthe coordinates of the points G_φ, and I_φ are given by:

$G_{φ} = R_{φ} G_{0} = [\begin{matrix} \cos φ & - s in φ & 0 \\ \sin φ & \cos φ & 0 \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 \\ 0 \\ 0 \end{matrix}] = [\begin{matrix} \cos φ \\ \sin φ \\ 0 \end{matrix}]$

$I_{φ} = R_{φ} I_{0} = [\begin{matrix} \cos φ & - s in φ & 0 \\ \sin φ & \cos φ & 0 \\ 0 & 0 & 1 \end{matrix}] [\begin{matrix} 1 \\ 1 \\ 1 \end{matrix}] = [\begin{matrix} \cos φ - \sin φ \\ \sin φ + \cos φ \\ 1 \end{matrix}]$

During a step S605, the processor 601 computes the value of the rotation angle φ.

The value of the rotation angle φ is obtained when the generating ray ρ_Gφ crosses the reference straight line Δ, i.e the distance between the reference straight line Δ and the generating ray ρ_Gφis equal to 0.

The distance between two skew lines with equations

${\begin{matrix} \vec{x} = \vec{x_{1}} + (\vec{x_{2}} - \vec{x_{1}}) \cdot s \\ \vec{y} = \vec{x_{3}} + (\vec{x_{4}} - \vec{x_{3}}) \cdot t \end{matrix}$

is given by (Gellert et al. 1989, p. 538).

$D = \frac{\langle (\vec{x_{3}} - \vec{x_{1}}) \cdot [(\vec{x_{2}} - \vec{x_{1}}) \times (\vec{x_{4}} - \vec{x_{3}})] \rangle}{\langle (\vec{x_{2}} - \vec{x_{1}}) \times (\vec{x_{4}} - \vec{x_{3}}) \rangle}$

This can be written in the concise form:

$D = \frac{\langle \vec{c} \cdot (\vec{a} \times \vec{b}) \rangle}{\langle \vec{a} \times \vec{b} \rangle} by defining (\begin{matrix} \vec{a} = \vec{x_{2}} - \vec{x_{1}} \\ \vec{b} = \vec{x_{4}} - \vec{x_{3}} \\ \vec{c} = \vec{x_{3}} - \vec{x_{1}} \end{matrix}$

In the current case, for

${\begin{matrix} \vec{x} = \vec{x_{1}} + (\vec{x_{2}} - \vec{x_{1}}) \cdot s \\ \vec{x} = \vec{x_{3}} + (\vec{x_{4}} - \vec{x_{3}}) \cdot t \end{matrix}$

x₁=P_Δ0and x₂=P_Δ1, and x₃=G_φ, and x₄=I_φ

A distance between two straight lines being equals to zero reverts to compute:

$D = 0 \Leftrightarrow \vec{c} \cdot (\vec{a} \times \vec{b}) = 0 \Leftrightarrow \det (\vec{a} \vec{b} \vec{c}) = 0 \Leftrightarrow \langle \begin{matrix} a_{1} & a_{2} & a_{3} \\ b_{1} & b_{2} & b_{3} \\ c_{1} & c_{2} & c_{3} \end{matrix} \rangle = 0$

In the case of the reference straight line Δ and the generating ray ρ_Gφ:

$(\begin{matrix} \vec{a} = \vec{x_{2}} - \vec{x_{1}} = \vec{P_{Δ 0} P_{Δ 1}} = {[\begin{matrix} X_{Δ 1} - X_{Δ 0} & Y_{Δ 1} - Y_{Δ 0} & Z_{Δ 1} - Z_{Δ 0} \end{matrix}]}^{T} \\ \vec{b} = \vec{x_{4}} - \vec{x_{3}} = \vec{G_{φ} I_{φ}} = {[\begin{matrix} - \sin φ & \cos φ & 1 \end{matrix}]}^{T} \\ \vec{c} = \vec{x_{3}} - \vec{x_{1}} = P_{Δ 0} G_{φ} = {[\begin{matrix} \cos φ - X_{Δ 0} & \sin φ - Y_{Δ 0} & - Z_{Δ 0} \end{matrix}]}^{T} \end{matrix} D = 0 \Leftrightarrow \langle \begin{matrix} X_{Δ 1} - X_{Δ 0} & Y_{Δ 1} - Y_{Δ 0} & Z_{Δ 1} - Z_{Δ 0} \\ - s in φ & \cos φ & 1 \\ \cos φ - X_{Δ 0} & \sin φ - Y_{Δ 0} & - Z_{Δ 0} \end{matrix} \rangle = 0 \Leftrightarrow \langle \begin{matrix} d X_{Δ} & {dY}_{Δ} & d Z_{Δ} \\ - s in φ & \cos φ & 1 \\ \cos φ - X_{Δ 0} & \sin φ - Y_{Δ 0} & - Z_{Δ 0} \end{matrix} \rangle = 0 \langle \begin{matrix} a_{1} & a_{2} & a_{3} \\ b_{1} & b_{2} & b_{3} \\ c_{1} & c_{2} & c_{3} \end{matrix} \rangle = a_{1} b_{2} c_{3} - a_{1} b_{3} c_{2} - a_{2} b_{1} c_{3} + a_{2} b_{3} c_{1} + a_{3} b_{1} c_{2} - a_{3} b_{2} c_{1} D = - Z_{Δ 0} \cdot \cos φ \cdot {dX}_{Δ} - (\sin φ - Y_{Δ 0}) \cdot {dX}_{Δ} - Z_{Δ 0} \cdot \sin φ \cdot {dY}_{Δ} + (\cos φ - X_{Δ 0}) \cdot {dY}_{Δ} - (\sin φ - Y_{Δ 0}) \cdot \sin φ \cdot {dZ}_{Δ} - (\cos φ - X_{Δ 0}) \cdot \cos φ \cdot {dZ}_{Δ} D = \cos φ \cdot ({dY}_{Δ} - Z_{Δ 0}, {dX}_{Δ} + X_{Δ 0} \cdot {dZ}_{Δ}) - \sin φ \cdot ({dX}_{Δ} + Z_{Δ 0}, \cdot Z_{Δ 0} \cdot {dY}_{Δ} + Y_{Δ 0} \cdot {dZ}_{Δ}) + Y_{Δ 0} \cdot {dX}_{Δ} - X_{Δ 0}, \cdot {dY}_{Δ} - \sin^{2} φ \cdot {dZ}_{Δ} - \cos^{2} φ \cdot {dZ}_{Δ} D = \cos φ \cdot (- Z_{Δ 0} \cdot {dX}_{Δ} + {dY}_{Δ} + X_{Δ 0} \cdot {dZ}_{Δ}) - \sin φ \cdot ({dX}_{Δ} + Z_{Δ 0} \cdot {dY}_{Δ} + Y_{Δ 0} \cdot {dZ}_{Δ}) + (Y_{Δ 0} \cdot {dX}_{Δ} - X_{Δ 0} \cdot {dY}_{Δ} - d Z_{Δ}) Writing D = A \cdot \cos φ + B \cdot \sin φ - C with : (\begin{matrix} A = (- Z_{Δ 0} \cdot {dX}_{Δ} + {dY}_{Δ} + X_{Δ 0} \cdot {dZ}_{Δ}) \\ B = - ({dX}_{Δ} + Z_{Δ 0} \cdot {dY}_{Δ} + Y_{Δ 0}, \cdot {dZ}_{Δ}) \\ C = - (Y_{Δ 0} \cdot {dX}_{Δ} - X_{Δ 0}, \cdot {dY}_{Δ} - {dZ}_{Δ}) \end{matrix}$

Two solutions for φ giving D=0 can be found by:

$A + B \cdot \tan φ = \frac{C}{\cos φ} \Leftrightarrow {(A + B \cdot \tan φ)}^{2} = \frac{C^{2}}{\cos^{2} φ} \Leftrightarrow A^{2} + 2 \cdot A \cdot B \cdot \tan φ + B^{2} \cdot \tan^{2} φ = C^{2} (1 + \tan^{2} φ) \Leftrightarrow (B^{2} - C^{2}) \cdot \tan^{2} φ + 2 \cdot A \cdot B \cdot \tan φ + (A^{2} - C^{2}) = 0$

Discriminant: Λ=4. A². B²−4. (B²−C²). (A²−C²)=4. (A²+B²−C²).C²

This leads to two solutions:

$\tan φ^{\pm} = \frac{- 2 \cdot A \cdot B \pm \sqrt{⩓}}{2 \cdot (B^{2} - C^{2})} = \frac{- A B \pm \langle C \rangle \sqrt{A^{2} + B^{2} - C^{2}}}{B^{2} - C^{2}}$

Thus, two angles φ⁺ and φ⁻are identified for the generating ray ρ_Gφto cross the reference straight line Δ.

Since there are two families of generating rays for a same hyperboloid, four different rays generating the pixel beam cross the reference straight line Δ, two for each of the two families of generating rays. One generating ray is chosen among those four generating rays crossing the reference straight line. Then the selected generating ray is transformed back in the original coordinate system in a step S606.

This method is run on the different pixel beams of the camera. Thus, two collections of rays are obtained: a first collection of rays comprising the chief rays of the pixel beams of the camera and a second collection of rays comprising for the different pixel beams of the camera the generating rays crossing the reference straight line. Those two collections of rays are used in the method describe hereafter in order to provide a compact format for representing the pixel beams.

In order to propose a file format for storing rays which needs less storage space, a method for parametrizing the four dimensions of light-field radiance may be with reference to the cube illustrated in FIG. 7A. All six faces of the cube may be used to parameterize the light-field. In order to parameterize direction, a second set of planes parallel to the cube faces, may be added. In this way the light-field may be defined with respect to six pairs of planes with normals along the axis directions as:

{right arrow over (i)},−{right arrow over (i)},
custom-character
,−

,{right arrow over (k)},−{right arrow over (k)}

FIG. 7B illustrates a light-field ray, such as a chief ray or a generating ray defining a pixel beam, passing through two reference planes P1 and P2 used for parameterization positioned parallel to one another and located at known depths custom-character ₁and ₂respectively. The light-field ray intersects the first reference plane P₁at depth ₁at intersection point (x₁, y₁) and intersects the second reference plane P₂at depth ₂at intersection point (x₂, y₂). In this way the light-field ray may be identified by four coordinates (x₁, y₁, x₂, y₂). The light-field can thus be parameterized by a pair of reference planes for parameterization P₁, Pz also referred herein as parametrization planes, with each light-field ray being represented as a point (x₁,y₁,x₂,x₂,)∈R⁴in 4D ray space.

Thus this is done for each chief ray and each generating ray of a collection of pixel beams of a camera.

For example an origin of the reference co-ordinate system may be placed at the center of a plane P₁generated by the basis vectors of the coordinate axis system ({right arrow over (i)}₁, {right arrow over (j)}₁). The {right arrow over (k)} axis is normal to the generated plane P₁and the second plane P₂can be placed for the sake of simplicity at a distance custom-character =Δ from plane P₁along the {right arrow over (k)} axis. In order to take into account the six different directions of propagation the entire light-field may be characterized by six pairs of such planes. A pair of planes, often referred to as a light slab characterizes the light-field interacting with the sensor or sensor array of the light-field camera along a direction of propagation.

The position of a reference plane for parameterization can be given as:

{right arrow over (x₀)}=d{right arrow over (n)} where {right arrow over (n)} is the normal and d is an offset from the origin of the 3D coordinate system along the direction of the normal.

A Cartesian equation of a reference plane for parameterization can be given as:

{right arrow over (b)}({right arrow over (x)}−{right arrow over (x₀)})=0

If a light-field ray has a known position:

- {right arrow over (x_i)}(x_i, y_i, z_i) and a normalized propagation vector:
- {right arrow over (u)}(u₁, u₂, u₃) the general parametric equation of a ray in 3D may be given as:

{right arrow over (x)}=t{right arrow over (u)}+{right arrow over (x_i)}

The co-ordinates of the intersection {right arrow over (x1)} between the light-field ray and a reference plane are given as:

$\begin{matrix} \vec{x_{1}} = \vec{x_{i}} + \vec{u} \frac{\vec{n} (\vec{x_{0}} - \vec{x_{i}})}{\vec{u} \vec{n}} & (A) \end{matrix}$

There is no intersection between the light-field rays and the reference parameterization if the following condition is not satisfied:

({right arrow over (x₁)}−{right arrow over (x₀)}){right arrow over (u)}>0

Due to the perpendicularity with one of the axes of the system of the pair of reference planes used to parameterize the light-field, one of the components of the ray intersection is always constant for each plane. Hence if there is an intersection of a light-field ray {right arrow over (x1)} with the first reference plane, and the intersection {right arrow over (x2)} of the said light-field with the second reference plane, four coordinates vary and equation A can be used to calculate the four parameters of a light-field ray. These four parameters can be used to build up a 4D ray diagram of the light-field.

Assuming parameterization of the light-field with reference to two parameterization reference planes, data representing the light-field may be obtained as follows. If a reference system is set as pictured in FIG. 8 a first parametrization plane P1 is perpendicular to z axis at z=z1, a second parametrization plane P2 is arranged perpendicular to the z axis at z=z2 and a ray whose light-field parameters are L(x1; y1; x2; y2) are to be rendered at a location z=z3 where a photosensor array of a light-field camera is positioned. From equation (A):

$\vec{x_{3}} = \vec{x_{2}} + \vec{u} \frac{\vec{n} (z_{3} \vec{n} - \vec{x_{2}})}{\vec{u} \vec{n}}$

$\vec{x_{3}} = \vec{x_{1}} + \vec{u} \frac{\vec{n} (z_{3} \vec{n} - \vec{x_{1}})}{\vec{u} \vec{n}} with$

$\vec{u} = \frac{\vec{x_{2}} = \vec{x_{1}}}{ \vec{x_{2}} = \vec{x_{1}} } = (u_{x}, u_{y}, u_{z})$

$\vec{n} (0, 0, 1)$

Developing the above expression gives:

$x_{3} = x_{2} + \frac{u_{x}}{u_{z}} (z_{3} - z_{2})$

$y_{3} = y_{2} + \frac{u_{y}}{u_{z}} (z_{3} - z_{2})$

$z_{3} = z_{3}$

$x_{3} = x_{1} + \frac{u_{x}}{u_{z}} (z_{3} - z_{1})$

$y_{3} = y_{1} + \frac{u_{y}}{u_{z}} (z_{3} - z_{1})$

$z_{3} = z_{3}$

Both sets of equation should deliver the same point {right arrow over (x3)} as the rendered light-field ray at the new location. By replacing u_x; u_y; u_zwith their corresponding expression as functions of {right arrow over (x1)} and {right arrow over (x2)}, if the second set of equation from the previous block is used and x3 and y3 are added together:

$x_{1} + \frac{z_{3} - z_{1}}{z_{2} - z_{1}} (x_{2} - x_{1}) + y_{1} + \frac{z_{3} - z_{1}}{z_{2} - z_{1}} (y_{2} - y_{1}) = x_{3} + y_{3}$

Leading to the expression:

(z₂−z₃)(x₁+y₁)+(z₃−z₁)(x₂+y₂)=(z₂−z₁)(x₃+y₃) (B)

Co-ordinates with a subscript ₃relate to a known point (x₃, y₃, custom-character ₃) where the light-field is rendered. All depth co-ordinates Z are known. The parameterisation planes are in the direction of propagation or rendering. The light-field data parameters L are (x₁, y₁, x₂, y₂).

The light-field rays that form an image at point (x₃, y₃, custom-character ₃) are linked by expression (B) which defines a hyper plane in ⁴.

This signifies that if images are to be rendered from a two-plane parametrized light-field, only the rays in the vicinity of hyperplanes need to be rendered, there is no need to trace them. FIG. 9A is a flow chart illustrating the steps of a method for generating data representative of a light-field according to one or more embodiments of the invention. FIG. 9B is a block diagram schematically illustrating the main modules of a system for generating data representative of a light-field according to one or more embodiments of the invention.

In a preliminary step S801 of the method parameters defining the chief rays and the generating rays of the different pixel beams associated to the pixels of the sensor of the camera are acquired. These parameters are obtained as a result of the method for sorting the generating rays described above.

Such parameters are the coordinates of the chief rays and of a generating ray crossing the reference straight line of the different pixel.

In another preliminary step S802 raw light-field data is acquired by a light-field camera 801. The raw light-field data may for example be in the form of micro images as described with reference to FIG. 3. The light-field camera may be a light-field camera device such as shown in FIG. 1A or 1B and 2A and 2B.

In step S803 the acquired light-field data is processed by ray parameter module 802 to provide intersection data (x₁, y₁, x_z, y₂) defining intersection of captured light-field rays, which correspond to the chief rays and the generating rays crossing the reference straight line of pixel beams 40, 50, with a pair of reference planes for parameterization P₁, P₂at respective depths custom-character ₁, ₂.

From calibration of the camera the following parameters can be determined: the custom-character centre of projection (x₃, y₃, ₃) the orientation of the optical axis of the camera and the distance f from the pinhole of the camera to the plane of the photosensor. The light-field camera parameters are illustrated in FIG. 10. The photosensor plane is located at depth _p. The pixel output of the photosensor is converted into geometrical representation of light-field rays. A light-slab comprising the two reference planes P₁and P₂is located at depths custom-character ₁and ₂, respectively, beyond ₃, at the other side of the centre of projection of the camera to the photosensor. By applying a triangle principle to the light rays, pixel coordinates (x_p, y_p, _p) recording the light projected from the array of microlenses can be mapped to ray parameters i.e. reference plane intersection points (x₁, y₁, x₂, y₂) by applying the following expression:

$x_{1} = \frac{z_{3} - z_{1}}{z_{3} - z_{p}} x_{p} + \frac{z_{1} - z_{p}}{z_{3} - z_{p}} x_{3}$

$y_{1} = \frac{z_{3} - z_{1}}{z_{3} - z_{p}} y_{p} + \frac{z_{1} - z_{p}}{z_{3} - z_{p}} y_{3}$

$x_{2} = \frac{z_{3} - z_{2}}{z_{3} - z_{p}} x_{p} + \frac{z_{1} - z_{p}}{z_{3} - z_{p}} x_{3}$

$y_{2} = \frac{z_{3} - z_{2}}{z_{3} - z_{p}} y_{p} + \frac{z_{1} - z_{p}}{z_{3} - z_{p}} y_{3}$

The above calculation may be extended to multiple cameras with different pairs of triplets (x_p, y_p, custom-character _p) (x₃, y₃, ₃):

In the case of a plenoptic camera, a camera model with an aperture is used and a light-field ray is described in the phase space as having an origin (x_p, y_p, custom-character _p) and a direction (x′₃, y′₃, 1). Its propagation unto the plane (x₃, y₃) at depth ₃can be described as a matrix transform. The lens will act as an ABCD matrix to refract the ray and another ABCD propagation matrix will bring the ray onto the light-slab reference planes P₁and P₂.

From this step intersection data (x₁, y₁, x₂, y₂) geometrically defining intersection of the chief rays and the generating ray crossing the reference straight line with reference planes P₁, P₂is obtained.

In step S804 2D ray a diagram graphically representing the intersection data (x₁, y₁, x₂, y₂) is obtained by ray diagram generator module 803.

FIG. 11 is a 2D ray diagram graphically representing intersection data (x₁, x₂) of light-field rays captured by a camera at location x₃=2 and depth custom-character ₃=2 with an aperture |A|<0.5. The data lines of the ray diagram used to parameterise are sampled by 256 cells providing an image of 256×256 pixels.

If the ray diagram illustrated in FIG. 11 is interpreted as a matrix, it can be seen that it is sparsely populated. If the rays were to be saved individually in a file instead of the 4D phase space matrix, this would require saving for each ray, at least 2 bytes (int16) for each position x_ior x₃plus 3 bytes for the color, i.e. 7 bytes per ray for a 2D slice light-field, and 11 bytes per ray for its full 4D representation. Even then, the rays would be stored randomly in the file which might be unsuitable for applications that need to manipulate the representation. The inventors of the present invention have determined how to extract only the representative data from the ray diagram matrix and to store the data in a file in a structured manner.

Since the light-field rays are mapped along data lines of the 2D ray diagram, it is more efficient to store parameters defining the data line rather than the line values themselves. Parameters defining the data line such as, for example, a slope defining parameter s and an axis intercept d may be stored with the set of light-field rays belonging to that data line.

This could require for example as little as 2 bytes for slope parameter s, 2 bytes for intercept parameter d and then only 3 bytes per ray, Moreover, the rays may be ordered along lines in the file. In order to set lines through matrix cells so called digital lines are generated which approximate the ray lines with minimum error.

To locate the data lines and to obtain slope parameters and intercept parameter d step S805 a Radon transform is performed by line detection module 804 on the ray diagram generated in step S804.

From the obtained slope parameter s and intercept parameter d a representative digital line is generated by digital line generation module 805 in step S806. In this step digital lines are generated by approximating an analytical line to its nearest grid point, for example by applying Bresenham's algorithm. Indeed Bresenham's algorithm provides a way to provide a digital line with minimal operation. Other methods may apply a fast discrete Radon transform calculation. An example of Bresenham application is one adapted from the following reference:

- http://www.cs.helsinki.fi/group/goa/mallinnus/lines/bresenh.html.

The digital format defines the data line by two points of a grid (0,d) and (N−1, s) d being the intersection corresponding to the value of x₂when x₁=0 and s being the slope parameter corresponding to the value of x₂when x₁=N−1. From the digital format generated the slope a of each individual line may be expressed as a function of d, N and s, as:

$a = \frac{s - d}{N - 1}$

where:

s∈{0, 1, . . . N−1} and d∈{0, 1, . . . , N−1}

FIG. 12 illustrates an example of a digital line generated by application of Bresenham's algorithm.

FIG. 13 illustrates a group of digital lines having the same slope a (or s−d) but different intercepts d, the group of data lines being contiguous. The group of data lines is referred to herein as a bundle of lines and corresponds to a beam resulting from the camera not being ideally a pinhole camera. Each line addresses different pixels. In other words, one pixel belongs only to a unique line of a bundle with the same slope but different intercepts. The upper and lower boundaries of the axis intersections d are given as d_maxand d_minrespectively.

Ray data parameterized by a sampled pair of lines (in 2D) and belonging to one camera, belong to a family of digital lines (beam) in the phase space used for representing the data. The header of the beam can simply contain the slope a and the thickness of the beam defined by the upper and lower boundaries of the axis intersections d_max−d_min. The ray values will be stored as RGB colors along digital lines whose header can be d and s. Void cells of the ray diagram in the sampled space do not need to be stored. Coordinates x1; x2 of the rays can be deduced from the parameters d, s and from the position of the cell along the digital line.

Parameters to be estimated from the light-field or from camera's geometry are the slope a the lower and upper bounds of the digital line intercepts (d_min, d_max), and the digital line parameters (d_i, s_i). The discrete Radon transform has already been discussed as a tool to measure the support location of the light-field in the ray diagram.

FIG. 14B shows the discrete Radon transform in the digital line parameter space (d, s) of the datalines of FIG. 14A. FIG. 14C is a zoom of the region of interest comprised in FIG. 14B. The beam of digital lines is located by the search for the maximum value parameters. There could be some offset between the geometrical center of symmetry of the DRT and the actual position of the maximum due to image content so that later on, an algorithm is used to pin-point the center of symmetry instead of the maximum. Then, the waist of the beam transform as shown on FIG. 13C is easy to find to give the values (d_min, d_max). Point (d_min=74, s=201) is the lower envelope of the beam of digital lines from FIG. 12A, and point (d_max=81, s=208) is the upper envelope of the beam of digital lines.

The equations of two orthogonal 2D sliced spaces from equation B is given as.

(z₂−z₃)(x₁+y₁)+(z₃−z₁)(x₂+y₂)=(z₂−z₁)(x₃+y₃) (C)

If a 2D slice for x_icoordinates is taken, the equation of the beam of lines where ray data through an aperture of size A at (x₃, y₃, z₃) will map is given as:

$\begin{matrix} x_{2} = \frac{(z_{3} - z_{2})}{(z_{3} - z_{1})} x_{1} + \frac{(z_{2} - z_{1})}{(z_{3} - z_{1})} (x_{3} \pm A) = m x_{1} + (d_{\max_{x}} - d_{\min_{x}}) & (D) \end{matrix}$

Similarly, if a 2D slice is taken for y_icoordinates:

$\begin{matrix} y_{2} = \frac{(z_{3} - z_{2})}{(z_{3} - z_{1})} y_{1} + \frac{(z_{2} - z_{1})}{(z_{3} - z_{1})} (y_{3} \pm A) = m y_{1} + (d_{\max_{y}} - d_{\min_{y}}) & (E) \end{matrix}$

As previously described, the values of m and d_max_x, d_min_x, d_max_y, d_min_ymay be evaluated in the discrete domain. To localize the characteristics of a light-field as defined by the format discussed previously, there is no need to perform a 4D discrete Radon transform (DRT). If two orthogonal 2D DRT are obtained, measurements can be performed of the slope m of the hyper-plane and the beam width of the digital hyper-planes where all data concentrates in the 4D ray-diagram.

This simpler procedure of location assumes a circular entrance pupil A so that d_max_x, d_min_x, d_max_y, d_min_ywill encompass all hyper-planes intercepts, some values written in the format will contain no values.

It would be interesting to obtain a format for the 4D case which is similar to what was proposed for the 2D case. To do so, it would be interesting to associate the 2D lines found on the Π(x₁, x₂), plane with the lines found on the Π(y₁, y₂) place, i.e., the lines that are the results of the intersection of the corresponding hyper plane with the two orthogonal slices of Π(x₁, x₂), and Π(y₁, y₂), From expressions D and E, it is known that the corresponding lines have the same slope m. This is the first parameter that associates each line in Π(x₁, x₂) to a line in Π(y₁, y₂), for a camera at a certain depth. If there are multiple cameras at the same depth (i.e., the case of FIG. 14A), there are three lines in Π(x₁, x₂), and three lines in Π(y₁, y₂), with the same estimated slope of m. The correspondences in the line offsets between the lines in these two planes are then determined. To do this, the formulation of the lines in expressions D and E are exploited. In particular, denoting

$k = \frac{z_{2} - z_{1}}{z_{3} - z_{1}},$

- the offsets are as follows:

$\begin{matrix} {\begin{matrix} k x_{3} + k A = d_{\max_{x}} \\ k x_{3} - k A = d_{\min_{x}} \end{matrix} and & (F) \\ {\begin{matrix} k y_{3} + k A = d_{\max_{y}} \\ k y_{3} - k A = d_{\min_{y}} \end{matrix} & (G) \end{matrix}$

The sets of the equations may be solved for k, x₃and y₃. Note that (X₃, y₃, z₃) correspond to the coordinates of the camera, or in other words the voxel where the corresponding bundle of light is focused into a circle of the radius A. We have supposed that the aperture on the plane positioned at z₃is circular, so that d_max_x−d_min_x=d_max_y−d_min_y, =2kA, and by solving the previous sets of equations:

$\begin{matrix} k = \frac{d_{\max_{x}} - d_{\min_{x}}}{2 A} & (G) \\ x_{3} = A \frac{d_{\max_{x}} + d_{\min_{x}}}{d_{\max_{x}} - d_{\min_{x}}} & (H) \\ y_{3} = A \frac{d_{\max_{y}} + d_{\min_{y}}}{d_{\max_{y}} - d_{\min_{y}}} & (I) \\ z_{3} = \frac{z_{2} + (k - 1) z_{1}}{k} & (J) \end{matrix}$

The digital lines may be scanned as before on Π(x₁, x₂) using the Bresenham digital lines; For each individual (x₁, x₂), value, the corresponding (y₁, y₂) values captured in the light-field are stored. To find such values, expression C is exploited. All the following are either known or estimated from expressions F and G x3; y3; z3; z1; z2

Moving on each line in Π(x₁, x₂), for each (x₁^q, x₂^q), the following relationship in (y₁, y₂) is obtained:

$y_{2} = \frac{z_{3} - z_{2}}{z_{3} - z_{1}} y_{1} + \frac{z_{3} - z_{2}}{z_{3} - z_{1}} x_{1}^{q} + \frac{z_{2} - z_{1}}{z_{3} - z_{1}} (x_{3} + y_{3}) - x_{2}^{q}$

$or, y_{2} = m y_{1} + m x_{1}^{q} + k (x_{3} + y_{3}^{*}) - x_{2}^{q} = m y_{1} + d_{off} (x_{1}^{q}, x_{2}^{q}, x_{3}, y_{3}^{*})$

For each point in Π(x₁, x₂), a collection of lines in Π(y₁, y₂) is saved. d_offcorresponds to the offset of the lines scanned and saved for (x₁^q, x₂^q). It is noted that:

d
_off(x₁^q,x₂^q)=mx₁^q+k(x₃+y₃*)−x₂^q

With reference to FIG. 12 each square is a (x₁^q, x₂^q), point, and for each one of these points, there is a set of Bresenham digital lines running out of the plane of the figure along a digital bundle defined by equation:

y
₂
=my
₁
+d
_off(x₁^q,x₂^q,x₃,y₃′) (K)

perpendicular to the depicted datalines, but in a 4D space.

An exemplary data format for a bundle of data lines per camera is illustrated in Table 1.

TABLE 1

Tags
Data

Light slab orientation 1
one tag w from (−x, +x, −y, +y, −z, +z)

Parametrization plane offset w₂− w₁(w₃
□ e

is relative to w₁) (float)

Width in number of
min u₁
max u₁
N
u_1min
u_1Max

u₁cells (integer)
value
value

(float)
(float)

Width in number of
min u₂
max u₂
M
u_2min
u_2Max

u₂cells (integer)
value
value

(float)
(float)

Width in number of
min v₁
max v₁
P
v_1min
v_1Max

v₁cells (integer)
value
value

(float)
(float)

Width in number of
min v₂
max v₂
Q
v_2min
v_2Max

v₂cells (integer)
value
value

(float)
(float)

Number of cameras

n

(integer)

Aperture size for

A₁

cam₁

Cam₁focus (3

u3₁
v3₁
w3₁

floats)

lowest cam₁
cam₁
cam₁number
d₁
m₁
l^x₁

intercept in (u₁, u₂)
steepness
of digital lines

in u₁, u₂

cam₁number

l^y₁

of digital lines

in v₁, v₂

data line

(RGB z_pa)₀
(RGB z_pa)₁
. . .
(RGB Z_pa)_n

(end₁= l^x₁*N*l^y₁*P) *

(3 bytes + 2floats)

. . .

. . .
. . .

. . .

. . .
. . .
. . .
. . .

Aperture size for

A_n

cam_n

Cam_nfocus (3

u3_n
v3_n
w3_n

floats)

lowest cam_n
cam_n
cam_nnumber
d_n
m_n
l^x_n

intercept in (u₁, u₂)
steepness
of digital lines

in u₁, u₂

cam_nnumber

l^y_n

of digital lines

in v₁, v₂

data line

(RGB z_pa)₀
(RGB z_pa)₁
. . .
(RGB z_pa)_n

(end_n= l^x_n*N*l^y_n*P) *

(3 bytes + 3floats)

Firstly general metadata of the 4D space is provided: including boundaries of the 4 axes x₁, x₂, y₁, y₂and their corresponding sampling. The number of cameras (bundles) is also provided. For each camera j the following parameters are saved:

- the size of the aperture: A_j, which corresponds to the diameter of the pupil of a pixel beam,
- the focus point of the camera: cam_j; focusPoint=(u₃, u₃, w₃)
- lowest d intercept in (x1_x,2)=d_j
- steepness=m_jl
- number of digital lines in (x₁, x₂)=l_j^x
- number of digital lines in (y₁, y₂)=l_j^xy

On each camera, for each (x^q₁; x^q₂), scanning is started on (y₁, y₂) with respect to expression (K) using the Bresenham digital lines, and the RGB values of each light-field rays are saved. In particular y₃*−A to y₃*+A and the corresponding d_offis calculated according to expression (K).

The same calculations are performed in the decoding step using the stored metadata. In particular, k is found using equation (H). Hence the format remains compact. There is no need to store four indexes for each ray in the system. It is noted that the sampling of the hyper-plane above is the sampling of the 4D ray-space and thus a single x1; y1; x2; y2 location is not missed. This is only one example of a systematic scanning of the 4D ray-space for saving all data in a very compact form. Other processes may of course be applied. The parametric form seems to be adapted to explore the hyper-plane because it permits an inter-leaved space exploration.

In the case of multiple cameras to work on data that contains several bundles of hyper-planes (several maxima in the Radon transform due to multiple cameras), a more complex algorithm may be used. As a pre-processing step, the parameters (m, k) are found for all the peaks in the radon transform of Π(x₁, x₂), and put in one set. The same is done for the peaks in (y₁, y₂) and the parameters are put in another set. Now in each iteration of the greedy algorithm, the maximum peak intensity is found in the 2D radon transform of (x₁, x₂) and the corresponding peak in (y₁, y₂) is found by matching the previously found parameters (m, k). After saving the data as mentioned in the last section, these peaks are cleaned from the radon transforms, and the next iteration is started, until nothing meaningful remains in the light-field

Although the present invention has been described hereinabove with reference to specific embodiments, the present invention is not limited to the specific embodiments, and modifications will be apparent to a skilled person in the art which lie within the scope of the present invention.

Many further modifications and variations will suggest themselves to those versed in the art upon making reference to the foregoing illustrative embodiments, which are given by way of example only and which are not intended to limit the scope of the invention, that being determined solely by the appended claims. In particular the different features from different embodiments may be interchanged, where appropriate.

METHOD AND AN APPARATUS FOR GENERATING DATA REPRESENTATIVE OF A PIXEL BEAM

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

Priority Claims (1)

PCT Information