This Application is a Section 371 National Stage Application of International Application No. PCT/FR2013/050284, filed Feb. 12, 2013, the content of which is incorporated herein by reference in its entirety, and published as WO 2013/121140 on Aug. 22, 2013, not in English.
The present invention relates in general to the field of image processing, and more precisely to coding three-dimensional (3D) images and sequences of three-dimensional images.
More particularly, the present invention relates to coding at least one holographic pattern storing a light signal representative of the light received by at least one perspective object in a scene.
The invention may apply particularly, but not exclusively, to video coding performed using current video coding schemes and amendments thereto (MPEG, H.264, H.264 SVC, H.264 MVC, etc.) or future amendments (ITU-T/VCEG (H.265) or ISO/MPEG (HEVC)), and to the corresponding decoding.
In known manner, a holographic pattern is the result of a holographic method that consists in recording a light wavefront coming from a scene or an object by recording the value of this wavefront of monochromatic type on a semitransparent plate. Such recording is made possible by eliminating time variations in the phase of the light front by interference with a reference light wavefront at the same frequency as said light wavefront.
The holographic pattern as created in this way is a monochromatic image containing details of wavelength order of the light wavefront coming from the scene or the object. The image may be modeled as the Fresnel transform of the wavefront coming from the scene or the object, which transform has the property of dispersing spatial frequency details. Information in a holographic pattern is thus very poorly localized, and therefore lends itself poorly to conventional coding techniques.
A first example of coding such patterns is described in the document by E. Darakis and T. Naughton, entitled “Compression of digital hologram sequence MPEG-4”, Proc. SPIE, Vol. 7358, 2009. In that document, coding is performed on a frame containing a set of time-indexed holographic patterns, said frame thus defining a holographic video. In the same manner as an image in a video sequence, each pattern of the frame is placed in succession as input to a video coder complying with the MPEG-4 standard in order to be subjected to block coding using a predictive scheme that makes use of spatial redundancies between patterns (intra coding) or by making use of time redundancies between the patterns (inter coding).
A second example of coding said patterns is described in the document by M. Liebling and M. Unser, entitled “Fresnelets: new multiresolution wavelet bases for digital holography”, IEEE Trans. Image Processing, Vol. 12, No. 1, January 2003. As in the above-mentioned document, a frame is coded using a set of time-indexed holographic patterns, said frame thus defining a holographic video. Each pattern of the frame is modeled by a function to which filter banks are applied in order to obtain a decomposition in a given function base, e.g. wavelets.
A drawback of such examples of coding holographic patterns is that they remain very expensive in terms of calculation and the coded data obtained is very expensive to transmit in a stream that is intended in particular to be decoded. In addition, such coding methods do not take account of the fact that holographic patterns include dispersed information that was originally well localized in the space domain.
In a first aspect, the present invention provides a method of coding at least one holographic pattern recording a light signal representative of the light received by at least one perspective object in a scene.
The coding method of the invention is remarkable in that it performs the steps consisting in:
By means of reconstructing the light signal in this way in the form of a plurality of rays in the space domain, instead of reconstructing said signal by applying an inverse Fresnel transform, it becomes possible to reduce in non-negligible manner the time and the complexity required for performing the calculations for coding a holographic pattern.
In addition, such reconstruction coupled to making use of rays in the form of a plurality of views respectively representing a plurality of perspectives of the same 3D scene thus makes it possible to take advantage in optimum manner of the multiple redundancies that exist in space and time between the different views in order to code the holographic pattern effectively.
By not including redundant coding information in the coded signal for transmission, i.e. information that there is no point in transmitting, it is possible to obtain a significant reduction in the cost of signaling.
In a particular implementation, the determining step consists in demodulating the light signal in amplitude and in frequency so as to deliver for each point xj(1≦j≦M) of the holographic pattern a set of N data pairs, the data in a pair under consideration being associated respectively with the amplitude and with the direction of a light ray passing via a point under consideration.
Such a provision serves to optimize the cost of coding significantly.
In another particular implementation, demodulation is performed by a Gabor transform or a wavelet transform.
Such transforms have the advantage of effectively localizing the spatial frequencies associated with a given position in the hologram.
In yet another particular implementation, during the step of generating the N views, for a current point xj(1≦j≦M) of the holographic pattern, the viewing plane is subdivided into a plurality of N partitions, with the centroid of each partition corresponding to the point of intersection with the viewing plane of the light ray passing through the point xj.
Such a provision has the advantage of associating clusters of pixels with the intensity value of a reference pixel.
In a variant of this other embodiment, during the step of generating the N views, for a current point x′j(1≦j≦M) of the holographic pattern, the viewing plane is subdivided recursively into four partitions until each subpartition obtained by the subdivision contains only one centroid, the centroid of each subpartition corresponding to the point of intersection with the viewing plane of the light ray passing through the point x′j.
Such a provision also has the advantage of allocating to pixel clusters the intensity value of a reference pixel.
According to yet another particular implementation, the step of compressing the multi-view image uses the following substeps:
Such a provision makes it possible during coding to optimize the elimination of space and time redundancies between the N views of the resulting multi-view image in order significantly to reduce the cost of coding.
In yet another particular implementation, the coding is of the multi-view coding (MVC) type.
Such a provision makes it possible for holographic patterns to have applied thereto the standard MVC technique that until now has been devoted to coding multi-view images.
In corresponding manner, the invention provides a coding device suitable for performing the coding method of the invention. The coding device of the invention is thus for coding at least one holographic pattern having stored thereon a light signal representative of the light received by at least one perspective object in a scene.
Such a coding device is remarkable in that it comprises processor means that are suitable for:
In another aspect, the invention provides a computer program including instructions for performing the coding method of the invention when it is executed on a computer. The invention also provides a computer program on a data medium, the program including instructions adapted to performing the coding method of the invention, as described above.
The program may use any programming language and be in the form of source code, object code, or code intermediate between source code and object code, such as in a partially compiled form, or in any other desirable form.
The invention also provides a computer readable data medium including instructions of a computer program as mentioned above.
The data medium may be any entity or device capable of storing the program. For example, the medium may comprise storage means such as read-only memory (ROM), e.g. a compact disk (CD) ROM, or a microelectronic circuit ROM, or magnetic recording means, e.g. a floppy disk or a hard disk.
Furthermore, the data medium may be a transmissible medium such as an electrical or optical signal suitable for being conveyed via an electrical or optical cable, by radio, or by other means. The program of the invention may in particular be downloaded from a network of the Internet type.
Alternatively, the data medium may be an integrated circuit in which the program is incorporated, the circuit being adapted to execute or to be used in the execution of the method in question.
The above-mentioned computer program and coding device present at least the same advantages as those conferred by the coding method of the present invention.
Other characteristics and advantages appear on reading about several preferred embodiments described with reference to the figures, in which:
There follows a description of an implementation in which the coding method of the invention is used to code a current holographic pattern MH as shown in
The holographic pattern MH may be single or it may form part of a set of patterns representing the same object and indexed in time, in other words a frame of a holographic video.
In the implementation of the invention, the coding method of the invention is performed in a coder device C0 shown in
A determination, first step C1 shown in
With reference to
By way of example, such demodulation may be performed by a Gabor transform, a wavelet transform, a development as a series of wavelets, etc.
The determination step C1 also includes a conversion second substep C12 during which the N resulting signals are converted into N respective rays of given amplitude and direction.
In the preferred implementation, the demodulation substep C11 performs a Gabor transform. To this end, at each point xj(1≦j≦M) of said current holographic pattern MH, Gabor filtering is applied in a set of 2D directions (θ0, . . . , θN). A set Sj of N amplitude and oriented spatial frequency data pairs is then obtained at each point xj(1≦j≦M) such that:
Sj={(aj1,fj1),(aj2,fj2), . . . , (ajN,fjN)}
when the amplitudes are greater than a given threshold.
At the end of the demodulation substep C11, M sets S1 to SM of data pairs are obtained.
Thereafter, the conversion substep C12 converts each of the M sets Sj to SM into M respective sets SC1 to SCM of amplitude and 3D direction data pairs. For a point xj of the current holographic pattern MH, the set:
SCj={(aj1,dj1),(aj2,dj2), . . . , (ajN,djN)}
is obtained in application of the equation sin(α)=λ/f associating the spatial frequency f with the diffraction direction α for an incident plane wave perpendicular to a plane containing said current holographic pattern MH, α being considered in an azimuth plane defined by said set of 2D directions (θ0, . . . , θN).
As an alternative, the demodulation substep C11 applies a 2D wavelet transform to each point xj(1≦j≦M) of said current holographic pattern MH. For this purpose, consideration is given to a generator wavelet φ and the transform:
ajN=∫φθ
in order to obtain sufficient discretization of said set of 2D directions (θ0, . . . , θN), with xc being a current point of the holographic pattern MH and T being an integration variable. The generator wavelet φ may be any function satisfying the conventional acceptability criterion:
where designates the Fourier transform of φ.
During a generation step C2 shown in
To this end, and as shown in
As shown in
In a first implementation as shown in
In a second implementation as shown in
During a following step C3 shown in
During a following compression step C4, as shown in
Such a step is performed by a compression software module CMP shown in
Given the particular arrangement of the N views making up the multi-view image IE, such compression is reliable and accurate since said compression makes use of redundancies both in space and in time between the N generated views.
In a first implementation shown in
To this end, the compression step C4 comprises substeps as described below.
During a substep C41 shown in
During a substep C42, shown in
During a substep C43, shown in
During a substep C44, shown in
During a substep C45, shown in
In a second implementation shown in
At the end of the compression step C4, a coded view of signal SG1 (coded using blocks predictive coding) or SG2 (coded using MVC coding) is produced. Said signal is stored or else transmitted to a decoder (not shown).
The above-described coding method is reiterated for a plurality of holographic patterns belonging to a determined frame.
When the coded signal SG1 or SG2 is transmitted to a decoder, which decoder is adapted to reconstitute said number N of views from the encoded data contained in the signal SG1 or SG2.
In a first implementation shown in
Alternatively, the N reconstructed views V1 to VN are transformed respectively into N individual wavefronts by means of a Fourier transform. The N wavefronts are then transported by a Fresnel transform into a common plane in order to reconstitute the initial holographic pattern MH. By way of example, such a method is described in the document by K. Yamamoto, T. Senoh, R. Oi, and T. Kurita, entitled “84K4-size computer generated hologram for 3D visual system using rendering technology”, 2010 4th International Universal Communication Symposium (IUCS), October 2010, pp. 193-196.
Naturally, the implementations described above are given purely by way of non-limiting indication, and numerous modifications can easily be performed by the person skilled in the art without thereby going beyond the ambit of the invention.
Number | Date | Country | Kind |
---|---|---|---|
12 51402 | Feb 2012 | FR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/FR2013/050284 | 2/12/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2013/121140 | 8/22/2013 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
20040021768 | Payne | Feb 2004 | A1 |
20080259419 | Wilson | Oct 2008 | A1 |
20100033780 | Gitter | Feb 2010 | A1 |
20100149610 | Schwerdtner | Jun 2010 | A1 |
20130250382 | Wiltshire | Sep 2013 | A1 |
Number | Date | Country |
---|---|---|
WO 2011121130 | Oct 2011 | DE |
Entry |
---|
Darakis et al. (Visually lossless compression of digital hologram sequences, SPIE vol. 7529, pp. 752912-1-8) (2010). |
Seo et al. (A new coding technique of Digital Hologram Video based on View-Point MCTF, Proc. of SPIE vol. 6392, pp. 63920S-1-8) (2006). |
Liebling et al., “Fresnelets: new multiresolution wavelet bases for digital holography”, IEEE Trans. Image Processing, vol. 12, No. 1, Jan. 2003. |
Plesniak et al., “Reconfigurable image projection holograms”, Optical Engineering 45(11), 115801 (Nov. 2006). |
Yamamoto et al., “84K4 size computer generated hologram for 3D visual system using rendering technology”, 2010 4th International Universal Communication Symposium (IUCS), Oct. 2010, pp. 193 196. |
International Search Report and Written Opinion dated May 28, 2013 for corresponding International Patent Application No. PCT/FR2013/050284, filed Feb. 12, 2013. |
Darakis et al.: “Compression of Digital Hologram SEquences using MPEG-4”, Proceedings of SPIE, Jan. 1, 2009 (Jan. 1, 2009), pp. 735811-735811-8 , XP055032800. |
Le Thanh Bang et al.: “Compression of Digital Hologram for 3D Object using Wavelet-Bandelets Transform”, Proceedings of SPIE, Jan. 1, 2011 (Jan. 1, 2011), pp. 795704-1, XP055032803. |
Number | Date | Country | |
---|---|---|---|
20150029566 A1 | Jan 2015 | US |