This Application is a Section 371 National Stage Application of International Application No. PCT/FR2013/051625, filed Jul. 8, 2013, the content of which is incorporated herein by reference in its entirety, and published as WO 2014/009644 on Jan. 16, 2014, not in English.
The present invention relates in general to the field of image processing and more particularly to coding digital images and digital image sequences.
The invention can thus apply in particular to the video coding performed by present-day and future video coders (ITU-T/ISO MPEG HEVC), and extensions thereto.
The HEVC standard that is currently being prepared and that is described in the document by: B. Bross, W. J Han, J. R. Ohm, G. J. Sullivan, and T. Wiegand, entitled “High efficiency video coding (HEVC) text specification draft 6”, Document JCTVC-H1003 of JCT-VC, San Jose, Calif., USA, February 2012; is similar to the preceding H.264 standard in the sense that it makes use of block partitioning of the video sequence. Nevertheless, the HEVC standard differs from the H.264 standard by the fact that the partitioning that is used complies with a tree structure referred to as a “quadtree”. For that purpose, and as shown in
In an HEVC compatible coder, the partitioning of the block CTBi is iterated to a predetermined partitioning depth level.
At the end of the above-mentioned successive partitionings of the block CTBi, as shown in
The size of said coding blocks may be selected in adaptive manner with the help of block partitioning complying with a quadtree type structure in which the leaves of the tree structure represent respectively the coding blocks CB1, CB2, . . . , CBj, . . . , CBM obtained at various partitioning depth levels.
With reference to
There exist various partitioning modes for a coding block CBj under consideration. By way of example,
After predictive coding of the coding block CBj under consideration, it may be partitioned once more into a plurality of smaller blocks TB1, TB2, . . . . TBv, . . . , TBQ (1≤v≤Q) which are referred to as “transform” blocks. Such partitioning complies with a quadtree structure known as a “residual quadtree” in which the leaves represent the respective coding blocks TB1, TB2, . . . , TBv, . . . , TBQ obtained at the various partitioning depth levels.
In the tree structure constituted in this way, there exist:
In an HEVC compatible coder, for a block CTBi under consideration, a plurality of different partitionings for that block are put into competition in the coder, i.e. different respective combinations of partitioning iterations are put into competition, in order to select the best partitioning, i.e. the partitioning that optimizes the coding of the block CTBi under consideration in application of a predetermined coding performance criterion, e.g. the data rate/distortion cost or an efficiency/complexity compromise, which are criteria well known to the person skilled in the art.
Once optimum partitioning of a block CTBi under consideration has been performed, a digital information sequence, e.g. a run of bits, that is representative of this optimum partitioning is transmitted in a stream that is to be read by a video decoder.
Such a stream likewise comprises:
In certain circumstances, it can happen that the partitioning into a plurality of subblocks as performed on a block that has already been coded and then decoded resembles the partitioning of the block CTBi under consideration that is to be coded. As a result, the digital information sequence representative of the partitioning of the block CTBi that is to be coded also resembles the digital information sequence representative of the partitioning of the block that has already been coded and then decoded. It should be understood that the digital information sequence representative of the partitioning of the block CTBi that is to be coded contains a non-negligible amount of digital information that represents high signaling cost, thereby reducing the performance of the coding.
The present invention provides, in one aspect, a method of coding a current image that has previously been partitioned into blocks, the method performing the following steps for a current block that is to be coded:
Such a coding method is remarkable in that it comprises the steps consisting in:
Such a provision thus makes it possible to encode only the difference between the partitioning of the current block and the reference partitioning, i.e. the partitioning of the block that has already been coded and then decoded, thereby significantly reducing the amount of digital information needed on decoding in order to reconstruct the partitioning of the current block.
In a particular implementation, the coding method includes a step consisting in transmitting a data signal to a decoder, which signal includes digital information indicating whether or not the partitioning prediction step has been performed.
Such a provision enables any present or future coders to have the option of activating or deactivating prediction of the partitioning of the current block, for an image sequence under consideration, for an image under consideration, or indeed for an image portion (or “slice”) under consideration. As a result, such a partitioning prediction method is particularly flexible since it can be adapted to the current video context.
In another particular implementation, when the above-mentioned partitioning prediction step is performed, the data signal includes the encoded residual digital information as mentioned above.
Such a provision makes it possible for the purpose of reconstructing the current block, to inform the decoder about the result of the difference between the partitioning of the current block and the reference partitioning.
In yet another particular implementation, the current block and the already coded and then decoded block are respectively partitioned at least once to obtain two partitioning depth levels that are less than or equal to a predetermined partitioning depth level, and the predetermined comparison criterion is as follows:
Such a provision makes it possible to reduce the cost of signaling when:
In yet another particular implementation, said comparison criterion is the following:
In a variant, the comparison criterion is as follows:
In yet another particular implementation, the coded and then decoded block has a pre-identified position:
Such a provision thus makes it possible to implement the invention in the context of coding a two-dimensional image or image portion equally well by using an intra prediction mode or by using an inter prediction mode.
With inter coding, the other image that has already been decoded may be an image that occurs before or after the current image.
Such a provision thus makes it possible to perform the invention in the context of coding a two-dimensional image while using either a uni-predictive or a bi-predictive scheme.
Finally, such a provision makes it possible for the invention to be performed for three-dimensional (3D) video coding, such as multiview 3D coding, or HEVC 3D coding, where proposals have been made to code a first image component relative to at least one second image component that has already been coded and then decoded.
It should be observed that the first and second image components are not pieces of one image but, in the meaning of the invention, represent two different views of a complete image.
Such first and second image components may for example be respectively a texture component and the associated depth component, as used in the new multiview video plus depth (MVD) video coding format that is presently under development.
Alternatively, the above-mentioned first and second image components may be respectively a depth component and the associated texture component.
Other types of first and second image components may naturally be envisaged.
Thus, the first and second image components may be respectively:
It is also possible to envisage coding a first image component relative to a second image component and a third image component. Under such circumstances, and by way of example:
The invention also provides a coder device for coding a current image that has previously been subdivided into blocks, the device being designed to perform the above-specified coding method. For a current block to be coded, such a device comprises:
Such a coder device is remarkable in that it comprises:
The invention also provides a data signal representative of an image coded using the above-specified coding method. Such a signal is remarkable in that it includes said encoded residual digital information, as mentioned above.
The invention also provides a method of decoding a data signal representative of a current image that has previously been partitioned into blocks and that has been coded, by performing for a current block for decoding a step that consists in initially reconstructing the partitioning of a decoded block.
Such a decoding method is remarkable in that it comprises the steps consisting in:
On receiving and reading coding information, and more particularly residual digital information representative of the difference between the partitioning of the current block and the reference partitioning, such a decoding technique makes it possible to reconstruct the current block by applying a prediction that is the inverse of the prediction performed in the above-specified coding method.
The invention also provides decoder device for performing the above-specified decoding method, said decoder device being adapted to decoding a data signal representative of a current image that has previously been partitioned into blocks and that has been coded, the device comprising, for a current block for decoding, reconstruction means suitable for initially reconstructing the partitioning of a decoded block.
Such a decoder device is remarkable in that it comprises:
The invention also provides a computer program including instructions for performing the coding method or the decoding method of the invention when the program is executed on a computer.
The program may use any programming language and be in the form of source code, object code, or code intermediate between source code and object code, such as in a partially compiled form, or in any other desirable form.
The invention also provides a computer readable data medium storing a computer program, the program including instructions adapted to performing the coding or the decoding method of the invention, as described above.
The data medium may be any entity or device capable of storing the program. For example, the medium may comprise storage means such as a read only memory (ROM), e.g. a compact disk (CD) ROM, or a microelectronic circuit ROM, or indeed magnetic recording means, e.g. a universal serial bus (USB) stick or a hard disk.
Furthermore, the data medium may be a transmissible medium such as an electrical or optical signal that may be conveyed by an electrical or optical cable, by radio, or by other means. The program of the invention may in particular be downloaded from an Internet type network.
Alternatively, the data medium may be an integrated circuit in which the program is incorporated, the circuit being adapted to execute or to be used in the execution of the above-specified coding or decoding method.
The above-mentioned coder device and the corresponding computer program present at least the same advantages as those imparted by the coding method of the present invention.
The above-mentioned decoder device, and corresponding computer program and data medium present at least the same advantages as those imparted by the decoding method of the present invention.
Other characteristics and advantages appear on reading the description of preferred implementations made with reference to the figures, in which:
1. A Coding Method
There follows a description of an implementation of the invention in which the coding method of the invention is used for coding an image or a sequence of images as a binary stream similar to that obtained by coding that complies for example with the HEVC 2D standard that is in preparation.
In this implementation, the coding method of the invention is implemented by way of example in software or hardware form by making modifications to a coder initially in compliance with the HEVC 2D standard. The coding method of the invention is shown in the form of an algorithm comprising steps C1 to C7, as shown in
In the embodiment of the invention, the coding method of the invention is implemented in a coder device CO as shown in
During a prior step C1 shown in
It should be observed that in the meaning of the invention, the term “block” has the same meaning as the “coding unit” in the terminology used in particular in the HEVC standard, e.g. in the document by: B. Bross, W. J Han, J. R. Ohm, G. J. Sullivan, and T. Wiegand, entitled “High efficiency video coding (HEVC) text specification draft 6”, Document JCTVC-H1003 of JCT-VC, San Jose, Calif., USA, February 2012.
In particular, such a coding unit groups together sets of pixels that are rectangular or square in shape and that are also known as blocks, or macroblocks, or indeed sets of pixels presenting other geometrical shapes.
During a step C2 shown in
During a step C3 shown in
In accordance with the invention, the block CTBu is partitioned at least once until reaching a partitioning depth level k1 that is less than or equal to a predetermined partitioning depth level kP (k1≥0, kP≥0).
Said partitioning is performed by a second partitioning software module MP2 shown in
The steps C2 to C3 are reiterated for all of the blocks CTB1, CTB2, . . . , CTBS.
An example of partitioning the block CTBu is shown in
In the example shown, the current block CTBu is partitioned to a depth level k1=2 that is set by the coder, said level being less than a predetermined partitioning depth level kP, such as for example kP=4. In the example shown, the partitioning depth level k1 is determined as being the level for which the final partitioning obtained for the selected block CTBu optimizes a coding performance criterion, in particular the data rate/distortion cost, for example.
With reference to
During a step C4 shown in
Such a step of producing binary sequences is performed by a data processor software module PSB shown in
The above-mentioned decoding step is performed by the decoding module MD, also shown in
In accordance with the invention, during a step C5 shown in
Such a prediction step C5 is performed in a prediction software module MPR shown in
By way of example, the selected block that has already been coded and then decoded is the block CTBri.
The selected block CTBri was previously partitioned into a plurality of coding subblocks Br1, Br2, . . . , Brf, . . . , BrG with Such partitioning complies with a quadtree type structure, as described above.
Other types of tree structure could naturally be envisaged.
In accordance with the invention, the partitioning of the block CTBri is performed at least once until a partitioning depth level k2 is obtained that is less than or equal to a predetermined partitioning depth level kP (k2≥0, kP≥0).
Said partitioning is performed by the second partitioning software module MP2 shown in
An example of the partitioning of the block CTBri is shown in
In the example shown, the partitioning of the current block CTBri is performed to a depth level k2=3 set by the coder and less than the predetermined partitioning depth level kP, such as kP=4. The partitioning depth level k2 is determined as being the level for which the final partitioning obtained for the selected block CTBri optimizes a coding performance criterion, in particular the data rate/distortion cost, for example.
With reference to
The partitioning of the block CTBri that has been coded and then decoded is represented in the form of a sequence of bits Sri.
The table below shows an example of content for a sequence Sri representative of the partitioning of the coded and then decoded block CTBri and content for the sequence Su representative of the partitioning of the current block CTBu.
For the sequence Sri:
For the sequence Su:
The last three bits of the sequence Su are offset to the end of the sequence in order to indicate that there is a difference of partitioning between the subblock B7 and the subblock Br7 of block CTBri.
Said prediction step C5 shown in
At the end of the prediction step C5, a sequence S′u is obtained that is representative of the predicted partitioning of each current block CTBu.
The prediction step C5 is reiterated for all of the sequences S1 to SS so as to obtain sequences S′2 to S′S that are representative of the predicted partitioning of the current blocks CTBi to CTBS.
Whether an HEVC compatible coder has or has not performed said prediction step is signaled by digital information such as a “1” bit to indicate that the prediction of the partitioning has been performed, or else a “0” bit to indicate that the prediction of the partitioning has not been performed. Depending on circumstances, one such bit may be signaled per image sequence, or per image, or indeed per image portion or “slice”.
In accordance with a first comparison criterion, the following steps are performed:
To this end, said comparison is performed in compliance with the table T1 below down to the partitioning depth level kP, which is equal to 4 in the presently-described example:
In this example, the residual information with the first value is a “0” bit and the residual information with a second value is “1” bit.
The table T1 comprises:
The partitionings at equal partitioning depth level are compared in a predetermined order for ordering the subblocks of the current block CTBu and of the coded and then decoded block CTBri such as for example the “raster scan” order that is well known to the person skilled in the art.
At the end of the comparison in accordance with table T1, the sequence of residual bits representative of the predicted partitioning for the current block CTBu is as follows:
The last “0” bit in the sequence S′u indicates that neither of the blocks CTBu and CTBri reach the partitioning depth level kP=4.
As an alternative, another convention for bit values may be used in table T1, such as that given below:
In application of a second comparison criterion, which is similar to the above-described first comparison criterion, said comparison is performed in compliance with table T2 below down to a partitioning depth level kP, which is equal to 4 in the presently-described example:
In this example, the residual information having a first value is a “0” bit, and the residual information having a second value is a “1” bit followed by a bit giving the location of the different partitioning of the subblock of the current block CTBu or of the coded and then decoded block CTBri.
Table T2 comprises:
At the end of the comparison in compliance with table T2, the sequence of residual bits representative of the predicted partitioning of the current block CTBu is as follows:
In this sequence S′u, and with reference to
As an alternative, another convention for bit values may be used in table T2, as given below:
In application of a third comparison criterion, said comparison is performed in compliance with table T3 below:
Table T3 comprises:
The above-mentioned sequences Sri and Su are then compared with each other, the first bit of the sequence Sri being compared with the first bit of the sequence Su, and so on to the last bit of each of the sequences.
At the end of the comparison performed in accordance with table T3, the sequence of residual bits representative of the predicted partitioning of the current block CTBu is as follows:
In application of a fourth comparison criterion, said comparison is performed in compliance with table T4 below:
Table T4 comprises:
The above-mentioned sequences Sri and Su are then compared with each other, the first bit of the sequence Sri being compared with the first bit of the sequence Su, and so on to the last bit of each of the sequences.
At the end of the comparison in accordance with table T4, the sequence of residual bits representative of the predicted partitioning of the current block CTBu is as follows:
In this sequence S′u and with reference to
In application of a fifth comparison criterion, which is a combination of the above-described second and third comparison criteria, said comparison is performed in compliance with table T5 below, starting from the second partitioning depth level and continuing to the partitioning depth level kP which is equal to 4 in the presently-described example:
Table T5 uses the same conventions as table T3. It differs therefrom by having an additional column labeled “depth level bit” indicating that when at equal partitioning depth level a subblock of the current block CTBu and a corresponding subblock of the coded and then decoded block CTBri have been partitioned differently, then the depth level bit is set to “1” to indicate explicitly a partitioning difference between a subblock of the current block CTBu and a subblock of the coded and then decoded block CTBri.
At the end of the comparison in accordance with table T5, the sequence of residual bits representative of the predicted partitioning of the current block CTBu is as follows:
In this sequence, and with reference to
With reference to
Such an encoding step is performed by an entropic coder CE shown in
Thereafter, during a step C7 shown in
The data signal F is constructed in a stream-constructing software module CF as shown in
The data signal F is then transmitted over a communications network (not shown) to a remote terminal. The remote terminal has a decoder that is described in greater detail below.
With reference to
With reference to
The index for the selected block CTBri is transmitted in the data signal F.
With reference to
In a possible alternative to the example shown in
With reference to
Such first and second image components may for example be respectively a texture component and an associated depth component, as implemented in the new video coding format known as multiview video plus depth (MVD) which is currently under development.
Alternatively, the above-mentioned first and second image components may be respectively a depth component and its associated texture component.
It is naturally possible to envisage using other types of first and second image component.
Thus, the first and second image components could respectively be:
It is also possible to envisage coding a first image component relative to a second image component and a third image component. By way of example, this might give:
2. A Decoding Method
An implementation of the invention is described below in which the decoding method of the invention is used for decoding an image sequence in a binary stream similar to that obtained by coding in compliance, for example, with the HEVC 2D standard presently under development.
In this implementation, the decoding method of the invention is implemented for example in software or hardware manner by modifying a decoder that was initially in compliance with the HEVC 2D standard. The decoding method of the invention is shown in the form of an algorithm comprising steps D1 to D2 as shown in
In the implementation of the invention, the decoding method of the invention is implemented in a decoder device DO as shown in
During a step D1 shown in
Such a step is performed by a selection software module MS, as shown in
During a step D2 shown in
Such a step is performed by the entropic decoder module DE, as shown in
During that same step D2 shown in
More precisely, such reconstruction consists in performing prediction that is the inverse of the prediction performed in the prediction step C5 of
For this purpose, the sequence Sri representative of the previously reconstructed partitioning of the decoded block CTBri is added to the sequence S′u as read, thereby providing the reconstructed sequence Su representative of the partitioning of the current block CTBu.
The substeps D21 and D22 may be performed simultaneously, or alternatively they may be performed in either order.
The steps D1 and D2 are reiterated for all of the sequences S′1 to S′S identified in the data signal F.
Naturally, the above-described implementations are given purely by way of non-limiting indication and numerous modifications may easily be applied thereto by the person skilled in the art without thereby going beyond the ambit of the invention.
Number | Date | Country | Kind |
---|---|---|---|
12 56604 | Jul 2012 | FR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/FR2013/051625 | 7/8/2013 | WO | 00 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2014/009644 | 1/16/2014 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
5351095 | Kerdranvat | Sep 1994 | A |
7499492 | Ameres et al. | Mar 2009 | B1 |
20110274170 | Paz | Nov 2011 | A1 |
20110310976 | Wang | Dec 2011 | A1 |
20120269274 | Kim | Oct 2012 | A1 |
20140240456 | Kang | Aug 2014 | A1 |
20150049817 | Liu | Feb 2015 | A1 |
20150208067 | Jung | Jul 2015 | A1 |
Number | Date | Country |
---|---|---|
2011046607 | Apr 2011 | WO |
2011061089 | May 2011 | WO |
WO 2011061089 | May 2011 | WO |
2011127966 | Oct 2011 | WO |
WO 2011127966 | Oct 2011 | WO |
Entry |
---|
Bross, Benjamin et al: “High Efficiency Video Coding (HEVC) Text Specification Draft 6”, Joint-Collaborative Team on Video Coding (JCT-VC), Document JCTVC-H1003, San Jose, CA, USA, Feb. 2012. |
International Search Report and Written Opinion dated Sep. 30, 2013 for corresponding International Patent Application No. PCT/FR2013/051625, filed Jul. 8, 2013. |
Chia-Yuan Teng et al: “A New Quadtree Predictive Image Coder”, Proceedings of the International Conference on Image Processing. (ICIP). Washington, Oct. 23-26, 19956; vol. 2, Oct. 23, 1995, pp. 73-76, XP010196975. |
Merkle P. Et al: “The Effects of Multiview Depth Video Compression on Multiview Rendering,” Signal Processing, Image Communication, Elsevier Science Publishers, Amsterdam, NL, vol. 24, No. 1-2, Jan. 1, 2009, pp. 73-88, XP025884346. |
Number | Date | Country | |
---|---|---|---|
20150208067 A1 | Jul 2015 | US |