The invention relates to a method and a device for displaying a sequence of pictures. More particularly, the invention relates to a method for displaying a sequence of pictures arising in the form of a multilayer stream and a display device for displaying such a sequence of pictures.
In order to allow a user to change the program displayed on its terminal, i.e. change from the display of a first sequence of pictures to a second sequence of pictures, it is known to add in the stream representative of the second sequence of pictures of the random access points (RAP). In order to increase the rapidity of access to the second stream from the first stream representative of the first sequence of pictures, it is necessary to distribute into this second stream random access points in a close manner. Such a solution has the disadvantage of increasing the bitrate of the second stream into which these random access points are inserted.
Moreover, it is known to represent a sequence of pictures in the form of a multilayer stream comprising a base layer representative of the sequence pictures at a first resolution and/or quality, called pictures of the base layer, and at least one enhancement layer representative of the pictures of said sequence at a second resolution and/or quality, called pictures of the enhancement layer. In the particular case where the second sequence of pictures that the user wants to display on its terminal comes in the form of a multilayer stream, it is known to add random access points both in the base layer and in the enhancement layer. Generally, random access points are distributed in the base layer, either in a more frequent manner, or in a similar manner as in the enhancement layer in order to limit the increase of the multilayer stream bit rate and to accelerate the display of the second stream. For example, it is recommended to insert random access points into the base layer every 2 seconds, or even every 500 milliseconds in the case where a rapid access is required, whereas random access points can be inserted into the enhancement layer on average at least every 5 seconds.
When the user indicates to the terminal that it wants to display the second sequence of pictures, the terminal waits for the arrival of a random access point of the second sequence of pictures. A terminal generally comprises a decoding device linked to a display device (e.g. a Set-Top-Box (STB) linked to a SDTV or HDTV screen). In the case where the base layer comprises more frequent random access points than the enhancement layer, the terminal therefore decodes the data of the base layer until the arrival and decoding of a random access point of the enhancement layer. From the data thus decoded, the decoding device reconstructs the corresponding pictures of the base layer. These are then displayed on the display device. After the decoding of the random access point of the enhancement layer, the terminal decodes the enhancement layer to reconstruct the pictures of the enhancement layer. These are then displayed on the display device.
Such a display is, however, unsatisfactory from a visual point of view since, at the moment when the random access point of the enhancement layer is decoded, the terminal changes from the display of the pictures of the base layer to the pictures of the enhancement layer, which can correspond to a sudden jump in terms of quality and/or content.
The purpose of the invention is to compensate for at least one disadvantage of the prior art.
For this purpose, the invention relates to a method to display a sequence of pictures coming in the form of a multilayer stream comprising a base layer representative of the pictures of the sequence at a first resolution and/or first quality, called pictures of the base layer, and at least one enhancement layer representative of the pictures of the sequence at a second resolution and/or second quality, called pictures of the enhancement layer. The base and enhancement layers comprise random access points. The method comprises the following steps:
According to a particular embodiment, the processing step is a filtering step of each of the pictures of the enhancement layer at an intermediate resolution and/or quality gradually increasing over a predefined time interval from the first resolution and/or first quality to the second resolution and/or second quality.
According to a particular characteristic, the filtering step is a sub-sampling step.
According to another particular embodiment, the processing step is a reframing step of each of the pictures of the enhancement layer by a cropping window, the size of which gradually increases over a predefined time interval between the size of a cropping window associated with the pictures of the base layer and the size of the pictures of the enhancement layer.
According to a particular characteristic, the processing step further comprises a sub-sampling step.
The invention also relates to a method for displaying a sequence of pictures coming in the form of a multilayer stream comprising a base layer representative of the pictures of the sequence at a first resolution and/or first quality, called pictures of the base layer, and at least one enhancement layer representative of the pictures of the sequence at a second resolution and/or second quality, called pictures of the enhancement layer, the base and enhancement layers comprising random access points. The device comprises:
The invention will be better understood and illustrated by means of non-restrictive embodiments and advantageous implementations, with reference to the accompanying drawings, wherein:
The invention relates to a method to display a multilayer stream representative of a sequence of pictures comprising at least two layers. According to the invention, the various layers of the multilayer stream can be fully coded independently from each other. The multilayer stream can be a scalable stream comprising a base layer representative of the pictures of the sequence at a first resolution and/or quality, called pictures of the base layer, and at least one enhancement layer representative of the pictures of the sequence at a second resolution and/or quality, called pictures of the enhancement layer. However, the invention is not limited in any way to this configuration and the “multilayer stream” term is to be understood in the widest sense. It also includes the simulcast streams and the multi-view streams.
A scalable stream is for example a stream complying with the SVC video coding standard described in the JVT-AC205 document published in October 2008 and entitled Joint Draft ITU-T Rec. H.264/ISO/IEC 14496-10/Amd.3 Scalable video coding. However, the invention is not limited in any way to this standard and can be used with any multilayer stream to improve the visual comfort of the user during the display of a sequence of pictures, particularly when changing from the display of the pictures of the base layer to the pictures of the enhancement layer. The pictures of the base layer have a size (w,h), where w is the width and h the height and the pictures of the enhancement layer have a size (W,H), where W is the width and H the height. The picture content of the pictures of the base layer can be identical to the pictures of the enhancement layer even if they have a lower resolution. This is the case when the pictures of the base layer are generated by sub-sampling of the pictures of the enhancement layer. However, the picture contents can be different as illustrated in
According to a first embodiment described with reference to
At step 100, the base layer is decoded from one of its random access points (corresponding to time T1 in
At step 110, the reconstructed pictures of the base layer are displayed. This step can require a spatial filtering of the pictures of the base layer to put them at the resolution of the display device. This is particularly the case if the pictures of the base layer are in the 720p format and must be displayed on a screen in the 1080p high definition (HD) format. In this particular case, the pictures of the base layer are up-sampled before display.
At step 120, if a random access point of the enhancement layer is decoded, then the method continues at step 130 otherwise it resumes at step 100.
At step 130, the enhancement layer is decoded from the random access point (corresponding to time T2 in
At step 140, the reconstructed pictures of the enhancement layer are processed. They are gradually sub-sampled over a predefined period of time, e.g. 500 ms. In
At step 150, the processed pictures of the enhancement layer are displayed. This step can require a spatial filtering of the processed pictures of the enhancement layer to put them at the resolution of the display device. This is particularly the case if the processed pictures of the enhancement layer have a resolution lower than that of the screen on which they must be displayed. In this particular case, the processed pictures of the enhancement layer, i.e. after sub-sampling, are up-sampled before their display.
This embodiment is preferentially used when the picture content of the pictures of the base layer and of the enhancement layer is identical, i.e. when no reframing tool is used to generate the pictures of the base layer from the pictures of the enhancement layer as illustrated in
According to a second embodiment described with reference to
At step 100, the base layer is decoded from one of its random access points (corresponding to time T1 in
At step 110, the reconstructed pictures of the base layer are displayed. This step can require a spatial filtering of the pictures of the base layer to put them at the resolution of the display device. This is particularly the case if the pictures of the base layer are in the 720p format and must be displayed on a screen in the 1080p high definition (HD) format. In this particular case, the pictures of the base layer are up-sampled before display.
At step 120, if a random access point of the enhancement layer is decoded, then the method continues at step 130 otherwise it resumes at step 100.
At step 130, the enhancement layer is decoded from the random access point (corresponding to time T2 in
At step 140, the reconstructed pictures of the enhancement layer are processed. They are gradually reframed over a predefined period of time, e.g. 500 ms. In
The following picture of the enhancement layer is reframed at the size (w+dw,h+dh), i.e. at a size slightly higher than that of the pictures of the base layer. The method is repeated on each reconstructed picture of the enhancement layer by increasing the size of the cropping window each time until reaching the size (H, W) of the pictures of the enhancement layer. At each new picture of the enhancement layer, the cropping window size can be increased by dh pixels in height and dw pixels in width, e.g. dh=4 and dw=4. Thus, the additional picture content between the pictures of the base layer and the pictures of the enhancement layer is gradually added over a period of time between T2 and T3 instead of being suddenly added if, at time T2, there is a direct change from the display of the pictures of the base layer to the display of the pictures of the enhancement layer.
At step 150, the reframed pictures of the enhancement layer are displayed. This step can require a spatial filtering of the reframed pictures of the enhancement layer to put them at the resolution of the display device. This is particularly the case if the reframed pictures of the enhancement layer have a resolution lower than that of the screen on which they must be displayed. In this particular case, the reframed pictures of the enhancement layer are up-sampled before display.
A third embodiment described with reference to
At step 100, the base layer is decoded from one of its random access points (corresponding to time T1 in
At step 110, the reconstructed pictures of the base layer are displayed. This step can require a spatial filtering of the pictures of the base layer to put them at the resolution of the display device. This is particularly the case if the pictures of the base layer are in the 720p format and must be displayed on a screen in the 1080p high definition (HD) format. In this particular case, the pictures of the base layer are up-sampled before display.
At step 120, if a random access point of the enhancement layer is decoded, then the method continues at step 130 otherwise it resumes at step 100.
At step 130, the enhancement layer is decoded from the random access point (corresponding to time T2 in
At step 140, the pictures of the enhancement layer are processed. They are gradually reframed and sub-sampled over a predefined period of time, e.g. 500 ms. In
The picture of the enhancement layer reframed in this way is then sub-sampled at the size (w,h) of the picture of the base layer.
The following picture of the enhancement layer is reframed at the size (w′+dw,h′+dh) then sub-sampled at a size (w+dw, h+dh). The method is repeated on each reconstructed picture of the enhancement layer by increasing the size of the cropping window and of the sub-sampled picture each time until reaching the size (H, W) of the pictures of the enhancement layer. Thus, the additional picture content between the pictures of the base layer and the pictures of the enhancement layer is gradually added over a period of time between T2 and T3 instead of being suddenly added if, at time T2, there is a direct change from the display of the pictures of the base layer to the display of the pictures of the enhancement layer. Likewise, the quality of the pictures displayed between the pictures of the base layer and the pictures of the enhancement layer gradually increases over a period of time between T2 and T3 instead of suddenly changing if, at time T2, there is a direct change from the display of the pictures of the base layer to the display of the pictures of the enhancement layer.
At step 150, the processed pictures of the enhancement layer are displayed. This step can require a spatial filtering of the processed pictures of the enhancement layer to put them at the resolution of the display device. This is particularly the case if the processed pictures of the enhancement layer have a resolution lower than that of the screen on which they must be displayed. In this particular case, the reframed pictures of the enhancement layer are up-sampled before display.
These last two embodiments are preferentially used when the picture content of the pictures of the base layer and of the enhancement layer is different, particularly when the pictures of the base layer are generated from a part only of the pictures of the enhancement layer, said part being delimited in the pictures of the enhancement layer by a cropping window as illustrated in
A fourth embodiment is described with reference to
At step 100, the base layer is decoded from one of its random access points (corresponding to time T1 in
At step 110, the reconstructed pictures of the base layer are displayed. This step can require a spatial filtering of the pictures of the base layer to put them at the resolution of the display device. This is particularly the case if the pictures of the base layer are in the 720p format and must be displayed on a screen in the 1080p high definition (HD) format. In this particular case, the pictures of the base layer are up-sampled before display.
At step 120, if a random access point of the enhancement layer is decoded, then the method continues at step 130 otherwise it resumes at step 100.
At step 130, the enhancement layer is decoded from the random access point (corresponding to time T2 in
At step 140, the pictures of the enhancement layer are processed. They are gradually filtered over a predefined period of time, e.g. 500 ms. In
The following picture of the enhancement layer is filtered so as to obtain an intermediate quality between that of the picture of the base layer and that of the picture of the enhancement layer. This filtering can be performed by sub-sampling at a lower resolution (w″+dw″,h″+dw″) followed by up-sampling at the resolution (W,H). The method is repeated on each picture of the enhancement layer reconstructed by increasing the quality of the filtered picture each time until reaching the quality of the pictures of the enhancement layer. Thus, the picture quality between the pictures of the base layer and the pictures of the enhancement layer is gradually improved over a period of time between T2 and T3 instead of being suddenly improved if, at time T2, there is a direct change from the display of the pictures of the base layer to the display of the pictures of the enhancement layer.
This embodiment is preferentially used when the pictures of the base layer and of the enhancement layer have the same resolution, i.e. when (w, h)=(W, H) but when the quality/fidelity of the pictures of the base layer in comparison with the pictures of the original sequence (also source sequence) is lower than that of the pictures of the enhancement layer. The gradual processing of the pictures of the enhancement layer during the transitional period between T2 and T3 is performed in such a way that the quality of the processed pictures gradually increases between the quality of the pictures of the base layer before or at time T2 and the quality of the pictures of the enhancement layer at or after time T3.
These four embodiments advantageously allow the visual comfort on display to be improved. Indeed, the transition between the display of the pictures of the base layer and the pictures of the enhancement layer is gradual in terms of content and/or quality, i.e. fidelity to the pictures of the source sequence. Note that, for these four embodiments, it is possible to display at time T2 either the picture of the corresponding base layer or a picture of the processed enhancement layer. Likewise, it is possible to display at time T3 either a picture of the processed enhancement layer or the corresponding picture of the enhancement layer.
The method according to the invention is advantageously used to improve the display from a visual point of view in the case where a user wants to change from the display of a first sequence to a second sequence. Hence, with reference to
At step 80, if a random access point is decoded for the base layer, then the method continues at step 100 otherwise it continues at step 90.
At step 90, a predefined picture is displayed on the screen as long as a random access point of the base layer is not decoded. Steps 100 to 150 are identical to steps 100 to 150 previously described with reference to one of the embodiments and are not further described.
The predefined picture is for example a black image or still the last picture of sequence A displayed before the receipt of a signal indicating the user's will to change sequences.
The invention also relates to a display device 20 represented in
Number | Date | Country | Kind |
---|---|---|---|
0951298 | Mar 2009 | FR | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2010/052487 | 2/26/2010 | WO | 00 | 6/25/2012 |