This application claims priority from European Patent Application No. 17306267.0, entitled “METHOD AND TERMINAL FOR TILE SELECTION AMONGST A SET OF TILES”, filed on Sep. 26, 2017, the contents of which are hereby incorporated by reference in its entirety.
The present disclosure relates to the streaming of Virtual Reality (VR) multimedia content (such as VR 360° videos or VR panoramic videos) to an end device through a delivery network, and especially to a tile based video streaming.
This section is intended to introduce the reader to various aspects of art, which may be related to various aspects of the present disclosure that are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present disclosure. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.
VR 360° videos offer an immersive experience wherein a user can look around using a VR head-mounted display (HMD) or can navigate freely within a scene on a flat display by controlling the viewport with a controlling apparatus (such as a mouse or a remote control).
Such a freedom in spatial navigation requires that the whole 360° scene is delivered to a player (embedded within the HMD or TV set) configured to extract the video portion to be visualized depending on the position of the viewport within the scene. Therefore, a high throughput is necessary to deliver the video. Indeed, it is commonly admitted that a physical space field of vision surrounded by the 360° horizontal direction and 180° vertical direction can be entirely covered by a user within a minimum set of twelve viewports. To offer an unrestricted VR 360° video service in 4K resolution, a video stream equivalent to twelve 4K videos has to be provided.
Therefore, one main issue relies on the efficient transmission of VR 360° videos over bandwidth constrained network with an acceptable quality of immersive experience (i.e. avoiding freeze screen, blockiness, black screen, etc.). Currently, for delivering a VR 360° video service in streaming, a compromise has to be reached between immersive experience, resolution of video and available throughput of the content delivery network.
The majority of known solutions streaming VR 360° videos provides the full 360° scene to the end device, but only less than 10% of the whole scene is presented to the user. Since delivery networks have limited throughput, the video resolution is decreased to meet bandwidth constraints.
Other known solutions mitigate the degradation of the video quality by reducing the resolution of the portion of the 360° scene arranged outside of the current viewport of the end device. Nevertheless, when the viewport of the end device is moved upon user's action to a low resolution area, the displayed video suffers from a sudden degradation.
The present disclosure has been devised with the foregoing in mind.
The disclosure concerns a method for selecting, at a terminal, a tile amongst a set of tiles associated with a multimedia content requested by the terminal, said multimedia content being divided into a plurality of tile segments, said method comprising:
In an embodiment, the processing parameter can be a scalar product of a distance vector with a movement vector, and the selection criterion can be met when the scalar product of an identified tile is higher than the scalar products of the other identified tiles.
In an embodiment, the distance vector can be defined between said next position of the viewport and a center of an identified tile, and the movement vector can be defined from the current position of the viewport and the next position of the viewport.
In an embodiment, the processing parameter can be a distance between said next position of the viewport and a center of an identified tile, and the selection criterion can be met when the distance of an identified tile is smaller than the distance of the other identified tiles.
In an embodiment, the identified tiles can comprise at least the viewport.
In an embodiment, each identified tile can present an overlap with a tile associated with the current tile segment.
In an embodiment, the estimation of the trajectory of the viewport can be obtained by linear approximation.
In an embodiment, the next position of the viewport can be obtained by extrapolation from the estimated trajectory.
In an embodiment, the selecting of an identified tile can rely on the determination of a scalar product except:
The present disclosure further concerns a terminal configured to select a tile amongst a set of tiles associated with a multimedia content requested by the terminal, said multimedia content being divided into a plurality of tile segments, wherein said terminal comprises one memory and one processing circuitry configured to perform:
In an embodiment, the processing parameter can be a scalar product of a distance vector with a movement vector, and the selection criterion can be met when the scalar product of an identified tile is higher than the scalar products of the other identified tiles.
In an embodiment, the distance vector can be defined between said next position of the viewport and a center of an identified tile, and the movement vector can be defined from the current position of the viewport and the next position of the viewport.
In an embodiment, the processing parameter can be a distance between said next position of the viewport and a center of an identified tile, and the selection criterion can be met when the distance of an identified tile is smaller than the distance of the other identified tiles.
In an embodiment, the identified tiles can comprise at least the viewport.
In an embodiment, each identified tile can present an overlap with a tile associated with the current tile segment.
In an embodiment, the estimation of the trajectory of the viewport can be obtained by linear approximation.
In an embodiment, the next position of the viewport can be obtained by extrapolation from the estimated trajectory.
Besides, the present disclosure is further directed to a non-transitory program storage device, readable by a computer, tangibly embodying a program of instructions executable by the computer to perform a method for selecting, at a terminal, a tile amongst a set of tiles associated with a multimedia content requested by the terminal, said multimedia content being divided into a plurality of tile segments, which comprises:
The present disclosure also concerns a computer program product which is stored on a non-transitory computer readable medium and comprises program code instructions executable by a processor for implementing a method for selecting, at a terminal, a tile amongst a set of tiles associated with a multimedia content requested by the terminal, said multimedia content being divided into a plurality of tile segments, which comprises:
The method according to the disclosure may be implemented in software on a programmable apparatus. It may be implemented solely in hardware or in software, or in a combination thereof.
Some processes implemented by elements of the present disclosure may be computer implemented. Accordingly, such elements may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.) or an embodiment combining software and hardware aspects that may all generally be referred to herein as “circuit”, “module” or “system”. Furthermore, such elements may take the form of a computer program product embodied in any tangible medium of expression having computer usable program code embodied in the medium.
Since elements of the present disclosure can be implemented in software, the present disclosure can be embodied as computer readable code for provision to a programmable apparatus on any suitable carrier medium. A tangible carrier medium may comprise a storage medium such as a floppy disk, a CD-ROM, a hard disk drive, a magnetic tape device or a solid state memory device and the like.
The disclosure thus provides a computer-readable program comprising computer-executable instructions to enable a computer to perform the method for tile selection according to the disclosure.
Certain aspects commensurate in scope with the disclosed embodiments are set forth below. It should be understood that these aspects are presented merely to provide the reader with a brief summary of certain forms the disclosure might take and that these aspects are not intended to limit the scope of the disclosure. Indeed, the disclosure may encompass a variety of aspects that may not be set forth below.
The disclosure will be better understood and illustrated by means of the following embodiment and execution examples, in no way limitative, with reference to the appended figures on which:
Wherever possible, the same reference numerals will be used throughout the figures to refer to the same or like parts.
The following description illustrates the principles of the present disclosure. It will thus be appreciated that those skilled in the art will be able to devise various arrangements that, although not explicitly described or shown herein, embody the principles of the disclosure and are included within its scope.
All examples and conditional language recited herein are intended for educational purposes to aid the reader in understanding the principles of the disclosure and are to be construed as being without limitation to such specifically recited examples and conditions.
Moreover, all statements herein reciting principles, aspects, and embodiments of the disclosure, as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents as well as equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure.
Thus, for example, it will be appreciated by those skilled in the art that the block diagrams presented herein represent conceptual views of illustrative circuitry embodying the principles of the disclosure. Similarly, it will be appreciated that any flow charts, flow diagrams, state transition diagrams, pseudocode, and the like represent various processes which may be substantially represented in computer readable media and so executed by a computer or processor, whether or not such computer or processor is explicitly shown.
The functions of the various elements shown in the figures may be provided through the use of dedicated hardware as well as hardware capable of executing software in association with appropriate software. When provided by a processor, the functions may be provided by a single dedicated processor, by a single shared processor, or by a plurality of individual processors, some of which may be shared. Moreover, explicit use of the term “processor” or “controller” should not be construed to refer exclusively to hardware capable of executing software, and may implicitly include, without limitation, digital signal processor (DSP) hardware, read only memory (ROM) for storing software, random access memory (RAM), and nonvolatile storage.
In the claims hereof, any element expressed as a means and/or module for performing a specified function is intended to encompass any way of performing that function including, for example, a) a combination of circuit elements that performs that function or b) software in any form, including, therefore, firmware, microcode or the like, combined with appropriate circuitry for executing that software to perform the function. It is thus regarded that any means that can provide those functionalities are equivalent to those shown herein.
In addition, it is to be understood that the figures and descriptions of the present disclosure have been simplified to illustrate elements that are relevant for a clear understanding of the present disclosure, while eliminating, for purposes of clarity, many other elements found in typical digital multimedia content delivery methods, devices and systems. However, because such elements are well known in the art, a detailed discussion of such elements is not provided herein. The disclosure herein is directed to all such variations and modifications known to those skilled in the art.
The present disclosure is depicted with regard to a streaming environment to deliver a VR multimedia content (such as a 360° video or a panoramic video) to a client terminal through a delivery network.
As shown in
The client terminal 100—connected to the gateway 200 through a first network N1 (such as a home network or an enterprise network)—may wish to request a VR video stored on a remote server 300 through a second network N2 (such as the Internet network). The first network N1 is connected to the second network N2 thanks to the gateway 200.
The server 300 streams segments to the client terminal 100, upon the client request, using a streaming protocol (such as the HTTP adaptive streaming protocol, so called HAS, like MPEG-DASH).
As shown in the example of
As an example, the client terminal 100 is a portable media device, a mobile phone, a tablet or a laptop, a head mounted device, a set-top box or the like. Naturally, the client terminal 100 might not comprise a complete video player, but only some sub-elements such as the ones for demultiplexing and decoding the media content and might rely upon an external means to display the decoded content to the end user.
As depicted in
It is assumed hereafter that only one tile can be delivered to the terminal 100, such a tile covering a larger part of the scene than the viewport VP associated with the client terminal 100 requesting the VR video 10 (as shown in the example of
It is further assumed that an overlap exists between consecutive tiles of the set of tiles as depicted in the example of
As shown in the example of
According to the present principles, as shown in
The client terminal (e.g. thanks to its streaming controller 103 or processor(s) 105) can obtain, in a step 601, the next position of the viewport VP of the terminal 100—at the expiration of a current tile segment—based on an estimation of the trajectory of the viewport VP.
As shown in the example of
To that end, the trajectory 700 of the center Vi of the viewport VP can, for instance, be obtained by linear approximation thanks to the movement vector {right arrow over (M)} determined, for instance, from intermediate positions Vi (as shown in the example of
Once the trajectory 700 of the viewport VP has been estimated, the viewport position V2 at expiration of the current tile segment (time t2 in the example of
In a step 602, the client terminal 100 (e.g. thanks to its streaming controller 103 or processor(s) 105) can identify one or several tiles 400 amongst the set of tiles comprising at least the viewport VP at the next position V2. Each identified tile (references 400A, 400B, 400C, 400D, 400E in the example of
In a step 603, the client terminal 100 (e.g. thanks to its streaming controller 103 or processor(s) 105) can determine, for each identified tile 400A, 400B, 400C, 400D and 400E, a processing parameter depending, for instance, on the center of the identified tile and the position of the viewport VP at time t2.
In a first embodiment compliant with the present principles, the processing parameter is a scalar product SP of a distance vector {right arrow over (V)} with the movement vector {right arrow over (M)}.
In the example of
For each distance vector to , the following scalar products can be computed:
PA=·{right arrow over (M)}
PB=·{right arrow over (M)}
PC=·{right arrow over (M)}
PD=·{right arrow over (M)}
PE=·{right arrow over (M)}
Once the scalar products associated with the identified tiles have been computed, the client terminal 100 can select, in a step 603, the identified tile for which the corresponding computed scalar product is higher than the scalar products of the other identified tiles. In the example of
In a second embodiment compliant with the present principles, the processing parameter is the distance between the position V2 of the viewport VP at the expiration of the current segment (time t2) and the center of an identified tile.
Then, the client terminal 100 (e.g. thanks to its streaming controller 103 or processor(s) 105) selects (in step 603) the tile for which the associated distance is smaller than the distances computed for the other identified tile. In the example of
In a third embodiment, the tile selection (step 603) combines both first and second embodiments. In particular, the tile selection can be implemented according to the first embodiment except:
In a tile based video streaming (such as adaptive streaming), the present principles can allow to select a tile not only by considering its distance from the current viewport but also by taking the user's movement into account. By selecting a video tile as far as possible in the movement direction, the duration of the recovery between viewport and tile can be improved, reducing the risk of leaving the covered scene area too early. In adaptive streaming, moving from a tile to another one can be achieved only at the beginning of (sub)segment, i.e. at a periodicity of the (sub)segment duration which can vary for a few hundreds of milliseconds to several seconds. Therefore, the process to choose the tile according to the user's aiming point is decisive for the quality of the user experience.
Thanks to the present principles, a user can then be allowed to pan across the VR video without risking the occurrence of a black areas displayed due to a lack of video data.
References disclosed in the description, the claims and the drawings may be provided independently or in any appropriate combination. Features may, where appropriate, be implemented in hardware, software, or a combination of the two.
Reference herein to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment can be included in at least one implementation of the method and device described. The appearances of the phrase “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments necessarily mutually exclusive of other embodiments.
Reference numerals appearing in the claims are by way of illustration only and shall have no limiting effect on the scope of the claims.
Although certain embodiments only of the disclosure have been described herein, it will be understood by any person skilled in the art that other modifications, variations, and possibilities of the disclosure are possible. Such modifications, variations and possibilities are therefore to be considered as falling within the spirit and scope of the disclosure and hence forming part of the disclosure as herein described and/or exemplified.
The flowchart and/or block diagrams in the Figures illustrate the configuration, operation and functionality of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, or blocks may be executed in an alternative order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of the blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems that perform the specified functions or acts, or combinations of special purpose hardware and computer instructions. While not explicitly described, the present embodiments may be employed in any combination or sub-combination.
Number | Date | Country | Kind |
---|---|---|---|
17306267 | Sep 2017 | EP | regional |
Number | Name | Date | Kind |
---|---|---|---|
20130258036 | Tong et al. | Oct 2013 | A1 |
20170118540 | Thomas et al. | Apr 2017 | A1 |
Number | Date | Country |
---|---|---|
2434772 | Mar 2012 | EP |
Entry |
---|
Champel et al., “Key factors for a highh-quality VR experience”, Application of Digital Image Processing XL, SPIE vol. 10396, Sep. 19, 2017, pp. 1-12. |
Number | Date | Country | |
---|---|---|---|
20190098341 A1 | Mar 2019 | US |