The invention relates to a system and method for media synchronization. The invention further relates to a play-out device for use in the system and to data representing a reporting of the play-out device. The invention further relates to a computer program product for causing a processor system to perform the method.
Media content such as video content and audio content is commonly delivered to users in a digital form. If media content has a temporal aspect, and in particular is associated with a timeline which indicates how the media content is to be played out or otherwise processed over time, such digital form is typically referred to as a media stream. Examples of media streams include video streams such as camera-recorded or computer-rendered streams, audio streams such as microphone-recorded streams, timed text streams such as subtitle streams or social-media streams, timed events streams which show an advertisement image or perform an action at a receiver, and multimedia streams comprising different types of media streams.
There may be a need to synchronize the play-out of a media stream to a reference play-out. For example, a group of soccer fans may be located at different locations but may like to watch together a soccer match by streaming the match online from one or more media servers. During the soccer match, the soccer fans may then communicate with each other, e.g., using chat or voice services. In such a case, it may be noticeable and even detrimental to the shared experience of the soccer match if the play-out at the different locations is asynchronous. Namely, a user of which the play-out is running ahead may spoil the fun of other users of which the play-out is running behind, e.g., by shouting ‘Gooooaal’ and thereby giving away the imminent goal. As such, it may be desirable to synchronize the play-out of the soccer match across the different locations to a common target, such as a reference play-out timestamp.
In general, the reference play-out to which the play-out of a media stream is to be synchronized may take various forms, including but not limited to: a) the play-out of the media stream at one or more further destinations, b) the play-out of one or more further media streams, and c) clock information indicating a reference timing of the play-out of the media stream. These use-cases are known per se from the field of media synchronization, and also referred to as a) inter-destination synchronization, b) inter-stream synchronization and c) intra-stream synchronization, respectively. Together with related synchronization use-cases and techniques, these synchronization use-cases will be in short referred to as media synchronization.
For example, inter-destination media synchronization may involve carrying out a synchronization between play-out of content at different destinations. Here, the play-out at one destination may constitute a reference for the play-out at another destination. With inter-stream synchronization, the play-out of one media stream, e.g., a video stream, may constitute a reference for the play-out of another media stream, e.g., an audio stream, or vice versa, e.g., to achieve lip synchronization. With intra-stream synchronization, the original timing of a recording, e.g., 25 frames per second or 44 kHz audio, may constitute a reference for the play-out of a media stream in time. With event-based synchronization, an original timing of an event, e.g., the timing of questions to an audience in a play-along quiz show, may constitute a reference for an actual presentation of such events during the broadcast of such a quiz show.
It is noted that media synchronization may be carried out to a level where the play-out of a media stream is fully synchronized with a reference play-out, or it may at least be intended to carry out the synchronization to said level. However, it may also be desired to perform synchronization to a level other than said full synchronization. For example, if it is known that a video frame buffer causes a 25 ms delay further onward in a media processing chain, it may be desirable to synchronize a video stream and an audio stream to the level of the video stream being ahead of the audio stream by 25 ms. Accordingly, media synchronization may purposefully establish a predetermined time delay to establish a target level of synchrony.
Media synchronization may involve performing one or more synchronization actions. For example, if the play-out of the media stream by the play-out device is ahead of the reference play-out, the former play-out may need to be delayed. Conversely, if the play-out of the media stream by the play-out device is behind the reference play-out, the former play-out may need to be advanced. Synchronization actions may take various forms, including seeking forward or backward in the media stream (also referred to as ‘jumping’) and temporarily adjusting the play-out rate (also referred to as ‘adaptive media play-out’). Synchronization actions may be performed by the play-out device, but also by another entity, e.g., a stream source or network buffer.
Disadvantageously, there may be a structural deficiency in the obtaining of synchronicity between the play-out of the media stream by the play-out device and the reference play-out. Here, the term ‘structural deficiency’ may refer to a limitation in the obtaining of the synchronicity which is not addressed by performing a synchronization action once, or by adjusting the common target in the media synchronization. Such a limitation may be associated with the play-out device, or with the delivery of the media stream to the play-out device. Namely, the limitation may be such that the synchronization action cannot be performed, that the synchronization action only establishes an insufficient degree of synchronicity, or that a sufficient degree of synchronicity is only established temporarily before reverting to an insufficient degree.
The existence of a structural deficiency in the obtaining of synchronicity with a reference play-out may trigger synchronization actions which in turn may cause frequent and/or large play-out distortions. Here, the term ‘play-out distortion’ refers to an irregularity in the play-out caused by the synchronization action, resulting in a deviation from the regular, typically continuous, play-out of the media stream at the regular play-out rate. For example, the user may notice the play-out of the media stream frequently pausing as a consequence of a structural deficiency in the obtaining of synchronicity, e.g., in case the play-out is structurally running ahead of the reference play-out. The structural deficiency may also increase the delays involved in starting up streaming of a media stream, or when changing to another media stream. The structural deficiency may also result in sufficient synchronicity not being established at all. This may be the case when a buffer associated with the play-out device is of an insufficient size to accommodate the buffering required by the synchronization actions. Accordingly, it may not be possible to sufficiently pause the play-out of the media stream to allow the reference play-out to catch up.
It would be advantageous to obtain a system or method which addresses a structural deficiency in the obtaining of a degree of synchronicity between the play-out of a media stream by a play-out device and a reference play-out out.
The following aspects of the invention involve a play-out device reporting a quality of sync metric to a metric subsystem, which may then be analyzed to identify the structural deficiency. A corrective action may then be scheduled to address the structural deficiency. Examples of corrective actions include, but are not limited to, re-clustering of play-out devices, adjusting priority and/or bandwidth in the streaming, etc. As such, the quality of experience of one or more users experiencing the play-out may be improved. The invention is applicable to various forms of media synchronization, such as inter-destination synchronization and/or inter-media synchronization.
A first aspect of the invention may provide a system for media synchronization, the system comprising:
A further aspect of the invention may provide the play-out device.
A further aspect of the invention may provide data representing the quality of sync metric reported by the play-out device.
A further aspect of the invention may provide a method for media synchronization in a system comprising a play-out device for play-out of a media stream, wherein the play-out of the media stream by the play-out device is synchronized with a reference play-out, thereby obtaining a degree of synchronicity with the reference play-out, the method comprising,
in the play-out device:
and in a metric system:
A further aspect of the invention may provide a computer program product comprising instructions for causing a processor system to perform the method.
Embodiments are defined in the dependent claims.
The above measures involve a play-out device which may render a media stream to a user, e.g., by rendering an audio/video component of the media stream, by rendering a quiz question represented by an event stream, etc. A non-limiting example may be that a user may be provided with a display of a video stream on a display.
A synchronization subsystem may be provided which may, at least to a certain degree, synchronize the play-out of the media stream by the play-out device to a reference play-out. Such synchronization may involve one or more synchronization actions being performed. It will be appreciated that depending on the type of media synchronization, the reference play-out to which the play-out is synchronized may take various forms, including play-out of the media stream at a further destination (inter-destination synchronization), play-out of a further media stream (inter-stream synchronization), and clock information indicating a reference timing of the play-out of the media stream (intra-stream synchronization). A non-limiting example may be that the play-out of the media stream may be synchronized across multiple play-out devices, with the reference play-out being represented by timing information provided by the synchronization subsystem defining a common target for the play-out devices. An example of a common target is a reference playback timestamp. It will be appreciated and further elucidated that the location of the synchronization subsystem, as well as the possible synchronization action(s) being performed, may take various forms, as are known per se from the technical field of media synchronization.
The play-out device may, during the play-out of the media stream, determine a quality of sync metric which may be indicative of a structural deficiency in the obtaining of the synchronicity with the reference play-out. As such, data may be obtained which directly or indirectly identifies the structural deficiency. The data may be externally reported by the play-out subsystem and received by a metric subsystem which may then analyse the quality of sync metric to identify the structural deficiency. A non-limiting example is that if the quality of sync metric is constituted by a plurality of values associated with the play-out of the media stream, the metric subsystem may employ heuristics to explicitly identify the structural deficiency from the reported values. Having identified the structural deficiency, the metric subsystem may then schedule a corrective action which may be known or expected to address the structural deficiency. Here, the term ‘corrective action’ may refer to an action which, contrary to a synchronization action, structurally addresses a limitation associated with the play-out device, or with the delivery of the media stream to the play-out device.
The inventors have recognized that a structural deficiency in the obtaining of a sufficient degree of synchronicity between the play-out of the media stream by the play-out device and the reference play-out may be observable to one or more users. For example, the structural deficiency may result in frequent buffering, skipping or pausing of the play-out of the media stream. Another example is that an insufficient degree of synchronicity may be directly observable by the user(s), e.g., when experiencing the play-out on a television and a ‘second screen’ such as a tablet device or smartphone. As such, the structural deficiency in the obtaining of synchronicity may cause the user(s) experiencing the play-out of the media stream to have an insufficient quality of experience despite or in consequence of synchronization actions taking place. By being indicative of the structural deficiency, the reported quality of sync metric may at the same time be indicative of the quality of experience obtained by the user(s). Accordingly, a corrective action may be scheduled which addresses the structural deficiency and thereby an insufficient quality of experience. Compared to the mere (repeated) performing of synchronization actions, an advantage may be that fewer synchronization actions may be needed after performing the corrective action. Since a synchronization action typically results in a play-out distortion in the play-out, the corrective action may result in fewer or less noticeable play-out distortions. In fact, it may not even be possible to perform a synchronization action in view of the structural deficiency, but upon determining the structural deficiency, a corrective action may be possible. A non-limiting example may be that an insufficient buffer size may prevent the play-out of a broadcast stream being paused for a sufficient time so as to allow the play-out of a corresponding HTTP-based stream by another play-out device to ‘catch up’. However, it may be possible to take a corrective action, e.g., by increasing the bandwidth of the HTTP-based streaming to the other play-out device. Accordingly, it may not be needed anymore to pause the play-out of the broadcast stream for a prolonged time. Namely, the increase in bandwidth may enable the HTTP-based stream to ‘catch up’. It will be appreciated that various other types of corrective actions may be taken so as to address different types of structural deficiencies.
In an embodiment, the play-out device may be configured for determining the quality of sync metric based on:
The term ‘playback operation’ may refer to an operation associated with the play-out which is initiated by the play-out device rather than by a user. Examples of playback operations initiated by the play-out device may include the following: a pausing of the play-out, a stopping of the play-out, a resuming of the play-out, a starting of the play-out, a seeking in the media stream, a change in speed of the play-out, a frame drop during the play-out, and a representation switch of the media stream in adaptive streaming. The inventors have recognized that one or more occurrences of a playback operation may be indicative of a structural deficiency in the obtaining of the synchronicity. Likewise, the degree of synchronicity obtained by the play-out device with the reference play-out may be indicative of such a structural deficiency. Accordingly, the quality of sync metric may be generated based on a numerical measure of the degree of synchronicity. A non-limiting example may be that a difference between a current playback timestamp of the play-out device and a reference playback timestamp provided by the synchronization subsystem may be reported, or that a mean and variance of the difference may be reported.
In an embodiment, the play-out device may be configured for i) when performing the playback operation, registering a reason for performing the playback operation, thereby obtaining a registered reason, and ii) determining the quality of sync metric further based on the registered reason. There may be a reason for the play-out device initiating the playback operation. For example, the play-out may be stopped by the play-out device in response to, e.g., the media stream ending or a re-buffering. The latter may be indicative of a structural deficiency in obtaining synchronous play-out, whereas this may not need to be the case for the former. By determining the quality of sync metric further based on the registered reason, the quality of sync metric may be generated to be more accurately indicative of the structural deficiency.
In an embodiment, the play-out device may be configured for formatting the quality of sync metric to indicate at least one of:
The data reported by the play-out device may thus indicate the abovementioned information, e.g., in a textual or non-textual data format. The indication of the time period to which the quality of sync metric pertains may be of relevance in a periodic reporting of the quality of sync metric in that it may enable the reporting of the quality of sync metric to be distinguished from earlier and/or later reports. The time period may also provide a context to the number of occurrences of a type of playback operation. The number of occurrences may be indicative of a structural deficiency in obtaining synchronous play-out. For example, numerous occurrences of pausing and resuming may be indicative of an insufficient priority and/or bandwidth in the streaming of the media stream. Moreover, the claimed difference, or its mean and variance, may represent a numerical measure of the degree of synchronicity obtained by the play-out device with the reference play-out, and may thereby be indicative of a structural deficiency in obtaining synchronous play-out.
In an embodiment, the play-out device may be part of one or more play-out devices for play-out of the media stream, and the corrective action may be an adjustment in at least one of:
The play-out device may be part of a group of one or more play-out devices playing-out the media stream. The corrective action scheduled by the metric subsystem may adjust a delivery of the media stream to at least one of the one or more play-out devices. As such, the adjustment may be in what is being delivered, or how the delivery is performed. Additionally or alternatively, the corrective action may adjust a boundary condition associated with the synchronizing of the play-out of the media stream by the play-out device with the reference play-out. Both types of corrective actions are well suited for addressing structural deficiencies in obtaining synchronous play-out.
In an embodiment, the adjustment in the delivery of the media stream may comprise at least one of:
The structural deficiency may be associated with the delivery of a particular media stream and/or the processing of the particular media stream by the play-out device. By switching to a different media stream, this type of structural deficiency may be addressed. The structural deficiency may also be associated with a stream source, in that the stream source may not be able to deliver or reliably deliver the media stream. By switching to a different stream source for the same media stream, this type of structural deficiency may be addressed. The structural efficiency may also be associated with an insufficient priority and/or bandwidth allocated as part of the streaming of the media stream. By adjusting the priority and/or bandwidth being allocated to the media stream, this type of structural deficiency may be addressed. The structural deficiency may also be associated with a too early or too late delivery of the media stream, causing the play-out device not being able to obtain a sufficient degree of synchronicity with the reference play-out. By adjusting the timing of the delivery of the media stream, this type of structural deficiency may be addressed.
In an embodiment, the adjustment in the boundary condition may comprise at least one of:
In case of there being multiple play-out devices, a clustering or re-clustering of play-out devices in two or more clusters to which the media synchronization is separately applied may result in a more homogenous quality of sync than would have been obtained when applying the media synchronization to a single cluster comprising all of the play-out devices. For example, play-out devices having a high latency with respect to a stream source, and thus needing a long time to perform, e.g., a program or channel switch, may result in play-out devices having a low latency to also take a long time to perform the program or channel switch in order to maintain a sufficient degree of synchronicity. By clustering all ‘high-latency’ play-out devices in a separate cluster from the low-latency′ play-out devices, a more homogenous quality of sync may be obtained in each cluster individually and thereby an overall higher quality of experience. It is noted that clustering based on latency is merely an example and that there are also various other suitable clustering criteria, such as Quality of Service parameters known per se within the field of telecommunications. Additionally or alternatively, the synchronization threshold in at least one of the one or more play-out devices may be adjusted. For example, if a low quality of sync is reported, the synchronization threshold may be adjusted to allow a larger deviation from the reference play-out, thereby reducing the frequency of the synchronization actions being performed. Conversely, if a high quality of sync is reported, the synchronization threshold may be adjusted to maintain a smaller deviation from the reference play-out.
In an embodiment, the metric subsystem may be configured for effecting the corrective action by sending an instruction indicative of the adjustment to be performed to at least one of: at least one of the one or more play-out devices, a stream source providing the media stream, the synchronization subsystem, and a network entity involved in the streaming of the media stream to at least one of the one or more play-out devices. The corrective action may be scheduled by the metric subsystem but performed by another entity. For that purpose, the metric subsystem may send an instruction indicative of the adjustment to be performed to the respective entity.
In an embodiment, the metric subsystem may be configured for scheduling the corrective action in response to the quality of sync metric falling below a predetermined quality threshold. The quality of sync metric may be reported or representable as a numerical value. The corrective action may be scheduled when the numerical value falls below a particular threshold. The particular threshold may represent, e.g., the quality of experience being deemed to become insufficient. As such, the metric subsystem may timely schedule a corrective action.
In an embodiment, communication between the play-out device and the metric subsystem may take place in accordance with a client-server communication scheme, and the play-out device may represent the client and the metric subsystem may represent the server in the client-server communication scheme.
It will be appreciated by those skilled in the art that two or more of the above-mentioned embodiments, implementations, and/or aspects of the invention may be combined in any way deemed useful.
Modifications and variations of the play-out device, the data representing the quality of sync metric, the method and/or the computer program product, which correspond to the described modifications and variations of the system, can be carried out by a person skilled in the art on the basis of the present description.
The invention is defined in the independent claims. Advantageous yet optional embodiments are defined in the dependent claims.
These and other aspects of the invention are apparent from and will be elucidated with reference to the embodiments described hereinafter. In the drawings,
It should be noted that items which have the same reference numbers in different Figures, have the same structural features and the same functions, or are the same signals. Where the function and/or structure of such an item has been explained, there is no necessity for repeated explanation thereof in the detailed description.
The following list of reference numbers is provided for facilitating the interpretation of the drawings and shall not be construed as limiting the claims.
The play-out of a media stream by a play-out device may be synchronized by a synchronization subsystem with a reference play-out, thereby obtaining a degree of synchronicity with the reference play-out. To address a structural deficiency in the obtaining of the synchronicity with the reference play-out, the play-out device may report a quality of sync metric to a metric subsystem, which may then be analysed to identify the structural deficiency. A corrective action may then be scheduled to address the structural deficiency. Examples of corrective actions include re-clustering of play-out devices, adjusting priority and/or bandwidth in the streaming, etc. As such, the quality of experience of one or more users experiencing the play-out may be improved.
An embodiment of a system comprising the play-out device and the metric subsystem will be described with reference to
As will be further elucidated with reference to
The system may further comprise a metric subsystem 140 for receiving the quality of sync metric from the play-out device, analysing the quality of sync metric to identify the structural deficiency, and scheduling a corrective action to address the structural deficiency. In the example of
The quality of sync metrics as reported by the play-out devices may be stored as data 032 in a metrics database 030. Conversely, data 034 may be retrieved from the metrics database 030, e.g., for analysis or determining the corrective action.
The metric subsystem 140 may be configured for scheduling the corrective action in accordance with a predefined criterion. For example, the corrective action may be scheduled in response to the quality of sync metric, or a measure derived therefrom, falling below a predetermined quality threshold. As will be elucidated with further reference to
It is noted that the communication between the play-out device(s) 101-103 and the metric subsystem 140 may take place in accordance with a client-server communication scheme. Herein, a play-out device may represent the client and the metric subsystem 140 may represent the server in the client-server communication scheme. Accordingly, the play-out device(s) 101-103 may also be referred to as ‘client’ in the following. It is further noted that in the example of
An example of a quality of sync metric may be the following. Here, the quality of sync metric is defined as an extension of an existing metric, namely of a playlist metric defined within MPEG—Dynamic Adaptive Streaming over HTTP (DASH), e.g., in ISO/IEC 23009-1:2014. The playlist metric is an example of a metric that a DASH client may be asked to report. Although the reporting mechanism is not defined within ISO/IEC 23009-1:2014, the metrics are. The definition of the playlist metric is user-centric as the definition specifies: “A playback period is the time interval between a user action and whichever occurs soonest of the next user action, the end of playback or a failure that stops playback”, referring to a user-initiated operation performed by the play-out device. The playlist metric may be modified so as to additionally or alternatively serve as quality of sync metric. Below, underlined text represents a modification of the playlist metric defined in ISO/IEC 23009-1:2014. Keys of the playlist metric which have not been modified have been omitted below.
or a player action and whichever
the next player action, the end of playback,
Actiontype
Enum
Type of action that initiated the play-out:
user action
player action
behind the IDMS reference playback
timestamp
ahead of the IDMS reference playback
timestamp
Additionally or alternatively to the quality of sync metric being determined based on an occurrence of one or more playback operations performed by the play-out device during play-out of the media stream, the quality of sync metric may be determined based on a numerical measure of the degree of synchronicity obtained by the play-out device with the reference play-out. An example of such a numerical measure is a difference, or a mean and a variance of the difference, between a current playback timestamp of the play-out device and a reference playback timestamp provided by the synchronization subsystem. Such a difference may also be referred to as ‘skew’. Accordingly, the data 110 shown in
In an embodiment, the quality of sync metric may be as follows:
By reporting the skew, or the mean and variance of the skew, between a reference and actual playback timestamp, the metric subsystem may determine an existence of a structural deficiency in the obtaining of the synchronicity with the reference play-out. This is graphically illustrated in
It is noted that the quality of sync metric may combine the occurrence of one or more playback operations performed by the play-out device during play-out of the media stream with a numerical measure of the degree of synchronicity obtained by the play-out device with the reference play-out. Effectively, the quality of sync metric may be comprised of two parts, e.g., two ‘sub-metrics’. Here, the reporting of occurrences of playback operations may be deemed to be a ‘subjective’ metric, in that it may reflect a consequences of structural difficulties encountered by the play-out in obtaining synchronicity with the reference play-out, which may decrease the quality of experience of the user(s) experiencing the play-out. The reporting of the numerical measure of the degree of synchronicity may be termed an ‘objective’ metric in that it may directly represent the degree of synchronicity obtained by the play-out device.
1. Measuring the play-out timing of the media streams at clients within a synchronization cluster, and communicating this timing to the other participants or to a centralised synchronization server.
2. Determining the relative play-out timing between clients, and in general determining the slowest clients. In case of a central synchronization server, this information may be communicated as synchronization instructions to all clients.
3. Delaying of the play-out at all clients to match the slowest client, to synchronise the play-out across all clients.
These steps may be performed in various architectures, including but not limited to A) Sync Maestro Scheme, where all clients may report about their play-out to a single server, and the single server may distribute instructions to all clients, B) Distributed Control Scheme, where all clients may report about their play-out to all other clients, and each client may autonomously make synchronization decisions, and C) Master/Slave scheme, where one client is designated as master client which reports about its play-out to all other clients, who must adhere to the master's play-out timing.
The inventors have recognized that due to fast technical developments, the characteristics of the clients to be synchronised are getting more and more diverse in terms of screen capabilities, available bandwidth, computation power, networking capabilities, etc. As a result, the quality of the experience that the users perceive in a synchronised media session may be poor for various reasons.
For example, the slowest client may contribute to lowering the quality of experience not only of its user but of everyone in the session. As is known from the technical field of media synchronization, the reference point in time where the clients of the same session have to adjust to may be determined by the position of the slowest client. As such, when watching live content, the slowest client may force all other clients in, e.g., an IDMS cluster to slow down. The extent of this slow-down may be such that the delay compared to the live edge may become unacceptably large.
Moreover, if there are large play-out differences between clients, a significant amount of buffering may be needed. For example, when changing channels during a synchronised session, channel changing delays may increase due to this buffering (if synchronised channel change is desired along the synchronised viewing), which may lead to excessively large channel changing delays. Also, the start-up delay for a synchronization session may be large if the play-out differences are large. In this respect, it is noted that typical delay differences for TV broadcasts may range from a few seconds delay differences to over one minute when including internet broadcasts.
Yet another reason for a poor quality of experience may be that due to, e.g., clock drift at some of the clients, many synchronization actions may be carried out, resulting in, e.g., continuous skipping ahead or pausing of the play-out to synchronise the play-out across clients. This may also hold for varying network circumstances. For example, network congestion may lead to buffer underruns and thus play-out discontinuities for one client, which may disturb the play-out of all clients in the session.
These synchronization actions may be visible to the user(s), and thus negatively impact their quality of experience. Artefacts such as the play-out temporarily pausing, jumping to another part of the media stream, or playing faster or slower than normal, or large waiting times during a channel change, are all examples of visible synchronization actions. It may thus be a problem to carry out IDMS with a high level of quality of experience for the user(s).
In particular,
The quality of sync analysis unit 140B may then evaluate the Quality of Synchronization for a subset or all of the clients based on the reported data. For example, the reported quality of sync metrics may be considered as ‘raw’ data from which the overall Quality of Synchronization may be computed, e.g., based on the number of player-initiated playback pauses or skipping events per minutes, etc. In case the Quality of Synchronization is deemed unsatisfactory, the session management unit 140C may then schedule a corrective action to be taken. Different types of corrective actions may be taken so as to address different types of structural deficiencies in the obtaining of the synchronicity with the reference play-out, which may thereby improve the Quality of Synchronization. It is noted that corrective actions differ from synchronization actions, in that they specifically address a structural deficiency in the obtaining of the synchronicity. An example of a corrective action is an adjustment in a delivery of the media stream to at least one of the one or more play-out devices. Another example is that the corrective action may comprise an adjustment in a boundary condition associated with the synchronizing of the play-out of the media stream by the play-out device with the reference play-out.
Accordingly, the metric subsystem may perform as corrective action a clustering or re-clustering of the play-out devices 101-104 in two or more clusters 160B, 160C to which the media synchronization may be separately applied to obtain a more homogenous quality of sync than would have been obtained when applying the media synchronization to the single cluster 160A comprising all of the play-out devices.
Such corrective action may involve the following: the play-out devices 101-104 may be separated and put together in separate clusters 160B, 160C. The clustering may be based on similarities which appear from the reported data. For example, ‘fast’ clients may be put together in a first cluster while ‘slow’ clients may be put together in second cluster. After such re-clustering, the Quality of Synchronization obtained in each of the clusters 160B, 160C may be greater than the one provided previously by the single cluster 160A, e.g., resulting in a ‘Syncness’ value of 5 instead of 2. Accordingly, a higher quality of experience may be obtained for the users of the play-out devices 101-104 in both clusters, e.g., by there being less buffering time for the fastest play-out devices 103-104 and thus there being less “wait” messages for the fastest play-out devices 103-104, the fastest ones being closer to the live edge, etc.
Within the context of MPEG-DASH, a workflow may be the following (here, further reference is made to units of the metric subsystem 140 shown in
1. The session management unit 140C, acting as or representing an IDMS session manager, may communicate Session Information to clients. The Session Information may be part of a Media Presentation Description (MPD). It may comprise the quality of sync (QoSync) and quality of service (QoS) metrics to be reported by the clients, and identify the way and to whom the clients have to report said metrics. For example, the Session Information may identify the metrics server 140A.
2. The clients may join the session, e.g., by a client sending a “Join” message to the IDMS session manager to signal its participation.
3. The clients may start retrieving the media content, e.g., a media stream.
4. The clients may report the quality of sync and quality of service metrics to the metrics server 140A.
5. The quality of sync analysis unit 140B may determine whether some clients experience problems obtaining synchronicity with the reference playback timestamp, and in particular, whether there appears to exist a structural deficiency.
6. For example, clients A and B may report poor QoSync and may lag behind the reference playback timestamp. As such, the users of clients A and B may have a poor quality of experience. Clients C, D and F may report good QoSync metrics.
7. Because the session offers live content, the IDMS session manager may attempt to maintain the reference playback as close as possible to the live edge.
8. The IDMS session manager may therefore create a new session for clients A and B and send new Session Information to them. Additionally, the IDMS session manager may have verified that clients A and B have similar quality of service which increases the chance for them to synchronise together.
9. Clients A and B may join the new session and start synchronising.
10. Clients A and B may report good quality of sync metrics
11. Now two IDMS sessions are open, each representing a different cluster of clients. Session 1 comprising clients C, D and F may be the closest one to the live edge while Session 2 comprising clients A and B may be few seconds behind.
In the above example, the synchronization subsystem 120 is preferably aware of the availability of alternative streams and their (general) output timing. This may be the case if the synchronization subsystem 120 is co-located with a media server, or if the media server sends information about the availability and output timing to the synchronization subsystem 120. Also, in case of live content, the synchronization subsystem 120 may learn about the availability and output timing from reports of other clients. The synchronization subsystem 120 is preferably further aware of which streams offer the same content. Note that the instructions may also be general instructions to ‘switch’. A client, upon receiving these instructions, may know if alternative streams are available, and may then switch. This may lead to an improvement of the quality of sync, or it may not. This way, quality of sync may be improved in a trial-and-error manner.
As shown in
Other examples of adjusting the priority or bandwidth of the delivery of the media stream may be the following. For example, in a managed network, a 3GPP (3rd Generation Partnership Project) PCC (Policy Control & Charging) or RACS (Resource & Admission Control Subsystem) architecture may be used to guarantee bandwidth for certain streams. The same may be accomplished using IntServ (Integrated services). Also, in the example of
Note that there may be different ways of adjusting a synchronization threshold. In the example of
It is noted that delaying the delivery of a media stream by 10 seconds typically constitutes a corrective action which is visible for client 101. As such, the performing of the corrective action may be weighed against other aspects of synchronization quality. In this case, the synchronization algorithm may determine that the additional delay for clients 102 and 103 may have a higher impact on the quality of synchronization than the delay action for client 101. It is also noted that it may be the streaming source 010 that performs the delay action, but it may also be another entity.
The present invention may have been described within the context of one or more media streams being streamed. However, such media streams may also be accessed by a play-out device in a non-streaming manner, e.g., from a file.
The following scenario exemplifies an advantageous use of the present invention. An overseas Dutch fan group may gather online to enjoy the 2016 Euro cup finale which opposes the Netherlands and the host country, France. The group may consist of 20 users that are logged into a streaming service. An IDMS cluster may then be created for these 20 users. Before the game starts, the users may be invited to switch on the video feed to start synchronized play-out among the group members. Because the fans are scattered all over the world the quality of synchronization of the various users may be quite different. It may turn out that there are too many resynchronization events that occur to keep the group in sync which ultimately leads to a poor quality of experience. As a result, the service may create two new clusters where clients with similar quality of sync metrics are migrated to. It will be appreciated that in general, quality of sync metrics as reported in the past may be used to preemptively schedule a corrective action. For example, if a certain client always had a fast connection, the client is likely to have one again during a current session. Such historical data may be updated with new measurements during the session. It is noted that the present invention is applicable to various forms of media synchronization, such as to, e.g., for inter-media synchronization in an HbbTV setting. An example is synchronizing an auxiliary stream delivered over the internet containing a sign language interpreter with a main stream representing a regular TV broadcast.
It will be appreciated that a method according to the invention may be implemented in the form of a computer program which comprises instructions for causing a processor system to perform the method. The method may also be implemented in dedicated hardware, or as a combination of the above.
The computer program may be stored in a non-transitory manner on a computer readable medium. Said non-transitory storing may comprise providing a series of machine readable physical marks and/or a series of elements having different electrical, e.g., magnetic, or optical properties or values.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design many alternative embodiments.
In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. Use of the verb “comprise” and its conjugations does not exclude the presence of elements or steps other than those stated in a claim. The article “a” or “an” preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the device claim enumerating several means, several of these means may be embodied by one and the same item of hardware. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage.
Number | Date | Country | Kind |
---|---|---|---|
14199743.7 | Dec 2014 | EP | regional |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2015/079590 | 12/14/2015 | WO | 00 |