The present invention relates generally to adaptive streaming techniques, and more particularly to techniques for requesting chunks for adaptive streaming applications.
HTTP (Hypertext Transfer Protocol) Adaptive Streaming is a technique used to stream multimedia over a computer network, such as a computer network employing TCP (Transmission Control Protocol) connections. Current HTTP Adaptive Streaming client implementations may not fully utilize the available throughput of TCP connections. Thus, the client may select a bandwidth level for a given TCP connection that is lower than necessary (in turn leading to a reduced video quality). For each TCP connection, a “congestion window” limits the total number of unacknowledged packets that may be in transit. The size of the congestion window is determined by a “slow start” phase (also referred to as an exponential growth phase) and a “congestion avoidance” phase (also referred to as a linear growth phase).
During the exponential growth phase, the slow start mechanism increases the size of the congestion window each time an acknowledgment is received. The window size is increased by the number of segments that are acknowledged. The window size is increased until either an acknowledgment is not received for a given segment (e.g., a segment is lost) or a predetermined threshold value is reached. If a segment is lost, TCP assumes that the loss is due to network congestion and attempts to reduce the load on the network. Once a loss event has occurred (or the threshold has been reached), TCP enters the linear growth phase. During the linear growth phase, the window is increased by one segment for each round trip time (RTT), until a loss event occurs.
Microsoft Silverlight™ is an application framework for writing and running Internet applications. The HTTP Adaptive Streaming client, for example, within Microsoft Silverlight opens two persistent TCP connections at the beginning of a streaming session. The client uses both connections to request audio and video chunks, sometimes simultaneously. The client may switch between the two connections to request either audio or video chunks. A new chunk is not requested until the client has fully received the previously requested chunk. The client may also introduce gaps in between chunk requests in order to prevent its internal buffer from overflowing. Thus, the HTTP Adaptive Streaming client within Microsoft Silverlight may not fully utilize the available throughput because of undesired interactions at the TCP layer that lead to “TCP Slow Start” or “TCP Congestion Avoidance” (or both).
A need therefore exists for a mechanism to increase the data throughput between an HTTP Adaptive Streaming client and server.
Generally, a chunk request scheduler is provided for HTTP adaptive streaming. According to one aspect of the invention, requests for media chunks (e.g., audio and/or video chunks) are scheduled over a network by requesting the media chunks over the network using at least one connection; storing the media chunks in at least one buffer; monitoring a level of the at least one buffer; and selectively switching between at least two predefined download strategies for the requesting step based on the buffer level.
A number of exemplary download strategies are addressed. For example, the chunk request scheduler can switch between a download strategy of scheduling audio and video chunk requests over multiple TCP connections and a download strategy of scheduling audio and video chunk requests over a single TCP connection. In addition, the chunk request scheduler can switch between a download strategy of pipelining multiple audio and video chunk requests over a single TCP connection and a download strategy of pipelining single audio and video chunk requests over a single TCP connection.
In yet another variation, the chunk request scheduler can switch between a download strategy of pipelining multiple audio and video chunk requests over a single TCP connection and a download strategy of sequentially scheduling audio and video chunk requests over a single TCP connection. In addition, the chunk request scheduler can switch between a download strategy of requesting audio and video chunk requests over one or more TCP connections and a download strategy of waiting to schedule N chunk requests until a predefined lower buffer threshold is satisfied.
According to another aspect of the invention, requests for media chunks (e.g., audio and/or video chunks) are scheduled over a network by obtaining an ordering of the plurality of connections based on a rate of each of the plurality of connections; storing the media chunks in at least one buffer; and requesting the media chunks over the ordered plurality of connections based on a size of the media chunks. For example, audio chunk requests can be scheduled over or more TCP connections having a lower rate order and video chunk requests can be scheduled over TCP connections having a higher rate order.
A more complete understanding of the present invention, as well as further features and advantages of the present invention, will be obtained by reference to the following detailed description and drawings.
According to one aspect of the invention, data throughput between the HTTP Adaptive Streaming client 120 and the server 180 is increased through efficient scheduling of chunk downloads over one or more TCP connections through the network 100. As a result, the HTTP Adaptive Streaming client 120 may be able to select a higher video quality level or reduce the number of quality oscillations during the video playback.
According to another aspect of the invention, a Chunk Request Scheduler 130 is provided for HTTP Adaptive Streaming clients 120. The disclosed Chunk Request Scheduler 130 improves the scheduling of audio and video chunk requests over one or more TCP connections. The Chunk Request Scheduler 130 opens and maintains one or more TCP connections between the HTTP Adaptive Streaming client 120 and server 180. The Chunk Request Scheduler 130 accepts requests for audio and video chunks from the HTTP Adaptive Streaming client 120 and schedules them over the opened TCP connections. The Chunk Request Scheduler 130 can optionally be integrated with the HTTP Adaptive Streaming client 120 in order to have access to internal variables, such as those related to the rate determination algorithm of the HTTP Adaptive Streaming client 120.
As discussed hereinafter, the exemplary Chunk Request Scheduler 130 may follow different scheduling strategies to maximize the data throughput between the server 180 and the client 120. In one exemplary embodiment, the Chunk Request Scheduler 130 can dynamically switch between a plurality of strategies, for example, based on observed network conditions.
The exemplary adaptive streaming client 120 typically employs one or more buffers 140 to store downloaded chunks, in a known manner.
The exemplary adaptive streaming client 120 may be implemented as a media player executing, for example, on a general purpose computer. The media player may be implemented, for example, using the Microsoft Silverlight™ application framework, as modified herein to provide the features and functions of the present invention. In an alternate implementation, the exemplary adaptive streaming client 120 may be implemented as a media player executing, for example, on dedicated hardware, such as a set-top terminal (STT).
The exemplary Chunk Request Scheduler 130 may pursue any of the following scheduling strategies below (or a combination thereof):
Interleaving Video and Audio Chunk Requests
The Chunk Request Scheduler 130 can optionally interleave video and audio chunk requests by inserting audio chunk requests in between video chunk request such that any gap that the client 120 would otherwise introduce in between video chunk requests is filled by one or more audio chunk requests.
Pipelining Chunk Requests
The Chunk Request Scheduler 130 can optionally pipeline chunk requests by scheduling chunk requests back-to-back such that there is always at least one outstanding chunk request.
Sending Chunk Requests Over Multiple Connections
The Chunk Request Scheduler 130 can optionally open several TCP connections and simultaneously sends chunk requests (or requests for partial chunks) over multiple connections in order to reduce the impact of TCP congestion events on any single TCP connection during a chunk download. This strategy may also help in cases where the initial TCP receive window is set to a small value.
Chunk Request Scheduler Implementations
A test is performed during step 330 to determine if the buffer 140 is full. If it is determined during step 330 that the buffer 140 is full, then the Chunk Request Scheduler 130 switches to using a single TCP connection during step 340 and starts interleaving requests for audio and video chunks in order to minimize request gaps. If, however, it is determined during step 330 that the buffer 140 is not full, then the Chunk Request Scheduler 130 continues to schedule audio and video chunk requests over multiple TCP connections during step 320, until the buffer is full.
A test is performed during step 420 to determine if the buffer 140 is full. If it is determined during step 420 that the buffer 140 is not full, then the exemplary Chunk Request Scheduler 130 continues to pipeline multiple audio and video chunk requests over a single connection during step 410 until the buffer 140 is full.
If, however, it is determined during step 420 that the buffer 140 is full, then the Chunk Request Scheduler 130 pipelines single audio and video chunk requests over a single connection during step 430. In this manner, the Chunk Request Scheduler 130 can interleave audio and video chunk requests, but multiple outstanding audio chunk requests or multiple outstanding video chunk requests are not permitted at a given time.
A further test is performed during step 440 to determine if there is a sudden change in the download bandwidth (either up or down). Generally, when multiple chunk requests are pipelined, it reduces the client's reaction time to changes in bandwidth. As long as the bandwidth is slowly varying, it is not a problem but if there are sudden changes, it is advantageous to switch to single chunk requests so that the client can react faster to the changes. It is more important that the client do it when there is a drop in bandwidth so that its buffer isn't starved, but the strategy can also be applied when conditions improve to allow faster increases in bit rate.
If it is determined during step 440 that there is a sudden change in the download bandwidth, then the Chunk Request Scheduler 130 continues to pipeline single audio and video chunk requests over a single connection during step 430. If, however, it is determined during step 440 that there is not a sudden change in the download bandwidth, then the Chunk Request Scheduler 130 returns to pipelining multiple audio and video chunk requests over a single connection during step 410.
Generally, as long as the average time for downloading the chunks equals the fixed playout time of the chunk, the pipelining method will eliminate all gaps. However, if the average chunk download time is smaller than the chunk playout time (and the client buffer 140 is of limited size), pipelining will enter a self-clocking regime. More precisely, when the client buffer 140 is full, the next request has to wait until a chunk in the client buffer has been consumed to allow for more data to arrive. This might easily lead to the introduction of gaps of similar frequency as without pipelining. In order to avoid the above problem, a hysteresis is introduced to control the pipelining behavior.
As shown in
If, however, it is determined during step 520 that the level of buffer 140 is above the upper threshold, then the exemplary Chunk Request Scheduler 130 switches to sequentially scheduling audio and video chunk requests over the same TCP connection during step 530. A further test is performed during step 540, to determine if the level of buffer 140 is below the lower threshold. If it is determined during step 540 that the level of buffer 140 is not below the lower threshold, then the exemplary Chunk Request Scheduler 130 continues to sequentially schedule audio and video chunk requests over the same TCP connection during step 530.
If it is determined during step 540 that that the level of buffer 140 is below the lower threshold, then the exemplary Chunk Request Scheduler 130 switches back to pipelining multiple audio and video chunk requests over a single connection during step 510 in order to maximize the data throughput.
As shown in
The exemplary Chunk Request Scheduler 130 then schedules audio chunk requests over the slower TCP connection(s) during step 630 and schedules video chunk requests over the faster TCP connection(s) during step 640.
Alternatively, the exemplary Chunk Request Scheduler 130 can send requests for smaller chunks on the slower connection(s) during step 630 (to build up the congestion window) and send requests for larger chunks on the faster connection(s) during step 640, as would be apparent to a person of ordinary skill in the art.
Generally, as long as the average time for downloading the chunks equals the fixed playout time of the chunk, the pipelining method will eliminate all gaps. However, if the average chunk download time is smaller than the chunk playout time (and the client buffer 140 is of limited size), pipelining will enter a self-clocking regime. More precisely, when the client buffer 140 is full, the next request has to wait until a chunk in the client buffer has been consumed to allow for more data to arrive. This might easily lead to the introduction of gaps of similar frequency as without pipelining. In order to avoid the above problem, a hysteresis is introduced to control the pipelining behavior.
As shown in
If, however, it is determined during step 720 that the level of buffer 140 is above the upper threshold, then the exemplary Chunk Request Scheduler 130 enters a waiting mode during step 730 until there is room for N chunks (e.g., rather than scheduling each new chunk as each chunk is consumed from the buffer).
A further test is performed during step 740, to determine if the level of buffer 140 is below the lower threshold. If it is determined during step 740 that the level of buffer 140 is not below the lower threshold, then the exemplary Chunk Request Scheduler 130 continues to wait during step 730.
If it is determined during step 740 that that the level of buffer 140 is below the lower threshold, then the exemplary Chunk Request Scheduler 130 switches back to requesting audio and video chunks over one or more connections during step 710, as discussed above. In this manner, the lower threshold specified in step 740 can trigger the exemplary Chunk Request Scheduler 130 to request N chunks at once during step 710 or to pipeline the requests over a single connection.
Conclusion
Among other benefits, aspects of the present invention can help HTTP Adaptive Streaming applications to achieve a higher data throughput between client and server and hence deliver higher quality video to end users.
While
While exemplary embodiments of the present invention have been described with respect to processing steps in a software program, as would be apparent to one skilled in the art, various functions may be implemented in the digital domain as processing steps in a software program, in hardware by a programmed general-purpose computer, circuit elements or state machines, or in combination of both software and hardware. Such software may be employed in, for example, a hardware device, such as a digital signal processor, application specific integrated circuit, micro-controller, or general-purpose computer. Such hardware and software may be embodied within circuits implemented within an integrated circuit.
Thus, the functions of the present invention can be embodied in the form of methods and apparatuses for practicing those methods. One or more aspects of the present invention can be embodied in the form of program code, for example, whether stored in a storage medium, loaded into and/or executed by a machine, or transmitted over some transmission medium, wherein, when the program code is loaded into and executed by a machine, such as a computer, the machine becomes an apparatus for practicing the invention. When implemented on a general-purpose processor, the program code segments combine with the processor to provide a device that operates analogously to specific logic circuits. The invention can also be implemented in one or more of an integrated circuit, a digital signal processor, a microprocessor, and a micro-controller.
System and Article of Manufacture Details
As is known in the art, the methods and apparatus discussed herein may be distributed as an article of manufacture that itself comprises a computer readable medium having computer readable code means embodied thereon. The computer readable program code means is operable, in conjunction with a computer system, to carry out all or some of the steps to perform the methods or create the apparatuses discussed herein. The computer readable medium may be a recordable medium (e.g., floppy disks, hard drives, compact disks, memory cards, semiconductor devices, chips, application specific integrated circuits (ASICs)) or may be a transmission medium (e.g., a network comprising fiber-optics, the world-wide web, cables, or a wireless channel using time-division multiple access, code-division multiple access, or other radio-frequency channel). Any medium known or developed that can store information suitable for use with a computer system may be used. The computer-readable code means is any mechanism for allowing a computer to read instructions and data, such as magnetic variations on a magnetic media or height variations on the surface of a compact disk.
The computer systems and servers described herein each contain a memory that will configure associated processors to implement the methods, steps, and functions disclosed herein. The memories could be distributed or local and the processors could be distributed or singular. The memories could be implemented as an electrical, magnetic or optical memory, or any combination of these or other types of storage devices. Moreover, the term “memory” should be construed broadly enough to encompass any information able to be read from or written to an address in the addressable space accessed by an associated processor. With this definition, information on a network is still within a memory because the associated processor can retrieve the information from the network.
It is to be understood that the embodiments and variations shown and described herein are merely illustrative of the principles of this invention and that various modifications may be implemented by those skilled in the art without departing from the scope and spirit of the invention.