The present invention relates generally to communication networks, and more particularly to delivery of content within such networks.
In order to facilitate the delivery of streaming video and other types of content, communication networks are typically configured to include network caches. For example, such caches may be deployed at multiple locations distributed throughout a given communication network. The network caches store content that has been previously requested from network servers by user devices, such as computers, mobile phones or other communication devices, and may additionally or alternatively store content that is expected to be requested at high volume by such devices.
Network caching arrangements of this type advantageously allow future requests for the cached content to be served directly from the caches rather than from the servers. This limits the congestion on the servers while also avoiding potentially long delays in transporting content from the servers to the user devices. Cache misses are handled by quickly transferring the requested content from the corresponding server to an appropriate network cache.
A potential drawback of conventional network caching arrangements is that such arrangements are not able to address local impairments that can arise on the access side of the network and adversely impact content streaming to the user devices. These impairments are particularly pronounced on wireless access portions of the network due to a number of factors including the scarcity of air link resources and channel variability due to fading and user device mobility.
Although well-known content streaming protocols such as Progressive Download (PD) streaming and Adaptive Bit Rate (ABR) streaming can dynamically adjust content delivery rates to adapt to varying network conditions, these protocols also have significant drawbacks. For example, such content streaming protocols generally provide no control over the particular network paths that are utilized for streaming of content to user devices. Also, each user device running an instance of one of these content streaming protocols typically makes its own independent decision regarding the content delivery rate to be utilized at a given point in time, which can lead to an inefficient allocation of network resources across multiple user devices.
Illustrative embodiments of the present invention provide improved delivery of streaming video and other types of content from network caches to user devices in a communication network. For example, these embodiments provide content delivery techniques that overcome the above-noted drawbacks of conventional network caching and content streaming protocols by opportunistically delivering the content to selected user devices at rates determined based on monitored conditions such as buffer occupancy and channel quality.
In one embodiment, at least one processing device of a communication network is configured to implement a content delivery system. The content delivery system is configured to identify a set of user devices to receive content in a scheduling interval, to initiate delivery of the content to the set of user devices at respective delivery rates for a first portion of the scheduling interval, to monitor conditions associated with delivery of the content to the set of user devices, and to adjust a delivery rate of at least one of the user devices in the set for a second portion of the scheduling interval based at least in part on the monitored conditions. As indicated above, the monitored conditions may comprise, for example, buffer occupancy and channel quality for each of the user devices. The identifying, initiating, monitoring and adjusting are repeated for each of a plurality of additional scheduling intervals.
The content delivery system may additionally be configured to select particular network caches from which the content will be delivered to the set of user devices, to select particular network paths over which the content will be delivered from the selected network caches to the set of user devices, and to control delivery of the content to the user devices from the selected network caches over the selected network paths. The selection of particular network caches and particular network paths may be performed at least in part responsive to the monitored conditions.
By way of example, the above-noted selection of network paths may involve selection of multiple network paths over which the content will be delivered to a given one of the user devices. In such an arrangement, the multiple network paths may be in different access networks, such as a cellular access network and a wireless local area network. The content delivery system may be configured to switch from a first one of the multiple network paths to a second one of the multiple network paths responsive to a change in at least one of the monitored conditions.
Illustrative embodiments of the invention will be described herein with reference to exemplary communication networks, content delivery systems and associated processing devices. It should be understood, however, that the invention is not limited to use with the particular networks, systems and devices described, but is instead more generally applicable to any network-based content delivery application in which it is desirable to provide improved performance in terms of parameters such as network resource utilization and user experience.
The user devices 106 may comprise, for example, computers, mobile telephones or other communication devices configured to receive content from content delivery system 102 via base station 105. A given such user device 106 will therefore generally comprise a processor and a memory coupled to the processor, as well as a transceiver which allows the user device to communicate with one or more network caches via the base station 105 and access network 104.
Content is delivered under the control of the content delivery system 102 to user devices 106-1 and 106-2 over respective network paths 112-1 and 112-2. The user devices 106 are also denoted herein as respective user devices A and B. In addition, a user device is also referred to herein as simply a “user,” although the latter term in certain contexts herein may additionally or alternatively refer to an actual human user associated with a corresponding device.
It should be noted that “content delivery” as the term is broadly used herein may refer to video streaming as well as other types of content streaming, as well as non-real-time content delivery.
In some embodiment, the user devices 106 are referred to as respective clients, and the content delivery system 102 is associated with one or more servers. However, other embodiments do not require the use of such a client-server model.
The content delivery system 102 controls delivery of content to the user devices 106 from a set 115 of network caches 115-1, 115-2, . . . 115-N. Although shown as separate from the access network 104 in the present embodiment, one or more of the network caches 115 may be implemented at least in part within the access network 104. Alternatively, the network caches 115 may be implemented elsewhere in the communication network 100 so as to be readily accessible to content delivery system 102. In the present embodiment, the content delivery system 102 is coupled between the network caches 115 and the access network 104, although other arrangements are possible.
Although only single instances of content delivery system 102, access network 104 and base station 105 are shown in
Also, there may be a significantly larger number of user devices 106 than the two exemplary devices shown in
In operation, the content delivery system 102 identifies a set of user devices 106 to receive content in a scheduling interval, initiates delivery of the content to the set of user devices at respective delivery rates for a first portion of the scheduling interval, monitors conditions associated with delivery of the content to the set of user devices, and adjusts a delivery rate of at least one of the user devices in the set for a second portion of the scheduling interval based at least in part on the monitored conditions. These operations are repeated for each of a plurality of additional scheduling intervals. The monitored conditions may comprise, for example, buffer occupancy and channel quality for each of the user devices.
The first and second portions of the scheduling interval may comprise respective measurement and regulation phases of the scheduling interval. A more detailed example of an arrangement of this type will be described in conjunction with the illustrative embodiment of
In identifying a set of user devices to receive content in a scheduling interval, the content delivery system 102 utilizes user device state information such as user device buffer occupancies provided as part of the monitored conditions. Thus, for example, the content delivery system 102 may identify user devices having respective buffer occupancies at or below a low watermark threshold and include those user devices in the set, and identify user devices having respective buffer occupancies at or above a high watermark threshold and exclude those user devices from the set. Those user devices having respective buffer occupancies between the low watermark threshold and the high watermark threshold may also be included in the set.
In adjusting a delivery rate of at least one of the user devices in the set for the second portion of the scheduling interval, the content delivery system 102 identifies at least one of the user devices in the set as having an above average channel quality for the first portion of the scheduling interval based on the monitored conditions. The content delivery system 102 then increases the delivery rate in the second portion of the scheduling interval for that device or devices, while also decreasing the delivery rate in the second portion of the scheduling interval for one or more other user devices in the set that are not identified as having an above average channel quality.
The adjusted delivery rate for a given user device may be an increased delivery rate selected to allow the buffer occupancy of the given user device to reach a specified level within the second portion of the scheduling interval.
Accordingly, the content delivery system 102 in the present embodiment opportunistically delivers content at higher rates to one or more user devices that are currently experiencing above average channel conditions, while reducing the rates for other user devices that may be experiencing below average channel conditions. As will be described in greater detail below, such an arrangement provides significant improvements in network resource utilization and user experience in the communication network 100 relative to conventional techniques.
The scheduling intervals are configured in one or more embodiments to have durations on the order of seconds or minutes, so as to take advantage of slow fading effects in the channels. This is distinct from conventional base station scheduling arrangements in which scheduling intervals are on the order of milliseconds in order to take advantage of fast fading effects in the channels.
Due to slow fading effects, also referred to herein as “shadow” fading, user device channels tend to oscillate slowly between above average channel quality supporting high rates and below average channel quality supporting low delivery rates. Examples of such time-varying channels are illustrated in
The content delivery system 102 in the present embodiment makes opportunistic use of these slow oscillations in channel quality by identifying in the first portion of a given scheduling interval which user device or devices are currently experiencing above average channel quality, and then increasing delivery rates for that device or devices, while reducing delivery rates for one or more other devices that are currently experiencing below average channel quality. This tends to result in a significant increase in content delivery throughput relative to conventional streaming in which each user device independently determines its delivery rate.
In implementing this opportunistic content delivery process, the content delivery system 102 makes use of the above-noted user state information relating to buffer fullness, which in the present embodiment corresponds to fullness levels of the device caches 110. More particularly, the content delivery system 102 more aggressively fills the device caches 110 at times when their corresponding user devices 106 are experiencing above average channel quality.
In delivering content to a set of user devices determined in the manner described above, the content delivery system 102 also selects particular network caches 115 from which the content will be delivered to the set of user devices, selects particular network paths 112 over which the content will be delivered from the selected network caches 115 to the set of user devices 106, and controls delivery of the content to respective device caches 110 of the set of user devices 106 from the selected network caches 115 over the selected network paths 112.
For example, in the
In selecting particular network caches and network paths, the content delivery system 102 utilizes user device state information and network state information, which may be obtained at least in part by monitoring conditions associated with content delivery in the manner previously described. Based on the user device station information and network state information, the content delivery system 102 dynamically selects the best network caches 115 and network paths 112 for delivering content to each of the user devices 106. This process may involve selecting particular user devices 106 to receive content from particular network caches 115 over particular network paths 112. The content delivery system 102 reacts to changing user device and network conditions by updating its selections and the associated content delivery schedule over multiple scheduling intervals.
It should be noted that the content delivery system 102 may select the same network cache for use in delivering content to multiple user devices 106. Thus, for example, a single one of the network caches 115 may be selected to deliver content to user devices 106-1 and 106-2 over respective network paths 112-1 and 112-2.
As indicated above, the content delivery system 102 in the present embodiment selectively assigns content delivery resources among contending user devices 106 based on channel quality measures or other monitored device or network state information relating to those user devices. Accordingly, at particular opportunistic times corresponding to favorable channel conditions for the selected user devices, the content delivery system 102 attempts to fill the device caches 110 of selected user devices 106 with delivered content in order to avoid content “starvation” at other times when their respective channel conditions are less favorable. As indicated previously, such arrangements can provide significant performance gains relative to conventional techniques. For example, by dynamically prioritizing resource allocations to user devices in accordance with their respective channel qualities, much higher network efficiency can be obtained.
The particular configuration of communication network 100 as shown in
For example, in the
More particularly, the NSC 102′ in the
In the
Another possible arrangement of communication network 100 is shown in
The CSCs 120 can monitor conditions such as buffer occupancy, channel quality (e.g., SNR, RSSI), session performance (e.g., throughput, delays, losses) and user device location (e.g., cell ID) and report this information to the NSC 102′. Note that when the CSC is deployed on the user device the NSC would not need to interface with the network elements to obtain additional data about the session (e.g., cell ID) since such information can be provided by the CSC.
It is also possible to implement the previously-described rate adjustments at least in part within the CSC itself by making the data flow to a media streaming application on the user device be proxied via the CSC. In this case, the NSC would not necessarily have to be in the data path but will only be responsible for adaptively selecting the data rates with the actual rate enforcement being implemented by the CSC.
Accordingly, the NSC in such an embodiment would control when each of the CSCs is allowed to download data, by providing appropriate control signals to the respective CSCs. Benefits of utilizing the CSC include better visibility into user device state and network state, and also more scalable distributed rate adjustment enforcement. However, implementation of the CSC within the user device may require changes to existing client side media streaming applications.
Again, the particular embodiments illustrated in
Referring now to
The scheduler 200 is configured to identify a set of user devices to receive content in a scheduling interval and to initiate delivery of the content to the set of user devices at respective delivery rates for a first portion of the scheduling interval. The monitor 204 is configured to monitor conditions associated with delivery of the content to the set of user devices, such as buffer occupancy and channel quality. The regulator 202 is configured to adjust a delivery rate of at least one of the user devices in the set for a second portion of the scheduling interval based at least in part on the monitored conditions.
The scheduler 200 in the present embodiment is also configured to select particular network caches 115 from which the content will be delivered to the set of user devices 106, and to select particular network paths 112, such as network paths 112-1 and 112-2 in the
Again, selection of particular network caches 115 may involve selection of a single network cache to deliver content to multiple user devices 106. Arrangements of this type are illustrated in
As indicated previously, the selection operations of the scheduler 200 may occur in respective scheduling intervals, also referred to in some embodiments herein as time slots. Thus, for each of a plurality of such scheduling intervals, the scheduler 200 may identify one or more user devices that are experiencing above average channel quality and for which content will therefore be delivered at increased rates.
Content delivery in some embodiments may occur in “sessions” established with respective user devices 106. Thus, in some embodiments, content is delivered to user devices in respective sessions associated with those devices. A given session generally refers to delivery of content to a particular user device.
The scheduler 200 may take into account not only buffer occupancy and channel quality but also other user state information such as buffer drain rate. For example, it can maintain higher rates for those user devices 106 in which their respective device caches 110 are in danger of reaching the above-noted low watermark threshold. This ensures that in addition to high network efficiency, high quality user experience is also maintained for each session.
The regulator 202 adaptively adjusts the rate of data transfer to one or more of the sessions by, for example, slowing down or suspending selected sessions while letting other sessions proceed unconstrained at the highest possible rate. This rate limiting can be implemented in many ways, including slowing down TCP acknowledgements sent from the user devices to the content delivery network.
The above-described functionality of scheduler 200 and regulator 202 are exemplary only, and additional or alternative functionality may be provided in such elements. For example, in other embodiments, the regulator 202 may be eliminated and the rate adjustment functionality may instead be implemented in the scheduler. Also, a scheduler in another embodiment can be configured to use different video resolutions. For example, in some cases different lower resolution videos may be available and besides selecting the data rate for each user the scheduler may also dynamically select the video resolution for improved performance.
Embodiments of the invention are flexible in terms of the scheduling intervals that are used to control content delivery to user devices. However, as indicated above, embodiments of the invention can utilize scheduling intervals on the order of seconds or minutes in order to take advantage of shadow fading. Such arrangements can advantageously complement existing base station scheduling mechanisms that utilize scheduling intervals on the order of milliseconds to take advantage of fast fading.
The monitor 204 is configured to monitor at least one of user device state information and network state information such that selection operations performed by the scheduler 200 are performed at least in part responsive to at least one of the monitored user device state information and the monitored network state information.
One possible example of such monitored information is illustratively shown in
The user device state information conveyed from a given user device 106 to content delivery system 102 or NSC 102′ may more particularly comprise information such as buffer occupancy, channel quality and available access networks.
At least a portion of the network state information may be obtained directly by the content delivery system 102 or NSC 102′ from appropriate network elements rather than via feedback from the user devices 106.
The network state information may include information such as access network state information and network cache state information. The access network state information may comprise, for example, information indicative of utilization and congestion of the access network 104 and possibly one or more additional access networks that may be utilized to deliver content to the user devices 106. The network cache state information may comprise, for example, processing load information for each of the network caches 115. Numerous other types of network state information may be used in other embodiments.
Thus, in a given embodiment the monitor 204 may be configured to track the per-session network state including throughput, packet losses and retransmissions as well as the user device state. In addition, the monitor 204 may obtain from base station 105 or other network elements (e.g., an RNC) one or more additional parameters including the cell ID and other location information of the user device. The collected information is used to identify the set of sessions with impaired channel quality and whose data transfer can be curtailed to favor other contending sessions in the same cell that are getting above average channel quality and hence data rate.
As indicated previously, selection operations performed by the scheduler 200 are performed at least in part responsive to at least one of the monitored user device state information and the monitored network state information.
These operations may be carried out so as to ensure that the respective device caches 110 of the selected user devices 106 can be substantially filled to a designated level within a designated amount of time, or to ensure that a designated amount of the content can be delivered to the respective device caches 110 of the selected user devices 106 within a designated amount of time.
For example, selecting in scheduler 200 particular ones of the user devices 106 to receive content at respective delivery rates may involve selecting a subset of the user devices having higher respective ratios of current channel quality to average channel quality relative to other ones of the user devices not in that subset. In other words, the scheduler 200 selects for the current scheduling interval those user devices that have the highest ratios of current channel quality to average channel quality. Other types of channel quality measures or selection criteria may be used in other embodiments.
Accordingly, in this embodiment the content delivery system 102 may be viewed as selecting for an increased delivery rate in a current scheduling interval only those user devices that are currently experiencing channel quality above their average channel quality. This helps to ensure the best use of available channel capacity in filling up device caches 110 with delivered content. The particular user devices that are selected to receive content at increased rates based on channel quality measures will typically vary from scheduling interval to scheduling interval, as network conditions change. The content delivery system 102 is therefore able to react to changing network conditions by selecting different user devices, network caches and network paths for content delivery.
For a given user device 106-1 selected for content delivery in a current scheduling interval based on its channel quality as described above, the scheduler 200 may select the network cache 115 having the highest available bandwidth.
In the embodiment shown in
Such an arrangement provides an improved user experience relative to conventional arrangements, independent of short term changes in network state. Also, there is no need for the network to support specialized streaming mechanisms, as the streaming in this embodiment is supported by the device cache 110-1 within the user device 106-1.
As indicated previously, the scheduler 200 selects particular network paths over which content will be delivered from the selected network caches 115 to the set of user devices 106. In some embodiments, this may involve selecting multiple network paths over which the content will be delivered to a given one of the user devices 106. For example, the content delivery system 102 may be configured to switch from a first one of the network paths to a second one of the network paths responsive to a change in at least one designated network condition.
An example of an embodiment of this type is shown in
Based on changing network conditions as detected using such monitoring, the content delivery system 102 switches to a second selected network path from network cache 115-N, as illustrated by downward dashed arrow (1) in the figure. The content delivery system 102 therefore adapts to changes in network conditions by switching over to better network caches and network paths during data transfer to the selected user device 106-1. Again, this ensures that content continues to be delivered at an appropriate rate, without adversely impacting user experience.
In such an arrangement, for example, the multiple network paths over which the content will be delivered to the given selected user device 106-1 may comprise a first network path through a first access network such as access network 104 and a second network path through a second access network different than the first access network. As one possible illustration, the first and second access networks in such an arrangement may comprise a cellular access network and a wireless local area network, respectively. This is illustrated in the embodiment of
Based on monitored user device state information and monitored network state information, the content delivery system 102 switches between these multiple paths, possibly within a given scheduling interval, in delivering content to the device cache 110-1 of the selected user device 106-1. For example, different portions of the content to be delivered can be delivered over different ones of the multiple paths. Alternatively, portions of the content may be delivered simultaneously over two or more of the multiple paths. The portions may be downloaded out of order and at least partially in parallel and combined in the device cache 110-1 for streaming to the media player 108-1. Again, this ensures that the content is delivered at an appropriate rate to the selected user device.
It is to be appreciated that the various delivery arrangements shown in
Referring again to
The processor 210 may be implemented as a microprocessor, a microcontroller, an application-specific integrated circuit (ASIC), field-programmable gate array (FPGA), or other type of processing device, as well as portions or combinations of such devices.
The memory 212 may comprise an electronic random access memory (RAM), a read-only memory (ROM), a disk-based memory, or other type of storage device, as well as portions or combinations of such devices.
The processor 210 and memory 212 may be used in storage and execution of one or more software programs for performance of operations such as scheduling, regulating and monitoring within the content delivery system 102 or NSC 102′. Accordingly, one or more of the scheduler 200, regulator 202 and monitor 204 or portions thereof may be implemented at least in part using such software programs.
The memory 212 may be viewed as an example of what is more generally referred to herein as a computer program product or still more generally as a computer-readable storage medium that has executable program code embodied therein. Other examples of computer-readable storage media may include disks or other types of magnetic or optical media, in any combination. These computer-readable storage media and a wide variety of other articles of manufacture comprising such computer-readable storage media are considered embodiments of the present invention.
The processor 210, memory 212 and network interface 214 may comprise well-known conventional circuitry suitably modified to operate in the manner described herein. Conventional aspects of such circuitry are well known to those skilled in the art and therefore will not be described in detail herein.
It is to be appreciated that an NSC or other type of content delivery system as disclosed herein may be implemented using components and modules other than those specifically shown in the exemplary arrangement of
In order to illustrate the performance gains possible using communication networks configured as illustrated in
Although channel variations for different user are typically uncorrelated, it is assumed in this example that the user A channel is above its average rate whenever the user B channel is below its average rate and vice versa. More particularly, the user A channel toggles between an above average rate of 7 Mbps and a below average rate of 5 Mbps, while the user B channel toggles between an above average rate of 3 Mbps and a below average rate of 1 Mbps.
Thus, if A were the only user to be scheduled, its rate would toggle between 7 and 5 Mbps for an average rate of 6 Mbps. Likewise if B were the only user to be scheduled it would get an average rate of 2 Mbps. However, if both users are active at the same time then they will share the network proportionally resulting in an average rate of (6+2)/2=4 Mbps, with A getting a rate of 6/2=3 Mbps and B getting a rate of 2/2=1 Mbps.
Next assume that users are selected for data transfer based on how much better their channel quality is compared to their average channel quality, in accordance with content delivery techniques disclosed herein. In the current two-user example, this corresponds to selecting that user for data transfer that maximizes the ratio of its current rate to its average rate.
Accordingly, user A will be selected for data transfer in the first time interval, user B will be selected for data transfer in the second time interval, user A will be selected for data transfer in the third time interval, and so on. Thus, only one user is active at any given time and when active user A will get a rate of 7 Mbps and user B will get a rate of 3 Mbps. Since each user will only be active half the time, users A and B will get average rates of 7/2 and 3/2 Mbps respectively for an overall average rate of 7/2+3/2=5 Mbps.
Thus, in the present example, by prioritizing users with above average channel quality for data transfer under the control of content delivery system 102, not only does each user get a higher rate (16.7% higher for A and 50% higher for B) but also the overall throughput of the network is improved (by 25%).
The foregoing is a simple example of an arrangement in which content delivery system 102 dynamically schedules users for data transfer depending on monitored conditions such as channel quality.
This example may be viewed more generally as a type of arrangement in which data transfer for users having temporarily impaired channel quality is curtailed in favor of those users whose channels are currently above their average channel quality. Techniques of this type can be effective even for streaming applications as long as there is high variability in the channels (e.g., due to shadow fading) to ensure that a given user does not stay in below average channel conditions for long periods of time and hence gets scheduled for data transfer often enough to make good progress without adversely impacting the user playback experience.
The content delivery system 102 in these embodiments utilizes the device caches 110 to help deal with playback disruptions that might otherwise occur during periods when data transfer to one or more users is suspended or otherwise performed at a reduced rate due to their below average channel quality. More particularly, opportunistic data transfers by the content delivery system keep the devices caches 110 well replenished. This is because users selected to receive data transfers get high rates, not only due to their above average channel quality, but also due to the fact that they are sharing the available network capacity with fewer other users.
It should be noted that embodiments of the invention can be combined with other mechanisms for buffering large amounts of data to prevent disruptions. For example, user movements in the increasingly common HetNet environment (e.g., an integrated network of LTE, WiFi and small cells) can provide pockets of high capacity areas with very high data rates where fast buffer filling is possible. Pre-loading large portions of the content in advance of playback during off-peak times is also a viable option particularly given the large amounts of client side storage.
An embodiment that prioritizes user devices based on their channel quality and variability should generally not penalize other user devices, particularly those user devices that happen to be located in regions with relatively poor channel quality (e.g., the edge of a cell) or have low channel variability (e.g., static users). Such user devices should continue to receive equal or higher treatment from the network to avoid any deterioration in their user experience, particularly for streaming applications.
This can be accomplished in one or more embodiments of the present invention by additionally incorporating buffer occupancy of the user devices in the scheduling process. For example, as mentioned previously, the content delivery system 102 can be configured such that data transfer for user devices whose buffer occupancy is below a low watermark threshold is not curtailed. As a result, the dynamic prioritizing of data transfer based on channel conditions is only applied to user devices with buffer occupancy above the low watermark threshold. These user devices can deal with disruptions in data transfers during scheduling intervals when they are suspended or otherwise reduced in rate by the scheduler 200.
In addition, some user devices are suspended irrespective of their channel quality once they have built up sufficient buffer occupancy that can continue to play delivered content for some time without needing additional data transfers. Any excess capacity generated either by dynamic prioritization of user devices or by suspending user devices with high buffer occupancy can be made available to user devices with low buffer occupancy to ensure enhanced user experience.
An exemplary scheduling algorithm that may be implemented in content delivery system 102 to provide prioritization of user devices based on buffer occupancy and channel quality will now be described in greater detail, with reference to
In this exemplary scheduling algorithm, the dynamic scheduling of data transfers is performed once per time slot. The time slot length t is selected to be on the order of tens of seconds. Not considering the impact of fast fading at very short time scales on the order of milliseconds, the channel impairments due to slow fading can be assumed to be relatively static in each time slot as defined above, since a mobile user device can typically only travel a few tens of meters within each time slot, and hence the data rates stay substantially constant within each time slot. In addition, the average channel quality (ACQ), which is related to the distance based constant radio link power loss, can be assumed to be static even for much longer time scales, on the order of tens or even hundreds of time slots, especially since streaming is mainly carried out by semi-static users.
Each scheduling time slot is further sub-divided into two sub-slots of length t1 and t2 where t=t1+t2. The sub-slot of length t1 is a measurement phase which precedes a regulation phase corresponding to the sub-slot of length t2. In the regulation phase, the data transfer for one or more sessions is adjusted based on the buffer occupancy and channel quality of each of the corresponding user devices 106. The buffer occupancy in this context refers to the amount of delivered content stored in the device cache 110 of the corresponding user device.
Consider a cell C and a time slot T. Assume there are n streaming users S={Ui,1≦i≦n} in cell C at this time. We use Qi to denote the ACQ for user i. Let Bi denote the buffer occupancy for user Ui at the beginning of time slot T. Bi is measured in units of time slots of size t. This means that the playback for user Ui can continue for Bi*t seconds from its buffer alone, without any further data transfer. Note that B, depends not only on the number of bytes of data buffered but also on the minimum required streaming rate for user Ui.
Let Mi denote the minimum required streaming rate for user Ui In case of video streaming, Mi is rate at which the video is encoded. Let θi(k) (in units of time slots each t seconds long) denote the buffer occupancy threshold for user Ui for it to be scheduled for dynamic prioritizing of data transfer with k users. Specifically, it means that for Ui to be selected for dynamic prioritization with k−1 other users, its buffer occupancy must be larger than this threshold: Bi≧θi(k). The threshold θi(k) depends on k and additional details regarding computation of the threshold will be provided below.
Let θH, θL denote high and low watermark thresholds respectively such that users with buffer occupancy Bi exceeding the high watermark threshold are not scheduled in time slot T while those with buffer occupancy of at most the low watermark threshold are scheduled in time slot T irrespective of their channel conditions. Exemplary values for these thresholds will be provided below.
As indicated previously, the scheduling algorithm in the present example is illustrated in
Let the rates of these users Ui in the measurement phase be denoted by Ri and their channel qualities be denoted by CQi. Next, the largest set of users S4⊂S2 is determined for which data transfer can be dynamically prioritized based on channel conditions. This is done by finding the largest value k (at most the size of S2) such that there are at least k users Ui in S2 whose buffer occupancy exceeds their buffer occupancy threshold θi(k) at average data transfer rates Ri. If there are more than k users that satisfy this condition then among them the k best users who have the highest excess buffer occupancy Bi−θi(k) are selected. S4 then consists of all these k users. All the remaining users in S2-S4 are moved to set S3 to be scheduled for data transfer in this slot. Among the users in S4 the user Uj with the best current channel quality ratio CQi/Qi (and hence whose rate is highest compared to its average rate) is selected for data transfer in this time slot.
In the regulation phase, all users in S3 and user Uj are allowed to do data transfer. We start out by capping the rates of the users Ui in S3 to their measured rates Ri. The user Uj is allowed the rest of the capacity of the cell. At this point, if the rate that Uj is getting far exceeds kMj, then some of this excess capacity is given to the users Ui in S3 who may not be getting their minimum required rate Mi.
We use two parameters δ1,δ2 to control this rate boosting. We always pick the user for rate boosting who is furthest behind in getting its minimum required rate. The rate boosting is done by increasing the rate cap of the user Ui. Note that this will typically trigger a proportional fair mechanism in the cell to try to equalize the bandwidth allocation between users Uj and user Ui thus lowering the rate of user Uj and increasing the rate of user Ui up to its bandwidth cap.
In the regulation phase of the above-described algorithm, the rate of Uj is compared to kMj rather than Mj because even though Rj is the rate user Uj gets in this time slot in the long run its average rate is expected to be only Rj/k. This is because if the set S4 were not to change over the next few time slots then only one user from the set S4 will be scheduled in a time slot. This follows from the assumed independence in the channel variations among the users. As a result we can expect Uj to be scheduled in only 1/k-th of the time slots on the average resulting in an average rate of Rj/k. Hence, the rate of Uj in this time slot should be at least kMj so that on average its rate is at least Mj.
The scheduling algorithm of
The fast fading happens at very short time granularity on the order of milliseconds and is exploited by conventional base station scheduling. However, in order to effectively deal with the shadow fading, the content delivery system 102 in illustrative embodiments is configured to operate at a much coarser time granularity, and may therefore be complementary to conventional base station scheduling. Shadow fading is generally known to have a lognormal distribution with an autocorrelation function that decays exponentially with distance. In particular the correlation between two points x meters apart is given by e−αx where α= 1/20 for environments intermediate between urban and suburban microcellular. Accordingly, the shadow fading function stays substantially stationary for small distances.
The
The manner in which buffer occupancy thresholds can be determined for the
The
Let BS (k) be the additional buffering component attributable to use of the
It can be shown that if the channel fading variations of the k users are independent then the probability that the next data transfer for user Ui would be scheduled after n time intervals is at most e−n/k. Thus, at n=BS(k)=3k, this probability is less than 0.05.
Although user Ui is only involved in data transfer once every k time intervals on average, when it is scheduled it is the only user among the k users doing the data transfer and hence gets k times more rate. In addition, since user Ui is scheduled only at time intervals when its channel quality and hence data rate are above average its data transfer stays higher than it would be if all k users were allowed data transfer at all times. This suggests that the data transfer of user Ui can fall behind its average data transfer by at most BT+BS(k) time intervals, thus motivating the above-described threshold θi(k)=BT+BS(k).
As a more particular example, assume that there are k=10 users whose data transfers are being dynamically prioritized. Let the scheduling time slot length be t=10 seconds. Thus for user Ui we have BS(k)=3k=30 time intervals or a buffer of size 30*10=300 seconds or 5 minutes. Thus 5 minutes of additional buffer occupancy, beyond that required by the conventional buffer threshold BT, is needed for implementation of the
As indicated above, the
The length of the measurement phase in the
Also, the number of users n selected out of a total of NU users for data transfer in the regulation phase can be varied. Thus embodiments of the invention can opportunistically and dynamically select the best n<NU users in every measurement phase and only allow data transfers to those n users in the regulation phase.
Again, the foregoing are merely examples of possible implementations of certain embodiments, and should not be construed as limiting in any way.
Embodiments of the invention can provide significant advantages relative to conventional techniques. For example, by giving priority to the user devices with the best channel quality in each scheduling interval, the content delivery system 102 with scheduler 200 implementing the
The scheduler also quickly reacts to the changes in the network and user device conditions to dynamically update the set of user devices selected for data transfer. The dynamic nature of the radio link (e.g., due to shadow fading) ensures that a mobile user is not stuck in a poor state for long periods of time and hence that user gets picked for data transfer by the scheduler often enough to keep its device cache well stocked with data. The scheduler is therefore able to opportunistically deliver content to all users while making sure that the content is delivered at the highest possible rates thus resulting in very efficient utilization of network resources.
As mentioned above, embodiments of the present invention may be implemented at least in part in the form of one or more software programs that are stored in a memory or other computer-readable storage medium of a processing device of a communication network. As an example, components of content delivery system 102 such as scheduler 200, regulator 202 and monitor 204 may be implemented at least in part using one or more software programs.
Of course, numerous alternative arrangements of hardware, software or firmware in any combination may be utilized in implementing these and other system elements in accordance with the invention. For example, embodiments of the present invention may be implemented in one or more ASICS, FPGAs or other types of integrated circuit devices, in any combination. Such integrated circuit devices, as well as portions or combinations thereof, are examples of “circuitry” as the latter term is used herein.
It should again be emphasized that the embodiments described above are for purposes of illustration only, and should not be interpreted as limiting in any way. Other embodiments may use different types of communication networks, access networks, content delivery systems, server components, client components, schedulers, monitors, user devices, buffers and other network elements, depending on the needs of the particular application. Alternative embodiments may therefore utilize the techniques described herein in other contexts in which it is desirable to provide improved throughput for content delivery to multiple user devices in a communication network. Also, it should be understood that the particular assumptions made in the context of describing the illustrative embodiments should not be construed as requirements of the invention. The invention can be implemented in other embodiments in which these particular assumptions do not apply. These and numerous other alternative embodiments within the scope of the appended claims will be readily apparent to those skilled in the art.