Jitter buffer adjustment algorithm

Information

  • Patent Grant
  • 6747999
  • Patent Number
    6,747,999
  • Date Filed
    Monday, November 15, 1999
    25 years ago
  • Date Issued
    Tuesday, June 8, 2004
    20 years ago
Abstract
A jitter buffer controller (110) allows the depth of the jitter buffer (113) to be adjusted dynamically according to the varying jitter of the current sequence. The jitter buffer controller (110) maintains a cache (109) of previous jitter values. This cache contains exponentially averaged values that represent the long-term behavior of specific destination's jitter characteristics, i.e., a jitter characteristic memory. These values are used to initialize the jitter buffer's depth. The jitter cache prevents the buffer's depth from being initialized with an arbitrary value that disregards the specific destination's jitter characteristics.
Description




BACKGROUND OF THE INVENTION




1. Field of the Invention




The present invention relates to packet transmission and, particularly, to a system and method for optimizing a jitter buffer.




2. Description of the Related Art




When sending voice data across packet networks, such as telephony over LAN (ToL) or Voice over IP (VoIP) networks, the voice is usually compressed, packetized and, finally, sent across the network to the destination. When the packets are sent into the network, they are generated at a constant rate. However, due to behavior of the packet network, the even time intervals between the packets are lost as the packets transit the network. This irregularity in packet separation is referred to as “jitter.” Jitter can cause clicks, delays and other annoyances in multimedia transmission, creating overall poor reproduction quality.




A jitter buffer is often used to even out the packet separation. A jitter buffer is a FIFO (first in, first out) buffer in which packets leave the buffer at a predetermined, constant rate. Minimizing the amount of actual jitter buffering is important because the jitter buffering process introduces delays in the reproduced signal. Excessive buffering delays transmission output, while under-buffering causes gaps in the data.




Jitter rates vary throughout a transmission sequence. A jitter rate is the average variance in packet arrival times. It is measured as packets arrive over a specific implementation defined interval. The actual jitter rate reported (in accordance with IETF RFC 1889) is an exponentially averaged value of the jitter for each packet over the interval. The distribution of the averaged jitter rate is significantly different from the actual jitter values, so common queueing theory solutions are not applicable.




A Jitter Buffer designed with a constant predetermined depth is referred to as a Static Jitter Buffer. A Static Jitter Buffer does not recognize each sequence's unique jitter characteristics and can not adjust itself to meet the needs of individual sequences. The inflexibility of the Static Jitter Buffer degrades the smoothing capability of the jitter buffering process, thereby failing to provide sufficient buffering for some sequences while unnecessarily delaying others.




SUMMARY OF THE INVENTION




These and other drawbacks in the prior art are overcome in large part by a system and method according to the present invention. A jitter buffer controller according to the present invention allows the depth of the jitter buffer to be adjusted dynamically according to the varying jitter of the current sequence. The jitter buffer controller maintains a cache of previous jitter values. This cache contains exponentially averaged values that represent the long-term behavior of specific destination's jitter characteristics, i.e., a jitter characteristic memory. These values are used to initialize the jitter buffer's depth. The jitter cache prevents the buffer's depth from being initialized with an arbitrary value that disregards the specific destination's past jitter characteristics.




The jitter buffer controller uses two main variables to adjust the depth of the jitter buffer: current_depth and rise. The current_depth accounts for the current jitter while the rise “cushions” increases to the depth as a result of having a history of past jitter. These two variables are used to create a more accurate depth that is adaptive to varying jitter rates. The jitter buffer depth can change throughout a single data sequence based on the variance of the effective jitter.











BRIEF DESCRIPTION OF THE DRAWINGS




A better understanding of the invention is obtained when the following detailed description is considered in conjunction with the following drawings in which:





FIG. 1

is a block diagram of an exemplary telephony over LAN (ToL) network according to an embodiment of the invention; and





FIG. 2

is a block diagram of an exemplary codec according to an embodiment of the present invention.











DETAILED DESCRIPTION OF THE INVENTION





FIGS. 1-2

illustrate a system and method for adjusting jitter buffers according to an implementation of the present invention. According to an embodiment of the invention, the depth of a jitter buffer is adjusted based on current jitter and a history of past jitter. It is noted that the teachings of the present invention are applicable to any transmission medium in which data is produced at a constant rate but where the transmission medium perturbs the rate. Thus, the figures are exemplary only.




Turning now to

FIG. 1

, an exemplary telecommunications system


100


according to an embodiment of the invention is shown therein. The telecommunications system


100


includes a local area network (LAN) or packet network


101


. As shown, the telecommunications network is embodied as an H.323 compliant network. It is noted, however, that any type of multimedia packet network or any network for transmission of time dependent data may be employed. As shown, coupled to the LAN


101


are a variety of H.323 terminals


102




a


,


102




b


, a multi-point control unit (MCU)


104


, an H.323 gateway


106


, an H.323 gatekeeper


108


, a LAN server


112


, and a plurality of other devices such as personal computers (not shown).




The H.323 terminals


102




a


,


102




b


are in compliance with the H.323 Recommendation. Thus, the H.323 terminals


102




a


,


102




b


support H.245 control signaling for negotiation of media channel usage, Q.931 (H.225.0) for call signaling and call setup, H.225.0 Registration, Admission, and Status (RAS), and RTP/RTCP for sequencing audio and video packets. The H.323 terminals


102




a


,


102




b


may further implement audio and video codecs, T.120 data conferencing protocols and MCU capabilities. Further details concerning the H.323 Recommendation may be obtained from the International Telecommunications Union; the H.323 Recommendation is hereby incorporated by reference in its entirety as if fully set forth herein.




Further, as will be explained in greater detail below, one or more of the network entities such as the gateway


106


may include a jitter buffer


113


and jitter buffer control


110


according to the present invention. As will be described in greater detail below, the jitter buffer control


110


functions to identify current and past jitter behavior. Jitter buffer depth is adjusted based on this analysis.




An exemplary interface including a jitter buffer system according to an embodiment of the present invention is shown in

FIG. 2. A

codec


14


includes an encoder


88


for encoding audio data and a decoder


96


for decoding incoming audio data. The decoder


86


is coupled to a digital-to-analog converter


82


. Similarly, the encoder


88


is coupled to an analog-to-digital converter


84


. A jitter buffer


113


is provided at the input to the decoder


86


. A packetizer


80


is provided at the output of the encoder


88


. The packetizer


80


formats outgoing audio data into data packets for transmission over the data network. A controller


110


, which may be embodied as a known microcontroller, controls operation of the jitter buffer


113


and the packetizer


80


. As will be explained in greater detail below, the controller


110


, in conjunction with the jitter buffer cache


109


, monitors jitter behavior and adjusts jitter buffer depth based on an analysis of current and past jitter. The jitter cache contains an exponentially averaged value representing specific destinations' statistical histories of jitter.




A dynamic jitter buffer control according to the present invention dynamically adjusts a jitter buffer depth in terms of a “current depth” and a “rise.” Current depth is a parameter related to the measured size of the jitter buffer. Rise is the calculated “cushion” appended to the jitter buffer depth to account for a connection's previous history of jitter. Each time an increased level of jitter is sampled, a predetermined penalty is added to the rise, increasing the amount of “extra” buffer space provided. The rise penalty is a constant positive value which can only serve to increase the value of the rise. When the algorithm encounters a jitter rate higher than what the buffer provides for, a predetermined rise penalty is added to the current rise thereby increasing the overall depth of the jitter buffer. The increase in the overall rise of the jitter buffer prepares the buffer to handle an increase in jitter throughout a transmission. The rise variable allows the system to “learn” that a bad path might become bad again and more quickly adapt to it, while the current_depth adjusts to the current jitter characteristics. The delay will be minimized on good connections but the rise variable then allows the system to remain stable if the connection begins to encounter jitter again.













TABLE 1









Parameter




Description











Current_depth




Current depth of the algorithm. The current_depth







should not be confused with the overall depth of the jitter







buffer. The overall depth of the jitter buffer includes both







the current_depth and the rise.






rise




The “cushion” to increase the depth as a result of







having a history of past jitter. This is a monotonically







increasing variable.














Parameters used include maximum and minimum thresholds, as well as rise increments, as described in Table 2 below:














TABLE 2









Parameter




Description




Value Range











Minimum









This is the absolute lowest depth




Minimum buffering






depth




to which the jitter buffer may be




required for “optimal”







adjusted.




network configuration.






Maximum









This is the highest depth to




Maximum buffering






depth




which the jitter buffer may be




value compatible with







adjusted.




acceptable








regeneration of the








original data.






Initial_depth




This is the initial value that will




Tuned value based on







be assigned to current_depth




mean network delivery







before the first jitter sample is




time divided by the







received if no information is




nominal packet rate.







available in the Jitter cache.






Initial_rise




This is the initial value for the




Tuned value based on







rise penalty cushion if no




the standard deviation







information is available in




of the mean network







the Jitter cache.




packet delivery time








(doubled) divided by








the nominal packet








rate.






Maximum_-




This is the maximum assessed




Tuned value, to permit






constant_rise




rise penalty that can be




quickly adapting to







administered to the jitter buffer




degrading network







depth. This value limits the




conditions, while







amount of buffering “over” the




permitting acceptable







current jitter-derived value




performance for







applied.




nominal network








behavior. Generally,








a bit less than the








maximum depth less








the initial depth.






Rise_penalty




A configurable constant which is




Tuned value providing







added to the rise to account for




for several increments







jitter. The rise_penalty is used




of rise.







to increase the rise if deemed







necessary by the algorithm and







is the amount the cached rise is







decremented initially for long-







term decay














As noted above, the jitter buffer depth is a combination of a current_depth and a rise value. These are used to allow the jitter buffer controller to adjust jitter buffer depth to meet the specific needs of each sequence.




The jitter buffer depth is initialized when a new sequence begins. During the initial jitter measurements, the jitter sample data is insufficient to derive a new jitter buffer depth customized to the current transmission. A certain number of jitter samples must be received before an accurate jitter buffer depth can be determined. The jitter cache prevents this lack of data from degrading jitter performance during this initial period.




To initialize the system for a particular transmission sequence using a particular path identification, the following operation is performed:














TABLE 2











Maximum_-




This is the maximum assessed




Tuned value, to permit






constant









rise penalty that can be




quickly adapting to






rise




administered to the jitter buffer




degrading network







depth. This value limits the




conditions, while







amount of buffering “over” the




permitting acceptable







current jitter-derived value




performance for







applied.




nominal network








behavior. Generally,








a bit less than the








maximum depth less








the initial depth.






Rise









A configurable constant which is




Tuned value providing






penalty




added to the rise to account for




for several increments







jitter. The rise_penalty is used to




of rise.







increase the rise if deemed







necessary by the algorithm and







is the amount the cached rise is







decremented initially for long-







term decay














As noted above, the jitter buffer depth is a combination of a current_depth and a rise value. These are used to allow the jitter buffer controller to adjust jitter buffer depth to meet the specific needs of each sequence.




The jitter buffer depth is initialized when a new sequence begins. During the initial jitter measurements, the jitter sample data is insufficient to derive a new jitter buffer depth customized to the current transmission. A certain number of jitter samples must be received before an accurate jitter buffer depth can be determined. The jitter cache prevents this lack of data from degrading jitter performance during this initial period.




To initialize the system for a particular transmission sequence using a particular path identification, the following operation is performed:




















if the path identification is in the Jitter cache then







{







 current_depth = cached value (depth) + rise_penalty







 rise   = cached value (rise) − rise_penalty







}







else







{







 current_depth = initial_depth







 rise   = initial_rise







}







 new_depth=minimum_depth







if new_depth >= maximum_depth then







{







 if current_depth = maximum_depth then







 {







  (Algorithm bound reached, the network is unreliable)







  Do not continue (No adjustment needed)







 }







 new_depth = maximum_depth







}







if new_depth + 1 > current_depth then







{







 new_rise = rise + rise_penalty







 if new_rise <= maximum_constant_rise then







  rise = new_rise







}







current_depth = new_depth + rise







PostSmoothingOption







Adjust jitter buffer depth to current_depth







if an adjustment has been made then







{







 new_cache_depth = (cache_alpha * prev_cache_depth)







     + ((1 − cache_alpha) * current_depth)







 new_cache_rise = (rise_alpha * prev_cache_rise)







     + ((1 − rise_alpha) * rise)















A variable, called new_depth, is calculated by taking the current sample value and dividing it by the frame duration. The frame duration represents the quantization division of the originally continuous data. A pre-smoothing operation also may be performed. Pre-smoothing is optional and is described in greater detail below. The variable new_depth is applied to the depth of the jitter buffer if it is deemed necessary by the controller. Once the possible new_depth has been determined, a series of comparisons are made to determine if an adjustment to the jitter buffer depth is required.




If the new_depth is less than the pre-configured minimum depth then the new depth is changed to the value of the minimum depth. If the new_depth is equal to the maximum_depth value, then it is determined if the current_depth equals the maximum_depth, and no adjustment is made. In this situation, a warning is issued stating that the jitter for this destination is very high. If, however, the new_depth has exceeded the acceptable maximum_depth value, then new_depth is set equal to the maximum_depth. This provides for improved accuracy in the jitter buffer depth, without exceeding the limits which would result in excessive delay.




If the computed new


13


depth plus one is greater than the current_depth, but does not meet any of the previous conditions, adjustments to both the rise and current_depth need to be made. A rise penalty is added to the current rise value. If the new_rise is less than or equal to the maximum_constant_rise, the rise is set to the new rise. As described above, the rise penalty is a monotonically increasing value that accounts for the variance of past jitter. This value also prevents the occurrence of sporadic or cyclic jitter from skewing the accuracy of the jitter buffer adjustment. Next, the current_depth is set to the new_depth plus rise. A post-smoothing operation may occur, as will be described in greater detail below. The cache depth is updated to the current depth.




Each time the current_depth or rise is changed, the jitter cache is updated to reflect the most recent states of these variables. That is, a new_cache depth and a new_cache


13


rise are determined.




In particular, as noted above, the jitter cache contains exponentially averaged values representing a specific destination's history of depth and rise values. Each value is a combination of the current state of the variable being entered and a value representing the past states of the variable as a whole. A “sliding window exponential average” equation is used in the updating of both the depth and rise values in the cache. This allows the current rise and depth values to have a predetermined weight when used to update the jitter cache. The parameter Alpha determines the exponential behavior of this cache when new data is added. The Alpha value is indirectly proportional to the effect of the new values on the cache. Alpha may be between 0 and 1 inclusive. A large value of Alpha indicates that new updates have a small effect on the cache and vice versa.




As noted above, either (or both) the input data (new_depth) or the output data (current_depth) may be exponentially smoothed at the points indicated (PreSmoothing or PostSmoothing, or both). Smoothing may provide greater stability by changing the jitter buffer depth less frequently. However, smoothing generally makes the average jitter buffer depth somewhat higher, so the decision to smooth the data is application dependent.




PreSmoothing is computed as follows:






smoothed_depth=depth_alpha*smoothed_depth+(1−depth_alpha)*new_depth






Then, the smoothed_depth value is used in place of the new_depth value. The depth_alpha value may vary from 0 to 1.




Postsmoothing is computed as follows:






jitter_depth=jitter_alpha*jitter_depth+(1−jitter_alpha)*current_depth






The actual jitter buffer depth is then set to the computed jitter depth in place of the current_depth value. The jitter_alpha value may vary from 0 to 1.



Claims
  • 1. A method, comprising:receiving data into a jitter buffer; and adjusting a depth of said jitter buffer by determining a parameter related to a measured size of the jitter buffer and increasing a size of the jitter buffer using a rise value, said rise value comprising a monotonically increasing value that accounts for the variance of past jitter.
  • 2. A method according to claim 1, said adjusting comprising analyzing jitter of a current transmission.
  • 3. A method, comprising:receiving data into a jitter buffer; and adjusting a depth of said jitter buffer by analyzing a jitter history; said adjusting comprising analyzing jitter of a current transmission and determining a current depth of said jitter buffer and a rise value, said rise value being a monotonically increasing variable and defining an amount by which said current depth may be increased, said rise and said current depth defining an overall jitter buffer depth.
  • 4. A method according to claim 3, said adjusting comprising determining an initial current depth and rise from a jitter cache prior to reception of said current transmission.
  • 5. A method according to claim 3, said adjusting comprising updating said jitter cache during said transmission.
  • 6. A telecommunications system, comprising:a packet network; and a jitter buffer system, said jitter buffer system including a jitter buffer, a jitter buffer cache, and a jitter buffer controller, said jitter buffer cache configured to store a history of network jitter, and said jitter buffer controller is configured to adjust a depth of said jitter buffer by analyzing said history and determining a parameter related to a measured size of the jitter buffer and increasing a size of the jitter buffer using a rise value, said rise value comprising a monotonically increasing value that accounts for the variance of past jitter.
  • 7. A telecommunications system according to claim 6, said jitter buffer controller further configured to analyze jitter of a current transmission.
  • 8. A telecommunications system, comprising:a packet network; and a jitter buffer system, said jitter buffer system including a jitter buffer, a jitter buffer cache, and a jitter buffer controller, said jitter buffer cache configured to store a history of network jitter, and said jitter buffer controller is configured to adjust a depth of said jitter buffer by analyzing said history; said jitter buffer controller configured to analyze jitter of a current transmission and determine a current depth of said jitter buffer and a rise value, said rise value being a monotonically increasing variable and defining an amount by which said current depth may be increased, said rise and said current depth defining an overall jitter buffer depth.
  • 9. A telecommunications system according to claim 8, said jitter buffer controller configured to determine an initial current depth and rise from a jitter cache prior to reception of said current transmission.
  • 10. A telecommunications system according to claim 9, said jitter buffer controller configured to update said jitter cache during said transmission.
  • 11. A device, comprising:a jitter buffer; a jitter buffer cache configured to store a history of jitter of transmitted data; and a jitter buffer controller configured to adjust a depth of said jitter buffer by determining a parameter related to a measured size of the jitter buffer and increasing a size of the jitter buffer using a rise value, said rise value comprising a monotonically increasing value that accounts for the variance of past jitter.
  • 12. A device according to claim 11, said jitter buffer controller further configured to analyze jitter of a current transmission.
  • 13. A device, comprising:a jitter buffer; a jitter buffer cache configured to store a history of jitter of transmitted data; and a jitter buffer controller configured to adjust a depth of said jitter buffer by analyzing said history; said jitter buffer controller configured to analyze jitter of a current transmission and determine a current depth of said jitter buffer and a rise value, said rise value being a monotonically increasing variable and defining an amount by which said current depth may be increased, said rise and said current depth defining an overall jitter buffer depth.
  • 14. A device according to claim 13, said jitter buffer controller configured to determine an initial current depth and rise from a jitter cache prior to reception of said current transmission.
  • 15. A device according to claim 14, said jitter buffer controller configured to update said jitter cache during said transmission.
  • 16. A method, comprising:receiving data into a jitter buffer; determining a depth value of said jitter buffer; and adjusting said depth value by performing an exponential smoothing on said depth value.
CROSS REFERENCE TO RELATED APPLICATIONS

This application is related to application Ser. No. 09/440,456, titled “Apparatus and Method for Adaptive Jitter Buffers”, filed concurrently herewith, and incorporated by reference in its entirety as if fully set forth herein.

US Referenced Citations (22)
Number Name Date Kind
4147895 Fenoglio Apr 1979 A
5146477 Cantoni et al. Sep 1992 A
5423020 Vojnovich Jun 1995 A
5528598 Ng Jun 1996 A
5534937 Zhu et al. Jul 1996 A
5550877 Waters Aug 1996 A
5621775 Etienne Apr 1997 A
5640388 Woodhead et al. Jun 1997 A
5652627 Allen Jul 1997 A
5757871 Furukawa et al. May 1998 A
5764298 Morrison Jun 1998 A
5778218 Gulick Jul 1998 A
5872789 Orleth et al. Feb 1999 A
5872823 Sutton Feb 1999 A
5881245 Thompson Mar 1999 A
5940479 Guy et al. Aug 1999 A
6097698 Yang et al. Aug 2000 A
6167465 Parvin et al. Dec 2000 A
6301251 Kim et al. Oct 2001 B1
6377931 Shlomot Apr 2002 B1
6452950 Ohlsson et al. Sep 2002 B1
6473432 Nishimura et al. Oct 2002 B1
Non-Patent Literature Citations (1)
Entry
Patent application, “Apparatus and Method for Adaptive Jitter Buffer Depletion”, filed Oct. 5, 1999, Ser. No. 09/412,876, Atty Dkt No. 99P7889.