The present invention relates generally to the field of Voice-Over-Internet-Protocol (VoIP). More particularly, the present invention provides means for adapting the bandwidth requirement of a real-time voice-communication to the available bandwidth of the underlying transport network.
In traditional circuit switched telephony, a continuous data “pipe” is provided through the Public Switched Telephone Network (PSTN) to guarantee the flow of the PCM voice data. Internet telephony on the other hand must overcome a variety of impairments to the regular and timely delivery of voice data packets to the far end. These impairments are inherent in current Internet architecture, which provides a best-effort delivery service without any guarantees regarding the delivery of voice packets. Additionally, the transport of the voice packets is constrained by the amount of bandwidth available in the network connection, the delay that the packet experiences and any packet loss or corruption that occurs. In general, the measure of the quality of a data network to transport voice data packets quickly and consistently is referred to as the network's quality of Service (QoS).
A variety of network conditions affect the QoS of a connection. The bandwidth (BW) is the measure of the number of bits per second that can flow through a network link at a given time. Available bandwidth is limited by both the inherent capacity of the underlying network as well as other traffic along that route. End-to-End bandwidth from sender to receiver (the “call path”) will be determined by the slowest link on the entire route. For example, a dialup connection to the ISP with and ideal bandwidth of 56 kilobits per second (kb/s) may be the slowest link for a user. However, the bandwidth actually available to a VoIP application on this link at a particular time will be lower if a larger file transfer is taking place at that time.
The bandwidth usage per channel is determined primarily by the compressor/decompressor (CODEC) used to digitize and compress voice data and its associated overhead. Table 1 lists the one-way bandwidth requirements of three popular CODECs and a Mean Opinion Score (MOS) based on the ITU-T recommendation for measuring voice quality (higher MOS values indicate better quality).
As illustrated in Table 1, CODECs such as G.723 and G.729 significantly reduce the data bandwidth required. There is, however a general tradeoff, between using a high compression CODEC (with its low bandwidth usage) and voice quality. The high compression CODECs typically have slightly reduced voice quality (as reflected in the MOS rating), and introduce additional delay due to the added computational effort. The highest bandwidth is required by the minimal compression G.711 CODEC, which is the standard toll quality CODEC.
Another factor in bandwidth usage is the overhead introduced by different IP layers. Most CODECs operate by collecting a block of voice samples and then compressing this block to produce a frame of coded voice. As this media frame is prepared for transport over IP, different protocol layers add their own headers to the data to be able to recreate the voice stream at the destination.
Protocol overhead can be reduced by including more than one media frame per datagram (or packet). This also reduces the number of packets sent per second and hence the bandwidth usage.
Delay along the voice transmission call path can significantly affect voice quality. If the delay is too large, for example greater than 400 ms (ITU-T recommendation), interactive communication will be impossible. Many factors contribute to delay in VoIP, the most important being the delay experienced by VoIP media packets on the network. Another source of delay is the CODEC used for processing voice. High compression CODECs introduce more delay than low compression CODECs.
VoIP media packets comprising a data stream may not experience the same delay. Some packets may be delayed more than others due to instantaneous network usage and congestion or as a result of traversing different routes through the network. This variance from the average delay is called jitter. Voice CODECS will produce poor voice output if the input packet stream is not delivered at the exact play-out time. A jitter buffer at the receiver can smooth out this variation but it adds some more delay. If the jitter is larger than what the buffer can handle, the jitter buffer may underflow or overflow resulting in packet loss.
QoS is also degraded by packet loss. The most common cause of packet loss on land-based networks is the overloading of a router queue along the transmission call path. In this case the router will discard packets. On land-based networks, packet loss is therefore a sign of network congestion. Packets can also be lost because of corruption. Internet routers are programmed to discard corrupted packets. Voice CODECS can generally cope with small random packet losses, by interpolating the lost data. Large packet loss ratio or burst packet loss can severely degrade voice quality. The exact limits vary by the CODEC used but generally, low compression CODECS are more tolerant to packet loss.
The lack of QoS guarantees on the Internet has been a major challenge in developing VoIP applications. IETF is working on a number of proposals to help guarantee the quality of service that time critical data such as VoIP services require, including:
These services are, however, not currently available on the present day Internet. VoIP applications on end systems are required to work around the hurdles presented to regular and timely data flow. The Internet offers a best effort delivery service. So long as sufficient bandwidth is available, VoIP traffic can flow smoothly with an acceptable QoS. If the bandwidth is constrained, the effects described above will result in degraded voice quality.
What would be desirable are means to allow VoIP applications to sense the current call path bandwidth and to adapt in real-time the transmission rate to utilize that bandwidth.
Embodiments of the present invention provide a real-time bandwidth monitor (RTBM) for VoIP applications to sense the available bandwidth between two endpoints of a VoIP communication (herein, a “call path”) and to adapt in real-time the transmission rate to utilize that bandwidth. If sufficient bandwidth is available, the RTBM selects a low compression, low latency CODEC to offer best possible voice quality to the user. If the bandwidth is constrained, the RTBM, instead of allowing the VoIP application to fail, degrades gracefully by switching to a high compression CODEC. On further bandwidth reduction, the RTBM increases the media frames per packet. Because the bandwidth reduction may be transitory, the RTBM constantly monitors the end-to-end available bandwidth so as to invoke the CODEC/frame per packet combination that provides the best QoS achievable over the current end-to-end available bandwidth.
It is therefore an aspect of the present invention to monitor current end-to-end available bandwidth in a VoIP communication using a real-time bandwidth monitor (RTBM) and to adapt in real-time the transmission rate of a VoIP application to utilize that bandwidth.
It is another aspect of the present invention that if the RTBM determines that sufficient bandwidth is available, to select a low compression, low latency CODEC to offer the best possible voice quality to the user.
It is still another aspect of the present invention that if the RTBM determines that bandwidth is limited, to switch to a high compression CODEC.
It is yet another aspect of the present invention that if the RTBM determines that the bandwidth is highly restricted, to increase the media frames per packet.
It is an aspect of the present invention to constantly monitor the call path available bandwidth so as to invoke the CODEC/frame per packet combination that provides the best QoS achievable over the current call path available bandwidth.
It is another aspect of the present invention to determine improvements in bandwidth for VoIP media communications by making specialized measurements via “probe packets” sent prior to media startup and during conversation “silence periods” so that no additional network bandwidth is consumed for making the measurement.
It is still another aspect of the present invention to provide a RTBM that is application independent and able to adjust the send rate automatically in a plug and play fashion.
These and other aspects of the present invention will become apparent from a review of the general and detailed descriptions that follow.
An embodiment of the present invention provides a method for adapting the transmission rate of media packets between endpoints in a voice over Internet protocol (VoIP) communication. A starting bandwidth measure at a starting endpoint is determined. A starting CODEC at the starting endpoint is selected based on the starting bandwidth measure. The starting CODEC is associated with a starting CODEC nominal data rate. An ending bandwidth measure at the ending endpoint is determined. An ending CODEC at the ending endpoint is selected based on the ending bandwidth measure. The ending CODEC is associated with an ending CODEC nominal data rate. The ending endpoint is informed of the starting CODEC nominal data rate. The starting endpoint is informed of the ending CODEC nominal data rate. A current CODEC comprising a data rate equal to the lower of the starting CODEC nominal data rate and the ending CODEC nominal data rate is selected and used at the starting and ending end points.
In another embodiment of the present invention, the starting bandwidth measure is determined by sending a starting probe packet from the starting endpoint to a network device. According to embodiments of the present invention, the network device is selected from the group consisting of a STUN server, a SIP server, and an echo server. The starting probe packet is echoed by the network device to the starting endpoint. The bandwidth of the path from the starting endpoint to the network device is then determined.
The starting CODEC is associated with a bandwidth range. A determination is made whether the starting bandwidth measure is within the bandwidth range. If so, the starting CODEC is selected.
In another embodiment of the present invention, a packet loss ratio of a media packet stream between the starting endpoint and the ending endpoint is obtained. A determination is made whether the packet loss ratio exceeds a maximum packet loss ratio associated with the current CODEC. If the packet loss ratio exceeds the maximum packet loss ratio, then a nominal in-use data rate of the current CODEC is determined. A determination is made whether the current CODEC is associated with an alternate nominal data rate that is lower than the nominal in-use data rate. If current CODEC is associated with an alternate nominal data rate that is lower than the in-use data rate, the alternate nominal data rate is substituted for the in-use nominal data rate.
If the current CODEC is not associated with an alternate nominal data rate that is lower than the in-use nominal data rate, a determination is made whether a current frames per packet measure is less than a maximum frames per packet measure associated with the current CODEC. If the current frames per packet measure is less than the maximum frames per packet measure associated with the current CODEC, then the frames per packet measure of the media packet is increased.
If the current frames per packet measure is greater than or equal to the maximum frames per packet, then a determination is made whether a substitute CODEC having a substitute nominal data rate that is lower than the nominal data rate of the current CODEC is available at the starting and ending endpoints. If the substitute CODEC is available at the starting and ending endpoints, then the substitute CODEC is used at the starting and ending endpoints.
Embodiments of the present invention provide a real-time bandwidth monitor (RTBM) for VoIP applications to sense the available bandwidth between two endpoints of a VoIP communication (herein, a “call path”) and to adapt in real-time the transmission rate to utilize that bandwidth. If sufficient bandwidth is available, the RTBM selects a low compression, low latency CODEC to offer best possible voice quality to the user. If the bandwidth is constrained, the RTBM, instead of allowing the VoIP application to fail, degrades gracefully by switching to a high compression CODEC. On further bandwidth reduction, the RTBM increases the media frames per packet. Because the bandwidth reduction may be transitory, the RTBM constantly monitors the end-to-end available bandwidth of the path so as to invoke the CODEC/frame per packet combination that provides the best QoS achievable over the current end-to-end available bandwidth.
When a call is placed from telephone 120 to telephone 150, the voice quality of the audio signal is affected by the CODEC used and the bandwidth of the network path between them. In an embodiment of the present invention, VoIP endpoint 100 and VoIP endpoint 130 each comprise an optimization database (115 and 145 respectively). Each entry in the database maps a range of bandwidth calculations to a set of pre-computed optimizations for CODEC and frames per packet.
In an embodiment of the present invention, optimization databases 115 and 145 list all usable CODEC and frames per packet combinations. For each CODEC and frame rate combination, optimization databases 115 and 145 further lists the minimum required bandwidth and the maximum tolerable packet loss ratio. The required bandwidth entries are pre-computed values. The maximum tolerable packet loss ratio is an experimentally determined quantity.
In order to establish a VoIP call, the endpoints will typically use a signaling protocol such as IETF's SIP or ITU-Ts H323. If a calling endpoint knows the address of a destination endpoint, the calling endpoint sends a setup request directly to the destination endpoint. If the calling endpoint only knows an alias or “telephone number,” the calling endpoint resolves the alias or telephone number into an IP address by using a directory service. Alternatively, the calling endpoint may forward the setup request to a proxy server that will perform the address resolution and forward the setup request to the destination end-point on behalf of the sender. Once the call setup negotiations are complete, the two endpoints exchange media using the RTP protocol, which provides all the necessary information to reassemble a media stream from packets. When the media session is in progress, each receiver uses RTCP to send feedback to the sender about the quality of the packet stream it is receiving.
In addition to these protocols VoIP devices may require to implement supplementary protocols to function properly. One such protocol is STUN that is used by an endpoint on a private LAN to determine an external routable IP address.
In an exemplary embodiment, during the time when the calling endpoint has sent a call setup request and the called endpoint has not yet responded with the final acknowledgment, the endpoints measure the bandwidth of the actual media path by bouncing probe packets off each other. Prior to this measurement, the two endpoints exchange media channel information. Both SIP and H.323 provide mechanisms for achieving this. Additionally, the two end points start echo servers on the same port as they intend to receive media on. When the above two conditions are met, both endpoints “ping” the peer and measure the path RTT, which can be used to calculate the available bandwidth. This gives a more accurate measure of the path bandwidth and can be used to fine-tune the frames per packet for the media stream.
In another embodiment of the present invention, the bandwidth is measured using a fixed number of probe packets. By way of illustration and not as a limitation, in an exemplary embodiment of the present invention, five packets of different sizes are used to determine the bandwidth. The Round Trip Time (RTT) for each packet size is measured is measured twice and then the minimum of the two is used. Using linear regressions, the slope of the line that fits a plot of RTT samples against packet size is determined using the following formula:
m=(n*sigmaXY−sigmaX*sigmaY)/(n*sigma(X^2)−(sigmaX)^2),
where Y=RTT, X=size of packet, n=number of samples, m=slope, and sigma is a summing function.
The slope m can be calculated as the samples are collected therefore there is no need to first collect all samples and then process them afterwards. The bandwidth is then calculated is as follows:
bandwidth=l/m
In this exemplary embodiment, when a call session is established, the calling VoIP endpoint presents its preferred CODEC to the called endpoint and the called endpoint presents its preferred CODEC to the calling endpoint. The CODEC associated with the lower nominal data rate is used by both endpoints for the media stream. For the most cases this is a good choice and the media path can easily provide the bandwidth required by the media stream.
RTP and RTCP protocols are used for the media exchange. The RTP protocol provides mechanisms for transporting the actual voice payload. The RTP header includes sequence number, timestamp and source identifier, this information is used to reconstruct the stream from the individual packets and to detect lost delayed or out of sequence packets. Each receiving endpoint collects information about the total number of lost packets and packet arrival jitter (variation in packet arrival times) and conveys this information back to the sending endpoint using RTCP protocol at regular intervals. The jitter buffer in each endpoint will smooth out jitter within a certain range and rearrange out of sequence packets. However, if a packet is delayed beyond the capability of the jitter buffer, it will be considered a lost packet. Similarly, a burst of packets that causes the jitter buffer to overflow will result in lost packets. According to the exemplary embodiment, each receiving end point also collects the number of packets lost due to jitter buffer overflow and underflow and passes this information to the sending endpoint through RTCP as jitter buffer packet loss.
The jitter packet loss provides a measure of network jitter and delay. Excessive packet loss is an indication of the fact that the media path is not able to support the bandwidth requirements of the media stream. If the packet loss ratio exceeds the acceptable packet loss ratio for the current CODEC configuration as established in the optimization databases (see,
In still another embodiment of the present invention, if action 1 or 2 above has been taken, the bandwidth is periodically measured during silence intervals to determine if the conditions are again suitable for restoring the previous CODEC configuration.
Systems and methods for dynamically adapting the transmission rate for real-time voice over IP communications to the available bandwidth have been disclosed. It will be understood by those skilled in the art that the present invention may be embodied in other specific forms without departing from the scope of the invention disclosed and that the examples and embodiments described herein are in all respects illustrative and not restrictive. Those skilled in the art of the present invention will recognize that other embodiments using the concepts described herein are also possible. Additionally, as will be appreciated by those skilled in the art, references to specific network protocols are illustrative and not limiting. Further, any reference to claim elements in the singular, for example, using the articles “a,” “an,” or “the” is not to be construed as limiting the element to the singular.
This application claims priority under 35 U.S.C. § 119(e) from provisional application No. 60/552,359 filed Mar. 11, 2004. The 60/552,359 application is incorporated by reference herein, in its entirety, for all purposes.
Number | Name | Date | Kind |
---|---|---|---|
5402481 | Waldman | Mar 1995 | A |
5809128 | McMullin | Sep 1998 | A |
5987103 | Martino | Nov 1999 | A |
6014440 | Melkild et al. | Jan 2000 | A |
6091732 | Alexander, Jr. et al. | Jul 2000 | A |
6104757 | Rhee | Aug 2000 | A |
6118768 | Bhatia et al. | Sep 2000 | A |
6125113 | Farris et al. | Sep 2000 | A |
6141345 | Goeddel et al. | Oct 2000 | A |
6185288 | Wong | Feb 2001 | B1 |
6256778 | Oliver | Jul 2001 | B1 |
6307853 | Storch et al. | Oct 2001 | B1 |
6351464 | Galvin et al. | Feb 2002 | B1 |
6359880 | Curry et al. | Mar 2002 | B1 |
6389005 | Cruickshank | May 2002 | B1 |
6389038 | Goldberg et al. | May 2002 | B1 |
6434139 | Liu et al. | Aug 2002 | B1 |
6445694 | Swartz | Sep 2002 | B1 |
6449251 | Awadallah et al. | Sep 2002 | B1 |
6496477 | Perkins et al. | Dec 2002 | B1 |
6542497 | Curry et al. | Apr 2003 | B1 |
6597686 | Smyk | Jul 2003 | B1 |
6603774 | Knappe et al. | Aug 2003 | B1 |
6618761 | Munger et al. | Sep 2003 | B2 |
6636504 | Albers et al. | Oct 2003 | B1 |
6658496 | Minakata et al. | Dec 2003 | B1 |
6700956 | Chang et al. | Mar 2004 | B2 |
6760324 | Scott et al. | Jul 2004 | B1 |
6763226 | McZeal, Jr. | Jul 2004 | B1 |
6771594 | Upadrasta | Aug 2004 | B1 |
6788769 | Waites | Sep 2004 | B1 |
6795540 | Mow | Sep 2004 | B1 |
6822957 | Schuster et al. | Nov 2004 | B1 |
6826174 | Erekson et al. | Nov 2004 | B1 |
6856612 | Bjelland et al. | Feb 2005 | B1 |
6895000 | Lai et al. | May 2005 | B2 |
6907031 | Ehlinger et al. | Jun 2005 | B1 |
6947417 | Laursen et al. | Sep 2005 | B2 |
6954454 | Schuster et al. | Oct 2005 | B1 |
7012888 | Schoeneberger et al. | Mar 2006 | B2 |
7016481 | McElvaney | Mar 2006 | B2 |
7046683 | Zhao | May 2006 | B1 |
7092380 | Chen et al. | Aug 2006 | B1 |
7113500 | Bollinger et al. | Sep 2006 | B1 |
7145900 | Nix et al. | Dec 2006 | B2 |
7212622 | Delaney et al. | May 2007 | B2 |
7213766 | Ryan et al. | May 2007 | B2 |
7283542 | Mitchell | Oct 2007 | B2 |
7302053 | Chang et al. | Nov 2007 | B2 |
20010038033 | Habib | Nov 2001 | A1 |
20020007273 | Chen | Jan 2002 | A1 |
20020052965 | Dowling | May 2002 | A1 |
20020097843 | Krol et al. | Jul 2002 | A1 |
20020131604 | Amine | Sep 2002 | A1 |
20020147912 | Shmueli et al. | Oct 2002 | A1 |
20020184376 | Sternagle | Dec 2002 | A1 |
20020191621 | Jha | Dec 2002 | A1 |
20020191768 | Stoughton | Dec 2002 | A1 |
20030002479 | Vortman et al. | Jan 2003 | A1 |
20030012137 | Abdelilah et al. | Jan 2003 | A1 |
20030023669 | DeLima et al. | Jan 2003 | A1 |
20030093606 | Mambakkam et al. | May 2003 | A1 |
20030110257 | Hyun et al. | Jun 2003 | A1 |
20030112820 | Beach | Jun 2003 | A1 |
20030123388 | Bradd | Jul 2003 | A1 |
20030135376 | Harada | Jul 2003 | A1 |
20030161453 | Veschi | Aug 2003 | A1 |
20030204619 | Bays | Oct 2003 | A1 |
20030214939 | Eldumiati et al. | Nov 2003 | A1 |
20030219006 | Har | Nov 2003 | A1 |
20030224780 | Rodman et al. | Dec 2003 | A1 |
20040019539 | Raman et al. | Jan 2004 | A1 |
20040032860 | Mundra et al. | Feb 2004 | A1 |
20040032862 | Schoeneberger et al. | Feb 2004 | A1 |
20040047451 | Barker et al. | Mar 2004 | A1 |
20040086093 | Schranz | May 2004 | A1 |
20040114581 | Hans et al. | Jun 2004 | A1 |
20040133668 | Nicholas, III | Jul 2004 | A1 |
20040141508 | Schoeneberger et al. | Jul 2004 | A1 |
20040141758 | El-Reedy | Jul 2004 | A1 |
20040165578 | Burritt et al. | Aug 2004 | A1 |
20040205023 | Hafer et al. | Oct 2004 | A1 |
20040205777 | Zalenski et al. | Oct 2004 | A1 |
20040218583 | Adan et al. | Nov 2004 | A1 |
20040223458 | Gentie | Nov 2004 | A1 |
20040248590 | Chan et al. | Dec 2004 | A1 |
20040258003 | Kokot et al. | Dec 2004 | A1 |
20050002506 | Bender et al. | Jan 2005 | A1 |
20050074031 | Sunstrum | Apr 2005 | A1 |
20050074122 | Fascenda | Apr 2005 | A1 |
20050089052 | Chen et al. | Apr 2005 | A1 |
20050091392 | Gesswein et al. | Apr 2005 | A1 |
20050094621 | Acharya et al. | May 2005 | A1 |
20050138183 | O'Rourke et al. | Jun 2005 | A1 |
20050180464 | McConnell et al. | Aug 2005 | A1 |
20050195799 | Burne et al. | Sep 2005 | A1 |
20050220083 | Takeuchi | Oct 2005 | A1 |
20050243733 | Crawford et al. | Nov 2005 | A1 |
20060008059 | Ying et al. | Jan 2006 | A1 |
20060029062 | Rao et al. | Feb 2006 | A1 |
20060029063 | Rao et al. | Feb 2006 | A1 |
20060031393 | Cooney et al. | Feb 2006 | A1 |
20060034296 | Talucci | Feb 2006 | A1 |
20060037071 | Rao et al. | Feb 2006 | A1 |
20060039356 | Rao et al. | Feb 2006 | A1 |
20060088025 | Barkley et al. | Apr 2006 | A1 |
20060208066 | Finn et al. | Sep 2006 | A1 |
20060276230 | McConnell | Dec 2006 | A1 |
20070248081 | Barkley et al. | Oct 2007 | A1 |
20080025291 | Barkley et al. | Jan 2008 | A1 |
Number | Date | Country | |
---|---|---|---|
20050201414 A1 | Sep 2005 | US |
Number | Date | Country | |
---|---|---|---|
60552359 | Mar 2004 | US |