The present invention relates generally to communication networks, and more particularly to load balancing and overload control techniques for use in Session Initiation Protocol (SIP)-based networks, such as IP Multimedia Subsystem (IMS) networks, and other types of communication networks.
Session Initiation Protocol (SIP) is rapidly becoming the de facto signaling protocol for establishing, modifying and terminating multimedia sessions between users in a communication network. SIP is described in J. Rosenberg et al., “SIP: Session Initiation Protocol,” Internet Engineering Task Force (IETF) RFC 3261, June 2002, which is incorporated by reference herein. SIP has also been adopted for the IP Multimedia Subsystem (IMS), which is the next-generation core network architecture for mobile and fixed services defined by the 3rd Generation Partnership Project (3GPP).
A network element that processes and forwards SIP messages is called a proxy server in SIP terminology, and a Call Session Control Function (CSCF) in IMS terminology. 3GPP defines three types of CSCF elements: Proxy CSCF (P-CSCF) which is the interface to the user, Interrogating CSCF (I-CSCF) which provides an interface to other servers in different administration domains, and Serving CSCF (S-CSCF) which handles registration, enforces policy and provides an interface to application servers. Such network elements are referred herein as SIP/IMS servers, and a signaling network comprising these and other network elements is referred to as a SIP-based network.
In order to achieve high levels of performance in a SIP-based network, it is important to distribute the traffic load evenly over the network elements. Unfortunately, conventional load balancing techniques are often not well suited for use in the SIP context, and may fail to provide the desired performance levels.
A related problem in SIP-based networks involves overload control. Like other network elements, a SIP/IMS server can become overloaded when traffic demand exceeds its available resources, for example, its available processing resources. Even with over-provisioning, overload may still occur for various reasons including temporary traffic surges due to “flash crowd” effect, node or link failures, poor routing, traffic diversion due to maintenance and denial-of-service attacks, etc.
A variety of techniques have been developed for addressing overload control in communication networks. These include, for example, overload control based on M/M/1 queuing systems, and overload control techniques developed for use in the Signaling System 7 (SS7) context.
Unfortunately, these and other conventional overload control techniques fail to address the quantitative impact of overload on SIP performance and fail to provide specific approaches for handling overload in SIP-based networks, which are often more complex in terms of messaging services and signaling topologies.
It is therefore apparent that a need exists for improved load balancing and overload control techniques, particularly in SIP-based networks.
The present invention in an illustrative embodiment provides improved techniques for load balancing and overload control in a SIP-based network or other type of communication network.
In accordance with one aspect of the invention, a load balancing technique is provided in which a first server receives feedback information from downstream servers of the network, the downstream servers including at least first and second downstream servers associated with respective first and second paths between the first server and a target server, the feedback information comprising congestion measures for the respective downstream servers. The congestion measures may be, for example, processor utilization measures, message processing loads, buffer occupancy measures, message processing delays, or any other type of information indicative of congestion, or combinations thereof. The feedback information may be transmitted from the downstream servers to the first server in one or more SIP 100 response messages, for example, encoded in an extension header. A message routing process in the first server is adjusted based on the received feedback information to compensate for imbalance among the congestions measures of the downstream servers. The adjustment in an illustrative embodiment is dynamic, so as to ensure that the message routing process keeps track of prevailing network conditions, thereby improving capacity utilization in the network.
The above-described receipt of feedback information and associated adjustment of the message routing process may be replicated at each server in a network. In other words, each of the servers may operate as the first server relative to other servers of the network.
The feedback information may comprise a highest congestion measure among the congestion measures of a plurality of servers in the first or second paths between the first server and the target server.
One of the first and second downstream servers may be the target server itself, or a nearest neighboring server of the first server.
The message routing process may be adjusted by, for example, adjusting routing information which specifies relative percentages of a given set of messages to be routed on the first and second paths. The routing information may comprise at least first and second routing probabilities for the respective first and second paths, stored in a routing table or other suitable data structure.
In accordance with another aspect of the invention, an overload control technique is provided in which a first server receives feedback information from at least one downstream server of the network, the downstream server being associated with a path between the first server and a target server, the feedback information comprising a congestion measure of the downstream server. The first server generates a blocking message for delivery to a user agent based on the feedback information.
The downstream server may be the target server itself, or a nearest neighboring server of the first server. The first server may be an ingress server of the network, or a core network server that is the nearest upstream neighbor of the downstream server.
Again, the operations associated with the first server above may be replicated at other servers of the network. Thus, the load balancing and overload control techniques can be implemented in a distributed manner, without the need for any centralized controller.
The load balancing and overload control techniques of the invention may be used alone or in combination. An illustrative embodiment of the invention combines both techniques to provide an enhanced communication protocol referred to herein as “Overload-Safe SIP” or OS-SIP. Advantageously, OS-SIP avoids a congestion collapse problem typically exhibited by conventional SIP, while also providing higher capacity and reduced ring delay and call setup time. Thus, OS-SIP delivers significant performance improvement and offers high-reliability service independent of traffic loads.
These and other features and advantages of the present invention will become more apparent from the accompanying drawings and the following detailed description.
The present invention will be illustrated below in conjunction with exemplary SIP-based networks and associated load balancing and overload control techniques. It should be understood, however, that the invention is not limited to use with the particular load balancing or overload control techniques of the illustrative embodiments, nor with any particular type of network or other communication network. The disclosed techniques are suitable for use with a wide variety of other systems and in numerous alternative applications.
SIP messages from a UAC to a UAS are called requests and those in the reverse direction are called responses. In this particular example, the first end user, corresponding to UAC 102, represents a caller who transmits a request (e.g., initiating a call), while the second end user, corresponding to UAS 108, is a callee who receives the request from the caller and responds accordingly. The request and response are shown by the respective solid line 110 and dashed line 112. As is apparent, a given request from a UAC to a UAS may traverse multiple servers whose main purpose is to route messages closer to the end user. A server may rely on a domain name system (DNS) to resolve an IP address from a SIP address which is similar to an email address.
The SIP protocol is structured into multiple layers. The bottom layer is the transport (TR) layer which currently may utilize User Datagram Protocol (UDP) or Transmission Control Protocol (TCP). The transaction layer, which is the heart of SIP, uses the service of the transport layer and reliably delivers messages from one SIP entity to another through an IP-based network, which as noted previously will typically include a multiplicity of servers not explicitly shown in the figure. In particular, the transaction layer provides message retransmissions, matches responses to requests and facilitates timeouts. The transaction layer comprises client transaction (CT) and server transaction (ST) portions. The client transaction receives requests from its upper layer, which is the transaction user or the core, and reliably transmits the requests to its peer server transaction. The client transaction relies on timers and retransmissions to ensure that messages are received by its peer. The server transaction receives requests from the transport layer and delivers them to its core. In addition, the server transaction also provides filtering of retransmissions by transmitting appropriate responses to its peer client transaction. The interaction between the client and server transactions is governed by a set of finite-state machines (FSMs).
In the SIP-based network 100, there are two types of servers, namely, stateless server 104 and stateful server 106. A stateless server does not contain a transaction layer. Its function is merely to forward messages to the next hop. A stateful server, on the other hand, terminates a transaction layer and thus can also generate additional messages. For example, upon receiving a request from its upstream neighbor, a stateful server may generate multiple requests to multiple destinations, a technique known as “forking,” in order to determine an appropriate location at which to contact the end user.
When UA A initiates the call to UA B, UA A typically sends an INVITE request containing UA B's SIP address to the outbound server (Server A) that serves UA A's domain. The INVITE request also contains other pertinent information needed by SIP, as well as additional information such as media and codec types needed for the bearer session. Upon receiving the INVITE request, Server A possibly performs a DNS query (not shown) to locate the inbound server (Server B) that serves UA B. Server A then forwards the INVITE request to Server B. In addition, Server A sends a 100 Trying response to UA A to indicate that INVITE processing is in progress.
Assume that the INVITE request is lost because Server B is congested. If the transport layer is unreliable (e.g., UDP), the transaction layer at Server A would detect the loss from the absence of 100 Trying, and retransmit the INVITE request. Eventually, when the INVITE request reaches the destination, UA B responds with a 180 Ringing response. If the callee decides to answer the call, a 200 OK response is sent to the caller, which may confirm the 200 OK response by returning an ACK. At this point, the bearer channel is established and communication or other data transfer between the caller and callee can begin. At the end of the session, either party can terminate the session by sending a BYE request. In this example, UA A terminates the session by sending a BYE request that is acknowledged by a 200 OK response from UA B.
A congestion collapse problem that can arise when SIP-based networks become overloaded will now be described with reference to
SIP uses various timers, denoted A through K, to ensure reliable delivery of messages. When a server is congested, the timers may trigger more retransmissions which may cause more congestion.
The present invention provides techniques which avoid the congestion collapse problem illustrated in
A number of exemplary overload control algorithms suitable for use in conjunction with the present invention will now be described. For purposes of illustration, the algorithms are described as operating at a single server, rather than over a network of servers. Conventional aspects of the first two of these algorithms, known as the occupancy algorithm (OCC) and the acceptance rate algorithm, are respectively described in U.S. Pat. No. 4,974,256, issued Nov. 27, 1990 in the name of B. L. Cyr et al. and entitled “Load balancing and overload control in a distributed processing telecommunication system,” and S. Kasera et al., “Fast and robust signaling overload control,” International Conference on Network Protocols, 2001. However, such algorithms have not heretofore been adapted for use in the SIP context. The final overload control algorithm to be described is an improved version of the acceptance rate algorithm that we have determined is particularly well suited for providing overload control in SIP-based networks. It should be understood that embodiments of the invention may utilize the occupancy algorithm, the acceptance rate algorithm, the improved acceptance rate algorithm, or another overload control algorithm.
In the occupancy algorithm, incoming calls to a server are controlled by a variable f which denotes the fraction of calls that are accepted. Thus a new call is accepted with probability f or, equivalently, blocked with probability 1−f. In applying this algorithm to the SIP context, INVITE requests may be accepted with probability f, while other messages are always accepted as long as the message buffer in the server is not full. Based on current system overload conditions, the objective of the occupancy algorithm is to dynamically adjust f to maintain high call throughput. The overload condition is based on processor utilization, ρ, which is periodically probed at every τ seconds. In each n-th probed epoch, the average processor utilization {tilde over (ρ)}(n) is updated and compared with a target utilization ρt arg. The average utilization can be computed as a moving average (MA) over the previous k epochs
or by exponentially weighted moving average (EWMA)
The basic idea of the occupancy algorithm is to increase f if {tilde over (ρ)}<ρt arg, and to decrease it otherwise. Let f(n) denote the newly updated f in the current epoch n, while f(n−1) denote f updated in epoch n−1. The algorithm that updates fin each epoch is described as follows.
where fmin represents the threshold for the minimum fraction of traffic accepted. The multiplicative factor φ is given by
φ=min{ρt arg/{tilde over (ρ)},φmax},
where φmax defines the maximum possible multiplicative increase in f from one epoch to the next.
In the above-cited S. Kasera et al. reference, it is argued that because ρ cannot exceed 1, the occupancy algorithm cannot decrease f by more than 10% when the system is overloaded, and thus the algorithm may react too slowly under sudden traffic surge. The basic idea of the acceptance rate algorithm is to use {tilde over (α)} in place of {tilde over (ρ)}, where {tilde over (α)} represents the average call acceptance rate into the system. The target acceptance rate αt arg can be set to αt arg=μρt arg, where μ is the system call-carrying capacity, which can be estimated by μ={tilde over (α)}/{tilde over (ρ)}. It is suggested that αt arg is updated by a EWMA with a smoother average than that for {tilde over (α)}. The acceptance rate algorithm uses the following multiplicative factor:
φ=αTARG/{tilde over (α)}
We have recognized that conventional implementations of the occupancy algorithm and the acceptance rate algorithm are problematic in that they do not take into account unfinished work in the system. In particular, if {tilde over (α)}=αt arg, then f(n)=f(n−1) independent of the message queue content. Instead, when {tilde over (α)}=αt arg, we want to decrease f(n) if the queue content is too high and increase f(n) if the queue content is too low. A second observation is that the above algorithms tend to increase f(n) more than to decrease it for the same amount of differences (positive or negative) between the variable to be compared with the target parameter. Hence we modify φ for the improved acceptance algorithm as follows.
where q is the average queue length, in number of messages, updated using EWMA at each message arrival, qt arg is the queue target, and N is the average number of messages per call. The updating of average queue length at each message arrival may be viewed as a type of event-driven updating. Other examples of such event-driven updating are described in S. Floyd et al., “Random early detection gateways for congestion avoidance,” IEEE Transactions on Networking, Vol. 1, No. 4, pp. 397-413, August 1993.
To evaluate the performance of the preceding overload control algorithms in a SIP environment, one may simulate a server that implements the full transaction layer of SIP, such as the stateful server 106 of
In evaluating the performance of a SIP-based network having the topology shown in
We will now describe a number of overload control techniques for use in a SIP-based network or other type of network in an illustrative embodiment of the invention.
There are a number of approaches that may be used to notify overload using otherwise conventional SIP messages. One approach is to provide notification of an overloaded server by sending a 503 Service Unavailable response from the overloaded server to its upstream neighbor server. This response can state, via a Retry-After header field, an amount of time for which the overloaded server will be unavailable. Upon receipt of this message, the upstream neighbor server will not transmit any other requests to the overloaded server, regardless of the destination of the requests, for the given duration. The upstream neighbor server, however, can still transmit responses to the overloaded server. We found this mechanism to react poorly to overload since the 503 response typically causes a large volume of traffic to be diverted to other alternate servers, which in turn results in overload elsewhere. If other servers also implement the same mechanism, it is likely that overload will oscillate from one server to another.
Another message that can be used to notify overload is 500 Server Internal Error. Unlike the 503 response which is global in nature, the 500 response is only applicable locally for a given call. To control overload, the 500 response is most effectively applied in response to an INVITE request to reject a new call.
An alternate approach is not to explicitly send a notification message to indicate an overload, but to simply drop INVITE requests to block new calls. This approach in general may not work well since it may cause a large number of retransmissions.
Another important issue is with respect to the location of the server that initiates the overload notification.
The simplest approach, referred to herein as local overload control, is for each overloaded server to initiate the notification autonomously. An example is shown in
Another approach, called ingress overload control, is to propagate upstream the overload status information for each target, for example, via a new header in the 100 Trying response. Each server forwarding this information will compare its own overload status value with the received downstream overload status value and propagate the maximum value of the two overload status values upstream. For a given target, an ingress server decides to accept or block a new call based on the overload status information. An example is shown in
A third approach intermediate between the previous two is called penultimate overload control. Here the server previous to the overloaded server is the one that blocks new calls. With reference again to
As noted above, illustrative embodiments of the invention may incorporate both load balancing and overload control techniques. Exemplary load balancing techniques will now be described in greater detail with reference to
Referring now to
In the next-hop load balancing approach, each server independently and dynamically adjusts the routing probabilities to its downstream neighbors based on congestion feedback information received from those neighbors. For example, as illustrated in
Although the congestion measures in this example are utilization measures, a wide variety of other types of congestion measures may be used. The term “congestion measure” as used herein is therefore intended to be construed generally, so as to encompass, for example, processor utilization measures, message processing loads, buffer occupancy measures, message processing delays, or any other type of information indicative of congestion, as well as combinations of such measures or information.
From the feedback information received from S3 and S4, S1 adjusts its routing probabilities with the objective of equalizing the congestion measures at S3 and S4. Such an adjustment in routing probabilities is shown in
Another load balancing approach that may be utilized in a given embodiment of the invention is referred to herein as target-based load balancing.
The difference between the next-hop and target-based load balancing techniques described above is illustrated in
In the next-hop load balancing approach, because the 0.5 utilization value of server 903 is higher than the 0.2 utilization value of server 901, the routing probabilities q1 and q2 at server S will be adjusted such that the routing probability q1 will be increased while the routing probability q2 will be decreased until the utilization values at server 901 and server 903 become substantially equal, that is, load balanced.
In the target-based load balancing approach, the highest utilization values in the first and second paths are the 0.6 utilization value of server 902 and the 0.5 utilization value of server 903, respectively. Since the highest utilization value of the first path is higher than the highest utilization value of the second path, the routing probabilities q1 and q2 at server S will be adjusted such that the routing probability q2 will be increased while the routing probability q1 will be decreased until a load balanced condition results. It can be seen that the two approaches may produce different routing probability results for the same set of server utilization values. Although next-hop load balancing may not perform as well as target-based load balancing under certain conditions, next-hop load balancing is simpler to implement than target-based load balancing.
In accordance with the target-based load balancing approach described previously, the highest utilization of the upper paths between node a and node z is in the range 0.4 to 0.5, while the highest utilization of the lower path between node a and node z is 0.3. This feedback information is propagated back through the network to node a, where is it stored in routing table 1000. Various approaches may be used to specify a single value of congestion measure for the upper paths. A simple approach is to take the worst case value of 0.5 for the upper path congestion measure. The routing table is considerably simplified for clarity of illustration, but generally includes columns for the target node, the via node indicative of a particular path to the target, the highest utilization for the particular path, and the routing probability. Of course, numerous alternative routing table formats may be used in implementing the invention.
One possible example of a distributed load balancing algorithm that may be used in implementing the next-hop or target-based load balancing approaches described above, within a given server denoted server i, is as follows:
Let xij(d)=fraction of traffic from i via next hop j (destined to target d)
Let uij(d)=“smoothed” utilization via j (to target d) observed by server i
At each update,
compute Δxij(d)=αxij(d)(Ui−uij(d)),
where Ui=Σjxij(d)uij(d)
New traffic assignments are then given by:
Xij(d)=max(0,xij(d)+Δxij(d)),
xij(d)=Xij(d)/ΣjXij(d).
In this example, the xij(d) values correspond generally to the routing probabilities described previously. The algorithm may be executed at each server periodically, for example, every T seconds. Other suitable algorithms for implementing the next-hop or target-based load balancing approaches described herein will be apparent to those skilled in the art.
As mentioned above, the previously-described local, penultimate and ingress overload control techniques will now be illustrated with reference to
Referring initially to
Also illustrated in
As is apparent from the foregoing description, the illustrative embodiments described in conjunction with the examples of
Referring now to
With reference to
Again, it is to be appreciated that the particular parameters, assumptions, network topologies and other features of the illustrative embodiments described above are presented by way of example only. Although particularly useful with SIP-based networks, such as IMS networks, the techniques described herein can be applied to a wide variety of other types of communication networks, using any of a number of different communication protocols. These and numerous other alternative embodiments within the scope of the appended claims will be readily apparent to those skilled in the art.
The present application is a continuation of U.S. patent application Ser. No. 11/395,455, filed Mar. 31, 2006, the disclosure of which is hereby incorporated by reference herein.
Number | Name | Date | Kind |
---|---|---|---|
4914650 | Sriram | Apr 1990 | A |
4974256 | Cyr | Nov 1990 | A |
5727051 | Holender | Mar 1998 | A |
6134218 | Holden | Oct 2000 | A |
6311065 | Ushiki et al. | Oct 2001 | B1 |
6469991 | Chuah | Oct 2002 | B1 |
6496811 | Bodnar | Dec 2002 | B1 |
6507562 | Kadansky | Jan 2003 | B1 |
6542462 | Sohraby et al. | Apr 2003 | B1 |
6578068 | Bowman-Amuah | Jun 2003 | B1 |
6618378 | Giroux et al. | Sep 2003 | B1 |
6680943 | Gibson | Jan 2004 | B1 |
6778496 | Meempat et al. | Aug 2004 | B1 |
6888797 | Cao et al. | May 2005 | B1 |
6904017 | Meempat et al. | Jun 2005 | B1 |
7050424 | Cheng | May 2006 | B2 |
7069342 | Biederman | Jun 2006 | B1 |
7170905 | Baum | Jan 2007 | B1 |
7225271 | DiBiasio | May 2007 | B1 |
7277423 | Welch | Oct 2007 | B1 |
7305477 | Gbadegesin | Dec 2007 | B2 |
7319667 | Biederman | Jan 2008 | B1 |
7409441 | Kake et al. | Aug 2008 | B2 |
7447194 | Schlesener | Nov 2008 | B1 |
7483374 | Nilakantan | Jan 2009 | B2 |
7509229 | Wen | Mar 2009 | B1 |
7555560 | Hassan | Jun 2009 | B2 |
7689223 | Lewis | Mar 2010 | B1 |
7864939 | Burg | Jan 2011 | B1 |
7961715 | Dhesikan | Jun 2011 | B1 |
9124465 | Findlay | Sep 2015 | B1 |
20010014847 | Keenan | Aug 2001 | A1 |
20010025310 | Krishnamurthy | Sep 2001 | A1 |
20020035642 | Clarke | Mar 2002 | A1 |
20020120729 | Faccin et al. | Aug 2002 | A1 |
20020186657 | Jain et al. | Dec 2002 | A1 |
20030043742 | De Maria | Mar 2003 | A1 |
20030093462 | Koskelainen | May 2003 | A1 |
20030123432 | Cheng | Jul 2003 | A1 |
20030174648 | Wang | Sep 2003 | A1 |
20030198183 | Henriques et al. | Oct 2003 | A1 |
20040148423 | Key | Jul 2004 | A1 |
20040152469 | Yla-Outinen | Aug 2004 | A1 |
20040205190 | Chong | Oct 2004 | A1 |
20050003824 | Siris | Jan 2005 | A1 |
20050015492 | Kumbalimutt | Jan 2005 | A1 |
20050055436 | Yamada et al. | Mar 2005 | A1 |
20050117576 | McDysan | Jun 2005 | A1 |
20050163126 | Bugenhagen et al. | Jul 2005 | A1 |
20050220095 | Narayanan | Oct 2005 | A1 |
20060002312 | Delattre | Jan 2006 | A1 |
20060050640 | Jin et al. | Mar 2006 | A1 |
20060198309 | Cortes | Sep 2006 | A1 |
20070025301 | Petersson | Feb 2007 | A1 |
20070037581 | Morita | Feb 2007 | A1 |
20070121673 | Hammer | May 2007 | A1 |
20070153813 | Terpstra | Jul 2007 | A1 |
20070180113 | Van Bemmel | Aug 2007 | A1 |
20080114850 | Skog et al. | May 2008 | A1 |
Number | Date | Country |
---|---|---|
PCTUS2007008244 | Aug 2007 | WO |
Entry |
---|
Blake et al., “An Architecture for Differentiated Services”, 1998. |
Braden et al., “Recommendations on Queue Management and Congestion Avoidance in the Internet”, RFC 2309, 1998. |
Braden et al., “Resource ReSerVation Protocol (RSVP)—Version 1 Functional Specification”, RFC 2205, 1997. |
Floyd, “Congestion Control Principles”, RFC 2914, 2000. |
Geneiatakis et al., “Survey of Security Vulnerabilities in Session Initiation Protocol”, 2006. |
Jacobson et al., “Congestion Avoidance and Control”, 1988. |
Nagle, “Congestion Control in IP/TCP Internetworks”, RFC 896, 1984. |
Ohta, “Overload Protection in a SIP Signaling Network”, 2006. |
Ramakrishnan et al., “A Proposal to add Explicit Congestion Notification (ECN) to IP”, 1999. |
Rosenberg et al., “SIP: Session Initiation Protocol”, RFC 3261, 2002. |
Shen et al., “Session Initiation Protocol (SIP) Server Overload Control: Design and Evaluation”, 2008. |
Shen et al., “On TCP-based SIP Server Overload Control”, 2010. |
Turanyi et al., “Load Control: Congestion Notifications for Real-time Traffic”, 2001. |
Wang, “SIP Overload Control: A Backpressure-based Approach”, 2008. |
Welzl et al., “Congestion Control in the RFC Series”, RFC 5783, 2010. |
Wijnen et al., “View-based Access Control Model (VACM) for the Simple Network Management Protocol (SNMP)”, RFC 2275, 1998. |
Yang et al., “An Optimized Algorithm for Overload Control of SIP signaling Network”, 2009. |
Ejzak et al., “Network Overload and Congestion: A Comparison of ISUP and SIP”, 2004. |
Geng et al., “A SIP Message Overload Transfer Scheme”, 2006. |
Handley et al., “SIP: Session Initiation Protocol”, RFC 2543, 1999. |
Hilt et al., “Controlling Overload in Networks of SIP Servers”, 2008. |
Hilt et al., “Design Considerations for Session Initiation Protocol (SIP) Overload Control”, RFC 6357, 2011. |
Hilt et al., “Session Initiation Protocol (SIP) Overload Control”, 2010. |
Hong et al., “A Comparative Study of SIP Overlaod Control Algorithms”, 2012. |
Rosenberg, “Requirements for Management of Overload in the Session Initiation Protocol”, RFC5390, 2008. |
Shen et al., “On TCP-based SIP Server Overload Control”, 2009. |
Camarillo et al., “Integration of Resource Management and Session Initiation Protocol (SIP)”, RFC3312, 2002. |
Carmarillo et al., “Integrated Services Digital Network (ISDN) User Part (ISUP) to Session Initiation Protocol (SIP) Mapping”, RFC3398, 2002. |
Cisco, “SIP Troubleshooting: SIP Calls Receives 500 Internal Server Error ‘Routing Failed’ Event”, 1992-2015. |
Handley et al., “SIP: Session Initiation Protocol”, RFC2543, 1999. |
Mosavat et al., “SIP Extensions for the eXtensible Service Protocol”, 2003. |
Nagle, “Congestion Control in IP/TCP Internetworks”, RFC896, 1984. |
Walker et al., “Multiserivce Switching Forum Implementation Agreement for SIP interface between Call Agent and Service Broker”, MSF-IA-SIP .005-FINAL, 2004. |
Wikipedia, “List of SIP response codes”, 2017. |
Katrinis et al., “Dynamic Adaptation of Source Specific Distribution Trees for Multiparty Teleconferencing,” Proceedings of the ACM Conference on Emerging Network Experiment and Technology (CoNEXT), Oct. 2005, pp. 156-165, Toulouse, France. |
S. Yin et al., “MALB: MANET Adaptive Load Balancing,” IEEE 60th Vehicular Technology Conference (VTC), Sep. 2004, pp. 2843-2847, vol. 4. |
Joseph Y. Hui, “A Congestion Measure for Call Admission and Traffic Engineering for Multi-Layer Multi-Rate Traffic,” International Journal of Digital and Analog Communication Systems, Apr. 1990, pp. 127-135, vol. 3, No. 2. |
M. Talla et al., “Global Congestion Control in Hybrid ATM/TDMA Networks,” Communications: The Key to Global Prosperity, Global Telecommunications Conference (GLOBECOM), Nov. 1996, pp. 147-151, vol. 1, London, United Kingdom. |
S. Kasera et al., “Robust Multiclass Signaling Overload Control,” Proceedings of the 13th IEEE International Conference on Network Protocols (ICNP), Nov. 2005, pp. 246-258. |
J. Rosenberg et al., “SIP: Session Initiation Protocol,” Network Working Group, Request for Comments: 3261, Jun. 2002, 269 pages. |
J. Rosenberg et al., “Session Initiation Protocol (SIP): Locating SIP Servers,” Network Working Group, Request for Comments 3263, Jun. 2002, 17 pages. |
B.T. Doshi et al., “Overload Performance of Several Processor Queueing Disciplines for the M/M/1 Queue,” IEEE Transactions on Communications, Jun. 1986, pp. 538-546, vol. COM-34, No. 6. |
R. Radhakrishna Pillai, “A Distributed Overload Control Algorithm for Delay-Bounded Call Setup,” IEEE/ACM Transactions on Networking, Dec. 2001, pp. 780-789, vol. 9, No. 6. |
D.R. Manfield et al., “Performance Analysis of SS7 Congestion Controls Under Sustained Overload,” IEEE Journal on Selected Areas in Communications, Apr. 1994, pp. 405-414, vol. 12, No. 3. |
Michael Rumsewicz, “On the Efficacy of Using the Transfer-Controlled Procedure During Periods of STP Processor Overload in SS7 Networks,” IEEE Journal on Selected Areas in Communications, Apr. 1994, pp. 415-423, vol. 12, No. 3. |
R.P. Ejzak et al., “Network Overload and Congestion: A Comparison of ISUP and SIP,” Bell Labs Technical Journal, Sep. 2004, pp. 173-182, vol. 8, No. 3. |
S. Blake et al., “An Architecture for Differentiated Services,” Network Working Group, Request for Comments: 2475, Dec. 1998, 36 pages. |
John Nagle, “Congestion Control in IP/TCP Internetworks,” Network Working Group, Request for Comments: 896, Jan. 1984, 9 pages. |
S. Kasera et al., “Fast and Robust Signaling Overload Control,” 9th International Conference on Network Protocols, Nov. 2001, pp. 323-331. |
S. Floyd et al., “Random Early Detection Gateways for Congestion Avoidance,” IEEE/ACM Transactions on Networking (TON), Aug. 1993, pp. 397-413, vol. 1, No. 4. |
Number | Date | Country | |
---|---|---|---|
20160065475 A1 | Mar 2016 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 11395455 | Mar 2006 | US |
Child | 14939587 | US |