The described technology relates generally to data communications networks and, more particularly, to sustaining session connections.
Users of computing devices sometimes use their computing devices to communicate with other users. As an example, a user may communicate with another user using a computing device by sending or receiving typed messages, such as by using the MICROSOFT MESSENGER (“MESSENGER”) instant messaging product. As another example, a user may communicate with another user by speaking into a microphone and hearing the other user on a speaker or headset, such as by using a Voice over Internet Protocol (“VoIP”) application. These users may use a variety of computing devices such as personal computers, personal digital assistants, cellular telephones, VoIP-enabled telephones, etc.
Applications providing these types of communications services to users may also need to provide presence information. Presence information conveys an ability or willingness of a user to communicate using a computing device. Presence information can be detected automatically by computing devices or actively selected by users. A computing device may automatically observe and communicate presence information, such as a user's “status.” As an example, when a user is not logged into any computing device or is not using (or logged into) an application that observes and conveys presence information, the user's status may be automatically indicated as “off-line.” When the user starts or logs into an application that observes and conveys presence information, such as MESSENGER, the user's status may be automatically indicated as “on-line.” When the user performs no actions on the computing device for a period of time, the application may automatically indicate that the user is “away.” Users can also actively select their status. As examples, users may indicate that they are “out for lunch” or “busy,” which could be indications that they may not immediately respond to messages from other users.
Multiple computing devices may register this presence information with a registration server computing device so that a “watcher,” which is an application desiring to determine a user's status, can determine meaningful presence information by querying the registration server or subscribing to automatically receive presence information. As an example, MESSENGER may indicate to the registration server that the user has not performed any actions on the computing device for a period of time, and so the user's presence information could be “away.” A VoIP-enabled phone may indicate to the registration server that the user has placed a phone call, and so the user's presence information could be “on the phone.” When a watcher receives this presence information from the registration server, it may determine that the user's status is “on the phone and unable to respond to messages on the computing device.” If, however, the user begins to perform actions on the computing device, the watcher may determine that the user's status is “on the phone, but able to respond to messages on the computing device.” If the user actively indicates on the computing device that the user's status is “away” and uses a VoIP-enabled phone, then the watcher may determine that the user's status is “roaming.”
Communications applications sometimes need to establish and manage sessions between computing devices. A session is a set of interactions between computing devices that occurs over a period of time. As an example, real-time communications applications such as MESSENGER or VoIP establish sessions between communicating devices on behalf of users. These applications may use various mechanisms to establish sessions, such as a “Session Initiation Protocol” (“SIP”). SIP is an application-layer control protocol that computing devices can use to discover one another and to establish, modify, and terminate sessions between computing devices.
Applications may employ SIP with another protocol to send or receive information. By using SIP with other protocols, applications can create and manage a session and exchange information during the session. The protocol employed with SIP to exchange information may segment the information into messages. As an example, a VoIP application may segment a long narration into shorter messages. Exchanging messages during a session is referred to as a “dialog.” SIP may use lower-level communications layers to transport a dialog's messages, such as Transmission Control Protocol/Internet Protocol (“TCP/IP”), which are commonly employed transport- and network-layer protocols.
Transmission Control Protocol (“TCP”) is a connection-oriented, reliable delivery transport layer protocol. TCP is typically described as a transport layer that provides an interface between an application layer (e.g., an application using SIP) and a network layer. The application layer generally communicates with the TCP layer by sending or receiving a stream of data (e.g., a number of bytes of data). TCP organizes this datastream into segments that can be carried by the protocol employed at the network layer, e.g., the Internet Protocol (“IP”). These segments of data are commonly referred to as “packets,” “frames,” or “messages.” Each message generally comprises a header and payload. The header comprises data necessary for routing and interpreting the message. The payload comprises the actual data that is being sent or received. The application, transport, and network layers, together with other layers, are jointly referred to as a data communications stack.
Messages in a connection may transit intermediary computing devices. When a transport or network layer of a sending computing device's data communications stack establishes a connection with a corresponding layer of a data communications stack in a receiving computing device, messages exchanged by the computing devices may transit several intermediary computing devices. As examples, messages may transit proxy servers, network address translators, or gateways. These intermediary computing devices receive, process, and forward messages. As an example, a proxy server may be an intermediary between computing devices connected to an intranet and computing devices connected to the Internet. This proxy server may attempt to make the intranet more secure by, e.g., ignoring connection requests from outside the intranet.
Intermediary computing devices may terminate idle connections between two computing devices to use network resources efficiently. As an example, a MESSENGER or VoIP application executing on a user's computing device connected to an intranet (“computing device A”) may establish a SIP-based session with an application executing on another computing device connected to the Internet (“computing device B”). This session may use a TCP/IP connection between computing devices A and B, and messages exchanged by the computing devices may transit a proxy server. When no messages are exchanged in the connection for a period of time, the proxy server may close the connection between it and computing device B to conserve network resources. However, computing device A may be unaware that the connection is closed because its connection to the proxy server remains open. As a result, the application executing on computing device B may mistakenly assume that the application (or user) of computing device A is no longer online or using the connection. Moreover, to make the intranet to which the proxy server is connected more secure, the proxy server may ignore connection requests from devices not connected to the intranet, such as computing devices connected to the Internet. Consequently, the application executing on computing device B, which is connected to the Internet, may be unable to query the application executing on computing device A for its status or request to reestablish a session.
Intermediary computing devices may close connections even when computing devices use a “keep-alive” mechanism. Some computing devices employ a keep-alive mechanism to keep a connection alive despite a lack of messages. This mechanism involves sending “keep-alive” messages to indicate that the sender has not closed the connection. However, efficient intermediary computing devices may recognize keep-alive messages and, while keeping alive connections between the sending computing devices and the intermediary computing device, may terminate the connection to the recipient indicated in the keep-alive message (e.g., to computing devices connected to the Internet).
A connection may be sustained by sending a valid message of an application layer. As an example, a SIP application employing TCP in a transport layer may periodically send a valid SIP message, such as a REGISTER message. A REGISTER message may be used to enable a SIP server to associate a Uniform Resource Identifier (“URI”) of the sender of the message with the computing device used by the sender. When the valid SIP message cannot be sent or received, the SIP application may detect that its TCP connection is unavailable. A problem with sending REGISTER, or indeed any valid SIP message containing valid data, is that doing so could be computationally intensive when multiple clients and servers need to do so to sustain connections. In the case of REGISTER messages, a server receiving such a message may need to parse the message to determine the URI and the identity of the sender's computing device, and may further need to store the data in a database that is possibly on another server.
Thus, an effective approach to sustaining session connections that does not rely on TCP keep-alives or extensive computational processing of messages would have significant utility.
SIP is an Internet proposed standard. Its specification, “RFC 3261,” is available at <http://www.ieff.org/rfc/rfc3261.txt>. A specification for extensions to SIP relating to event notifications, “RFC 3265,” is available at <http://www.ieff.org/rfc/rfc3265.txt>. A specification relating to presence information in instant messaging systems, “RFC 2778,” is available at <http://www.ieff.org/rfc/rfc2778.txt>. A draft of a proposed specification relating to presence information in SIP is available at <http://www.ieff.org/internet-drafts/draft-ieff-simple-presence-10.txt>. All four of these specifications are incorporated herein in their entirety by reference.
In an embodiment, techniques for sustaining session connections are provided. The techniques send heartbeat messages when not sending a message may cause the session connection to close because of a timeout condition. Heartbeat messages are valid transport layer messages that will be ignored by protocols at higher levels of a data communications stack. As an example, the techniques may send a TCP message containing only a carriage return and line feed (“CRLF”) in its payload. Because the TCP layer considers a message containing only a CRLF to be a valid TCP message, intermediary computing devices such as proxy servers may not interpret heartbeat messages as “keep alive” messages, and may sustain session connections.
In an embodiment, techniques for sustaining session connections are provided. The techniques send heartbeat messages when not sending a message may cause the session connection to close because of a timeout condition. Heartbeat messages are valid transport layer messages containing a non-empty payload that will be ignored by protocols at higher levels of a data communications stack. As an example, the techniques may send a TCP message containing only a carriage return and line feed (“CRLF”) in its payload. Because the TCP layer considers a message containing only a CRLF to be a valid TCP message, intermediary computing devices such as proxy servers may reset a message clock relating to the connection in which the message was sent or received. The message clock indicates an amount of time that has elapsed since a message was last sent or received in the connection. When the message clock of a connection indicates that a threshold amount of time has elapsed (e.g., indicating that a timeout condition has occurred), an intermediary computing device may close the connection. Because the message clock is reset when a message is sent or received, the connection may not be closed unless another message is not sent or received before the threshold amount of time elapses. When the TCP layer of the recipient computing device forwards the received heartbeat message to a higher layer of its data communications stack, the higher layer may ignore the heartbeat message. As an example, SIP may ignore otherwise valid SIP messages that contain only white space, such as spaces, tabs, carriage returns, line feeds, or CRLFs. If messages containing information other than CRLFs are received, the higher layer may attempt to interpret these messages as containing information that cannot be ignored. In various embodiments, the heartbeat may contain merely carriage returns, line feeds, or any data that would be valid at the transport layer but ignored by higher layers of the data communications stack. Thus, by sending valid transport layer messages, such as heartbeat messages that are ignored by SIP, the techniques can sustain session connections even when no information is exchanged between computing devices at layers of the data communications stack higher than the transport layer. What sorts of messages are valid but ignored may be defined in a protocol's specification or definition.
In an embodiment, messages ignored by layers above a transportation layer may include, e.g., carriage returns, line feeds, spaces, tabs, or any white space characters in general. In an embodiment, such messages containing merely white space characters may end with a carriage return or line feed. Characters that can be used in messages to sustain connections without causing excessive computational burden may be determined by analyzing the protocols used above the protocol that could close connections (e.g., TCP).
Turning now to the Figures,
Computing device 300 has a data communications stack comprising an application layer 304, transport layer 306, network layer 308, data link layer 310, and physical layer 312. The application layer may comprise a SIP application and other applications. The SIP application may have a component or layer that communicates with the transport layer. This communications layer of the SIP application may send or receive heartbeat messages. Computing devices 301, 302, and 303 may have similar data communications stacks. Specifically, computing device 303 may have a data communications stack comprising an application layer 314, transport layer 316, network layer 318, data link layer 320, and physical layer 322. Some or all of these application layers may additionally comprise a SIP application.
When two computing devices are connected, layers of their data communications stacks may have logical or physical connections between one another. As an example, when computing device 300 is connected to computing device 303 via computing devices 301 and 302, physical layer 312 may be connected via a physical connection 324 to the physical layer of computing device 301, which may be connected via a physical connection to the physical layer of computing device 302, and which may be connected via a physical connection to the physical layer of computing device 303. The computing devices may each be connected to one another via, e.g., a modem, network interface card, or other connection, such as over the Internet. The Internet is comprised of various physical connections, such as telephone lines, microwave links, and other forms of wired and wireless physical connections that together form multiple routes for messages. Indeed, an early goal for the Internet was to make it possible for a sender to communicate with a recipient despite network outages. When computing device 300 sends data to computing device 303, the data may travel on different physical routes through the Internet.
Higher layers of data communications stacks such as data link layers 310 and 320, network layers 308 and 318, transport layers 306 and 316, and application layers 304 and 314 may be connected by logical connections, such as logical connections 326, 328, 330, and 332. When two layers of a data communications stack have a logical connection and exchange data, the data may be sent “down” the data communications stack of the sending computing device to the physical layer of the data communications stack, across a physical connection, and then “up” the data communications stack of the receiving computing device. As an example, when a SIP application 304 of computing device 300 communicates with a SIP application 314 of computing device 303, the SIP application 304 may communicate data to transport layer 306, which may communicate the data to network layer 308, which may communicate the data to data link layer 310, which may communicate the data to physical layer 312. The physical layer 312 may communicate the data over the physical connection 324, and ultimately may communicate the data to the physical layer 322. The physical layer 322 may communicate data it receives up to data link layer 320, which may communicate the data to network layer 318, which may communicate the data to transport layer 316, which ultimately may communicate the data to the SIP application 314.
When computing devices 301 and 302 receive data on their physical connections, they may also send the data up their data communications stacks. It is possible that a lower level of the data communications stack than the application layer may be able to handle the received data without sending it to a higher layer. As an example, when SIP application 304 sends a SIP message to SIP application 314, it may send data corresponding to the SIP message to transport layer 306. The data communications stack of computing devices 301 and 302, which both receive the data on their physical layers, may communicate the received up their data communications stack to their respective transport layers. The transport layers, which recognize from the received data that the destination indicated in the data is computing device 303, may simply forward the data to transport layer 316 (via their corresponding lower levels). Thus, the application/SIP layers of computing devices 301 and 302 may not need to handle or even view the messages.
When no data is received by a transport layer of either computing device 301 or 302 over a period of time, the computing device may close the connection corresponding to a SIP session between computing devices 300 and 303. As an example, if computing device 301 fails to receive any messages on a connection from either computing device 300 or computing device 302, it may assume that the connection is no longer necessary and close the connection. Thus, further SIP messages between computing device 300 and 303 may not be able to transit a previously opened connection that transits computing devices 301 and 302.
The sustain_connection routine may be called repeatedly in a loop of a thread that is separate from threads that send or receive messages. Alternatively, the sustain_connection routine may be called in response to a clock event that is triggered at some specified time interval prior to the timeout.
Alternatively, the sustain_connection routine may simply send a heartbeat message at some interval less than the timeout value regardless of the clock value. As an example, if a proxy server may time out after one minute of inactivity, the routine may send a heartbeat message every 30 seconds even if a message was sent or received just a few seconds earlier.
At block 606, the subroutine calls the send_message subroutine and passes to it the created heartbeat message as a parameter. The send_message subroutine is described above in relation to
Upon receiving the message, the subroutine may reset a clock value (not shown) that may be used to determine whether a heartbeat message needs to be sent to sustain the session.
In an embodiment, a computing device may send heartbeat messages in some connections and may employ another mechanism with other connections to sustain its connections. As an example, the computing device may employ heartbeat messages when sending messages to its own “home” server or other computing devices connected thereto, because the home server may be able to associate the heartbeat message with the computing device's connection, and store an indication of the association. By doing so, the server may be able to efficiently update the computing device's presence information if the connection is subsequently lost. The computing device may send REGISTER messages in other connections when it may be advantageous to do so. As an example, the computing device may send REGISTER messages when an intermediate device fails to forward heartbeat messages.
Although particular examples discussed herein refer to using SIP and TCP, alternate embodiments may use other equivalent or similar protocols.
The computing device on which the techniques for sustaining session connections are implemented may include a central processing unit, memory, input devices (e.g., keyboard and pointing devices), output devices (e.g., display devices), and storage devices (e.g., disk drives). The memory and storage devices are computer-readable media that may contain instructions that implement the security system. In addition, the data structures and message structures may be stored or transmitted via a data transmission medium, such as a signal on a communications link. Various communications links may be used, such as the Internet, a local area network, a wide area network, or a point-to-point dial-up connection.
The techniques for sustaining session connections may be described in the general context of computer-executable instructions, such as program modules, executed by one or more computers or other devices. Generally, program modules include routines, programs, objects, components, data structures, etc., that perform particular tasks or implement particular abstract data types. Typically, the functionality of the program modules may be combined or distributed as desired in various embodiments.
From the foregoing, it will be appreciated that specific embodiments of the invention have been described herein for purposes of illustration, but that various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims.