This application relates generally to methods and apparatuses, including computer program products, for automatically creating Interactive Connectivity Establishment (ICE) relay candidates without using Traversal Using Relays around NAT (TURN).
An Internet Protocol (IP) session involves the connection between two devices across a network of routers, cables, and switches for the purpose of exchanging packets of information. For example, a web browser can establish an IP-based HTTPS session with a website for the purpose of retrieving information. In another example, a device can establish a Session Initiation Protocol (SIP) session with another computing device to, e.g., conduct a phone call.
Web browsers have recently begun adopting the Web Real-Time Communication (WebRTC) protocol for the purpose of establishing real-time audio and video sessions between browser clients. The WebRTC protocol as defined by the IETF relies on the ICE (RFC 5245) methods for establishing a direct communication link between the two clients. Under certain network topologies, the only means for a successful communication link is through the use of a media relay server placed out in the network.
For both SIP and WebRTC sessions, ICE defines that media relay server to be a Traversal Using Relays around NAT (TURN) server running the TURN protocol (RFC 5766). The TURN protocol however is susceptible to many types of attacks such as theft of service and distributed denial of service.
The ICE protocol is designed to allow two client devices to automatically discover the best way to send voice and video media streams across an IP network. Certain network topologies such as those using Firewall/NATs can prevent the devices from directly communicating due to the way in which some Firewall/NATs create and enforce IP address and port access. The NAT function automatically maps an external IP address and port to every outbound message stream the client produces. The procedures of ICE allow the client to learn what IP address and port the NAT has assigned. When a client decides to initiate a real-time session, the client must first determine which IP address and port combinations are available for the purpose of establishing a media connection with another client. These IP and port combinations are called “candidates” in the terminology of ICE.
In the simplest model, a client begins its candidate discovery by sending a STUN (RFC 5389) binding request to a STUN server somewhere out in the network. The STUN server responds to the client binding request and provides the IP address and port information of where the STUN server saw the binding request originate from. If the client is behind a Firewall/NAT, the STUN server sees the external IP address and port that the NAT assigned to this outbound message transmission. This is called the Server Reflexive candidate.
According to ICE, a client in need of a network media relay issues a TURN allocation request using a procedure similar to STUN. In addition to providing the Server Reflexive candidate, the TURN request asks the TURN server to allocate a media relay port for the client to use.
The website application server 102 hands the TURN service credentials to Client A 108 when Client A initiates a call request (e.g., a SIP INVITE, a WebRTC call request) by clicking on a web page link, for example. Client A 108 then issues the necessary resource allocation messages to the TURN server 104 using the TURN protocol. The allocation response from the TURN server 104 contains the ICE Relay candidate, referred to as (r1) in
In the TURN model, the client device then creates a Session Description Protocol (SDP) of its possible media candidates. In
After the STUN binding exchange, Client B 110 creates its SDP using the Host candidate (b1) and the Server Reflexive candidate (b2). The SDP is handed up to the website application server 102 using the media protocol and the website application server 102 delivers the SDP to Client A 108. Both Client A 108 and Client B 110 now have the other client's SDP and the STUN connectivity checks begin. ICE defines the priority of the various permutations that arise when each client systematically tries to communicate between candidates. Client A 108, for instance, attempts to send a STUN message from its Host candidate to Client B's (110) Host candidate (a1-b1). If Client A 108 does not see a response, then Client A tries to send a STUN connectivity message to the Server Reflexive candidate of Client B 110 and make the attempt occur between (a2-b2), as shown in
At the same time, Client B 110 performs STUN connectivity checks towards Client A's (108) candidates. If the Host and Reflexive candidates do not succeed, then Client B 110 sends a STUN connectivity check to the Relay candidate (r1). The TURN server 104 then encapsulates the request inside a TURN header and forwards the encapsulated request to Client A 108 using the established TURN binding (a2-t1). Because Client A 108 created that TURN binding, Client A receives the encapsulated STUN connectivity message and responds using the reverse path (a2-t1-r1-b2). Client B 110 then successfully receives the connectivity response and both clients 108, 110 decide to use that connection for the media stream. This entire process must happen for each media stream. For example, a call using audio and video will have to perform the above process twice before exchanging audio and video data.
The TURN server 104 shown in
The systems and methods described herein allow for a session border controller (SBC) or a Web Session Border Controller (WebSBC™) server to be used in place of a TURN server such that the media relay function is not subject to the service-affecting attacks of TURN. The WebSBC server, available from Sansay, Inc. of San Diego, Calif., differs from a TURN server fundamentally by only allowing resource allocations to occur directly from the website application server. In contrast, ICE deployments that rely on TURN servers initiate resource allocations directly from the client which result in the inherent security problems.
The WebSBC server, with its direct interface to the website application server, allows for secure media relay allocations to occur outside of the client's direct involvement or knowledge. The WebSBC server allocation information is provided to the website application server application for the purpose of manipulating the media relay binding description, also called the Session Description Protocol (SDP) (see RFC 4566) by adding the allocated relay candidate to the client-provided SDP. The SDP that is exchanged between the clients, through the website application server, therefore offers a pre-allocated media relay candidate that the clients naturally use according to the procedures of ICE. The WebSBC server performs the media relay function securely and effectively, without using the TURN protocol.
The invention, in one aspect, features a method for securely allocating media relay candidates without using TURN. A website application server receives a media relay binding description from a first client device, the binding description including relay candidates associated with the first client device. The website application server determines a subscription profile associated with the first client device and creates an allocation link to a session border controller based on the subscription profile. The website application server modifies the media relay binding description to include one or more relay ports located on the session border controller, and transmits the media relay binding description to a second client device. The website application server receives one or more relay candidates associated with the second client device, and transmits to the first client device, the media relay binding description including the one or more relay candidates associated with the second client device. A media relay connection is established between the first client device and the second client device based on the media relay binding description, the connection established via the relay ports located on the session border controller.
The invention, in another aspect, features a system for securely allocating media relay candidates without using TURN. The system includes a website application server configured to receive a media relay binding description from a first client device, the binding description including relay candidates associated with the first client device. The website application server is further configured to determine a subscription profile associated with the first client device and create an allocation link to a session border controller based on the subscription profile. The website application server is further configured to modify the media relay binding description to include one or more relay ports located on the session border controller, and transmit the media relay binding description to a second client device. The website application server is further configured to receive one or more relay candidates associated with the second client device, and transmit, to the first client device, the media relay binding description including the one or more relay candidates associated with the second client device. A media relay connection is established between the first client device and the second client device based on the media relay binding description, the connection established via the relay ports located on the session border controller.
The invention, in another aspect, features a computer program product, tangibly embodied in a non-transitory computer readable medium, for securely allocating media relay candidates without using TURN. The computer program product includes instructions operable to cause a data processing apparatus to receive a media relay binding description from a first client device, the binding description including relay candidates associated with the first client device. The computer program product includes instructions operable to cause the data processing apparatus to determine a subscription profile associated with the first client device and create an allocation link to a session border controller based on the subscription profile. The computer program product includes instructions operable to cause the data processing apparatus to modify the media relay binding description to include one or more relay ports located on the session border controller, and transmit the media relay binding description to a second client device. The computer program product includes instructions operable to cause the data processing apparatus to receive one or more relay candidates associated with the second client device, and transmit, to the first client device, the media relay binding description including the one or more relay candidates associated with the second client device. A media relay connection is established between the first client device and the second client device based on the media relay binding description, the connection established via the relay ports located on the session border controller.
Any of the above aspects can include one or more of the following features. In some embodiments, the relay candidates associated with the first client device and the relay candidates associated with the second client device are Interactive Connectivity Establishment (ICE) candidates. In some embodiments, the allocation link is created using a secure protocol. In some embodiments, the secure protocol is Representational State Transfer (REST) over Hypertext Transfer Protocol Secure (HTTPS). In some embodiments, the media relay connection uses the Web Real-Time Communication (WebRTC) protocol and the session border controller is a WebSBC server. In some embodiments, the media relay connection uses Session Initiation Protocol (SIP) and the session border controller is a SIP session border controller.
In some embodiments, establishing a media relay connection includes the session border controller receiving a connectivity message from the first client device via a first one of the relay ports located on the session border controller, and receiving a connectivity message from the second client device via a second one of the relay ports located on the session border controller. The session border controller authenticates the connectivity messages using credential information received from the website application server via the allocation link, and latches address information associated with the first client device to the first one of the relay ports. The session border controller latches address information associated with the second client device to the second one of the relay ports. The session border controller forwards the connectivity message from the first client device to the second client device, and forwards the connectivity message from the second client device to the first client device.
In some embodiments, the connectivity messages are based on the Session Traversal Utilities for NAT (STUN) protocol. In some embodiments, the connectivity message from the first client device includes the address information associated with the first client device and the connectivity message from the second client device includes the address information associated with the second client device. In some embodiments, the address information associated with the first client device represents a new IP address and port number generated by the first client device upon transmitting the connectivity message to the session border controller. In some embodiments, the address information associated with the second client device represents a new IP address and port number generated by the second client device upon transmitting the connectivity message to the session border controller.
In some embodiments, the media relay binding description is a Session Description Protocol (SDP) message. In some embodiments, the relay candidates associated with the first client device include a server reflexive address associated with a Network Address Translation (NAT) device coupled between the first client device and the website application server. In some embodiments, the relay candidates associated with the second client device include a server reflexive address associated with a Network Address Translation (NAT) device coupled between the second client device and the website application server.
Other aspects and advantages of the technology will become apparent from the following detailed description, taken in conjunction with the accompanying drawings, illustrating the principles of the technology by way of example only.
The advantages of the technology described above, together with further advantages, may be better understood by referring to the following description taken in conjunction with the accompanying drawings. The drawings are not necessarily to scale, emphasis instead generally being placed upon illustrating the principles of the technology.
The systems and methods described herein do not require a TURN server or TURN protocol to be run by the clients.
The system 200 includes a website application server 202 that is connected to a plurality of client computing devices (e.g., Client A 208, Client B 210) via Firewall/NAT devices 206a, 206b respectively. Example client devices can include, but are not limited to, personal computers, tablets, mobile computing devices, smart phones, and the like. The system also includes a WebSBC server 204 that is also connected to the client devices 208, 210 via the Firewall/NAT devices 206a, 206b.
The system 200 requires an Allocation Protocol link to be established between the website application server 202 and the WebSBC server 204. This link should use a secure protocol API such as Representational State Transfer (REST) over HTTP/HTTPS, but other secure protocols can be used as well. The link can be initiated by either the website application server 202 or the WebSBC server 204 based on preconfigured addresses. The link is expected to stay up for the duration of service and can provide for multiple client allocations over the common interface.
The binding request sent by Client A 208 results in the Server Reflexive address (a2) being created. Client A 208 then constructs its media relay binding description (also called SDP) using the candidates (a1) and (a2) and passes the media relay binding description up to the website application server 202 via WebSockets or HTTP. The website application server 202 receives (302) the media relay binding description that includes the ICE relay candidates associated with Client A 208.
The website application server 202 then determines (304) a subscription profile associated with Client A 208. The website application server 202 decides, based on the subscription profile, to create (306) an allocation link (denoted by the Allocation Protocol link in
The WebSBC server 204 then allocates two relay ports (r1) and (r2) and returns those candidates to the website application server 202 in an Allocation Protocol response. The website application server 202 then modifies (308) the media relay binding description by adding the relay candidate (r2) to Client A's (a1) and (a2) candidates, and transmits (310) the media relay binding description down to Client B 210. Client B responds by transmitting its two ICE relay candidates (b1) and (b2) to the website application server 202, and the website application server receives (312) the two ICE relay candidates from Client B 210. The website application server 202 then modifies the media relay binding description to add Relay candidate (r1) to the (b1) and (b2) candidates from Client B 210. The website application server 202 transmits (314) the media relay binding description, including the relay candidate (r1), to Client A 208. A media relay connection is established (316) between Client A 208 and Client B 210 based on the media relay binding description, via the relay ports (r1) and (r2) located on the WebSBC server 204.
The result of this network-based Relay candidate allocation is that Client A 208 is told to try to connect with Client B 210 using (b1), (b2), and (r1) candidates. Client B 210 subsequently tries to connect using (a1), (a2), and (r2) candidates. If the Host and Reflexive candidates are not able to communicate directly, then both clients 208, 210 begin sending STUN connectivity messages to the relay candidates (r1) and (r2). The act of sending those messages creates two new Firewall/NAT bindings (a3) and (b3).
The WebSBC server 204 in
For example, assume Client A 208 happens to attempt a connectivity check to the relay candidate (r1) before Client B 210 does its check to (r2). The WebSBC server 204 verifies that Client A 208 actually sent the connectivity message by performing the STUN short-term credential check using the information provided in Allocation Protocol request. If the authentication check passes, the WebSBC server 204 binds the (r1) relay port to the source IP and port (a3) of Client A's message. The WebSBC server 204 can either store that STUN message and deliver it to Client B 210 at a later time or the WebSBC server 204 can discard the message, relying on the fact that Client B's attempt should cause a retransmit from Client A 208. Once Client B 210 attempts a connectivity check to port (r2) and it is verified and latched, the WebSBC server 204 forwards it to the other latched connection on port (r1). From this point on, the two clients 208, 210 have a secure and authenticated path to complete the STUN connectivity handshake and can begin sending the media streams over the ports (a3-r1-r2-b3), as shown in
In a second embodiment, the system 200 of
In a third embodiment, the ICE relay candidate creation techniques described above can be applied to a system using a SIP SBC model.
The system 600 uses the same ICE relay candidate creation techniques as described above with respect to
Allocation Protocol Attributes
The following section provides additional detail on the attributes of the allocation protocol used by the system 200 in
ICE Media Relay Binding Description Examples Using a WebSBC Server
The following are examples of the media relay binding description, also called the SDP, transmitted between the client devices 208, 210 and the website application server 202, as described above with respect to
(1) Client A's SDP as offered to the website application server after its STUN binding request:
(2) Website application server modified SDP after WebSBC server relay allocation as sent to Client B:
b=RS:0
b=RR:0
(3) Client B's SDP as offered to the website application server after its STUN binding request:
(4) Website application server modified SDP as sent to Client A:
The above-described techniques can be implemented in digital and/or analog electronic circuitry, or in computer hardware, firmware, software, or in combinations of them. The implementation can be as a computer program product, i.e., a computer program tangibly embodied in a machine-readable storage device, for execution by, or to control the operation of, a data processing apparatus, e.g., a programmable processor, a computer, and/or multiple computers. A computer program can be written in any form of computer or programming language, including source code, compiled code, interpreted code and/or machine code, and the computer program can be deployed in any form, including as a stand-alone program or as a subroutine, element, or other unit suitable for use in a computing environment. A computer program can be deployed to be executed on one computer or on multiple computers at one or more sites.
Method steps can be performed by one or more processors executing a computer program to perform functions by operating on input data and/or generating output data. Method steps can also be performed by, and an apparatus can be implemented as, special purpose logic circuitry, e.g., a FPGA (field programmable gate array), a FPAA (field-programmable analog array), a CPLD (complex programmable logic device), a PSoC (Programmable System-on-Chip), ASIP (application-specific instruction-set processor), or an ASIC (application-specific integrated circuit), or the like. Subroutines can refer to portions of the stored computer program and/or the processor, and/or the special circuitry that implement one or more functions.
Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital or analog computer. Generally, a processor receives instructions and data from a read-only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more memory devices for storing instructions and/or data. Memory devices, such as a cache, can be used to temporarily store data. Memory devices can also be used for long-term data storage. Generally, a computer also includes, or is operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto-optical disks, or optical disks. A computer can also be operatively coupled to a communications network in order to receive instructions and/or data from the network and/or to transfer instructions and/or data to the network. Computer-readable storage mediums suitable for embodying computer program instructions and data include all forms of volatile and non-volatile memory, including by way of example semiconductor memory devices, e.g., DRAM, SRAM, EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto-optical disks; and optical disks, e.g., CD, DVD, HD-DVD, and Blu-ray disks. The processor and the memory can be supplemented by and/or incorporated in special purpose logic circuitry.
To provide for interaction with a user, the above described techniques can be implemented on a computer in communication with a display device, e.g., a CRT (cathode ray tube), plasma, or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse, a trackball, a touchpad, or a motion sensor, by which the user can provide input to the computer (e.g., interact with a user interface element). Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, and/or tactile input.
The above described techniques can be implemented in a distributed computing system that includes a back-end component. The back-end component can, for example, be a data server, a middleware component, and/or an application server. The above described techniques can be implemented in a distributed computing system that includes a front-end component. The front-end component can, for example, be a client computer having a graphical user interface, a Web browser through which a user can interact with an example implementation, and/or other graphical user interfaces for a transmitting device. The above described techniques can be implemented in a distributed computing system that includes any combination of such back-end, middleware, or front-end components.
The components of the computing system can be interconnected by transmission medium, which can include any form or medium of digital or analog data communication (e.g., a communication network). Transmission medium can include one or more packet-based networks and/or one or more circuit-based networks in any configuration. Packet-based networks can include, for example, the Internet, a carrier internet protocol (IP) network (e.g., local area network (LAN), wide area network (WAN), campus area network (CAN), metropolitan area network (MAN), home area network (HAN)), a private IP network, an IP private branch exchange (IPBX), a wireless network (e.g., radio access network (RAN), Bluetooth, Wi-Fi, WiMAX, general packet radio service (GPRS) network, HiperLAN), and/or other packet-based networks. Circuit-based networks can include, for example, the public switched telephone network (PSTN), a legacy private branch exchange (PBX), a wireless network (e.g., RAN, code-division multiple access (CDMA) network, time division multiple access (TDMA) network, global system for mobile communications (GSM) network), and/or other circuit-based networks.
Information transfer over transmission medium can be based on one or more communication protocols. Communication protocols can include, for example, Ethernet protocol, Internet Protocol (IP), Voice over IP (VoIP), a Peer-to-Peer (P2P) protocol, Hypertext Transfer Protocol (HTTP), Session Initiation Protocol (SIP), H.323, Media Gateway Control Protocol (MGCP), Signaling System #7 (SS7), a Global System for Mobile Communications (GSM) protocol, a Push-to-Talk (PTT) protocol, a PTT over Cellular (POC) protocol, Universal Mobile Telecommunications System (UMTS), 3GPP Long Term Evolution (LTE) and/or other communication protocols.
Devices of the computing system can include, for example, a computer, a computer with a browser device, a telephone, an IP phone, a mobile device (e.g., cellular phone, personal digital assistant (PDA) device, smart phone, tablet, laptop computer, electronic mail device), and/or other communication devices. The browser device includes, for example, a computer (e.g., desktop computer and/or laptop computer) with a World Wide Web browser (e.g., Chrome™ from Google, Inc., Microsoft® Internet Explorer® available from Microsoft Corporation, and/or Mozilla® Firefox available from Mozilla Corporation). Mobile computing device include, for example, a Blackberry® from Research in Motion, an iPhone® from Apple Corporation, and/or an Android™-based device. IP phones include, for example, a Cisco® Unified IP Phone 7985G and/or a Cisco® Unified Wireless Phone 7920 available from Cisco Systems, Inc.
Comprise, include, and/or plural forms of each are open ended and include the listed parts and can include additional parts that are not listed. And/or is open ended and includes one or more of the listed parts and combinations of the listed parts.
One skilled in the art will realize the technology may be embodied in other specific forms without departing from the spirit or essential characteristics thereof. The foregoing embodiments are therefore to be considered in all respects illustrative rather than limiting of the technology described herein.
This application claims priority to U.S. Provisional Patent Application No. 61/730,345, filed on Nov. 27, 2012, the contents of which are incorporated herein in their entirety.
Number | Name | Date | Kind |
---|---|---|---|
7983254 | Alt et al. | Jul 2011 | B2 |
8054827 | Riley et al. | Nov 2011 | B2 |
8204064 | MeLampy et al. | Jun 2012 | B2 |
8218458 | Flynn et al. | Jul 2012 | B2 |
8233484 | Hiribarren et al. | Jul 2012 | B2 |
8249076 | Bertone et al. | Aug 2012 | B1 |
8312169 | Perumal et al. | Nov 2012 | B2 |
8325741 | Zhu | Dec 2012 | B2 |
20020120749 | Widegren et al. | Aug 2002 | A1 |
20020184510 | Shieh | Dec 2002 | A1 |
20030120135 | Gopinathan et al. | Jun 2003 | A1 |
20060015463 | Gupta et al. | Jan 2006 | A1 |
20080259943 | Miyajima et al. | Oct 2008 | A1 |
20090006633 | Moore et al. | Jan 2009 | A1 |
20090135842 | Zhu | May 2009 | A1 |
20090245233 | Prasad et al. | Oct 2009 | A1 |
20090274146 | Zhu | Nov 2009 | A1 |
20100131621 | Zetterlund et al. | May 2010 | A1 |
20100217874 | Anantharaman et al. | Aug 2010 | A1 |
20100293297 | Perumal et al. | Nov 2010 | A1 |
20100312880 | Veits | Dec 2010 | A1 |
20110060835 | Dorso et al. | Mar 2011 | A1 |
20110122885 | Hedman et al. | May 2011 | A1 |
20110191493 | Lee et al. | Aug 2011 | A1 |
20110289221 | Lowekamp | Nov 2011 | A1 |
20120065813 | Nguyen et al. | Mar 2012 | A1 |
20120142341 | Nagpal et al. | Jun 2012 | A1 |
20120158974 | Perumal et al. | Jun 2012 | A1 |
20120300242 | Meike et al. | Nov 2012 | A1 |
20130089187 | Stahl | Apr 2013 | A1 |
20130163446 | Kruger et al. | Jun 2013 | A1 |
20130185440 | Blau et al. | Jul 2013 | A1 |
Number | Date | Country | |
---|---|---|---|
61730345 | Nov 2012 | US |