Audio-video telephony with firewalls and network address translation

Abstract
The present invention relates to a communications system (1) for making multimedia calls. The system comprises two multimedia terminals (10,12) and communication means for making a multimedia call over a shared communications network (20), including a firewall (26) through which the multimedia call must pass, and which restricts certain types of communication. Each terminal (10,12) has a number of logical communication ports for the multimedia call, including at least one dynamically assigned port. In the course of setting up the multimedia call, at least one of the terminals (10,12) is adapted to send a request to the other of the terminals to open up one or more of the dynamic ports in the other terminal. The system includes a proxy server (40) between the terminals (10,12) that acts for each terminal as a proxy for the other terminal during the course of the call. The proxy server (40) has logical communication ports for communication with the terminals including one or more pre-assigned ports. The firewall (26) is configured not to restrict communication between one or both terminals (10,12) and the pre-assigned port(s) of the proxy server (40). The proxy server (40) is configured to receive and forward the request(s) to open up said dynamic port(s) via one of its pre-assigned ports.
Description


[0001] The present invention relates to a communications system for making multimedia calls.


[0002] The rapidly evolving IP (Internet Protocol) data network is creating new opportunities and challenges for multimedia and voice communications service providers. Unprecedented levels of investment are being made in the data network backbone by incumbent telecommunication operators and next generation carriers and service providers. At the same time, broadband access technologies such as DSL and cable modems are bringing high speed Internet access to a wide community of users. The vision of service providers is to make use of the IP data network to deliver new voice, video and data services right to the desktop, the office and the home alongside high speed Internet access.


[0003] The importance of standards for wide spread communications is fundamental if terminals from different manufacturers are to inter-operate. In the multimedia arena, the current standard for real-time communications over packet networks (such as IP data networks) is the ITU standard H.323. H.323 is now a relatively mature standard having support from the multimedia communications industry that includes companies such as Microsoft, Cisco and Intel. For example, it is estimated that 75% of PCs have Microsoft's NetMeeting (trade mark) program installed. NetMeeting is an H.323 compliant software application used for multimedia (voice, video and data) communication.


[0004] Interoperability between equipment from different manufacturers is also now being achieved. Over 120 companies world-wide attended the last interoperability event hosted by the International Multimedia Telecommunications Consortium (IMTC), an independent organisation that exists to promote the interoperability of multimedia communications equipment. The event is a regular one that allows manufacturers to test and resolve inter-working issues.


[0005] Hitherto, there had been a number of barriers to the mass uptake of multimedia (particularly video) communications. Ease of use, quality, cost and communications bandwidth had all hampered growth in the market. Technological advances in video encoding, the ubiquity of cheap IP access and the current investment in the data network coupled with the rollout of DSL together with ISDN and Cable modem now alleviates most of these issues making multimedia and voice communications readily available.


[0006] As H.323 was being defined as a standard, it was assumed that there would be H.323-H.320 gateways that exist at the edge of network domains converting H.323 to H.320 for transport over the wide area between private networks. Therefore, implementations of H.323 over IP concentrated on communications within a single network.


[0007] However, IP continues to find favour as the wide area protocol. More and more organisations continue to base their entire data networks on IP. High speed Internet access, managed Intranets, Virtual Private Networks (VPNs) all based on IP are commonplace. The IP trend is causing H.320 as a multimedia protocol to decline. The market demand is to replace H.320 completely with H.323 over IP.


[0008] Unfortunately, unforeseen technical barriers to the real-world, wide area deployment of H.323 still exist. The technical barriers relate to the communications infrastructure at the boundaries of IP data networks.


[0009] The H.323 standard applies to multimedia communications over Packet Based Networks that have no guaranteed quality of service. It has been designed to be independent of the underlying transport network and protocols. Today the IP data network is the default and ubiquitous packet network and the majority (if not all) of implementations of H.323 are over an IP data network. Nevertheless, today, successful implementation of multimedia and voice communications are confined to Intranets or private managed IP networks because there are IP topological problems preventing the widespread deployment of H.323 between private IP networks and the public Internet or shared or managed IP networks.


[0010] The problems arise because of two IP technologies—Network Address Translation (NAT) and Firewalls.


[0011] NAT came about to solve the ‘shortage of addresses’ problem. Any endpoint or ‘host’ in an IP network has an ‘IP address’ to identify that endpoint so that data packets can be correctly sent or routed to it and packets received from it can be identified from where they originate. At the time of defining the IP address field no-one predicted the massive growth in desktop equipment. After a number of years of global IP deployment, it was soon realised that the number of endpoints wanting to communicate using the IP protocol would exceed the number of unique IP addresses possible from the address field. To increase the address field and make more addresses available would have required the entire IP infrastructure to be upgraded. (The industry is planning to do this with IP Version 6 at some point).


[0012] The solution of the day is now referred to as NAT. The first NAT solution, which is referred to as simple NAT in IETF RFC1631, uses a one-to-one mapping, came about before the World-Wide Web existed and when only a few hosts (e.g. email server, file transfer server) within an organisation needed to communicate externally to that organisation. NAT allows an enterprise to create a private IP network where each endpoint within that enterprise has an address that is unique only within the enterprise but is not globally unique. These are private IP addresses. This allows each host within an organisation to communicate (i.e. address) any other host within the organisation. For external communication, a public or globally unique IP address is needed. At the edge of that private IP network is a device that is responsible for translating a private IP address to/from a public IP address—the NAT function. The enterprise will have one or more public addresses belonging exclusively to the enterprise but in general fewer public addresses than hosts are needed either because only a few hosts need to communicate externally or because the number of simultaneous external communications is smaller. A more sophisticated embodiment of NAT has a pool of public IP addresses that are assigned dynamically on a first come first served basis for hosts needing to communicate externally. Fixed network address rules are required in the case where external equipment needs to send unsolicited packets to specific internal equipment.


[0013] Today we find that most private networks use private IP addresses from the 10.x.x.x address range. External communications are usually via a service provider that offers a service via a managed or shared IP network or via the public Internet. At the boundaries between each of the networks NAT is applied to change addresses to be unique within the IP network the packets are traversing. Simple NAT changes the complete IP address on a one-to-one mapping that may be permanent or dynamically created for the life of the communication session.


[0014] A consequence of NAT is that the private IP address of a host is not visible externally. This adds a level of security.


[0015] Web Servers, Mail Servers and Proxy Servers are examples of hosts that would need a static one-to-one NAT mapping to allow external communications to reach them.


[0016] While computers and networks connected via a common IP protocol made communications easier, the common protocol also made breaches in privacy and security much easier too. With relatively little computing skill it became possible to access private or confidential data and files and also to corrupt that business information maliciously. The industry's solution to such attacks is to deploy ‘firewalls’ at the boundaries of their private networks.


[0017] Firewalls are designed to restrict or ‘filter’ the type of IP traffic that may pass between the private and public IP networks. Firewalls can apply restrictions through rules at several levels. Restrictions may be applied at the IP address, the Port, the IP transport protocol (TCP or UDP for example) or the application. Restrictions are not symmetrical. Typically a firewall will be programmed to allow more communications from the private network (inside the firewall) to the public network (outside the firewall) than in the other direction.


[0018] With the birth of the World-Wide Web it has become increasingly difficult to apply firewall rules just to IP addresses. Any inside host (i.e. your PC) may want to connect to any outside host (the web server) dotted around the globe. The concept of a ‘well known port’ is applied to the problem. A port identifies one end of a point-to-point transport connection between 2 hosts. A ‘well-known port’ is a port that carries one ‘known’ type of traffic. IANA, the Internet Assigned Number Authority specifies the pre-assigned well-known ports and the type of traffic carried over them. For example port 80 has been assigned for web surfing (http protocol) traffic, port 25 Simple Mail Transport Protocol etc.


[0019] An example of a firewall filtering rule for Web Surfing would be:


[0020] Any inside IP address/any port number may connect to any outside IP address/Port 80 using TCP (Transport Connection protocol) and HTTP (the application protocol for Web Surfing).


[0021] The connection is bi-directional so traffic may flow back from the Web Server on the same path. The point is that the connection has to be initiated from the inside.


[0022] An example of a firewall filtering rule for email may be:


[0023] Any outside IP address/any port number may connect to IP address 192.3.4.5/port 25 using TCP and SMTP. The NAT function may change the destination IP address 192.3.4.5 to 10.6.7.8 which is the inside address of the mail server.


[0024] Filtering rules such as the following are frowned upon by IT managers:


[0025] Any inside IP address/any port number may connect to any outside IP address/any port number for TCP or UDP and vice versa.


[0026] Such rules are tantamount to opening up the firewall, as it is too broad a filter.


[0027] Both NAT and firewall functions prevent H.323 communication working where NAT and firewall functions exist between the endpoints. This will typically be the case when the endpoints are in different private networks, when one endpoint is in a private network and the other endpoint is in the Internet or when the endpoints are in different managed IP networks.


[0028] H.323 has been designed to be independent of the underlying network and transport protocols. Nevertheless, implementation of H.323 in an IP network is possible with the following mapping of the main concepts:
1H.323 address:IP addressH.323 logical channel:TCP/UDP Port connection


[0029] In the implementation of H.323 over IP, H.323 protocol messages are sent as the payload in IP packets using either TCP or UDP transport protocols. Many of the H.323 messages contain the H.323 address of the originating endpoint or the destination endpoint or both endpoints. In the IP world this means we have IP addresses inside an H.323 message that is sent in an IP packet whose header contains the IP addresses of source and destination hosts.


[0030] However, a problem arises in that simple NAT functions will change the IP addresses of source and destination hosts without changing the H.323 addresses in the H.323 payload. This causes the H.323 protocol to break and requires intermediary intelligence to manipulate H.323 payload addresses.


[0031] Because of the complexity of multimedia communications, H.323 requires several logical channels to be opened between the endpoint. Logical channels are needed for call control, capabilities exchange, audio, video and data. In a simple point-to-point H.323 multimedia session involving just audio and video, at least six logical channels are needed. In the IP implementation of H.323, logical channels are mapped to TCP or UDP port connection, many of which are assigned dynamically.


[0032] Another problem arises in that firewall functions filter out traffic on ports that they have no rules for. Either the firewall is opened, which defeats the purpose of the firewall, or much of the H.323 traffic will not pass through.


[0033] H.323 communication is therefore an anathema to firewalls. Either a firewall must become H.323 aware or some intermediary intelligence must manipulate the port assignments in a secure manner.


[0034] One possible solution to this problem would be a complete IP H.323 upgrade. This requires:


[0035] H.323 upgrade to the simple NAT function at each IP network boundary. The NAT function must scan all H.323 payloads and consistently change IP addresses.


[0036] H.323 upgrade to the firewall function at each IP network boundary. The firewall must understand and watch all H.323 communication so that it can open up the ports that are dynamically assigned and must filter all non-H.323 traffic on those ports.


[0037] Deployment of H.323 intelligence at the boundary or in the shared IP network to resolve and arbitrate addresses. IP addresses are rarely used directly by users. In practice, IP address aliases are used. Intelligence is needed to resolve aliases to an IP address. This H.323 function is contained within H.323 entities called GateKeepers.


[0038] The disadvantages of this possible solution are:


[0039] Each organisation/private network must have the same level of upgrade for H.323 communication to exist.


[0040] The upgrade is costly. New functionality or new equipment must be purchased, planned and deployed. IT managers must learn about H.323.


[0041] The continual parsing of H.323 packets to resolve the simple NAT and firewall function places a latency burden on the signal at each network boundary. The latency tolerance for audio and video is very small.


[0042] As a result of these problems, the H.323 protocol is not being used for voice and multimedia communications when there is a firewall or network address translation (NAT). One approach has been to place H.323 systems on the public side of the firewall and NAT functions. This allows them to use H.323 while also allowing them to protect the remainder of their network. The disadvantages of this are:


[0043] 1. The most ubiquitous device for video communications is the desktop PC. It is nonsensical to place all desktop computers on the public side!


[0044] 2. The H.323 systems are not protected from attackers on the public side of the firewall.


[0045] 3. The companies are not able to take advantage of the potentially ubiquitous nature of H.323, since only the special systems will be allowed to conduct H.323 communications.


[0046] 4. The companies will not be able to take full advantage of the data-sharing facilities in H.323 because the firewall will prevent the H.323 systems from accessing the data. Opening the firewall to allow data-transfer functions from the H.323 system is not an option because it would allow an attacker to use the H.323 system as a relay.


[0047] H.323 is not the only protocol now being used for real-time voice and multimedia communications over IP networks. SIP (Session Initiation Protocol as define in the IETF RFC 2543), MGCP (Media Gateway Control Protocol), H.248 (sometimes referred to as Megaco—ITU's equivalent of MGCP,) have gained industry acceptance. All these protocols suffer the identical infrastructural problems caused by firewalls and NATs and cannot traverse them unaided.


[0048] It is an object of the present invention to address the problems caused by firewalls and NATs for voice and multimedia communications through firewalls and NATs in a common way for all protocols. Therefore, although the present invention is described with reference to the H.323 protocol it applies to all the real-time IP communication protocols including H.323, SIP, MGCP, Megaco and others. Likewise, the present invention applies to voice only as well as multimedia communications. Therefore, within the context of the description ‘multimedia’ means any combination of voice and/or video communications.


[0049] Accordingly, the invention provides a communications system for making a multimedia call, comprising, a first multimedia terminal, a second multimedia terminal, communication means for making a multimedia call over a shared communications network, said communication means including a first communication means and a second communication means associated respectively with the first multimedia terminal and the second multimedia terminal, the first communication means including a first firewall through which the multimedia call must pass, in which:


[0050] i) the first firewall is configured to restrict certain types of communication between the first terminal and the public shared communications network;


[0051] ii) each terminal has a number of logical communication ports for transmitting and/or receiving the multimedia call, including at least one dynamically assigned port;


[0052] iii) in the course of setting up a multimedia call, at least one of the terminals is adapted to send a request to the other of the terminals to open up one or more of the dynamic ports in the terminal receiving said request;


[0053] characterised in that:


[0054] iv) the system includes a proxy server between the first terminal and the second terminal that acts for each terminal as a proxy for the other terminal during the course of a multimedia call;


[0055] v) the proxy server has logical communication ports for communication with the terminals including one or more pre-assigned ports for communication with the first terminal;


[0056] vi) the first firewall is configured not to restrict communication between the first terminal and the pre-assigned port(s) of the proxy server; and


[0057] vii) the proxy server is configured to receive and forward the request(s) to open up said dynamic port(s) via one of its pre-assigned ports. Also according to the invention, there is provided a method of making a multimedia call using a communications system that comprises a first multimedia terminal, a second multimedia terminal, communication means including a first communication means and a second communication means associated respectively with the first multimedia terminal and the second multimedia terminal, wherein each terminal has a number of logical communication ports for transmitting and/or receiving the multimedia call, including at least one dynamically assigned port, and the first communication means includes a first firewall configured to restrict certain types of communication between the first terminal and the shared communications network, in which the method comprises the steps of:


[0058] a) setting up a multimedia call over a shared communications network with the first communications means and the second communications means between the first multimedia terminal and the second multimedia via the first firewall;


[0059] b) in the course of setting up a multimedia call, at least one of the terminals sends a request to the other of the terminals to open up one or more of the dynamic ports in the terminal receiving said request;


[0060] characterised in that the method comprises the steps of:


[0061] c) including a proxy server between the first terminal and the second terminal that acts for each terminal as a proxy for the other terminal during the course of a multimedia call, the proxy server having logical communication ports for communication with the terminals including one or more pre-assigned ports for communication with the first terminal;


[0062] d) configuring the first firewall not to restrict communication between the first terminal and the pre-assigned port(s) of the proxy server; and


[0063] e) configuring the proxy server to receive and forward the request(s) to open up said dynamic port(s) via one of its pre-assigned ports.


[0064] Such a system may be used for making a multimedia call according to the H.323 or H.248 standard of the International Telecommunications Union. Alternatively, the system may be used for making a multimedia call according to the SIP or MGCP standard of the Internet Engineering Task Force. Furthermore, the proxy may support mixed protocol environments.


[0065] The shared communications network may comprise a public network such as the public switched telephone network (PSTN) and the public Internet data network, or it may be any other IP network where firewalls may be deployed to demarcate and restrict traffic crossing network boundaries. For example, in one embodiment of the invention, the proxy server is place in the de-militarised zone (DMZ) of an enterprise's network, where the firewall is restricting traffic passing into the private network from the DMZ.


[0066] The proxy server will usually be remote from both of the first multimedia terminal and the second multimedia terminal, for example being connected to both the first and second multimedia terminals through the shared IP network.


[0067] In many cases, the second communication means will also include a second firewall through which the multimedia call must pass. The second firewall may then be configured to restrict certain types of communication between the second terminal and the shared communications network. The proxy server will then have logical communication ports for communication with the terminals including one or more pre-assigned ports for communication with the second terminal. The second firewall can then be configured not to restrict communication between the second terminal and the pre-assigned port(s) of the proxy server.


[0068] In a preferred embodiment of the invention, the number of pre-assigned ports of the proxy server is less than or equal to the total number of dynamically assigned ports for the terminal(s). For example, the proxy server may have two pre-assigned ports, and preferably three pre-assigned ports, one for TCP and two for UDP.


[0069] The terminals may be adapted to transmit and/or receive multimedia media signals together with associated multimedia control signals, the control signals being sent to one of the pre-assigned ports and the media signals being sent to the other pre-assigned ports.


[0070] Preferably, at least one the logical communications ports is a pre-assigned port, said request being sent to the pre-assigned port as an initial request to initiate communication over the communication link.


[0071] The communication means may be adapted for making a multimedia call at least in part via the internet, in which case the proxy server will have a public internet protocol address by which the or each of the terminals communicate with the proxy server, the firewall(s) being configured not to restrict communication between the terminal(s) and the pre-assigned port(s)of the proxy server.


[0072] The invention is applicable to the case where there is one or more pair(s) of first terminals and of second terminals. For example, several first multimedia terminals at one site may each connect to corresponding other second multimedia terminals at a variety of other sites.






[0073] The invention will be described by way of example, with reference to the accompanying drawings, in which:


[0074]
FIG. 1 is a schematic diagram of a communications system according to a preferred embodiment of the invention for making a multimedia call; and


[0075] FIGS. 2 to 9 are schematic diagrams showing the method of setting up a multimedia call according to a preferred embodiment of the invention.






[0076] The alternative to a complete H.323 upgrade is presented in the example described with reference to FIG. 1. This shows a communication system 1 having a first enterprise 2 and a second enterprise 4, each of which include private networks 6,8 both of which have one or more H.323 terminals 10,12. Each private network 6,8 has private IP addresses 14,16 coincidentally within the 10.x.x.x address range. The private IP addresses 14,16 may result from a static assignment or dynamic assignment through normal DHCP procedures. External communication is via a shared, managed or public Internet 20. For external communication, the first enterprise 2 has a public IP address pool 22 beginning at 192.1.1.1 and the second enterprise 4 has a public IP address pool 24 beginning at 206.1.1.1. Each enterprise has a router 32,34 that is programmed with translation rules to perform a simple Network Address Translation (NAT) function, either a standing mapping between inside addresses 14,16 (private) and outside addresses 22,24 (public) or to make dynamic mappings based on which H.323 of the terminals 10,12 on the private network 6,8 connects first to the shared network 20 via the corresponding router 32,34.


[0077] The private networks 6,8 are each protected at their edge edges with firewall functions 26,28. The firewall functions are configured with the rules shown in Table 1 to allow H.323 traffic. The rules take into account the well known ports for H.323 and T.120, which are 1718, 1719, 1720 and 1503 and also two new well known ports proposed under the invention, referred to as X and Y.
2TABLE 1From IPFromTo IPToIPRuleAddressPortAddressPortprotocolApplication1AnyAny224.0.1.411718UDPGK discoveryRequests2Proxy1718AnyAnyUDPGK discoveryserverResponses3AnyAnyProxy1719UDPGKserverRegistrationrequests4Proxy1719AnyAnyUDPGatekeeperserverRegistrationresponses5AnyAnyProxy1720TCPOutbound CallserverControl (Q.931)6Proxy1720AnyAnyTCPInbound CallserverControl (Q.931)7AnyAnyProxyYTCPOutbound MediaserverControl (H.245)8ProxyYAnyAnyTCPInbound MediaserverControl (H.245)9AnyAnyProxyXUDPOutbound Mediaserver(RTP)10ProxyXAnyAnyUDPInbound Mediaserver(RTP)11AnyAnyProxyYUDPOutbound Mediaserver(RTCP)12ProxyYAnyAnyUDPInbound Mediaserver(RTCP)13AnyAnyProxy1503TCPOutbound Dataserver(T.120)14Proxy1503AnyAnyTCPInbound Dataserver(T.120)


[0078] In the above table, the specifically listed port numbers are the registered port numbers according to standards agreed to by IANA.


[0079] In order for H.323 terminals 10 in the first enterprise 2 to communicate with other H.323 terminals 12 in the second enterprise 4, there must exist a shared network 20 to which a proxy server 40 is connected, for example, via a router 38. The proxy server 40 has a public IP address 44, for example 45.6.7.8. The proxy server would also have two new well known ports numbers X,Y 46 that would have to be agreed and registered in advance with IANA.


[0080] The proxy server 40 appears to H.323 terminals as if it is their H.323 gatekeeper (or SIP registrar for SIP terminals, etc.). During a call, to any one terminal the proxy server appears as the other or remote terminal. To gatekeepers, the proxy server appears as all their endpoints. The gatekeeper function (not shown) may be co-resident with the proxy server or remote from it.


[0081] When an H.323 terminal 10,12 is switched on it first discovers and then registers with the gatekeeper function through the proxy server 40 in order to make known that it is ready to make or receive multimedia calls. The registration process requires the terminal 10,12 to pass its own IP address 14,16 in an H.323 message to the gatekeeper function through the proxy server 40. When leaving the terminal, the source address field of the IP packet is the private IP address of the terminal 14,16. However, as that IP packet passes through the simple NAT function the source address in the IP packet is changed to its public equivalent 22,24. Because the NAT function is unaware of the H.323 payload containing the private IP address of the terminal, this IP address is not changed. As the registration messages pass through the proxy server on the way to the gatekeeper, the proxy server 40 stores both the terminal's ‘apparent’ IP address 22,24 (i.e. where the packet appeared to come from following the NAT change) as well as the terminal's private or ‘real’ IP address 14,16. During future call control requests to the gatekeeper function, the proxy server would mandate that all call control will be handled by the various functions (call control, media control and media processing) within the proxy server at the proxy server's IP address 40. In this illustration of the invention, we have assumed that the proxy server is a single device with a single IP address. In other embodiments of the invention the ‘proxy server’ may be several co-operating devices. Additionally, the proxy server device(s) may each have one or multiple IP addresses. Where multiple IP addresses are used, the normal practise is allocate them from a single subnet, then the programming of the firewall rules becomes specifying the allowed ports to and from a subnet rather than individual IP addresses.


[0082] If the H.323 terminal 10,12 does not support a gatekeeper registration function, the terminal must be given a static private IP address. A static NAT rule then must be made in the simple NAT function of the router 32,34 and the proxy server 40 must be programmed with the static apparent IP address 14,16 and the real IP address 22,24 of the terminal 10,12. The terminal 10,12 is programmed to pass all call control requests to the proxy server 40 as in the previous case.


[0083] FIGS. 2 to 9 show a method of setting up a multimedia call according to a preferred embodiment of the invention. These drawings show seven steps or stages, as described below:


[0084] Step 1. FIGS. 2 and 3:


[0085] The user A at terminal A110 uses H.323 software to place a multimedia call to the user B at terminal B112. Software running in terminal A110 composes an H.323 setup message containing the identities of A and B, and the true IP address 14 of terminal A110 and the true IP address 44 of the proxy server 42. This message 50 is then placed from a local port PA1 11 in one or more TCP IP packets, which are labelled with terminal A1's 10 IP address 16 as source, and the proxy server's 42 IP address 44 as destination. The setup message is sent to a pre-assigned port 41 of the proxy server 42, here port number 1720. As these packets 50 pass through the simple Network Address Translation (NAT) function in router 32, the source IP address 14 in the IP packet is changed to the public equivalent IP address 18 (e.g. 10.1.1.1 becomes 192.1.1.1). The H.323 message 50 itself is unchanged.


[0086] This setup message transmitted by terminal A110 is represented by:
3TCP Packet|SourceIP/Port: 10.1.1.1/PA1|DestinationIP/Port: 45.6.7.8/1720H.323|SourceIP/Port: 10.1.1.1/PA1|DestinationIP/Port: 45.6.7.8/1720


[0087] The setup message is altered by the router 32, and is then represented by:
4TCP Packet|SourceIP/Port: 192.1.1.1/PA1|DestinationIP/Port: 45.6.7.8/1720H.323|SourceIP/Port: 10.1.1.1/PA1|DestinationIP/Port: 45.6.7.8/1720


[0088] Step 2. FIGS. 3 and 4:


[0089] The H.323 setup message 51 reaches the proxy server 42, which determines the location of user B (in conjunction with some gatekeeper function, for example), and composes a similar H.323 new setup message 52 to send there. This new setup message 52 contains the identities of A and B, and the true IP address 44 (e.g. 45.6.7.8) of the proxy server 42 and the true IP address 16 (e.g. 10.1.1.1) of the terminal B112. The proxy server 42 then sends this message 52 from a pre-assigned port 55, here port number 2777, to the public IP address 17 (e.g. 206.1.1.1) of terminal B112; the IP packets are labelled with the IP address 44 of the proxy server 42 as source, and the public IP address 17 of terminal B112 as destination.


[0090] The new setup message 52 forwarded by the proxy server 42 can be represented by:
5TCP Packet|SourceIP/Port: 45.6.7.8/2777|DestinationIP/Port: 206.2.1.1/1720H.323|SourceIP/Port: 45.6.7.8/2777|DestinationIP/Port: 10.1.1.1/1720


[0091] Step 3, FIGS. 4 and 5:


[0092] The simple NAT function in the router 34 changes the IP packets so that their destination address 17 becomes the true IP address 16 of terminal B112. The H.323 message 53 contained in the packets is not changed, but because the proxy server 42 inserted the true IP address 16 before sending the message 52, the message 53 forwarded by the router 34 now has the correct IP address 16. This forwarded message 53 contains information that identifies the call as originating with the user at terminal A110.


[0093] The setup message altered by the router 34 is then represented by:
6TCP Packet|SourceIP/Port: 45.6.7.8/2777|DestinationIP/Port: 10.1.1.1/1720H.323|SourceIP/Port: 45.6.7.8/2777|DestinationIP/Port: 10.1.1.1/1720


[0094] Step 4. FIG. 6:


[0095] The terminals A110 and B112 decide, for example in a process as set out by well-known internationally agreed standards, that they will send audio and/or video signals.


[0096] The process is the same in either direction, and is also the same for audio as it is for video.


[0097] Terminal B112 prepares a new TCP port PB1 13 on which it will receive a connection from H.245 communication. It then sends an H.323 “connect” message 54 back to the proxy server 42. The address of the new port 10.1.1.1/PB1 is in the message 54.


[0098] This is represented by:
7TCP Packet|SourceIP/Port: 10.1.1.1/1720|DestinationIP/Port: 45.6.7.8/2777H.323|H.245 addressIP/Port: 10.1.1.1/PB1


[0099] The router 34 translates the message as:
8TCP Packet|SourceIP/Port: 206.1.1.1/1720|DestinationIP/Port: 45.6.7.8/2777H.323|H.245 addressIP/Port: 10.1.1.1/PB1


[0100] Step 5. FIG. 7:


[0101] The proxy server 42 sends an H.323 “connect” message 56 to terminal A110 at IP address 192.1.1.1/PA1. The message names the IP address 45.6.7.8/2777 as the port to which terminal A110 should make an H.245 connection.


[0102] This is represented by:
9TCP Packet|SourceIP/Port: 45.6.7.8/1720|DestinationIP/Port: 192.1.1.1/PA1H.323|H.245 addressIP/Port: 45.6.7.8/2777


[0103] The router 34 translates the message 57 and forwards this to the terminal A110 as:
10TCP Packet|SourceIP/Port: 45.6.7.8/1720|DestinationIP/Port: 10.1.1.1/PA1H.323|H.245 addressIP/Port: 45.6.7.8/2777


[0104] Step 6, FIG. 8:


[0105] Next, two events take place, either independently, or one after the other. First, terminal A110 establishes H.245 communications with the proxy server 42. The IP packets 58,59 that carry the H.245 communication are subject to the translations at the router 32 as the initial setup messages described above. Second, the proxy server 42 makes a similar H.245 connection 60,61 to the terminal B112 via the router 34, with address translation in the same manner as described above. At this stage, there are no IP address carried in the H245 messages 58,59;60,61.


[0106] Step 7. FIGS. 9A to 9E:


[0107] Terminals A110 and B112 follow normal H.245 protocols to open logical channels to carry audio and/or video signals. Each channel carries either audio or video, but never both. The process is the same for all channels. A number of ports 27,29 are opened in both the terminals 10,12 and the proxy server 42, as shown in summary form in FIG. 9A.


[0108] The order in which the various ports are opened can vary, and one particular example is described here. In particular, although the steps shown in FIG. 9E are shown occurring after those of FIG. 9D, the steps of FIG. E may happen either before those shown in FIG. 9C, or between those shown in FIGS. 9C and 9D.


[0109] First as shown in FIG. 9B, terminal A110 sets up a dynamic port pair PA3/UDP and PA4/UDP 31 as an audio channel for sending audio. Numerically, according to the rules for RTP communication (standard IETF RFC 1889), PA4=PA3+1, and PA3 is an even number. Port PA3 is used for RTP communication, and port PA4 is used for RTCP communication.


[0110] Terminal A110 sends the necessary “open logical channel” message 62 to the proxy server 42. The NAT function in the router 32 forwards a translated message 63 and IP packets as:
11BeforeAfterTCP|SourceIP/Port:10.1.1.1/PA2192.1.1.1/PA2|DestinationIP/Port:45.6.7.8/277745.6.7.8/2777RTCP|Address10.1.1.1/PB4unchanged


[0111] Then as shown in FIG. 9C, the proxy server 42 composes a similar new message 64 to terminal B112. The proxy server 42 places the identity of the pre-assigned ports in this message, along with information about the nature of the signal. The message 64 is constructed with 45.6.7.8/2777 (UDP) as the RTCP address at the proxy server. The encoding method may be the same as the encoding method selected by terminal A1, or it may be different. The proxy server 42 then transmits the message 64 in IP packets to terminal B112's public IP address 17.


[0112] The message passes through the simple NAT function at the router 34. This changes the destination IP address in the packets to be terminal B112's true address 16. The terminal receives the message 65, and opens a pair of dynamic ports 35 to receive the signal.


[0113] This is represented by:
12BeforeAfterTCP|SourceIP/Port:45.6.7.8/277745.6.7.8/2777|DestinationIP/Port:206.1.1.1/PB110.1.1.1/PB1RTCP|Address45.6.7.8/2777unchanged


[0114] Then, as shown in FIG. 9D, the terminal B112 replies with an “open logical channel acknowledge, response 66 that contains the true IP addresses 16 of terminal B112, and the port numbers of the dynamic ports 35 that the terminal B1 has opened.


[0115] The “open logical channel acknowledge” message 66 gives the RTP and RTCP addresses of the terminal B112, here 10.1.1.1/PB2 and 10.1.1.1/PB3. In this example, PB2 is an even number, and PB3=PB2+1. This message 66 is placed into IP packets having a source IP address equal to the true IP address 16 of the terminal B112, and a destination address equal to the IP address 44 of the proxy server 42. The message 66 passes through the router 34 which uses the simple NAT function to forward a translated message 67 to the proxy server 42 having the true IP address 16 of terminal B112 changed to the public IP address 17. The packet reaches the proxy server 42, which uses the dynamic port numbers from the message plus the public IP address (206.1.1.1) of terminal B112 to open its pre-assigned ports 33 to send the audio signal to terminal B112. The router 34 does not change the addresses in the H.323 message.


[0116] This is represented by:
13BeforeAfterTCP|SourceIP/Port:10.11.1/PB1206.1.1.1/PB1|DestinationIP/Port:45.6.7.8/277745.6.7.8/2777RTP|Address10.1.1.1/PB2 (UDP)unchangedRTCP|Address10.1.1.1/PB3 (UDP)unchanged


[0117] Finally, as shown in FIG. 9E, the proxy server 42 transmits an “open logical channel acknowledge” response 68 to the public IP address 18 of terminal A110 to tell the terminal the ports that will receive the audio signal. In this example, the message lists the pre-assigned ports 2776/UDP and 2777/UDP at the proxy server 42 as the ports for RTP and RTCP respectively. The router 32 modifies the IP address of the terminal in the IP packet of the forwarded message 69, but makes no change to the response itself. The terminal receives this message 69, and begins to send the audio signal.


[0118] The setup message altered by the router 32 is translated as follows:
14BeforeAfterTCP|SourceIP/Port:45.6.7.8/277745.6.7.8/2777|DestinationIP/Port:192.1.1.1/PA210.1.1.1/PA2RTP|Address45.6.7.8/2776unchangedRTCP|Address45.6.7.8/2777unchanged


[0119] Multimedia communication (“media data”) may then flow between the two terminals 10,12. As terminal A110 generates media data for the new channel, it sends it from a new third port 10.1.1.1/PA3 to the proxy server 42 at 45.6.7.8/2776. The proxy server 42 receives the media data, and determines from the apparent source address that the packets are intended for the logical channel, and forwards them to B112 by sending them from 45.6.7.8/2776 to terminal B112 at 206.1.1.1/PB2. The proxy server 42 may perform processing before sending the media data onwards, or it may forward the media data unaltered.


[0120] In this example, the proxy server 42 must record the apparent or “public” IP address 18, here 192.1.1.1, of terminal A110 because it will not have direct access to the true originating address 14, here 10.1.1.1, as it receives the packets of media data.


[0121] The invention described above allows H.323 endpoints located in different secure and private IP data networks to communicate with each other without compromising the data privacy and data security of the individual private networks. The invention relates to a method and apparatus that has the advantage of working with existing firewalls, routers and proxies thus saving the costs of upgrading those devices to be fully H.323 compliant or deploying additional H.323 devices. One aspect of the invention presented herein applies to those deployments where simple (1-to-1) NAT (Network Address translation) mapping may be applied at the edge of the private networks, or where NAT may be bypassed. A separate aspect of the invention applies to deployments where NAPT (Network Address and Port Translation) is applied at the edge of the private networks. The two aspects of the invention can coexist and the apparatus can allow communications to take place between private networks following one method and private networks following the other method. Similarly within a single private network, some terminals may use one method (e.g. dedicated room systems) whereas other terminals may use the second method (e.g. desktop client PCs).


[0122] The invention presented herein are illustrated with reference to the ITU H.323 standard as that is the predominant standard for multimedia communications over packet networks including IP networks. However, it is equally applicable to other standards or methods that need to dynamically assign ports to carry bi-directional information , for example SIP, MGCP and H.248.


[0123] In summary, the invention provides a method and a system for allowing H.323 terminals located in private IP networks that: does not compromise the existing security procedures and measures; that avoids the need to upgrade existing firewalls, routers and proxies; and that avoids the deployment in the private network of additional specialist H.323 equipment. The invention also permits standard H.323 equipment in one private network to communicate with other H.323 terminals in the same or different private and/or public IP networks via an H.323 proxy server using a shared or public IP network.


[0124] Note that the static private IP address of an H.323 terminal may in fact be the same as the public IP address to which it is mapped, in which case the one-to-one mapping is transparent.


[0125] The advantages of the approach described above are that:


[0126] NAT and firewall functions do not need to be upgraded.


[0127] Connectivity may be provided by a service provider through a shared network, or by enterprises themselves using the public internet


[0128] Latency of the signal is kept to a minimum.


[0129] Organisations can therefore subscribe to a shared resource in a shared IP network. Costs are kept to a minimum/shared and security is not compromised.

Claims
  • 1. A communications system (1) for making a multimedia call, comprising, a first multimedia terminal (10), a second multimedia terminal (12), communication means for making a multimedia call over a shared communications network (20), said communication means including a first communication means and a second communication means associated respectively with the first multimedia terminal (10) and the second multimedia terminal (12), the first communication means including a first firewall (26) through which the multimedia call must pass, in which: i) the first firewall (26) is configured to restrict certain types of communication between the first terminal (10) and the shared communications network (20); ii) each terminal (10,12) has a number of logical communication ports (27,29) for transmitting and/or receiving the multimedia call, including at least one dynamically assigned port (31,35); iii) in the course of setting up a multimedia call, at least one of the terminals (10,12) is adapted to send a request (62) to the other of the terminals to open up one or more of the dynamic ports (35) in the terminal receiving said request; characterised in that: iv) the system (1) includes a proxy server (40) between the first terminal (10) and the second terminal (12) that acts for each terminal (10,12) as a proxy for the other terminal during the course of a multimedia call; v) the proxy server (40) has logical communication ports (33) for communication with the terminals (10,12) including one or more pre-assigned ports (41,55) for communication with the first terminal (10); vi) the first firewall (26) is configured not to restrict communication between the first terminal (10) and the pre-assigned port(s) (41,55) of the proxy server (40); and vii) the proxy server (40) is configured to receive and forward (64) the request(s) (62) to open up said dynamic port(s) (35) via one of its pre-assigned ports (41,55).
  • 2. A communication system (1) as claimed in claim 1, in which: viii) the second communication means includes a second firewall (28) through which the multimedia call must pass; ix) the second firewall (28) is configured to restrict certain types of communication between the second terminal (12) and the shared communications network (20); x) the proxy server (40) has logical communication ports (33) for communication with the terminals (10,12) including one or more pre-assigned ports (41,55) for communication with the second terminal (12); and xi) the second firewall (28) is configured not to restrict communication between the second terminal (12) and the pre-assigned port(s) (41,55) of the proxy server (40).
  • 3. A communication system (1) as claimed in claim 1 or claim 2, in which the number of pre-assigned ports (41,55) of the proxy server (40) is less than or equal to the total number of dynamically assigned ports (31,35) for the terminal(s) (10,12).
  • 4. A communication system (1) as claimed in claim 3, in which the proxy server (40) has at least one pre-assigned port number.
  • 5. A communication system (1) as claimed in claim 4, in which the proxy server (40) has two pre-assigned port numbers.
  • 6. A communication system (1) as claimed in any preceding claim, in which the terminals (10,12) are adapted to transmit and/or receive multimedia media signals together with associated multimedia control signals (59,60), the control signals being sent to one of the pre-assigned ports (41) and the media signals being sent to the other of the pre-assigned ports (55).
  • 7. A communication system (1) as claimed in any preceding claim, in which at least one of the logical communications ports is a pre-assigned port, said request (62) being sent to the pre-assigned port (41) as an initial request to initiate communication over the communication link.
  • 8. A communication system (1) as claimed in any preceding claim, in which the communication means is adapted for making a multimedia call at least in part via the internet, and the proxy server (40) has one or multiple public internet protocol address(es) by which the or each of the terminals (10,12) communicate with the proxy server (40), the firewall(s) (26,28) being configured not to restrict communication between the terminal(s) (10,12) and the internet protocol address(es) and pre-assigned logical port numbers (41,55) of the proxy server (40).
  • 9. A communication system (1) as claimed in any preceding claim, in which there is a plurality of pairs of first terminals (10) and of second terminals (12).
  • 10. A communication system (1) as claimed in any preceding claim, in which the system (1) is for making a multimedia call according to the H.323 standard of the International Telecommunications Union.
  • 11. A communications system (1) as claimed in any preceding claim, in which the system (1) is for making a multimedia call according to the SIP standard of the Internet Engineering Task Force.
  • 12. A communications system (1) as claimed in any preceding claim, in which the system (1) is for making a multimedia call according to the MGCP standard of the Internet Engineering Task Force.
  • 13. A communications system (1) as claimed in any preceding claim, in which the system (1) is for making a multimedia call according to the H.248 standard of the ITU.
  • 14. A communications system (1) as claimed in any preceding claim, in which the second terminal (12) is another proxy server (40) serving a remote community of terminals and endpoints.
  • 15. A communications system (1) as claimed in any preceding claim, in which a third party deploys the proxy server (40) for the provision of communication services between enterprises.
  • 16. A communications system (1) as claimed in any preceding claim, in which the first terminal's enterprise deploys the proxy server (40) for the provision of external communication service with other enterprises, service providers or its remote branches.
  • 17. A communications system (1) as claimed in any preceding claim, in which the gatekeeper function is co-resident with the proxy server (40).
  • 18. A communications system (1) as claimed in any of clams 1 to 16, in which the gatekeeper function is a separate system from the proxy server (40).
  • 19. A method of making a multimedia call using a communications system (1) that comprises a first multimedia terminal (10), a second multimedia terminal (12), communication means including a first communication means and a second communication means associated respectively with the first multimedia terminal (10) and the second multimedia terminal (12), wherein each terminal (10,12) has a number of logical communication ports (11,13) for transmitting and/or receiving the multimedia call, including at least one dynamically assigned port (31,35), and the first communication means includes a first firewall (26) configured to restrict certain types of communication between the first terminal (10) and the shared communications network (20), in which the method comprises the steps of: a) setting up a multimedia call over a shared communications network (20) with the first communications means and the second communications means between the first multimedia terminal (10) and the second multimedia via the first firewall (26); b) in the course of setting up a multimedia call, at least one of the terminals (10,12) sends a request (62) to the other of the terminals to open up one or more of the dynamic ports (35) in the terminal receiving said request; characterised in that the method comprises the steps of: c) including a proxy server (40) between the first terminal (10) and the second terminal (12) that acts for each terminal (10,12) as a proxy for the other terminal during the course of a multimedia call, the proxy server (40) having logical communication ports (33) for communication with the terminals (10,12) including one or more pre-assigned ports (41,55) for communication with the first terminal (10); d) configuring the first firewall (26) not to restrict communication between the first terminal (10) and the pre-assigned port(s) (41,55) of the proxy server (40); and e) configuring the proxy server (40) to receive and forward (64) the request(s) (62) to open up said dynamic port(s) (35) via one of its pre-assigned ports (41,55).
Priority Claims (1)
Number Date Country Kind
0018547.0 Jul 2000 GB
PCT Information
Filing Document Filing Date Country Kind
PCT/GB01/03308 7/24/2001 WO