Embodiments described herein relate generally to conference calling, and more specifically to systems and methods for facilitating conference calls using security tokens.
Some embodiments described herein make use of a mobile station. A mobile station is a two-way communication device with advanced data communication capabilities having the capability to communicate with other computer systems, and is also referred to herein generally as a mobile device. A mobile device may also include the capability for voice communications. Depending on the functionality provided by a mobile device, it may be referred to as a data messaging device, a two-way pager, a cellular telephone with data messaging capabilities a PDA, a Smartphone, a wireless Internet appliance, or a data communication device (with or without telephony capabilities). A mobile device communicates with other devices through a network of transceiver stations.
Most applications for use with such mobile devices have been designed to be stand-alone applications (that generally do not interact with other applications), with a centralized email server providing email, a telephony system providing voice services, an instant messenger service allowing short, informal chats, etc. However, it has been recognised that these services or tools may be enhanced and may improve efficiency if greater interaction between such services was facilitated.
Consider a situation in which clicking on an email while at home automatically initiated a call from the users enterprise PBX (Private Branch Exchange) to the email sender, or launched an IM (Instant Messaging) session from a problem tracking system to allow informal communications between a support engineer and the customer. This inter-working has become known as “unified communications”.
One way to implement a unified communications system within an enterprise may be through the introduction of proprietary protocols. “Glue” applications may be written to tie together the administration API (Application Programming Interface) published by one company with an equivalent API from another. However, such solutions require substantial effort to introduce inter-operability with services.
For a better understanding of embodiments described herein, and to show more clearly how they may be carried into effect, reference will now be made, by way of example, to the accompanying drawings in which:
The difficulty in implementing a unified communications system within an enterprise has been recognized and a protocol created that allows the establishment, control and release of sessions between users and servers in a generic and extensible fashion. The Session Initiation Protocol (SIP) has been designed and further enhanced through the IETF (Internet Engineering Task Force). The applicants have recognized that SIP provides a flexible environment that can be leveraged to bring unified communications to mobile devices.
SIP is an application-layer control (signalling) protocol for creating, modifying and terminating sessions with one or more participants. These sessions include Internet multimedia conferences, Internet telephone calls and multimedia distribution. Members in a session can communicate via multicast or via a mesh of unicast relations, or a combination of these.
SIP as defined in RFC 2543 and superseded by RFC 3261 is the IETF's standard for multimedia session management. SIP is an ASCII-based, application-layer control protocol that supports user mobility. It is used to establish, maintain, modify and terminate multimedia sessions between two or more end points. It is important to note that SIP provides the control plane for these sessions. The data plane, in SIP is described by Session Description Protocol (SDP). This contains information pertaining to the session itself (i.e. subject, time-to-live, media info). RTP is one of many (possible) transports which may be described by SDP (as carried in a corresponding SIP message). Real-time Transport Protocol in the context of SIP, would be an ‘out of band’ means for delivering audio and/or video. Note other SDP transports could include IP, UDP, H.320 etc.
There is no requirement that the data plane and control plane follow the same path through the IP domain.
The SIP protocol allows:
To aid the reader in understanding the implementation of SIP in a telephony application, reference is made to
During any one SIP session, a UA will function either as a UAC or a UAS but not as both simultaneously. SIP provides a means to establish, control and terminate one or more multimedia sessions. However, SIP itself is not an application but a platform on which applications can be built. A SIP application may provide simple voice calling functionality in a low-end (minimal featured) softphone, or large and complex functionality such as for an eLearning application that would involve the transmission of voice, video and slides to a multi-participant conference.
Embodiments described herein are generally directed to systems and methods for authenticating participants in a conference call.
In a broad aspect, there is provided a method of facilitating a conference call between a plurality of communication devices, the method comprising: providing a first primary communication device; wherein the first primary communication device comprises a first security token generator configured to generate first security tokens; providing a second primary communication device; providing a conference call controller; wherein the conference call controller is configured to receive and authenticate security tokens; establishing a first control link between the first primary communication device and the conference call controller; generating a first security token; communicating the first security token between the first primary communication device and the conference call controller via the first control link; authenticating the first security token; and establishing a media link between the first and second primary communication devices via the conference call controller. In some embodiments, the first control link may comprise a WiFi connection.
Further, in some embodiments, the second primary communication device comprises a second security token generator configured to generate second security tokens. In such embodiments, the method may further comprise: establishing a second control link between the second primary communication device and the conference call controller; generating a second security token; communicating the second security token between the second primary communication device and the conference call controller via the second control link; and authenticating the second security token. In some embodiments, the second control link may comprise a WiFi connection.
In some implementations, the media link may comprise a voice signal and/or a multimedia signal.
A computer-readable medium may also be provided which may comprise instructions executable on the conference call controller for implementing steps of the method(s).
Some further embodiments provide a method of facilitating a conference call between a plurality of communication devices, the method comprising: providing a first primary communication device; providing a second primary communication device; providing a conference call controller; wherein the conference call controller comprises a controller security token generator configured to generate security tokens; wherein the first primary communication device is configured to receive and authenticate security tokens; establishing a first control link between the first primary communication device and the conference call controller; generating a first security token; communicating the first security token between the conference call controller and the first primary communication device via the first control link; authenticating the first security token; and establishing a media link between the first and second primary communication devices via the conference call controller.
In certain implementations, the second primary communication device may be configured to receive and authenticate security tokens. In such implementations, the method may further comprise: establishing a second control link between the second primary communication device and the conference call controller; generating a second security token; communicating the second security token between the conference call controller and the second primary communication device via the second control link; and authenticating the second security token. The first and/or second control links may comprise a WiFi connection.
In some implementations, the media link may comprise a voice signal and/or a multimedia signal.
A computer-readable medium may also be provided which may comprise instructions executable on the conference call controller for implementing steps of the method(s).
In yet another aspect, a system may be provided for facilitating a conference call between a plurality of communication devices. The system comprises a first primary communication device, the first primary communication device comprising a first security token generator configured to generate first security tokens. The system also includes a conference call controller. The conference call controller is configured to: establish a first control link with a first primary communication device; to receive and authenticate security tokens; and to establish a media link between the first primary communication device and at least one second primary communication device, upon authenticating a first security token. The conference call controller and the first primary communication device are configured to communicate a first security token via the first control link. The conference call controller is also configured to establish a media link between the first and second primary communication devices subsequent to authenticating a first security token.
In some embodiments, the first primary communication device is configured for WiFi communication, and the conference call controller is also configured for WiFi communication. In such embodiments the first control link may comprise a WiFi connection.
The conference call controller may be operatively coupled to a telecommunications network.
The first (and in some instances the second) primary communication device(s) may comprise a portable or mobile communication device.
The media link may comprise a voice signal. In addition or in the alternative, the media link may comprise a multimedia signal. As well, the media link may comprise a telecommunications link.
Another aspect is directed to a system for facilitating a conference call between a plurality of communication devices. The system comprises a first primary communication device, the first primary communication device being configured to receive and authenticate security tokens; and a conference call controller configured to establish a first control link with a first primary communication device. The conference call controller comprises a controller security token generator configured to generate security tokens. The conference call controller and the first primary communication device are configured to communicate a first security token via the first control link. Additionally, the conference call controller is configured to establish a media link between the first primary communication device and at least one second primary communication device, subsequent to the first primary communication device authenticating a first security token.
In some embodiments, the first primary communication device is configured for WiFi communication, and the conference call controller is also configured for WiFi communication. In such embodiments the first control link may comprise a WiFi connection.
These and other aspects and features of various embodiments will be described in greater detail below.
To aid the reader in understanding the structure of a mobile device and how it communicates with other devices, reference is made to
Referring first to
Although the wireless network associated with mobile device 100 is a GSM/GPRS wireless network in one example implementation of mobile device 100, other wireless networks may also be associated with mobile device 100 in variant implementations. Different types of wireless networks that may be employed include, for example, data-centric wireless networks, voice-centric wireless networks, and dual-mode networks that can support both voice and data communications over the same physical base stations. Combined dual-mode networks include, but are not limited to, Code Division Multiple Access (CDMA) or CDMA2000 networks, GSM/GPRS networks (as mentioned above), and future third-generation (3G) networks like EDGE and UMTS. Some older examples of data-centric networks include the Mobitex™ Radio Network and the DataTAC™ Radio Network. Examples of older voice-centric data networks include Personal Communication Systems (PCS) networks like GSM and Time Division Multiple Access (TDMA) systems.
Microprocessor 102 also interacts with additional subsystems such as a Random Access Memory (RAM) 106, flash memory 108, display 110, auxiliary input/output (I/O) subsystem 112, serial port 114, keyboard 116, speaker 118, microphone 120, short-range communications 122 and other device subsystems 124.
Some of the subsystems of mobile device 100 perform communication-related functions, whereas other subsystems may provide “resident” or on-device functions. By way of example, display 110 and keyboard 116 may be used for both communication-related functions, such as entering a text message for transmission over network 200, and device-resident functions such as a calculator or task list. Operating system software used by microprocessor 102 is typically stored in a persistent store such as flash memory 108, which may alternatively be a read-only memory (ROM) or similar storage element (not shown). Those skilled in the art will appreciate that the operating system, specific device applications, or parts thereof, may be temporarily loaded into a volatile store such as RAM 106.
Mobile device 100 may send and receive communication signals over network 200 after required network registration or activation procedures have been completed. Network access is associated with a subscriber or user of a mobile device 100. To identify a subscriber, mobile device 100 requires a Subscriber Identity Module or “SIM” card 126 to be inserted in a SIM interface 128 in order to communicate with a network. SIM 126 is one type of a conventional “smart card” used to identify a subscriber of mobile device 100 and to personalize the mobile device 100, among other things. Alternatively, by way of example only, other types of “smart cards” which might be used may include an R-UIM (removable user identity module) or a CSIM (CDMA (code division multiple access) subscriber identity module) or a USIM (universal subscriber identity module) card. Without SIM 126, mobile device 100 is not fully operational for communication with network 200. By inserting SIM 126 into SIM interface 128, a subscriber can access all subscribed services. Services could include: web browsing and messaging such as e-mail, voice mail, Short Message Service (SMS), and Multimedia Messaging Services (MMS). More advanced services may include: point of sale, field service and sales force automation. SIM 126 includes a processor and memory for storing information. Once SIM 126 is inserted in SIM interface 128, it is coupled to microprocessor 102. In order to identify the subscriber, SIM 126 contains some user parameters such as an International Mobile Subscriber Identity (IMSI). An advantage of using SIM 126 is that a subscriber is not necessarily bound by any single physical mobile device. SIM 126 may store additional subscriber information for a mobile device as well, including datebook (or calendar) information and recent call information.
Mobile device 100 is a battery-powered device and includes a battery interface 132 for receiving one or more rechargeable batteries 130. Battery interface 132 is coupled to a regulator (not shown), which assists battery 130 in providing power V+ to mobile device 100. Although current technology makes use of a battery, future technologies such as micro fuel cells may provide the power to mobile device 100.
Microprocessor 102, in addition to its operating system functions, enables execution of software applications on mobile device 100. A set of applications that control basic device operations, including data and voice communication applications, will normally be installed on mobile device 100 during its manufacture. Another application that may be loaded onto mobile device 100 would be a personal information manager (PIM). A PIM has functionality to organize and manage data items of interest to a subscriber, such as, but not limited to, e-mail, calendar events, voice mails, appointments, and task items. A PIM application has the ability to send and receive data items via wireless network 200. PIM data items may be seamlessly integrated, synchronized, and updated via wireless network 200 with the mobile device subscribers corresponding data items stored and/or associated with a host computer system. This functionality creates a mirrored host computer on mobile device 100 with respect to such items. This can be particularly advantageous where the host computer system is the mobile device subscribers office computer system.
Additional applications may also be loaded onto mobile device 100 through network 200, auxiliary I/O subsystem 112, serial port 114, short-range communications subsystem 122, or any other suitable subsystem 124. This flexibility in application installation increases the functionality of mobile device 100 and may provide enhanced on-device functions, communication-related functions, or both. For example, secure communication applications may enable electronic commerce functions and other such financial transactions to be performed using mobile device 100.
Serial port 114 enables a subscriber to set preferences through an external device or software application and extends the capabilities of mobile device 100 by providing for information or software downloads to mobile device 100 other than through a wireless communication network. The alternate download path may, for example, be used to load an encryption key onto mobile device 100 through a direct and thus reliable and trusted connection to provide secure device communication.
Short-range communications subsystem 122 provides for communication between mobile device 100 and different systems or devices, without the use of network 200. For example, subsystem 122 may include an infrared device and associated circuits and components for short-range communication. Examples of short range communication would include standards developed by the Infrared Data Association (IrDA), Bluetooth, and the 802.11 family of standards developed by IEEE.
In use, a received signal such as a text message, an e-mail message, or web page download will be processed by communication subsystem 104 and input to microprocessor 102. Microprocessor 102 will then process the received signal for output to display 110 or alternatively to auxiliary I/O subsystem 112. A subscriber may also compose data items, such as e-mail messages, for example, using keyboard 116 in conjunction with display 110 and possibly auxiliary I/O subsystem 112. Auxiliary subsystem 112 may include devices such as: a touch screen, mouse, track ball, infrared fingerprint detector, or a roller wheel with dynamic button pressing capability. Keyboard 116 is an alphanumeric keyboard and/or telephone-type keypad. A composed item may be transmitted over network 200 through communication subsystem 104.
For voice communications, the overall operation of mobile device 100 is substantially similar, except that the received signals would be output to speaker 118, and signals for transmission would be generated by microphone 120. Alternative voice or audio I/O subsystems, such as a voice message recording subsystem, may also be implemented on mobile device 100. Although voice or audio signal output is accomplished primarily through speaker 118, display 110 may also be used to provide additional information such as the identity of a calling party, duration of a voice call, or other voice call related information.
Referring now to
The particular design of communication subsystem 104 is dependent upon the network 200 in which mobile device 100 is intended to operate, thus it should be understood that the design illustrated in
The wireless link between mobile device 100 and a network 200 may contain one or more different channels, typically different RF channels, and associated protocols used between mobile device 100 and network 200. An RF channel is a limited resource that must be conserved, typically due to limits in overall bandwidth and limited battery power of mobile device 100.
When mobile device 100 is fully operational, transmitter 152 is typically keyed or turned on only when it is sending to network 200 and is otherwise turned off to conserve resources. Similarly, receiver 150 is periodically turned off to conserve power until it is needed to receive signals or information (if at all) during designated time periods.
Referring now to
In a GSM network, MSC 210 is coupled to BSC 204 and to a landline network, such as a Public Switched Telephone Network (PSTN) 222 to satisfy circuit switched requirements. The connection through PCU 208, SGSN 216 and GGSN 218 to the public or private network (Internet) 224 (also referred to herein generally as a shared network infrastructure) represents the data path for GPRS capable mobile devices. In a GSM network extended with GPRS capabilities, BSC 204 also contains a Packet Control Unit (PCU) 208 that connects to SGSN 216 to control segmentation, radio channel allocation and to satisfy packet switched requirements. To track mobile device location and availability for both circuit switched and packet switched management, HLR 212 is shared between MSC 210 and SGSN 216. Access to VLR 214 is controlled by MSC 210.
Station 206 is a fixed transceiver station. Station 206 and BSC 204 together form the fixed transceiver equipment. The fixed transceiver equipment provides wireless network coverage for a particular coverage area commonly referred to as a “cell”. The fixed transceiver equipment transmits communication signals to and receives communication signals from mobile devices within its cell via station 206. The fixed transceiver equipment normally performs such functions as modulation and possibly encoding and/or encryption of signals to be transmitted to the mobile device in accordance with particular, usually predetermined, communication protocols and parameters, under control of its controller. The fixed transceiver equipment similarly demodulates and possibly decodes and decrypts, if necessary, any communication signals received from mobile device 100 within its cell. Communication protocols and parameters may vary between different nodes. For example, one node may employ a different modulation scheme and operate at different frequencies than other nodes.
For all mobile devices 100 registered with a specific network, permanent configuration data such as a user profile is stored in HLR 212. HLR 212 also contains location information for each registered mobile device and can be queried to determine the current location of a mobile device. MSC 210 is responsible for a group of location areas and stores the data of the mobile devices currently in its area of responsibility in VLR 214. Further VLR 214 also contains information on mobile devices that are visiting other networks. The information in VLR 214 includes part of the permanent mobile device data transmitted from HLR 212 to VLR 214 for faster access. By moving additional information from a remote HLR 212 node to VLR 214, the amount of traffic between these nodes can be reduced so that voice and data services can be provided with faster response times and at the same time requiring less use of computing resources.
SGSN 216 and GGSN 218 are elements added for GPRS support; namely packet switched data support, within GSM. SGSN 216 and MSC 210 have similar responsibilities within wireless network 200 by keeping track of the location of each mobile device 100. SGSN 216 also performs security functions and access control for data traffic on network 200. GGSN 218 provides internetworking connections with external packet switched networks and connects to one or more SGSN's 216 via an Internet Protocol (IP) backbone network operated within the network 200. During normal operations, a given mobile device 100 must perform a “GPRS Attach” to acquire an IP address and to access data services. This requirement is not present in circuit switched voice channels as Integrated Services Digital Network (ISDN) addresses are used for routing incoming and outgoing calls. Currently, all GPRS capable networks use private, dynamically assigned IP addresses, thus requiring a DHCP server 220 connected to the GGSN 218. There are many mechanisms for dynamic IP assignment, including using a combination of a Remote Authentication Dial-In User Service (RADIUS) server and DHCP server. Once the GPRS Attach is complete, a logical connection is established from a mobile device 100, through PCU 208, and SGSN 216 to an Access Point Node (APN) within GGSN 218. The APN represents a logical end of an IP tunnel that can either access direct Internet compatible services or private network connections. The APN also represents a security mechanism for network 200, insofar as each mobile device 100 must be assigned to one or more APNs and mobile devices 100 cannot exchange data without first performing a GPRS Attach to an APN that it has been authorized to use. The APN may be considered to be similar to an Internet domain name such as “myconnection.wireless.com”.
Once the GPRS Attach is complete, a tunnel is created and all traffic is exchanged within standard IP packets using any protocol that can be supported in IP packets. This includes tunneling methods such as IP over IP as in the case with some IPSecurity (IPsec) connections used with Virtual Private Networks (VPN). These tunnels are also referred to as Packet Data Protocol (PDP) Contexts and there are a limited number of these available in the network 200. To maximize use of the PDP Contexts, network 200 will run an idle timer for each PDP Context to determine if there is a lack of activity. When a mobile device 100 is not using its PDP Context, the PDP Context can be deallocated and the IP address returned to the IP address pool managed by DHCP server 220.
Referring now to
As illustrated in
To support multiple SIP applications on a mobile device 100A, 100B, 100C a SIP UA API (SIP User Agent Application Programming Interface) is preferably introduced. This API abstracts the applications from the SIP implementation, thus removing the need for the application programmer to know about the details of the protocol.
The SIP UA API will provide methods to construct, control and delete dialogs, a dialog being a single session between the device and some endpoint. For example, in a VoIP call a dialog is a call leg between the device and the PBX. A dialog may have none, one or multiple media streams associated. For example, a video/audio call will have two bidirectional media streams.
In addition the SIP AU API provides means to register, reregister and deregister SIP applications from the associated registrar server. This will be implemented in such a way to abstract the details of the registration from the application, so the application is unable to modify the registration parameters or the registrar information.
Finally the SIP UA API will provide a set of methods to allow applications a way to subscribe for events from a remote server and to notify a remote server of local application events.
The connectivity of certain embodiments of the mobile devices 100A, 100B, 100C are also illustrated in
A Service Delivery Platform (SDP) 412 is located within the enterprise LAN 410 behind the corporate firewall 414. A SIP enabled mobile device 100A, 100B, 100C communicates with the SDP 412 usually over the GME connection either through the Relay 416 or directly with the Mobile Enterprise Server (MES) 418 if operating in serial bypass mode (e.g. WLAN Enterprise Data). On the other side of the firewall 414, the SDP 412 communicates with existing enterprise servers.
The SDP 412 typically will be involved in the control flow. The media flow, the RTP session in the embodiment illustrated in
The SDP 412 is designed to be a platform upon which any number of applications may be executed. The control towards the device 100A, 100B, 100C will typically utilize a custom or enterprise-specific SIP (ESSIP), but the SDP 412 may utilize different protocols in communicating with other servers. This is illustrated in
The MES 418 may comprise various software and/or hardware elements for administering certain communication functionality of the mobile devices 100A, 100B, 100C. For example, the MES 418 may comprise an administration server 442, a mobile data server 444, a message server 268 (discussed in greater detail below), a database 419, a security module 446 which may be configured to encrypt and decrypt data and/or messages, an IM server 452 and a media server 454.
LAN 410 may comprise a number of network components connected to each other by LAN connections. For instance, one or more users' desktop computers (not shown), each of which may comprise a cradle, may be situated on LAN 410. Cradles for mobile device 100A, 100B, 100C may be coupled to a desktop computer by a serial or a Universal Serial Bus (USB) connection, for example. Such cradles may facilitate the loading of information (e.g. PIM data, private symmetric encryption keys to facilitate secure communications between mobile device 100A, 100B, 100C and LAN 410) from a desktop computer to mobile device 100A, 100B, 100C, and may be particularly useful for bulk information updates often performed in initializing mobile device 100A, 100B, 100C for use. The information downloaded to mobile device 100A, 100B, 100C may include certificates used in the exchange of messages. It will be understood by persons skilled in the art that user computers may also be connected to other peripheral devices not explicitly shown in
Furthermore, only a subset of network components of LAN 410 are shown in
In one example implementation, LAN 410 may comprise a wireless VPN router [not shown] to facilitate data exchange between the LAN 410 and mobile device 100B, 100C. A wireless VPN router may permit a VPN connection to be established directly through a specific wireless network to mobile device 100A, 100B, 100C. With the implementation of Internet Protocol (IP) Version 6 (IPV6) into IP-based wireless networks, enough IP addresses will be available to dedicate an IP address to every mobile device 100B, 100C, making it possible to push information to a mobile device 100B, 100C at any time. An advantage of using a wireless VPN router is that it could be an off-the-shelf VPN component, not requiring a separate wireless gateway and separate wireless infrastructure to be used. A VPN connection might utilize Transmission Control Protocol (TCP)/IP or User Datagram Protocol (UDP)/IP connection to deliver the messages directly to mobile device 100A, 100B, 100C in such implementation.
The communication system 400 shall preferably comprise the VoIP application 436 which is configured to utilize SIP to provide VoIP functionality. The SDP 412 is configured to route VoIP ESSIP requests from the mobile device 100B, 100C to the VoIP application 436, thereby enabling IP calling from a mobile device 100B, 100C connected on the WLAN to an existing SIP enabled gateway or PBX server 418A, 418B, 418C, 418D, 418E. For example, VoIP functionality may include basic calling features such as make and take a VoIP call, hold and resume, transfer (attended and semi attended), ad-hoc conferencing, among others.
The VoIP telephony functionality in some embodiments may be limited to those devices (such as, for example, devices 100B, 100C) that are connected to the WLAN. The use of VPN may allow devices 100B, 100C that are outside the enterprise to access enterprise VoIP services in a secure fashion.
The inventors have recognized the non-uniform way each third-party manufacturers PBX (or other gateway server) 418A, 418B, 418C, 418D, 418E uses SIP. Typically, each such gateway 418A, 418B, 418C, 418D, 418E uses its own version of SIP call flow to establish, control and release calls. As a result, the SIP call flow between the endpoint (typically a communication device, such as for example, mobile device 100A, 100B, 100C) and the PBX (or gateway) 418A, 418B, 418C, 418D, 418E needs to be customized for that particular PBX (or gateway) 418A, 418B, 418C, 418D, 418E.
The VoIP application 436 incorporates a customized Back-to-Back User Agent (B2BUA) (not shown) in the Service Delivery Platform 412, thereby positioned between the mobile device 100B, 100C and the gateway 418A, 418B, 418C, 418D, 418E. The B2BUA abstracts the details of the PBX call flows, registration, call control and configuration from the mobile device 100B, 100C. The B2BUA implements a defined set of ESSIP call flows to the mobile device 100B, 100C that can support a basic set of telephony procedures. The B2BUA also satisfies the SIP call flows that are specific to the gateway 418A, 418B, 418C, 418D, 418E for the same set of telephony procedures.
As each manufacturers gateway server 418A, 418B, 418C, 418D, 418E typically requires a different set of call flows for the same feature, the B2BUA encapsulates the gateway 418A, 418B, 418C, 418D, 418E specifics for the basic calling feature set into a PBX Abstraction Layer (PAL), each gateway 418A, 418B, 418C, 418D, 418E having its own specific PAL.
In addition, if necessary the B2BUA can support other PBX-specific feature extensions, which may be made available to communication devices coupled to the network 410, such as the mobile devices 110B, 100C. These extensions are handled through a PBX Extension Layer (PEL) in the B2BUA, which, like the PAL, abstracts the complexities of each PBX 418A, 418B, 418C, 418D, 418E for a given extension feature set. However, as the extension feature sets between different PBX 418A, 418B, 418C, 418D, 418E will not be the same, it may not be possible to develop a common user interface (UI). Accordingly, a plug-in application may be downloaded to the communication devices coupled to the network 410, such as the mobile devices 110B, 100C, to extend the UI and to provide communication device the necessary SIP Application information on how to handle new features. This plug-in is the Menu and Signalling Extension Plug-in (MSP). As will be understood, the PAL, PEL and MSP are all part of Extensible Signalling Framework (ESF).
With respect to the instant messaging services, the MES 418 may comprise an XMPP2SIMPLE (Extensible Messaging and Presence Protocol to SIP Instant Messaging and Presence Leveraging Extensions) SIP application to enable integration of SIP with an IM session. For example, a voice call may be established over VoIP or over a traditional circuit switched medium directly from an IM session screen. The voice connection may be requested by either party in the IM session. As well as voice, the XMPP2SIMPLE application may also interface SIMPLE (SIP Instant Messaging and Presence Leveraging Extensions) based IM systems to the IM internal architecture of the mobile devices 100A, 100B, 100C.
The MES 418 may use an XMPP (Extensible Messaging and Presence Protocol) based API (Application Programming Interface) over an IPe (IP endpoint) secured socket provided by the XMPP2SIMPLE Application to request that SIP functions be accessed. This API may provide any user identifications that are required and routing information to the VoIP gateway. The gateway might be the VoIP PBX 418D or it might be a VoIP enabled server. The SDP 412 establishes a SIP session to the device 100A, 100B, 100C and a second to the gateway (such as the PBX 418D). The RTP media flow is routed directly to the VoIP gateway (such as the PBX 418D).
Consider a situation in which an IM session is in process between a first mobile device eg. 100B, and a second mobile device 100C. The session may use the enterprise-specific IM protocol between the devices 100B, 100C and an IM Proxy Server in the MES 418, and the third-party IM protocol between the IM Proxy Server and the IM server (eg. IM PBX 418A).
At some point in time, either device 100B, 100C, may request that the session be converted into a voice connection. The MES IM Server 452 requests over the XMPP based API that XMPP2SIMPLE set up an SIP based call. For each mobile device 100B, 100C, the XMPP2SIMPLE acts as a B2BUA, setting up one SIP session with the mobile device 100B, 100C using the ESSIP flows, and a second session with the IM Server 418A using the IM Server 418A specific SIP. These connections are then manipulated to connect the RTP media flow between the two mobile devices 100B, 100C. Communication may also be established between mobile devices 100A, 100B, 100C and other networked devices, such as, for example, computer 450 (which may be equipped to provide voice communication, for example using VoIP) and electronic “whiteboard” 456 (via the internet 224), and telephones 18 (via the PSTN).
Alternatively, a call may be established over circuit switched media. For example, an IM session running on a WAN mobile device 100A may request the establishment of a voice connection. In this case the MES IM Server 452 could request directly to the Fixed Mobile PBX 418E for a circuit switched call, or through the SDP 412 which would establish two circuit switched call legs, one to each party, via the PBX 418E.
The communication system 400 may also provide for certain applications to interact directly with other application services, e.g. applications that provide media streaming capabilities such as e-learning or MP3/video playback, downloading and sharing. Consider a scenario in which an enterprise-wide announcement is to be made. Here the announcement is stored in a MES service which proceeds to call out to all enterprise mobile devices 100A, 100B, 100C.
These services may require a multimedia session to be established between a server and the ESSIP enabled devices 100A, 100B, 100C. In addition there are a number of other servers such as Lightweight Directory Access Protocol (LDAP) servers, location servers, a database application, or an extensible markup language (XML) application. These application services provide back-end services such as directory, authentication, and billing services.
In this case the MES media application or server 454 might again be configured to use an API to set up the multimedia session or to obtain information from the SDP 412. The SDP 412 acts as a UAS, controlling the session and setting the RTP or similar stream directly to the MES Media Server 454. Once the multimedia streaming session has finished, the MES Media Server 454 terminates the SIP session via an API call.
The communication system 400 may also be configured with a voice mobility module 460 (such as the Voice Mobility Management system distributed by Ascendent Systems) which may comprise software and hardware to offer voice mobility anchored at the network between WLAN 404 and cellular 402 networks. The system 400 may offer enhancements such as single number in and out of the enterprise, conferencing, single voice mailbox, etc.
The voice mobility module 460 may use the SIP server through CSTA (Computer Supported Telecommunication Applications) interface that allows first party call control. The interface between the SDP 412 and the PBX 418D, 418E may be SIP Trunk.
In this environment, the voice mobility module 460 controls the media flow passing over the RTP session.
The SDP 412 may interface to the MES 418 for signaling to the device 100A, 100B, 100C and database support, and to the application servers such as the gateway or PBX servers 418A, 418B, 418C, 418D, 418E for application support. This section shall describe in more detail how those interfaces are to be managed.
The SDP 412 may interface to the MES 418 through an ESSIP Connector, a service that communicates directly with a Dispatcher. The ESSIP Connector terminates the GME protocol and is responsible for pushing the SIP signals to an SIP Server (not shown) over a TLS secured socket. This arrangement requires that a new content type be created for SIP, and allows a new ESSIP service book to be pushed to a mobile device 100A, 100B, 100C.
On the other side of the SIP/TLS link, the SDP 412 may also comprise a Unified Communications (UC) Server (not shown). The UC Server executes the SIP applications and communicates to the gateway and PBX servers 418A, 418B, 418C, 418D, 418E, MES IM Server 452, IM server 418A, and voice mobility module 460, etc.
Any number of ESSIP Connectors may support access a single UC Server, the exact number being limited by the configuration of the components over hardware platforms. All configurations using a single UC Server must be connected to the same mobile device database domain.
Both the ESSIP Connector and the UC Server may read data for configuration from the database 419 via an SDP MES Management Server using a web services interface. This component also offers the SDP administration UI.
The UC Server stores information on the MES database 419, which is used at reset to configure the UC Services and users. The following items may be included in the basic server configuration: Sip Realm; Sip Domain Name; Sip Server Address; Sip Server Port; Sip Server Transport; Proxy Server Address; Proxy Server Port; and Proxy Server Transport.
The following items may also be included as part of the database 419 per user: Sip User Display Name; Sip User ID; Sip User Password; Sip Realm; Sip Registration Timeout; Sip Local Port; Sip RTP Media Port; Sip Domain Name; Sip Server Type; Sip Server Address; Sip Server Port; Sip Server Transport; Emergency Number; Sip Secondary Server Type; Sip Secondary Server Address; Sip Secondary Server Port; and Sip Secondary Server Transport.
The UC Server may also require notification from the database 419 when an administrator adds a user into the system 400 so that it can update the internal table without scanning the whole database 419.
The SDP Management Server (SDP MS) (not shown) may abstract the MES database 419 from the SDP 412 components and provides a user interface for administration purposes. The ESSIP Connector and the UC Server will both obtain configuration through the SDP BMS. As the users of UC Services will also be the general MES users, then those configuration items that are specific to each user will require additions to existing user records.
The communications network 400 is preferably also provided with a conference call controller module 440 configured to facilitate and control conference calls between 2 or more parties. As will be discussed in greater detail, below, the controller module 440 may comprise an application or other programming and is configured to coordinate the conference call functionality and to facilitate the exchange of voice and other media between conference call participants. The controller module 440 may comprise conference application 440A and conference services modules 400B and may reside in or otherwise form part of the SDP 412. A corresponding conference application 140 (which may be programmed to indirectly interact with the conference application 440A) resident on the mobile device 100 may also be provided to facilitate the provision of conference call functionality to the user. Such conference application 140 may, for example, manage the generation of GUIs (graphical user interface) and the interaction of the user in a conference call, as well as local security features, etc. A token application 142, operatively coupled to the conference application 140 may also be provided.
In certain embodiments in which the mobile device 100 is intended to generate security tokens, the token application 142 will comprise a security token generator 144. The generator 144 is preferably configured to generate a security token or “password” (or otherwise sufficiently unique data) which can be authenticated as originating from the generator 144. Once the token has been generated by the generator 144, it may be communicated via the communication subsystem 104. As will be understood, a security token typically uniquely identifies (and hence authenticates) the device generating the token.
As will be understood, for example, the generator 144 may utilize an algorithm which generates a “One Time Password” (OTP) based in part on the time and/or date. An authenticator will utilize a corresponding algorithm and must be synchronized with the generator 144, in order to confirm the authenticity of the OTP. As will be understood, different embodiments may use alternate techniques to generate security tokens such as OTPs.
By way of example, RSA's SECURID™ hardware tokens utilize time synchronization in generating OTPs which may be used to identify and/or authenticate a user of a SECURID™ hardware token.
The conference application 440A may be configured to receive and authenticate security tokens generated by, for example, mobile devices 100A, 100B, 100C. The conference application 440A may alternately or in addition comprise a conference security token generator 440C configured to generate security tokens. In some embodiments, the token application 142 will be configured to receive and authenticate security tokens, for example, as may be generated by the conference security token generator 440C.
Messages intended for a user of mobile device 100 are initially received by a message server 268 of LAN 410, which may form part of the MES 418. Such messages may originate from any of a number of sources. For instance, a message may have been sent by a sender from a computer 450 within LAN 410, from a different mobile device [not shown] connected to wireless network 200 (or 404) or to a different wireless network, or from a different computing device (such as computer 450) or other device capable of sending messages, via the shared network infrastructure 224, and possibly through an application service provider (ASP) or Internet service provider (ISP), for example.
Message server 268 typically acts as the primary interface for the exchange of messages, particularly e-mail messages, within the organization and over the shared network infrastructure 224. Each user in the organization that has been set up to send and receive messages is typically associated with a user account managed by message server 268. One example of a message server 268 is a Microsoft Exchange™ Server. In some implementations, LAN 410 may comprise multiple message servers 268. Message server 268 may also be adapted to provide additional functions beyond message management, including the management of data associated with calendars and task lists, for example.
Referring now to
In one embodiment, at least some of the steps of the method are performed by a conference call application that executes and resides on a conference call controller (e.g. conference call controller 440 of
Method 500 commences at Block 510 in which a first primary communication device 610 has been provided. For example, mobile communication device 100B may be selected for use as a first primary communication device 610 in a conference call as contemplated herein. The first primary communication device 610 comprises a first security token generator 144B. Similarly, a second primary communication device 612, for example mobile communication device 100C, may be provided (Block 512), which comprises a second security token generator 144C. A conference call controller, such as controller 440, may also be provided (Block 514). As noted previously, the conference call controller is configured to receive and authenticate security tokens generated by the first and second security token generators 144B, 144C.
The conference call may then be initiated, typically utilizing both SIP and RTP protocols, as discussed above (Block 516). A first control link (as indicated by line 614 in
A first security token 620 may then be generated by the first security token generator 144B and communicated to the conference call controller 440 via the first control link for verification (Block 518). In some embodiments, the user of the first primary communication device 610 might also be required to input a personal identification number (PIN), or speak a particular word or phrase which could be analyzed by a voice recognizer on the conference call controller 440, for additional identification/security.
The first security token 620 may then be authenticated by the conference call controller 440 (Block 520). As will be understood, if the first security token 620 is determined not to be authentic, the steps of the method will discontinue and the conference call will not proceed.
A media link (as represented by line 616 in
While such media link 616 may comprise a standard voice stream as may be established for typical voice telephony or other communications, as will be understood, the media link 616 may comprise other types of media data signals (for example, for multimedia presentations, or videophone applications). In some embodiments, preferably the media link 616 is encrypted.
In some embodiments, a second control link (as indicated by line 618 in
In some implementations, a second security token 624 may also be generated by the second security token generator 144C and communicated to the conference call controller 440 via the second control link for verification (Block 526). It should be understood that not every embodiment will require the authentication of a second primary communication device 612 in order for a conference call involving such second primary communication device 612, to proceed.
The second security token 624 may then be authenticated by the conference call controller 440 (Block 528). In some embodiments, the creation of the second control link, and the generation and authentication of the second security token 624 (as discussed in relation to Blocks 522-526) may be required to be completed prior to the establishing of the media link 616 in Block 522.
In embodiments in which multiple control links eg. 614, 618 are established between multiple communication devices 610, 612, one control link (eg. second control link 618) may be designated as the moderator link 615. Typically, the intended moderators identity will be one of the parameters of the conference call data, and may by default be assigned to a communication device 610, 612 initiating the conference call (if appropriate). The communication device 612 having the moderator control link 615 may be provided with top level control over the conference call and amending its parameters, including for example, adding or removing parties as necessary or amending privilege levels, or even assigning/delegating the moderator privileges. So for example, if the communication device 612 having the moderator control link 615 assigns the moderator privileges to communication device 610, the control link 615 may shift to the first control link 614, thereby providing the user of the first primary communication device 610 with the moderator privileges to control the conference call.
Once the media link 616 has been established, users of the first and second primary communication devices 610, 612, may communicate with each other.
Referring now to
Method 500′ commences at Block 510′ in which a first primary communication device 610 has been provided. For example, mobile communication device 100B may be selected for use as a first primary communication device 610 in a conference call as contemplated herein. The first primary communication device 610 comprises a first token application 142B, configured to receive and authenticate security tokens. Similarly, a second primary communication device 612, for example mobile communication device 100C, may be provided (Block 512′), which comprises a second token application 142C configured to receive and authenticate security tokens. A conference call controller, such as controller 440, may also be provided (Block 514′). As noted previously, in an embodiment implementing the method 500′, the conference call controller 440 is configured with a conference token generator 440C to generate and communicate security tokens.
The conference call may then be initiated, typically utilizing both SIP and RTP protocols, as discussed above (Block 516′). A first control link (as indicated by line 614 in
A first security token 620 may then be generated by the controller 440 and communicated to the first primary communication device 610 via the first control link 614 for verification (Block 518′). The first security token 620 may then be authenticated by the first token application 142B (Block 520′).
A media link (as represented by line 616 in
As previously noted, while such media link 616 may comprise a standard voice stream as may be established for typical voice telephony or other communications, the media link 616 may comprise other types of media data signals (for example, for multimedia presentations, or videophone applications). In some embodiments, preferably the media link 616 is encrypted.
In some embodiments, a second control link (as indicated by line 618 in
In some implementations, a second security token 624 may also be generated by the controller 440 and communicated to the second primary communication device 610 via the second control link for verification (Block 526′).
The second security token may then be authenticated by the second token application 142C (Block 528′). The authentication of the second security token 624 may also be required to be established prior to the establishing of the media link 616 in Block 522′.
Once the media link 616 has been established, users of the first and second primary communication devices 610, 612, may communicate with each other.
As will be understood, while two primary communication devices 610, 612 were illustrated and described as participating in the conference call, additional communication devices may also participate in the conference call.
As will also be understood, while the communication system and embodiments described herein have been illustrated as utilizing SIP, it should be understood that other protocols (including those which may be developed in the future) may be utilized for establishing and controlling sessions as contemplated herein. In addition to “other protocols” it is possible that some embodiments may utilize mediation layers (eg. JAIN/SIP or JAIN/CC) to establish and control sessions as contemplated herein. As well, other embodiments may utilize other signaling mechanisms, such as IMS, SS7, ISDN and H323.
As will further be understood, while the communication system and embodiments described herein have been illustrated as requiring the generation and authentication of at least one security token in order to establish a conference call (and that not all devices need to be authenticated via a security token in order to participate in a conference call), in some embodiments the location of a mobile device 100 may be used to determine whether the authentication of a security token is required in order to participate in a conference call. For example, if the mobile device 100 is in a certain geographical location (eg. in close proximity to, and networked with, LAN 410), the use of a security token to authenticate the device 100 may not be required.
Some or all of the steps of the method of facilitating a conference call in accordance with any of the embodiments described herein may be provided as executable software instructions stored on computer-readable media, which may include transmission-type media.
The invention has been described with regard to a number of embodiments. However, it will be understood by persons skilled in the art that other variants and modifications may be made without departing from the scope of the invention as defined in the claims appended hereto.