The present invention relates to the communications field. More particularly, the present invention is directed to methods, apparatus, and systems which allow an end-point device, e.g., a video telephony device, to resolve the simultaneous utilization of multiple mutually exclusive signaling protocols into a single Internet protocol (IP) telephony connection.
High-speed data-over-cable networks have high bandwidth capacity and deliver a mixture of analog and digital television and toll-quality telephony. Programming provided by cable service companies flows downstream to users over a cable operator's network (“the cable network”). The cable network includes a headend (the transmission source), a distribution network, and a set-top box located at the point of service. Information is carried within the distribution network using coaxial cable, fiber-optic cable, Ethernet cable (currently “Category 5” cable), microwave communication, satellite communication, and/or wireless RF communication.
A two-way cable system, allowing information to flow both downstream and upstream, can be implemented using, e.g., cable modems. An Internet service provider (ISP) gateway connects the cable network to the Internet, allowing two way communications with users that are not part of the cable network. Information flowing to and from the ISP gateway must be packetized and conform to standard Internet protocols.
Internet protocol (IP) telephony allows individuals in different locations to communicate with each other over an IP network, just as users have traditionally communicated over voice telephones using Public Switched Telephone Networks (PSTN). Additionally, IP telephony may include a combination of video, still image, and data information during a communication session. Voice telephony involves the recordation, transmission, reception, and issuance of sound. Video telephony entails the communication of both dynamic visual information and audio information and may include additional data streams for still images, slides, documents, and computer files.
Early video telephony systems—also referred to as videoconferencing systems—were expensive and required large amounts of communication bandwidth. Introduction of new audio and video compression techniques and the advent of high-speed communications networks, such as the Internet, have allowed video telephony to become more economical and popular.
Video conferencing systems have been primarily room-based, where participants go to a specially equipped conference room. In more advanced video telephony applications, it is desired to provide individual stations where participants can engage in calls at their desk, in their home, or at any other private or public location. With the proliferation of advanced wireless networks, wireless video telephony is beginning to emerge as a viable option.
A traditional video telephony system consists of video terminals for initiating, transmitting, receiving, and displaying a communication session and a network for connecting the video terminals, as illustrated in
A typical video terminal consists of an audio input device (microphone) and a video input device (camera) for capturing the image and sound of a user and his surroundings. An optional data input device (such as a hard drive containing computer files or a digital scanner for still pictures and documents) is used to acquire non-A/V information.
An encoder is used to convert analog information, such as audio and analog video, into digital information for transmission over the network. Once the digital information is received by another video terminal, a decoder is used to convert the digital information back into analog representations. These analog representations are then displayed to a party using speakers (audio) and monitors (video). A device which both encodes and decodes signals is referred to as a codec. For most signaling protocols, codec devices are defined by protocol standards.
Because A/V signals occur in real-time, it is important that the network provide a low and predictable delay connection, referred to as Quality of Service (QoS). There are two basic types of networks for transmitting video telephony information: (1) circuit-switched networks and (2) packet-switched networks. Circuit-switched networks, such as integrated services digital network (ISDN) and general switched telephone networks (GSTN), allocate a dedicated amount of bandwidth and a predictable delay connection.
Packet-switched networks, such as local area networks (LAN) and the Internet, break input data streams into uniform data packets and append addressing information, sequence counts, and error controls. Each packet is transmitted independently through a shared non-dedicated bandwidth network. At the receiving end, the packets are checked for errors, re-sequenced as necessary, and combined into an output data stream.
Several different types of signaling protocols are used to establish connections (e.g., point-to-point conversations) between packet-switched point-to-point devices. The most common signaling protocols are the Session Initiation Protocol (SIP), the H.323 protocol, and PacketCable network-based call signaling (NCS). However, these protocols are mutually exclusive, as they were designed independently of each other. As currently utilized, these protocols are not interoperable. Other such protocols will be apparent to those skilled in the art.
Each information stream (audio/video/data) of a video telephony session requires a negotiated codec. Video telephony requires the use of at least two codecs: one to handle the audio information stream and one to handle the video information stream. Some signaling protocols, such as SIP and H.323, allow for the negotiation of multiple codecs during a communication session. However, PacketCable NCS only supports one codec (audio) and, therefore, cannot be used to establish a video telephony connection. Accordingly, PacketCable NCS, in its present form, is not a viable protocol for video telephony.
Notwithstanding this issue, several cable service companies are striving to provide customers with telephone service based on the PacketCable NCS signaling protocol. Eventually, these cable companies may wish to provide video telephony, requiring the use of another, mutually exclusive, signaling protocol.
It would be advantageous to provide methods and systems for combining these mutually exclusive signaling protocols into a single IP telephony connection. It would also be advantageous if the receiving end-point device was able to resolve conflicts between the mutually exclusive signaling protocols without user intervention and without creating extensions to the signaling protocols. This would allow a cable service company to offer video telephony to its customers as an extension of its PacketCable NCS telephone service rather than as a completely different service.
The methods, apparatus, and systems of the present invention provide the foregoing and other advantages.
The present invention provides methods, systems, and apparatus that allow at least two end-point IP telephony devices to combine multiple data streams conforming to mutually-exclusive signaling protocols into a single video telephony connection. An audio-only data stream conforming to an audio-only signaling protocol (such as PacketCable NCS) is combined with at least one video data stream conforming to one or more video signaling protocols (such as SIP or H.323) to produce a single IP telephony connection.
Methods, systems, and apparatus are provided to establish an audio portion, a video portion, and optional data stream portions of a single IP telephony session. In a point-to-point communication session an audio codec is negotiated between the originating end-point and the terminal end-point according to a voice signaling protocol. A voice-only call management server (CMS or “call agent”) is used to establish the audio portion of the session. According to a video signaling protocol, the end-points negotiate a video codec and a video call agent establishes the video portion of the session. The establishment of the video portion may occur simultaneously or subsequent to the establishment of the audio portion. The end-points are responsible for resolving conflicts between the voice and video signaling protocols.
The present invention will hereinafter be described in conjunction with the appended drawing figures, wherein like numerals denote like elements, and:
The ensuing detailed description provides preferred exemplary embodiments only, and is not intended to limit the scope, applicability, or configuration of the invention. Rather, the ensuing detailed description of the preferred exemplary embodiments will provide those skilled in the art with an enabling description for implementing a preferred embodiment of the invention. It should be understood that various changes may be made in the function and arrangement of elements without departing from the spirit and scope of the invention as set forth in the appended claims.
The protocols used to implement the present invention may include, but are not limited to, PacketCable Network-based Call Signaling (NCS), Packet Cable Duos, Session Initiation Protocol (SIP), Session Description Protocol (SDP), SGCP, MGCP, H.323, and the like.
In an example embodiment of the invention, as shown by the block diagram of
An exemplary cable network 31 is illustrated by the block diagram of
As shown in the example implementation of
In a preferred embodiment of a managed IP network 44, as illustrated by the block diagram of
The elements of the MTA 34 (
An MTA protocol stack 68, roughly corresponding to the open system interconnection (OSI) protocol standard, is illustrated in
UDP is a connectionless protocol that provides an unreliable communication channel. Messages, in the form of datagrams, are not guaranteed to arrive at their destination or to arrive in the same order they were sent. A function of the managed IP network 44 (
Those skilled in the art will appreciate that implementation of the present invention is independent of the type of transport protocol used. Depending on the specific implementation of the combined protocol telephony system 30 (
The block diagram of
The UDP protocol application 100, the IP protocol application 102, and the Ethernet protocol application 104 are also software constructs. It is a common practice in the industry to store software applications such as the MTA application 86 and these protocol applications on one or more memory devices from which they can be retrieved and loaded into a computer processing device. For example, these applications may be stored on a hard-disk drive (HDD), moved into random access memory (RAM) and run on a micro-processor or central processing unit (CPU).
Audio, video, and optional information captured by the customer premises equipment 36 is passed through the ports 80, 82, 84, and coded by the codecs 88, 90, 92, respectively. Audio information is routed to the NCS call signaling application 94, while video and optional data are processed by the SIP user agent application 96. This dual data path is necessitated by the fact that PacketCable NCS signaling protocols support only one codec, and this single codec is customarily used for audio information.
Some signaling protocols, such as SIP or H.323, are capable of supporting multiple codecs, including one or more audio and video codecs and additional codecs for optional information streams. On the other hand, PacketCable NCS signaling protocols, as currently implemented in the industry, are not interoperable with these other signaling protocols and, therefore, are mutually exclusive. The incompatibility of PacketCable NCS and the flexibility of these other signaling protocols would appear to obviate the need for using PacketCable NCS call signaling for IP telephony.
However, cable service companies have a substantial infrastructure dedicated to delivering cable television service and audio only telephony utilizing PacketCable NCS. Accordingly, cable service providers are interested in supplementing, not replacing, these services. To meet these needs, it is an important aspect of the invention that the MTA 34 includes a mutually-exclusive signaling protocol data combiner (MSPDC) 98. This MSPDC is a software application or hardware device for combining (e.g., synchronizing) multiple data streams conforming to mutually exclusive signaling protocols into a single IP telephony session.
Corresponding methods are also provided in accordance with the invention.
It is noted that the above description of the invention should in no way be interpreted as limiting the scope of the present invention as other alternative embodiments are contemplated. For example, different protocols can be used for audio and video, as noted above. Further, the invention can be implemented to allow two different audio-only systems to be used, e.g., for three-way calling.
It should now be appreciated that the present invention provides advantageous methods, systems, and apparatus for combining multiple data streams conforming to mutually exclusive signaling protocols into a single IP telephony session.
Although the invention has been described in connection with various illustrated example embodiments, numerous modifications and adaptations may be made thereto without departing from the spirit and scope of the invention as set forth in the claims.