The present invention relates to video telephony, and more particularly, to a voice over IP video telephone capable of operating over an IP network.
This application claims priority from U.S. Provisional Application No. 60/641,684 filed Jan. 5, 2005 entitled “INNER PROCESSOR COMMUNICATION IN A MULTIPROCESSOR DEVICE,” which is incorporated herein by reference; U.S. Provisional Application No. 60/641,883 filed Jan. 5, 2005 entitled “INNER PROCESSOR COMMUNICATION IN A VOICE OVER IP VIDEO TELEPHONY DEVICE,” which is incorporated herein by reference; U.S. Provisional Application No. 60/641,326 filed Jan. 4, 2005 entitled “METHOD FOR SYNCHRONIZATION OF AUDIO AND VIDEO PACKETS WITHIN AN IP VIDEO TELEPHONE,” which is incorporated herein by reference; and U.S. Provisional Application No. 60/641,328 filed Jan. 4, 2005 entitled “IP VIDEO TELEPHONE WITH POTS TELEPHONE CONNECTIVITY,” which is incorporated herein by reference.
The combination of video and audio channels provides a unique platform for interpersonal communication. With the availability of broadband Internet network connections in the home, there is an opportunity to further methods of interaction between content providers and consumers.
An IP telephone is a telephone device that transmits voice over a network using data packets instead of circuit switch connections over voice only networks. An IP telephone refers to the transfer of voice over the Internet protocol (IP) of the TCP/IP protocol suite. Other voice over packet (VOP) standards exist for frame relay and ATM networks but many people use the terms voice over IP (VOIP) or IP telephone to mean voice over any packet network
IP telephones originally existed in the form of client software running on multi-media PCs for low cost PC to PC communications over the Internet. Quality of service (QOS) problems associated with the Internet and the PC platform itself resulted in poor voice quality due to excessive delay, variable delay, and network congestion resulting in lost packets, thus relegating VOIP primarily to hobby status. The QOS provided by the Internet continues to improve as the infrastructure is augmented with faster backbone links and switches to avoid congestion, higher access connections to the end users such as XDSL cut-down latency, and new protocols like RSVP and techniques like tag switching give priority to delay sensitive data such as voice and video. IP telephones include one wire systems for transmitting both voice and data. The data may comprise video data of the user of the IP phone in some embodiments. IP telephones provide better scalability as additional stations are added to the system, and the ability to mix and match IP telephones from different manufacturers.
IP telephones have several advantages over multimedia PCs with client software including lower latencies due to an embedded system implementation, a familiar user paradigm of using a telephone versus a PC enabled phone, greater reliability, and lower station costs where a PC is not required.
When considering IP telephones for home use, the network interface that is available is typically a DSL or cable broadband connection. Typically, IP telephones connect to a cable modem or DSL modem via a high speed interface such as Ethernet or universal serial bus (USB). There are also emerging home communication standards such as being presented by home RF, which provides wireless communication within the home. In this new residential environment, IP telephones will attach to the home LAN and have access to the data network and the PSTN via either a DSL or cable modem which communicates to DSLAM or cable system equipment.
A home voice over IP telephone including video capabilities would provide a platform for providing a number of different services and opportunities to the home user. A platform for implementing this service would be greatly desirable.
The present invention disclosed and claimed herein, in one aspect thereof, comprises an IP video telephone. The IP video telephone includes a video display and speaker for playing received video and audio signals respectively. A group of processors includes a gateway processor for connecting the IP video telephone to an external IP network and receiving the audio and video signals, a video processor for processing received video signals and an audio processor for processing received audio signals. An ethernet network interconnects each of the gateway processor, the video processor and the audio processor within the IP video telephone.
For a more complete understanding of the present invention and the advantages thereof, reference is now made to the following description taken in conjunction with the accompanying Drawings in which:
a-8c indicate the various manners in which processing components of the IP video telephone may be interconnected via an Ethernet network;
a and 9b illustrate analog telephone connections to the IP video telephone;
Referring now to the drawings, and more particularly to
The display of the IP video telephone 402 and the browser operating within the video processor 111 are typically left in a powered state to enable content to be received by the IP video telephone 402 at any point. In this manner, when the IP video telephone is not presently operating with supporting an audio/visual telephone call, the browser may be used to display content to an individual on the screen of the IP video telephone. This enables a host server interconnected with the IP video telephone 402 through a network such as the Internet to consistently provide advertising or other types of directed information on the display of the IP video telephone through the browser. This information may be determined in such a manner that the displayed information is of particular interest to the individual.
Base unit 101 may include manual inputs 102. Typically the manual inputs 102 for a video telephone include a standard telephone keypad with ten numeric buttons plus a“#” and “*” buttons. Manual inputs may further include any number of other button switches, thumb wheels, pointing devices or other appropriate manual input devices. A wide variety of functions and features may be controlled using the manual inputs 102. Manual inputs 102 may include navigation keys or a joy stick for up, down, right and left selections and programmable soft keys. Power and status LEDs may also be provided to display information to a user.
A base unit 102 maybe connected to a handset 104. Handset 104 maybe substantially a standard telephone handset including a microphone and speaker. Handset 104 may be directly connected to the base unit 101. A handset 104 directly connected to the base unit 101 may be called a “tethered” or “wired” handset. Handset 104 may also include a wireless transceiver, a wireless connection to the base unit 101 including (or connected to) a wireless transceiver. The wireless transceivers may be a 2.4 GHz transceiver or any other suitable wireless transceiver frequency. The wireless transceivers may be spread spectrum transceivers. A handset 104 wirelessly connected to the base unit may be called a wireless handset.
Base unit 101 is connected to an interface 105. Typically, interface 105 is integral with base unit 101. Interface 105 includes an interface for connection to a network 106 such as an IP network. The network 106 may comprise an open network such as the Internet. Interface 105 includes interface connections 101 for connecting the base unit 101 to a variety of peripherals or networks. Typically, the interface 105 will provide Ethernet ports, telephone handset and keypad support, video capture and display ports including NTSC composite input and output ports, S video ports, NTSC camera ports and LCD display ports. The interface 105 may include audio capture and reproduction ports, an external microphone port, an external speaker port, two audio line level inputs, and a hands-free speaker phone.
A digital video camera 115 is connected to the base unit 101. Typical digital video camera 115 comprises a CCD camera device. The digital video camera 115 may be integral with the base unit 101 or the display 103. An additional digital video camera 137 may be integral with the handset 104. A privacy shield 141 may be a cover provided to disable the digital video camera 137 by covering the lens of the digital video camera 137.
Referring now to
Referring now to
Referring now to
Referring now to
An advanced memory controller 526 includes an external input/output controller 528, a flash/ROM/SRAM controller 530 and an SDRAM controller 532. These controllers provide programmable 8/16/32 bit data and 22 bit address bus with up to 64 megabytes of total memory space for flash, ROM, SRAM, SDRAM and external peripherals. The PCI host bridge 534 supports three external PCI masters or guest mode and further a mini PCI and card bus peripheral. The PCI host bridge 534 supports a 33 MHz, 32 bit PCI interface. The gateway processor 408 further includes an interrupt controller 536 for generating interrupts in response to various interrupt conditions, 16 GPIOs for inputting and outputting data, a UART transceiver 540 and timer/watchdog circuitry 542 for timing various events.
Referring now back to
Referring now to
The TNETV 1050 VOIP processor is a communications processor based on a MIPS 32 reduced instruction set computer (RISC) processor 600, along with a C55X digital signal processor (DSP) 601. The VOIP processor 420 has a rich peripheral set architect specifically for IP phone applications, which reduced the build materials costs, time and complexity associated with developing an IP phone. The RISC processor 600 supplies the overall system services and performs user interface, network management, protocol stack management, call processing and task scheduling functions. The DSP processor 601 provides real time voice processing functions such as echo cancellation, compression, PCM processing and tone generation/detection.
The external memory interface 632 supports two SDRAM chip selects providing 120 megabytes of memory space. The external memory interface 632 also supports three chip selects providing 16 megabytes each of RAM or ROM memory. Finally, the interface provides one chip select for providing a 32 megabyte flash memory.
Referring now back to
An embedded terminal adaptor 440 is interconnected with the VOIP processor 420 through a digital-to-analog and analog-to-digital interface 442. Information transmitted from the embedded terminal adaptor 504 is converted from analog into digital data by an analog-to-digital converter within the interface 442. Likewise, digital data coming from the VOIP processor 420 is converted into analog data for use by an analog telephone connected to the embedded terminal adaptor 440 by the interface 442. Information provided to the VOIP processor 420 by an analog telephone connected to the embedded terminal adaptor 440 is routed from the VOIP processor 420 to the gateway processor 408. The gateway processor 408 allows the data to be packetized and transmitted over the IP network 404 such that ultimately the data can be routed to another VOIP device connected to the IP network 404 or to an analog telephone connected to a PSTN network which is interconnected to the IP network 404.
The video processor 422 is connected to the Ethernet link 418 to provide video capabilities for the IP video telephone 402. The video processor 422 includes a video Codec and LCD panel controller. The video processor 422 may in one embodiment comprise a TI TMS320DM642 digital signal processor. Referring now to
The video processor 422 includes three configurable video port peripherals 708 (VP0, VP1 and VP2). These video port peripherals provide a glueless interface to common video decoder and encoder devices. The DSP video port peripherals support multiple resolutions and video standards. The video ports peripherals are configurable and can support video capture and video display modes. Each video port may include two channels with a 5120 byte capture/display buffer that is split-able between the two channels. The DSP video ports include a capture port interfaced with a Philips decoder with integrated multiplexer for NTSC, S-video sources; a display port interfaced with Philips SAA7105 NTSC and S-video encoder and a third port dedicated to the LCD panel.
The peripheral set further includes a 10/100 Mb/s Ethernet MAC; a management data input/output 711; a VCXO interpolated control port 712; a multichannel buffered audio serial port 714; an inter-integrated circuit bus module; two multichannel buffered serial ports 718; three 32-bit general purpose timers 720; a user-configurable 16-bit or 32-bit host port interface 722; a peripheral component interconnect 724; a 16-bit general-purpose input/output port 726 with programmable interrupt/even generation modes; and a 16-bit glueless external memory interface 728 which is capable of interfacing to synchronous and asynchronous memories and peripherals.
The multichannel buffered audio serial port transmitter 714 is programmed to output multiple encoded data channels simultaneously with a single RAM containing the full implementation of user data and channel status field. The multichannel buffered audio serial port 714 also provides extensive error checking and error features, such as bad clock deterioration circuit for each high frequency master clock which verifies that the master clock is within a program frequency range.
The Ethernet media access controller 710 provides an efficient interface between the DSP core processor and the Ethernet network 418. The media access controller 710 supports both 10-base T and 100-base T in either have or full duplex with hardware flow control and quality of service support. The Ethernet MAC 710 makes use of a customer interface to the DSP core that allows efficient data transmission and reception.
The management data input/output (MDIO) module 711 continuously pulls all 32 MDIO addresses in order to enumerate all PHY devices in the system. Once a PHY candidate has been selected by the DSP, the MDIO module transparently monitors its link state by rating the rating the PHY status register. Link change events are stored in the MDIO module 711 and can optionally interrupt the DSP, allowing the DSP to pull the link status of the device without continuously performing costly MDIO accesses.
The VCXO interpolated control (VIC) 712 port provides a digital-to-analog conversion with resolution from 9-bits to up to 16-bits. The output of the VIC 712 is a single bit interpolated D/A output.
The I2C0 port 728 on the video processor 422 enables the DSP to easily control peripheral devices and communicate with a host processor. Additionally, the standard multichannel buffered serial port (MCBSP) 718 may be used to communicate with serial peripheral interface (SPI) mode peripheral devices.
The video processor 422 connects with a video memory 446. The video memory 446 may comprise a flash memory, SDRAM, or other suitable memory device. The video processor 422 also connects to a video decoder 448. The video decoder may comprise an NTSC decoder for decoding provided video data. The video decoder 448 receives video signals from an external NTSC source 450 or from a video camera 452. The video processor 422 is also connected with a video encoder 454 that may comprise an NTSC encoder. The video encoder 454 may be integral with a CSC 156 to provide video signals to a RGB/LCD panel 158. The video encoder 454 may also provide video signals to an LCD panel 163 and a CV/S/RGB output 162.
Referring now to
b illustrates a configuration wherein only the gateway processor 408 may communicate with each of the voice over IP processor 420 and the video processor 422. When the video processor wishes to converse with the voice over IP processor 420, it must do so through the gateway processor 408. Thus, IP packet messages are transmitted from the video processor 422 to the gateway processor 408, and the gateway processor 408 then forwards the IP packets to the voice over IP processor 420. Likewise, when the voice over IP processor 420 desires to communicate with the video processor 422, it must forward packets to the gateway processor 408 which then forwards the packets onward to the video processor 422. As can be seen, each of the voice over IP processor 420 and video processor 422 may communicate directly with the gateway processor 408.
Finally,
The use of processing devices on the same board having packet network communications functionalities associated therewith enables an ease of configuration and updating with respect to the IP video telephone board. In this configuration, any of the processing chips used for either the voice over IP processor 420, gateway processor 408 and video processor 422 may be upgraded to a different chip or component by merely implementing the new chip within the board design. The only requirement is that the newly implemented chip must have the ability to transceive over an Ethernet network. Since the processors within the IP telephone board are each designed to carry out their various functionalities and communicate with the outside world using IP packets via an IP network, the use of differing components for these processors does not adversely affect the operation of the IP video telephone board.
Referring now to
Referring now to
When connected in the manners illustrated in
Referring now to
A ring signal 1024 is provided from the external world back to the call control and the call control forwards the ring signal to the gateway processor and the call control 1000 forwards the ring signal to the gateway processor 408 at 1026. After the call is answered at the receiving end, an answer signal 1028 is provided from the external world to the call control 1006. The call control 1006 notifies the gateway 408 that the call is connected using a call connection signal 1030. The call controller 1006 notifies the audio processor 420 at 1032 that the call is connected and sets the capabilities for the call with the audio processor. The video processor 422 is notified at 1034 that the call is connected and sets the capabilities for the video processor 422. The call control 1006 transmits an acknowledge signal 1036 back to the external world to where the call has been answered. The call is supported by the IP video telephone during the time period 1038 for which the call is active.
Once the user has completed the call and hangs up the receiver ofthe IP video telephone, a hang-up signal 1040 is provided from the gateway 408 to call control 1006. The call control 1006 initiates a hang-up notification 1042 to the external world to the unit to which the IP video phone is connected. The call control 1006 initiates a stop signal 1044 to the audio processor 420 and a stop signal 1046 to the video processor 422 to indicate that the call has been disconnected. An acknowledgment 1048 is received from the external world at the call control 1006, and the call control notifies the gateway processor 408 that the call is disconnected at 1050.
Referring now to
During transmission ofthe packets over the IP network, the assumption is that the packets sent at the same time will be grouped together as they are received and arrive at substantially the same time. However, when arriving at the second location 1104, the decoding ofthe video packet will again take longer than the decoding ofthe audio packet at the second location. Thus, the initial delay D1 between the audio and video packets is caused by the encoding delays at the first location 1102 and the second delay D2 is associated with the inherent decoding delay differences between the audio and video packets. Thus, a total delay of D1+D2 will be introduced between the audio and video packets resulting in a lack of synchronization between the audio and video packets at the receiving end.
One manner for minimizing or eliminating the lack of synchronization between the audio and video packets is illustrated in the flow chart of
Referring now to
The packets are transmitted over the associated IP network 1312 and statistically the packets will take the same pathway and arrive at a receive gateway 1314 at substantially the same time. The audio packets received at the receive gateway 1314 are delayed for 50 milliseconds while the video packets are not delayed at all and are passed on directly to the video decoder 1316. The provided video packets are decoded by the video decoder 1316 which takes approximately 100 milliseconds. After a delay of 50 milliseconds, the associated audio packets are forwarded to the audio decoder 1318 wherein the packets are decoded in approximately 50 milliseconds. Due to the induced delay of 50 milliseconds at the receive gateway 1314 for the audio packets, the audio packets provided from the audio decoder 1318 and the associated video packets from the video decoder 1316 will be output as associated video and audio packets at substantially the same time. This provides for a synchronized output at the IP video telephone 1104.
Referring now to
The active display 1406 provides various information to the user. A message portion 1408 provides an indication of stored voice and video messages to the user. The calendar portion 1410 provides an abbreviated version ofthe user's calendar for the day and the ability to select a particular day ofthe week to view activities scheduled for that day. A reminders section 1412 provides various reminders that the user has programmed into the IP video telephone enabling them to be reminded of particular events or appointments. A weather display 1414 provides various information to the user on current and coming weather conditions for various days ofthe week. Finally, an ad window 1416 provides for the placement of banner ads that have been purchased by various advertisers that have a business relationship with the service provider ofthe IP video telephone. While the foregoing description describes one particular embodiment ofthe display associated with the IP video telephone, it will be realized by one skilled in the art that the above-described displays and the particular descriptions ofthe displays following herewith comprise only a single embodiment and numerous changes and alterations to the display may be made to suit a particular user and/or service provider.
Referring now to
Referring now to
The call log pane 1604 additionally provides information on previously dialed calls and missed calls. The messages pane 1606 provides listing of both video and voice messages that have been received and stored for a user. The directories pane 1608 provides access to various telephone directories including a personally created phone book, a white pages or a yellow pages. An alerts pane 1610 may provide either information previously indicated by the user as important to the user for which they wish to wish alerts upon, or alternatively, may be directed information pushed to the user based upon data mining analysis with respect to the user's call and/or interest activities.
In addition to the displays described above, the IP video telephone may also include the displays illustrated in
Referring now to
Referring now to
SIP is a signaling protocol for Internet conference, telephony, presence, event notification and instant messaging. SIP provides the necessary protocol mechanisms so that systems and proxy server can provide services such as call forwarding; callee and calling “number” delivery, where numbers can be any (preferably unique) naming scheme; personal mobility, i.e., the ability to reach a called party under a single, location independent address even when the user changes terminals; terminal type negotiation and selection wherein a caller can be given a choice how to reach the party such as via Internet telephone, mobile phone, an answering service, etc.; terminal capability negotiation; caller and callee authentication; blind and supervised call transfer, and invitations to multicast conferences. Extensions of SIP allow third party signaling such as quick to dial services, fully meshed conferences and connections to multipoint control units, as well as mixed mode and the transition between those. SIP addresses users by an email-like address and reuses some ofthe infrastructure of electronic mail delivery such as DNSMX records or using SMTPEXPN for address expansions. SIP addresses (URLs) can also be embedded in web pages. SIP is addressing neutral, with addresses expressed as URLs of various types such as SIP, H.323 or telephone (E.164). SIP is independent ofthe packet layer and only requires an unreliable datagram service, as it provides its own reliability mechanism.
The data port 1904 of the audio processor 422, the data port 1906 of the video processor 420 and the data port 1908 ofthe gateway processor 408 each have unique internal IP addresses associated therewith that are used only within the video telephone 402. These unique IP addresses are different from the IP address associated with the data port 1910 with which the IP video telephone 402 is connected with the external world from the gateway processor 408. In order for data packets to be transmitted between the audio processor 422 and the video processor 420 to the external IP network through the gateway processor 408. The Ethernet and SIP addresses used within the internal Ethernet network and over the external IP network must be translated. Thus, when data packets are transmitted to the gateway processor 408, the SIP proxy 1912 is responsible for converting the SIP protocol addresses from the address utilized by the audio processor 422 to the SIP protocol address used at the output ofthe gateway processor 408. The SIP proxy module 1912 additionally converts the address of video packets from the video processor 420 to the address of the output of the gateway processor 408. The SIP proxy 1912 additionally includes the capability for routing audio stream packets to/from the audio processor 422 and video stream packets coming to/from the video processor 420 at the same time. The SIP proxy 1912 achieves this by transmitting the video packets as a second audio stream of larger audio packets. The SIP proxy 1912 believes it is transmitting a second audio stream when in fact it is transmitting the stream of video packets from the video processor. The router/firewall/NAT 1914 is responsible for translating addresses from packets received from the audio processor 422 and the video processor 420 in the ethernet domain. The packets from the audio and video processors have the IP port addresses from the outputs of both the audio and video processors. The router/firewall/NAT 1914 converts the addresses of these output ports to the address of the output port 1910 of the gateway processor 408 at the Ethernet level.
The stun module 1916 is utilized to enable the gateway processor 408 ofthe video telephone 402 to determine the IP address by which the outside world views the video telephone. The stun module 1916 does this by transmitting messages to a stun server 1918 associated with the SIP server 1920 enabling call connections. The stun server 1918 transmits a response back to the stun module 1916 indicating the outside world's view of data from the IP video telephone 402.
Referring now also to
The content and configuration module 1922 enables control of the configuration of the audio processor 422 and the video processor 420. All operating parameters within these two processors are controllable through the content and configuration module 1922. One parameter the content and configuration module 1922 may set is the codec with which the audio and video processors process incoming and outgoing data packets. The audio processor may be configured to operate according to the G.711, G.722, G.720 or any other available audio codec with which the audio processor 422 may operate. Likewise, the video processor 420 may be configured to code/decode video packet data according to H.264, H.263 or other types of video codecs. In the preferred embodiment, the configuration parameters may be set within the content and configuration module 1922 from an external host server 1924. This external server may download these parameters into the content and configuration module 1922 and the content and configuration module 1922 may then download the appropriate parameters to the video processor 420 and the audio processor 422 through the internal ethernet.
The content and configuration module 1922 is also able to control the content which is displayed by the browser 1926 within the video processor 420. The browser 1926 operates as an Internet browser providing the ability for the video processor 420 to display various web page content upon the display ofthe video telephone 402. Content may be established within the content and configuration module 1922 either by the user ofthe video telephone 402 selecting display preferences or controlling browsing ofthe Internet through the browser 1926 using, for example, the handset 104. Alternatively, the external server 1924 may push content to the content and configuration module 1922 in order to enable external content providers to display, for example, directed advertising information on the browser 1926 of the video telephone 402. Thus, the content portion ofthe content and configuration module 1922 may be either controlled locally via the user of the video telephone 402 or externally via a content provider providing a server 1924 interconnected with the video telephone 402.
Referring now to
The audio processor 422 sends at step 2112 a SIP message to the gateway processor 408. The SIP message includes the audio and visual codec capabilities of the calling video telephone 402. The gateway processor 408 converts the IP addresses associated with the SIP protocol and the IP addresses associated with the Ethernet protocol to the appropriate addresses using the SIP proxy 1912 and router/firewall/NAT module 1914 and forwards this information to the SIP server 1920. The SIP server 1920 generates a SIP invite at 2115 which is forwarded to the called party. The called party responds to the received SIP invite at step 2116, and the gateway processor 408 receives at step 2118 the called party's response. The appropriate address conversions are made by the router/firewall/NAT module 1914 and SIP proxy 1912 at the gateway 408 such that the audio processor 422 may be notified at step 1920 ofthe completion or non-completion of the call. Once the call is connected, the video processor 420 is notified at step 2122 by the audio processor 422 ofthe call connection. Inquiry step 1924 determines if the called party enables provision of an audio only or an audio/video call. If only audio is provided, an audio call is provided at step 2126. If an audio/video call is indicative, the video call is provided at step 2128. The call continues until the call is ended at step 2130.
If the called party is another video telephone according to the type described hereinabove, the receipt of a SIP server invite would cause the operation as illustrated in the flow chart of
It will be appreciated by those skilled in The art having The benefit of this disclosure that this invention provides a broadband information appliance. It should be understood that The drawings and detailed description herein are to be regarded in an illustrative rather than a restrictive manner, and are not intended to limit The invention to The particular forms and examples disclosed. On The contrary, The invention includes any further modifications, changes, rearrangements, substitutions, alternatives, design choices, and embodiments apparent to those of ordinary skill in The art, without departing from The spirit and scope of this invention, as defined by The following claims. Thus, it is intended that The following claims be interpreted to embrace all such further modifications, changes, rearrangements, substitutions, alternatives, design choices, and embodiments.
Number | Date | Country | |
---|---|---|---|
60641684 | Jan 2005 | US | |
60641883 | Jan 2005 | US | |
60641326 | Jan 2005 | US | |
60641328 | Jan 2005 | US |