This invention relates generally to the field of telecommunications and more specifically to a system and method for displaying users in a visual conference between locations.
There are many methods available for groups of individuals to engage in conferencing. One common method, videoconferencing, involves one or more individuals located in one location engaging in a video conference with one or more individuals located in at least one other location, remote from the first. Videoconferencing involves the use of video equipment, such as cameras and displays. When the remote location has more cameras than the local location has displays, individuals at the local location have an artificial and unrealistic experience during the videoconference. Similarly, individuals at the local location have a diminished experience if the aggregate number of cameras at several remote locations outnumbers the displays at the local location.
In accordance with particular embodiments, disadvantages and problems associated with previous techniques for displaying users in a visual conference between locations may be reduced or eliminated.
According to one embodiment of the present invention, a method for displaying a visual conference includes receiving a plurality of video signals from at least two cameras located at one or more remote sites. The method also includes assigning each video signal to a respective display of a plurality of displays at a local site for the duration of a visual conference. At least one of the plurality of displays has more than one video signal assigned thereto. The method further includes selecting, for each of the at least one of the plurality of displays having more than one video signal assigned thereto, a video signal of the more than one video signal assigned to the display. The method also includes displaying each selected video signal on its respective display.
In some embodiments selecting a video signal of the more than one video signal may include selecting the video signal having the highest audio level of the more than one video signal. Depending on the embodiments, the method may include assigning each video signal to a respective display based on the relative location and alignment of a first camera in relation to at least one other camera of the at least two cameras or assigning each video signal to a respective display based on a total number of video signals of the plurality of video signals assigned to the respective display.
In some embodiments the method may further include, for each of the at least one of the plurality of displays having more than one video signal assigned thereto, switching between the more than one video signal assigned to the display. In particular embodiments switching between the more than one video signal assigned to the display may include switching to the video signal having the highest audio level of the more than one video signal assigned to the display.
According to another embodiment of the present invention, a system for displaying a visual conference includes an interface operable to receive a plurality of video signals from at least two cameras located at one or more remote sites. The system also includes a processor coupled to the interface and operable to assign each video signal to a respective display of a plurality of displays at a local site for the duration of a visual conference. At least one of the plurality of displays having more than one video signal assigned thereto. The processor is further operable to select, for each of the at least one of the plurality of displays having more than one video signal assigned thereto, a video signal of the more than one video signal assigned to the display. Wherein the plurality of displays are operable to display each selected video signal on the respective display.
Certain embodiments of the invention may provide one or more technical advantages. A technical advantage of one embodiment includes providing a realistic experience to users of a visual conference. In a visual conference environment in which users at a local site view users at a remote site, providing a realistic experience is desirable. The realistic experience may include providing a life-size image of a user at a remote site or simulating a conference table that the users surround. The display mimics an actual conference table where remote users appear as if they are across the table from the local users. Another technical advantage of some embodiments includes providing a realistic, natural experience to local users even though the local site has fewer resources than the remote site(s). Even if a local site has fewer displays to display the video signals, the local site's experience is not compromised. Instead, the quality of the site's experience is maintained.
Certain embodiments of the invention may include none, some, or all of the above technical advantages. One or more other technical advantages may be readily apparent to one skilled in the art from the figures, descriptions, and claims included herein.
For a more complete understanding of the present invention and its features and advantages, reference is now made to the following description, taken in conjunction with the accompanying drawings, in which:
Embodiments of the present invention and its advantages are best understood by referring to
Network 102 represents communication equipment, including hardware and any appropriate controlling logic, for interconnecting elements coupled to network 102 and facilitating communication between sites 100. Network 102 may include a local area network (LAN), a metropolitan area network (MAN), a wide area network (WAN), any other public or private network, a local, regional, or global communication network, an enterprise intranet, other suitable wireline or wireless communication link, or any combination of the preceding. Network 102 may include any combination of gateways, routers, hubs, switches, access points, base stations, and any other hardware, software, or a combination of the preceding that may implement any suitable protocol or communication.
User 104 represents one or more individuals or groups of individuals who are present for the visual conference. Users 104 participate in the visual conference using any suitable device and/or component, such as an audio Internet Protocol (IP) phones, video phone appliances, personal computer (PC) based video phones, and streaming clients. During the visual conference, users 104 engage in the session as speakers or participate as non-speakers.
Videoconferencing equipment 106 facilitates the videoconferencing among users 104. Videoconferencing equipment 106 may include any suitable elements to establish and facilitate the visual conference. For example, videoconferencing equipment 106 includes speakers, microphones, or a speakerphone. In the illustrated embodiment, videoconferencing equipment 106 includes cameras 108, displays 110, a processor 112, and a network interface 114.
Cameras 108 include any suitable hardware and/or software to facilitate capturing an image of user 104 and providing the image to other users 104. Cameras 108 capture and transmit the image of user 104 as a video signal. Displays 110 include any suitable hardware and/or software to facilitate receiving the video signal and displaying the image of user 104 to other users 104. For example, displays 110 may include a notebook PC or a wall mounted display. Displays 110 display the image of user 104 using any suitable technology that provides a realistic image, such as high definition, high-power compression hardware, and efficient encoding/decoding standards. Videoconferencing equipment 106 establishes the visual conference session using any suitable technology and/or protocol, such as Session Initiation Protocol (SIP) or H.323. Additionally, videoconferencing equipment 106 may support and be interoperable with other video systems supporting other standards, such as H.261, H.263, and/or H.264.
Processor 112 controls the operation and administration of videoconferencing equipment 106 by processing information and signals received from cameras 108 and interface 114. Processor 112 includes any suitable hardware, software, or both that operate to control and process signals. For example, processor 112 may be a programmable logic device, a microcontroller, a microprocessor, any suitable processing device, or any combination of the preceding. Interface 114 communicates information and signals to and receives information and signals from network 102. Interface 114 represents any port or connection, real or virtual, including any suitable hardware and/or software that allow videoconferencing equipment 106 to exchange information and signals with network 102, other videoconferencing equipment 106, or and/or other elements of system 10.
In an example embodiment of operation, users 104 at sites 100a and 100d participate in a visual conference. When users 104 join the visual conference, a video signal is generated for each camera 108 that is assigned to a monitor 110. This assignment may persist for the duration of the visual conference. Thus, a remote user may always be displayed on the same local monitor. This may make it easier for local users to identify who and where the remote user is. More specifically, in the embodiment depicted in
Modifications, additions, or omissions may be made to system 10. For example, system 10 may include any suitable number of sites 100 and may facilitate a visual conference between any suitable number of sites 100. As another example, sites 100 may include any suitable number of cameras 108 and displays 110 to facilitate a visual conference. As yet another example, the visual conference between sites 100 may be point-to-point conferences or multipoint conferences. For point-to-point conferences, the number of displays 110 at local site 100 is less than the number of cameras 108 at remote site 100. For multipoint conferences, the aggregate number of cameras 108 at remote sites 100 is greater than the number of displays 110 at local site 100. Moreover, the operations of system 10 may be performed by more, fewer, or other components. Additionally, operations of system 10 may be performed using any suitable logic.
Each video signal may be assigned to only one monitor, and thus each user may only be displayed on one monitor. This may allow local users viewing local monitors to become familiar with the location of each user.
Remote site 210 has three video signals, left video signal 211, center video signal 212 and right video signal 213. The assignment in this situation may be relatively straight forward because the number of local displays is equal to the number of video signals. However, it may be desirable to determine to which particular display each video signal is assigned. One factor that may be used by some embodiments is the relative position of the remote cameras generating the video signals in relation to each other. More specifically, left video signal 211 may be assigned to the left local display 201, center video signal 212 may be assigned to center display 203, right video signal 213 may be assigned to right monitor 205. This may provide for a more natural appearance in eye contact between the remote users displayed on the local monitors and the local users viewing the local monitors.
Remote sites 220 through 240 each have two video signals, left video signal 221, 231 and 241, respectively, and right video signals 222, 232, and 242, respectively. Left video signal 221 may be assigned to left monitor 201 and right video signal 222 may be assigned to right monitor 205 to optimize the natural appearance of the eye contact between local and remote users.
In assigning the video signals of remote sites 230 and 240 it may not always be desirable to automatically assign them to left monitor 201 and right monitor 205. This may be because doing so may cause there to be an excess of video signals assigned to left monitor 201 and right monitor 205 and a shortage of video signals assigned to center monitor 203. This may be an inefficient use of monitor resources. Thus it may be desirable to use alternate assignment. However, it may still be desirable to optimize the eye contact between local and remote users in the alternate assignments. By maintaining the relative position of the video signals at least two additional alternate assignments may be used. For example, remote site 230 may have its video signals assigned to the two left most displays, while remote site 240 may have its two video signals assigned to the two right most displays. More specifically, left video signal 231 is assigned to left monitor 201, right video signal 232 is assigned to center monitor 203, left video signal 241 is assigned to center monitor 203 and right video signal 242 is assigned to right monitor 205. Thus, the left video signals (e.g., 231 and 241) remain to the left of the right video signals (e.g., 232 and 242).
Remote sites 250 through 270 each have a single video signal. Because there may not be an concerns with the position of the video signal in relation to other video signals from the same site, assigning the video signals to a monitor may be relatively straight forward. In assigning the single video signal from each remote site, one factor that may be considered is the relative priority of the user associated with the video signal. For example, if the president of a company is associated with video signal 251 it may be desirable to have video signal 251 assigned to center monitor 203. Another factor that may be considered in assigning a single video signal from a remote site may be the number of video signals that have already been assigned to a local monitor. For example, in assigning video signal 261 (assuming that the video signals from remote sites 210-250 have already been assigned) it may not be desirable to assign video signal 261 to center monitor 203 because center monitor 203 already has four video signals assigned to it, while monitors 201 and 205 only have three video signals assigned to them. Thus, in this situation it may be desirable to assign video signal 261 to one of the side monitors, such as left monitor 201.
Modifications, additions, or omissions may be made while displaying video signals on local monitors 201, 203 and 205. For example, there may be a different number of local monitors (e.g., 1, 2, 4 or more monitors) to which video signals may be assigned. As another example, a remote site may have more video signals than there are local monitors. For example, the video signals of remote site 220 may be part of remote site 210 (thus remote site 210 may have five video signals, 211, 212, 213, 221 and 222). In such a case the assignment may take into account the factors discussed above in assigning the video signals (e.g., video signal 221 may be the left most video signal, video signal 211 may be the second left most video signal, video signal 212 may be the center video signal, video signal 213 may be the second right most video signal and video signal 222 may be the right most video signal). Similarly, the assignment may last for the duration of the visual conference, thereby increasing the comfort and familiarity that local users may have in knowing who and where the speaker is.
Because each video signal is assigned to a respective display for the duration of the visual conference, those users participating in a visual conferencing locally may be able quickly identify who and where the speaker is. This may be because the video signal containing the image and sound of a particular user may always be displayed on the same display, so local users may not need to search multiple monitors to locate the speaker.
If the number of video signals is less than or equal to the number of displays then for each display that is to be used (if there are five displays and two video signals, two displays will have a video signal assigned thereto while the remaining three displays may be unused or may display other content) a video signal may be assigned. In assigning video signals to displays one or more of several different factors may be used. Some example factors may include: the relative position of the remote user in relation to the other remote users; the relative position and alignment of the remote camera to the remote user and/or the other remote cameras; the associated priority of the user (e.g., the president of the company may be displayed in a center display); the order in which the video signals joined the visual conference; or any other factor that may be deemed advantageous in determining where to assign a video signal.
Because of the possibility of there being more remote video signals received and assigned than there are local displays, it may be that one or more of the displays may have more than one video signal assigned thereto. In such a case, the assignment of each video signal may, in addition to the above factors, also consider such factors as the number of video signals that have been assigned to a particular display.
In the situation in which there are more displays than there are video signals, steps 330 through 350 may be fairly straightforward. More specifically, the video signal selected at step 330 and displayed at 350 may be the only video signal assigned to the respective display. Additionally, because there may not be other video signals assigned to the display, there may not be a need to switch between video signals at step 350.
In the situation in which there are more video signals than there are displays, at step 330, one video signal for each display may be selected from the plurality of video signals assigned to the display. The selected signal may then be displayed on the respective display at step 340. In making the initial selection (e.g., before the visual conference begins) as to which video signal to display such factors as the order in which the video signals were assigned to the display or the relative priority of the user assigned to the display may be considered.
At step 350 it may be possible to switch from the currently displayed video signal to a new video signal. Recall that both the current video signal and the new video signal were previously assigned to the display. In some embodiments the switch may occur when it is detected that a user, other than the user currently displayed, has talked. As soon as it is detected that a new user has begun to talk, the switch may occur. Thus, each display may currently be displaying the video signal, of the plurality of video signals assigned to the display, associated with the last user to speak.
Modifications, additions, or omissions may be made to the method depicted in
Although the present invention has been described in several embodiments, a myriad of changes, variations, alterations, transformations, and modifications may be suggested to one skilled in the art, and it is intended that the present invention encompass such changes, variations, alterations, transformations, and modifications as fall within the scope of the appended claims.
This application claims priority to U.S. Patent Application Ser. No. 60/794,016, entitled “VIDEOCONFERENCING SYSTEM,” which was filed on Apr. 20, 2006.
Number | Name | Date | Kind |
---|---|---|---|
4400724 | Fields | Aug 1983 | A |
4494144 | Brown | Jan 1985 | A |
4961211 | Tsugane et al. | Oct 1990 | A |
4965819 | Kannes | Oct 1990 | A |
5272526 | Yoneta et al. | Dec 1993 | A |
5491797 | Thompson et al. | Feb 1996 | A |
5508733 | Kassatly | Apr 1996 | A |
5541639 | Takatsuki et al. | Jul 1996 | A |
5673256 | Maine | Sep 1997 | A |
5675374 | Kohda | Oct 1997 | A |
5737011 | Lukacs | Apr 1998 | A |
5751337 | Allen et al. | May 1998 | A |
5790179 | Shibata et al. | Aug 1998 | A |
5801756 | Iizawa | Sep 1998 | A |
5802294 | Ludwig et al. | Sep 1998 | A |
5903637 | Hogan et al. | May 1999 | A |
6049694 | Kassatly | Apr 2000 | A |
6172703 | Lee | Jan 2001 | B1 |
6346962 | Goodridge | Feb 2002 | B1 |
6396531 | Gerszberg et al. | May 2002 | B1 |
6577807 | Yaegashi et al. | Jun 2003 | B1 |
6611503 | Fitzgerald et al. | Aug 2003 | B1 |
6710797 | McNelley et al. | Mar 2004 | B1 |
6711212 | Lin | Mar 2004 | B1 |
6757277 | Shaffer et al. | Jun 2004 | B1 |
6774927 | Cohen et al. | Aug 2004 | B1 |
6775247 | Shaffer et al. | Aug 2004 | B1 |
6795108 | Jarboe et al. | Sep 2004 | B2 |
6798441 | Hartman et al. | Sep 2004 | B2 |
6882358 | Schuster et al. | Apr 2005 | B1 |
6886036 | Santamäki et al. | Apr 2005 | B1 |
6922718 | Chang | Jul 2005 | B2 |
6981047 | Hanson et al. | Dec 2005 | B2 |
6989836 | Ramsey | Jan 2006 | B2 |
6992702 | Foote et al. | Jan 2006 | B1 |
6999829 | Bazzocchi et al. | Feb 2006 | B2 |
7027659 | Thomas | Apr 2006 | B1 |
7038588 | Boone et al. | May 2006 | B2 |
7039027 | Bridgelall | May 2006 | B2 |
7043528 | Schmitt et al. | May 2006 | B2 |
7050425 | Richter et al. | May 2006 | B2 |
7054268 | Paranteinen et al. | May 2006 | B1 |
7057636 | Cohen-Solal et al. | Jun 2006 | B1 |
7068299 | Lemieux et al. | Jun 2006 | B2 |
7080105 | Nakanishi et al. | Jul 2006 | B2 |
7092002 | Ferren et al. | Aug 2006 | B2 |
7111045 | Kato et al. | Sep 2006 | B2 |
7151758 | Kumaki et al. | Dec 2006 | B2 |
7154526 | Foote et al. | Dec 2006 | B2 |
7203904 | Lee | Apr 2007 | B2 |
7245272 | Shiuan et al. | Jul 2007 | B2 |
7256822 | Suga et al. | Aug 2007 | B2 |
7277117 | Takashima et al. | Oct 2007 | B2 |
7532232 | Shah et al. | May 2009 | B2 |
20020099682 | Stanton | Jul 2002 | A1 |
20030071890 | McClure | Apr 2003 | A1 |
20030149724 | Chang | Aug 2003 | A1 |
20040004942 | Nebiker et al. | Jan 2004 | A1 |
20040010464 | Boaz | Jan 2004 | A1 |
20050024484 | Leonard et al. | Feb 2005 | A1 |
20050248652 | Firestone et al. | Nov 2005 | A1 |
20050260976 | Khartabil et al. | Nov 2005 | A1 |
20060041571 | Kubokawa | Feb 2006 | A1 |
20060066717 | Miceli | Mar 2006 | A1 |
20060129626 | Fitzpatrick | Jun 2006 | A1 |
20060152575 | Amiel et al. | Jul 2006 | A1 |
20060158509 | Kenoyer et al. | Jul 2006 | A1 |
20060168302 | Boskovic et al. | Jul 2006 | A1 |
20060200518 | Sinclair et al. | Sep 2006 | A1 |
20060251038 | Tamura et al. | Nov 2006 | A1 |
20060259193 | Wang et al. | Nov 2006 | A1 |
20060264207 | Tamura et al. | Nov 2006 | A1 |
20070070940 | Vander Veen et al. | Mar 2007 | A1 |
20070250567 | Graham | Oct 2007 | A1 |
20080062625 | Batio | Mar 2008 | A1 |
20090174764 | Chadha et al. | Jul 2009 | A1 |
20090213207 | Shah et al. | Aug 2009 | A1 |
Number | Date | Country | |
---|---|---|---|
20070250568 A1 | Oct 2007 | US |
Number | Date | Country | |
---|---|---|---|
60794016 | Apr 2006 | US |