The manner in which communication sessions with remote parties occur is currently limited in functionality and flexibility. Accordingly, what is needed are a system and method that addresses these issues.
In one example embodiment, a method for managing a conference call using a selective transmission unit (STU) to manage a plurality of client devices participating in the conference call includes receiving, by a selective transmission unit (STU), video transmission parameters from each of a plurality of client devices, receiving, by the STU, video reception parameters from each of the client devices, instructing, by the STU, a first client device of the plurality of client devices to start transmitting a first video stream at a first resolution based on at least some of the video transmission parameters and the video reception parameters, receiving, by the STU, the first video stream, and forwarding the first video stream to at least a second client device of the client devices based on the video reception parameters of the second client device.
In one or more of the above examples, the method further includes determining, by the STU, that a second video stream from a third client device is not being viewed by any of the other client devices, and instructing, by the STU, the third client device to stop sending the second video stream.
In one or more of the above examples, the method further includes determining, by the STU, that a second video stream having a second resolution is desired from the first client device, instructing, by the STU, the first client device to start transmitting the second video stream, receiving, by the STU, the second video stream, and forwarding the second video stream to at least one of the other client devices based on the video reception parameters of the respective client device.
In one or more of the above examples, the method further includes determining, by the STU, that the second video stream is not being received from the first client device prior to instructing the first client device to start transmitting the second video stream.
In one or more of the above examples, determining that a second video stream having a second resolution is desired from the first client device occurs when a request for the second resolution is received from one of the other client devices.
In one or more of the above examples, the method further includes receiving, by the STU, a modification of the video reception parameters from the second client device, wherein the modification requests a second video stream having a second resolution from the first client device, determining, by the STU, that the second video stream is already being received from the first client device, and forwarding the second video stream to the second client device.
In one or more of the above examples, the method further includes monitoring, by the STU, a network performance level associated with the first client device, and modifying, by the STU, at least one of a video stream being received from the first client device or a video stream being sent to the first client device based on the network performance level.
In one or more of the above examples, the STU manages the conference call by communicating only with a browser on each of the client devices, and wherein the client devices do not communicate with the STU using an application other than the browser and do not use any browser plug-ins to communicate with the STU for the conference call.
In another example embodiment, a method for managing a conference call using a selective transmission unit (STU) to manage a plurality of client devices participating in the conference call includes receiving, by a selective transmission unit (STU), video transmission parameters and video reception parameters from each of a plurality of client devices, managing, by the STU, a plurality of video streams received by the STU from the client devices, wherein the managing includes controlling a resolution of each video stream being received from the client devices based on the video transmission parameters and video reception parameters, and managing, by the STU, which of the received video streams are sent to each of the client devices based on the video reception parameters of the respective client device.
In one or more of the above examples, managing the plurality of video streams received by the STU from the client devices includes instructing at least one of the client devices to stop sending a video stream having a particular resolution.
In one or more of the above examples, managing the plurality of video streams received by the STU from the client devices includes instructing at least one of the client devices to start sending a video stream having a particular resolution.
In one or more of the above examples, the method further includes monitoring, by the STU, a network performance level associated with a first client device of the client devices, and modifying, by the STU, at least one of a video stream being received from the first client device or a video stream being sent to the first client device based on the network performance level.
In one or more of the above examples, the method further includes receiving, by the STU, a change in the video reception parameters of a first client device of the client devices, and modifying, by the STU, a video stream being sent to the first client device based on the change.
In one or more of the above examples, the STU communicates with at least one of the client devices for the conference call using only a browser present on the client device, and wherein the client device does not communicate with the STU using an application other than the browser and does not use any browser plug-ins to communicate with the STU for the conference call.
In another example embodiment, a system for managing a conference call includes a server having a processor and a memory coupled to the processor, the memory containing a plurality of instructions for execution by the processor, the instructions including instructions for providing a selective transmission unit (STU) and configuring the STU for receiving video transmission parameters and video reception parameters from each of a plurality of client devices, managing a plurality of video streams received by the STU from the client devices, wherein the managing includes controlling a resolution of each video stream being received from the client devices based on the video transmission parameters and video reception parameters, and managing which of the received video streams are sent to each of the client devices based on the video reception parameters of the respective client device.
In one or more of the above examples, the instructions for managing the plurality of video streams received by the STU from the client devices include instructions for instructing at least one of the client devices to stop sending a video stream having a particular resolution.
In one or more of the above examples, the instructions for managing the plurality of video streams received by the STU from the client devices include instructing at least one of the client devices to start sending a video stream having a particular resolution.
In one or more of the above examples, the system further includes instructions for monitoring, by the STU, a network performance level associated with a first client device of the client devices, and modifying, by the STU, at least one of a video stream being received from the first client device or a video stream being sent to the first client device based on the network performance level.
In one or more of the above examples, the system further includes instructions for receiving, by the STU, a change in the video reception parameters of a first client device of the client devices, and modifying, by the STU, a video stream being sent to the first client device based on the change.
In one or more of the above examples, the STU is configured to communicate with at least one of the client devices for the conference call using only a browser present on the client device, and wherein the client device does not communicate with the STU using an application other than the browser and does not use any browser plug-ins to communicate with the STU for the conference call.
Other technical features may be readily apparent to one skilled in the art from the following figures, descriptions, and claims.
Before undertaking the DETAILED DESCRIPTION below, it may be advantageous to set forth definitions of certain words and phrases used throughout this patent document. The terms “transmit,” “receive,” and “communicate,” as well as derivatives thereof, encompass both direct and indirect communication. The terms “include” and “comprise,” as well as derivatives thereof, mean inclusion without limitation. The term “or” is inclusive, meaning and/or. The phrase “associated with,” as well as derivatives thereof, means to include, be included within, interconnect with, contain, be contained within, connect to or with, couple to or with, be communicable with, cooperate with, interleave, juxtapose, be proximate to, be bound to or with, have, have a property of, have a relationship to or with, or the like.
Moreover, various functions described below can be implemented or supported by one or more computer programs, each of which is formed from computer readable program code and embodied in a computer readable medium. The terms “application” and “program” refer to one or more computer programs, software components, sets of instructions, procedures, functions, objects, classes, instances, related data, or a portion thereof adapted for implementation in a suitable computer readable program code. The phrase “computer readable program code” includes any type of computer code, including source code, object code, and executable code. The phrase “computer readable medium” includes any type of medium capable of being accessed by a computer, such as read only memory (ROM), random access memory (RAM), a hard disk drive, a compact disc (CD), a digital video disc (DVD), or any other type of memory. A “non-transitory” computer readable medium excludes wired, wireless, optical, or other communication links that transport transitory electrical or other signals. A non-transitory computer readable medium includes media where data can be permanently stored and media where data can be stored and later overwritten, such as a rewritable optical disc or an erasable memory device.
As used here, terms and phrases such as “have,” “may have,” “include,” or “may include” a feature (like a number, function, operation, or component such as a part) indicate the existence of the feature and do not exclude the existence of other features. Also, as used here, the phrases “A or B,” “at least one of A and/or B,” or “one or more of A and/or B” may include all possible combinations of A and B. For example, “A or B,” “at least one of A and B,” and “at least one of A or B” may indicate all of (1) including at least one A, (2) including at least one B, or (3) including at least one A and at least one B. Further, as used here, the terms “first” and “second” may modify various components regardless of importance and do not limit the components. These terms are only used to distinguish one component from another. For example, a first user device and a second user device may indicate different user devices from each other, regardless of the order or importance of the devices. A first component may be denoted a second component and vice versa without departing from the scope of this disclosure.
It will be understood that, when an element (such as a first element) is referred to as being (operatively or communicatively) “coupled with/to” or “connected with/to” another element (such as a second element), it can be coupled or connected with/to the other element directly or via a third element. In contrast, it will be understood that, when an element (such as a first element) is referred to as being “directly coupled with/to” or “directly connected with/to” another element (such as a second element), no other element (such as a third element) intervenes between the element and the other element.
The terms and phrases as used here are provided merely to describe some embodiments of this disclosure but not to limit the scope of other embodiments of this disclosure. It is to be understood that the singular forms “a,” “an,” and “the” include plural references unless the context clearly dictates otherwise. All terms and phrases, including technical and scientific terms and phrases, used here have the same meanings as commonly understood by one of ordinary skill in the art to which the embodiments of this disclosure belong. It will be further understood that terms and phrases, such as those defined in commonly-used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the relevant art and will not be interpreted in an idealized or overly formal sense unless expressly so defined here. In some cases, the terms and phrases defined here may be interpreted to exclude embodiments of this disclosure.
Definitions for other certain words and phrases may be provided throughout this patent document. Those of ordinary skill in the art should understand that in many if not most instances, such definitions apply to prior as well as future uses of such defined words and phrases.
None of the description in this application should be read as implying that any particular element, step, or function is an essential element that must be included in the claim scope. The scope of patented subject matter is defined only by the claims. Moreover, none of the claims is intended to invoke 35 U.S.C. § 112(f) unless the exact words “means for” are followed by a participle. Use of any other term, including without limitation “mechanism,” “module,” “device,” “unit,” “component,” “element,” “member,” “apparatus,” “machine,” “system,” “processor,” or “controller,” within a claim is understood by the Applicant to refer to structures known to those skilled in the relevant art and is not intended to invoke 35 U.S.C. § 112(f).
For a more complete understanding, reference is now made to the following description taken in conjunction with the accompanying Drawings in which:
It is understood that the following disclosure provides many different embodiments or examples. Specific examples of components and arrangements are described below to simplify the present disclosure. These are, of course, merely examples and are not intended to be limiting. In addition, the present disclosure may repeat reference numerals and/or letters in the various examples. This repetition is for the purpose of simplicity and clarity and does not in itself dictate a relationship between the various embodiments and/or configurations discussed.
Referring to
The server 102 includes one or more selective transmission units (STUs) 116. As is known, some media control units (MCUs) operate by merging video streams for a conference call or other media session, and then forwarding the merged video stream to the clients. However, the merging of the video streams prevents the streams from being independently manipulated by the clients and prevents more granular control by the MCU. Selective forwarding units (SFUs) use an architecture for conferencing servers where the server does not merge videos, but instead forwards the video streams based on which client is interested in viewing which video stream. However, an SFU may receive multiple video streams of different resolutions from a single client, and then forward the desired video stream to each of the other clients. As the SFU may have no way to control the video streams sent from each client, unnecessary resources (e.g., bandwidth and processing resources) may be used for sending, receiving, and processing video streams that are not needed or desired, which may in turn reduce the number of client devices that can be involved in a single conference call.
In contrast, the STU 116 of the present disclosure extends the SFU architecture to provide additional granularity and control over the video streams by enabling the server 102 to interact with the client devices and control individual video streams sent by the client devices. As the number of participants in a conference call increases, the STU 116 may reduce the resolution of some video streams based on the available network bandwidth, the number of participants, whether one or more users are being viewed in larger windows, etc. This enables the STU 116 to scale the conference call dynamically based on the number of participants and the available resources.
The client devices 104, 106, and 108 may be mobile devices (e.g., tablets, smartphones, personal digital assistants (PDAs), or netbooks), laptops, desktops, workstations, smart televisions, and/or any other computing device capable of receiving and sending electronic communications via a wired or wireless network connection. Such communications may be direct (e.g., via a peer-to-peer network, an ad hoc network, or using a direct connection), indirect, such as through a server or other proxy (e.g., in a client-server model), or may use a combination of direct and indirect communications.
With additional reference to
Each client 104, 106, and 108 may have the ability to transmit at multiple resolutions and to receive multiple streams of video of different participants. Accordingly, the STU 116 includes logic to determine such factors as what resolution(s) each client device should use to send its video to the server 102, how many video streams each client device should receive from the server 102, and how many “small” videos and “large” videos should be sent to a client device.
With respect to which resolutions a client device should send to the server 102, various factors may be taken into account for both the sending client device and other client devices involved in the conference call. For example, the bandwidth available for sending video from the client device 104 affects the device's ability to stream higher quality video (e.g., a 480p video stream takes less bandwidth than a 1080p video stream). The client device's processing power, memory, camera resolution, available power, and/or other device attributes may also affect the client's ability to send video data. Accordingly, in step 202, the client device may advertise (e.g., send one or more messages) to the server 102 detailing certain sending capabilities of the client, such as available resolutions, framerates, etc. For example, the client device 104 may indicate that it can send video in 90p (160×90), 180p (320×180), 240p (352×240), 360p (640×360), 480p (720×480), 720p (1280×720), and 1080p (1920×1080). It is understood that the resolutions that may be sent may vary from device to device, and these are merely examples and are not intended to be limiting.
Another factor is how many video streams the server 102 should send to the client device, and how many of those streams should be “small” videos and “large” videos. Small videos may use lower resolution video, while large videos may use higher resolution video relative to small videos. The presentation of large and small videos may initially be based on user preferences. For example, the client device 104 may indicate that it wants “small” videos to be 720p, while it wants “large” videos to be 1080p. The client device 106 may indicate that it wants “small” videos to be 480p, while it wants “large” videos to be 720p. The client device 108 may indicate that it wants both “small” and “large” videos to be 480p. This enables each client device to establish its preferences, which may then be adjusted by the STU 116 based on factors such as available resolutions and bandwidth. Accordingly, in step 204, the client may advertise (e.g., send one or more messages) to the server 102 requesting certain large and/or small videos.
In some embodiments, a default video size may be sent initially (e.g., only small videos) and later adjustments may be made. Such adjustments may be made by the client device itself or by the STU 116. For example, in step 206, a change may be requested by the client that affects the grid windows being viewed, their respective sizes, etc., which may result in changes to the viewing grid and corresponding changes to received video streams (if needed).
In steps 208 and 210, the STU 116 may indicate that a particular resolution is to be stopped or started (step 208) or the view is to be changed (step 210). These steps may be based on information known to the STU 116 about how other client devices are viewing this user and/or based on changing network information (as will be described later in greater detail). For example, as will be discussed in greater detail below, a client device may be showing users in a grid (generally “small” videos), may have one user in a spotlight (a “large” video), etc. Accordingly, the resolution needed to display the client device's video stream in the manner selected by other client devices may be used to determine the resolution of the video stream sent by the client device. In some embodiments, a client device may not send any video stream to the STU 116 if no other client device is viewing the user of the sending client device.
Accordingly, to achieve the control desired for the different video streams, the STU 116 may track a number of items for each client device that is participating in the conference call. These items may include (1) the resolutions capable of being transmitted; (2) for each resolution being transmitted (and not transmitted) by the client device, the other client devices interested in that resolution; (3) the number of small videos (and their resolution) the client device is receiving and from which client devices those videos are being sent; and (4) the number of large videos (and their resolution) and from which client devices those videos are being sent. If a participant is interested in a higher resolution than what the participant is transmitting, the STU 116 may send the highest available resolution. For example, if the client device 104 wants to receive video at 1080p, but the highest resolution video from the client device 106 is 720p, then the server 102 will send the 720p video to the client device 104. It is understood that fewer or more items may be tracked for each client device, and some described items may be combined or further separated.
By tracking these items, the STU 116 may determine what video frames to forward to various participants. For example, when a 720p video frame arrives from the client device 104, the STU 116 would identify all participants who are interested in viewing this resolution or a higher resolution that is not being transmitted and send it to them. If the number of participants interested in viewing a particular resolution drops to zero, the STU 116 may instruct the client device to stop sending that resolution. It is understood that if no participants are interested in video of any resolution from a particular client device, that device may not send any video until at least one participant requests it. This tracking and video stream control of each participant provides the STU 116 with the ability to support very large conferences with every participant participating with audio and video. It may also reduce the utilization of resources (e.g., server/client CPU and network loads) for a given size and quality of meeting compared to the previously described MCU and SFU approaches, thereby resulting in more optimized conference calls.
Referring to
In
While a client device may be capable of transmitting at a particular resolution (e.g., 720p), its ability to do so may be limited by network conditions and/or available bandwidth. In the present embodiment, the client device and the STU 116 may track the network conditions for that device in the conference call and, if needed, makes adjustments to the video streams for that particular client device. For example, both the client device and the STU 116 may monitor for errors in the incoming packets and take actions based on detected errors.
In the present embodiment, aspects on the error calculation may include: (1) computing errors across all video streams and audio streams, and averaging those errors on a time interval (e.g., every five seconds); (2) if the average error rate exceeds a certain percentage over a certain number of periods (e.g., five percent (5%) for three consecutive periods), a step-down action may be executed; (3) if the average error rate is below a certain percentage for a certain number of periods (e.g., one percent (1%) for three consecutive periods), a step-up action may be executed; (4) there may be no step-down timer (e.g., immediately after one step-down and three more time periods of five percent loss, there can be another step-down action); and/or (5) there may be a step-up timer (e.g., of a time interval of one minute or some other period), in which case after the execution of a step-down or step-up action, the next step-up action cannot be taken for a certain period of time (e.g., one minute). It is understood that these aspects may be varied in many ways, and this is only one possible set of actions that may be taken. For example, percentages may be increased or decreased, and time intervals may be lengthened or shortened, or may be implemented with a dynamic aspect such as a sliding window.
The parameters may be configurable for a particular conference call or as applied to the STU 116 and client devices in general. One possible implementation of the above aspects may be as follows:
The values may evolve over time during a conference call. For example, the default values may be used at the call's start, and then gradually modified as the conference call becomes longer to ensure more rapid discovery of optimal parameters but no constant changes on intermittent errors.
The STU 116 may check for loss across all video streams and the audio stream for each participant and decide on whether to execute step-down or step-up actions. It may instruct the client device to take these actions through streamcontrol messages (e.g., via a signaling channel rather than Real-Time Transport Control Protocol (RTCP)). One possible implementation of step-up and step-down levels for the STU 116 may be as follows:
Each client device may check for loss across all video streams and the audio stream and decide on whether to execute step-down or step-up actions. The client device may instruct the STU 116 to take these actions through changeview messages (e.g., via a signaling channel rather than RTCP). One possible implementation of step-up and step-down levels for a client device may be as follows and may vary depending on whether the user is in grid view or spotlight/VAD view.
For the spotlight/VAD view:
For the grid view:
Referring to
Referring to
Referring to
Referring to
Referring to
In the present example, the grid 900 is displayed via a browser (e.g., using Chrome, Safari, Internet Explorer, Brave, Opera, or a similar browser) without the use of a client-side application or plug-in on the client device 104. It is understood that in other embodiments the present disclosure may be applied to environments in which a client device uses an application or browser plug-in to communicate with the STU 116, and the description of browser only communications is not intended to be limiting.
By relying strictly on the browser's inherent capabilities without the use of applications or plug-ins, the ability to join and participate in the conference call may be available to any client device with a browser. This simplifies joining a conference call, and enables joining even if the client device does not permit the download or installation of applications or browser plug-ins (e.g., due to security restrictions). Furthermore, this provides a level of security to the client device, as there are no downloads to be installed or authorized in order to access the conference call. In addition, as many browsers are widely used and frequently updated for security reasons, the user of the client device need not be concerned about potential application or plug-in flaws that might compromise the client device's security if not updated. In addition, by relying only on the client device's browser, there is less chance of needing an update before joining a conference call, as might happen if an application or a plug-in has not been used for a while. This also enables mobile devices to fully participate in a conference call using only their built-in browser (or another browser that is selected by the user).
To accomplish this browser focused conferencing, the server 102 may rely on a defined framework, such as the WebRTC framework, to provide complete conference functionality. Other technologies may also be supported by the solution, such as the Unified Plan for SDPs as supported by Safari, and/or Plan-B that is supported by Chrome and other platforms. Further support may be provided using a cross-platform JavaScript SDK that is fully featured. It is understood that any suitable framework or technology may be used with the present disclosure, and the selection and use of such frameworks and technologies may depend on a variety of factors and may change over time due to the adoption and/or deprecation of such frameworks and technologies.
Each video stream that is displayed can be manipulated as though it is a thumbnail in a folder, with each video tile in the grid 900 able to be manipulated in various ways individually. Such manipulation may include selecting, moving, resizing, and dragging-and-dropping (e.g., to form a break-out session or to rearrange the display arrangement) the video displayed in one or more tiles. Multi-selection of tiles may be possible (e.g., using Ctrl-Click). Once selected, various actions may be performed on the selected user/video using input devices such as a keyboard, left click/right click mouse menus, and/or buttons in the frame. Furthermore, for an individual video or for all the videos as a whole, a user may be able to zoom in/out and/or scroll to see other participants who are currently not displayed.
To achieve this seamless and intuitive user interface (UI), the solution may use the ability of browsers to manage divs. For example, HTML5 allows floating divs that are placed optimally by the browser, as well as enabling various decorations on a div and the ability to vary the size and position of each of these divs independently. By displaying each video stream as a div, the browser can be used to manipulate these divs.
As shown in
With additional reference to
Referring to
Referring to
If the user wants a particular tile or tiles to remain on the grid for viewing, those tiles may be “pinned” to prevent them from moving. This is shown in
It is understood that many different arrangements of tiles may be used, with different grid sizes and shapes, different numbering orders for tiles, different replacement patterns, and other variations. Accordingly, the examples provided herein are for purposes of illustration only and are not intended to be limiting.
Referring to
The message flow 1200 is generally inefficient because Real-Time Transport Protocol (RTP) establishment takes a non-trivial amount of time (e.g., two to five seconds) to complete the following: (1) gather candidates; (2) exchange initial candidates through SDP exchange over a signaling channel; (3) manage ICE connectivity; and (4) perform DTLS channel setup and key exchange. In this model, when a user is moved to a different conference, the RTP is torn down and re-established, which slows down the user experience.
Referring to
Accordingly, in step 1302, the client device 104 sends a message to the server 102 indicating that the client device 104 wants to move to a new conference call. In steps 1304 and 1306, respectively, the client device 104 and the server 102 transfer the existing ports/sockets to the new conference call. In step 1308, SRTP audio/video flow begins. In step 1310, signaling occurs to setup the new conference.
This model also provides the capability to switch one media (e.g., audio) to a different conference call while keeping other media (e.g., video and other sessions) with the original conference call. This enables participants to view video of a conference call while holding a side conversation via another conference call, all using the same browser-based interface.
Referring to
Accordingly, in steps 1404 and 1406, respectively, the client device 104 and server 102a send TURN allocate messages to the TURN server 1402. In steps 1408 and 1410, respectively, the client device 104 and server 102a establish DTLS/ICE connectivity with the TURN server 1402. In steps 1412 and 1414, media flow begins between the client device 104 and server 102a via the TURN server 1402. In step 1416, the client device 104 sends a message to the server 102a indicating that the client device 104 wants to move to a new conference call on the server 102b.
In step 1418, the server 102a sends a message to the server 102b requesting admittance of the client device 104 using the original DTLS information. In step 1420, TURN allocation occurs between the servers 102a and 102b. In step 1422, the previous ICE is paired with the new STU of the server 102b. In steps 1424 and 1426, media flow begins between the client device 104 and server 102b via the TURN server 1402.
This model enables a number of features such as fast seamless breakout sessions. Another enabled feature may provide events with multiple conferences where people can hover over a conference to temporarily see the video and hear audio. Participants may do a virtual walk over the floor experiencing the difference conferences. The participant may also be a potential collaborator. Still another enabled feature may provide split audio and video that allows a participant to temporarily hear what is going on in another conference. This may be useful in breakout sessions. In such cases, the video may still be with main conference, but the audio split enables the participant to temporarily listen to a breakout session.
In other embodiments (not necessarily associated with
Referring to
The computer system 1500 may use any operating system (or multiple operating systems), including various versions of operating systems provided by Microsoft (such as WINDOWS), Apple (such as iOS or Mac OS X), Google (Android), UNIX, and LINUX, and may include operating systems specifically developed for handheld devices, personal computers, and servers depending on the use of the computer system 1500. The operating system, as well as other instructions (e.g., for the processes and message sequences described herein), may be stored in the memory unit 1504 and executed by the processor 1502. For example, if the computer system 1500 is the server 102 or a client device 104, 106, 108, the memory unit 1504 may include instructions for performing some or all of the message sequences and methods described with respect to such devices in the present disclosure.
The network 1516 may be a single network or may represent multiple networks, including networks of different types. For example, the server 102 or a client device 104, 106, 108 may be coupled to a network that includes a cellular link coupled to a data packet network, or data packet link such as a wide local area network (WLAN) coupled to a data packet network. Accordingly, many different network types and configurations may be used to establish communications between the server 102, client devices 104, 106, 108, servers, and/or other components described herein.
Exemplary network, system, and connection types include the internet, WiMax, local area networks (LANs) (e.g., IEEE 802.11a and 802.11g wi-fi networks), digital audio broadcasting systems (e.g., HD Radio, T-DMB and ISDB-TSB), terrestrial digital television systems (e.g., DVB-T, DVB-H, T-DMB and ISDB-T), WiMax wireless metropolitan area networks (MANs) (e.g., IEEE 802.16 networks), Mobile Broadband Wireless Access (MBWA) networks (e.g., IEEE 802.20 networks), Ultra Mobile Broadband (UMB) systems, Flash-OFDM cellular systems, and Ultra wideband (UWB) systems. Furthermore, the present disclosure may be used with communications systems such as Global System for Mobile communications (GSM) and/or code division multiple access (CDMA) communications systems. Connections to such networks may be wireless or may use a line (e.g., digital subscriber lines (DSL), cable lines, and fiber optic lines).
Communication among the server 102, client devices 104, 106, 108, servers, and/or other components described herein may be accomplished using predefined and publicly available (i.e., non-proprietary) communication standards or protocols (e.g., those defined by the Internet Engineering Task Force (IETF) or the International Telecommunications Union-Telecommunications Standard Sector (ITU-T)), and/or proprietary protocols. For example, signaling communications (e.g., session setup, management, and teardown) may use a protocol such as the Session Initiation Protocol (SIP), while data traffic may be communicated using a protocol such as the Real-time Transport Protocol (RTP), File Transfer Protocol (FTP), and/or Hyper-Text Transfer Protocol (HTTP). A sharing session and other communications as described herein may be connection-based (e.g., using a protocol such as the transmission control protocol/internet protocol (TCP/IP)) or connection-less (e.g., using a protocol such as the user datagram protocol (UDP)). It is understood that various types of communications may occur simultaneously, including, but not limited to, voice calls, instant messages, audio and video, emails, document sharing, and any other type of resource transfer, where a resource represents any digital data.
While the preceding description shows and describes one or more embodiments, it will be understood by those skilled in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the present disclosure. For example, various steps illustrated within a particular sequence diagram or flow chart may be combined or further divided. In addition, steps described in one diagram or flow chart may be incorporated into another diagram or flow chart. Furthermore, the described functionality may be provided by hardware and/or software, and may be distributed or combined into a single platform. Additionally, functionality described in a particular example may be achieved in a manner different than that illustrated, but is still encompassed within the present disclosure. Therefore, the claims should be interpreted in a broad manner, consistent with the present disclosure.
This application claims the benefit of U.S. Provisional Application 63/176,419, filed on Apr. 19, 2021, and entitled SYSTEM AND METHOD FOR HIGHLY SCALABLE BROWSER-BASED AUDIO/VIDEO CONFERENCING, which is incorporated by reference herein in its entirety.
Number | Name | Date | Kind |
---|---|---|---|
5442637 | Nguyen | Aug 1995 | A |
5761309 | Ohashi et al. | Jun 1998 | A |
5790637 | Johnson et al. | Aug 1998 | A |
5818447 | Wolf et al. | Oct 1998 | A |
5889762 | Pajuvirta et al. | Mar 1999 | A |
6031818 | Lo et al. | Feb 2000 | A |
6128283 | Sabaa et al. | Oct 2000 | A |
6141687 | Blair | Oct 2000 | A |
6161082 | Goldberg et al. | Dec 2000 | A |
6195694 | Chen et al. | Feb 2001 | B1 |
6202084 | Kumar et al. | Mar 2001 | B1 |
6219638 | Padmanabhan et al. | Apr 2001 | B1 |
6298129 | Culver et al. | Oct 2001 | B1 |
6311150 | Ramaswamy et al. | Oct 2001 | B1 |
6343067 | Drottar et al. | Jan 2002 | B1 |
6360196 | Poznanski et al. | Mar 2002 | B1 |
6389016 | Sabaa et al. | May 2002 | B1 |
6438376 | Elliott et al. | Aug 2002 | B1 |
6473425 | Bellaton et al. | Oct 2002 | B1 |
6574668 | Gubbi et al. | Jun 2003 | B1 |
6606112 | Falco | Aug 2003 | B1 |
6741691 | Ritter et al. | May 2004 | B1 |
6754181 | Elliott et al. | Jun 2004 | B1 |
6766373 | Beadle et al. | Jul 2004 | B1 |
6826613 | Wang et al. | Nov 2004 | B1 |
6836765 | Sussman | Dec 2004 | B1 |
6842460 | Olkkonen et al. | Jan 2005 | B1 |
6850769 | Grob et al. | Feb 2005 | B2 |
6898413 | Yip et al. | May 2005 | B2 |
6912278 | Hamilton | Jun 2005 | B1 |
6940826 | Simard et al. | Sep 2005 | B1 |
6963555 | Brenner et al. | Nov 2005 | B1 |
6975718 | Pearce et al. | Dec 2005 | B1 |
6987756 | Ravindranath et al. | Jan 2006 | B1 |
6999575 | Sheinbein | Feb 2006 | B1 |
6999932 | Zhou | Feb 2006 | B1 |
7006508 | Bondy et al. | Feb 2006 | B2 |
7010109 | Gritzer et al. | Mar 2006 | B2 |
7013155 | Ruf et al. | Mar 2006 | B1 |
7079529 | Khuc | Jul 2006 | B1 |
7080158 | Squire | Jul 2006 | B1 |
7092385 | Gallant et al. | Aug 2006 | B2 |
7117526 | Short | Oct 2006 | B1 |
7123710 | Ravishankar | Oct 2006 | B2 |
7184415 | Chaney et al. | Feb 2007 | B2 |
7185114 | Hariharasubrahmanian | Feb 2007 | B1 |
7272377 | Cox et al. | Sep 2007 | B2 |
7302496 | Metzger | Nov 2007 | B1 |
7304985 | Sojka et al. | Dec 2007 | B2 |
7345999 | Su et al. | Mar 2008 | B2 |
7346044 | Chou et al. | Mar 2008 | B1 |
7353252 | Yang et al. | Apr 2008 | B1 |
7353255 | Acharya et al. | Apr 2008 | B2 |
7412374 | Seiler et al. | Aug 2008 | B1 |
7457279 | Scott et al. | Nov 2008 | B1 |
7477282 | Firestone et al. | Jan 2009 | B2 |
7487248 | Moran et al. | Feb 2009 | B2 |
7512652 | Appelman et al. | Mar 2009 | B1 |
7542472 | Gerendai et al. | Jun 2009 | B1 |
7564843 | Manjunatha et al. | Jul 2009 | B2 |
7570743 | Barclay et al. | Aug 2009 | B2 |
7574523 | Traversat et al. | Aug 2009 | B2 |
7590758 | Takeda et al. | Sep 2009 | B2 |
7613171 | Zehavi et al. | Nov 2009 | B2 |
7623476 | Ravikumar et al. | Nov 2009 | B2 |
7623516 | Chaturvedi et al. | Nov 2009 | B2 |
7656870 | Ravikumar et al. | Feb 2010 | B2 |
7664495 | Bonner et al. | Feb 2010 | B1 |
7769881 | Matsubara et al. | Aug 2010 | B2 |
7774495 | Pabla et al. | Aug 2010 | B2 |
7778187 | Chaturvedi et al. | Aug 2010 | B2 |
7782866 | Walsh et al. | Aug 2010 | B1 |
7917584 | Arthursson | Mar 2011 | B2 |
8009586 | Chaturvedi et al. | Aug 2011 | B2 |
8065418 | Abuan et al. | Nov 2011 | B1 |
8200796 | Margulis | Jun 2012 | B1 |
8407314 | Chaturvedi et al. | Mar 2013 | B2 |
8407576 | Yin et al. | Mar 2013 | B1 |
8560642 | Pantos et al. | Oct 2013 | B2 |
8611540 | Chaturvedi et al. | Dec 2013 | B2 |
9143489 | Chaturvedi et al. | Sep 2015 | B2 |
9237140 | Lerner et al. | Jan 2016 | B1 |
9356997 | Chaturvedi et al. | May 2016 | B2 |
9742846 | Chaturvedi et al. | Aug 2017 | B2 |
10027926 | Schirdewahn | Jul 2018 | B1 |
10091258 | Carter et al. | Oct 2018 | B2 |
10097638 | Chaturvedi et al. | Oct 2018 | B2 |
10834256 | Nair et al. | Nov 2020 | B1 |
10887549 | Wehrung et al. | Jan 2021 | B1 |
10924709 | Faulkner et al. | Feb 2021 | B1 |
11107490 | Slotznick | Aug 2021 | B1 |
11315158 | Lidster et al. | Apr 2022 | B1 |
20030164853 | Zhu et al. | Sep 2003 | A1 |
20040141005 | Banatwala et al. | Jul 2004 | A1 |
20050071678 | Lee et al. | Mar 2005 | A1 |
20050147212 | Benco et al. | Jul 2005 | A1 |
20060195519 | Slater et al. | Aug 2006 | A1 |
20060233163 | Celi et al. | Oct 2006 | A1 |
20070003044 | Liang et al. | Jan 2007 | A1 |
20080037753 | Hofmann | Feb 2008 | A1 |
20090282251 | Cook et al. | Nov 2009 | A1 |
20090319916 | Gudipaty et al. | Dec 2009 | A1 |
20100064344 | Wang | Mar 2010 | A1 |
20100299529 | Fielder | Nov 2010 | A1 |
20110044211 | Long et al. | Feb 2011 | A1 |
20110145687 | Grigsby et al. | Jun 2011 | A1 |
20110279640 | Choi | Nov 2011 | A1 |
20110289155 | Pirnazar | Nov 2011 | A1 |
20120064976 | Gault et al. | Mar 2012 | A1 |
20140096036 | Mohler | Apr 2014 | A1 |
20140340468 | Winterstein | Nov 2014 | A1 |
20150195096 | Anka | Jul 2015 | A1 |
20150295777 | Cholkar et al. | Oct 2015 | A1 |
20160057391 | Block et al. | Feb 2016 | A1 |
20160234264 | Coffman et al. | Aug 2016 | A1 |
20170013233 | Kuusela | Jan 2017 | A1 |
20170171280 | Kim | Jun 2017 | A1 |
20170249394 | Loeb et al. | Aug 2017 | A1 |
20180012192 | Rosenberg | Jan 2018 | A1 |
20180176508 | Pell | Jun 2018 | A1 |
20190273767 | Nelson et al. | Sep 2019 | A1 |
20200274965 | Ravichandran | Aug 2020 | A1 |
20200301647 | Yoshida | Sep 2020 | A1 |
20200382618 | Faulkner et al. | Dec 2020 | A1 |
20210099574 | Nair et al. | Apr 2021 | A1 |
20220086197 | Lohita et al. | Mar 2022 | A1 |
Number | Date | Country |
---|---|---|
1603339 | Dec 2005 | EP |
1638275 | Mar 2006 | EP |
1848163 | Oct 2007 | EP |
1988698 | Nov 2008 | EP |
1404082 | Oct 2012 | EP |
1988697 | Feb 2018 | EP |
2005094600 | Apr 2005 | JP |
2005227592 | Aug 2005 | JP |
2007043598 | Feb 2007 | JP |
20050030548 | Mar 2005 | KR |
03079635 | Sep 2003 | WO |
2005009019 | Jan 2005 | WO |
2004063843 | Mar 2005 | WO |
2006064047 | Jun 2006 | WO |
2006075677 | Jul 2006 | WO |
2008099420 | Dec 2008 | WO |
Entry |
---|
Live Broadcast Stream Push Method, Device and Electronic Device Document ID KR 20220139393 A Date Published (XiaHou Jiaxi). |
Balamurugan Karpagavinayagam et al. (Monitoring Architecture for Lawful Interception in VoIP Networks, ICIMP 2007, (Aug. 24, 2008). |
Blanchet et al; “IPv6 Tunnel Broker with the Tunnel Setup Protocol (TSP)”; May 6, 2008; IETF; IETF draft of RFC 5572, draftblanchet-v6ops-tunnelbroker-tsp-04; pp. 1-33. |
Chathapuram, “Security in Peer-To-Peer Networks”, Aug. 8, 2001, XP002251813. |
Cooper et al; “NAT Traversal for dSIP”; Feb. 25, 2007; IETF; IETF draft draft-matthews-p2psip-dsip-nat-traversal-00; pp. 1-23. |
Cooper et al; “The Effect of NATs on P2PSIP Overlay Architecture”; IETF; IETF draft draft-matthews-p2psip-nats-and-overlays-01.txt; pp. 1-20. |
Dunigan, Tom, “Almost TCP over UDP (atou),” last modified Jan. 12, 2004; retrieved on Jan. 18, 2011 from 18 pgs. |
Hao Wang, Skype VoIP service-architecture and comparison, In: INFOTECH Seminar Advanced Communication Services (ASC), 2005, pp. 4, 7, 8. |
Isaacs, Ellen et al., “Hubbub: A sound-enhanced mobile instant messenger that supports awareness and opportunistic interactions,” Proceedings of the SIGCHI Conference on Human Factors in Computing Systems; vol. 4, Issue No. 1; Minneapolis, Minnesota; Apr. 20-25, 2002; pp. 179-186. |
J. Rosenberg et al., SIP: Session Initiation Protocol (Jun. 2008) retrieved at http://tools.ietf.org/html/rfc3261. Relevant pages provided. |
J. Rosenberg et al. “ Session Traversal Utilities for NAT (STUN)”, draft-ietf-behave-rfc3489bis-06, Mar. 5, 2007. |
Jeff Tyson, “How Instant Messaging Works”, www.verizon.com/learningcenter, Mar. 9, 2005. |
Mahy et al., The Session Initiation Protocol (SIP) “Replaces” Header, Sep. 2004, RFC 3891, pp. 1-16. |
NiceLog User's Manual 385A0114-08 Rev. A2, Mar. 2004. |
Pejman Khadivi, Terence D. Todd and Dongmei Zhao, “Handoff trigger nodes for hybrid IEEE 802.11 WLAN/cellular networks,” Proc. Of IEEE International Conference on Quality of Service in Heterogeneous Wired/Wireless Networks, pp. 164-170, Oct. 18, 2004. |
Philippe Bazot et al., Developing SIP and IP Multimedia Subsystem (IMS) Applications (Feb. 5, 2007) retrieved at redbooks IBM form No. SG24-7255-00. Relevant pages provided. |
Qian Zhang; Chuanxiong Guo; Zihua Guo; Wenwu Zhu, “Efficient mobility management for vertical handoff between WWAN and WLAN, ” Communications Magazine, IEEE, vol. 41. issue 11, Nov. 2003, pp. 102-108. |
RFC 5694 (“Peer-to-Peer (P2P) Architecture: Definition, Taxonomies, Examples, and Applicability”, Nov. 2009). |
Rory Bland, et al,“P2P Routing” Mar. 2002. |
Rosenberg, “STUN-Simple Traversal of UDP Through NAT”, Sep. 2002, XP015005058. |
Rosenberg, J; “Interactive Connectivity Establishment (ICE): A Protocol for Network Address Translator (NAT) Traversal for Offer/Answer Protocols”; Oct. 29, 2007; I ETF; I ETF draft of RFC 5245, draft-ietf-mmusic-ice-19; pp. 1-120. |
Salman A. Baset, et al, “An Analysis of the Skype Peer-To-Peer Internet Telephony Protocol”, Department of Computer Science, Columbia University, New York, NY, USA, Sep. 15, 2004. |
Seta, N.; Miyajima, H.; Zhang, L;; Fujii, T., “All-SIP Mobility: Session Continuity on Handover in Heterogeneous Access Environment,” Vehicular Technology Conference, 2007. VTC 2007-Spring. IEEE 65th, Apr. 22-25, 2007, pp. 1121-1126. |
Singh et al., “Peer-to Peer Internet Telephony Using SIP”, Department of Computer Science, Columbia University, Oct. 31, 2004, XP-002336408. |
Sinha, S. and Oglieski, A., A TCP Tutorial, Nov. 1998 (Date posted on Internet: Apr. 19, 2001) [Retrieved from the Internet ]. |
Srisuresh et al; “State of Peer-to-Peer(P2P) Communication Across Network Address Translators(NATs)”; Nov. 19, 2007; I ETF; I ETF draft for RFC 5128, draft-ietf-behave-p2p-state-06.txt; pp. 1-33. |
T. Dierks & E. Rescorla, The Transport Layer Security (TLS) Protocol (Ver. 1.2, Aug. 2008) retrieved at http://tools.ietf.org htmllrfc5246. Relevant pages provided. |
Wireless Application Protocol—Wireless Transport Layer Security Specification, Version Feb. 18, 2000, Wireless Application Forum, Ltd. 2000; 99 pages. |
WISPA: Wireless Internet Service Providers Association; WISPA-CS-IPNA-2.0; May 1, 2009. |
Number | Date | Country | |
---|---|---|---|
63176419 | Apr 2021 | US |