People use their cellular telephones (e.g., iPhone, Droid, etc.) and other electronic devices to play content, such as music or videos. Herein, a device that provides media is referred to as a “media source device.” Other media source devices include a tablet computer, a laptop computer, a personal computer, etc. The user may have an application such as an MP3 player, a Web Browser, a media player, etc. that allows them to play media that is either stored locally or retrieved from another source, such as the Internet.
Often media source devices do not render the media adequately. For example, the display on a cellular telephone may be too small or the speaker may not be of sufficient quality or volume. Moreover, output of the media source device may not be easily viewable or listenable to more than one person. Furthermore, absent carrying the media source device with them, the user is unable to enjoy the media in various locations throughout their home.
It would be beneficial to the user to be able to view or listen to media content anywhere in their home or other environment. It would be beneficial to the user to be able to selectively choose exactly where the media is rendered. It would also be beneficial if the solution worked with whatever application runs on the media source device in order to play the media.
The technology described herein provides an architecture for distributing media content. A wired and wireless media transport technology is provided that allows for the simultaneous transmission of media to multiple zones while maintaining precise timing synchronization. A user can have a network of speakers, and independently select which ones are actively playing and have their playback synchronized. This network of speakers is referred to herein as a virtual media network. Note that the media signal itself can be audio or video. Therefore, the virtual media network may include display devices.
The media source device can be a cell phone, tablet, stereo, set-top box, PC or other device. The transmission method of media into the network can be wired, as through an auxiliary cable, or wireless as with Bluetooth or WiFi. The speakers themselves may be governed in a self-forming network. Audio may be injected into the network from media source device and the end-point network itself controls audio/video distribution, timing, and rendering. In one embodiment, the audio that is injected into the network is the audio portion of an audio-video signal. The video signal may be played on the media source device (e.g., tablet computer). Note that the audio signal may be kept in sync with the video signal.
In one embodiment, a user can select any media application to serve as a source of the media. For example, the user could select an MP3 application, an Internet radio application, etc. The user then simply selects an output device, such as a speaker in their living room, to cause the media to be sent to the selected output device. The audio may be sent to the selected output device by the operating system. The user can call up a second application to add other speakers to the virtual media network, as well as to control volume of the speakers, etc. The second application never touches the audio, in one embodiment. The devices in the network may handle the audio/video distribution, timing, and rendering. Therefore, the media source device is not burdened with this processing. Moreover, note that this solution allows the user to select whatever media application they like as the source of the media. No modifications are needed to the media source application.
The following definitions will be used throughout this description:
Broadcaster—Any device that can transmit a media stream that is formatted for the virtual media network. May also refer to a broadcasting mechanism within the device.
Renderer—Any device that can render a media stream that is formatted for the virtual media network. May also refer to a rendering mechanism within the device.
Media Node—Any device that contains a renderer or a broadcaster. Nodes of one embodiment are responsible for maintaining network time synchronization and the state of the network including media routing information.
Media source device—Any device that transmits original media to a sink.
Sink—Any device that receives originating media from a source. May also refer to a mechanism within the device for receiving a media signal.
Gateway Capable Media Node—Any device that combines a sink and broadcaster. Gateways accept media from a sink and re-broadcast into the virtual media network to renderers.
Virtual Media Network—A group of one or more nodes having at least one gateway. A virtual media network may be established by a user and renders a media signal that is synchronized between all rendering devices in the network. Note that only one media node serves as an active gateway in one embodiment of a virtual media network.
In one embodiment, the system allows for simultaneous transmission of media to multiple zones while maintaining precise timing synchronization. As one example, a user can have a network of speakers, independently select which ones are actively playing and have their playback synchronized. The transmission method of media into the network can be wired, as through an auxiliary cable, or wireless as with Bluetooth, WiFi or another network communication protocol. As one example, the living room gateway may have an auxiliary out line to provide the media signal to the stereo receiver by one of its auxiliary in lines. On the other hand, the living room gateway may provide the media signal to the office renderer and the kitchen renderer via wireless transmission. Thus, note that the living room gateway may or may not have its own renderer.
The media nodes 104 themselves may be governed in a self-forming network, in one embodiment. Note that the media nodes 104 themselves may control audio/video distribution, timing, and rendering. Therefore, much of the processing load is removed from the media source device 102. Therefore, a device such as a cellular telephone, which may have limited processing power, is not burdened. The example of
In step 204, a media source device 102 is paired with a gateway media node 104. As noted above, each virtual media network has one gateway media node 104, in one embodiment. A user may specifically select one media node 104, which will serve as the gateway, or the gateway may be determined automatically without user intervention. For example, the user of smartphone 102a may select the living room media node as a primary listening device, which results in it becoming the gateway. In one embodiment, the gateway media node is selected based on its status as a currently active output device for the media source node 102. In one embodiment, the gateway media node serves as an active output device for the media source node 102 while acting as the gateway. In one embodiment, the gateway media node reports the device or state information to the media source device 102. Further details are discussed with respect to
In step 206, a virtual media network is formed. Step 206 may be formed in response to a user selecting media nodes 104. For example, the user accesses a software program on media source dice 102 (e.g., smartphone) that allows the user to select media nodes 104. Note that if a media node 104 is already a part of a different virtual media network, this media node 104 might be indicated as unavailable. Alternatively, the user might be allowed to request that this media node 104 be freed up. In one embodiment, step 206 results in instructing the gateway media node 104 to forward the media signal to other media nodes 104 in the virtual media network. Further details are discussed with respect to
In step 208, media is transferred from the media source device 102 to the gateway media node 104. This step 208 could be initiated in response to a user selecting that media be presented on an output device associated with the media source. For example, the user could have any application running on the smartphone 102a that plays media. The user simply selects the gateway media node 104 as the output device and the media is transferred to the gateway media node 104. Note that this media transfer could happen at the operating system (O/S) level. An implication of this transfer is that any media application can be selected by the user as the media source for the virtual media network.
In step 210, the gateway media node 104 broadcasts the media signal to other media nodes 104 in the virtual media network. For example, the living room gateway broadcasts the media signal it received from smartphone 102a to office renderer and kitchen renderer. Note that each media node 104 may play the media at its own user-controllable level (e.g., volume). Thus, there may be some commands sent from the media source device 102 to the gateway media node 104. However, the gateway may perform much, if not most of the processing. Therefore, the media source device 102 is not bogged down with a heavy processing load.
Some of the media nodes 104 include a broadcaster 304. Such nodes may be referred to herein as broadcasting nodes. A broadcaster 304 may be implemented by any combination of hardware and/or software. In one embodiment, broadcasters 304 transmit media in an airtime broadcast format that is understood by other media nodes 104. Note that this format may be different from the one used to send the media signal from the media source 102. Broadcasters 304 and renderers 306 may co-exist in the same media node 104 so that local playback can be synchronized with playback on remote renderers. Source injection may be done via a source-sink link. Unlike source to sink transmission, airtime broadcasts can be used for point-to-multipoint media transmission with synchronous playback.
As noted, a gateway capable media node 104 has the combination of a sink 302 and a broadcaster 304. In one embodiment, gateways receive media from the media source device 102 and re-broadcast the media in a format that is compatible with the virtual media network. Gateways can also include a renderer 306. In one embodiment, a gateway media node 104 is considered to be an endpoint.
Multiple gateway capable media nodes 104 can exist on the network. In one embodiment, an election method exists to determine the best gateway for a media source device 102 to use. For example, in the event only one media node 104 with a renderer 306 is active for the media source device 102, that rendering node may also be the best gateway, conserving network bandwidth for other sources. On the other hand, if multiple renderers are active for the media source device 102 the best gateway may be the one with the strongest/best network connection. An election scheme may occur to identify the best candidate and, if necessary, a stream handoff may occur to a different gateway in which case the original gateway becomes the source's sink. This can occur during stream construction or mid-stream. In the event that an active gateway is disabled, the network can self-heal and elect a new gateway to re-establish airtime broadcast streams.
Some of the media nodes 104 include a renderer 306. Such media nodes 104 may be referred to herein as rendering nodes. A renderer 306 may be implemented by any combination of hardware and/or software. Renderers 306 can decode and play the media stream through an internally powered speaker, or via analog or digital outs to another amplifier/speaker device, using the example of audio for the media signal. For video, the renderer 306 can decode and play the media stream through an internally powered display, or via analog or digital outs to another display or device having or driving a display. In one embodiment, a media node 104 with a renderer 306 supports the creation, maintenance, and distribution of a virtual wall clock. Renderers 306 may use the wall clock to precisely render the stream at the timestamp specified in the airtime stream format.
A brief discussion will now be provided of different virtual media networks of
In
In the example of
In one embodiment, the broadcaster 304 transmits the media signal using a different network protocol than the one used to send the media signal to it. For example, the media source 102 might send the media signal using a Bluetooth protocol. The broadcaster 304 might reformat this signal and send it using a Wi-Fi protocol.
In the example of
In the example of
The example of
The example of
In the example of
As previously noted, media source devices 102 inject media into the virtual media network. Examples include a PC or a SmartPhone. Methods of media injection include cables supporting analog or digital transmission, Bluetooth, and WiFi. In one embodiment, the media source 102 can be a broadcaster (as in
Note that many formats and connections may be used for the transmission from media source device 102 to sink 302. A media source 102 can transmit via wire, BT A2DP, or a specific protocol via Wi-Fi to a sink 302, as some non-limiting examples. A WiFi protocol can be designed to give a tradeoff between quality and latency, or to guarantee accuracy. For example, the protocol can detect errors and request retransmission of data. Often this may not be the goal of the broadcast; however, it is important that the media arrives reliably prior to broadcasting. Embodiments disclosed herein maintain compatibility with existing devices. Note that most smartphones support BT and wired connections.
The network is based on standard Wi-Fi infrastructure, in one embodiment. Each media node can connect to an access point 310 where it acquires an IP address via DHCP. Often nodes will not have a UI (display, keyboard entry, etc.) that allows for the entering of a wireless access key. In such cases, WPS-PBC can be used to achieve a connection. Other methods can include ad-hoc mode, whereby the user connects to the endpoint directly from a GUI enabled device and inputs network parameters via a webpage served by the node, or an application page that communicates directly with the node. Another method is for an application running on a phone or other device to communicate with the media node via Bluetooth. An application can prompt the user for which access point to connect to and the corresponding network access code. In one embodiment, the media node 104 is provided a name by the user during this set-up phase.
In the absence of infrastructure such as access points 310, a node can turn itself into a virtual access point. Other nodes can discover the access point 310 and connect to form a private network. WPS-PBC and ad-hoc methods can be used to make secure connections.
In step 402, the network media node 104 broadcasts its device status and state information. Step 402 may be performed periodically. The device status and/or state information may include the type of device it is, what capabilities it has, and the amount of processing bandwidth available. Device status and/or state information may also include whether the media node 104 is currently serving as a gateway, whether it is currently part of a virtual media network, its volume level, etc.
In step 404, a new media node is found. In one embodiment, the media node 104 receives device status from other media nodes 104. In one embodiment, step 404 is analogous to step 402, but describes receiving device status, as opposed to providing device status. Typically, media nodes 104 both provide their status and receive status from other media nodes 104.
In step 406, the newly found media node is added to a list. This list may include various device status and state information. The device description could include a name that has been assigned to the newly found device (e.g., kitchen, living room, etc.), its IP address, and its MAC address. The device description may also indicate whether the newly found node has a broadcaster 304, and whether it has a sink 302. Therefore, this information may indicate whether the newly found node 104 has the physical ability to act as a gateway. The device description may further indicate such things as whether it has its own speaker, or whether it has an auxiliary line out to send the media signal to a stereo receiver or the like. The state information for a particular media node 104 may include, but it is not limited to, a virtual network name of which it is a part (which may have been provided to it by a media source), whether it presently has communication links to other devices in a virtual network, volume level. The media node 104 may store information for all of the media nodes 104, such that it can provide the media source 102 with whatever information is necessary. Also, the media node 104 is able to control the virtual media network using this state information. Note that each media node 104 may store the state information such that it is capable of taking over as the gateway media node 104.
From time to time a media node may disappear. If this happens (step 408), then an asymmetric verification is performed in step 410. The asymmetric verification may guard against incorrect state transitions due to transient network outages. Pending the outcome of the asymmetric verification, the media node may be removed from the list.
Step 412 indicates that the media node 104 may pause for a pre-defined period of time before broadcasting its device status again.
In step 502, the media source device 102 sends a request to a media node 104 for state information. Note that this media node 104 may be one that is being targeted to become the gateway for a virtual media network.
In step 504, the media source device 102 receives the state information from the media node 104. At this time, the virtual media network could include any number of active media nodes 104. However, for the sake of discussion an example will be discussed in which the gateway is the only media node 104 that is active.
In step 506, the media source device 102 pairs with the media node 104. Pairing refers to establishing the media node 104 as a gateway for a virtual media network being served by the media source device 102. Numerous techniques can be used to determine which media node 104 should serve as the gateway. Further details are discussed with respect to
The pairing protocol starts with the media source 102 sending a request challenge to the potential gateway media node 104. As the quality of the network is dependent on the information made available from the nodes, a security mechanism exists in one embodiment to prevent un-sanctioned nodes from joining the virtual media network. Media nodes 104 are required to pass a challenge-response query when joining the virtual media network in one embodiment. If a device does not have the proper security keys to complete the challenge-response, it will not be allowed to join the virtual media network. The security mechanism prevents the attachment of counterfeit devices and helps maintain the integrity of the virtual media network.
If the gateway media node 104 responds correctly, then the media source 104 sends a pair request message to the gateway media node 104. The gateway media node 104 determines whether it is able to serve as the gateway. If so, in it sends a grant response to indicate that it will serve as a gateway. If it cannot serve as the gateway it indicates this in its response.
Assuming that the pairing was granted, the media source device 102 sends an encrypted block cypher. Media streams can be optionally encrypted prior to transmission preventing streams from being sniffed from the network. The media source device 102 may now send encrypted audio to the gateway media node 104.
Referring back to
In step 602, the media source device 102 presents a list of available media nodes 104 to add to the virtual media network. This list may be based on the state information that was received in process 500. Step 602 may be performed by a virtual media network application (
In step 604, a selection of a media node 104 is received. This may be received by the virtual media network application 740. As one example, the user selects the bedroom speaker.
In step 606, the media source device 102 sends a link request to the gateway media node 104 to add the new media node 104 to the virtual media network. In one embodiment, virtual media network application 740 sends the link request.
In step 608, the gateway media node 104 links with the new node 104. In step 610, the gateway media node 104 sends back the response to the media source 102 that the new node has been linked. The user is able to add any number of media nodes to the virtual media network by selection of additional media nodes 104.
The protocol starts with the media source 102 sending an add link request to the gateway media node 104. This request may identify the potential new media node 104 using any of the state information that is stored at the gateway media node 104, in one embodiment. The new node might be identified by speaker name, MAC address, IP address, etc.
Similar to how a gateway node may need to pass a challenge-response query when joining the network in one embodiment, the new media node 104 may also be required to do so. Thus, the gateway node 104 sends a request challenge to the potential new media node 104. If the node media node 104 responds correctly, then the gateway media node 104 sends a link request message to the new media node 104. The new media node 104 may determine whether it is able to take part in the virtual media network. For example, if it is already in another virtual media network it may decline the invitation to join the network, in one embodiment. If it decides to join, it sends a link granted response.
Assuming that the link was granted, the gateway media node 104 informs the media source device 102 that the link was granted. Also, the gateway media node 104 may send an encrypted block cypher to the new media node 104. This may or may not be the same cypher that the gateway was sent from the media source device 102. Note that the gateway media node 104 may use a different encryption than is used by the media source device 102. The gateway media node 104 may now send encrypted audio to the new media node 104.
The rendering module 306 is responsible for processing the media signal for presentation on the speakers or other output device. Optionally, the media node 104 has or is connected to a video display 712. In this case, the rendering module is responsible for processing the media signal for presentation on the display. The rendering module may receive the media signal from any of the network interfaces.
The broadcasting module 304 is able to forward a media signal to appropriate media nodes 104. The auxiliary output may be used to provide a media signal to a device such as a home stereo system. In one embodiment, the broadcaster 304 handles forwarding media signals to the auxiliary output.
The command module is able to process commands to control the media signal. These commands could include volume, play, pause, etc. The synchronization module is responsible for precise synchronization of the media signal during playback on the various media nodes in the network.
Media nodes 104 can be controlled through a variety of mechanisms. Controllers can include a SmartPhone App, Tablet App, a UI on a TV or set-top box, buttons with or without a display on the node, or a PC app. In one embodiment, these devices can control whether a renderer 306 renders a particular stream, the volume output of the renderer 306, and a master volume.
In one embodiment, all media nodes 104 support a command protocol. The command protocol may include methods to turn on/off audio playback, aggregate audio playback into synchronized zones, transport controls such as play, forward, reverse, and seek, metadata transmission to nodes, announcement of network state to devices joining the network, updates of state when devices leave the network, control via remote user interfaces, and other messages and method to maintain the airtime network.
Note that the elements of the media node 104 may be implemented with software, hardware, or a combination of software and hardware. The media node 104 may have one or more processors and computer readable storage media with instructions thereon, which when executed on the one or more processors, implement functionality of various elements of the media node 104. An example device having a processor and computer storage is discussed later.
A user can access the virtual network media application 740 to control the virtual media network. As one example, the virtual network media application 740 may present a user interface to allow the user to select media nodes 104, control their volume, playback etc. In one embodiment, there is a master volume for the network and individual volumes for each media node 104.
The media source application 742 could be any application that is capable of playing audio on the media source device 102. For example, it could be a MP3 player, an Internet audio, a web browser, etc. In one embodiment, the media will be played on whatever output device is selected by the user. This output device selection may be under control of the O/S 750. For example, the O/S 750 may provide for a pop-up window that allows the user to select the output device. One or more of the media nodes 104 may appear as selections. By simply selecting one of the media nodes 104, the media signal associated with the audio application is sent from the media source device 102 to the selected media node 104 over network interface 722B. In one embodiment, the media library 752 is used to decode the media. The media library sends the decoded media to the network media driver 754, which sends the media signal to the selected output device. If the media node 104 is selected as the output device, the media signal is sent over network interface 722B. In one embodiment, the network media driver 754 is a Bluetooth driver. However, network media driver 754 may be compliant with any protocol.
Note that with the foregoing embodiment, the virtual media application 740 never touches the media signal. This has the advantage that any media source application 742 may be used when sending the media signal to the media node 104 simply by selecting the appropriate output device for the media source device 102. Thus, one embodiment of a virtual network media application is compatible with any media source applications 742. Moreover, no changes are required of to the media source application 742.
As has been previously discussed, one embodiment of a gateway media node 104 has the ability to perform any needed reformatting and processing of the media signal so that it is compatible with the virtual media network. Thus, the gateway media node 104 offloads much of the processing from the media source device 102.
In step 804, a network link is established between the media source device 102 and the selected speaker using network interface 722B. Note that this link may be established at the O/S/level.
In step 806, the user begins to play audio using the media source application 742. In step 808, the media library 752 decodes the audio and sends it to the network media driver 754. In step 810, the network media driver 754 streams the audio to the selected speaker over network interface 722B. In one embodiment, the audio is the audio portion of an audio-video signal. The video signal may be played on the media source device 102 (e.g., tablet computer). Note that the audio signal may be kept in sync with the video signal.
In step 812, the user selects the virtual network media application 740. In step 814, a link is established between the media source device 102 and the speaker using network interface 722A. The virtual network media application 740 may initiate this link. In one embodiment, the authentication protocol of
In order to identify the proper speaker in step 814, in one embodiment, the virtual network media application 740 queries the O/S using an API to determine which speaker the user is presently streaming audio to. In one embodiment, the virtual network media application 740 asks the user for the name of the speaker that they are presently streaming audio to. Since the speaker stores its name, the virtual network media application 740 can learn that when it receives state information from media nodes (e.g., step 504,
In step 816, the user enters commands into a UI that is provided by the virtual network media application 740. These commands could be to add new speakers, control the volume, send commands such as “play,” “pause,” “rewind”, etc. Note that commands may be entered in many ways such as checking a box, moving a slider, using a remote control, etc. In step 818, the commands are sent to the speaker using network interface 722A.
Note that although
The virtual network media application 740 may be similar to the one described in
In this embodiment, a command channel is used to send commands using network interface 720. A data channel may be used to send the media signal using network interface 720. In one embodiment, the network interface 720 is compliant with Wi-Fi. However, the network interface 720 could be compliant with another protocol. Moreover, it is not required that the commands and data be sent using the same network protocol.
Note that by having a driver in the O/S, media signals from any media source application 742 may be sent to the media node 104. All the user needs to do is to select one of the media nodes 104. In response, the virtual network media driver 784 is used. Therefore, the virtual media network can be used with any media source application 742 that runs on the media source device 102.
In step 904, a network link is established between the media source device 102 and the selected speaker using network interface 722. In one embodiment, the virtual network media driver 784 initiates this link. In one embodiment, the authentication protocol of
In step 906, the user begins to play audio using the media source application 742. In step 908, the media library 752 decodes the audio and sends it to the virtual network media driver 784. In step 910, the virtual network media driver 754 streams the audio to the selected speaker over network interface 722. In one embodiment, the audio is sent using Wi-Fi, although another protocol may be used. In one embodiment, the audio is the audio portion of an audio-video signal. The video signal may be played on the media source device 102 (e.g., tablet computer). Note that the audio signal may be kept in sync with the video signal.
In optional step 912, the user selects the virtual network media application 740. In step 914, the user enters commands into a UI that is provided by either the virtual network media application 740 or the virtual network media driver 784. These commands could be to add new speakers, control the volume, send commands such as “play,” “pause,” “rewind”, etc. In step 916, the commands are sent to the speaker using network interface 722. In one embodiment, this is the same communication link that was established by the virtual network driver 784. However, another communication link could be established. There may be two channels associated with the communication link such that the audio signal and commands are sent on separate channels. Note that steps of
In step 1008, the user selects the media source application 742 that is embedded within the virtual network media application 740. In step 1010, the user begins to play audio using the media source application 742. In step 1012, audio is streamed to the selected speaker over network interface 722A. In one embodiment, the audio is the audio portion of an audio-video signal. The video signal may be played on the media source device 102 (e.g., tablet computer). Note that the audio signal may be kept in sync with the video signal.
In step 1014, the user enters commands into a UI that is provided by the virtual network media application 740. These commands could be to add new speakers, control the volume, send commands such as “play,” “pause,” “rewind”, etc. In step 1016, the commands are sent to the speaker using network interface 722A. In one embodiment, this is the same communication link that was established in step 1006. However, another communication link could be established. There may be two channels associated with the communication link such that the audio signal and commands are sent on separate channels. Note that steps of
In one embodiment, all media nodes 104 synchronize to a virtual wall clock. The virtual wall clock may be used by the broadcaster 304 to timestamp the media stream with the intended render time. The virtual wall clock may be used by renderers 306 to precisely render the media samples at given time. The virtual wall clock ensures that all media nodes 104 have a common understanding of render time. In one embodiment, each rendering device 306 renders samples at the time specified in the media stream. Other information for the rendering of the stream may also be included in the stream format including sampling frequency, word size, number of channels, encoding format, etc.
In step 1104, the gateway media node 104 receives an audio signal from the media source device 102. In step 1106, the gateway media node 104 decodes the audio. The gateway may de-multiplex the audio signal prior to decoding.
In step 1108, the gateway media node 104 re-encodes the audio for broadcast to other media nodes 104. Note that the gateway may use a different encoding than the media source device used. For example, the audio signal may have been encoded at the media source device in a format that is compatible with Bluetooth. It may be re-encoded in a format that is compatible with Wi-Fi.
In step 1109, the gateway media node 104 encapsulates the audio signal. In one embodiment, the gateway media node 104 compresses the audio signal. As an example, in high quality networks, a light lossless compression technique such as Free Audio Lossless Codec (FLAC) can be used to cut bandwidth in half with minimal processing overhead. In low quality networks, a higher compression standard such as OGG or Advance Audio Coding (AAC) can be used to minimize network bandwidth at the expense of sound quality and processing overhead. Beyond the compression algorithm itself, the signal can resampled to a lower sampling rate, down-mixed to a mono stream, or down-sampled to a lower sample resolution. Encoding or transcoding the media stream to a compressed form can improve airtime reliability by using less network bandwidth at the expense of processing overhead. Supported codecs can include lossless and lossy compression techniques of various bitrates, sampling frequencies, channels, and sample sizes.
All media nodes 104 are cognizant of the supported encoding formats, in one embodiment. All broadcasters 304 are capable of encoding into the supported formats, in one embodiment. All renderers 306 are capable of decoding the supported formats, in one embodiment. The encoding format that is used for each stream may be determined among the media nodes 104 with feedback from network quality, available processing resources, the number of rendering zones being supported, the number of active streams being supported, and the maximum acceptable latency.
In optional step 1110, redundant packets are added. If the audio signal has been compressed, additional packets may be added. In one embodiment, a group of packets is interleaved with a group of redundant packets. For example, with a 2:1 compression ratio, two seconds of the original audio signal may be compressed to one second. As one example, one second worth of (compressed data) packets may be interleaved with one second of redundant packets. The number of packets in a group could be one or higher.
Broadcasting has two options in one embodiment. In option A, the gateway media node 104 broadcasts the audio signal to other media nodes 104 (step 1111). In option B, the gateway media node 104 sends the audio signal to a wireless access point 310 (step 11112). The wireless access point 310 broadcasts the audio signal to other media nodes in step 1114.
Broadcast media may be the largest consumer of network bandwidth. Typical uncompressed audio streams can exceed over 1.5 mbps. Transmission can consume 1.5 mbps per stream up to the access point 310 and an additional 1.5 mbps per stream down to the renderer 306 for a total of 3 mbps. For point-to-point simulcasting, the typical bandwidth may be 3 mbps times the number of simulcast streams. This has the potential for saturating the network.
Embodiments support multiple transmission protocols. In one embodiment, UDP over IP is used. Note that in one embodiment, the receiving media node is not required to acknowledge reception of packets. For example, UDP over IP may not require reception of packets. In one embodiment, the receiving media node requests the gateway to re-send a data packet that is not received. Note that this may occur in an embodiment that uses UDP over IP. As mentioned above, in one embodiment, redundant data packets are sent.
Network statistics may be maintained by media nodes 104. The elected broadcaster 304 or gateway is responsible for determining the best transmission methods to balance quality of service, latency, processor utilization, and network utilization, in one embodiment. For example if the network is of good quality, with high available bandwidth and strong connections to individual nodes 104, a guaranteed transmission protocol can be used. If the network is saturated or of lower quality, a multicasting technique may be preferable. Additional methods can help conserve bandwidth, and detect, correct or conceal transmission errors. In general, multicasting, simulcasting and point-to-point protocols are supported with the most suitable protocol determined at the time of stream construction with network quality, available processing power, and the number of streams being contributing factors in the decision process.
In step 1116 all of the media nodes 104 in the virtual media network synchronously play the audio signal. In one embodiment, a renderer 306 de-muxes and decodes the stream and renders at the time specified in the encapsulation. Note that the gateway device itself could save an already de-muxed version of the media signal such that it does not need to de-mux again. In one embodiment, the gateway node 104 sends the stream to itself in the form of a rendering thread.
In one embodiment, the audio is the audio portion of an audio-video signal. The video signal may be played on the media source device 102 (e.g., tablet computer). Note that the audio signal may be kept in sync with the video signal.
The media clock may be recovered through the media stream with reference to the wall clock and may be synchronized to media frames or groups of samples. The media clock drives the formation of the hardware frame clocks, word clocks and bit clocks. Synchronizing via the media stream guarantees accurate clocks can be generated at the media nodes 104 from a logical viewpoint. Slight variations in hardware, such as with crystals, can cause clock drift and other variances in clock timing. Constant measurement and comparison of the media clock and wall clock allows the system to detect drift. In one embodiment, a software-only media clock recovery mechanism involves adding or removing media samples to and from the media rendering buffers to re-sync media clocks across devices. In one embodiment, the rendering buffer manipulation is done in a way that does not cause the effects of obvious clicking or skipping. A hardware mechanism, using VCXOs, or voltage controlled oscillators, can be controlled from the processor based on drift measurements and push or pull the hardware oscillators into tighter synchronization.
Depending on the stream format, errors can occur. Sources of errors include lost packets, out-of-order packets, or packets that arrive after the time-stamped play time. Renderers 306, in conjunction with broadcasters 304, can provide different methods to conceal and/or prevent errors.
In multicast sessions, errors can be detected when a packet does not arrive by comparing the sequence numbers of arrived packets. If a packet is lost during a multicast transmission, a renderer 306 can send a negative acknowledgement to the broadcaster 304 and ask for retransmission of a given packet. If not enough time is available for a re-transmit (acceptable latency) or network bandwidth does not allow retransmission, the renderer 306 can conceal the error by muting the audio output during the affected render time, or re-forming the audio signal through signal processing techniques such as filtering.
If packets arrive out of order, the renderer 306 can re-order arrived packets prior to output to the audio device. This may be dependent on the pre-determined network latency.
If a particular broadcaster-renderer link is poor, the link has the potential of affecting the quality of all of the links in the network. Constant re-transmissions, and re-measurement of network performance consume bandwidth and may add unnecessary latency and processor burden. In bad network environments, guaranteed delivery links such as TCP/IP can be used to ease processor utilization at the expense of greater bandwidth utilization. These links essentially prevent error cases from happening in the airtime rendering subsystem. Note that TCP-IP is not required. Alternatively, where network bandwidth is plentiful, this method can be used as the standard broadcast method.
In some embodiments, a longer acceptable rendering latency can be negotiated between the media nodes 104 to deliver higher QoS. This latency can be changed mid-stream or at the start of stream construction. Latency improves QoS by allowing more time for correction or concealment mechanisms to take effect. In some cases, such as with audio synchronization with games or video, only lower latencies are tolerable even if a higher error rate results.
The network media driver 754, virtual network media driver 784, virtual network media application 740, or other O/S driver or application can transmit the media signal (e.g., audio) in many formats. In one embodiment, the media signal is transmitted from the media source node 102 using raw PCM. In one embodiment, e media signal is transcoded to a generic format such as FLAC. In one embodiment, the network media driver 754, virtual network media driver 784, virtual network media application 740, or other O/S driver or application intelligently elects to use the native source format. For example, if the source file is an MP3, the code on the media source node 102 can elect to send the MP3 as a stream to the gateway media node 104 and the gateway media node 104 can rebroadcast the MP3 to rendering media nodes (after the gateway instruments the signal timing).
In one embodiment, the network media driver 754, virtual network media driver 784, virtual network media application 740, or other O/S driver or application can instrument the native format and send it directly to rendering media nodes 104. This saves the transcoding that would otherwise happen in the gateway media node 104 or media source device 102, and will generally use less bandwidth.
In step 1326, the media source node 102 determines whether to send using native format or another format that is supported by the media node 104. If a media source device 102 supports the native format, then timing information is added by the media source device (step 1328) and the media source device 102 sends the media signal to the media nodes that support the native format 104 (step 1330). In OS architectures there may be media decode facilities (e.g., DirectShow, or OpenCore, or gStreamer) that the application pumps a stream or file data to. The functionality may be modified at this level to selectively transcode or bypass and transmit through the driver. If the format is not supported by a media node 104, raw PCM or transcoding to a supported format like FLAC could be done. This is depicted as steps 1322 and 1334.
In one embodiment, an audio signal that is played in the virtual media network is synchronized to a video signal. As one example, the media source device 102 provides the video portion of an audio-visual signal to a display. The audio portion of the signal is sent to the gateway media node 104, which broadcasts it to other media nodes 104 in the virtual media network.
The display could be any device. The display could be a part of the media source device 102. Alternatively, the video signal could be sent wirelessly or by wireline to a display or device having a display. The display may or may not be associated with a node in the virtual media network. As examples, the display could be a tablet computer, television, cellular telephone, etc.
In one embodiment, synchronizing audio to video includes having a render time for video and a render time for audio. The video render time is used to control when the video is rendered on the display. The media source device 102 may send the audio render time to the gateway media node. Therefore, the audio may be kept synchronized with the video. The audio render time may be used to allow multiple media nodes 104 to play the audio in synchronization with the video.
Portable storage medium drive 562 operates in conjunction with a portable non-volatile storage medium, such as a floppy disk, to input and output data and code to and from the computer system of
User input device(s) 560 provides a portion of a user interface. User input device(s) 560 may include an alpha-numeric keypad for inputting alpha-numeric and other information, or a pointing device, such as a mouse, a trackball, stylus, or cursor direction keys. In order to display textual and graphical information, the computer system of
The components contained in the computer system of
The technology described above can be implemented using hardware, software, or a combination of both hardware and software. The software is stored on one or more processor readable storage devices including hard disk drives, CD-ROMs, DVDs, optical disks, floppy disks, tape drives, RAM, ROM, flash memory, or other suitable storage devices. The software is used to program one or more processors to perform any of the processes described herein. In alternative embodiments, some or all of the software can be replaced by dedicated hardware including custom integrated circuits, gate arrays, FPGAs, PLDs, and special purpose computers.
One embodiment includes a method for distributing media, comprising the following. A media source device receives state information that describes media nodes that are potentially available to form a virtual media network. One or more selections of one or more media nodes that are to form a virtual media network are received. A first of the media nodes in the virtual media network that is selected as an output device in an operating system interface is instructed to forward a media signal that the first media node receives from the media source device to other media nodes in the virtual media network.
One embodiment includes a network device, comprising a first network interface for receiving a media signal from a media source device using a first network protocol, a second network interface for receiving a media signal from a media source device using a second network protocol, and a broadcaster for transmitting media signals received from both the first network interface and the second network interface to another device using the second network protocol.
One embodiment includes one or more processor readable storage devices having processor readable code embodied on said processor readable storage devices, said processor readable code for programming one or more processors to perform a method comprising the following steps. A media source device receives state information that describes media nodes that are potentially available to form a virtual media network. A first of the media nodes is established as a gateway media node. The gateway media node is requested to link to one or more of the media nodes that are to form a virtual media network with the gateway media node. The first media node serves an output device in an operating system interface while acting as the gateway media node.
One embodiment includes a method comprising the following. A first media signal is received at a first network media node from a media source device using a first network protocol. A command signal for the first media signal is received at the first network media node from the media source device using a second network protocol. The command signal specifies other network media nodes to receive the first media signal and commands for rendering the first media signal. The first media signal is broadcast to the other network media nodes using the second network protocol. The commands are sent to the other network media nodes using the second network protocol.
One embodiment includes a method comprising the following. Media is injected into a network from a media source device. The network including a plurality of media nodes. A first of the media nodes is selected to serve as a gateway for the network based on its status as an active output device for the media source device. Media distribution is controlled at the first media node, including re-broadcasting the media from the first media node to media nodes that are actively rendering the media, and maintaining precise timing synchronization of rendering the media at the media nodes.
The foregoing detailed description has been presented for purposes of illustration and description. It is not intended to be exhaustive or to limit embodiments to the precise form disclosed. Many modifications and variations are possible in light of the above teaching. The described embodiments were chosen in order to best explain the principles and practical applications to thereby enable others skilled in the art to best utilize the various embodiments and with various modifications as are suited to the particular use contemplated. It is intended that the scope be defined by the claims appended hereto.
This application claims the benefit of U.S. Provisional Application No. 61/405,835, entitled “Media Distribution Architecture,” by Lau et al., filed on Oct. 22, 2010, incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61405835 | Oct 2010 | US |