Maintaining synchronization between live streaming videos and additional sources of contextually-related data (e.g., on a companion device) has a number of issues; (note that what is a “live” broadcast can vary from provider to provider, e.g., there are different delays experienced by viewers having different broadcasters). One issue arises in that the user may want to pause/stop/rewind/fast forward, yet maintain the timeline synchronization between video and data.
For example, consider a viewer watching the live video of a professional martial arts-type/ultimate fighting event on a display driven by a gaming console or other entertainment device. The console or other entertainment device is capable of presenting statistical data about the event, e.g., from an external source that is separate from the video feed. However, as the viewer pauses, stops, rewinds and fast forwards so that the video is no longer live, the statistical data needs to be presented to the viewer in such a way that it does not give away even a hint as to the outcome, as doing so may ruin/spoil the experience for the viewer.
It is also desirable to link and synchronize the display of interactive input. This needs to be done in a way that allows different types of users to effectively have the same experience, that is, the experience may be the same for users viewing the “head” of the feed as well as users who joined the stream late and as a result are watching a delayed live stream.
This Summary is provided to introduce a selection of representative concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify key features or essential features of the claimed subject matter, nor is it intended to be used in any way that would limit the scope of the claimed subject matter.
Briefly, various aspects of the subject matter described herein are directed towards outputting data in synchronization with a outputting a video stream, in which the video stream and data are received from external sources that are separate from one another. This may include determining a location within the video stream, sending a request based on the location for information corresponding to the data, and receiving the information comprising the data or a reference to the data.
In one or more implementations, a data service obtains event-related data that is generated in conjunction with streaming video of an event, stamps the event-related data with a synchronization marker, and outputs synchronization-marked data that corresponds to the event-related data for access. A data encoding service communicates with the data service to obtain information corresponding to the event-related data and maintain a manifest based upon the information. The data encoding service also returns at least part of the manifest in response to a request for manifest content, in which the manifest content provides information by which the synchronization-marked data is accessible for use in synchronizing with streamed video content based upon one or more synchronization markers in the synchronization-marked data.
One or more aspects are directed towards synchronizing event-related data from one source with streaming video from another source, including marking the event-related data with synchronization markers that correspond to locations in the streaming video. Metadata that identifies the event-related data is provided to a consumer, which facilitates access to the event-related data marked with synchronization markers based upon information in the metadata.
The present invention is illustrated by way of example and not limited in the accompanying figures in which like reference numerals indicate similar elements and in which:
Various aspects of the technology described herein are generally directed towards combining streaming video that may be received in real time/near real time with separate, external data (e.g., contextually related data) in a way that maintains synchronization between the video feed and the external data. In one or more aspects, this includes linking live events to a video timeline in real time in a way that allows the synchronization to be maintained regardless of where in the video stream a user is currently viewing the video in (non-real) time, e.g., as a result of each viewer's pause/stop/rewind/fast forward operations.
The external data such as in the form of text (although graphics, animations and/or additional video may be generated from the data, as well as active data such as obtained via interactive fan voting) may be superimposed over the video stream, or output alongside, underneath, in a picture-in-picture type output and so forth. Further, the external data may be presented on a companion device, such as a smartphone, table, personal computer and/or the like that a viewer possesses along with the video playing device (e.g., gaming console) that is presenting the streaming video feed. For purposes of explanation, a single companion device is generally described herein, although it is understood that the data may be output on multiple companion devices and/or on the same display that is showing the streaming video feed.
It should be understood that any of the examples herein are non-limiting. For instance, while a gaming console is used in some examples, any computing device, such as a tablet, handheld device, smartphone, laptop computer, desktop computer and so on may be used with the technology described herein. Indeed, the use of a companion device along with a primary device is facilitated in one or more examples. In one aspect, the technology described herein may be based around the functionality on a gaming console (e.g., Xbox One™) application. The application functionality may be supported by various external services and inputs as exemplified herein. There are also operating modes that are based upon the feature needs of an application, such as demonstrated by fan voting examples, and a companion device (e.g., based upon SmartGlass™-related technology) experience. As such, the present invention is not limited to any particular embodiments, aspects, concepts, structures, functionalities or examples described herein. Rather, any of the embodiments, aspects, concepts, structures, functionalities or examples described herein are non-limiting, and the present invention may be used various ways that provide benefits and advantages in computers and video and data output in general.
The video encoder 106 encodes the video that it receives from the broadcast satellite (or other source) into an appropriate video format (e.g., one such format is based upon Smooth Streaming). This process also includes the start time (a reference time controlled by the system) of when the live video has begun encoding, as part of the video stream that the client receives (via a video manifest 326,
On the data side, certain events such as sports events, which in one example is a professional fight, have external data generated therewith such as statistics related to the event. For example, in a fight scenario, such external data may be a tally of the number of punches thrown in a round and/or in total thus far by each contestant. In general, this data is produced by an external provider, e.g., experts viewing the event in real time, typically assisted by computer programs or the like. Such data may be entered sporadically, such as every few seconds while the event is ongoing; during timeouts when no action is occurring there may not be any data, or there may be compilations of older data pushed, such as totals thus far, for example.
The provider thus pushes data updates 110 as they are entered in real time into a data store 112, which may be consumed by a component (of cloud services 114 in this example). In one or more implementations, a cloud services component referred to in
The data service 116 component also sends a message to a data encoding service 120 with the relevant details of the data, (e.g., blob URL, event ID, fight ID in a fight scenario, internal timing, and so forth). At this point, any time transformations (
With respect to time transformation, there may be a time lag in transmission/uploading the data feed relative to the video stream, such as due to transmission delays, and/or database writing and reading. Time transformations may be used to account for such differences. Note that because of satellite transmission delay, encoding and decoding delays and the like, a viewer of the streaming video watching the video “live” actually sees the video with some time lag, on the order of twenty or more seconds. Thus, any small delay in event data entry and uploading/processing/downloading is easily compensated for, whereby a user sees the event without any undesirable difference between the data (e.g., statistics) and the actual video.
As described herein, a video event handler component 226, such as part of feature logic 332 (
By way of an example, consider synchronized fight statistics for a live “fight” event currently taking place somewhere in the world. At the event there is a broadcasting crew that is filming/capturing the event for live feeds to be played over television, the internet, and other video-capable devices. At this event there is also a group of people, typically not part of the same organization that provides the streams, who provide detailed information about what is taking place (in a fight example, data reflecting punches thrown, takedowns, and so forth). This information is entered into a system as the event data 110 (
To be highly desirable to users, the technology described herein (e.g., for providing synchronized fight statistics) allows a user to join a live video and receive the statistics or other data up to that point in the video, even data the user has not previously seen. Similarly, a user may change the location in a video by rewinding, fast-forwarding, and so forth, and view the data corresponding to that time. For example, if ten punches are thrown at time X in the fight, and the user starts at time X or fast-forwards/rewinds to time X, the accompanying statistics will show ten punches thrown at that time.
To accomplish the synchronization, the broadcast-related data is accompanied by a manifest for that video (or set of videos). The system maintains a current state of the statistical information in each new chunk of the data manifest for that video (or videos). By way of example, in one or more implementations, every twenty seconds of video may be associated with a chunk in the manifest. Notwithstanding, it is understood that the chunk size/duration is arbitrary, and technically it is possible to return the entire manifest if desired. However at present, due to practical limitations on hardware (e.g., CPU) and service levels (e.g., bandwidth) in many environments, it is typically advantageous to break the data up into smaller pieces, that is, the chunks.
Turning to
In general, when the feature logic 332 has decided that a video event has occurred, the feature logic 332 sends a notification through the rest of the application, which gets handled by specific components depending on what type of event it is and what data it contains. When the feature logic 332 has detected that the video timeline has entered into a new time window (as defined by the synchronization-related technology described herein), represented in
As generally represented in
In this way, when a user joins the video, a first manifest chunk (or alternatively set of chunks) that the user receives has the latest statistics (or a reference thereto) relative to that point. As used herein, this mode of operation is referred to as continuous mode. Note that the entire manifest may be received rather than chunks thereof, in which event receiving and processing separate portions of the manifest are not needed; (however in many scenarios chunks are used, and thus chunks generally will be described hereinafter). To this end, on a video player (e.g., in the gaming console application), enough information is provided to use the reference time plus the duration of where the user is in the video to know which chunk of manifest content to request from the cloud. Once the data has been downloaded, the data contains a list of events for the current chunk as well as any current state events (e.g., continuous messages), with each event containing a time code, e.g., with millisecond granularity. In this way, although a chunk corresponds to a relatively large chunk of time, e.g., twenty seconds, each event may be synchronized to within a millisecond of the video to which it is associated.
Turning to additional details of one example implementation, a data encoding service (
The message format may be the same for all types of video events, with bits that change depending on the type of content being broadcast, such as the “Encoder” parameter, and the “Data” parameter. The application decides how to process them.
Once this time has passed, the event is triggered and processed by the gaming console client. Depending on the type of message (e.g., a fight data event), the gaming console notifies any attached companion devices with the URL(s) to the blobs that are relevant to the current video time. For example, one type of companion device that provides contextually relevant information to live streaming video being played on a primary device may be based upon SmartGlass™ technology; e.g., play-by-play information on SmartGlass™ may be synchronized to the gaming console video.
On the video player (e.g., in the gaming console application), enough information is provided to use the reference time plus the duration of where the user is in the video to know which chunk of data to request from the cloud. Once the data has been downloaded, the data contains a list of events for the current chunk as well as any current state events (e.g., continuous messages), with each event containing a time code, e.g., with millisecond granularity. Once this time has passed, the event is triggered and processed by the gaming console client. Depending on the type of message (e.g., a fight data event), the gaming console notifies any attached companion devices with the URL(s) to the blobs that are relevant to the current video time. The companion devices then fetch this data, process it, and display it on the device screen for the user to see.
In one or more implementations, the data in the data store 112 may be maintained separately from the video stream, instead of permanently interleaving the data into the video stream, (which is one feasible alternative). Maintaining the data provides a number of benefits, including that the data may be modifiable, selectable, interactive and so forth.
For example, consider that the data initially contained an error. The data may be modified to correct that error, whereby any download of the data thereafter no longer contains the error. Thus, data errors such as misspellings, data changes due to rulings (e.g., a referee determines later that a takedown did not count) and so forth may be corrected. The data remains synchronized, so later users get the benefit of such error corrections. Further, the saved data may be processed and presented to viewers in new ways, e.g., a new statistic or total may be generated from original data that was not available at the initial viewing time. Note that while in general it is desirable to present the data to delayed viewers as if they were watching the event live, such enhancements/corrections may be even more desirable in certain circumstances.
In addition to data provided by statisticians or the like viewing the event live, other data may be provided, including by viewers. For example, online social networking and microblogging services allow users to provide comments and the like as an event is ongoing. Such data may be maintained in conjunction with the video, synchronized with the corresponding time, for reading by viewers, for example. Such data may be interactive. For example, a viewer may interactively view whatever comments he or she desires, including with filtering by sender name or topic; (in contrast, such comments shown during live television are generally selected by a television editor or the like and simply scroll below the main display).
By way of summary,
Step 508 sends the notification to the data encoding service, which updates the manifest. The process repeats via step 510 until the video feed is done, e.g., at the end of the broadcast event.
At step 608, the data corresponding to the current chunk may be obtained (e.g., the blob accessed). Step 610 matches the data via the timestamp to the current video location. This may be on the video player device and/or the companion device(s). If the data has changed (step 612), then it is updated at step 614, e.g., presented to the user in conjunction with the current video location. Otherwise the previous data, if any, may remain presented for viewing (although it may be cleared after a time duration if unchanged too long).
Via step 616, the process continues as long as the user remains in play mode, e.g., the user does not stop/pause the video, the video has not ended, the location is not changed via fast forward (FF) or rewind (RW), and so on. If continuing in play mode, step 618 represents advancing the video/timeline.
Step 620 represents checking whether the video timeline has entered into a new time window, e.g., the next chunk in the manifest. If so, a new data chunk is requested from the data encoding service manifest, e.g., by returning to step 604. If not, the process returns to step 608 to see if updated data is associated with the new video location/timeline.
Note that if the video exits play mode at step 616, the steps of
Turning to another aspect, namely a fan voting example (referred to herein as occurring as part of “discreet mode”), consider that similar to the synchronized fight statistics example, there is a video that is being encoded from a live broadcast. However, instead of synchronizing live statistical data, the technology overlays a voting screen onto the broadcast/outputs voting to a companion device that may be in response to what has just happened in the video, or what is about to happen (e.g., a poll taken before a fight starts). A general goal is to avoid a scenario where some other important information is covered up, or that users are bothered with a vote that has nothing to do with what they are now watching.
Unlike the synchronized fight statistics example, in most instances a voting question only is asked occasionally. Thus, the data service is configured to differentiate between continuous mode data that is output for access to synchronize with video played at any time and discreet mode data that is output only occasionally. Data entry with respect to the discreet mode encodes data only at the time that has been specified; if a user joins a live video after that time, the user does not receive any previous discreet mode data. However, in one or more implementations, if the user rewinds, for example, any previous discreet mode data is received at the same place in the video as if the user was watching the event live/in real time or near real time. Similarly, if the user pauses and then later starts playing the video, any discreet mode data is received at the appropriate place in the video.
As can be seen, one or more aspects are directed towards outputting data in synchronization with a outputting a video stream, in which the video stream and data are received from external sources that are separate from one another. This may include determining a location within the video stream, sending a request based on the location for information corresponding to the data, and receiving the information comprising the data or a reference to the data. In one or more implementations, outputting the data comprises displaying a representation of the data on at least one companion device, and/or displaying a representation of the data on a device that also plays video corresponding to the video stream.
The information may be received as a chunk of a manifest, in which the chunk is based upon the location. The data for the current location may be based upon information in the chunk, e.g., accessing the data may comprise retrieving a data blob via a URL of identified in the chunk; (note that instead of a reference to the data, the actual data itself may be provided). The data may be output based upon a synchronization marker associated with the data and the location within the video stream, e.g., the data may be timestamped with the synchronization marker.
In one or more implementations, a data service obtains event-related data that is generated in conjunction with streaming video of an event, stamps the event-related data with a synchronization marker, and outputs synchronization-marked data that corresponds to the event-related data for access. A data encoding service communicates with the data service to obtain information corresponding to the event-related data and maintain a manifest based upon the information. The data encoding service also returns at least part of the manifest in response to a request for manifest content, in which the manifest content provides information by which the synchronization-marked data is accessible for use in synchronizing with streamed video content based upon one or more synchronization markers in the synchronization-marked data. The synchronization-marked data may be contained within one or more data blobs, with at least one data blob accessible via a URL identified in the manifest content.
A video playing device may communicate with the data encoding service to request the manifest content in conjunction with streamed video played by the video playing device. A companion device may communicate with the video playing device, including to receive information by which the synchronization-marked data is accessible, whereby the companion device downloads the synchronization-marked data and displays a representation of the synchronization-marked data.
One or more aspects are directed towards synchronizing event-related data from one source with streaming video from another source, including marking the event-related data with synchronization markers that correspond to locations in the streaming video. Metadata that identifies the event-related data is provided to a consumer, which facilitates access to the event-related data marked with synchronization markers based upon information in the metadata. Some event-related data may be output for synchronizing with a substantially live video stream, but not configured for synchronizing with a video stream that is not substantially live.
It can be readily appreciated that the above-described implementation and its alternatives may be implemented on any suitable computing device, including a gaming system, personal computer, tablet, DVR, set-top box, smart television, smartphone and/or the like. Combinations of such devices are also feasible when multiple such devices are linked together. For purposes of description, a gaming (including media) system is described as one exemplary operating environment hereinafter.
The CPU 702, the memory controller 703, and various memory devices are interconnected via one or more buses (not shown). The details of the bus that is used in this implementation are not particularly relevant to understanding the subject matter of interest being discussed herein. However, it will be understood that such a bus may include one or more of serial and parallel buses, a memory bus, a peripheral bus, and a processor or local bus, using any of a variety of bus architectures. By way of example, such architectures can include an Industry Standard Architecture (ISA) bus, a Micro Channel Architecture (MCA) bus, an Enhanced ISA (EISA) bus, a Video Electronics Standards Association (VESA) local bus, and a Peripheral Component Interconnects (PCI) bus also known as a Mezzanine bus.
In one implementation, the CPU 702, the memory controller 703, the ROM 704, and the RAM 706 are integrated onto a common module 714. In this implementation, the ROM 704 is configured as a flash ROM that is connected to the memory controller 703 via a Peripheral Component Interconnect (PCI) bus or the like and a ROM bus or the like (neither of which are shown). The RAM 706 may be configured as multiple Double Data Rate Synchronous Dynamic RAM (DDR SDRAM) modules that are independently controlled by the memory controller 703 via separate buses (not shown). The hard disk drive 708 and the portable media drive 709 are shown connected to the memory controller 703 via the PCI bus and an AT Attachment (ATA) bus 716. However, in other implementations, dedicated data bus structures of different types can also be applied in the alternative.
A three-dimensional graphics processing unit 720 and a video encoder 722 form a video processing pipeline for high speed and high resolution (e.g., High Definition) graphics processing. Data are carried from the graphics processing unit 720 to the video encoder 722 via a digital video bus (not shown). An audio processing unit 724 and an audio codec (coder/decoder) 726 form a corresponding audio processing pipeline for multi-channel audio processing of various digital audio formats. Audio data are carried between the audio processing unit 724 and the audio codec 726 via a communication link (not shown). The video and audio processing pipelines output data to an A/V (audio/video) port 728 for transmission to a television or other display/speakers. In the illustrated implementation, the video and audio processing components 720, 722, 724, 726 and 728 are mounted on the module 714.
In the example implementation depicted in
Memory units (MUs) 750(1) and 750(2) are illustrated as being connectable to MU ports “A” 752(1) and “B” 752(2), respectively. Each MU 750 offers additional storage on which games, game parameters, and other data may be stored. In some implementations, the other data can include one or more of a digital game component, an executable gaming application, an instruction set for expanding a gaming application, and a media file. When inserted into the console 701, each MU 750 can be accessed by the memory controller 703.
A system power supply module 754 provides power to the components of the gaming system 700. A fan 756 cools the circuitry within the console 701.
An application 760 comprising machine instructions is typically stored on the hard disk drive 708. When the console 701 is powered on, various portions of the application 760 are loaded into the RAM 706, and/or the caches 710 and 712, for execution on the CPU 702. In general, the application 760 can include one or more program modules for performing various display functions, such as controlling dialog screens for presentation on a display (e.g., high definition monitor), controlling transactions based on user inputs and controlling data transmission and reception between the console 701 and externally connected devices.
The gaming system 700 may be operated as a standalone system by connecting the system to high definition monitor, a television, a video projector, or other display device. In this standalone mode, the gaming system 700 enables one or more players to play games, or enjoy digital media, e.g., by watching movies, or listening to music. However, with the integration of broadband connectivity made available through the network interface 732, gaming system 700 may further be operated as a participating component in a larger network gaming community or system.
While the invention is susceptible to various modifications and alternative constructions, certain illustrated embodiments thereof are shown in the drawings and have been described above in detail. It should be understood, however, that there is no intention to limit the invention to the specific forms disclosed, but on the contrary, the intention is to cover all modifications, alternative constructions, and equivalents falling within the spirit and scope of the invention.
This application is a continuation of and claims priority to U.S. application Ser. No. 14/170,586, entitled “SYNCHRONIZING EXTERNAL DATA TO VIDEO PLAYBACK,” filed Feb. 1, 2014, which claims priority to U.S. provisional patent application Ser. No. 61/816,672, filed Apr. 26, 2013, which are incorporated herein in their entirety.
Number | Date | Country | |
---|---|---|---|
61816672 | Apr 2013 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 14170586 | Feb 2014 | US |
Child | 15488335 | US |