Field
The following relates, in one aspect, to systems and methods for transferring metadata from one device to another over standard AudioVideo (AV) interfaces.
Related Art
Transmission of AV data from a source device to a sink device is typically implemented according to a pre-defined standard. Providing a standard for communication of such data allows source and sink devices to be built to the standard by different entities and have an expectation that those devices will be able to be used together.
For example, High Definition Multimedia Interface (HDMI) provides a standard that is widely adopted and used in devices for AV data communication. HDMI implements EIA/CEA standard video formats and waveforms. Digital video signals can be broken into three main parts: the vertical blanking region, the horizontal blanking region, and the active picture region. The vertical blanking region contains the vertical sync pulse that signifies a new field or frame (field and frame are used interchangeably herein, unless explicitly indicated otherwise). There are multiple lines of video data in each field.
For example, a Full High-Definition (HD) 1080p/60 Hz video stream has a vertical blanking interval every 16.67 ms (60 Hz) to signify a new frame and a horizontal blanking interval every 14.8 μs to signify a new horizontal line. Within the 14.8 μs horizontal line duration there are 2,200 pixels, which include the horizontal blanking period and the active picture. This requires a pixel clock of 148.5 MHz (1/14.8 μs horizontal line duration*2,200 pixels) to generate the video frame composed of the three regions.
The active picture region is the location of the pixels that a user would see on the display. The pixel color value is stored in three channels that are typically RGB- or YUV (YPbPr)-encoded. The color levels can be encoded in a defined number of bits per channel, such as from between 6 to 16 bits per channel, meaning that each pixel has a 24- to 48-bit color value. A digital monitor communicates Enhanced Display Identification Data (EDID) to a source, in order to describe what video formats and timings that monitor supports.
Information about a current video format is communicated from a source to a sink. In HDMI, a video format is indicated as a standard VIC code in an InfoFrame. An InfoFrame is a packet of data that is provided in a vertical blanking interval (IE: in a period where the video signaling indicates that there is no active video data). Resolution also can be communicated via other mechanisms, such as a DCDB record (as in SuperMHL).
For example,
In one aspect, implementations of the disclosure provide a robust, end-to-end mechanism for signaling metadata from one device to another. Metadata is data about the main payload or other information that is used in formatting, processing, or interacting with the main payload. For example, metadata may include sampling formats.
In one implementation, the active video area is extended into the regions that are not indicated as being active video data, such as the blanking area. In HDMI or MHL, a video data preamble is used to distinguish video data from other data (a portion of a frame transmission period during which data is encoded as video is called a “video period”). A video data preamble contains character sequence(s) that are not permitted in a video data period. This disclosure encodes non-video data into the video data period, but at a physical layer level, this non-video data is encoded and transmitted indistinguishably from video data.
In one implementation, character sequences indicative of a start of a video period may be provided earlier and/or provided later than what would be required purely based on the timing required for the video itself. These extra lines appear to HDMI/MHL receivers and transmitters as video data, and thus there is no need to have HW changes in receivers (sinks), transmitters (sources), including at bridge and switch nodes.
A difference between the standard (expected/determined according to the video mode) and the modified timing can be signaled using an existing source to sink communication method. In such an approach, to determine the timing of actual video data, a sink has to know VIC code and the information on where and how many metadata lines are added
In an aspect, hardware forward error correction (FEC) can be added for metadata only. In such implementation, a CPU can perform the FEC processing in a software-implemented stack. FEC could be provided for all Active Lines, and then FEC would likely be implemented with hardware acceleration. Other approaches to signaling a change to an active video period can be provided for different audio/video data transmission systems and approaches.
As shown in
With reference to
HDMI sink or repeater 150 contain a receiver 152 that is capable of operating the physical layer of HDMI, and extracting the video and metadata. Extracted video 153 is transferred over a parallel bus 155 to a video processor (if a sink) and to an HDMI transmitter (if a repeater) 154. Metadata 160 is received by a CPU 156 and then transferred to the video processor/HDMI transmitter 154. As shown, transfer of metadata can involve transfer through CPUs and interfaces, such as Inter Integrated Circuit Bus (I2C) make it difficult to pass metadata synchronously with video. In some cases, one or more repeaters may exist between a source and a sink. Each repeater between the source and the sink can increase the problem of de-synchronization. In
One advantage of implementations according to
Within a receiver, a Data Enable (DE) signal can be generated in response to detection of the character sequences indicative of the Video Data Period. A level of the DE signal can indicate whether data being transmitted is or is not video data. Thus, in one approach, the Video Data Period begins earlier and/or continues longer (and consequently, a DE signal would be brought high earlier and/or maintained high later while preserving other video timing) in order to extend what HDMI receiver 152 would identify as video data. This approach to metadata transmission does not need to take the place of data islands. Metadata could be transferred in data islands also, or data islands could be used for encoding or transmitting other data.
Particulars of how metadata can be encoded into the extended active video area can vary among implementations, although a standardized approach would allow increased inter-operability. In one approach, metadata can be encoded directly into the extended (also can be called “overscan” area in that the extended “video” area is not intended for actual display) or into Least Significant Bits, or at fixed locations, such as particular lines, or fixed positions within lines in the extended area. An implementation can use a similar structure to data islands. A generic packet structure can be defined to carry various metadata payloads. To be clear, the approach to encoding metadata into the extended active video area in an implementation decision, in that transmitters and receivers will treat the physical transmission and reception of such metadata simply as video data.
In one implementation, one HDMI/MHL field in a sequence will be encoded with metadata for a subsequent-in-display-order (e.g., a next) field. Extracted metadata is kept in a buffer until a VSYNC pulse. On the pulse, the stored metadata is applied to configure processing for the next field or frame to be displayed (e.g., to determine resolution, color encoding, and so on).
Some existing repeaters may need a SW or firmware update to allow the capabilities propagate from sink to source and be ready to tolerate the additional “video” lines.
In one approach, a standard Video Identification Code (VIC), provided in an AVI InfoFrame (for HDMI) or in a Device Connection Database (DCDB) for MHL indicates video resolution (i.e., the active video area), which indicates the number of video data lines being transmitted for each field. A Vendor Specific InfoFrame (VSIF) (for HDMI) or a DCDB for MHL is used to indicate modified video timing due to encoding of metadata as active video data.
A sink can indicate capability to support metadata encoding according to the disclosure to a source by transmitting a capability indication through a control channel. For example, using a display data channel via SCDC for HDMI or control bus via DCDB for MHL. In one approach, the sink's support for receiving metadata as active video and also support for each separate capability, such as Dynamic HDR, is also separately indicated.
Examples of an encoding format for metadata is a structure that allows embedding different sub-structures that communicate different kinds of metadata. Examples of metadata that can be provided from a source include metadata about High Dynamic Resolution (HDR), Display Stream Compression (DSC) PPS. HDMI includes a capability indication SCDC (for HDMI) and DCDB (for MHL) need to have a flag indicating sink ability to receive the metadata in video lines. Each sub-structure, within the general structure, can have a separate capability indication flag, for example, a sub-structure for Dynamic HDR can have a separate flag to indicate support for that capability.
An example data line structure can support that all pixels in a line are used for metadata encoding. So, a number of bits of metadata that can be stored per pixel and per line would vary based on resolution and color depth. In each video field, information in each next metadata line can be a continuation of the information of the previous metadata line. In one approach, metadata encoding is not continued across boundaries of fields, but in another implementation, metadata can be continued from one field to another.
Forward Error Correction (FEC) can be used to provide error detection and/or error correction capability for the metadata. FEC could be implemented in software on a CPU for only the metadata. If FEC were used for the entire active video area (including metadata), then hardware support would be desirable.
The following figures present various examples of how embodiments according to the disclosure can be implemented.
This application claims priority from U.S. Provisional App. No. 62/075,554, filed on Nov. 5, 2015, which is incorporated by reference in its entirety for all purposes.
Number | Name | Date | Kind |
---|---|---|---|
6226038 | Frink | May 2001 | B1 |
6278739 | Enomoto | Aug 2001 | B2 |
6525723 | Deering | Feb 2003 | B1 |
6642966 | Limaye | Nov 2003 | B1 |
6658060 | Otaka | Dec 2003 | B2 |
6834083 | Tahara | Dec 2004 | B1 |
7012964 | Nakano | Mar 2006 | B1 |
7356051 | Pasqualino et al. | Apr 2008 | B2 |
7360067 | Arimilli | Apr 2008 | B2 |
8942259 | Pasqualino | Jan 2015 | B2 |
20020097869 | Pasqualino | Jul 2002 | A1 |
20020164030 | Stephenson | Nov 2002 | A1 |
20050047447 | Satoh | Mar 2005 | A1 |
20070188662 | Winger | Aug 2007 | A1 |
20070296858 | Eymard | Dec 2007 | A1 |
20090232222 | Stockhammer | Sep 2009 | A1 |
20090274218 | Lavelle | Nov 2009 | A1 |
20090278984 | Suzuki | Nov 2009 | A1 |
20100115409 | Robert | May 2010 | A1 |
20110096079 | Carter | Apr 2011 | A1 |
20120081514 | Hasegawa | Apr 2012 | A1 |
20120207208 | Wyatt | Aug 2012 | A1 |
20130094566 | Milstein | Apr 2013 | A1 |
20130198768 | Kitazato | Aug 2013 | A1 |
20140020038 | Dewa | Jan 2014 | A1 |
20140053710 | Serletic, II | Feb 2014 | A1 |
20140086335 | Sasai | Mar 2014 | A1 |
20140140536 | Serletic, II | May 2014 | A1 |
20140320740 | Wan | Oct 2014 | A1 |
20150009408 | Ikushima | Jan 2015 | A1 |
20150117515 | Fu | Apr 2015 | A1 |
20150304526 | Maurice | Oct 2015 | A1 |
20150347870 | Andreopoulos | Dec 2015 | A1 |
Number | Date | Country | |
---|---|---|---|
62075554 | Nov 2014 | US |