Embodiments of the invention generally relate to the field of multimedia processing and, more particularly, conversion and processing of deep color video in a single clock domain.
In the processing and presentation of video data, there are numerous standards providing varying levels of color accuracy. High-definition video provides for greater density of colors and enhanced color accuracy. For example, 24-bit color is referred to as “truecolor”, and provides 16.7 million colors. “Deep color” refers to a gamut comprising more than 16.7 million colors, and is generally 30-bit or greater (normally 30, 36, and 48-bit color).
However, the native format of deep color video data may be difficult to process directly. Therefore, color depth conversion for deep color is commonly performed before and after processing deep color video. Conventional color depth conversion methods need to generate a local clock domain, referred to as a “pixel clock”, by using a phase locked loop (PLL). The use of a phase loop creates certain manufacturing and development costs, such as chip area requirements, power consumption, and circuit design/verification efforts.
Embodiments of the invention are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings in which like reference numerals refer to similar elements.
Embodiments of the invention are generally directed to conversion and processing of deep color video in a single clock domain.
In a first aspect of the invention, a method includes receiving one or more video data streams, the one or more video data streams including a first video data stream, the first video data stream having a first color depth and being clocked at a frequency of a link clock signal. The method further includes converting the first video data stream into a converted video data stream having a modified data format, wherein the modified data format includes transfer of a single pixel of data in one cycle of the link clock signal and the insertion of null data to fill empty cycles of the converted video data stream, and generation of a valid data signal to distinguish between valid video data and the null data in the converted video data stream. The method further includes processing the converted video data stream according to the frequency of the link clock signal to generate a processed data stream from the converted video data stream, wherein processing includes using the valid data signal to identify valid video data.
In a second aspect of the invention, an apparatus includes a port for reception of a first video data stream, the first video data stream having a first color depth and being clocked at a link clock frequency. The apparatus further includes a conversion element, the conversion element to convert the first video data stream into a converted video data stream having a modified data format, wherein the modified data format includes transfer of a single pixel of data in one cycle of the link clock signal and the insertion of null data to fill empty cycles of the converted video data stream, and wherein the conversion element generates a valid data signal to distinguish between valid video data and the null data. The apparatus further includes a processing element to generate a processed data stream from the converted data stream, the processing element to process the converted video data stream according to the frequency of the link clock signal.
Embodiments of the invention are generally directed to conversion and processing of deep color video in a single clock domain.
In some embodiments, a method, apparatus, or system provides for the processing of deep color video in a single link clock domain, without generation of a local clock, or pixel clock, domain. In some embodiments, a method, apparatus, or system operates without requiring use of phase lock loop circuitry to generate a pixel clock.
There are several different color representations varying in the required bit width (or color depth) to store the color data of a pixel. In the 24-bit per pixel (bpp) representation of true color, color values for each pixel are encoded in a 24-bit per pixel fashion, where an 8-bit unsigned integer (with values 0 through 255) represents each of the intensities of red, green, and blue. This representation is the most common color interchange format in image file and video formats.
In contrast, deep color is a term that refers to the more enhanced representation of color than 24-bit true color representation. Deep color expands the colors on the display from millions to billions, which provides more vividness and color accuracy. For deep color, there are commonly used 30-, 36-, and 48-bit per pixel (bpp) deep color representations. In the 30-bit color representation, colors are stored in three 10-bits channels, resulting in 30 bits of color data per pixel. In 48-bit color representation, high-precision colors are stored in three 16-bit channels, resulting in 48 bits of color data per pixel.
In a conventional system, color depth conversion is commonly performed before and after processing deep color video, the local clock, or pixel clock, domain being generated using phase locked loop circuitry. In some embodiments, the conversion and processing of deep color video is accomplished in a single clock domain, utilizing a link clock domain. In some embodiments, conversion to and from deep color video and the processing of video data is accomplished in a link clock domain without requiring use of phase lock circuitry to generate a pixel clock domain. In some embodiments, a method, apparatus, or system converts received video data (which may be referred to herein as “dense video data” to indicate that such data contains video data without insertion of null data) to a modified “sparse video data” format, where sparse video data is video data that has been converted such that a pixel is transferred in one cycle of a link clock signal and such that null data is inserted to fill empty cycles of the link clock signal.
In some embodiments, a method, apparatus, or system is provided in a multimedia system such as an HDMI™ (High-Definition Multimedia Link) or MHL™ (Mobile High-Definition Link) system. However, embodiments are not limited to these link formats.
In some embodiments, the apparatus or system includes other elements for the handling of the video data, including a receiver 110 for the reception of data, a memory 115 to buffer data as needed for processing and display, and a display element 120 for the display of processed video data.
For example, in the case of 36 bpp 210, the link clock frequency is 1.5 times higher than that of 24 bpp. For the video data path, the first 8-bit data of pixel 0 is transferred at the first link clock cycle and then the remaining 4-bit data of pixel 0 and the first 4-bit data of pixel 1 are packed together and transferred at the second link clock cycle.
For video data manipulation, there may be difficulties in providing an interface because the boundary between pixels in the data channel varies according to the time of sampling and the mode of deep color. In order to address this issue, conventional video processors convert a deep color interface (which is synchronized with a link clock signal) into a pixel clock domain in order to simplify next-stage video processing by a video processing core. The function of a video processing core stage depends on the main function of the system and may be any video processing task, such as picture in picture (PiP) processing, image enhancement, on-screen display (OSD), and others. After finishing the video processing, the output interface is conventionally converted back to the original link clock domain.
A PLL module 325 including phase lock loop circuitry is used to decrease the frequency of link clock signal 320 and generate the pixel clock signal 328, where the pixel clock rate is defined by the ratio of the pixel size to 24 bits. In this illustration, deep color video data source side video data bus 330 (illustrated as having three 8-bit data lines) is converted to provide video data to a video processing core 310 in a format to simplify video processing.
After completion of video processing by the video processing core 310, the processed data is transferred via video data bus 340 to a color depth conversion (pixel to link) module 315, which operates to pack the pixel-clock-domain deep color video and generate a link-clock-domain interface on a sink side video data bus 345 to provide compatibility with a sink device interface.
Phase locked loop (PLL) circuitry is a circuitry that generates an output clock whose phase is related to the phase of an input reference clock signal. PLL is also used to synthesize a local clock with lower or higher frequency than the input reference clock. For conventional color depth conversion, PLL circuitry is used to generate a pixel clock signal with the desired frequency rate in relation to the input link clock signal.
However, PLL blocks pose design and verification challenges on most high-speed chips. Additionally, the cost of implementation of a PLL is significant. PLL blocks require large on-chip area and consume large amounts of power.
In some embodiments, a method, apparatus, or system provides for color conversion of deep color video data using a single clock domain, the link clock domain 350, and thus eliminates the need for the PLL module in generating clocking for the pixel clock domain 355.
In this illustration, video data is received at a port on a source video data bus 530 from a source device, together with a link clock signal 520 and sync and control signals 522, the sync and control signals being transmitted between modules. In some embodiments, rather than generating a pixel clock signal, sparse video data is introduced on the data bus 535 by a color depth conversion module or element 505 in order to maintain the bandwidth of the deep color video data from a source. In some embodiments, a color depth conversion (dense to sparse) module 505 unpacks a link-clock-domain deep color video data stream, and generates a sparse video data interface in which pixels are transferred at the rate of one pixel per link clock cycle.
In some embodiments, a video processing core module or element 510 receives the sparse video data on the data bus 535 without modification of the clock frequency. In some embodiments, the video processing core module 510 receives the link clock signal 520, even though the data bit width has been increased. Therefore, the total data bandwidth of a sparse video data bus 535 is greater than the bandwidth of the source video data bus 530 receiving the video data. In some embodiments, null data is stuffed onto the sparse video data bus 535 according to the color depth conversion ratio of the color depth conversion module 505, the conversion ratio being the ratio between the pixel size of the video data and the bit width of the received video data. In some embodiments, a valid data signal 560 is turned off by the color depth conversion module 505 during periods when the video data has an interval with null data to identify video data and inserted null data.
In some embodiments, the video processing core module 510 utilizes the valid data signal 560 to distinguish between video data and inserted null data, and processes only the valid data. In some embodiments, the video processing core module 510 provides the processed video data via a sparse video data bus 540, together with a valid data signal 562 to identify processed video data and inserted null data.
In some embodiments, an additional color depth conversion (sparse to dense) module or element 515 receives the processed sparse video data and, utilizing the valid data signal 562 to distinguish between valid and null data, converts the processed sparse video data to dense video data to present on a sink side dense video data bus 545 in a format compatible with a sink device, such as a television or other presentation device.
In some embodiments, the video processing core module 510 includes control logic to detect a valid data signal, and utilizes such signal to sample only the valid portions of the sparse video data. In some embodiments, the overhead in providing such logic small when it is compared to PLL development and manufacturing costs such as chip area, power consumption, circuit design, and verification effort.
After completing video processing, the video processing core module 510 provides the converted video data via sparse video bus 540 to the color depth conversion (sparse to dense) module 515, which packs the sparse video data for transfer via the sink side dense video data bus 545, with the timing then returning to the format of the received data, as shown in the video data timing for dense video data (sink side) 685.
Thus, for input ports, 8-bit video data 750 is received in every link clock cycle and a total of 24 bits of data is received for three link clock cycles. For output ports, 24-bit sparse video data is transmitted via a 12-bit sparse video data output bus 710 for two link clock cycles (0 and 1) and null 12-bit data 752 is transmitted for the other cycle (phase 2). In some embodiments, the 0 and 1 phases (i.e., phases having a value that is less than 2) are detected by an element 732 that generates a valid data signal 714, such that the valid data signal 714 is disabled when the null data is presented on the sparse data output bus 710.
In some embodiments, valid data is received in phases 0 and 1, where latches 820 (holding 11 bits of a signal for a clock cycle) and 822 (to provide 8 bits of a current signal in phase 0, four bits of a delayed signal and four bits of a current signal in phase 1, and 8 bits of a current signal in phase 2). At phase 2, null data is received at the sparse video data port, but the data stored at latch 820 is used to generate the video data output in the phase. Thus, the null data contained in the sparse video data 810 is eliminated and is not included in the video data output 850, and the data is returned to dense video data form.
As shown, the main video is provided to video mixing 1050 in a main video clock domain 1070. In order to mix the main video with the sub video, the sub video will be required to be in the same clock domain. In this illustration, the sub video is received in the sub video clock domain 1072. The sub video data is received by an upper color depth converter 1030, which receives color depth information for the sub video. In a conventional apparatus or system, the upper color depth converter 1030 converts the format of the sub video into a pixel clock domain 1074 for ease of processing, such as down sampling and buffering 1032 in this example. A PLL module 1036 is used to generate a pixel clock signal from a link clock signal received with the sub video.
After completion of down sampling and buffering 1032, a lower color depth converter 1034, which has received color depth information for the main video, converts the format of the sub video into the same format as the main video for compatibility before merging with the main video by the video mixing 1050. The resulting video output 1060 is a PiP display composed of the main video and the sub video superimposed on top of the main video.
However, the chip size and power overhead required for PLL circuitry in a conventional apparatus or system creates cost and added complexity in manufacture. In addition, the PiP processing system requires three clock domains, the main clock domain 1070, the sub video link clock domain 1072, and the sub video pixel clock domain 1074, within the system. The use of multiple clock domains generally creates difficult logic design and verification issues. For simplicity in illustration,
In some embodiments, processing of PiP data may instead be provided utilizing a single domain channel for the processing of video data, where an apparatus or system may operate without requiring use of a PLL for the generation of a local pixel clock.
In some embodiments, the upper color depth converter 1130 converts the format of the sub video into sparse video format, as shown in, for example,
In some embodiments, the sparse video data and the valid data signal is received at a video processing core or element 1208, where the valid data is separated and processed 1210, where the separation of the valid video data is based on the received valid data signal. In some embodiments, the video processing core or element outputs processed sparse video data and the valid data signal 1212.
In some embodiments, the processed sparse video data is converted to dense video data, including use of the valid data signal to distinguish and eliminate the null data 1214, and the converted video data is presented as an output 1216. In some embodiments, the depth of the resulting processed video data is the same as the input data, and in other embodiments, the depth of the processed video data is different from the depth of the input data, such as when the processed video data needs to match the depth of another video signal.
In some embodiments, multiple video inputs are received 1302, where the video inputs may include varying color depths. A first video input is selected as a main video is selected as a sub video 1304. For simplicity of explanation, only a single sub video is described, but embodiments are not limited to the conversion and processing of any particular number of sub video data streams. In this example, the main video may have a first color depth and the second video may have a second color depth that may be different from the first color depth. In some embodiments, the main video is received in a main video clock domain and the second video is received in a sub video link clock domain 1306.
In some embodiments, the sub video is converted to a sparse video data format for the processing of the sub video data, where the conversion includes insertion of null data into the sub video data stream 1308. The video data timing may be, for example, as illustrated in
In some embodiments, the sparse video data and valid data signal are received at a video processing core or element 1312. The valid video data is separated from the sparse video data stream based on the valid data signal, and the valid video data is processed, including, for example, down sampling and buffering of the sub video 1314. In some embodiments, the processed sparse video data and valid video data signal are output from the video processing core or element 1316.
In some embodiments, the processed sparse video data is converted to dense video data, where the conversion includes use of the valid data signal to eliminate the null data, and where the conversion converts the video data to match the format of the main video 1318. The main video and the sub video are mixed 1320, resulting in the output of a PiP display 1322 containing the main video and the sub video in an inset window superimposed above the main video.
In the description above, for the purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the present invention. It will be apparent, however, to one skilled in the art that the present invention may be practiced without some of these specific details. In other instances, well-known structures and devices are shown in block diagram form. There may be intermediate structure between illustrated components. The components described or illustrated herein may have additional inputs or outputs that are not illustrated or described. The illustrated elements or components may also be arranged in different arrangements or orders, including the reordering of any fields or the modification of field sizes.
The present invention may include various processes. The processes of the present invention may be performed by hardware components or may be embodied in computer-readable instructions, which may be used to cause a general purpose or special purpose processor or logic circuits programmed with the instructions to perform the processes. Alternatively, the processes may be performed by a combination of hardware and software.
Portions of the present invention may be provided as a computer program product, which may include a computer-readable storage medium having stored thereon computer program instructions, which may be used to program a computer (or other electronic devices) to perform a process according to the present invention. The computer-readable storage medium may include, but is not limited to, floppy diskettes, optical disks, CD-ROMs (compact disk read-only memory), and magneto-optical disks, ROMs (read-only memory), RAMs (random access memory), EPROMs (erasable programmable read-only memory), EEPROMs (electrically erasable programmable read-only memory), magnet or optical cards, flash memory, or other type of media/computer-readable medium suitable for storing electronic instructions. Moreover, the present invention may also be downloaded as a computer program product, wherein the program may be transferred from a remote computer to a requesting computer.
Many of the methods are described in their most basic form, but processes may be added to or deleted from any of the methods and information may be added or subtracted from any of the described messages without departing from the basic scope of the present invention. It will be apparent to those skilled in the art that many further modifications and adaptations may be made. The particular embodiments are not provided to limit the invention but to illustrate it.
If it is said that an element “A” is coupled to or with element “B,” element A may be directly coupled to element B or be indirectly coupled through, for example, element C. When the specification states that a component, feature, structure, process, or characteristic A “causes” a component, feature, structure, process, or characteristic B, it means that “A” is at least a partial cause of “B” but that there may also be at least one other component, feature, structure, process, or characteristic that assists in causing “B.” If the specification indicates that a component, feature, structure, process, or characteristic “may”, “might”, or “could” be included, that particular component, feature, structure, process, or characteristic is not required to be included. If the specification refers to “a” or “an” element, this does not mean there is only one of the described elements.
An embodiment is an implementation or example of the invention. Reference in the specification to “an embodiment,” “one embodiment,” “some embodiments,” or “other embodiments” means that a particular feature, structure, or characteristic described in connection with the embodiments is included in at least some embodiments, but not necessarily all embodiments. The various appearances of “an embodiment,” “one embodiment,” or “some embodiments” are not necessarily all referring to the same embodiments. It should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects.
This application is related to and claims priority to U.S. Provisional Patent Application No. 61/436,019, filed Jan. 25, 2011, and such application is incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61436019 | Jan 2011 | US |