The field of invention relates generally to signal processing; and, more specifically, digital video stream routing and format conversion unit with audio delay.
a through 2c relate to differing aspects of the manner in which a video sample stream may be formatted. Specifically,
a shows three different video sample encoding formats 201, 202, 203. Each of the video sample steams 201, 202, 203 show their corresponding encoding approach across seven samples 200 (S1 through S7). Video sample stream 201 corresponds to a “trichromatic” encoding approach. Both of video sample streams 202 and 203 correspond to “color difference” encoding approaches.
According to a “trichromatic” encoding approach, each video sample contains three color components where the mixing of any two color components cannot produce the third color component. Video sample stream 201 provides an example of the most commonly used trichromatic approach “RGB”. RGB breaks down each sample into red (R), green (G) and blue (B) components. Thus, the first sample S1 in video sample stream 201 corresponds to the red, green and blue components (R1G1B1) of the first sample's visual information, the second sample S2 in video sample stream 202 corresponds to the red, green and blue components (R2G2B2) of the second sample of visual information, etc.
Both of video sample streams 202 and 203 employ a “color difference” encoding scheme. Color difference schemes use a luminance related quantity called luma (e.g., Y) and a pair of color difference components called chroma (e.g., Cb,Cr). Color difference schemes are often characterized by the sample rate relationship as between the luma and chroma components. According to a first type, referred to as 4:4:4, the luma and chroma components have the same sample rate. Thus, as seen in the 4:4:4 color difference sample stream 202, both luma and chroma components exist for each sample in the sample stream. According to a second type, referred to as 4:2:2, the chroma components have half the sample rate as the luma component. Thus, as seen in the 4:2:2 color difference sample stream 203, each sample has a luma component but only every other sample has chroma components.
b relates to different ways in which the video samples may be ordered within the video sample stream. A first is referred to as “progressive” and a second is referred to as “interlaced”. According to a “progressive” approach the samples within a video sample stream are ordered so that each of the frames of the video, when presented on the display, correspond to a complete “full screen” picture.
By contrast, according to an interlaced approach, the samples within a video stream are ordered so that the frames of the video are presented on the display as a pair consecutive “fields” where each neighboring horizontal “scan” line in a field corresponds to every other horizontal “scan” line on the display.
c demonstrates that different video sample streams may have different amounts of video samples per rendered horizontal line of displayed imagery and per rendered vertical line of displayed imagery. The samples, or picture elements in this rectangular array are commonly referred to as “pixels”. Here, the video sample stream should be formatted so that its samples correspond to (or at least can be rendered upon) the display upon which they are to be shown. Many video display types are designed to render imagery across an arrangement of pixels. The arrangement of pixels is typically characterized as the number of pixels that extend horizontally across the display and the number of pixels that extent vertically across the display.
c shows a representation of the pixel layout for two different video displays 218, 219. The first video display 218 has a pixel layout of 480 vertical pixels by 640 horizontal pixels. The second video display 219 has a pixel layout of 1080 vertical pixels by 1920 horizontal pixels. Note that different aspect ratios can be achieved based upon the pixel layout. That is, the first video display has a 4:3 horizontal to vertical aspect ratio (i.e., 640/480=4/3) that is typical of standard television; and, the second video display has a 16:9 horizontal to vertical aspect ratio (i.e., 1920/1080=16/9) that is more typical of standard cinema.
Because a display's pixel layout can be viewed as an array, a display can be viewed as having Y number of horizontal lines that each have X pixels; where, Y is the vertical pixel count and X is the horizontal pixel count (i.e., display 218 can be viewed as having 480 horizontal lines that each have 640 pixels; and, display 219 can be viewed as having 1080 horizontal lines that each have 1920 pixels). Here, referring briefly back to
Because of the various ways in which a video sample stream may be formatted, specific signal processing blocks have been used to convert from one type of formatting to another type of formatting.
a shows a color space converter 301 (CSC). The color space converter 301 is responsible for converting from one type of sample encoding to another type of sample encoding. For example, color space converter 301 might convert trichromatic encoded samples into color difference encoded samples; or, color space conveter CSC could covert color difference encoded samples into trichromatic encoded samples.
c shows a deinterlacer 304. Deinterlacer 304 converts an interlaced video sample stream into a progressive video sample stream. Various types of deinterlacing schemes exist. A basic approach involves simply packing the content of a pair of consecutive fields into a single progressive frame. However, this approach can render noticeable defects for fast motion images if neighboring fields correspond to different moments of time (i.e., an object of fast motion will blur). As such, “motion adaptive” deinterlacers have been developed that intelligently calculate the missing content between neighboring field lines in order to produce a complete progressive frame.
Another type of deinterlacing technique, referred to as “inverse 2-3 pulldown”, effectively reverses a “2-3 pulldown” interlacing process. The 2-3 pulldown process converts frames from a 24 frame per second stream into a 2-3-2-3- . . . field pattern (e.g., a first frame is used to create a pair of fields, the next frame is used to create a trio of fields, etc.) so as to create a 60 field per second stream. Inverse 2-3 pulldown combines fields from the same original film frame to form a complete video frame. Because fields are combined by this process so as to create frames, “inverse 2-3 pulldown” is viewed as a deinterlacing technique. Another type of deinterlacing technique, referred to as “inverse 2-2 pulldown”, effectively reverses a “2-2 pulldown” technique that uses each frame to create a pair of fields.
d shows a frame rate converter. A frame rate converter changes the formatting of a sample stream from being compatible with a first frame rate (e.g., 60 frames per second (fps)) to being compatible with a second frame rate (e.g., 30 frames per second (fps)). Frame rate converters are typically constructed to provide an output stream that, when viewed, exhibits rendered images having the same speed of motion that the input stream exhibits at its corresponding frame rate. That is, typically, the frame rate conversion process is not supposed to change the speed at which moving images move when displayed. A frame rate converter typically accepts input information that informs the frame rate converter of the specific conversion that it is to take place (e.g., a specific rate upconversion or rate downconversion)
e shows a scaler 306. A scaler converts a video sample stream that is formatted for a first pixel array into a sample stream that is formatted for a second pixel array. For example, referring also to
Note that any of the signal processing blocks of
The present invention is illustrated by way of example and not limitation in the figures of the accompanying drawings.
a (prior art) shows different video sample encoding schemes;
b (prior art) shows progressive and interlacing video streams;
c (prior art) shows displays having differing pixel arrays;
a (prior art) shows color space converter;
b (prior art) shows a pair of color difference encoding scheme converters;
c (prior art) shows a deinterlacer;
d (prior art) shows a frame rate converter
e (prior art) shows a scaler;
a through 7e show different video sample stream processing channels that can be effected by the sample stream conversion unit depicted in
a shows an embodiment of an audio sample delay unit;
b shows an embodiment of an audio clock frequency change circuit.
According to the embodiment of
The conversion unit 401 also accepts a video sample stream's corresponding audio sample stream at another input 403; and, provides the audio sample stream to the speaker driving circuitry 407 after applying some delay that accounts for the delay imposed upon the video sample stream by the conversion unit's video sample reformatting activity.
The video processing side 501b includes a multi-state routing network 509 that is able to accept differently formatted digital video sample streams across inputs 5041 through 504N and provides an output digital video stream at output 506. The multi-state routing network 509 also sends/receives video sample streams to/from a number of channel processing segments 5101 through 510N. Each channel processing segment performs one or more specific signal processing tasks upon a video sample stream so as to contribute to its reformatting. Because different video sample stream formats can be accepted and reformatted by the conversion unit 501, it is expected that not all of the channel processing segments 5101 through 510N would be used to reformat many of the different video sample stream formats that could be received. Better said, for example, a first type of received video sample stream format might only need one of the channel processing segments in order to be properly reformatted, a second type of received video sample stream format might need two of the channel processing segments in order to be properly reformatted, a third type of received video sample stream format might need three of the channel processing segments in order to be properly reformatted, etc. The multi-state routing network 509 has routed wiring to interconnect: 1) the video sample stream input nodes 504, 505 to appropriate channel processing segment input nodes; 2) appropriate channel processing segment output nodes to appropriate channel processing segment input nodes; and, 3) appropriate channel processing segment output nodes to the conversion unit video sample stream output 506.
The term “appropriate” as used above can be construed to mean “whatever might be needed to properly reformat a received video sample stream”. Said another way, the video processing side 501b can be viewed as being capable of establishing a number of different signal processing channels for each video sample stream input 504, 505. The multi-state routing network 509 is capable of establishing any of these signal processing channels by establishing the correct connections as between a video sample stream input, the one or more channel processing segments, and the conversion unit video sample output 506. Each set of connections that correspond to a specific reformatting channel for a particular input can be referred to as a unique “state” of the mult-state routing network 509 that it is capable of being configured into. A specific embodiment of a multi-state routing network is discussed further below with respect to
The conversion unit 501 also includes some form of control function 513 that understands the specific type of video sample stream formatting that is to be applied to the input video sample stream; and, in response, signals the multi-state routing network 509 to place it into the appropriate state.
The control function 513 may be implemented with a processor or microcontroller that executes software routines in order to set the multi-state routing network's state. Alternatively, a state machine circuit may be used. The manner in which the mode input 511 may be provided from the control function 513 to the multi-state routing network 509 may take on various forms such as dedicated control lines; or, a bus that is capable of transporting other transactions besides mode input 511 information. Note that one or more channel processing segments may also need to be provided with control information that signifies what type of processing is to be performed. For example, a deinterlacer function within a channel processing segment may be enabled or disabled with respect to the segment's overall processing in light of the digital video sample stream to be formatted.
Also, one or more components within the channel processing segments may be designed to “detect” the type of formatting that a received input video sample stream possesses and report this information to the control function (e.g., so that the control function can determine what type of mutli-state routing network 509 state is to be effected). Reporting control line 514 can be viewed as an embodiment of a line for reporting such information to the control function 513.
The multi-state routing network 609 includes routed wiring for making the appropriate interconnections as a function of the state of mode input 611. The multi-state routing network 609 may be implemented with logic circuitry. The precise functioning of the multi-state routing network 609 of
a through 7e show the different signal processing channels that the multi-state routing network 609 of
b corresponds to a channel that uses channel processing segments 6101, 6102 and 6103 in series with each other. The circuitry and channel formations observed in
As such, in order to deinterlace, frame rate convert and scale an RGB encoded input sample stream, the RGB encoded input sample stream must first be converted to a 4:2:2 color difference format. The circuitry and channel formations observed in
b corresponds to a channel that is used to provide deinterlacing for an RGB encoded sample stream. As such, the 4:2:2 encoded sample stream from the output of channel processing segment 610, is directed to the deinterlacer 619 by the selection of channel A for multiplexer 615. Because the output 606 is to provide an output sample stream in RGB encoded form, the 4:2:2 color difference encoded output from the deinterlacer 619 needs to be converted back from 4:2:2 color difference encoding to RGB encoding. Channel processing segment 6103 helps perform this function by way of its including a 4:2:2 to 4:4:4 color difference encoding converter 622 and YCbCr color difference to RGB color space converter 623. An output stream from the deinterlacer 619 is routed to the input of channel segment 6103 by selecting channel B of multiplexer 616.
The circuitry and channel formations observed in
The channel of
The channels of
The conversion input 624 is used to set the frame rate conversion that is performed by the frame rate conversion unit. The scale factor input 613 is used to set the scaling that is performed by the scaler 621. In an embodiment, the frame rate converter 620 and/or the scalar 621 are respectively told what frame rate conversion and/or scaling is to be performed by the control function. In a further embodiment, an end user having knowledge of the display upon which the output sample stream is to be displayed informs the control function of the frame rate and/or pixel array dimensions of the display. From this information, appropriate input parameters can be provided to the frame rate converter 620 and/or scaler 621.
It is important to point out that, at the expense of multi-state routing network complexity, alternate embodiments may be readily constructed having more channel processing segments with fewer signal processing blocks for purposes of supporting an even greater collection of possible channels between the conversion unit's video input(s) and output. For example, the 4:2:2 to 4:4:4 converter 622 and YCbCr to RGB color space coverter 623 could be associated with a fourth channel processing segment so as to leave channel processing segment 6103 with only a frame rate converter 620 and scalar 621. In this case, by routing the output of scaler 621 into another input channel of multiplexer 614 and by keeping the output of color space converter 623 routed to an input channel of multiplexer 614, the output 606 would be capable of provided not only RGB encoded outputs but also YCbCr encoded outputs. As such, the conversion unit would be capable of providing de-interlacing, frame rate conversion and/or scaling for: 1) an RGB to YCbCr input/output conversion process; or, 2) a YCbCr to RGB input/output conversion process. In this case, the multi-state routing network 609 would need to be further redesigned (other than the changes to the input of multiplexer 614 described just above) to further allow for the output of the scaler 621 to be routed to the input of the 4:2:2 to 4:4:4 converter 622. Further still, the multi-state routing network 609 could be designed to route those input streams not requiring either of frame rate conversion and scaling to bypass channel segment 6103 altogether.
Referring back to
According to the approach of
The audio delay function works by writing data into the ring buffer 801. There are read and write pointers into the buffer, and the difference between these pointers determines the delay (e.g., as specified at the “delay select” input). A specific amount of delay will produce a fixed difference between the read and write pointers for a given audio format. However, if the audio format is changed, and this results in the storage requirements changing (e.g., a different amount of memory will be required to store 10 ms worth of data (for a 10 ms delay) for a 16 bit 48 kHz audio sample stream as compared to a 24 bit 96 kHz audio sample stream), then the difference between the read and write pointers must change as well since there will now be a different amount of memory required to store the duration of audio data.
b shows an embodiment of audio sample rate change detection function that can be designed into the logic circuitry 802 of
The framer and control unit provides “start” and “stop” commands to the frequency counter 803 for every fixed number of passing audio clock cycles (e.g., 16). The frequency counter therefore “measures” the time length (in units of the reference clock cycle time) of a fixed number of audio samples. The save and comparison circuitry 805 saves consecutive measurement results as determined and provided by the frequency counter 803. In an embodiment where the reference clock frequency is 30 MHz, the framer 804 provides start and stop signals across 16 audio clock cycles, and the audio clock rates can be any of (in MHz) 1.0240, 1.2800, 1.4112, 1.5360, 1.7640, 1.9200, 2.0480, 2.1168, 2.3040, 2,8224, 3.0720, 3.5280, 3.8400, 4.2336, 4.6080, 5,6448, and 6.1440; then, an audio clock frequency change will occur anytime the counter's measurements are different by more than three (the absolute value of two subsequent samples subtracted one from another is greater than 3). As a consequence, a reset control signal is issued to reinitialize (set to zero) the contents of the ring buffer memory and restart the delay process.
Note also that embodiments of the present description may be implemented not only within a semiconductor chip but also within machine readable media. For example, the designs discussed above may be stored upon and/or embedded within machine readable media associated with a design tool used for designing semiconductor devices. Examples include a circuit description formatted in the VHSIC Hardware Description Language (VHDL) language, Verilog language or SPICE language. Some circuit description examples include: a behaviorial level description, a register transfer level (RTL) description, a gate level netlist and a transistor level netlist. Machine readable media may also include media having layout information such as a GDS-II file. Furthermore, netlist files or other machine readable media for semiconductor chip design may be used in a simulation environment to perform the methods of the teachings described above.
Thus, it is also to be understood that embodiments of this invention may be used as or to support a software program executed upon some form of processing core (such as the Central Processing Unit (CPU) of a computer) or otherwise implemented or realized upon or within a machine readable medium. A machine readable medium includes any statutory mechanism for storing information in a form readable by a machine (e.g., a computer). For example, a machine readable medium includes read only memory (ROM); random access memory (RAM); magnetic disk storage media; optical storage media; flash memory devices.
In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made thereto without departing from the broader spirit and scope of the invention as set forth in the appended claims. The specification and drawings are, accordingly, to be regarded in an illustrative rather than a restrictive sense.
Number | Name | Date | Kind |
---|---|---|---|
4997321 | Adams | Mar 1991 | A |
5357606 | Adams | Oct 1994 | A |
5857118 | Adams et al. | Jan 1999 | A |
6219747 | Banks et al. | Apr 2001 | B1 |
6380978 | Adams et al. | Apr 2002 | B1 |
6385692 | Banks et al. | May 2002 | B2 |
6393505 | Scalise et al. | May 2002 | B1 |
6473476 | Banks | Oct 2002 | B1 |
6489998 | Thompson et al. | Dec 2002 | B1 |
6515706 | Thompson et al. | Feb 2003 | B1 |
6587158 | Adams et al. | Jul 2003 | B1 |
6681059 | Thompson | Jan 2004 | B1 |
6700622 | Adams et al. | Mar 2004 | B2 |
7089577 | Rakib et al. | Aug 2006 | B1 |
7206025 | Choi | Apr 2007 | B2 |
7236209 | Martin | Jun 2007 | B2 |