The present invention relates generally to digital video compression systems.
In the commonly-owned patent application Ser. No. 10/260,534 ('534 application), published as US Publication No. US2005-0069034 one common inventor described a new technology for encoding digital video that exhibited particular success in the computer video arts. The contents of that publication will be presumed to be of knowledge to the reader, and are incorporated herein by reference.
In the typical computer video scenario, digital pixel information is prepared by a server 7 (
The video compressor 17 can be a local hardware component near or in the server 7 (anywhere, such as on a daughter card, a hang-off device, an external dongle, on the motherboard, etc.), a software component (anywhere, such as in a local CPU, a video processor, loaded in the motherboard, etc.), or an external pod communicating with the server via a communication link, network, wireless, or other coupling protocol.
Inside the video compressor 17, one of the frame buffers 11 and 12 receives the serial pixels from the source video 10 and loads them into the frame buffer to (typically) mimic the local frame buffer 6. A switch ahead of the frame buffers 11 and 12 loads a current (or “new”) frame into one of the frame buffers 11 or 12 while the other of the frame buffers 11 or 12 retains the previous (or “old”) frame that the switch had just previously directed to it. In that way, at any given time, one of the frame buffers 11/12 retains a complete old frame and the other of the frame buffers 11/12 is being fed a new frame. The frame buffers then alternative, frame-by-frame, storing/loading the old/new frames.
The old and new frames are used by the video compressor 17 to determine relationships between pixels in the current frame compared to the previous frame. An encoder 13 within the video compressor 17 determines those relationships between the pixels in the current frame (drawn from the new frame buffer 11/12) and pixels in the prior frame (drawn from the old frame buffer 11/12). The encoder 17 may also determine relationships between pixels each located with the current frame. In each case, the relationships can include run-length relationships or series relationships.
Run-length relationships identify runs of pixels in the serial pixel stream (from the source video 10) that have pixel values related to already known pixel values. By identifying the relationship, the decoder is instructed to “copy” the known pixel(s) for the identified run-length, rather than writing the independently identified pixel values. The run-length relationships can include any relationship determined between pixels of the current frame or between pixels of the current and previous frames. They may include the so-called (1) “copy old,” (2) “copy left,” (3) “copy above,” or other locational relationship commands. The “copy old” (CO) command is particularly appropriate for the present disclosure. In it, the pixel values for pixel locations in the current run-length of the current frame are determined to be the same as those pixel values of the previous frame in the same pixel locations. The CO command simply tells the decoder copy the same pixels for a run of X number of pixels that are identical to the pixels in the same run location of the previous frame. Similarly, the “copy left” (CL) command and “copy above” (CA) command indicate that the present run of pixels are the same as the pixels on the left of the current pixels (in the case of the CL command), or the pixels are the same as the pixels above the current pixels (in the case of the CA command). Of course, other kinds of locational relationships (other can “old,” “left,” and “above”) can be and are envisioned as well.
In the preferred run-length cases, the format for the encoding can include (using eight bit bytes by way of example only):
(1) For a first byte in the encoding, the byte can begin with a number of first bits identifying a code indicative of the run-length type (CO, CL, CA, etc.) followed by a remaining number of bits identifying the run length itself. For example, an eight bit byte can employ the first three bits for code indication followed by the next five bits indicating in a binary word the run length (up to a 25 pixel run length).
(2) Another following byte of encoding if the run length exceeds 25 pixels, where the first bit is a code indicating that the byte continues the previous run, followed by seven more bits in the binary word (which when strung with the previous 5 bits of the previous word will make a 12 bit word indicative of up to a 212 pixel run length).
(3) A number of additional following bytes like those in (2) where the run length exceeds the 212 pixel run length of the string of previous bits.
“Series” commands are a little different from the run-length commands and can contribute remarkable efficiency to the video compression. They are described in more detail in the '534 application, so only a brief description will be provided here. In essence, the series commands instruct the decoder to write a run of pixels using just two prior-known colors. In the preferred series cases, the format for the encoding can include (using eight bit bytes by way of example only):
If neither run-length nor series encoding is available or plausible, then the encoder will resort to higher overhead single-pixel color commands (usually requiring three bytes per pixel color for five bit color, and more for higher quality color) to instruct the decoder on a particular pixel value.
As shown in
The video compressor 17 communicates with a client 19, typically by a network connection via a standard network interface (not shown) such as an Ethernet or other suitable network communication system. Of course, the video compressor 17 and client 19 could also communicate by another other communication means such as a hard wire, wireless, etc. The system of
At the client 19, the decoder 18 is usually an application or script function in the local processing system 21 already in the client 19. If the client 19 is a computer workstation, for example, the decoder 18 is an application that runs on the local CPU employing some local memory 22. Also, client 19 usually contains a frame buffer 20 (sometimes on a separate video processing board) that receives the pixel information for a frame from the decoder 18. In practice, the objective is to move the information from the frame buffer 6 in the server 7 to the frame buffer 20 in the client 19 through the frame buffers 11/12 in the video compressor 17. In the midst, the video compressor reduces the size of the frame of information by the run-length, series, and pixel encoding and the decoder 18 restores the size of the frame by decoding it.
Presently, the cost of the frame buffer chip 16 is driving the cost of the video compressor 17. As the price of FPGAs for FPGA chip 14 (or alternatively ASICs, etc.) is falling, the price of the frame buffer chip has come to dominate the parts cost. We have developed a way to eliminate the frame buffer chip 16 without altering the kinds of code used and thus advantageously not altering the decoder function 18 in any way. Instead of storing all pixels in a video frame buffer, the disclosed encoder only stores check values for groups of pixels. Grouping of pixels can be done in any fashion, for example; a group can be all pixels on a single video line, a portion of a line, multiple lines or screen rectangles containing portions of multiple lines. For purposes of example only, the embodiment described below defines all pixels on each single video line as a group of pixels, therefore a video screen of 1024 by 768 pixels would have 1024 groups of pixels and 1024 check values stored in memory.
When the encoder finishes encoding the first line of the frame according to the run-length, series, and pixel commands described above, it then runs a check value on the encoding and stores that check value for that line in the local buffer 15 of the FPGA. It then sends the encoding to the decoder 18, which decodes the information in its normal manner and loads the resultant pixel values for that line in the frame buffer 20, as usual. The encoder then continues with the next line of the frame until each line of the frame is encoded, a corresponding check value is stored in the local buffer 15, and the encoding is sent to the decoder 18.
When the first line of the next frame arrives, it too is encoded by the encoder and its check value determined. If the check value is the same as the check value stored for the prior frame, then the encoding is discarded and the encoder re-codes the line as a “copy old” command using the entire line as the run length. The stored check value remains the same in the local buffer 15 and will be used again for the same line when the next frame arrives. The decoder, receiving the “copy old” command operates on them as it normally would: it copies the old pixels from the prior known frame for the entire line.
If the check value for a line is different from that stored in the local buffer 15, then the encoder overwrites the new check value for that line in the local buffer 15 and then sends the new encoded line to decoder 18. Decoder 18 again decodes the line as it normally would.
If the network throughput is insufficient for an encoded line to be sent, the encoder leaves the check value for that group unchanged and sends a command instructing the decoder not to change those pixels (even though they did change). This defers the updating of that line until the next frame. (This form of flow control would not be required if a frame buffer were used to hold all pixels from all lines until the network throughput was sufficient to resume sending).
Whenever a line is not updated and its updating is deferred until the next frame (as described above), the Copy Above (CA) command can not be used during the encoding of the line immediately following the deferred line, however all other encoding commands can be used.
As can been seen, the decoder 18 has no ability to realize when the encoder has chosen to encode the line based on the normal encoding procedure versus the mandated “copy old for a line run-length” procedure. It simply writes pixels as it's told by the same kinds of run-length, series, and pixel commands normally sent to it. The encoder sends the normal run-length, series, and pixel commands line-by-line unless it determines for a particular line that a check value is the same, in which case it mandates the “copy old for a line run-length” command.
In the end, the encoder no longer has to store entire frames of information, so the frame buffer chip can be eliminated. All of its encoding can be accomplished by receiving and encoding just a line or so at a time using just “copy left,” “copy above,” “make series,” and “draw pixel” commands until the check value determination reveals that a “copy old” is appropriate. In that instance, the encoder does not even have to know (and could not find out anyway) what the “old” pixels were—only that whatever the decoder has stored as the “old” pixels are ones that it should copy. The encoder then stores only a line (or few lines), which is a small enough amount of data (compared to one or two entire frames, for example) that it can be stored in the local buffer 15 and the frame buffer chip can be eliminated from the video compressor.
The present invention, in order to be easily understood and practiced, is set out in the following non-limiting examples shown in the accompanying drawings, in which:
Reference is made to
In an example video compression system, a client 19 with a workstation monitor is expected to receive video signals for display on the monitor from a distant server 7. Video signals are notoriously high-volume signals. A single screen of video at a common resolution of 1024 by 768 can be around one million pixels. Each pixel has a defined color, and each color has a defined red component, blue component, and green component (other color schemes are also known and can be used, but the so-called RGB system will be used herein by way of illustration and not limitation). Each red, blue and green color component is defined by a numeric value written as a binary word, sometimes five bits long (providing 25=32 possible color values for each red, green, and blue) but as long as the system can reasonably accommodate without limit. With five bit component values, a minimum of 15 bits are required to define each single pixel color, which are usually embodied in two eight-bit bytes. The one-million pixels for a video screen thus require two-million bytes to define the colors. A screen usually refreshes every 1/60th of a second, so transportation of 120 megabytes per second would be required to deliver streaming video without compression. And, that assumes a relatively low 5-bit color scheme, where many users would prefer a higher quality of color composition. Some communication links may accommodate such large volumes of constantly streaming data—but not many—especially if there are multiple simultaneous users employing the same communication link.
To alleviate the video data volume, a video compressor 17 receives the video from a source video 10 and reduces it. Each frame of the video is alternatively loaded into a new/old frame buffer 11/12 where it is retained for use by the encoder 13 programmed into an FPGA chip 14. According to one example, the encoder 13 encodes the video by a hierarchical choice of run-length encoding, series encoding, or as a last resort individual pixel encoding. The run-length encoding essentially identifies a run of pixels the color of which can be identified on the basis of pixel colors that are already known. Thus, a CO command will instruct the decoder for a current pixel to copy the color of the pixel at the same pixel location as the current pixel but from the previous frame. A CA command will instruct the decoder for a current pixel to copy the color of the pixel immediately above the current pixel in the same frame. A CL command will instruct the decoder for a current pixel to copy the color of the pixel immediately to the left of the current pixel in the same frame. Assuming again, by way of example only, a five-bit color scheme in which each pixel would require three eight-bit bytes to identify its individual color, that same pixel may be identifiable as a Copy command in only a single byte. Further, if a continuous run of pixels can be identified as all comporting to a common command condition (such as each pixel in a run of 100 pixels is the same as their corresponding old, above or left pixels), then a code can be written to tell the decoder in a byte or two that a Copy command applies to a run of 100 pixels. In such a case, a run of 100 pixels that could require 300 bytes of coding to individually identify each pixel color could be accurately encoded with only a byte or two.
Example formats for copy encoding can be found in the '534 application. One such example is described below for purpose of convenience to the reader. In it, an eight-bit byte is assumed, although the encoding can be used in any byte size of any number of bits. For copy commands, each byte is in the format: CCCRRRRR, where the first three C-bits identify the command type according to the following key:
000=Copy Old Command
001=Copy Left Command
010=Copy Above Command
The next five R-bits identify the run length. If the current run is determined by the encoder 13 to be less than 25 (i.e., 32 continuous pixels), then the five R-bits of-one byte will encode the run length. If the run length is more than 25 then a following eight-bit byte is encoded with the same three command bits followed by another five bits that are combined with the five bits of the preceding byte to make a ten-bit word accommodating a 210 run length (i.e., 1024 continuous pixels). A third byte can be added to encode a run of 215 run length (i.e., 32,768 continuous pixels) and a fourth byte can accommodate an entire screen as a continuous run of 220 run length (i.e., 1,048,576 continuous pixels, which is more than the pixels in one full screen of 760×1024).
Thus, a continuous run amounting to an entire screen that would have taken around 120 million bytes to write the individual colors can be written in just four bytes of encoding.
The series command is used whenever a run of pixels of just two colors is found. In it, the two colors are first encoded using one of the copy of pixel-draw commands so the decoder knows the actual value of the two possible colors (in essence, the decoder knows that the two colors immediately preceding the series command are the two colors to be used in the series, with the first color assigned the “0” value and the second color assigned the “1” value). The first byte in the series bytes has the following format: CCCXDDD, where C is the command key identifying the series command, X is the multiple byte indicator, and the D-bits indicate which of the two possible colors the next three consecutive pixels are. In this case, the CCC key for the series command is 011 (a code unique compared to the copy command keys). The X bit is set to “0” if the run of two-color series is just three pixels (corresponding to the three D-bits) long, and to “1” if the next byte continues the two-color series. Each subsequent series byte then takes the form of: XDDDDDDD, where the X-bit again indicates whether the next byte continues the series (by indicating “1” until the last byte in the series is reached in which case it is set to “0” to indicate the next byte is the last one) and each D-bit again indicates the “0” or “1” color for the next seven pixels in the continuous series.
If neither a copy command nor a series command will effectively encode the next pixel(s), the encoder resorts to encoding the next pixel as a make pixel command. In this command, the pixel color is communicated using the traditional Red, Green, Blue color values. For five bit color, the two byte pixel command takes the form of: CRRRRRGG GGGBBBBB, where C is a key, for example “1,” that identifies the make-pixel command (note that none of the other commands began with a “1”), RRRRR is five-bit red color value, GGGGG is the five-bit green color value, and BBBBB is the five-bit blue color value. The encoder 13 tries to encode pixels using other, more efficient encoding before it finally resorts to as few make-pixel encodings as possible and can return again to a copy or series command.
The above descriptions, especially with respect to the copy old command, require the video compressor 17 to store a previous frame as it receives the current frame from the source video 10 in order to compare the current pixel values with the pixel values in the same locations of the previous frame. Thus,
On the client side, a decoder 18 receives the copy, series and make-pixel commands and re-writes the pixel colors based on those commands into a frame buffer 20. The decoder 18 can be a script function or an application written in the existing local processor 21 of the client 19.
The video compressor 17 can be employed as a hang-on device to the server 7, or it can be included in server 7 as a daughter card, as a built-in to the video processor, as an embedded process in the mother board, or any other hardware or software accommodation. In any event, it is advantageous to reduce the cost of the components in the video compressor 17, including the frame buffer chip 16. The embodiment of
In
The decoder 18 operates the same as the decoder did with the frame buffers present. In other words, it does not know or care whether the coding commands were produced by standard encoding or by check value replacement encoding. It simply decodes the run-length, series, and pixel commands exactly as it would have done otherwise.
The encoding steps are shown schematically in
The check value operation can be any kind of determinative operation. The simplest may be a check sum in which the bit values of the encoded bytes are summed. Any other kind of determinative operations could also be employed. Check value algorithms are widely-known and vary widely. Any of the known check-sum, cyclic redundancy check or other determinative algorithms may be employed, and check values or determinative algorithms designed in the future may be employable as well. Whichever check value algorithm is chosen, it should in ideal situation yield a value that is uniquely associated with that object, like a fingerprint. There is, however, a trade-off between degree of distinction and size/complexity of the check value. A check value that is long and complex may be virtually guaranteed to uniquely correspond to a particular line encoding, but it may also be so long and complex that its determination or storage impedes the desirable results of encoding the video quickly and storing the check value locally. That is, a check value that takes too long to compute will hold up the delivery of the video line to the decoder (10's of thousands of lines may be moving each second). Also, a check value that is itself too long may fill the local buffer 15 of the FPGA will check sum values, leaving no further buffer space available for general FPGA processing.
With any check value algorithm it is possible that the same value could be inadvertently created for two different screens so the preferred check value algorithm would be one that minimized this probability. Check value algorithms that include pixel position in the calculation are preferred because they minimize inadvertently creating the same value for different video screens that typically occur sequentially, such as a cursor moving horizontally and relocating along a video line. A method of periodically updating the decoder's video screen without relying on the check value could also be included since the chance of inadvertently creating the same value for two different screens can be minimized but never completely eliminated. A 16-bit check value has a 1 in 65,536 chance of inadvertently creating the same value for two different screens.
The trade-offs for selecting pixel group size interact with the trade-offs for selecting the length of the check value. Larger groups require fewer check values per frame but are more wasteful when network throughput is insufficient. The amount of buffer memory available and the expected network throughput are key factors in selecting the optimal value for both of these values.
Once an appropriate check value algorithm is chosen and employed on the encoded result of line 31 of
Referring to
If during the comparison of
If the check value matches the stored check value, then the current bytes of codes are discarded at step 66 and a copy command counter (in the FPGA code) is incremented for the N pixels in line (i.e., the line length), at step 67. That counter is optionally used to increase the efficiency of the encoding to add subsequent line runs if the next lines can also be encoded by the “copy old” command, meaning that the check values for the subsequent lines match their corresponding stored values as well. As will be seen, once a check value of a next line does not match the corresponding stored value, then the counter will be reset and the counter value will be used as the run length for the single “copy old” command sent by the encoder 23. That is, the “copy old” command will announce a run length in excess of a single line length.
If the check value does not match, at step 65, the calculated check value for the current line is overwritten into the local buffer 15. That indicates that the current encoding is different from the corresponding line of the prior frame and must be sent to the decoder. But first, the encoder 23 determines whether any prior lines are encoded and unsent, at step 68. That occurs when, as described above, a “copy old” command of one or more line run-length has been accumulating in the counter (from step 67, for example). If the counter indicates that a “copy old” line(s) is waiting to be sent, at step 69, it is compiled and sent. If no such delayed commands exist, then the encoded current line is sent at step 72, after (optionally) determining at step 70 whether the last pixel in the current line is run-length encoded—in which case the first one or more pixels of the next line may be includable in the same run for efficiency. If so, at step 71, the portion of the current line that is ready for sending is sent and the end-portion of the line that has been preliminarily run-length encoded is held over to the next line to determine whether some starting pixels of the next line can be included in the same run. When the run is completed, it will be sent as a whole during an appropriate next step 68/69.
It should be noted that it may turn out that the check value for a current line is only coincidently the same as for a previous line that is really substantively different; but, such situations are unlikely and in the exceedingly rare event that they do happen, they will be quickly overwritten by the next corresponding frame line. The check value methodology and sophistication can be chosen to correspond to the level of false matches that can be tolerated.
The encoder 23 does not immediately send the “copy old” command when the check values match, but instead increments a running counter that counts the number of pixels in the previous and current lines that qualify for the “copy old” command. This improves the efficiency of the compression by increasing the run lengths where appropriate. That process of incrementing the counter and waiting for the next line is shown at step 85.
If the encoder finds no match between the current check value and the stored value, it sends the encoded line 80 to the decoder 18, after sending any “copy old” runs that have been delayed by step 85.
Then, the next line 32 arrives at the encoder 23. It is encoded into code stream 81, and its associated check value is determined and compared with the previously stored value of 0×67. Again, if the check value matches, at step 86, then the code stream 81 is discarded at step 89 and the “copy old” command is again presumed. At step 90, the pixel run is again incremented and held until a next line no longer-qualifies for the “copy old” command. If a match does not occur, at step 87, then any unsent “copy old” commands (from, for example, step 85) are compiled and sent based on a run equal to the counter value, followed by the current code stream 81. That mis-match check value from step 87 is then over-written into the local buffer 15 at the location corresponding to the line (in this case, Check Value AB). The process then continues from line-to-line and frame-to-frame, indefinitely.
Although the disclosure describes and illustrates various embodiments of the invention, it is to be understood that the invention is not limited to these particular embodiments. Many variations and modifications will now occur to those skilled in the art of backup communication. For full definition of the scope of the invention, reference is to be made to the appended claims.
This is a continuation-in-part of U.S. patent application Ser. No. 10/260,534 to Dambrackas, entitled “Video Compression System,” filed Oct. 1, 2002 (Dambrackas Video Compression).
Number | Date | Country | |
---|---|---|---|
Parent | 10260534 | Oct 2002 | US |
Child | 11282688 | Nov 2005 | US |