Method and circuit of high performance variable length coding and decoding for image compression

Description

BACKGROUND OF THE INVENTION

1. Field of Invention

The present invention relates to the variable length coding and decoding method, more specifically to a compression and decompression method and circuit which result in shorter code length of representing a data stream and short time in compression and decompression.

2. Description of Related Art

Efficient image compression coding plays important role in lower cost in storage and higher speed in data transmission accessing either. Another advantage of an efficient image coding is the lower power consumption in storage and data accessing due to the less data rate after compression.

There are some still image compression standard like JPEG which is a popular lossy compression algorithm with wide application like digital still camera, DSC and scanner. JPEG is a lossy compression requiring high computing power for compression and which compares to the original image data, there will some pixels become not the same values before compression which in some applications are not acceptable.

There are also other image compression algorithms like ADPCM which is also lossy algorithm with high amount of pixel values are not the same with original pixels.

This invention is to overcome the issues of high computing power of image compression as well as maintaining top quality compare to the original data with reasonable compression rate.

SUMMARY OF THE INVENTION

The present invention of high performance variable length coding for image compression reduces data rate with high throughput of compressed image and decompressing the pixel data in a short time.

- The present invention of high performance image compression applies a variable length coding method to represent the differential value between adjacent pixels.
- According to an embodiment of this invention, the differential value is divided by a predicted divider, the value of Quotient and Remainder are coded by assigning number of “0s” to represent the values of the Quotient, and binary code for representing the Remainder.
- According to an embodiment of this invention, the predicted divider is represented by an integer number of the power of “2” which helps in reducing the bit number and results in high performance in encoding and decoding.
- According to an embodiment of this invention, within the same clock cycle during encoding, the Quotient and Remainder of each color component of a pixel are calculated in parallel with predicted divider.
- According to an embodiment of this invention, the codes of Quotient and Remainder are separated by assigned a marker bit of “1” if the Quotient value is coded by assigning “0”, and if the Quotient value is coded by assigning “1”, then, the market bit is “0” as a separator.
- According to an embodiment of this invention, the code of Quotient is followed by the code of Remainder and can be revised with Remainder in front followed by Quotient.
- According to an embodiment of this invention, the differential values of pixel components of a pixel are separately encoded, if the input is in series, compression is done in series, if input of pixel component is in parallel, compression is done in parallel.
- According to an embodiment of this invention, the value of the divider is calculated by weighted factor of latest differential value of adjacent pixel and the previously divider.
- According to an embodiment of this invention, the three pixel components, Y, U and V of the same pixel are compressed separately and are packed together.
- According to an embodiment of this invention, two levels of decoding the Quotient is applied to reduce the latency time.

It is to be understood that both the foregoing general description and the following detailed description are by examples, and are intended to provide further explanation of the invention as claimed.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a prior art of JPEG, a still image compression algorithm.

FIG. 2A depicts the process of a differential coding following by a lossless variable length encoding of the difference of adjacent pixels.

FIG. 2B depicts the process of an image decompression

FIG. 3 illustrates a variable length which is included in this invention.

FIG. 4 illustrates the procedure of predicting the value of divider.

FIG. 5 depicts procedure of the VLC coding for the Diff. of adjacent pixels of a group of pixels.

FIG. 6 illustrates the process of coding the image pixel with three pixel components (YUV/pixel) and the structure of the compressed pixel data.

FIG. 7 illustrates the input waveforms of pixel data to be compressed and the output of the compressed pixels.

FIG. 8 depicts the flowchart of high performance lossless data decoding of the compressed pixel by decoding the Quotient and Remainder with calculation of divider.

FIG. 9 depicts the block diagram of implantation of the high performance lossless data decoding of the compressed pixel by decoding the Quotient and Remainder with calculation of divider of each pixel component.

DESCRIPTION OF THE PREFERRED EMBODIMENTS

The present invention relates specifically to the image compression for data reduction while still maintaining good quality. The present invention significantly reduces the amount of data of image and stored in a storage device, and correspondingly reduces the density, bandwidth requirement, power consumption and cost of storage devices for storing streaming data.

There are some prior arts of image compression methods of reducing image data. FIG. 1 depicts a popular compression algorithm is the still image compression standard named JPEG. JPEG compression includes some procedures in coding data stream. The color space conversion is to separate the luminance (brightness) from chrominance (color) and to take advantage of human being's vision less sensitive to chrominance than to luminance and can reduce more chrominance element without being noticed. In JPEG compression, an image is partitioned into many units of so named “Block” 11, 12 of 8×8 pixels to run the JPEG compression. A color space conversion mechanism transfers each 8×8 block pixels of the R(Red), G(Green), B(Blue) components into Y(Luminance), U(Chrominance), V(Chrominance) and further shifts them to Y, Cb and Cr. JPEG compresses 8×8 block of Y, Cb, Cr by the following compression procedures:

Step 1: Discrete Cosine Transform (DCT)
Step 2: Quantization
Step 3: Zig-Zag scanning
Step 4: Run-Length pair packing and
Step 5: Variable length coding (VLC).

DCT 13 converts the time domain pixel values into frequency domain. After transform, the DCT “Coefficients” with a total of 64 sub-bands of frequency represent the block image data, no long represent single pixel. The 8×8 DCT coefficients form the 2-dimention array with lower frequency accumulated in the left top corner, the farer away from the left top, the higher frequency will be. Further on, the closer to the left top, the more DC frequency which dominates the more information. The more right bottom coefficient represents the higher frequency which less important in dominance of the information. Like filtering, quantization 14 of the DCT coefficient is to divide the 8×8 DCT coefficients and to round to predetermined values. Most commonly used quantization table will have larger steps for right bottom DCT coefficients and smaller steps for coefficients in more left top corner. Quantization is the only step in JPEG compression causing data loss. The larger the quantization step, the higher the compression and the more distortion the image will be.

After quantization, most DCT coefficient in the right bottom direction will be rounded to “0s” and only a few in the left top corner are still left non-zero which allows another step of said “Zig-Zag” scanning and Run-Length packing 15 which starts left top DC coefficient and following the zig-zag direction of scanning higher frequency coefficients. The Run-Length pair means the number of “Runs of continuous 0s”, and value of the following non-zero coefficient. The Run-Length pair is sent to the so called “Variable Length Coding” (VLC) 16 which is an entropy coding method. The entropy coding is a statistical coding which uses shorter bits to represent more frequent happen patter and longer code to represent the less frequent happened pattern. JPEG standard adopts “Huffman” coding algorithm as the entropy coding. VLC is a step of lossless compression. JPEG is a lossy compression algorithm. JPEG compression shown in FIG. 1 is a reversible procedure which means, a compressed JPEG image can be decompressed and reconstructed to be original pixel of Y, U, V forms through the reverse rout of compression by the following reverse procedure: Variable Length Decoding, VLD, Run-Length Unpacking, Dequantization, and inverse DCT.

The JPEG picture with less than 1OX compression rate has acceptable good image quality, 20× compression will have more or less noticeable quality degradation. The JPEG image data stream coding costs relatively high computing power. For example, in software solution with a single CPU of 16 its data, it requires about 40 MIPS to encode a picture of 1M pixels of data within 1 second. The time distribution for encoding an JPEG image with 1M pixels is as the following: The total block number: 23,400, 1024 Macs of each block, So, DCT requires a total of 24M Macs (or 24 MIPS), quantization requires about ⅕ of that of DCT (or 5 MIPS), others dominates about another ⅕ of DCT computing time (or 5 MIPS). That comes out of ˜40 MIPS.

This invention of efficient image compression applies a new method and circuit of the VLC coding to achieve data reduction with much less computation compared to JPEG. FIG. 2A illustrates the flowchart of basic concept of the compression comprising two procedures: firstly taking the differential value 22 of adjacent pixels 21, secondly applies a variable length coding method 24 to represent the differential value, Dn 23. The difference is then sent to a VLC coding to further reducing the data rate. Decompression procedure reverses the image compression procedure shown in FIG. 2B. The compressed pixels 25 will firstly go through variable length decoding 26 to reconstruct the differential value of adjacent pixels, and the differential value will be added 27 to the previous pixel value to recover the value of present pixel 28. The recovered pixels will be temporarily saved in a register for reference in the next pixel.

The method of new variable length coding method of image compression accords to the following equation:

Diff.=Q×M+R (Q: Quotient, M: divider and R: Remainder)

This method of efficiently variable length coding is to code the “Quotient, Q”, divider” and “Remainder, R” with the M implicitly predicted which costs no bit in the bit stream. The VLC coding in this invention of efficient lossless data stream coding includes the following procedures:

- Calculating the quotient. 26
- Calculating the remainder. 26
- Implicitly calculating the divider, M or for more efficiently in calculating K, the value of 2^Kof the divider, M without assigning a code to represent it.
  
  For saving code, the N is predicted by examining the previous N and the Diff value, the difference of adjacent pixels. Based on the principle of high continuity of either adjacent image pixels, the divider, M of current sample can be predicted and needs no individual code to represent it. The equation in the VLC coding of predicting the value of M is illustrated by the means of predicting the value of M.

M
_n=(M_n−1+D_n)/2 (Eq. 2)

For example: Diff.=11=1×8+3, in the VLC coding of this invention, the quotient, Q=1 and Remainder, R=3 are the only two parameters needed to be coded with the M=8 (N=3) implicitly predicted by an average of weighted factors times M of previous pixels. In speeding the calculation and saving hardware in implementation, an expedition of rounding the M to be the power of 2 is adopted and M equals to the closest value between 2^K-1and 2^K

FIG. 3 shows the table of an example of the variable length coding of the Diff. differential value 31 of adjacent pixels with a predicted divider of 8 (K=3, 2^K) Diff. of range of 0-7 has Quotient of “0” which needs no bit, since the K=3, the Remainder are all 3 bits wide. The “1” is market to separate the Quotient and Remainder. From 8-15, the Quotient value is 1 which uses bit=“0” to represent it, and the Remainder are all 3 bits wide binary code. Ex. 0=“1000”, 1=“1001”, 2=“1030”, . . . 8=“01000”, 9=“01001”, 15=“01111” . . . .

As shown in equation, M_n=(M_n−1+D_n)/2, the D_nof the closest previous sample has highest weight of ½, the next sample will have a factor of ¼, . . . etc. the farer the samples, the lower value the weighted factors and less influence to the present sample in predicting the divider, M.

In the edge of a new pattern or object in a picture, the differential value, Dn changes sharply and the equation, M_n=(M_n−1+D_n)/2 can not update the divider M or K which causes higher bit rate in coding the Quotient and Remainder. As shown in FIG. 4, for reducing the bit rate of pixel in the edge of new pattern or new object, when the D_nvalue 41 is larger than a threshold 42, for example, TH1, the divider is set to a predetermined value 43 like “128”, when the D_nvalue is larger than a threshold 44, for example, TH2, the divider is set to a predetermined value 45 like “64”, when the D_nvalue is larger than a threshold 46, for example, TH3, the divider is set to a predetermined value 47 like “32”, others value of D_n, will the divider uses prediction equation 48: M_n=(M_n−1+D_n)/2. If the current divider does not adopt this prediction equation, then, no matter the differential value of the next Dn (or said D_n+1), the previous divider, M_n−1is assigned to be the next divider (M_n−1=M_n+1)instead of the one of the three predetermined values, for example, the “128”, “64”, “32”.

For efficiency and cost consideration, in implementing this invention, an image is partitioned to be thousands, hundreds of thousands or even millions of “Segments” with each segment having pixel number ranging from 8 to 1024 pixels with the default of 32 or 64 pixels. FIG. 5 shows the flowchart of segment by segment image compression. The differential value of adjacent pixels or the adjusted differential value which might be for example, converted from negative to positive is accumulated to for a segment, each new segment of pixels 51 will be assigned an initial divider, the K 52. The difference, Dn, between the input pixel component, ex. Y or U/Cb or V/Cr and previous pixel will be calculated 53. The Dn will be input to update the divider 55, K for coding the next pixel. The D_nwill also be divided by 2^K, the divider, and the “Quotient” and the “Remainder” 54 can therefore be calculated.

For instance, if the previous K=3, a pixel component of Y equals to 46 and previous Y equals to 13, the Dn will become 23 (46−13=23) (binary code=10111) which is divided by 8 (K=3) results in Quotient of “2” (binary code=“10”) and Remainder of “7” (binary code=111). In realization of the coding, the Remainder is an easy work by just assigning the LSB 3 bits of the Dn to be the Remainder, while the Quotient of the MSB 2 bits needed to be converted from “binary code=“10” to be “2” which in coding will be two “0s”. With a marker bit of “1” separating the Quotient and Remainder, the resulted code of “001111” of the first two bits of Quotient and the last 3 bits Remainder.

FIG. 6 shows the implementation of encoding the pixel with three components. The difference between input pixel components, Y 61, U/Cb and V/Cr and their corresponding previous pixel which are temporarily saved in the registers 62 are calculated 63 and are coded by the variable length coding method 64. The encoded pixel components of Y, U and V will then be concatenated 65 pixel by pixel. The encoded pixel component of Quotient and Remainder of each component 66, 67, 68 will be put together and followed by the next pixel components 601, 602. The order of putting Quotient in front of Remain or Remainder in front of Quotient 603 makes a little difference in decoder with the later having a little faster since the Remainders are known since the K, divider are know before decoding procedure of each pixel begins and decoding the Remainder can be done in parallel with the decoding the Quotient.

In some applications, the input component of a pixel for each clock cycle might have one Y, the Luma and one C, Chroma (U or V) with one U in the first cycle, and V in another cycle. In this case, the difference and the VLC hardware can be shared and the encoded data stream will have one Y followed by one C in a cycle and another Y with another C in the next clock cycle. The performance of compressing an image is depending on the algorithm as well as the hardware cost. The more hardware in parallel, the higher throughput can be generated.

FIG. 7 shows three kinds of input waveforms, the first one is an 8-bit input pixel bus with each clock having one of the Y, U and V component 70, 71, 72, 73 with Y interlacing between the U and V. The 2^ndwaveform has 16 bits of pixel component with each clock cycle having one Y and one U 74 or one V 75. Both the first and second input waveforms are so call 4:2:2 format which means each Y has a U or a V interlaced. In a 4:4:4 format which has higher number of color component might input 3 pixel components 76 in the same clock cycle with a 24-bit pixel data input at the same clock cycle. Depending on the pixel number of each segment, the first output of the compressed pixels will have delay time of for instance 32 clock cycles with the rest of pixels with pipelining output.

In reconstructing the compressed pixels, in gaining high throughput and reconstructing three (or two in 4:2:2 format) pixel components, Y, U, V in a clock cycle. This invention decodes three pixel components in one clock cycle with an example decoding procedure as shown in FIG. 8 with Quotient in front of Remainder in pixel stream. A group of compressed pixels are loaded 81 to the decompression temporary buffer with mixed compressed code of Quotients and Remainder of pixel components, Y, U and V together in the data stream. The first step is to decode the Quotient of the 1^stpixel component 83, for example, the Y. Then, the Remainder can be extracted and decoded. Since the remainder is coded by binary code which number of bit is the value of K, the power of the divider. The decoded value of Quotient and Remainder plus the divider can recover the differential value 85 of the adjacent pixel and the differential value is used to calculate the next divider, the K 82. After the Quotient and Remainder of theist pixel component is decoded, the next step is to decode the 2^ndpixel component by calculating the Quotient 86 and Remainder 87 which procedure is the same like that in decompressing the 1^stpixel component. The calculated value of the 2^ndpixel component is applied to determine the next divider, K value. For some application with 3^rdpixel component in the same clock cycle, the same decompressing procedure like the 1^stand 2^ndpixel components is applied to decode the Quotient 89 and Remainder 801. And the calculated value of the 3^rdpixel component is used to decide the next divider 801, K value. Those dividers of all three (or two) pixel components are updated 82 each clock cycle after each differential value is calculated.

Since the dividers of a pixel are known before decoding procedure begins, the Remainders of a pixel, are known as well. Should Remainder is placed in front of Quotient, decoding the Remainder can be done in parallel with decoding the Quotient which gains a little speed by reducing the additional delay time of decoding the Remainders of a pixel components. FIG. 8 is just an example of decoding the compressed pixel with the Quotient in front of Remainder, actually, the procedure of decoding the Quotient 83 and Remainder 84 (and 86, 87 in decoding U and 89, 801 in decoding V) can be swapped and gains more speed.

For achieving higher performance with shorter delay time in decoding the differential values of pixel component, the bottleneck is the decoding of the quotient f each pixel component. In the worst case, the Quotient might be as long as 16 bits or even longer, should it is decoded by one look up table mapping it cost long cascaded delay time. The one look up table is shown like the following:

Code=” 1xxx”
Q_Value=”0”

Code=” 01xxx”
Q_Value=”1”

Code=” 001xxx”
Q_Value=”2”

....
....

Code=”0000000000001xxx”
Q_Value=”12”

Code=”00000000000001xxx”
Q_Value=”13”

Code=”000000000000001xxx”
Q_Value=”14”

Code=”0000000000000001xxx”
Q_Value=”15”

Code=”00000000000000001xxx”
Q_Value=”16”

The longest delay time of decoding the Quotient by above table will be a gate with series input of 16 bits of number. In this invention, one of the key of speeding up is to break the above one large table into four smaller table with 2 levels of decoding. The following is a brief conceptual description of the new 2 levels of decoding the Quotient value:

Code=”0000”

then, T4=”1”

Code=”000”

then, T3=”1”

Code=”00”

then, T2=”1”

Code=”0”

then, T1=”1”

If

Code=”0000”
and
T4=”1”,
then T8=”1”

Code=”000”
and
T4=”1”,
then T7=”1”

Code=”00”
and
T4=”1”,
then T6=”1”

Code=”0”
and
T4=”1”,
then T5=”1”

If

Code=”0000”
and
T8=”1”,
then T12=”1”

Code=”000”
and
T8=”1”,
then T11=”1”

Code=”00”
and
T8=”1”,
then T10=”1”

Code=”0”
and
T8=”1”,
then T9 =”1”

If

Code=”0000”
and
T12=”1”,
then T16=”1”

Code=”000”
and
T12=”1”,
then T15=”1”

Code=”00”
and
T12=”1”,
then T14=”1”

Code=”0”
and
T12=”1”,
then T13=”1”

In realizing this high efficiency variable length codec of the image compression, a group of compressed pixels 91 fills the register with a predetermined depth with the control of loader as shown in FIG. 9. The 1^stdecoder 93 for the 1^stpixel component, said Y, calculates the Quotient and Remainder and the results are sent to the calculator to reconstruct the pixel component value 94. Once the Quotient and Remain are determined, a shifter 99 aligns the rest of code to go through the 2^nddecoder 95 of decoding the Quotient and Remainder of the 2^ndpixel component with similar mechanism as the 1^stdecoder. The reconstructed Quotient and Remainder are used to calculate the next divider 96 of the 2^ndpixel component. Same to the circuit of decoding the 1^stpixel component, after decoding the 2^ndpixel component, the compressed code is shifted and is fed to the 3^rddecoder 97 for reconstruct the Quotient and Remainder, and afterward, the Quotient and Remainder are used to update the divider 98 for next pixel.

A pixel comprising Red, Green and Blue (R, G, B) color component is applicable to this invention. Replacing Y, U, V pixel component by R, G, B can simply apply the R, G, B component into this invention of the high performance image compression.

It will be apparent to those skills in the art that various modifications and variations can be made to the structure of the present invention without departing from the scope or the spirit of the invention. In the view of the foregoing, it is intended that the present invention cover modifications and variations of this invention provided they fall within the scope of the following claims and their equivalents.

Claims

1. A method of compressing the pixel components, comprising: calculating the differential values between adjacent pixels of a group of pixel within an image frame;calculating and coding the value of “Quotient” by dividing the differential value by a predicted divider;calculating and coding the value of “Remainder” by dividing the differential value by a predicted divider; andcalculating the value of the next divider to represent the divider of the next pixel components.
2. The method of claim 1, wherein the difference of adjacent samples is divided by a predetermined divider to obtain the Quotient and the Remainder which divider is the average of the last divider value and the previous differential value of adjacent pixel.
3. The method of claim 1, wherein the code of the Quotient and the marker bit using to separate the Quotient and Remainder are using different polarity of digital bit.
4. The method of claim 1, wherein the divider is represented by an integer number representing the power of 2.
5. The method of claim 1, wherein the first predetermined value is assigned to be the divider of a pixel if the differential value of adjacent pixel is greater than the first threshold, the second predetermined value is assigned to be the divider of a pixel if the differential value of adjacent pixel is greater than the second threshold, and the third predetermined value is assigned to be the divider of a pixel if the differential value of adjacent pixel is greater than the third threshold.
6. The method of claim 5, wherein the Quotient of a pixel component can be coded first, followed by the Remainder of the same pixel component, or the Remainder can be coded first, followed by the Quotient.
7. The method of claim 1, wherein the divider of the next pixel with the previous pixel having differential value larger than predetermined threshold is the value of the previous divider.
8. The method of claim 1, wherein the Remainder is a binary code with the same bit number of the divider which is an integer number of the power of 2.
9. The method of claim 1, wherein a pixel is comprised of three components of Y, U, V or Red, Green and Blue.
10. A method of efficiently decompressing the pixel components, comprising: fetching the compressed pixel components which are stored in an image buffer;applying the variable length decoding procedure including but not limited to the following steps:calculating the remainder of the first pixel component by referring to the current divider and the calculated first quotient;calculating the quotient of the first pixel component by referring the current divider and remainder; andapplying the decoded remainder and quotient to calculate the differential value of the adjacent pixel component and to determine the divider value for the next pixel component;re-aligning the compressed pixel component bit position and repeat the above variable length decoding procedure for recovering the second pixel component; andif there is a third pixel component within the compressed pixels, then, realigning the compressed pixel component bit position and repeating the variable length decoding procedure for recovering the third pixel component.
11. The method of claim 10, wherein the value of the predicted divider of the power of “2” is assigned to represent the number of bits of the remainder.
12. The method of claim 10, wherein Remainder and Quotient of the first pixel component are decoded in parallel, followed by the Remainder and Quotient of the second pixel component which are also decoded in parallel and if available, followed by the Remainder and Quotient of the third pixel component are decoded in parallel.
13. The method of claim 10, wherein the Quotient of the first pixel component is decoded firstly followed by the Remainder of the first pixel component, then, the Quotient of the second pixel component is decoded and followed by the Remainder of the second pixel component, and if the third component is available, the Quotient of the first pixel component is decoded, then followed by the Remainder of the third pixel component.
14. The method of claim 10, wherein the recovered quotient and the previous divider are used to calculate the divider of the next pixel.
15. The method of claim 10, wherein decoding the Quotient and the Remainder of the pixel components and updating the dividers of the next pixel components are completed in a fixed clock cycle time.
16. The method of claim 10, wherein at least two levels of decoding procedure is applied to decode the Quotient value of each pixel component.
17. An apparatus for efficiently decompressing the differential value of adjacent pixels, comprising: a loader accessing and storing the compressed pixel data into a temporary storage device and fetching the compressed data for decoding;at least a circuit for decompressing the first pixel component comprising the following circuits: the first VLD decoder used to decode the Quotient and another VLD decoder to decode the Remainder of the differential value of the first pixel component with these two decoders decoding the Quotient and Remainder in parallel;a calculator with input of the decoded Quotient and Remainder from the first VLD decoder to calculate the value of the differential value of the first pixel component and then to calculate the divider of the next pixel; anda shifter used to shift out the decoded Quotient, market and Remainder bits and feeding the shifted bits into the next circuit for decoding the next pixel component;the second decoding circuit with the input from the shifted bits of the compressed pixel component, if the second pixel component is available in the same clock cycle with the first pixel component, and going through the same decoding procedure as the first pixel component; andthe third decoding circuit with the input from the shifted bits of the compressed pixel component, if the third pixel component is available in the same clock cycle with the first pixel component, and going through the same decoding procedure as the first pixel component.
18. The method of claim 17, wherein the Remainder and Quotient are decoded in parallel, and the results are used to recover the differential value of adjacent pixels which is used to calculate the divider for the next pixel.
19. The method of claim 17, wherein the reconstructed first pixel component is used as a reference to decode the second pixel component, and the decoded second pixel component is used as a reference to decode the third pixel component.
20. The method of claim 17, wherein a quotient decoder with two levels of logic gates of decoding is applied to calculate the Quotient of the differential value.

Method and circuit of high performance variable length coding and decoding for image compression

Information

Publication Number

Date Filed

Date Published

Inventors

CPC

US Classifications

International Classifications

Abstract

Description

Claims