1. Field of the Invention
The present invention relates to data compression, and in particular relates to methods for encoding image data.
2. Description of the Related Art
The rapid growth of digital imaging applications, including desktop publishing, multimedia, teleconferencing, and high-definition television (HDTV) has increased the need for efficient and standardized image compression techniques. Without image compression, the transmission of images would require an unacceptable bandwidth for many applications. As a result, methods of compressing images have been the subject of numerous research publications. Image compression schemes convert an image consisting of a two-dimensional array of pixels into a sequence of bits which are transmitted over a communication link. Each pixel represents the intensity of the image at a particular location therein. The transmission link may be an ordinary telephone line.
Consider an image comprising a gray-scale representation of a photograph at a resolution of 1000×1000 lines. Each pixel typically consists of 8 bits which are used to encode 256 possible intensity levels at the corresponding point on the photograph. Hence, without compression, transmission of the photograph requires that 8 million bits be sent over the communication link. A typical telephone line is capable of transmitting about 9600 bits per second; hence the picture transmission would require more than 10 minutes. Transmission times of this magnitude are unacceptable.
As a result, image compression systems are needed to reduce transmission time. It is also apparent to those skilled in the art that image compression systems may also be advantageously employed in image storage systems to reduce the amount of memory needed to store one or more images.
Transform coding is a type of image compression. Usually, image compression by transform coding requires three steps. In the first step, the image is transformed to a new representation in which the correlation between adjacent pixels is reduced. This transformation is usually completely reversible, that is, no information is lost at this stage. The number of bits of data needed to represent the transformed image is at least as large as that needed to represent the original image. The purpose of this transformation is to provide an image representation which is more ideally suited to known compression methods.
In the second step, referred to as quantization, each pixel in the transformed image is replaced by a value which is represented in fewer bits, on average, than the original pixel value. In general, the original gray scale is replaced by a new scale which has coarser steps and hence can be represented in fewer bits. The new gray scale typically has levels in which the different steps are of different sizes. The new gray scale is calculated from the statistical distribution of the pixel values in the transformed image.
In the third step, the quantized values are encoded by entropy encoding. Huffman coding and arithmetic coding are two of the most common entropy encoding techniques used. However, not all of the values in high frequencies are necessarily encoded in the entropy encoding step, compression rate determines whether all of the values in high frequencies are encoded or not. Usually, image data is encoded block by block. Each block comprises the quantized values in different frequencies. A following block is encoded after a block is finished being encoded. For encoding, the amount of bits after encoding is limited according to a predetermined compression rate. Therefore, the values in the blocks which are earlier encoded may be encoded completely in the beginning. On the other hand, after the encoded blocks exceed a maximum amount of bits available for encoding, the values in the blocks which are latter encoded may not be encoded completely. In the blocks which are latter encoded, the values in high frequencies are surrendered and only the values in low frequencies may be encoded. Accordingly, it may cause distortion of the compressed image data.
A detailed description is given in the following embodiments with reference to the accompanying drawings.
An encoding method for encoding image data comprising a plurality of pixels is disclosed. Each pixel corresponds to a pixel value. First coefficients in different frequencies which are transformed and quantized from a first set of pixel values are provided. Second coefficients in different frequencies which are transformed and quantized from a second set of pixel values are provided. A bitstream by encoding the first coefficients with interleaving the second coefficients according to a predetermined order is generated. The predetermined order to encode the first coefficients with interleaving the second coefficients is from the first coefficients and the second coefficients in the lowest frequency to the first coefficients and the second coefficients in the highest frequency. a1, a2, . . . , and am are the first coefficients from the lowest frequency to the highest frequency, and b1, b2, . . . , and bm are the second coefficients from the lowest frequency to t the highest frequency, the order to encode the first coefficients with interleaving the second coefficients is a1, b1, a2, b2, a3, b3, . . . am, bm.
An encoding method to encode image data of a plurality of pixels is disclosed. The image data is divided into a plurality of blocks, each block comprising a set of the pixels. The blocks are transformed to generate a plurality of sets of coefficients corresponding to the blocks in different frequencies according to a predetermined transform. Each of a first set and a second set is one of sets of coefficients, a1, a2, . . . , and am are the coefficients from the lowest frequency to the highest frequency in the first set of coefficients, b1, b2, . . . , and bm are the coefficients from the lowest frequency to the highest frequency in the second set of coefficients The sets of coefficients are quantized. A bitstream is generated by encoding one of the sets of coefficients with interleaving another one of the sets of coefficients according to a predetermined order. The predetermined order to encode the one of the sets of coefficients with interleaving another one of the sets of coefficients is from the coefficients in the lowest frequency to the coefficients in the highest frequency. Thus, the order to encode the first set of coefficients with interleaving the second set of coefficients is a1, b1, a2, b2, a3, b3, . . . am, bm.
An encoding device to encode image data of a plurality of pixels is disclosed. The encoding device comprises a transformation operator, a quantizer and a encoding operator. The transformation operator divides the image data into a plurality of blocks, each block comprises a set of the pixels, and transforms the blocks to generate a plurality of sets of coefficients corresponding to the blocks in different frequencies according to a predetermined transform. Each of a first set and a second set is one of sets of coefficients, a1, a2, . . . , and am are the coefficients from the lowest frequency to the highest frequency in the first set of coefficients, b1, b2, . . . , and bm are the coefficients from the lowest frequency to the highest frequency in the second set of coefficients. The quantizer quantizes the sets of coefficients. The encoding operator generates a bitstream by encoding one of the sets of coefficients with interleaving another one of the sets of coefficients according to a predetermined order. The predetermined order to encode the one of the sets of coefficients with interleaving another one of the sets of coefficients is from the coefficients in the lowest frequency to the coefficients in the highest frequency. Thus, the order to encode the one of the first set of coefficients with interleaving the second set of coefficients is a1, b1, a2, b2, a3, b3, . . . am, bm.
The present invention can be more fully understood by reading the subsequent detailed description and examples with references made to the accompanying drawings, wherein:
The following description is of the best-contemplated mode of carrying out the invention. This description is made for the purpose of illustrating the general principles of the invention and should not be taken in a limiting sense. The scope of the invention is best determined by reference to the appended claims.
Image data IMG comprises a plurality of pixels. Each pixel corresponds to a pixel value where the pixels are arranged as a two dimensional array. Image data IMG is divided into a plurality of blocks. Each block comprises 16 pixel values corresponding to the pixels in image data IMG. After transforming and quantizing the pixel values in blocks by a predetermined transform method, such as discrete cosine transform (DCT) or discrete wavelet transform (DWT), the coefficients corresponding to the pixel values in blocks are respectively obtained. For example, pixel values in block U and block V, respectively correspond to the pixels in two adjacent blocks in image data IMG. Referring to
In another embodiment, pixel values in block U and block V, correspond to the pixels in the same block in image data IMG. For example, when the pixels are presented by RGB color model, the pixel values in block U are R values corresponding to the pixels, and the pixel values in block V are B values corresponding to the pixels.
First, transformation operator 103 divides image data IMG into a plurality of blocks, each block comprises a set of pixels in IMG. Each pixel corresponds to a pixel value. Transformation operator 103 performs a predetermined transformation, such as DCT, block by block, and generates a plurality of sets of coefficients corresponding to the pixel values of the sets of pixels. Then, quantizer 105 performs quantization on the sets of coefficients.
Next, encoding operator 107 generates the bitstream as the result data by encoding one of the sets of the quantized coefficients with interleaving another one of the sets of the quantized coefficients starting at the coefficient in the lowest frequency to the coefficient in the highest frequency as the aforementioned method. Finally, the bitstream is generated by block interleave.
An image data IMG0 comprises 16 pixels arranged as a 4×4 two dimensional array. IMG0 is divided into four blocks, block A, block B, block C and block D. Each block comprises 4 pixels arranged as a 2×2 two dimensional array. The coefficients transformed and quantized from the blocks in image data IMG are as shown in
As the above encoding method discloses, all the blocks in image data are equally encoded into the bitstream. Under the limitation of compression rate, the disclosed encoding method encodes the coefficients in low frequencies first for each block. And then the coefficients in high frequencies are encoded. Therefore, the distortion of the encoded result data may be decreased.
While the invention has been described by way of example and in terms of the preferred embodiments, it is to be understood that the invention is not limited to the disclosed embodiments. To the contrary, it is intended to cover various modifications and similar arrangements (as would be apparent to those skilled in the art). Therefore, the scope of the appended claims should be accorded the broadest interpretation so as to encompass all such modifications and similar arrangements.