This application is based upon and claims the benefit of priority from Japanese Patent Application No. 2012-194599, filed on Sep. 4, 2012; the entire contents of which are incorporated herein by reference.
Embodiments described herein relate generally to an apparatus and a method for coding an image.
When natural image data is coded, by using JPEG for a static image, or by using MPEG for a moving image, effective data compression can be performed. On the other hand, as for CG image data such as a PC screen, in comparison with above method, by using a color palette-image coding method (each pixel is represented by a plurality of typical colors (palettes)), the data compression can be more effectively performed.
In the color palette-image coding method, following decisions are important, i.e., the target image is represented by which typical colors, and each pixel therein is represented by which typical color.
For example, by calculating a histogram of each color in an image block, conventional technique to determine a typical color of the image block based on the histogram is well known. In this technique, the histogram is calculated whenever one typical color is generated. Accordingly, great many calculation costs thereof are necessary. Furthermore, the histogram is calculated for all colors in the image block. Accordingly, in order to calculate the histogram, a memory having a very large capacity is necessary. As a result, for example, hardware-packaging of the memory (such as LSI) is difficult.
According to one embodiment, an image coding apparatus includes an acquisition unit, a comparison unit, a decision unit, and a coding unit. The acquisition unit is configured to acquire a pixel block having a predetermined size from image data to be coded. The comparison unit is configured to calculate a distance between a color of a target pixel in the pixel block and each of a plurality of typical colors, and to select a typical color of which the distance is a minimum distance from the plurality of typical colors. The decision unit is configured to assign a first index representing the selected typical color to the target pixel if the minimum distance is smaller than a first threshold, to assign a second index representing a new typical color to the target pixel if the minimum distance is larger than the first threshold, and to add the color of the target pixel as the new typical color when the second index is assigned. The coding unit is configured to code the selected typical color, the new typical color, the first index and the second index.
Various embodiments will be described hereinafter with reference to the accompanying drawings.
In an image coding apparatus 1 of the first embodiment, as to a target pixel to be coded, an index representing a typical color to represent the target pixel is assigned thereto, and the typical color and the index are coded. When the typical color (palette) is generated, a histogram of each color included in a pixel block is not calculated. Accordingly, a memory capacity necessary for processing can be reduced. Color information of the typical color and the target pixel may be represented by a color system such as RGB, YUV, YCbCr, or HSV. In the first embodiment, processing (scan) of the pixel block and the target pixel are performed in order of raster scan. However, this processing may be performed in order of another scan.
The acquisition unit 101 acquires a pixel block having a predetermined size from image data to be coded. Furthermore, the acquisition unit 101 calculates a distance between each typical color (previously determined) and a color of a target pixel in the pixel block, and selects a typical color of which distance is the shortest (minimum distance) from the each typical color.
If the minimum distance is smaller than a threshold A, the decision unit 103 assigns an index representing a typical color having the minimum distance, to the target pixel. If the minimum distance is larger than the threshold A, the decision unit 103 adds a pixel value of the target pixel as a new typical color, and assigns an index representing the new typical color to the target pixel. The storage unit 104 stores the typical color. Moreover, in the first embodiment, the index representing a typical color is numbered as a value in order of adding the typical color. The index of the first typical color is numbered as “0”, and numbered “1”, “2”, . . . in order. The coding unit 105 codes the typical color (i.e., palette) and the index assigned to the target pixel.
The acquisition unit 101 acquires a pixel block having a predetermined size from image data to be coded (S101).
The comparison unit 102 compares each typical color and a color of a target pixel in the pixel block (S102). The comparison unit 102 checks whether distances between all typical colors (stored in the storage unit 103) and the color of the target pixel are already calculated (S103). If the comparison unit 102 confirms that the distances are already calculated (Yes at S103), the comparison unit 102 sends a typical color having the minimum distance among the distances to the decision unit 103, and processing is forwarded to S104.
The decision unit 103 compares the minimum distance with a threshold A (S104). If the minimum distance is smaller than the threshold A (Yes at S104), processing is forwarded to S107. If the minimum distance is larger than the threshold A, processing is forwarded to S105. The decision unit 103 determines that a color of the target pixel is represented by the typical color having the minimum distance (S107).
The decision unit 103 checks whether the number of typical colors (stored in the storage unit 104) is larger than (or equal to) a predetermined upper limit (S105). If the number of typical colors is smaller than (or equal to) the upper limit (No at S105), processing is forwarded to S106. If the number of typical colors is larger than the upper limit (Yes at S105), processing is forwarded to S107 because a new typical color cannot be further added.
The decision unit 103 adds a color of the target pixel as a new typical color to the storage unit 104, and makes the storage unit 104 store the color (S106). The decision unit 103 checks whether an index of typical color is already assigned to all target pixels in the pixel block (S108). If the index is not assigned to at least one target pixel (No at S108), processing is returned to S102. If the index is already assigned to all target pixels (Yes at S108), processing is forwarded to S109.
The coding unit 105 codes typical colors (i.e., palette), indexes corresponding to the typical colors (stored in the storage unit 104), and indexes assigned to target pixels in the pixel block, and outputs coded data thereof (S109). The coding unit 105 checks whether coding is already executed to all pixel blocks in image data to be coded (S110). If the coding is not executed at least one pixel block yet (No at S110), processing is returned to S101. If the coding is already executed to all pixel blocks (Yes at S110), the coding is completed.
Hereinafter, detail processing of each step will be explained. A shape of the pixel block (acquired by the acquisition unit 101 at S101) may be any of a square having N×N pixels, a rectangle having M×N pixels, and a line having M×1 pixels (M, N: natural number).
The comparison unit 102 calculates a distance between a color of the target pixel and each typical color stored in the storage unit 104 (S102).
In order to calculate a color of the target pixel (in the pixel block) and a typical color, following equation is used.
In the equation (1), α is a weight coefficient of R, β is a weight coefficient of G, and γ is a weight coefficient of B. In the first embodiment, an example of “α=β=γ=1” is also explained. However, by considering Bayer arrangement, G may be preceded such as “α=γ=1, β=2”. Furthermore, in case of YUV color system, the distance may be calculated in the same way as the equation (1). In this case, for example, a weight coefficient of Y is α, a weight coefficient of U is β, and a weight coefficient of V is γ. In order not to occur the color distortion, UV may be preceded such as “α=1, β=γ=2”. Furthermore, in case of raising PSNR as objective image quality, Y may be preceded such as “α=2, β=γ=1”.
In above-mentioned example, the distance is calculated with eight bits accuracy. However, as a modification thereof, the distance may be calculated by dropping several low order bits of each color. As processing to drop the several low order bits, for example, arithmetic right shift is used. After the color information is compressed from eight bits to seven bits by dropping the least significant bit, the distance is calculated. In this case, even if positive and negative information of one bit (increased when the difference is calculated) is included, the processing can be performed with original eight bits. As a result, the number of bits of the distance cannot be increased.
In the first embodiment, a distance between a color of the target pixel (As to a left upper pixel in the pixel block of
Moreover, among typical colors stored in the storage unit 104, the comparison unit 102 counts the number of non-N/A (the number of typical colors newly added), and calculates a distance between each typical color (not N/A) and a color of the target pixel. In this case, unnecessary comparison with N/A is deleted. As a result, in case of hardware-packaging, the number of steps (S103) to decide whether comparison with each typical color is completed is reduced, and processing load thereof can be reduced. Moreover, in case of software-packaging, processing load to calculate the distance can be reduced. If the number of non-N/A is not counted, the distance with all typical colors (In above example, eight colors) is calculated. In the first embodiment, on assumption that the number of typical colors newly added is counted, following explanation will be done.
In case of software-packaging, when a color of the target pixel is replaced with a typical color, the decision unit 103 checks the distortion amount by using a threshold A. If the distortion amount is larger than the threshold A (No at S104), by setting the color of the target pixel to the typical color (S106), the distortion amount is controlled not to be over the threshold A. Various methods for determining the threshold A can be applied. For example, among a dynamic range of eight bits of each color (R, G, B), if a distortion amount of two low order bits is allowed, the distortion amount 3 (=maximum of two bits) occurs at each of RGB, and the threshold “A(=3+3+3)=9” is determined. Moreover, in the first embodiment, in case of this threshold A (=9), following explanation will be done.
If the threshold A is smaller, the distortion amount is fewer, and the image quality becomes higher. On the other hand, by adding many typical colors, an amount of coded data increases. Conversely, if the threshold A is larger, the distortion amount is larger, and the image quality falls. On the other hand, the amount of coded data decreases because the number of typical colors is few. Accordingly, the decision unit 103 may try the threshold A by repeat processing so that the amount of coded data is equal to a desired amount. As the repeat processing, by gradually changing the threshold A, the threshold A is calculated so that a relationship between an amount of the typical color and the distortion amount represents the desired amount.
Furthermore, the threshold A may be adjusted based on feedback control. As the feedback control, a correlative relationship between a data amount and the threshold A is estimated based on a relationship between a previous data amount and the threshold A, and the threshold A to be operated next is determined. For example, in case of the threshold A0, when the coding amount becomes too large, next processing is executed with the thresholds “A=A0×2”. In case of the threshold “A=A0×2”, when the coding amount becomes too small, by setting the threshold A to “(A0×2+A0)/2”, the threshold A is gradually converged. Furthermore, by executing a plurality of thresholds A in parallel, one threshold A with which the amount of coded data is nearly equal to a desired amount can be selected from the plurality of thresholds A. This modification will be explained afterwards.
Processing for a first target pixel in the pixel block is explained. All typical colors stored in the storage unit 104 are N/A. A distance between a color of the first target pixel and each typical color is a maximum (=255×3=765) larger than the threshold A(=9) (No at S104). Furthermore, as to an upper limit of the typical color “8”, the number of counts that new typical color is added is “0” (No at S105). Accordingly, processing is forwarded to S106. The decision unit 103 adds color information of RGB of the first target pixel to a typical color of palette 0 (index 0) in the storage unit 104 (S106).
Next, processing for the second target pixel in the pixel block is explained. The comparison unit 102 calculates a distance between a color of the second target pixel and each typical color stored in
This minimum distance (=128) is larger than the threshold A (=9) (No at 5104), and processing is forwarded to S105. The number of counts (=1) that new typical color is added is smaller than (or equal to) the upper limit (=1) (No at 5105), and processing is forwarded to S106. The decision unit 103 adds color information of RGB of the second target pixel to a typical color of palette 1 (index 1) in the storage unit 104 (S106).
After processing for all target pixels in the pixel block shown in
The coding unit 105 codes typical colors (i.e., palette), indexes corresponding to the typical colors, and indexes assigned to target pixels in the pixel block, and outputs coded data thereof (S109).
In above explanation, all information is coded with fixed length. For example, when indexes are coded, variable length coding method (Huffman coding, arithmetic coding) to vary the coding length based on the occurrence frequency may be used. Alternatively, Run Length Encoding to collectively code continuous indexes may be used.
In above explanation, coding processing of the pixel block shown in
In the first embodiment, as mentioned-above, the storage unit 104 stores palettes of which upper limit is “8”. Accordingly, after coding all target pixels, a distortion amount of the coded target pixel is smaller than an allowable distortion amount (threshold A). As a result, coded data is lossless compressed data without distortion. Hereinafter, in case that the upper limit of the number of palettes is “4”, operation for the pixel block shown in
A distance between a thirteenth target pixel (R=0, G=0, B=0) and each typical color has the minimum “256 (=128+64+64)” at palettes 1 and 3. This minimum distance is larger than the threshold A (No at S104). Accordingly, processing is forwarded to S105. The decision unit 103 checks whether the number of typical colors is larger than the upper limit (S105). Here, the number of typical colors newly added is already equal to the upper limit (=4) (Yes at S105). Accordingly, processing is forwarded to S107. As to the thirteenth target pixel, the distance has the minimum at a plurality of typical colors (palettes 1 and 3). In this case, for example, a smaller index among indexes corresponding to the typical colors may be selected. Furthermore, an index already used by adjacent target pixel in the pixel block may be selected. In the first embodiment, a smaller index is preferentially assigned. The decision unit 103 assigns an index 1 to the thirteenth target pixel. Hereinafter, in the same way, the index 1 is assigned to a fourteenth target pixel and a fifteenth target pixel. Last, as to a sixteenth target pixel, a distance has the minimum (=64) at a typical color of palette 0. Accordingly, the decision unit 103 assigns an index 0 to the sixteenth target pixel.
Continually, in case that the threshold A is a larger value, operation thereof will be explained. Hereinafter, on assumption that the threshold A is “96” and the upper limit is “4”, the operation is explained. Until a fourth target pixel among target pixels in the pixel block, the same processing as above-mentioned case of “A=9” is repeated.
Next, a distance between a ninth target pixel (R=64, G=128, B=64) and each typical color has the minimum “128 (=64+64)” at palettes 0 and 1. This minimum distance is larger than the threshold A (No at S104). Accordingly, processing is forwarded to S105. The decision unit 103 checks whether the number of typical colors stored in the storage unit 104 is larger than the upper limit (S105). Here, the number of typical colors stored in the storage unit 104 is two, and smaller than the upper limit (=4) (No at S105). Accordingly, the decision unit 103 adds a color of the eight target pixel to a palette 2 (stored in the storage unit 104) as a new typical color (S106).
In above-mentioned first embodiment, the comparison unit 102 calculates the distance a based on the equation (1). However, in a first modification, the comparison unit 102 calculates distances b˜d based on a plurality of equations. The decision unit 103 may decide whether to add a new typical color based on thresholds for respective equations.
For example, hereinafter, the case that following distances b˜d are newly added will be explained.
Distance b=|(R of the target pixel)−(R of the typical color)|
Distance c=|(G of the target pixel)−(G of the typical color)|
Distance d=|(B of the target pixel)−(B of the typical color)| (2)
For example, the comparison unit 102 calculates a distance a, and determines a typical color having the minimum distance a. If this minimum distance is larger than a threshold α, the decision unit 103 adds a color of the target pixel as a new typical color. If the minimum distance is smaller than (or equal to) the threshold a, the comparison unit 102 calculates respective distances b˜d between the typical color (having the minimum distance a) and color of the target pixel. The decision unit 103 compares respective distances b˜d with a threshold β. If all distances b˜d are smaller than (or equal to) the threshold β, the decision unit 103 assigns an index of the typical color (having the minimum distance a) to the target pixel. If any of distances b˜d is larger than the threshold β, the decision unit 103 adds a color of the target pixel as a new typical color.
As mentioned-above, in the first modification, typical colors are decided based on a plurality of distances a-d and two thresholds α and β. Accordingly, a large distortion of a specific color is suppressed. As a result, subjective image quality can be raised.
As a second modification of the first embodiment, when the number of typical colors newly added is equal to the upper limit of typical colors storable into the storage unit 104, the processing step different from S105 in
In the system of
As a fourth modification of the first embodiment, an initial state of typical colors stored in the storage unit 104 may be not N/A, and, for example, palette 0 may be a color of the first target pixel in the pixel block.
As a fifth modification of the first embodiment, by quantizing typical colors by the coding unit 105 at S109, image data can be coded with higher compression ratio.
According to the first embodiment and various modifications, color palette image coding can be realized without calculation of histogram for color of all pixels. Accordingly, the memory capacity necessary for this processing can be greatly reduced. Furthermore, processing is executed while the typical color is determined in order. Accordingly, image scan need not be repeated. As a result, this image coding is especially effective for hardware-packaging.
For example, by installing this image coding apparatus into a device or a system to process CG image data such as PC screen, the transmission velocity can be raised, and memory capacity such as DRAM to store CG image data can be reduced. However, usage of the first embodiment is not limited to above-mentioned usage.
In the image coding apparatus 2 of the second embodiment, a histogram for each typical color is calculated, and the threshold A is adjusted based on a distribution of the histogram. This feature is different from the image coding apparatus 1 of the first embodiment.
If all thresholds An (n=0˜7) are “96”, after the image coding apparatus 2 executes same processing (S301˜S309) as the image processing apparatus 1, as shown in
In the image coding apparatus 2, different from the first embodiment, the histogram calculation unit 206 calculates a histogram by accumulating the number of pixels (in the pixel block) assigned to each typical color (S206).
The decision unit 203 checks whether typical colors are already assigned to all target pixels in the image block (S309). Based on the histogram, the adjustment unit 207 checks whether to adjust the threshold An (S310). In order to represent a typical color (palette x) assigned to the largest number of target pixels by typical colors having higher accuracy, the adjustment unit 207 reduces (makes) the threshold x (be smaller) (S311).
For example, a value that pixels are averagely assigned to each typical color, i.e., ((the number of target pixels in the pixel block)/(the number of typical colors)), is a threshold C. If target pixels of which the number is larger than the threshold C are assigned to a specific typical color, adjustment of the threshold is decided to be necessary (No at S310). In
The adjustment unit 207 adjusts a value of the threshold Ax to be smaller than the present value. As to the adjustment amount, any method may be used. For example, the threshold may be simply reduced to a half of the present value. Furthermore, based on the number of typical colors able to be further added (In the second embodiment, 4 (=8−4) typical colors), the adjustment amount may be adjusted. In this case, for example, the threshold Ax is set to (Ax/(1+the number of typical colors able to be added)). As a result, if the number of typical colors able to be added is larger, the threshold is smaller than Ax. In the second embodiment, the threshold is simply reduced to a half of the present value (A0=96→48). However, according to the number of repeat processing (S310), the threshold A0 is automatically approximated to a suitable value. As a result, sufficient effect is obtained by simple adjustment method.
By setting thresholds to “A0=48 and An (n=1˜7)=96”, the comparison unit 202 calculates each typical color and the target pixel again (S302). In this case, as typical colors stored in the storage unit 204, new typical colors already added are used. In case of operating at the second time, until the fifteenth target pixel in the pixel block, the same assignment result of indexes as the first time is obtained. As to the sixteenth target pixel in the pixel block, a distance between a color thereof and each typical color has the minimum (64) at palette 0, and this minimum distance is larger than the threshold A0 (No at S304). The decision unit 203 adds the color of the sixteenth target pixel to palette 4 as a new typical color (S305, S306).
Moreover, in the image coding apparatus 1, as the threshold A, same value is commonly used for all typical colors. On the other hand, in the image coding apparatus 2, as the threshold A, different value can be set to each typical color. For example, a threshold A0 is set to palette 0, and another threshold A1 is set to palette 1.
As to two thresholds α and β explained in the first modification of the first embodiment, different threshold can be set to each typical color. For example, two thresholds α0 and β0 are set to palette 0, and two thresholds α1 and β1 are set to palette 1.
Furthermore, in the second modification that a plurality of image coding apparatuses 1 executes processing in parallel, each image coding apparatus 1 can use different threshold respectively. For example, a first image coding apparatus 1 uses a threshold A0 as palette 1 and a threshold A1 as palette 1. A second image coding apparatus 1 uses a threshold A′0 as palette 1 and a threshold A′1 as palette 1.
According to the second embodiments, the threshold An is automatically controlled while the memory capacity necessary to calculate the histogram is reduced. As a result, suitable typical colors can be generated.
In the disclosed embodiments, the processing can be performed by a computer program stored in a computer-readable medium.
In the embodiments, the computer readable medium may be, for example, a magnetic disk, a flexible disk, a hard disk, an optical disk (e.g., CD-ROM, CD-R, DVD), an optical magnetic disk (e.g., MD). However, any computer readable medium, which is configured to store a computer program for causing a computer to perform the processing described above, may be used.
Furthermore, based on an indication of the program installed from the memory device to the computer, OS (operating system) operating on the computer, or MW (middle ware software), such as database management software or network, may execute one part of each processing to realize the embodiments.
Furthermore, the memory device is not limited to a device independent from the computer. By downloading a program transmitted through a LAN or the Internet, a memory device in which the program is stored is included. Furthermore, the memory device is not limited to one. In the case that the processing of the embodiments is executed by a plurality of memory devices, a plurality of memory devices may be included in the memory device.
A computer may execute each processing stage of the embodiments according to the program stored in the memory device. The computer may be one apparatus such as a personal computer or a system in which a plurality of processing apparatuses are connected through a network. Furthermore, the computer is not limited to a personal computer. Those skilled in the art will appreciate that a computer includes a processing unit in an information processor, a microcomputer, and so on. In short, the equipment and the apparatus that can execute the functions in embodiments using the program are generally called the computer.
While certain embodiments have been described, these embodiments have been presented by way of examples only, and are not intended to limit the scope of the inventions. Indeed, the novel embodiments described herein may be embodied in a variety of other forms; furthermore, various omissions, substitutions and changes in the form of the embodiments described herein may be made without departing from the spirit of the inventions. The accompanying claims and their equivalents are intended to cover such forms or modifications as would fall within the scope and spirit of the inventions.
Number | Date | Country | Kind |
---|---|---|---|
2012-194599 | Sep 2012 | JP | national |