Image encoding/decoding method, apparatus thereof and recording medium in which program therefor is recorded

Information

  • Patent Application
  • 20010051005
  • Publication Number
    20010051005
  • Date Filed
    December 08, 2000
    24 years ago
  • Date Published
    December 13, 2001
    23 years ago
Abstract
The invention relates to an image encoding/decoding method, apparatus thereof and a recording medium in which a program therefor is recorded, whereby the encoding/decoding can be obtained with high image quality at high speed. In an image encoding method which comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where the differential vector which is obtained by separating the DC value DCT from the pixel block to be encoded is over an allowable value Z, calculating one or more orthogonal basis (αk), to which the differential vector is approximated, by the adaptive orthogonal transform (AOT) using the DC nest, each of the lowest n (n=log2 B) bits of base extraction blocks which are down-sampled from the DC nest is set to 0. Further, base extraction vectors are produced by separating a block mean value ai from the base extraction blocks .
Description


BACKGROUND OF THE INVENTION

[0001] 1. Field of the Invention


[0002] The present invention relates to an image encoding/decoding method, an apparatus thereof, and a recording medium in which a program therefor is recorded, and more particularly, relates to an image encoding/decoding method, an apparatus thereof, and a recording medium in which a program therefor is recorded, according to Hybrid Vector Quantization (HVQ) system.


[0003] 2. Description of Related Art


[0004] According to JPEG (Joint Photographic Expert Group) system, 8 times 8 pixel blocks are converted to DC (direct current) value and each coefficient value of from base to 63 times frequency by two dimensional DCT (discrete cosine transform), and information amount is reduced by quantizing the coefficient value in a different quantization width within no reduction of image quality utilizing frequency components of natural images which are gathered in a low frequency range, and then Huffman encoding is carried out.


[0005] According to HVQ system, which is a kind of mean value separation type block encoding same as JPEG, adaptive orthogonal transform (AOT) which is an intermediate system between a vector quantization and orthogonal transform encoding is used as a compression principle. AOT is a system in which the minimum number of non-orthogonal basis is selected from nests of the basis corresponding to a code book of vector quantization and the objective blocks become close to the desired and allowable error “Z”. According to the HVQ system, decoding is quickly carried because a decoding operation can be done in the form of integer. Natural images and artificial images (animation images, CG images) can be compressed in high image quality, because there are not mosquito and block noise, which are particularly generated in JPEG, and false contour, which is particularly generated in GIF. The invention relates to a method for further improving the image quality and for carrying out the coding operation at a higher speed in the HVQ system.


[0006] The applicants of the invention have proposed an image encoding/decoding method in accordance with the HVQ system utilizing self-similarity of images in Japanese Patent Application No. 189239/98. The contents of such proposal will be explained as follows. In the disclosure, a sign <a> means vector “a” or block “a”, a sign ∥a∥ means norm of the vector “a”, and a sign <a·b> means inner product of vectors a and b. Further, vectors and blocks in drawings and [numbers] are represented by block letters.


[0007]
FIG. 1 is a block diagram showing a conventional image encoder. In FIG. 1, 11 is an original image memory for storing an original image data, 12 is a DC value production unit for seeking a block average (DC) value per each pixel block (4 times 4 pixel) of the original image data, 13 is a differential PCM encoding unit (DPCM) for carrying out a differential predict encoding per each DC value, 14 is inverse DPCM encoding unit for decoding each DC value from the differential PCM encoding, 15 is a DC image memory for storing a decoded DC image, 16 is a DC nest production unit for cutting off the DC nest of a desired size from a part of the DC image, and 17 is a DC nest memory for storing the DC nest.


[0008] Further, 18 is a subtractor for separating a corresponding decoding DC value “DCJ” from a target image block <RJ> to be encoded, 19 is a differential vector buffer for storing a differential vector <dJ> which is DC separated, 20 is an extracted block buffer for storing a base extraction block <Ui> of 4 times 4 pixels which is down-sampled from the DC nest, 21 is an equilibrator for seeking a block mean value ai of the base extraction block <Ui>, 22 is a subtractor for separating the block means value ai from the base extraction block <Ui>, 23 is an extracted vector buffer for storing the base extraction block <Ui> which is separated by the mean value, 24 is an adaptive orthogonal transform (AOT) processing unit for producing an orthogonal basis αk <uk′> (k=1˜m) to search the DC nest to make the differential vector <dj> closer to the allowable error Z, where a square norm ∥dj2 of the differential vector is over the allowable error Z, 25 is a coefficient transform unit for seeking an expanding square coefficient βk which is multiplied by a non-orthogonal basis vector <uk> (k=1˜m) per the produced orthogonal basis αk <uk′> (k=1˜m) to produce an equivalent non-orthogonal basis βk <uk> (k=1˜m) , and 26 is an encoding unit by Huffman coding, run length coding or fixed length coding system for the compression encoding of information such as DPCM encoding of the DC value or the non-orthogonal basis βk <uk>.


[0009] In the DC value production unit 12, the block mean value of 4 times 4 pixels is provided in which the first decimal place is rounded off or down. In the DPCM 13, where the DC value of row J and column T is shown by the DCJ, I, a predictive value DCJ, I′ of the DCJ,I is provided by the formula, DCJ, I′=(DCJ,I−1+DCJ−1, I)/2, and its predictive error (Δ DCJ, I=DCJ, I−DCJ, I′) is linear-quantized byaquantization coefficient Q(Z) and is output. The quantization coefficient Q(Z) corresponds to the allowable error Z and is variable within the range of 1 to 8 according to the allowable error Z.


[0010] In the DC nest production unit 16, the DC nest is prepared by copying the range of vertical 39×horizontal 71 from the DC image. It is preferred that the DC nest includes more alternating current components because it is used as a codebook. Therefore, it is prepared by copying such the range that the sum of absolute values of difference between the DC values adjacent to each other in a plurality of the extracted ranges become maximum.


[0011] In making down-samples of the base extraction block <Ui>, a vertex per one DC value in vertical and horizontal section is set to (px, py) ε [0, 63]×[0, 31] and a distance of its sub-samples is set to 4 kinds of (sx, sy) ε {(1, 1), (1, 2), (2, 1), (2, 2) }. Accordingly, the total numbers of the base extraction blocks <Ui> are N (=8192) and are referred by an index counter “i” from the AOT 24. Behavior of conventional adaptive orthogonal transform processing unit 24 will be explained below.


[0012]
FIG. 2 is a flow chart of conventional adaptive orthogonal transform processing and FIG. 3 is an image drawing of the processing. In FIG. 2, it is input in the processing that the square norm ∥dj2 of the differential vector is more than Z. In step S121, the square norm ∥dj2 of the differential vector is set in a resister E. A basis number counter is initialized to k=1. In step S122, much value ( e.g. 100,000) is set in a minimum value holding resister E′. In step S123, an index counter of the base extraction block <Ui> is initialized to i=0. By these steps, the initial address and distance of sub-samples in the DC nest are set to (px, py)=(0, 0) and (sx, sy)=(1, 1), respectively.


[0013] In step S124, the base extraction vector <ui> is produced by separating the block mean value ai from the base extraction blocks <Ui>. Since the operation or calculation is carried out under the accuracy of integer level, any value of first decimal place in the block mean value ai is rounded off or down. In step S125, the base extraction vector <ui> is subjected to orthogonal transform processing to be converted to the orthogonal basis vector <uk′>, if necessary (k>1).


[0014]
FIG. 3 (A) and (B) are image drawings of the orthogonal transform processing. In FIG. 3 (A), the first base extraction vector <u1> can be the first basis vector <u1′> as it is.


[0015] Then, the second base extraction vector <u2> is subjected to orthogonal transform processing to be converted to the second basis vector <u2′> in accordance with the following method. That is, a shadow of the second base extraction vector <u2> projected on the first basis vector <u1′> is represented by the formula (1).


[0016] [Numeral 1]
1&LeftDoubleBracketingBar;u2&RightDoubleBracketingBar;cosθ=u1·u2&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;u1·u2=&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;&LeftDoubleBracketingBar;u2&RightDoubleBracketingBar;cosθ(1)


[0017] Accordingly, the second orthogonal vector <u2′> is obtained by subtracting the vector of the projected shadow from the second base extraction vector <u2>.


[0018] [Numeral 2]
2u2=u2-u1·u2&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;u1&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;(2)


[0019] In FIG. 3(B) , the third base extraction vector <u3> is subjected to orthogonal transform processing to the first basis vector <u1′> and the second basis vector <u2′>.


[0020]
FIG. 3 is three-dimensionally drawn. The third base extraction vector <u3> is subjected to orthogonal transform processing to the first basis vector <u1′> to obtain an intermediate orthogonal vector <u3″>.


[0021] [Numeral 3]
3u3=u3-u1·u3&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;u1(3)


[0022] Further, the intermediate orthogonal vector <u3″> is subjected to orthogonal transform processing to the second basis vector <u2′> to obtain the third basis vector <u3′>.


[0023] [Numeral 4]
4u3=u3-u2·u3&LeftDoubleBracketingBar;u2&RightDoubleBracketingBar;u2=(u3-u1·u3&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;2u1)-(u3-u1·u3&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;2u1)·u2&LeftDoubleBracketingBar;u2&RightDoubleBracketingBar;2u2=u1-u1·u3&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;2u1-u2·u3&LeftDoubleBracketingBar;u2&RightDoubleBracketingBar;2u2(4)


[0024] Turning to FIG. 2, in step S126, a scalar coefficient αi is calculated using the orthogonal vector <ui′> so that a distance with the differential vector <dk> (at first <dj>) becomes minimum.


[0025]
FIG. 3(C) is an image drawing of the orthogonal transform processing. In FIG. 3(C), where a differential vector represented by <dk> is subjected to approximation, a square norm thereof (ei=∥<dk>−αi<ui′>∥2 ) is minimum when the product of the orthogonal vector <ui′> and the scalar coefficient αi is diagonal with the differential vector {<dk>−αi <ui′>} as shown in FIG. 3(C) (inner product=0). Accordingly, the scalar coefficient αi is obtained by the formula (5).


[0026] [Numeral 5]
5αiui·(dk–αiu1)=0αiui·dk-αi2ui·ui=0(5-1)αi=dk·ui&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2(5-2)


[0027] It is shown in the drawing that the differential vector <dk> (k=0) is subjected to approximation to other first base extraction vector <uj′>. The first base extraction vector <uj′> is shown by the image drawing because it can take optional directions.


[0028] Turning to FIG. 2, in step S127, a square norm (ei) of error vector is obtained by the formula (6) after the differential vector <dk>(k=0) is subjected to approximation to the base extraction vector αi<uj′>.


[0029] [Numeral 6]
6ei=&LeftDoubleBracketingBar;dk-αiui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;dk&RightDoubleBracketingBar;2-2αidk·ui+αi2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;dk&RightDoubleBracketingBar;2-2dk·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2+dk·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;4&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;dk&RightDoubleBracketingBar;2-dk·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2=E-dk·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2(4)


[0030] In step S128 of FIG. 2, it is judged whether ei is less than E′ or not. If ei is less than E′, content of E′ is renewal in step S129 and the information regarding αi, <ui′>, <ui>, etc. at the time is held in an arrangement [αk], [uk′], [uk], etc. If ei is not less than E′, the processing in step S129 is skipped.


[0031] In step S130, one (1) is added to the counter i, and in step S131, it is judged whether i is not less than N (=8192) or not. If i is less than N. turning to step 124 and the same processing is carried out with respect to next base extraction vector <ui>.


[0032] The processing is repeated and when it is judged in step S131 that i is not less than N, all base extraction vectors <ui> have been completely tried. At the time, the register E′ holds the minimum square norm ei.


[0033] It is judged in step S132 whether E′ is not more than Z or not. If E′ is more than Z, it is treated as E=E′ in step S133. That is, the square norm of the differential vector is renewal. In step S134, one (1) is added to the counter k, turning to step S122. If E′ is not more than Z, this processing is skipped. Thus, the orthogonal basis αk<uk′>(k=1˜m) to approximate the difference of the first differential vector <dj> to the allowable error Z is obtained.


[0034] However, the block mean value ai of the base extraction block <Ui> has been rounded off or down in the conventional methods and therefore, improvement of image quality is limited. Why the conventional methods are inconvenient will be explained according to FIG. 4.


[0035]
FIG. 4 is an image drawing of mean value separation processing. A relationship of base extraction block <Ui> (vertical axis) with the pixel value of certain row (horizontal axis) is shown in FIG. 4(a). An actual pixel value is a block mean value of 16 pixels, but the block mean value of 4 pixels will be used to simplify the explanation herein. In FIG. 14(a), each pixel value is 5, 2, 4, and 3 and its mean value ai is 3.5. when the first decimal place is round down, the block mean value ai of the base extraction block <ui> is 0.5 as shown in FIG. 4(b). In FIG. 4(c), if the basis vector βk <uk> is added to the DC value DCJ of the decoded block, the DC component (ai=0.5) is overlapped on the target block <Rj>. In case that the number of basis is plural, the DC value is overlapped on the DCJ by various values in the range of 0<ai<1, and as a result, certain noise is overlapped per each block in the decoded image, whereby image quality is not improved. This disadvantage also occurs in case that the first decimal place is rounded off or up.


[0036] According to the conventional AOT processing, much operations and much time are required, because all of the base extraction vectors <ui> must be subjected to orthogonal processing to the preceding base vectors <uk′>.



SUMMARY OF THE INVENTION

[0037] It is therefore an object of the invention to provide an image encoding/decoding method, which provides high image quality at high speed, an apparatus thereof and a recording medium in which such program therefor is recorded.


[0038] The above object of the invention can be solved by the construction, for example, as shown in FIG. 5. That is, the image encoding method of the invention (1) comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where the differential vector <dj> which is obtained by separating the DC value DCJ from the pixel block to be encoded is over an allowable value Z, calculating one or more orthogonal basis (e.g. αk<vk>) , to which the differential vector <dj> is approximated, by the adaptive orthogonal transform (AOT) using the DC nest, wherein the lowest n (n=log2 B) bits of the DC pixel in each sample being set to 0, where the base extraction block is down-sampled from the DC nest and the block mean value ai of it is calculated using the samples.


[0039] Accordingly, any fraction less than 1 does not occurs in the block mean value ai and the block mean value ai with integer level precision is obtained at high speed.


[0040] In a preferred embodiment of the invention (1) that is the invention (2), the lowest n bits of the DC pixel is set to 0 or is masked, where the DC nest is produced from the DC image.


[0041] Accordingly, the DC nest, of which the lowest n bits of the DC pixel is set to 0 or is masked, is efficiently obtained by one processing.


[0042] In a preferred embodiment of the invention (1) or (2) that is the invention (3), a base extraction vector <ui> is produced to which the differential vector <dj> approximates by separating the block mean value ai from the base extraction block <Ui> in which the lowest n bits of the DC pixel is set to 0.


[0043] According to the invention (3), the sum (the block mean value) of all elements in such base extraction vectors <ui> is always 0 and the DC component is completely separated. Therefore, even if the base vectors <uk> are piled up on each other in the decoding side, unnecessary DC component (noise) does not cause. The image quality in the HVQ system is more improved by the invention (3).


[0044] In a preferred embodiment of the invention (3) that is the invention (4), optional elements (e.g. u16) of base extraction vectors <ui> are replaced by linear bond of the remainder elements and the inner product of the base extraction vectors <ui> and the other optional vectors <w> are calculated by the formula.


<w·ui>=(w1−w16) ui+(w2−w16) u2+. . . +(w15−w16) u15


[0045] In the invention (4), the sum of all elements in the base extraction vectors <ui> is always 0 and hence, the optional elements (e.g. u16) are represented by the linear bond of the remainder elements. Accordingly, the inner product calculation <w·ui> with the other optional vectors (w) can be expanded to the product-sum calculation as shown by the above formula, whereby a single round of such complicated calculation can be omitted. Since much inner product calculation of the vectors is conducted in the image encoding method according to the HVQ system, such single round omission of the calculation contributes to high speed encoding processing.


[0046] In a preferred embodiment of the invention (3) or (4) that is the invention (5), a first basis is searched so that hi may be maximum in the following formula,




h


i


=[<d·u


i
]2/∥ui2



[0047] wherein <d> is the differential vectors and <ui> is the base extraction vectors.


[0048] According to the invention (5), such condition that square norm ∥<d>−<αi ui>∥2 of the difference with the differential vectors <d> is minimum can be searched by the above simple calculation. Hence, the AOT processing can be carried out at high speed.


[0049] In the invention (6), a second basis is searched so that hi may be maximum in the following formula,




h


i


={<d·u


i
>−(<d·ui><u1·ui>/∥u12)2/{∥ui2−(<u1·ui>)/∥u12}



[0050] wherein <d> is the differential vectors, <u1> is the base extraction vectors corresponding to the first basis, and <ui> is the base extraction vectors for searching the second basis in the invention (3) or (4).


[0051] According to the invention (6), the AOT processing can be done more efficiently and at higher speed in addition to the advantages of the invention (5), because the calculation result which has been obtained in the first basis search can be used with respect to <d·u1> and ∥u1∥ of the numerator, and ∥ui2 and ∥u1∥ of the denominator.


[0052] In a preferred embodiment of the invention (3) or (4) that is the invention (7), a third basis is searched so that hi may be maximum in the following formula,




h


i
=(<d·ui>−<d·v1><v1·ui>−<d·v2><v2·u1>)2/{∥ui2−<v1·ui>2−<v2·ui>2}



[0053] wherein <d> is the differential vectors, <v1> is the first orthonormal base vectors, <v2> is the second orthonormal base vectors, and <ui> is the base extraction vectors for searching the third basis.


[0054] According to the invention (7), the AOT processing can be done more efficiently and at higher speed in addition to the advantages of the invention (5) and (6), because the calculation result which has been obtained in the first and second basis search can be used with respect to (<d·ui>−<d·v1><v1·ui>) of the numerator, and (∥ui2−<v1·ui>2) of the denominator.


[0055] In a preferred embodiment of the invention (6) or (7) that is the invention (8), the base extraction vectors <ui> which match with search conditions are subjected to orthogonal transform with one or more preceding orthonormal basis.


[0056] That is, one orthonormal processing per each base extraction vector <ui>, which is adopted as the basis after the search termination at each stage is carried out, whereby the AOT processing can be done more efficiently and at higher speed.


[0057] In the image encoding method of the invention (9), the norm of each scalar expansion coefficient β1˜βm is rearranged in decreasing order, a difference (including 0) between norms adjacent to each other is calculated, and Huffman coding is applied to the obtained difference. In the method, the basis is represented by βk<uk>, wherein k=1˜m.


[0058] In general, the norm of each scalar expansion coefficient β1˜βm can take various value. When the value is rearranged in ascending or descending order and the difference (including 0) between norms adjacent to each other is calculated, each difference is often similar to or same as each other. The more encoding compression is possible by applying the Huffman coding to the difference value.


[0059] In the image encoding method of the invention (10), image data <Rj> of coding objective block is encoded instead of the coding of the basis, where the basis is more than certain number. Accordingly, the decoded image quality is improved. In practical, it does not affect the coding compression ratio because such situation is little.


[0060] The above object of the invention can be resolved by the construction, for example, as shown in FIG. 14. That is, the image decoding method of the invention (11) comprises reproducing a DC image corresponding to each block mean value per B pixel from encoding data with respect to the HVQ system, making a part of said DC image a DC nest, reproducing image data <Rj> of target block by synthesizing, to DC value DCJ of target block, one or more basis vectors βk<uk> which is selected from DC nests based on the encoding data, and the lowest n (n=log2 B) bits of the DC pixel in each sample is set to 0, where the selected block is down-sampled from the DC nest and the block mean value of it is calculated using the samples.


[0061] Accordingly, any fraction less than 1 does not occurs in the block mean value and the block mean value with integer level precision is obtained at high speed.


[0062] According to the image decoding method in the invention (12), where the decoded basis is information with respect to βk<uk> (k=1˜m), the lowest n (n=log2 B) bits of the DC pixel per each selected block (Uk) to be read out from the DC nest are set to 0, product-sum calculation of basis βk<uk> (k=1˜m) is carried out, and the calculated result is divided by the number B of block pixels.


[0063] In the invention (12), the lowest n bits of each selected block (Uk) are set to 0, and hence, even if these are accumulated and added, the addition result becomes multiple of integer of the block size B (e.g. 16). An expansion coefficient βk is an integer precision. Accordingly, if the cumulative addition result is divided by the number B of the block pixels, block mean value Aj is efficiently obtained by one processing. Therefore, such calculation that the basis vectors βk<uk> (k=1˜m) are overlapped can be effectively carried out.


[0064] In a preferred embodiment of the invention (11) or (12) that is the invention (13), the lowest n bits of each DC pixel is set to 0, where DC nests are produced from the DC image, whereby processing is effectively carried out.


[0065] The image encoding apparatus of the invention (14) comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where a differential vector <dj> which is obtained by separating the DC value DCJ from the pixel block to be encoded is over an allowable value Z, calculating one or more orthogonal basis ( e.g. αk<vk>) , to which the differential vector <dj> is approximated, by the adaptive orthogonal transform (AOT) using the DC nest, and providing a memory 17 to store the DC nest in which the lowest n (n=log2 B) bits of the DC pixel are set to 0.


[0066] The image decoding apparatus of the invention (15) comprises reproducing a DC image corresponding to each block mean value per B pixel from encoding data with respect to the HVQ system, making a part of said DC image a DC nest, reproducing image data<Rj> of target block by synthesizing, to the DC value DC, of target block, one or more basis vectors βk<uk> which is selected from DC nests based on the encoding data, and providing a memory 49 to store the DC nest in which the lowest n (n=log2 B) bits of the DC pixel are set to 0.


[0067] The recording medium of the invention (16) comprises a computer readable recording medium storing a program to make a computer to implement the processing described in one of the invention (1) to (13).







BRIEF DESCRIPTION OF THE DRAWINGS

[0068]
FIG. 1 is a block diagram showing a conventional image encoder;


[0069]
FIG. 2 is a flow chart of a conventional adaptive orthogonal transform processing;


[0070]
FIG. 3 is an image drawing of the conventional adaptive orthogonal transform processing;


[0071]
FIG. 4 is an image drawing of a conventional mean value separation processing;


[0072]
FIG. 5 is an explanatory drawing of the principle of the invention;


[0073]
FIG. 6 is a block diagram showing an image encoder, which is an embodiment of the invention;


[0074]
FIG. 7 is a flow chart showing a main image encoding processing which is an embodiment of the invention;


[0075]
FIG. 8 is a flow chart (1) showing an adaptive orthogonal transform processing which is an embodiment of the invention;


[0076]
FIG. 9 is a flow chart (2) showing an adaptive orthogonal transform processing which is an embodiment of the invention;


[0077]
FIG. 10 is a flow chart (3) showing an adaptive orthogonal transform processing which is an embodiment of the invention;


[0078]
FIG. 11 is an explanatory drawing (1) of a DC nest, which is an embodiment of the invention;


[0079]
FIG. 12 is an explanatory drawing (2) of a DC nest, which is an embodiment of the invention;


[0080]
FIG. 13 is an image drawing of a compression encoding processing of the expansion coefficient;


[0081]
FIG. 14 is ablock diagram showing an image decoder, which is an embodiment of the invention;


[0082]
FIG. 15 is a flow chart showing an image decoding processing which is an embodiment of the invention; and


[0083]
FIG. 16 is an image drawing of an alternating current component prediction, which is an embodiment of the invention.







DETAILED DESCRIPTION OF THE INVENTION

[0084] Referring to the drawings, suitable embodiments of the invention will be explained in detail. The same sign indicates same or corresponding part through whole drawings.


[0085] In FIG. 6 which is a block diagram showing an embodiment of image encoding apparatus in the invention, 31 is a DC nest production unit which produces the DC nest from a decoding DC image according to the invention, 17 is a DC nest memory which stores the produced DC nest, 32 is an adaptive orthogonal transform (AOT) processing unit which effectively implements AOT processing at high speed, 33 is a coefficient transform unit, and 34 is an encoding unit which can make an expanding coefficient βk higher compression. The other construction is same as in FIG. 1. The feature of each unit will be apparent from the following explanation of behavior.


[0086] In FIG. 7, which is a flow chart showing a main image encoding processing, which is an embodiment of the invention, an original image data is input in a original image memory 11 at step S1. For example, an objective image of R.G.B. is converted to an image of Y.U.V., which is input in the memory 11. Y is a brightness data, U and V are color difference data. U and V are down-sampled using a brightness mean of 2 pixels in a horizontal direction. As an example, the brightness data Y is composed of vertical 960×horizontal 1280 pixels and, for example, 8 bits are allotted to each pixel. The processing of the brightness data Y will be mainly explained in the following but U and V are similarly processed.


[0087] A block mean (DC) value of every 4×4 pixels with respect to all image data is calculated at step S2. The first decimal place is round off at the time. All DC values are encoded by conventional two-dimensional DPCM method, etc. and are output at step S3. At step S4, all DPCM outputs are decoded by IDPCM method to reproduce the DC images, which are stored in a DC image memory 15. This is done to equalize AOT processing conditions in the encoding side with that in the decoding side. At step S5, the DC nest is reproduced from the DC images in the DC nest production unit 31, which is stored in the DC nest memory 17. A range from which the DC nest is cut can be selected by the same manner as conventional one.


[0088] In FIG. 11(a), the lowest 4 bits of each DC pixel DCJ cut from the DC image memory 15 are masked (are set to 0) which are stored in a nest pixel NJ of the DC nest memory 17. The lowest 4 bits are in relation with 24=B (B=block size 16) or 4=log2 B. As such result that the lowest 4 bits are masked, the sum of base extraction block <Ui> is always multiple of integer and a block mean value ai which is {fraction (1/16)} of the sum is always an integer. Accordingly, the base extraction vectors <ui> which are obtained by separating the block mean value ai from the base extraction block <Ui> are always 0.


[0089] In FIG. 11(a) and (b), graphs of concrete values are shown as example, in which mean of 4 pixels is used for simplifying the explanation. In FIG. 11(c), even if a plurality of basis vectors βk<uk> are cumulatively added to the DC pixel DCJ of a decoding block <Rj>, a noise is not overlapped as usual because the block mean value of each basis vectors βk <uk> is always 0, whereby image quality can be much improved.


[0090] The examples of the value in FIG. 11 are shown in FIG. 12(a). The sum of the DC pixels A to D is 251 and its mean is 251/4=62.75 (non-integer). The lowest 4 bits are masked when the DC pixels A to D are transmitted to the nest pixels A to D, whereby the sum of the nest pixels A to D is 224 and its mean value AV is 224/4=56 (integer). Each element a to d of the base extraction vectors <ui> which is obtained by separating the mean value 56 of the nest pixels from the nest pixels becomes 24, −24, 8 and −8, respectively. The sum of these elements is 0 (complete mean value separation).


[0091] The same value as in FIG. 12(a) is shown in FIG. 12(b) except that the DC pixels A to D are copied into the nest pixels A to D and the lowest 4 bits are masked from the sum of the nest pixels A to D. According to the method, the sum is the multiple of 16 and the block mean value is 60 (integer). However, according to the method, each element a to d of the base extraction vectors <ui> which is obtained by separating the mean value 60 of the nest pixels from the nest pixels A to D becomes 33, −25, 13 and −10, respectively. The sum of these elements is not 0 (complete mean value separation).


[0092] As shown in FIG. 12(b), after a part of the DC images is copied into the nest pixels A to D, the lowest 4 bits may be masked from each pixel when the base extraction block <Ui> is down-sampled from the DC nest.


[0093] Turning to FIG. 7, each index counter j, J to the original image memory 11 and the DC image memory 15 is initialized to 0 at step S6, wherein j indicates an index counter of the target block <Rj> which is encoding object, and J indicates an index counter of the DC pixel. At step S7, the differential vector <dj> is obtained by separating a corresponding decoding DC value DCf from the target block <Rj>. At step S8, it is judged whether the square norm ∥dj2 of the differential vector is more than the allowable error Z or not. In case that ∥djμ2 is not more than Z, 0 is output as the number of the basis at step S17. In this case, the target block <Rj> is decoded by alternating current component prediction method as described hereinafter. In case that ∥dj2 is more than Z, the adaptive orthogonal transform (AOT) processing method as described hereinafter is carried out at step S9.


[0094] At step S10, it is judged whether the number of the basis k produced by the adaptive orthogonal transform is more than 4 or not. According to the actual measurement, statistic result of k=1 to 3 has been obtained in most cases. Therefore, in case that k is more than 4, “5” is code-output as the number of the basis at step S18 and each pixel value of the target block <Rj> is output. In case that k is not more than 4, conversion to expanding coefficient βk is carried out as described hereinafter at step S11. At step S12, the basis number m, the expanding coefficient βk and the index information i of non-orthogonal basis vector <ui> each is code-output at step S12.


[0095] At step S13, “1” is added to the counters j and J, respectively. In the step, an addition of 1 to the counter j means renewal of onepixel block. It is judged at step S14 whether j is not less than M (the number of all image blocks) or not. In case that j is less than M, turning to step S7 and same encoding processing is repeated with respect to a next target block <Rj>, followed by same steps. It is judged at step S14 that j is not less than M, then encoding processing, for example, by Huffman method is carried out at step S15 as described hereinafter. Thus, encoding processing of one pixel is terminated.


[0096] In FIGS. 8 to 10, each of which is a flow chart (1), (2) or (3) of the adaptive orthogonal transform processing, it is shown that the minimum necessary number of orthogonal basis αk<vk> (k=1˜m) is effectively obtained at high speed. In the following explanation, the initial differential vector <dj> obtained at the step S7 is represented by <d> and the differential vector to be renewed later is represented by <dk> (k=1˜m).


[0097] A search processing of first basis is shown in FIG. 8. Before explanation of the processing, an idea on calculation for high speed processing will be explained. That is, the first basis is usually obtained as base extraction vector <uj>, which makes a square norm ei of the difference between the first basis and a differential vector <d> minimum, and is represented by the formula (7).


[0098] [Numeral 7]
7ei=&LeftDoubleBracketingBar;d-d·ui&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2ui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;d&RightDoubleBracketingBar;2-2d·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2+d·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;4&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;d&RightDoubleBracketingBar;2-d·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2ui=ui(7)


[0099] The first item ∥d∥2 of the right side in the formula (7) which is more than 0 is independent of an extracted basis and hence, <ui> that makes the second item of the right side in the formula (7) maximum can be the first basis. The second item hi of the right side is represented by the formula (8).


[0100] [Numeral 8]
8hi=d·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2(8)


[0101] A processing for searching and judging the first basis αk <vk> which makes hi maximum is explained. At step S22, the fifteen dimensional vector <d′> is obtained by subtracting the sixteenth component of <d> from the remaining components as a preprocessing to inner product calculation <d·ui> as described hereinafter. At step S22, the inner product <d′·u1> of hi numerator is obtained with respect to i=0˜(N−1) and is stored in an arrangement [Pi] {i=0˜(N−1)}.


[0102] More concretely, <ui> is sixteen dimensional vector, but its sixteenth component u16 can be represented by linear bond of the remaining fifteen components because the block mean value (sum of all elements) is 0.


[0103] [Numeral 9]




u


1


=[u


1


, u


2


, u


3


, . . . , u


16


]u


1


u


2


. . . +u


16
=0 u16=−(u1+u2+. . . +u15)  (9)



[0104] Accordingly, the inner product <d·ui> of hi numerator can be calculated from <d′·ui> equivalent thereto, whereby one product/sum calculation can be omitted which corresponds to 8192 calculations with respect to total of i.


[0105] [Numeral 10]
9d·ui=d1u1+d2u2++d15u15-d16(u1+u2++u15)=(d1-d16)u1+(d2+d16)u2++(d15-d16)u15=d·ui(10-1)
d′=[(d1−d16), (d2−d16), . . . , (d15−d16)]  (10-2)


[0106] At step S23, the square norm ∥ui2 of hi denominator is obtained with respect to i=0˜(N−1) and is stored in an arrangement [Li] {i=0˜(N−1)}.


[0107] [Numeral 11]


ui2=u12+u22+. . . +u162  (11)


[0108] The arrangement [Li] is repeatedly used. At step S24, a register E=0 storing a maximum value of hi, an index counter i=0 of the base extraction vector <ui> and a basis number counter k=1 are initialized, respectively.


[0109] At step S25, a value for hi=Pi2/Li is calculated. Step S26, it is judged whether hi is more than E or not. In case that hi is more than E, E is renewed by hi at step S27 and i is held in an arrangement [Ik] (k=1). In case that hi is not more than E, the processing at Step S27 is skipped.


[0110] At step S28, 1 is added to i and at step S29, it is judged whether i is not less than N (total extraction numbers) or not. In case that i is less than N, turning to step S25 and maximum value search processing is carried out with respect to next hi similar to above.


[0111] The same processing is repeated and the search of all nest blocks is terminated when i is not less than N. At the time, the index value i of the first basis vector <ui> which makes hi maximum is held in an arrangement [Ik].


[0112] At step S30, the first basis vector <ui> is normalized to be a normalized basis vector <vi> which is stored in an arrangement [Vk] (k=1). And, a scalar coefficient α1 (projection shadow of <d> on <vi> is calculated and is stored in an arrangement [Ak] (k=1).


[0113] At step S31, the differential vector <d> is approximated to the first basis and is renewed by the differential vector <d1>=<d>−α1<v1>. At step S32, a square norm e=∥u12 of new differential vector is calculated and at step S33, it is judged whether e is not more than Z or not. In case that e is not more than Z, the AOT processing is terminated at the step. In case that e is more than Z, the search processing of the second basis is carried out.


[0114] A search processing of the second basis is shown in FIG. 9. Before explanation of the processing, an idea on efficient calculation will be explained. That is, the second basis is usually obtained as orthogonal vector <uj′> which makes a square norm ei of the difference between the second basis and a differential vector <d1> minimum, and is represented by the formula (12).


[0115] [Numeral 13]
10ei=&LeftDoubleBracketingBar;d1-d1·ui&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2ui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;d1&RightDoubleBracketingBar;2-2d1·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2+di·ui2&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;4&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;d1&RightDoubleBracketingBar;2-d1·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2(12)


[0116] The orthogonal vector <ui′> is obtained by orthogonal transform of the second base extraction vector <ui> to the first normalized basis vector <v1>.


[0117] [Numeral 13]
11ui=ui-ui·v1&LeftDoubleBracketingBar;v1&RightDoubleBracketingBar;2vi=ui-ui·v1v1(13)


[0118] The first item ∥d∥2 of the right side in the formula (12) which is more than 0 is independent of an extracted basis and hence, <ui′> that makes the second item of the right side in the formula (12) maximum can be the second basis. The second item hi of the right side is represented by the formula (14).


[0119] [Numeral 14]
12hi=d1·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2(14)


[0120] According to the formula (14), hi can be calculated but the denominator of the formula (14) may be transformed in order to effectively utilize the calculation result in FIG. 8. That is, if the orthogonal vector <ui′> of the hi numerator is represented by the base extraction vector <ui>, the hi numerator can be represented by the formula (15).


[0121] [Numeral 15]
13d1·ui2=d1·(ui-ui·v1)v1)2=(d1·ui-d1·ui·v1v1)2=d1·ui2d1·v1=0(15)


[0122] Further, if the differential vector <d1> of the formula (15) is represented by the first differential vector <d>, the hi numerator can be represented by the formula (16).


[0123] [Numeral 16]
14d1·ui2=(d-d·v1v1)·ui2=(d·ui-d·v1v1·ui)2=(d·ui-d·u1&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;u1·ui&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;)2(16)


[0124] Accordingly, the calculation result <d·u1> which is obtained in the search of the first basis can be used for calculation of the hi numerator. Also, when the hi denominator is transformed, it can be represented by the formula (17).


[0125] [Numeral 17]
15&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;=&LeftDoubleBracketingBar;ui-ui·v1v1&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2-2ui·v12+ui·v12&LeftDoubleBracketingBar;v1&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2-ui·v12=&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2-(ui·u1&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;)2(17)


[0126] Accordingly, The calculation result ∥ui2, ∥u1∥ which is obtained in the first basis search can be used in the calculation of the hi numerator. When hi is placed in the formula (14), it can be represented by the formula (18-1) and finally by the formula (18-2).


[0127] [Numeral 18]
16hi=(d·ui-d·u1&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;u1·ui&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;)2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2-((ui·u1)&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;))2(18-1)=(P-PkIkuk·uiLk)2Li-((uk·ui)Lk)2(18-2)


[0128] A calculating result of the arrangement [Pi], [Li] can be used for Pi=<d1·ui>, Li=∥u12, respectively and the preceding result can be used for Pk=Pi=<d·u1>, {square root}{square root over ( )}Lk={square root}{square root over ( )}L1=∥u1∥. Accordingly, it is in a part of <uk·ui>=<u1·ui> that a calculation is newly required.


[0129] Based on the background as above, a search of the second basis is carried out by the following calculation. That is, at step S41, P1=<d·u1> and L1=∥u12 are held as k=1. The result obtained in steps S22 and S23 can be used. The numeral “1” of P1 means the first basis <u1> in the index counter i and is held in an arrangement [Ik] at step S27. At step S42, a calculation is carried out by the formula (19) and a result is stored in a register η, κ.


[0130] [Numeral 19]
17η=1Lkκ=Pkη(19)


[0131] At step S43, the fifteen dimensional vector <w1> is obtained by subtracting the sixteenth component of <u1> from the remaining components as the preprocessing of inner product calculation <u1·ui> as described below. At step S44, an inner product <wk·ui> is calculated with respect to i=0˜(N−1) and is stored in an arrangement [Qi]. At step S45, (Pi−κQi) is calculated with respect to i=0˜(N−1) and is stored in and written in over an arrangement [Pi]. The calculation result at step S45 is stored in (written in over) an arrangement [Pi], whereby contents of the arrangement [Pi] are gradually renewed on the past calculation result. Further, at step S46, (Li−Qi2) is calculated with respect to i=0˜(N−1) and is stored in and written in over an arrangement [Li]. Li in the right side is a result of the calculation at step S23. The calculation result at step S46 is stored in and written in over an arrangement [Li] at step S23, whereby contents of the arrangement [Li] are gradually renewed on the past calculation result. The repeated calculation of hi is finally represented by the formula (20).


[0132] [Numeral 20]
18hi=(Pi-κQi)2Li-Qi2=Pi2Li(20)


[0133] At step S24, a register E=0 holding a maximum value of hi and an index counter i=0 of the base extraction vector <ui> are initialized, respectively and “1” is added to a basis number counter k to be k=2.


[0134] At step S48, hi=Pi2/Li is calculated. At step S49, it is judged whether hi is more than E or not In case that hi is more than E, E is renewed by hi at step S50 and i is stored in an arrangement [Ik] (k=2). In case that hi is not more than E, the processing at step S50 is skipped.


[0135] At step S51, “1” is added to i and at step S52, it is judged whether i is not less than N or not. In case that i is less than N, turning to step S42 and the maximum value search processing is carried out with respect to subsequent hi. When the same procedure was proceeded and i is not less than N, the search of the all nest blocks are terminated. At the time, the index value of the second basis vector <ui> to make hi maximum is held in an arrangement [Ik] (k=2).


[0136] At step S53, the second basis vector <u2> is subjected to orthonormal with <v1> to be a normalized basis vector <v2> which is stored in an arrangement [Vk] (k=2). A scalar coefficient α2 which is a shadow of <d1> projected to <v2> is calculated and is stored in an arrangement [Ak] (k=2). The ortho normalization of the basis vector <u2> and the calculation of the scalar coefficient α2 are carried out at one time with respect to the above search result, whereby the AOT processing is much simplified at high speed.


[0137] At step S54, the differential vector <d1> is closed to the second basis and is renewed by the differential vector <d2>=<d1>−α2 <v2>. At step S55, a square norm e=∥u22 of new differential vector is calculated and at step S56, it is judged whether e is not more than Z or not. In case that e is not more than Z, the AOT processing is terminated at the step. In case that e is more than Z, the search processing of the third basis is carried out.


[0138] A search processing of the second basis is shown in FIG. 6. Before explanation of the processing, an idea on efficient calculation will be explained. That is, the third basis is usually obtained as orthogonal vector <uj′> which makes a square norm ei of the difference between the second basis and a differential vector <d2> minimum, and is represented by the formula (21).


[0139] [Numeral 21]
19ei=&LeftDoubleBracketingBar;d2-d2·ui&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2ui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;d2&RightDoubleBracketingBar;2-2d2·ui&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2+d2·ui&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;4&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;=&LeftDoubleBracketingBar;d2&RightDoubleBracketingBar;2-d2·ui&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2(21)


[0140] The orthogonal vector <ui′> is obtained by orthogonalization of the third base extraction vector <ui> to the first normalized basis vector <v1> and the second normalized basis vector <v2>.


[0141] [Numeral 22]




u


i


1


=u


1


−<u


1


·v


1


>v


1


−<u


i


·v


2


>v


2
  (22)



[0142] The first item ∥d22 of the right side in the formula (21) which is more than 0 is independent of an extracted basis and hence, <ui′> that makes the second item of the right side in the formula (21) maximum becomes the third basis. The second item hi of the right side is represented by the formula (23).


[0143] [Numeral 23]
20hi=d2·ui2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2(23)


[0144] If the orthogonal vector <ui′> of the hi numerator is represented by the base extraction vector <ui>, the hi numerator can be represented by the formula (24).


[0145] [Numeral 24]
21d2·ui2=d2·(ui-ui·v1v1-u1·v2v2)2=(d2·ui-d2·v1ui·v1-d2·v2ui·v2)2=d2·ui2d2·v1=0d2·v2=0(24)


[0146] Further, if the differential vector <d2> of the formula (24) is represented by the first differential vector <d>, the hi numerator can be represented by the formula (25).


[0147] [Numeral 25]
22d2·ui2=(d-d·v1)v1-d·v2v2)·ui2=(d2·ui-d·v1v1·ui-d·v2v2·ui)2=(d·ui-d·u1&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;u1·ui&LeftDoubleBracketingBar;u1&RightDoubleBracketingBar;-d·u2&LeftDoubleBracketingBar;u2&RightDoubleBracketingBar;u2·ui&LeftDoubleBracketingBar;u2&RightDoubleBracketingBar;)2(25)


[0148] Also, when the hi denominator is transformed, it can be represented by the formula (26).


[0149] [Numeral 26]
23&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2=&LeftDoubleBracketingBar;ui-ui·v1v1-ui·v2v2&RightDoubleBracketingBar;&LeftDoubleBracketingBar;ui-ui·v1v1-ui·v2v2&RightDoubleBracketingBar;=&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2-ui·v12-ui·v22(26)


[0150] When hi is placed in the formula (23), it can be represented by the formula (27).


[0151] [Numeral 27]
24hi=(d·ui-d·v1(v1·ui)-d·v2v2·ui)2&LeftDoubleBracketingBar;ui&RightDoubleBracketingBar;2-ui·v12-ui·v22


[0152] The second item of numerator and denominator in the formula (27) has been already calculated and is represented by the formula (28).




P


1


=<d·u


1


>−<d·v


1


><v


1


·u


1
>  (28-1)





I


1


=∥u


1
2−<u1·v1>2  (28-2)



[0153] Accordingly, hi is represented by the formula (29) in the same manner as in the formula (18-2).


[0154] [Numeral 29]
25hi=(Pi-PkLkvk·uiLk)2Li-(vk·uiLk)2(29)


[0155] The formula (29) is same form as the formula (18-2) except that the inner product <uk·ui> is changed to <vk·ui>. Accordingly, each basis hereinafter can be effectively obtained by repeating the same operation as in FIG. 5.


[0156] Based on the above processing, search of the third and following basis is calculated as follows. That is, P2=<d1·u2> and L2=∥u22 are hold by k=2 at stet S61. At step S62, calculation is carried out according to the formula (30) and a result is stored in the registers η and κ.


[0157] [Numeral 30]
26η=1Lkκ=Pkη(30)


[0158] At step S63, the fifteen dimensional vector <w2> is obtained by subtracting the sixteenth component of <v2> from the remaining components as the preprocessing of inner product calculation <v2·ui> as described below. Since each component of <v2> is not an integer, it is necessary that an inner product is calculated in the form of real number. In order to avoid the calculation in the form of real number, each component of <v2> (i.e.<w2>) is multiplied by a constant “a” to make it an integer, in advance.


[0159] At step S64, an inner product <w2·ui>η/a is calculated with respect to i=0˜(N−1) and is stored in (written in over) an arrangement [Qi]. At the time, each calculation result is divided by the constant a to put the position of a figure to the former position.


[0160] At step S65, (Pi−κQi) is calculated with respect to i=0˜(N−1) and is stored in (written in over) an arrangement [Pi]. At step S46, (Li−Qi2) is calculated with respect to i=0˜(N−1) and is stored in (written in over) an arrangement [Li]. The calculation of the formula (29) is represented by the formula (31).


[0161] [Numeral 31]
27hi=(Pi-κQi)2Li-Qi2=Pi2Li(31)


[0162] At step S67, a register E=0 holding a maximum value of hi and an index counter i=0 of the base extraction vector <ui> are initialized, respectively and “1” is added to a basis number counter k to be k=3.


[0163] At step S68, hi=Pi2/Li is calculated. At step S69, it is judged whether hi is more than E or not. In case that hi is more than E, E is renewed by hi at step S70 and i is held in an arrangement [Ik] (k=3). In case that hi is not more than E, the processing at step S70 is skipped.


[0164] At step S71, “1” is added to i and at step S72, it is judged whether i is not less than N or not. In case that i is less than N, turning to step S68 and the maximum value search processing is carried out with respect to subsequent hi. When the same procedure was proceeded and i is not less than N, the search of the all nest blocks are terminated. At the time, the index value of the third basis vector <u3> to make hi maximum is held in an arrangement [Ik] (k=3).


[0165] At step S73, the third basis vector <u3> is subjected to orthonormal transform with <v1> and <v2> to be a normalized basis vector <v3> which is stored in an arrangement [Vk]. A scalar coefficient α3 which is a shadow of <d2> projected to <v3> is calculated and is stored in an arrangement [Ak].


[0166] At step S74, the differential vector <d2> is approximated to the third basis and is renewed by the differential vector <d3>=<d2>−α3<v3>. At step S75, a square norm e=∥d32 of new differential vector is calculated and at step S76, it is judged whether e is not more than Z or not. In case that e is not more than Z, the AOT processing is terminated at the step. In case that e is more than Z, turning to the step S61 and the preprocessing and search processing of the fourth and following basis are carried out. It is preferred that the processing to judge whether k is not less than 4 or not is provided (not shown) after the step S76, whereby the AOT processing can be skipped in case that k is not less than 4.


[0167] The AOT processing can be much simplified can be carried out at high speed by the above processing or operation. The actual calculation time is reduced to ⅓ to {fraction (1/10)} in comparison with conventional methods.


[0168] Referring to FIG. 6, a group of αk<vk>(k=1˜m) is obtained from AOT 32 and the differential vector <dj> is approximated within the allowable error Z by the linear bond.


[0169] Further, in the coefficient transform unit 33, the expansion coefficient βk is obtained to transform the group of αk<vk>(k=1˜m) to βk<uk>(k=1˜m) by the following method. That is, when each matrix of the base extraction vector <uk>, the expansion coefficient βk, the orthonormal basis vector <vk> and the scalar coefficient αk is represented by the formula (32)


[0170] [Numeral 32]
28U=[u1,u2,um]B=[β1β2βm]V=[v1,v2,vm]A=[α1α2αm](32)


[0171] a relationship of the matrix is represented by the formula (33).


[0172] [Numeral 33]


UB=VA  (33)


[0173] In order to solve the formula with respect to the matrix B, both sides of the formula (33) is multiplied from the left by a transposed matrix UT of the matrix U as shown by the formula (34).


[0174] [Numeral 34]


UT UB=UT VA  (34)


[0175] The matrix (UT U) is expanded to be the formula (35).


[0176] [Numeral 35]
29UTU=[u1u2unk][u1,u2,unk]=[u1·u1u1·u2u1·unku2·u1u2·u2u2·unkunk·u1unk·u2unk·unk](35)


[0177] In the formula (35), <ui·uj> means an inner product, and a square matrix which is a symmetrical to a diagonal element is obtained because <ui·uj> is equal to <uj·ui>, and an inverse matrix exists because <ui> is different from <uj>. Therefore, the inverse matrix (UT U)−1 of the matrix (UT U) is multiplied from the left of both sides of the formula to obtain the formula (36) and βk is calculated.


[0178] [Numeral 36]


(UT U)−1 UT UB=B=(UT U)−1 UT VA  (36)


[0179] As explained above, it is unnecessary by transforming the group of the orthonormal basis αk <vk> (k=1˜m) to the non-orthonormal basis βk<uk> (k=1˜m) that each base extraction vector <uk> is subjected to orthogonal transform in decoding side every time, and the differential vector <dj> can approximate by adding a multiplied value of them and βk. Thus, the decoding processing can be simply carried out at high speed.


[0180] A compression encoding processing of the expansion coefficient βk will be explained.


[0181]
FIG. 13 is an image drawing of a compression encoding processing of the expansion coefficient. In FIG. 13(a), a norm is extracted from the produced β1˜β4. In FIG. 13(b), a norm is arranged, for example, in ascending order (β3, β2, β4, β1) and a difference (Δβ3, Δβ2, Δβ4, Δβ1) is calculated. In FIG. 13(c), the upper bits are separated by removing the lowest two bits from all bits in the difference of coefficient (Δβ3, Δβ2, Δβ4, Δβ1) and are subjected to Hoffman encoding.


[0182] In the example, two groups of Δβ3 and (Δβ2, Δβ4, Δβ1) exist with respect to the value, and according to Huffman encoding, a code sign of less bit numbers is allotted to (Δβ2, Δβ4, Δβ1) of which generating frequency is more and a code sign of more bit numbers is allotted to Δβ3 of which generating frequency is less. Accordingly, the compression encoding of expansion coefficient βk is possible. Also, fractions of the lowest bits are omitted by Huffman encoding of the upper bits in difference Δβk of the coefficient, whereby possibility of Δβ2=Δβ4=Δβ1 is high in the upper bits as shown in FIG. 13(c).


[0183] The lowest two bits of difference Δβk is packed with positive and negative code sign bits and an index information (13 bits=0˜8191) of the basis vectors <uk> corresponding to the sign bits in a code sign area of 2 bites fixed length and is output as the fixed length code sign. The output is carried out in the order of Δβ3, Δβ2, Δβ4 and Δβ1 (i.e. u3, u2, u4, u1).


[0184] In FIG. 13(d), each code sign is input in the order of u3, u2, u4 and u1 in decoding side, from which each of the coefficient Δβ3, Δβ2, Δβ4 and Δβ1 is separated. Further, β3 is decoded from the first Δβ3, β2 is decoded by adding Δβ2 to the decoded β3, β4 is decoded by adding Δβ4 to the decoded β2, and then β1 is decoded by adding Δβ1 to the decoded β4. The decoding order is not important because βk<uk> is functioned based on the sum (linear bond) of these values.


[0185] The difference can be calculated by arranging the norm in descending order instead of the ascending order.


[0186] The coding processing by the encoding unit 34 will be explained. A prediction difference ΔDCJ, I of DPCM is quantized by a quantization coefficient Q (Z) , and only in case that ΔDCJ, I is 0, run length coding is considered, and the prediction difference ΔDCJ,I and the run length coding each is independently subjected to Huffman coding. Only in case that the basis number k is 0, the run length coding is considered, and the basis number k and the run length each is independently subjected to Huffman coding. The coefficient difference Δβk is quantized by a constant number Q (e.g. 8) to obtain its quotient, which is subjected to Huffman coding. The code signbits of the expansion coefficient βk and the lowest two bits of the coefficient difference Δβk are incorporated in the code information i (=13 bits) of the basis vector <uk> to make the fixed length coding sign of 16 bits, which are incorporated in the coefficient difference Δβk in ascending (or descending) order and is transmitted. As a whole, row of the coding sign is constituted by incorporating these in appearing order per unit of pixel block. If necessary, a sign EOB is input to show change of pixel blocks.


[0187]
FIG. 14 is a block diagram showing an image decoder, which is an embodiment of the invention, and corresponds to the image encoder as shown in FIG. 6. In FIG. 14, 41 is a decoding unit by Huffman, etc., 42 is an alternating current component prediction unit for predicting target blocks <Rj> containing the alternating current component from the surrounding DC values DCJ′ containing the noticeable pixels DCJ, 43 is the differential vector reproduction unit for reproducing an approximate differential vector <dj> based on the decoding basis βk <uk> (k=1˜m), 44 is the Rj reproduction unit for reproducing target blocks <Rj> based on the decoding blocks <Rj>, 45 is the reproduced image memory, 46 is the IDPCM unit for IDPCM decoding the decoded DC value, 47 is the DC image memory for storing the DC nest, 48 is the DC nest production unit which is same as in FIG. 2, 49 is the DC nest memory for storing the DC nest, 50 is the selected block buffer for holing the selected blocks <Uk> which are down-sampled from the DC nest, 51 is a multiplier for multiplying <Uk> by βk, 52 and 53 are the cumulative addition unit of βk<uk> (k=1˜m), 54 is a means for obtaining a block mean value Aj of cumulative addition values, 55 is a subtractor for separating the block mean value Aj of cumulative addition values, 56 is an approximate vector buffer for holding reproduced approximate differential vector <dj>, and 57 is a means for adding the reproduced approximate differential vector <dj>.


[0188] In FIG. 15, which is a flow chart showing an image decoding processing of an embodiment of the invention, the image coding data is input at step S101. At step S102, each DC value in Y, U and V is decoded by IDPCM method similar to FIG. 6 and DC images are reproduced. At step S103, DC nest is produced from the DC value of Y component. At the time, as shown in FIG. 7, the lowest four bits of each DC pixel value DCJ are masked to be each DC nest pixel value NJ. The information such as cut position of the DC images is separately received. At step S104, the index counters j and J to the original image memory 45 and DC image memory 47 are initialized to 0.


[0189] At step S105, coding data of one block image is input. At step S106, it is judged that k is 0 or not. In case that k is 0, the target blocks <Rj> are reproduced by alternating current prediction method as described hereinafter. In case that k is not 0, it is judged at step S107 whether k is not less than 1 and not more than 4 or not.


[0190] In case k is not less than 1 and not more than 4, the differential vector <dj> is inversely quantized at step S112. Since the lowest four bits of the DC nest are previously masked in the embodiment of the invention, the differential vector <dj> is obtained at once by cumulatively adding the product of the selected block <Uk> and βk and by separating the block mean value Aj from the cumulative addition result, whereby the decoding processing is carried out at high speed. At step S113, the DC value DCJ corresponding to thus obtained differential vector <dj> is added.


[0191] In case k is less than 1 and more than 4, the target blocks <Rj> are directly produced from the decoding data of the target blocks <Rj> at step S108. Thus, the target blocks <Rj> of 4 times 4 pixels are reproduced by any methods as above. The reproduced target blocks <Rj> are stored in the reproduced image memory 45 at step S109.


[0192] At step S110, “1” is added to the counters j and J, respectively, and at step S111, it is judged whether i is not less than M (all pixel block numbers) or not. In case that i is less than M, turning to step S105 and the decoding and reproducing processing is carried out with respect to subsequent the block image coding data. When the same procedure was proceeded and j is not less than M in the judge at step S111, the decoding processing per one image is terminated.


[0193]
FIG. 16 is an image drawing of an alternating current component prediction, which is an embodiment of the invention and is applicable for conventional prediction methods.


[0194]
FIG. 16(A) is a stepwise alternating current component prediction method as described hereinafter. At first stage, each sub-block S1˜S4 is predicted from each DC value of the 4 blocks (U, R, B, L) surrounding the S1˜S4.


[0195] S1=S+(U+L−B−R)/8


[0196] S2=S+(U+R−B−L)/8


[0197] S3=S+(B+L−U−R)/8


[0198] S4=S+(B+R−U−L)/8


[0199] Similarly, U1˜U4, L1˜L4, R1˜R4 and B1˜B4 are predicted at the first stage. At second stage, 4 pixels P1˜P4 on S1 are predicted by using the above method repeatedly.


[0200] P1=S1+(U3+L2−S3−S2)/8


[0201] P2=S1+(U3+S2−S3−L2)/8


[0202] P3=S1+(S3+L2−U3−-S2)/8


[0203] P4=S1+(S3+S2−U3−L2)/8


[0204] Each 4 pixels P1˜P4 on S2˜S4 are predicted in the same manner. The target blocks <Rj> are reproduced by such two stage processing.


[0205]
FIG. 16(B) is a non-stepwise alternating current component prediction method, which the applicant has been already proposed. In FIG. 16(B), each of the 4 pixels P1˜P4 on each of the sub-block S1˜S4 is predicted from each DC value of 4 blocks surrounding the noticeable block S at a stroke. At first, each approximation of S2≈S3≈S, U3≈U2 and L2≈L is carried out to obtain each4 pixels P1˜P4 on S1. The approximation is applied to P1˜P4 on S1 to obtain the formula,




P


1


=S


1
+(U3+L2−S3−S2)/8=S1+(U+L−S−S)/8



[0206] The above formula, S1=S+(U+L−B−R)/8, is substituted for the formula, P1=S1+(U+L−S−S)/8, P1 on S1 is finally represented by the formula,




P


1


=S+
(2U+2L−2S−B−R)/8



[0207] And, P2 on S1 is represented by the formula,


[0208]

P


2


=S


1
+(U3+S2=S3−L2)/8=S1+(U+S−S−L)/8


[0209] The above formula, S1=S+(U+L−B−R)/8, is substituted for the formula, P2=S1+(U+S−S−L)/8, P2 on S1 is finally represented by the formula,




P


2


=S
+(2U−B−R)/8



[0210] Also, P3 on S1 is represented by the formula,




P


3


=S


1
+(S3+L2−U3−S2)/8=S1+(S+L−U−S)/8



[0211] The above formula, S1=S+(U+L−B−R)/8, is substituted for the formula, P3=S1+(S+L−U−S)/8, P3 on S1 is finally represented by the formula,




P


3


=S
+(2L−B−R)/8



[0212] Further, P4 on S1 is represented by the formula,




P


4


=S


1
+(S3+S2−U3−L2)/8=S1+(S+S−U−L)/8



[0213] The above formula, S1=S+(U+L−B−R)/8, is substituted for the formula, P4=S1+(S+S−U−L)/8, P4 on S1 is finally represented by the formula,




P


4


=S
+(2S−B−R)/8



[0214] Accordingly, 4 pixels P1˜P4 on S1 can be non-stepwise obtained by the formulae at a stroke.




P


1


=S
+(2U+2L−2S−B−R)/8





P


2


=S
+(2U−B−R)/8





P


3


=S
+(2L−B−R)/8





P


4


=S
+(2S−B−R)/8



[0215] Each 4 pixels P1˜P4 on S2˜S4 is obtained in the same manner.


[0216] The embodiments of the invention are explained by using the several examples, but it is apparent that the invention should not be limited thereto. It is to be appreciated that those skilled in the art can change or modify the embodiments in such point as construction, control, processing or a combination thereof without departing from the scope and spirit of the invention.


[0217] According to the invention, high image quality can be obtained by the improvement of the DC nest and high speed encoding can be achieved by the means for the AOT calculation. Therefore, the method of the invention much contributes to the attainment of high image quality and high speed encoding in the HVQ system.


Claims
  • 1. In an image encoding method which comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where the differential vector which is obtained by separating the DC value from the pixel block to be encoded is over an allowable value, calculating one or more orthogonal basis, to which the differential vector is approximated, by the adaptive orthogonal transform using the DC nest, the improvement which comprises setting the lowest n (n=log2 B) bits of the DC pixels in each sample to 0, where the base extraction blocks are down-sampled from the DC nest and the block mean value thereof is calculated using the samples.
  • 2. The method according to claim 1, wherein the lowest n bits of each DC pixels are set to 0, where the DC nest is produced from the DC image.
  • 3. The method according to claims 1 and 2, wherein a base extraction vector is produced to which the differential vector approximates by separating the block mean value from the base extraction block in which n bits of the DC pixels are set to 0.
  • 4. The method according to claim 3, optional elements of base extraction vectors <ui> are replaced by linear bond of the remainder elements and the inner product of the base extraction vectors and the other optional vectors <w> are calculated by the formula.
  • 5. The method according to claims 3 and 4, wherein a first basis is searched so that hi may be maximum in the following formula,
  • 6. The method according to claims 3 and 4, wherein a second basis is searched so thathi may be maximum in the following formula,
  • 7. The method according to claims 3 and 4, wherein a third basis is searched so that hi may be maximum in the following formula,
  • 8. The method according to claims 6 and 7, wherein the base extraction vectors <ui> which match with search conditions are subjected to orthonormal transform with one or more preceding orthonormal basis.
  • 9. In an image encoding method which comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where the differential vector which is obtained by separating the DC value from the pixel block to be encoded is over an allowable value, calculating one or more orthogonal basis, to which the differential vector is approximated, by the adaptive orthogonal transform using the DC nest, the improvement which comprises rearranging the norms of each scalar expansion coefficient β1˜βm in ascending or descending order, calculating a difference (including 0) between the norms adjacent to each other, and applying Huffman coding to the obtained difference, wherein the basis is represented by βk<uk>(k=1˜m).
  • 10. In an image encoding method which comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where the differential vector which is obtained by separating the DC value from the pixel block to be encoded is over an allowable value, calculating one or more orthogonal basis, to which the differential vector is approximated, by the adaptive orthogonal transform using the DC nest, the improvement which comprises encoding an image data of coding objective blocks instead of the coding of the basis, where the basis is more than certain number.
  • 11. In an image decoding method which comprises reproducing a DC image corresponding to each block mean value per B pixel from encoding data with respect to the HVQ system, making a part of said DC image a DC nest, and reproducing image data of target blockby synthesizing, to DC value of target block, one or more basis vectors which is selected from DC nests based on the encoding data, the improvement which comprises setting the lowest n (n=log2 B) bits of the DC pixels in each sample to 0, where the selected block is down-sampled from the DC nest and the block mean value of it is calculated using the samples.
  • 12. In an image decoding method which comprises reproducing a DC image corresponding to each block mean value per B pixel from encoding data with respect to the HVQ system, making a part of said DC image a DC nest, and reproducing image data of target block by synthesizing, to DC value of target block, one or more basis vectors which is selected from DC nests based on the encoding data, the improvement which comprise, where the decoded basis is information with respect to βk<uk> (k=1˜m), setting the lowest n (n=log2 B) bits of the DC pixels per each selected block (Uk) read out from the DC nest to 0, calculating a product-sum of basis βk<uk> (k=1˜m), and then dividing the calculated result by the block pixel number B.
  • 13. The method according to claims 11 and 12, wherein the lowest n bits of each DC pixel is made 0, where DC nests are produced from the DC image.
  • 14. In an image encoding apparatus which comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where a differential vector which is obtained by separating the DC value from the pixel block to be encoded is over an allowable value, calculating one or more orthogonal basis, to which the differential vector is approximated, by the adaptive orthogonal transform using the DC nest, the improvement comprising a memory to store the DC nest in which the lowest n (n=log2 B) bits of the DC nest pixels are set to 0.
  • 15. In an image decoding apparatus which comprises reproducing a DC image corresponding to each block mean value per B pixel from encoding data with respect to the HVQ system, making a part of said DC image a DC nest, and reproducing image data of target block by synthesizing, to the DC value of target block, one or more basis vectors which is selected from DC nests based on the encoding data, the improvement comprising a memory to store the DC nest in which the lowest n (n=log2 B) bits of the DC nest pixels are set to 0.
  • 16. A computer readable recording medium storing a program to make a computer to implement the processing according to claims 1 to 13.
Priority Claims (1)
Number Date Country Kind
2000-141675 May 2000 JP