Image encoding/decoding method, apparatus thereof and recording medium in which program therefor is recorded

Information

  • Patent Grant
  • 6714687
  • Patent Number
    6,714,687
  • Date Filed
    Friday, December 8, 2000
    23 years ago
  • Date Issued
    Tuesday, March 30, 2004
    20 years ago
Abstract
The invention relates to an image encoding/decoding method, apparatus thereof and a recording medium in which a program therefor is recorded, whereby the encoding/decoding can be obtained with high image quality at high speed. In an image encoding method which comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where the differential vector which is obtained by separating the DC value DCJ from the pixel block to be encoded is over an allowable value Z, calculating one or more orthogonal basis (αk), to which the differential vector is approximated, by the adaptive orthogonal transform (AOT) using the DC nest, each of the lowest n (n=log2 B) bits of base extraction blocks which are down-sampled from the DC nest is set to 0. Further, base extraction vectors are produced by separating a block mean value ai from the base extraction blocks .
Description




BACKGROUND OF THE INVENTION




1. Field of the Invention




The present invention relates to an image encoding/decoding method, an apparatus thereof, and a recording medium in which a program therefor is recorded, and more particularly, relates to an image encoding/decoding method, an apparatus thereof, and a recording medium in which a program therefor is recorded, according to Hybrid Vector Quantization (HVQ) system.




2. Description of Related Art




According to JPEG (Joint Photographic Expert Group) system, 8 times 8 pixel blocks are converted to DC (direct current) value and each coefficient value of from base to 63 times frequency by two dimensional DCT (discrete cosine transform), and information amount is reduced by quantizing the coefficient value in a different quantization width within no reduction of image quality utilizing frequency components of natural images which are gathered in a low frequency range, and then Huffman encoding is carried out.




According to HVQ system, which is a kind of mean value separation type block encoding same as JPEG, adaptive orthogonal transform (AOT) which is an intermediate system between a vector quantization and orthogonal transform encoding is used as a compression principle. AOT is a system in which the minimum number of non-orthogonal basis is selected from nests of the basis corresponding to a code book of vector quantization and the objective blocks become close to the desired and allowable error “Z”. According to the HVQ system, decoding is quickly carried because a decoding operation can be done in the form of integer. Natural images and artificial images (animation images, CG images) can be compressed in high image quality, because there are not mosquito and block noise, which are particularly generated in JPEG, and false contour, which is particularly generated in GIF. The invention relates to a method for further improving the image quality and for carrying out the coding operation at a higher speed in the HVQ system.




The applicants of the invention have proposed an image encoding/decoding method in accordance with the HVQ system utilizing self-similarity of images in Japanese Patent Application No. 189239/98. The contents of such proposal will be explained as follows. In the disclosure, a sign <a> means vector “a” or block “a”, a sign ∥a∥ means norm of the vector “a”, and a sign <a·b> means inner product of vectors a and b. Further, vectors and blocks in drawings and [numbers] are represented by block letters.





FIG. 1

is a block diagram showing a conventional image encoder. In

FIG. 1

,


11


is an original image memory for storing an original image data,


12


is a DC value production unit for seeking a block average (DC) value per each pixel block (4 times 4 pixel) of the original image data,


13


is a differential PCM encoding unit (DPCM) for carrying out a differential predict encoding per each DC value,


14


is inverse DPCM encoding unit for decoding each DC value from the differential PCM encoding,


15


is a DC image memory for storing a decoded DC image,


16


is a DC nest production unit for cutting off the DC nest of a desired size from a part of the DC image, and


17


is a DC nest memory for storing the DC nest.




Further,


18


is a subtractor for separating a corresponding decoding DC value “DC


J


” from a target image block <R


J


> to be encoded,


19


is a differential vector buffer for storing a differential vector <d


J


> which is DC separated,


20


is an extracted block buffer for storing a base extraction block <U


i


> of 4 times 4 pixels which is down-sampled from the DC nest,


21


is an equilibrator for seeking a block mean value a


i


of the base extraction block <U


i


>,


22


is a subtractor for separating the block means value a


i


from the base extraction block <U


i


>,


23


is an extracted vector buffer for storing the base extraction block <U


i


> which is separated by the mean value,


24


is an adaptive orthogonal transform (AOT) processing unit for producing an orthogonal basis α


k


<u


k


′> (k=1˜m) to search the DC nest to make the differential vector <d


j


> closer to the allowable error Z, where a square norm ∥d


j





2


of the differential vector is over the allowable error Z,


25


is a coefficient transform unit for seeking an expanding square coefficient β


k


which is multiplied by a non-orthogonal basis vector <u


k


> (k=1˜m) per the produced orthogonal basis α


k


<u


k


′> (k=1˜m) to produce an equivalent non-orthogonal basis β


k


<u


k


> (k=1˜m, and


26


is an encoding unit by Huffman coding, run length coding or fixed length coding system for the compression encoding of information such as DPCM encoding of the DC value or the non-orthogonal basis β


k


<u


k


>.




In the DC value production unit


12


, the block mean value of 4 times 4 pixels is provided in which the first decimal place is rounded off or down. In the DPCM


13


, where the DC value of row J and column T is shown by the DC


J, I


, a predictive value DC


J, I


′ of the DC


J,I


is provided by the formula, DC


J, I


′=(DC


J, I−1


+DC


J−1, I


)/2, and its predictive error (Δ DC


J, I


=DC


J, I


−DC


J, I


′) is linear-quantized by a quantization coefficient Q(Z) and is output. The quantization coefficient Q(Z) corresponds to the allowable error Z and is variable within the range of 1 to 8 according to the allowable error Z.




In the DC nest production unit


16


, the DC nest is prepared by copying the range of vertical


39


×horizontal


71


from the DC image. It is preferred that the DC nest includes more alternating current components because it is used as a codebook. Therefore, it is prepared by copying such the range that the sum of absolute values of difference between the DC values adjacent to each other in a plurality of the extracted ranges become maximum.




In making down-samples of the base extraction block <U


i


>, a vertex per one DC value in vertical and horizontal section is set to (p


x


, p


y


) ε [0, 63]×[0, 31] and a distance of its sub-samples is set to 4 kinds of (s


x


, s


y


) ε {(1, 1), (1, 2), (2, 1), (2, 2) }. Accordingly, the total numbers of the base extraction blocks <U


i


> are N (=8192) and are referred by an index counter “i” from the AOT


24


. Behavior of conventional adaptive orthogonal transform processing unit


24


will be explained below.





FIG. 2

is a flow chart of conventional adaptive orthogonal transform processing and

FIG. 3

is an image drawing of the processing. In

FIG. 2

, it is input in the processing that the square norm ∥d


j





2


of the differential vector is more than Z. In step S


121


, the square norm ∥d


j





2


of the differential vector is set in a resister E. A basis number counter is initialized to k=1. In step S


122


, much value ( e.g. 100,000) is set in a minimum value holding resister E′. In step S


123


, an index counter of the base extraction block <U


i


> is initialized to i=0. By these steps, the initial address and distance of sub-samples in the DC nest are set to (p


x


, p


y


)=(0, 0) and (s


x


, s


y


)=(1, 1), respectively.




In step S


124


, the base extraction vector <u


i


> is produced by separating the block mean value a


i


from the base extraction blocks <U


i


>. Since the operation or calculation is carried out under the accuracy of integer level, any value of first decimal place in the block mean value a


i


is rounded off or down. In step S


125


, the base extraction vector <u


i


> is subjected to orthogonal transform processing to be converted to the orthogonal basis vector <u


k


′>, if necessary (k>1).





FIG. 3

(A) and (B) are image drawings of the orthogonal transform processing. In

FIG. 3

(A), the first base extraction vector <u


1


> can be the first basis vector <u


1


′> as it is.




Then, the second base extraction vector <u


2


> is subjected to orthogonal transform processing to be converted to the second basis vector <u


2


′> in accordance with the following method. That is, a shadow of the second base extraction vector <u


2


> projected on the first basis vector <u


1


′> is represented by the formula (1).




[Numeral 1]











&LeftDoubleBracketingBar;

u
2

&RightDoubleBracketingBar;


cos





θ

=







u
1


·

u
2





&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;











u
1


·

u
2





=


&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;







&LeftDoubleBracketingBar;

u
2

&RightDoubleBracketingBar;


cos





θ






(
1
)













Accordingly, the second orthogonal vector <u


2


′> is obtained by subtracting the vector of the projected shadow from the second base extraction vector <u


2


>.




[Numeral 2]










u
2


=


u
2

-






u
1


·

u
2





&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;









u
1



&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;








(
2
)













In FIG.


3


(B), the third base extraction vector <u


3


> is subjected to orthogonal transform processing to the first basis vector <u


1


′> and the second basis vector <u


2


′>.





FIG. 3

is three-dimensionally drawn. The third base extraction vector <u


3


> is subjected to orthogonal transform processing to the first basis vector <u


1


′> to obtain an intermediate orthogonal vector <u


3


″>.




[Numeral 3]










u
3


=


u
3

-






u
1


·

u
3






&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;

2




u
1








(
3
)













Further, the intermediate orthogonal vector <u


3


″> is subjected to orthogonal transform processing to the second basis vector <u


2


′> to obtain the third basis vector <u


3


′>.




[Numeral 4]













u
3


=


u
3


-






u
2


·

u
3







&LeftDoubleBracketingBar;

u
2


&RightDoubleBracketingBar;

2




u
2










=


(


u
3

-






u
1


·

u
3






&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;

2




u
1




)

-






(


u
3

-






u
1


·

u
3






&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;

2




u
1




)

·

u
2







&LeftDoubleBracketingBar;

u
2


&RightDoubleBracketingBar;

2




u
2










=


u
1

-






u
1


·

u
3






&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;

2




u
1



-






u
2


·

u
3






&LeftDoubleBracketingBar;

u
2


&RightDoubleBracketingBar;

2




u
2











(
4
)













Turning to

FIG. 2

, in step S


126


, a scalar coefficient α


i


is calculated using the orthogonal vector <u


i


′> so that a distance with the differential vector <d


k


> (at first <d


j


>) becomes minimum.




FIG.


3


(C) is an image drawing of the orthogonal transform processing. In FIG.


3


(C), where a differential vector represented by <d


k


> is subjected to approximation, a square norm thereof (e


i


=∥<d


k


>−α


i


<u


i


′>∥


2


) is minimum when the product of the orthogonal vector <u


i


′> and the scalar coefficient α


i


is diagonal with the differential vector {<d


k


>−α


i


<u


i


′>} as shown in FIG.


3


(C) (inner product=0). Accordingly, the scalar coefficient α


i


is obtained by the formula (5).




[Numeral 5]














α
i




u
i


·

(


d
k



–α
i



u
1



)





=
0










α
i






u
i


·

d
k





-


α
i
2






u
i


·

u
i







=
0





(5-1)







α
i

=





d
k

·

u
i







&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2






(5-2)













It is shown in the drawing that the differential vector <d


k


> (k=0) is subjected to approximation to other first base extraction vector <u


j


′>. The first base extraction vector <u


j


′> is shown by the image drawing because it can take optional directions.




Turning to

FIG. 2

, in step S


127


, a square norm (e


i


) of error vector is obtained by the formula (6) after the differential vector <d


k


>(k=0) is subjected to approximation to the base extraction vector α


i


<u


j


′>.




[Numeral 6]













e
i

=


&LeftDoubleBracketingBar;


d
k

-


α
i



u
i




&RightDoubleBracketingBar;

2







=



&LeftDoubleBracketingBar;

d
k

&RightDoubleBracketingBar;

2

-

2


α
i






d
k

·

u
i






+


α
i
2




&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2









=



&LeftDoubleBracketingBar;

d
k

&RightDoubleBracketingBar;

2

-

2







d
k

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2



+







d
k

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

4





&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2









=



&LeftDoubleBracketingBar;

d
k

&RightDoubleBracketingBar;

2

-






d
k

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2









=

E
-






d
k

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2










(
4
)













In step S


128


of

FIG. 2

, it is judged whether e


i


is less than E′ or not. If e


i


is less than E′, content of E′ is renewal in step S


129


and the information regarding α


i


, <u


i


′>, <u


i


>, etc. at the time is held in an arrangement [α


k


], [u


k


′], [u


k


], etc. If e


i


is not less than E′, the processing in step S


129


is skipped.




In step S


130


, one (1) is added to the counter i, and in step S


131


, it is judged whether i is not less than N (=8192) or not. If i is less than N. turning to step


124


and the same processing is carried out with respect to next base extraction vector <u


i


>.




The processing is repeated and when it is judged in step S


131


that i is not less than N, all base extraction vectors <u


i


> have been completely tried. At the time, the register E′ holds the minimum square norm e


i


.




It is judged in step S


132


whether E′ is not more than Z or not. If E′ is more than Z, it is treated as E=E′ in step S


133


. That is, the square norm of the differential vector is renewal. In step S


134


, one (1) is added to the counter k, turning to step S


122


. If E′ is not more than Z, this processing is skipped. Thus, the orthogonal basis α


k


<u


k


′>(k=1˜m) to approximate the difference of the first differential vector <d


j


> to the allowable error Z is obtained.




However, the block mean value a


i


of the base extraction block <U


i


> has been rounded off or down in the conventional methods and therefore, improvement of image quality is limited. Why the conventional methods are inconvenient will be explained according to FIG.


4


.





FIG. 4

is an image drawing of mean value separation processing. A relationship of base extraction block <U


i


> (vertical axis) with the pixel value of certain row (horizontal axis) is shown in FIG.


4


(


a


). An actual pixel value is a block mean value of 16 pixels, but the block mean value of 4 pixels will be used to simplify the explanation herein. In FIG.


14


(


a


), each pixel value is 5, 2, 4, and 3 and its mean value a


i


is 3.5, when the first decimal place is round down, the block mean value a


i


of the base extraction block <u


i


> is 0.5 as shown in FIG.


4


(


b


). In FIG.


4


(


c


), if the basis vector β


k


<u


k


> is added to the DC value DC


J


of the decoded block, the DC component (a


i


=0.5) is overlapped on the target block <R


j


>. In case that the number of basis is plural, the DC value is overlapped on the DC


J


by various values in the range of 0<a


i


<1, and as a result, certain noise is overlapped per each block in the decoded image, whereby image quality is not improved. This disadvantage also occurs in case that the first decimal place is rounded off or up.




According to the conventional AOT processing, much operations and much time are required, because all of the base extraction vectors <u


i


> must be subjected to orthogonal processing to the preceding base vectors <u


k


′>.




SUMMARY OF THE INVENTION




It is therefore an object of the invention to provide an image encoding/decoding method, which provides high image quality at high speed, an apparatus thereof and a recording medium in which such program therefor is recorded.




The above object of the invention can be solved by the construction, for example, as shown in FIG.


5


. That is, the image encoding method of the invention (


1


) comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where the differential vector <d


j


> which is obtained by separating the DC value DC


J


from the pixel block to be encoded is over an allowable value Z, calculating one or more orthogonal basis (e.g. α


k


<v


k


>), to which the differential vector <d


j


> is approximated, by the adaptive orthogonal transform (AOT) using the DC nest, wherein the lowest n (n=log


2


B) bits of the DC pixel in each sample being set to 0, where the base extraction block is down-sampled from the DC nest and the block mean value a


i


of it is calculated using the samples.




Accordingly, any fraction less than 1 does not occurs in the block mean value a


i


and the block mean value a


i


with integer level precision is obtained at high speed.




In a preferred embodiment of the invention (


1


) that is the invention (


2


), the lowest n bits of the DC pixel is set to 0 or is masked, where the DC nest is produced from the DC image.




Accordingly, the DC nest, of which the lowest n bits of the DC pixel is set to 0 or is masked, is efficiently obtained by one processing.




In a preferred embodiment of the invention (


1


) or (


2


) that is the invention (


3


), a base extraction vector <u


i


> is produced to which the differential vector <d


j


> approximates by separating the block mean value a


i


from the base extraction block <U


i


> in which the lowest n bits of the DC pixel is set to 0.




According to the invention (


3


), the sum (the block mean value) of all elements in such base extraction vectors <u


i


> is always 0 and the DC component is completely separated. Therefore, even if the base vectors <u


k


> are piled up on each other in the decoding side, unnecessary DC component (noise) does not cause. The image quality in the HVQ system is more improved by the invention (


3


).




In a preferred embodiment of the invention (


3


) that is the invention (


4


), optional elements (e.g. u


16


) of base extraction vectors <u


i


> are replaced by linear bond of the remainder elements and the inner product of the base extraction vectors <u


i


> and the other optional vectors <w> are calculated by the formula.






<


w·u




i


>=(


w




1




−w




16


)


u




i


+(


w




2




−w




16


)


u




2


+ . . . +(


w




15




−w




16


)


u




15








In the invention (


4


), the sum of all elements in the base extraction vectors <u


i


> is always 0 and hence, the optional elements (e.g. u


16


) are represented by the linear bond of the remainder elements. Accordingly, the inner product calculation <w·u


i


> with the other optional vectors (w) can be expanded to the product-sum calculation as shown by the above formula, whereby a single round of such complicated calculation can be omitted. Since much inner product calculation of the vectors is conducted in the image encoding method according to the HVQ system, such single round omission of the calculation contributes to high speed encoding processing.




In a preferred embodiment of the invention (


3


) or (


4


) that is the invention (


5


), a first basis is searched so that h


i


may be maximum in the following formula,








h




i




=<d·u




i


>


2




/∥u




i





2








wherein <d> is the differential vectors and <u


i


> is the base extraction vectors.




According to the invention (


5


), such condition that square norm ∥<d>−<α


i


u


i


>∥


2


of the difference with the differential vectors <d> is minimum can be searched by the above simple calculation. Hence, the AOT processing can be carried out at high speed.




In the invention (


6


), a second basis is searched so that h


i


may be maximum in the following formula,








h




i




={<d·u




i


>−(<


d·u




1




><u




1




·u




i




>/∥u




1





2


)


2




/{∥u




i





2


−(<


u




1




·u




i


>)/∥


u




1





2


}






wherein <d> is the differential vectors, <u


1


> is the base extraction vectors corresponding to the first basis, and <u


i


> is the base extraction vectors for searching the second basis in the invention (


3


) or (


4


).




According to the invention (


6


), the AOT processing can be done more efficiently and at higher speed in addition to the advantages of the invention (


5


), because the calculation result which has been obtained in the first basis search can be used with respect to <d·u


1


> and ∥u


1


∥ of the numerator, and ∥u


i





2


and ∥u


1


∥ of the denominator.




In a preferred embodiment of the invention (


3


) or (


4


) that is the invention (


7


), a third basis is searched so that h


i


may be maximum in the following formula,








h




i


=(<


d·u




i




>−<d·v




1




><v




1




·u




i




>−<d·v




2




><v




2




·u




1


>)


2




/{∥u




i





2




−<v




1




·u




i


>


2




−<v




2




·u




i


>


2


}






wherein <d> is the differential vectors, <v


1


> is the first orthonormal base vectors, <v


2


> is the second orthonormal base vectors, and <u


i


> is the base extraction vectors for searching the third basis.




According to the invention (


7


), the AOT processing can be done more efficiently and at higher speed in addition to the advantages of the invention (


5


) and (


6


), because the calculation result which has been obtained in the first and second basis search can be used with respect to (<d·u


i


>−<d·v


1


><v


1


·u


i


>) of the numerator, and (∥u


i





2


−<v


1


·u


i


>


2


) of the denominator.




In a preferred embodiment of the invention (


6


) or (


7


) that is the invention (


8


), the base extraction vectors <u


i


> which match with search conditions are subjected to orthogonal transform with one or more preceding orthonormal basis.




That is, one orthonormal processing per each base extraction vector <u


i


>, which is adopted as the basis after the search termination at each stage is carried out, whereby the AOT processing can be done more efficiently and at higher speed.




In the image encoding method of the invention (


9


), the norm of each scalar expansion coefficient β


1


˜β


m


is rearranged in decreasing order, a difference (including 0) between norms adjacent to each other is calculated, and Huffman coding is applied to the obtained difference. In the method, the basis is represented by β


k


<u


k


>, wherein k=1˜m.




In general, the norm of each scalar expansion coefficient β


1


˜β


m


can take various value. When the value is rearranged in ascending or descending order and the difference (including 0) between norms adjacent to each other is calculated, each difference is often similar to or same as each other. The more encoding compression is possible by applying the Huffman coding to the difference value.




In the image encoding method of the invention (


10


), image data <R


j


> of coding objective block is encoded instead of the coding of the basis, where the basis is more than certain number. Accordingly, the decoded image quality is improved. In practical, it does not affect the coding compression ratio because such situation is little.




The above object of the invention can be resolved by the construction, for example, as shown in FIG.


14


. That is, the image decoding method of the invention (


11


) comprises reproducing a DC image corresponding to each block mean value per B pixel from encoding data with respect to the HVQ system, making a part of said DC image a DC nest, reproducing image data <R


j


> of target block by synthesizing, to DC value DC


J


of target block, one or more basis vectors β


k


<u


k


> which is selected from DC nests based on the encoding data, and the lowest n (n=log


2


B) bits of the DC pixel in each sample is set to 0, where the selected block is down-sampled from the DC nest and the block mean value of it is calculated using the samples.




Accordingly, any fraction less than 1 does not occurs in the block mean value and the block mean value with integer level precision is obtained at high speed.




According to the image decoding method in the invention (


12


), where the decoded basis is information with respect to β


k


<u


k


> (k=1˜m), the lowest n (n=log


2


B) bits of the DC pixel per each selected block (U


k


) to be read out from the DC nest are set to 0, product-sum calculation of basis β


k


<u


k


> (k=1˜m) is carried out, and the calculated result is divided by the number B of block pixels.




In the invention (


12


), the lowest n bits of each selected block (U


k


) are set to 0, and hence, even if these are accumulated and added, the addition result becomes multiple of integer of the block size B (e.g. 16). An expansion coefficient β


k


is an integer precision. Accordingly, if the cumulative addition result is divided by the number B of the block pixels, block mean value A


j


is efficiently obtained by one processing. Therefore, such calculation that the basis vectors β


k


<u


k


> (k=1˜m) are overlapped can be effectively carried out.




In a preferred embodiment of the invention (


11


) or (


12


) that is the invention (


13


), the lowest n bits of each DC pixel is set to 0, where DC nests are produced from the DC image, whereby processing is effectively carried out.




The image encoding apparatus of the invention (


14


) comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where a differential vector <d


j


> which is obtained by separating the DC value DC


J


from the pixel block to be encoded is over an allowable value Z, calculating one or more orthogonal basis ( e.g. α


k


<v


k


>), to which the differential vector <d


j


> is approximated, by the adaptive orthogonal transform (AOT) using the DC nest, and providing a memory


17


to store the DC nest in which the lowest n (n=log


2


B) bits of the DC pixel are set to 0.




The image decoding apparatus of the invention (


15


) comprises reproducing a DC image corresponding to each block mean value per B pixel from encoding data with respect to the HVQ system, making a part of said DC image a DC nest, reproducing image data<R


j


> of target block by synthesizing, to the DC value DC, of target block, one or more basis vectors β


k


<u


k


> which is selected from DC nests based on the encoding data, and providing a memory


49


to store the DC nest in which the lowest n (n=log


2


B) bits of the DC pixel are set to 0.




The recording medium of the invention (


16


) comprises a computer readable recording medium storing a program to make a computer to implement the processing described in one of the invention (


1


) to (


13


).











BRIEF DESCRIPTION OF THE DRAWINGS





FIG. 1

is a block diagram showing a conventional image encoder;





FIG. 2

is a flow chart of a conventional adaptive orthogonal transform processing;





FIG. 3

is an image drawing of the conventional adaptive orthogonal transform processing;





FIG. 4

is an image drawing of a conventional mean value separation processing;





FIG. 5

is an explanatory drawing of the principle of the invention;





FIG. 6

is a block diagram showing an image encoder, which is an embodiment of the invention;





FIG. 7

is a flow chart showing a main image encoding processing which is an embodiment of the invention;





FIG. 8

is a flow chart (


1


) showing an adaptive orthogonal transform processing which is an embodiment of the invention;





FIG. 9

is a flow chart (


2


) showing an adaptive orthogonal transform processing which is an embodiment of the invention;





FIG. 10

is a flow chart (


3


) showing an adaptive orthogonal transform processing which is an embodiment of the invention;





FIG. 11

is an explanatory drawing (


1


) of a DC nest, which is an embodiment of the invention;





FIG. 12

is an explanatory drawing (


2


) of a DC nest, which is an embodiment of the invention;





FIG. 13

is an image drawing of a compression encoding processing of the expansion coefficient;





FIG. 14

is ablock diagram showing an image decoder, which is an embodiment of the invention;





FIG. 15

is a flow chart showing an image decoding processing which is an embodiment of the invention; and





FIG. 16

is an image drawing of an alternating current component prediction, which is an embodiment of the invention.











DETAILED DESCRIPTION OF THE INVENTION




Referring to the drawings, suitable embodiments of the invention will be explained in detail. The same sign indicates same or corresponding part through whole drawings.




In

FIG. 6

which is a block diagram showing an embodiment of image encoding apparatus in the invention,


31


is a DC nest production unit which produces the DC nest from a decoding DC image according to the invention,


17


is a DC nest memory which stores the produced DC nest,


32


is an adaptive orthogonal transform (AOT) processing unit which effectively implements AOT processing at high speed,


33


is a coefficient transform unit, and


34


is an encoding unit which can make an expanding coefficient β


k


higher compression. The other construction is same as in FIG.


1


. The feature of each unit will be apparent from the following explanation of behavior.




In

FIG. 7

, which is a flow chart showing a main image encoding processing, which is an embodiment of the invention, an original image data is input in a original image memory


11


at step S


1


. For example, an objective image of R.G.B. is converted to an image of Y.U.V., which is input in the memory


11


. Y is a brightness data, U and V are color difference data. U and V are down-sampled using a brightness mean of 2 pixels in a horizontal direction. As an example, the brightness data Y is composed of vertical 960×horizontal 1280 pixels and, for example, 8 bits are allotted to each pixel. The processing of the brightness data Y will be mainly explained in the following but U and V are similarly processed.




A block mean (DC) value of every 4×4 pixels with respect to all image data is calculated at step S


2


. The first decimal place is round off at the time. All DC values are encoded by conventional two-dimensional DPCM method, etc. and are output at step S


3


. At step S


4


, all DPCM outputs are decoded by IDPCM method to reproduce the DC images, which are stored in a DC image memory


15


. This is done to equalize AOT processing conditions in the encoding side with that in the decoding side. At step S


5


, the DC nest is reproduced from the DC images in the DC nest production unit


31


, which is stored in the DC nest memory


17


. A range from which the DC nest is cut can be selected by the same manner as conventional one.




In FIG.


11


(


a


), the lowest 4 bits of each DC pixel DC


J


cut from the DC image memory


15


are masked (are set to 0) which are stored in a nest pixel N


J


of the DC nest memory


17


. The lowest 4 bits are in relation with 2


4


=B (B=block size 16) or 4=log


2


B. As such result that the lowest 4 bits are masked, the sum of base extraction block <U


i


> is always multiple of integer and a block mean value a


i


which is {fraction (1/16)} of the sum is always an integer. Accordingly, the base extraction vectors <u


i


> which are obtained by separating the block mean value a


i


from the base extraction block <U


i


> are always 0.




In FIGS.


11


(


a


) and (


b


), graphs of concrete values are shown as example, in which mean of 4 pixels is used for simplifying the explanation. In FIG.


11


(


c


), even if a plurality of basis vectors β


k


<u


k


> are cumulatively added to the DC pixel DC


J


of a decoding block <R


j


>, a noise is not overlapped as usual because the block mean value of each basis vectors β


k


<u


k


> is always 0, whereby image quality can be much improved.




The examples of the value in

FIG. 11

are shown in FIG.


12


(


a


). The sum of the DC pixels A to D is 251 and its mean is 251/4=62.75 (non-integer). The lowest 4 bits are masked when the DC pixels A to D are transmitted to the nest pixels A to D, whereby the sum of the nest pixels A to D is 224 and its mean value AV is 224/4=56 (integer). Each element a to d of the base extraction vectors <u


i


> which is obtained by separating the mean value 56 of the nest pixels from the nest pixels becomes 24, −24, 8 and −8, respectively. The sum of these elements is 0 (complete mean value separation).




The same value as in FIG.


12


(


a


) is shown in FIG.


12


(


b


) except that the DC pixels A to D are copied into the nest pixels A to D and the lowest 4 bits are masked from the sum of the nest pixels A to D. According to the method, the sum is the multiple of 16 and the block mean value is 60 (integer). However, according to the method, each element a to d of the base extraction vectors <u


i


> which is obtained by separating the mean value 60 of the nest pixels from the nest pixels A to D becomes 33, −25, 13 and −10, respectively. The sum of these elements is not 0 (complete mean value separation).




As shown in FIG.


12


(


b


), after a part of the DC images is copied into the nest pixels A to D, the lowest 4 bits may be masked from each pixel when the base extraction block <U


i


> is down-sampled from the DC nest.




Turning to

FIG. 7

, each index counter j, J to the original image memory


11


and the DC image memory


15


is initialized to 0 at step S


6


, wherein j indicates an index counter of the target block <R


j


> which is encoding object, and J indicates an index counter of the DC pixel. At step S


7


, the differential vector <d


j


> is obtained by separating a corresponding decoding DC value DC


f


from the target block <R


j


>. At step S


8


, it is judged whether the square norm ∥d


j





2


of the differential vector is more than the allowable error Z or not. In case that ∥d


j


μ


2


is not more than Z, 0 is output as the number of the basis at step S


17


. In this case, the target block <R


j


> is decoded by alternating current component prediction method as described hereinafter. In case that ∥d


j





2


is more than Z, the adaptive orthogonal transform (AOT) processing method as described hereinafter is carried out at step S


9


.




At step S


10


, it is judged whether the number of the basis k produced by the adaptive orthogonal transform is more than 4 or not. According to the actual measurement, statistic result of k=1 to 3 has been obtained in most cases. Therefore, in case that k is more than 4, “5” is code-output as the number of the basis at step S


18


and each pixel value of the target block <R


j


> is output. In case that k is not more than 4, conversion to expanding coefficient β


k


is carried out as described hereinafter at step S


11


. At step S


12


, the basis number m, the expanding coefficient β


k


and the index information i of non-orthogonal basis vector <u


i


> each is code-output at step S


12


.




At step S


13


, “1” is added to the counters j and J, respectively. In the step, an addition of 1 to the counter j means renewal of onepixel block. It is judged at step S


14


whether j is not less than M (the number of all image blocks) or not. In case that j is less than M, turning to step S


7


and same encoding processing is repeated with respect to a next target block <R


j


>, followed by same steps. It is judged at step S


14


that j is not less than M, then encoding processing, for example, by Huffman method is carried out at step S


15


as described hereinafter. Thus, encoding processing of one pixel is terminated.




In

FIGS. 8

to


10


, each of which is a flow chart (


1


), (


2


) or (


3


) of the adaptive orthogonal transform processing, it is shown that the minimum necessary number of orthogonal basis α


k


<v


k> (k=


1˜m) is effectively obtained at high speed. In the following explanation, the initial differential vector <d


j


> obtained at the step S


7


is represented by <d> and the differential vector to be renewed later is represented by <d


k


> (k=1˜m).




A search processing of first basis is shown in FIG.


8


. Before explanation of the processing, an idea on calculation for high speed processing will be explained. That is, the first basis is usually obtained as base extraction vector <u


j


>, which makes a square norm e


i


of the difference between the first basis and a differential vector <d> minimum, and is represented by the formula (7).




[Numeral 7]













e
i

=


&LeftDoubleBracketingBar;

d
-





d
·

u
i






&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2




u
i



&RightDoubleBracketingBar;

2







=



&LeftDoubleBracketingBar;
d
&RightDoubleBracketingBar;

2

-

2






d
·

u
i




2



&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2



+






d
·

u
i




2



&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

4





&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2









=




&LeftDoubleBracketingBar;
d
&RightDoubleBracketingBar;

2

-






d
·

u
i




2



&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2








u
i



=

u
i










(
7
)













The first item ∥d∥


2


of the right side in the formula (7) which is more than 0 is independent of an extracted basis and hence, <u


i


> that makes the second item of the right side in the formula (7) maximum can be the first basis. The second item h


i


of the right side is represented by the formula (8).




[Numeral 8]










h
i

=





d
·

u
i




2



&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2






(
8
)













A processing for searching and judging the first basis α


k


<v


k


> which makes h


i


maximum is explained. At step S


22


, the fifteen dimensional vector <d′> is obtained by subtracting the sixteenth component of <d> from the remaining components as a preprocessing to inner product calculation <d·u


i


> as described hereinafter. At step S


22


, the inner product <d′·u


1


> of h


i


numerator is obtained with respect to i=0˜(N−1) and is stored in an arrangement [P


i


] {i=0˜(N−1)}.




More concretely, <u


i


> is sixteen dimensional vector, but its sixteenth component u


16


can be represented by linear bond of the remaining fifteen components because the block mean value (sum of all elements) is 0.




[Numeral 9]








u




1




=[u




1




, u




2




, u




3




, . . . , u




16




]u




1




u




2




+ . . . +u




16


=0


u




16


=−(


u




1




+u




2




+ . . . +u




15


)  (9)






Accordingly, the inner product <d·u


i


> of h


i


numerator can be calculated from <d′·u


i


> equivalent thereto, whereby one product/sum calculation can be omitted which corresponds to 8192 calculations with respect to total of i.




[Numeral 10]















d
·

u
i




=



d
1



u
1


+


d
2



u
2


+





+


d
15



u
15


-


d
16



(


u
1

+

u
2

+





+

u
15


)









=



(


d
1

-

d
16


)



u
1


+


(


d
2

-

d
16


)



u
2


+





+


(


d
15

-

d
16


)



u
15









=




d




·

u
i












(10-1)












d′


=[(


d




1




−d




16


), (


d




2




−d




16


), . . . , (


d




15




−d




16


)]  (10-2)




At step S


23


, the square norm ∥u


i





2


of h


i


denominator is obtained with respect to i=0˜(N−1) and is stored in an arrangement [L


i


] {i=0˜(N−1)}.




[Numeral 11]









u




i





2




=u




1




2




+u




2




2




+ . . . +u




16




2


  (11)






The arrangement [L


i


] is repeatedly used. At step S


24


, a register E=0 storing a maximum value of h


i


, an index counter i=0 of the base extraction vector <u


i


> and a basis number counter k=1 are initialized, respectively.




At step S


25


, a value for h


i


=P


i




2


/L


i


is calculated. Step S


26


, it is judged whether h


i


is more than E or not. In case that h


i


is more than E, E is renewed by h


i


at step S


27


and i is held in an arrangement [I


k


] (k=1). In case that h


i


is not more than E, the processing at Step S


27


is skipped.




At step S


28


, 1 is added to i and at step S


29


, it is judged whether i is not less than N (total extraction numbers) or not. In case that i is less than N, turning to step S


25


and maximum value search processing is carried out with respect to next h


i


similar to above.




The same processing is repeated and the search of all nest blocks is terminated when i is not less than N. At the time, the index value i of the first basis vector <u


i


> which makes h


i


maximum is held in an arrangement [I


k


].




At step S


30


, the first basis vector <u


i


> is normalized to be a normalized basis vector <v


i


> which is stored in an arrangement [V


k


] (k=1). And, a scalar coefficient α


1


(projection shadow of <d> on <v


i


> is calculated and is stored in an arrangement [A


k


] (k=1).




At step S


31


, the differential vector <d> is approximated to the first basis and is renewed by the differential vector <d


1


>=<d>−α


1


<v


1


>. At step S


32


, a square norm e=∥u


1





2


of new differential vector is calculated and at step S


33


, it is judged whether e is not more than Z or not. In case that e is not more than Z, the AOT processing is terminated at the step. In case that e is more than Z, the search processing of the second basis is carried out.




A search processing of the second basis is shown in FIG.


9


. Before explanation of the processing, an idea on efficient calculation will be explained. That is, the second basis is usually obtained as orthogonal vector <u


j


′> which makes a square norm e


i


of the difference between the second basis and a differential vector <d


1


> minimum, and is represented by the formula (12).




[Numeral 12]













e
i

=


&LeftDoubleBracketingBar;


d
1

-






d
1

·

u
i







&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2




u
i




&RightDoubleBracketingBar;

2







=



&LeftDoubleBracketingBar;

d
1

&RightDoubleBracketingBar;

2

-

2







d
1

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2



+







d
i

·

u
i





2



&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;

4





&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2









=



&LeftDoubleBracketingBar;

d
1

&RightDoubleBracketingBar;

2

-






d
1

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2










(
12
)













The orthogonal vector <u


i


′> is obtained by orthogonal transform of the second base extraction vector <u


i


> to the first normalized basis vector <v


1


>.




[Numeral 13]










u
i


=



u
i

-






u
i

·

v
1






&LeftDoubleBracketingBar;

v
1

&RightDoubleBracketingBar;

2




v
i



=


u
i

-





u
i

·

v
1






v
1








(
13
)













The first item ∥d∥


2


of the right side in the formula (12) which is more than 0 is independent of an extracted basis and hence, <u


i


′> that makes the second item of the right side in the formula (12) maximum can be the second basis. The second item h


i


of the right side is represented by the formula (14).




[Numeral 14]










h
i

=






d
1

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2






(
14
)













According to the formula (14), h


i


can be calculated but the denominator of the formula (14) may be transformed in order to effectively utilize the calculation result in FIG.


8


. That is, if the orthogonal vector <u


i


′> of the h


i


numerator is represented by the base extraction vector <u


i


>, the h


i


numerator can be represented by the formula (15).




[Numeral 15]

















d
1

·

u
i





2

=





d
1

·




(


u
i

-




u
i

·

v
1





)



v
1


)




2







=


(





d
1

·

u
i




-



d
1

·




u
i

·

v
1







v
1



)

2







=







d
1

·

u
i




2










d
1

·

v
1





=
0








(
15
)













Further, if the differential vector <d


1


> of the formula (15) is represented by the first differential vector <d>, the h


i


numerator can be represented by the formula (16).




[Numeral 16]

















d
1

·

u
i




2

=





(

d
-




d
·

v
1






v
1



)

·

u
i




2







=


(




d
·

u
i




-




d
·

v
1









v
1

·

u
i






)

2







=


(




d
·

u
i




-





d
·

u
1





&LeftDoubleBracketingBar;

u
1

&RightDoubleBracketingBar;








u
1

·

u
i





&LeftDoubleBracketingBar;

u
1

&RightDoubleBracketingBar;




)

2








(
16
)













Accordingly, the calculation result <d·u


1


> which is obtained in the search of the first basis can be used for calculation of the h


i


numerator. Also, when the h


i


denominator is transformed, it can be represented by the formula (17).




[Numeral 17]













&LeftDoubleBracketingBar;

u
1


&RightDoubleBracketingBar;

=


&LeftDoubleBracketingBar;


u
i

-





u
i

·

v
1






v
1



&RightDoubleBracketingBar;

2







=



&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2

-

2






u
i

·

v
1




2


+






u
i

·

v
1




2




&LeftDoubleBracketingBar;

v
1

&RightDoubleBracketingBar;

2









=



&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2

-





u
i

·

v
1




2








=



&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2

-


(





u
i



·

u
1






&LeftDoubleBracketingBar;

u
1

&RightDoubleBracketingBar;


)

2









(
17
)













Accordingly, The calculation result ∥u


i





2


, ∥u


1


∥ which is obtained in the first basis search can be used in the calculation of the h


i


numerator. When h


i


is placed in the formula (14), it can be represented by the formula (18-1) and finally by the formula (18-2).




[Numeral 18]










h
i

=



(




d
·

u
i




-





d
·

u
1





&LeftDoubleBracketingBar;

u
1

&RightDoubleBracketingBar;












u
1

·

u
i





&LeftDoubleBracketingBar;

u
1

&RightDoubleBracketingBar;




)

2




&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2

-


(





u
i

·

u
1





&LeftDoubleBracketingBar;

u
1

&RightDoubleBracketingBar;


)

2







(18-1)











=



(

P
-



P
k



I
k













u
k

·

u
i






L
k





)

2



L
i

-


(





u
k

·

u
i






L
k



)

2












(18-2)













A calculating result of the arrangement [P


i


], [L


i


] can be used for P


i


=<d


1


·u


i


>, L


i


=∥u


1





2


, respectively and the preceding result can be used for P


k


=P


i


=<d·u


1


>, {square root over ( )}L


k


={square root over ( )}L


1


=∥u


1


∥. Accordingly, it is in a part of <u


k


·u


i


>=<u


1


·u


i


> that a calculation is newly required.




Based on the background as above, a search of the second basis is carried out by the following calculation. That is, at step S


41


, P


1


=<d·u


1


> and L


1


=∥u


1





2


are held as k=1. The result obtained in steps S


22


and S


23


can be used. The numeral “1” of P


1


means the first basis <u


1


> in the index counter i and is held in an arrangement [I


k


] at step S


27


. At step S


42


, a calculation is carried out by the formula (19) and a result is stored in a register η, κ.




[Numeral 19]









η
=



1


L
k








κ

=


P
k


η






(
19
)













At step S


43


, the fifteen dimensional vector <w


1


> is obtained by subtracting the sixteenth component of <u


1


> from the remaining components as the preprocessing of inner product calculation <u


1


·u


i


> as described below. At step S


44


, an inner product <w


k


·u


i


> is calculated with respect to i=0˜(N−1) and is stored in an arrangement [Q


i


]. At step S


45


, (P


i


−κQ


i


) is calculated with respect to i=0˜(N−1) and is stored in and written in over an arrangement [P


i


]. The calculation result at step S


45


is stored in (written in over) an arrangement [P


i


], whereby contents of the arrangement [P


i


] are gradually renewed on the past calculation result. Further, at step S


46


, (L


i


−Q


i




2


) is calculated with respect to i=0˜(N−1) and is stored in and written in over an arrangement [L


i


]. L


i


in the right side is a result of the calculation at step S


23


. The calculation result at step S


46


is stored in and written in over an arrangement [L


i


] at step S


23


, whereby contents of the arrangement [L


i


] are gradually renewed on the past calculation result. The repeated calculation of h


i


is finally represented by the formula (20).




[Numeral 20]










h
i

=




(


P
i

-

κ






Q
i



)

2



L
i

-

Q
i
2



=


P
i
2


L
i







(
20
)













At step S


24


, a register E=0 holding a maximum value of h


i


and an index counter i=0 of the base extraction vector <u


i


> are initialized, respectively and “1” is added to a basis number counter k to be k=2.




At step S


48


, h


i


=P


i




2


/L


i


is calculated. At step S


49


, it is judged whether h


i


is more than E or not In case that h


i


is more than E, E is renewed by h


i


at step S


50


and i is stored in an arrangement [I


k


] (k=2). In case that h


i


is not more than E, the processing at step S


50


is skipped.




At step S


51


, “1” is added to i and at step S


52


, it is judged whether i is not less than N or not. In case that i is less than N, turning to step S


42


and the maximum value search processing is carried out with respect to subsequent h


i


. When the same procedure was proceeded and i is not less than N, the search of the all nest blocks are terminated. At the time, the index value of the second basis vector <u


i


> to make h


i


maximum is held in an arrangement [I


k


] (k=2).




At step S


53


, the second basis vector <u


2


> is subjected to orthonormal with <v


1


> to be a normalized basis vector <v


2


> which is stored in an arrangement [V


k


] (k=2). A scalar coefficient α


2


which is a shadow of <d


1


> projected to <v


2


> is calculated and is stored in an arrangement [A


k


] (k=2). The ortho normalization of the basis vector <u


2


> and the calculation of the scalar coefficient α


2


are carried out at one time with respect to the above search result, whereby the AOT processing is much simplified at high speed.




At step S


54


, the differential vector <d


1


> is closed to the second basis and is renewed by the differential vector <d


2


>=<d


1


>−α


2


<v


2


>. At step S


55


, a square norm e=∥u


2





2


of new differential vector is calculated and at step S


56


, it is judged whether e is not more than Z or not. In case that e is not more than Z, the AOT processing is terminated at the step. In case that e is more than Z, the search processing of the third basis is carried out.




A search processing of the second basis is shown in FIG.


6


. Before explanation of the processing, an idea on efficient calculation will be explained. That is, the third basis is usually obtained as orthogonal vector <u


j


′> which makes a square norm e


i


of the difference between the second basis and a differential vector <d


2


> minimum, and is represented by the formula (21).




[Numeral 21]













e
i

=


&LeftDoubleBracketingBar;


d
2

-






d
2

·

u
i







&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2




u
i




&RightDoubleBracketingBar;

2







=



&LeftDoubleBracketingBar;

d
2

&RightDoubleBracketingBar;

2

-

2






d
2

·

u
i







&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2



+






d
2

·

u
i







&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

4





&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2









=



&LeftDoubleBracketingBar;

d
2

&RightDoubleBracketingBar;

2

-






d
2

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2










(
21
)













The orthogonal vector <u


i


′> is obtained by orthogonalization of the third base extraction vector <u


i


> to the first normalized basis vector <v


1


> and the second normalized basis vector <v


2


>.




[Numeral 22]








u




i




′=u




j




−<u




i




·v




1




>v




1




−<u




i




·v




2




>v




2


  (22)






The first item ∥d


2





2


of the right side in the formula (21) which is more than 0 is independent of an extracted basis and hence, <u


i


′> that makes the second item of the right side in the formula (21) maximum becomes the third basis. The second item h


i


of the right side is represented by the formula (23).




[Numeral 23]










h
i

=






d
2

·

u
i





2



&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2






(
23
)













If the orthogonal vector <u


i


′> of the h


i


numerator is represented by the base extraction vector <u


i


>, the h


i


numerator can be represented by the formula (24).




[Numeral 24]

















d
2

·

u
i





2

=





d
2

·

(


u
i

-





u
i

·

v
1






v
1


-





u
1

·

v
2






v
2



)




2







=


(





d
2

·

u
i




-





d
2

·

v
1









u
i

·

v
1





-





d
2

·

v
2









u
i

·

v
2






)

2







=







d
2



·

u
i





2










d
2

·

v
1





=


0









d
2

·

v
2





=
0









(
24
)













Further, if the differential vector <d


2


> of the formula (24) is represented by the first differential vector <d>, the h


i


numerator can be represented by the formula (25).




[Numeral 25]

















d
2

·

u
i




2

=









(

d
-



d
·

v
1





)



v
1


-




d
·

v
2






v
2



)

·

u
i




2







=


(





d
2

·

u
i




-




d
·

v
1









v
1

·

u
i





-




d
·

v
2









v
2

·

u
i






)

2







=


(




d


·

u
i





-





d
·

u
1





&LeftDoubleBracketingBar;

u
1

&RightDoubleBracketingBar;












u
1

·

u
i





&LeftDoubleBracketingBar;

u
1

&RightDoubleBracketingBar;



-





d
·

u
2






&LeftDoubleBracketingBar;

u
2


&RightDoubleBracketingBar;












u
2


·

u
i





&LeftDoubleBracketingBar;

u
2


&RightDoubleBracketingBar;




)

2








(
25
)













Also, when the h


i


denominator is transformed, it can be represented by the formula (26).




[Numeral 26]














&LeftDoubleBracketingBar;

u
i


&RightDoubleBracketingBar;

2

=


&LeftDoubleBracketingBar;


u
i

-





u
i

·

v
1






v
1


-





u
i

·

v
2






v
2



&RightDoubleBracketingBar;







&LeftDoubleBracketingBar;


u
i

-





u
i

·

v
1






v
1


-





u
i

·

v
2






v
2



&RightDoubleBracketingBar;








=



&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2

-





u
i

·

v
1




2

-





u
i

·

v
2




2









(
26
)













When h


i


is placed in the formula (23), it can be represented by the formula (27).




[Numeral 27]







h
i

=



(




d
·

u
i




-




d
·

v
1










(


v
1

·

u
i


)


-




d
·

v
2









v
2

·

u
i






)

2




&LeftDoubleBracketingBar;

u
i

&RightDoubleBracketingBar;

2

-





u
i

·

v
1




2

-





u
i

·

v
2




2













The second item of numerator and denominator in the formula (27) has been already calculated and is represented by the formula (28).








P




i




=<d·u




i




>−<d·v




1




><v




1




·u




i


>  (28-1)










I




1




=∥u




i





2




−<u




i




·v




1


>


2


  (28-2)






Accordingly, h


i


is represented by the formula (29) in the same manner as in the formula (18-2).




[Numeral 29]










h
i

=



(


P
i

-



P
k



L
k









v
k

·

u
i






L
k





)

2



L
i

-


(





v
k

·

u
i






L
k



)

2







(
29
)













The formula (29) is same form as the formula (18-2) except that the inner product <u


k


·u


i


> is changed to <v


k


·u


i


>. Accordingly, each basis hereinafter can be effectively obtained by repeating the same operation as in FIG.


5


.




Based on the above processing, search of the third and following basis is calculated as follows. That is, P


2


=<d


1


·u


2


> and L


2


=∥u


2





2


are hold by k=2 at stet S


61


. At step S


62


, calculation is carried out according to the formula (30) and a result is stored in the registers η and κ.




[Numeral 30]









η
=



1


L
k








κ

=


P
k


η






(
30
)













At step S


63


, the fifteen dimensional vector <w


2


> is obtained by subtracting the sixteenth component of <v


2


> from the remaining components as the preprocessing of inner product calculation <v


2


·u


i


> as described below. Since each component of <v


2


> is not an integer, it is necessary that an inner product is calculated in the form of real number. In order to avoid the calculation in the form of real number, each component of <v


2


> (i.e.<w


2


>) is multiplied by a constant “a” to make it an integer, in advance.




At step S


64


, an inner product <w


2


·u


i


>η/a is calculated with respect to i=0˜(N−1) and is stored in (written in over) an arrangement [Q


i


]. At the time, each calculation result is divided by the constant a to put the position of a figure to the former position.




At step S


65


, (P


i


−κQ


i


) is calculated with respect to i=0˜(N−1) and is stored in (written in over) an arrangement [P


i


]. At step S


46


, (L


i


−Q


i




2


) is calculated with respect to i=0˜(N−1) and is stored in (written in over) an arrangement [L


i


]. The calculation of the formula (29) is represented by the formula (31).




[Numeral 31]










h
i

=




(


P
i

-

κ






Q
i



)

2



L
i

-

Q
i
2



=


P
i
2


L
i







(
31
)













At step S


67


, a register E=0 holding a maximum value of h


i


and an index counter i=0 of the base extraction vector <u


i


> are initialized, respectively and “1” is added to a basis number counter k to be k=3.




At step S


68


, h


i


=P


i




2


/L


i


is calculated. At step S


69


, it is judged whether h


i


is more than E or not. In case that h


i


is more than E, E is renewed by h


i


at step S


70


and i is held in an arrangement [I


k


] (k=3). In case that h


i


is not more than E, the processing at step S


70


is skipped.




At step S


71


, “1” is added to i and at step S


72


, it is judged whether i is not less than N or not. In case that i is less than N, turning to step S


68


and the maximum value search processing is carried out with respect to subsequent h


i


. When the same procedure was proceeded and i is not less than N, the search of the all nest blocks are terminated. At the time, the index value of the third basis vector <u


3


> to make h


i


maximum is held in an arrangement [I


k


] (k=3).




At step S


73


, the third basis vector <u


3


> is subjected to orthonormal transform with <v


1


> and <v


2


> to be a normalized basis vector <v


3


> which is stored in an arrangement [V


k


]. A scalar coefficient α


3


which is a shadow of <d


2


> projected to <v


3


> is calculated and is stored in an arrangement [A


k


].




At step S


74


, the differential vector <d


2


> is approximated to the third basis and is renewed by the differential vector <d


3


>=<d


2


>−α


3


<v


3


>. At step S


75


, a square norm e=∥d


3





2


of new differential vector is calculated and at step S


76


, it is judged whether e is not more than Z or not. In case that e is not more than Z, the AOT processing is terminated at the step. In case that e is more than Z, turning to the step S


61


and the preprocessing and search processing of the fourth and following basis are carried out. It is preferred that the processing to judge whether k is not less than 4 or not is provided (not shown) after the step S


76


, whereby the AOT processing can be skipped in case that k is not less than 4.




The AOT processing can be much simplified can be carried out at high speed by the above processing or operation. The actual calculation time is reduced to ⅓ to {fraction (1/10)} in comparison with conventional methods.




Referring to

FIG. 6

, a group of α


k


<v


k


>(k=1˜m) is obtained from AOT


32


and the differential vector <d


j


> is approximated within the allowable error Z by the linear bond.




Further, in the coefficient transform unit


33


, the expansion coefficient β


k


is obtained to transform the group of α


k


<v


k


>(k=1˜m) to β


k


<u


k


>(k=1˜m) by the following method. That is, when each matrix of the base extraction vector <u


k


>, the expansion coefficient β


k


, the orthonormal basis vector <v


k


> and the scalar coefficient α


k


is represented by the formula (32)




[Numeral 32]










U
=



[


u
1

,

u
2

,








u
m



]






B

=

[




β
1






β
2











β
m




]









V
=



[


v
1

,

v
2

,








v
m



]






A

=

[




α
1






α
2











α
m




]







(
32
)













a relationship of the matrix is represented by the formula (33).




[Numeral 33]






UB=VA  (33)






In order to solve the formula with respect to the matrix B, both sides of the formula (33) is multiplied from the left by a transposed matrix U


T


of the matrix U as shown by the formula (34).




[Numeral 34]






U


T


UB=U


T


VA  (34)






The matrix (U


T


U) is expanded to be the formula (35).




[Numeral 35]














U
T


U

=


[




u
1






u
2











u
nk




]





[


u
1

,

u
2

,








u
nk



]







=

[





u
1

·

u
1






u
1

·

u
2









u
1

·

u
nk








u
2

·

u
1






u
2

·

u
2









u
2

·

u
nk






















u
nk

·

u
1






u
nk

·

u
2









u
nk

·

u
nk





]








(
35
)













In the formula (35), <u


i


·u


j


> means an inner product, and a square matrix which is a symmetrical to a diagonal element is obtained because <u


i


·u


j


> is equal to <u


j


·u


i


>, and an inverse matrix exists because <u


i


> is different from <u


j


>. Therefore, the inverse matrix (U


T


U)


−1


of the matrix (U


T


U) is multiplied from the left of both sides of the formula to obtain the formula (36) and β


k


is calculated.




[Numeral 36]






(U


T


U)


−1


U


T


UB=B=(U


T


U)


−1


U


T


VA  (36)






As explained above, it is unnecessary by transforming the group of the orthonormal basis α


k


<v


k


> (k=1˜m) to the non-orthonormal basis β


k


<u


k


> (k=1˜m) that each base extraction vector <u


k


> is subjected to orthogonal transform in decoding side every time, and the differential vector <d


j


> can approximate by adding a multiplied value of them and β


k


. Thus, the decoding processing can be simply carried out at high speed.




A compression encoding processing of the expansion coefficient β


k


will be explained.





FIG. 13

is an image drawing of a compression encoding processing of the expansion coefficient. In FIG.


13


(


a


), a norm is extracted from the produced β


1


˜β


4


. In FIG.


13


(


b


), a norm is arranged, for example, in ascending order (β


3


, β


2


, β


4


, β


1


) and a difference (Δβ


3


, Δβ


2


, Δβ


4


, Δβ


1


) is calculated. In FIG.


13


(


c


), the upper bits are separated by removing the lowest two bits from all bits in the difference of coefficient (Δβ


3


, Δβ


2


, Δβ


4


, Δβ


1


) and are subjected to Hoffman encoding.




In the example, two groups of Δβ


3


and (Δβ


2


, Δβ


4


, Δβ


1


) exist with respect to the value, and according to Huffman encoding, a code sign of less bit numbers is allotted to (Δβ


2


, Δβ


4


, Δβ


1


) of which generating frequency is more and a code sign of more bit numbers is allotted to Δβ


3


of which generating frequency is less. Accordingly, the compression encoding of expansion coefficient β


k


is possible. Also, fractions of the lowest bits are omitted by Huffman encoding of the upper bits in difference Δβ


k


of the coefficient, whereby possibility of Δβ


2


=Δβ


4


=Δβ


1


is high in the upper bits as shown in FIG.


13


(


c


).




The lowest two bits of difference Δβ


k


is packed with positive and negative code sign bits and an index information (13 bits=0˜8191) of the basis vectors <u


k


> corresponding to the sign bits in a code sign area of 2 bites fixed length and is output as the fixed length code sign. The output is carried out in the order of Δβ


3


, Δβ


2


, Δβ


4


and Δβ


1


(i.e. u


3


, u


2


, u


4


, u


1


).




In FIG.


13


(


d


), each code sign is input in the order of u


3


, u


2


, u


4


and u


1


in decoding side, from which each of the coefficient Δβ


3


, Δβ


2


, Δβ


4


and Δβ


1


is separated. Further, β


3


is decoded from the first Δβ


3


, β


2


is decoded by adding Δβ


2


to the decoded β


3


, β


4


is decoded by adding Δβ


4


to the decoded β


2


, and then β


1


is decoded by adding Δβ


1


to the decoded β


4


. The decoding order is not important because β


k


<u


k


> is functioned based on the sum (linear bond) of these values.




The difference can be calculated by arranging the norm in descending order instead of the ascending order.




The coding processing by the encoding unit


34


will be explained. A prediction difference Δ


DC






J, I




of DPCM is quantized by a quantization coefficient Q (Z), and only in case that Δ


DC






J, I




is 0, run length coding is considered, and the prediction difference Δ


DC






J,I




and the run length coding each is independently subjected to Huffman coding. Only in case that the basis number k is 0, the run length coding is considered, and the basis number k and the run length each is independently subjected to Huffman coding. The coefficient difference Δβ


k


is quantized by a constant number Q (e.g. 8) to obtain its quotient, which is subjected to Huffman coding. The code signbits of the expansion coefficient β


k


and the lowest two bits of the coefficient difference Δβ


k


are incorporated in the code information i (=13 bits) of the basis vector <u


k


> to make the fixed length coding sign of 16 bits, which are incorporated in the coefficient difference Δβ


k


in ascending (or descending) order and is transmitted. As a whole, row of the coding sign is constituted by incorporating these in appearing order per unit of pixel block. If necessary, a sign EOB is input to show change of pixel blocks.





FIG. 14

is a block diagram showing an image decoder, which is an embodiment of the invention, and corresponds to the image encoder as shown in FIG.


6


. In

FIG. 14

,


41


is a decoding unit by Huffman, etc.,


42


is an alternating current component prediction unit for predicting target blocks <R


j


> containing the alternating current component from the surrounding DC values DC


J


′ containing the noticeable pixels DC


J


,


43


is the differential vector reproduction unit for reproducing an approximate differential vector <d


j


> based on the decoding basis β


k


<u


k


> (k=1˜m),


44


is the R


j


reproduction unit for reproducing target blocks <R


j


> based on the decoding blocks <R


j


>,


45


is the reproduced image memory,


46


is the IDPCM unit for IDPCM decoding the decoded DC value,


47


is the DC image memory for storing the DC nest,


48


is the DC nest production unit which is same as in

FIG. 2

,


49


is the DC nest memory for storing the DC nest,


50


is the selected block buffer for holing the selected blocks <U


k


> which are down-sampled from the DC nest,


51


is a multiplier for multiplying <U


k


> by β


k


,


52


and


53


are the cumulative addition unit of β


k


<u


k


> (k=1˜m),


54


is a means for obtaining a block mean value A


j


of cumulative addition values,


55


is a subtractor for separating the block mean value A


j


of cumulative addition values,


56


is an approximate vector buffer for holding reproduced approximate differential vector <d


j


>, and


57


is a means for adding the reproduced approximate differential vector <d


j


>.




In

FIG. 15

, which is a flow chart showing an image decoding processing of an embodiment of the invention, the image coding data is input at step S


101


. At step S


102


, each DC value in Y, U and V is decoded by IDPCM method similar to FIG.


6


and DC images are reproduced. At step S


103


, DC nest is produced from the DC value of Y component. At the time, as shown in

FIG. 7

, the lowest four bits of each DC pixel value DC


J


are masked to be each DC nest pixel value N


J


. The information such as cut position of the DC images is separately received. At step S


104


, the index counters j and J to the original image memory


45


and DC image memory


47


are initialized to 0.




At step S


105


, coding data of one block image is input. At step S


106


, it is judged that k is 0 or not. In case that k is 0, the target blocks <R


j


> are reproduced by alternating current prediction method as described hereinafter. In case that k is not 0, it is judged at step S


107


whether k is not less than 1 and not more than 4 or not.




In case k is not less than 1 and not more than 4, the differential vector <d


j


> is inversely quantized at step S


112


. Since the lowest four bits of the DC nest are previously masked in the embodiment of the invention, the differential vector <d


j


> is obtained at once by cumulatively adding the product of the selected block <U


k


> and β


k


and by separating the block mean value A


j


from the cumulative addition result, whereby the decoding processing is carried out at high speed. At step S


113


, the DC value DC


J


corresponding to thus obtained differential vector <d


j


> is added.




In case k is less than 1 and more than 4, the target blocks <R


j


> are directly produced from the decoding data of the target blocks <R


j


> at step S


108


. Thus, the target blocks <R


j


> of 4 times 4 pixels are reproduced by any methods as above. The reproduced target blocks <R


j


> are stored in the reproduced image memory


45


at step S


109


.




At step S


110


, “1” is added to the counters j and J, respectively, and at step S


111


, it is judged whether i is not less than M (all pixel block numbers) or not. In case that i is less than M, turning to step S


105


and the decoding and reproducing processing is carried out with respect to subsequent the block image coding data. When the same procedure was proceeded and j is not less than M in the judge at step S


111


, the decoding processing per one image is terminated.





FIG. 16

is an image drawing of an alternating current component prediction, which is an embodiment of the invention and is applicable for conventional prediction methods.




FIG.


16


(A) is a stepwise alternating current component prediction method as described hereinafter. At first stage, each sub-block S


1


˜S


4


is predicted from each DC value of the 4 blocks (U, R, B, L) surrounding the S


1


˜S


4


.




S


1


=S+(U+L−B−R)/8




S


2


=S+(U+R−B−L)/8




S


3


=S+(B+L−U−R)/8




S


4


=S+(B+R−U−L)/8




Similarly, U


1


˜U


4


, L


1


˜L


4


, R


1


˜R


4


and B


1


˜B


4


are predicted at the first stage. At second stage, 4 pixels P


1


˜P


4


on S


1


are predicted by using the above method repeatedly.




P


1


=S


1


+(U


3


+L


2


−S


3


−S


2


)/8




P


2


=S


1


+(U


3


+S


2


−S


3


−L


2


)/8




P


3


=S


1


+(S


3


+L


2


−U


3


−S


2


)/8




P


4


=S


1


+(S


3


+S


2


−U


3


−L


2


)/8




Each 4 pixels P


1


˜P


4


on S


2


˜S


4


are predicted in the same manner. The target blocks <R


j


> are reproduced by such two stage processing.




FIG.


16


(B) is a non-stepwise alternating current component prediction method, which the applicant has been already proposed. In FIG.


16


(B), each of the 4 pixels P


1


˜P


4


on each of the sub-block S


1


˜S


4


is predicted from each DC value of 4 blocks surrounding the noticeable block S at a stroke. At first, each approximation of S


2


≈S


3


≈S, U


3


≈U


2


and L


2


≈L is carried out to obtain each4 pixels P


1


˜P


4


on S


1


. The approximation is applied to P


1


˜P


4


on S


1


to obtain the formula,








P




1




=S




1


+(


U




3




+L




2




−S




3




−S




2


)/8=


S




1


+(


U+L−S−S


)/8






The above formula, S


1


=S+(U+L−B−R)/8, is substituted for the formula, P


1


=S


1


+(U+L−S−S)/8, P


1


on S


1


is finally represented by the formula,








P




1




=S+


(2


U+


2


L−


2


S−B−R


)/8






And, P


2


on S


1


is represented by the formula,








P




2




=S




1


+(


U




3




+S




2




=S




3




−L




2


)/8=


S




1


+(


U+S−S−L


)/8






The above formula, S


1


=S+(U+L−B−R)/8, is substituted for the formula, P


2


=S


1


+(U+S−S−L)/8, P


2


on S


1


is finally represented by the formula,








P




2




=S


+(2


U−B−R


)/8






Also, P


3


on S


1


is represented by the formula,








P




3




=S




1


+(


S




3




+L




2




−U




3




−S




2


)/8=


S




1


+(


S+L−U−S


)/8






The above formula, S


1


=S+(U+L−B−R)/8, is substituted for the formula, P


3


=S


1


+(S+L−U−S)/8, P


3


on S


1


is finally represented by the formula,








P




3




=S


+(2


L−B−R


)/8






Further, P


4


on S


1


is represented by the formula,








P




4




=S




1


+(


S




3




+S




2




−U




3




−L




2


)/8=


S




1


+(


S+S−U−L)/


8






The above formula, S


1


=S+(U+L−B−R)/8, is substituted for the formula, P


4


=S


1


+(S+S−U−L)/8, P


4


on S


1


is finally represented by the formula,








P




4




=S


+(2


S−B−R


)/8






Accordingly, 4 pixels P


1


˜P


4


on S


1


can be non-stepwise obtained by the formulae at a stroke.








P




1




=S


+(2


U+


2


L−


2


S−B−R


)/8










P




2




=S


+(2


U−B−R


)/8










P




3




=S


+(2


L−B−R


)/8










P




4




=S


+(2


S−B−R


)/8






Each 4 pixels P


1


˜P


4


on S


2


˜S


4


is obtained in the same manner.




The embodiments of the invention are explained by using the several examples, but it is apparent that the invention should not be limited thereto. It is to be appreciated that those skilled in the art can change or modify the embodiments in such point as construction, control, processing or a combination thereof without departing from the scope and spirit of the invention.




According to the invention, high image quality can be obtained by the improvement of the DC nest and high speed encoding can be achieved by the means for the AOT calculation. Therefore, the method of the invention much contributes to the attainment of high image quality and high speed encoding in the HVQ system.



Claims
  • 1. In an image encoding method which comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where the differential vector which is obtained by separating the DC value from the pixel block to be encoded is over an allowable value, calculating one or more orthogonal basis, to which the differential vector is approximated, by the adaptive orthogonal transform using the DC nest, the improvement which comprises setting the lowest n (n=log2 B) bits of the DC pixels in each sample to 0, where the base extraction blocks are down-sampled from the DC nest and the block mean value thereof is calculated using the samples.
  • 2. The method according to claim 1, wherein the lowest n bits of each DC pixels are set to 0, where the DC nest is produced from the DC image.
  • 3. The method according to claim 2, wherein a base extraction vector is produced to which the differential vector approximates by separating the block mean value from the base extraction block in which n bits of the DC pixels are set to 0.
  • 4. The method according to claim 3, optional elements of base extraction vectors <ui> are replaced by linear bond of the remainder elements and the inner product of the base extraction vectors and the other optional vectors <w> are calculated by the formula,<w·ui>=(w1−w16) u1+(w2−w16)u2+ . . . +(w15−w16) u15.
  • 5. The method according to claim 4, wherein a first basis is searched so that hi may be maximum in the following formula,hi=<d·u1>2/∥ui∥2 wherein <d> is the differential vectors and <ui> is the base extraction vectors.
  • 6. The method according to claim 4, wherein a second basis is searched so that hi may be maximum in the following formula,hi={<d·ui>−(<d·u1><u1·ui>/∥u1∥2)2/{∥ui∥2−(<u1·ui>)/∥u1∥2}wherein <d> is the differential vectors, <u1> is the base extraction vectors corresponding to the first basis, and <ui> is the base extraction vectors for searching the second basis.
  • 7. The method according to claim 4, wherein a third basis is searched so that hi may be maximum in the following formula,hi=(<d·ui>−<d·v1><v1·ui>−<d·v2><v2·ii>)2/{∥ui∥2−<v1·ui>2−<v2·ui>}2 wherein <d> is the differential vectors, <v1> is the first orthonormal base vectors, <v2> is the second orthonormal base vectors, and <ui> is the base extraction vectors for searching the third basis.
  • 8. The method according to claim 6, wherein the base extraction vectors <ui> which match with search conditions are subjected to orthonormal transform with one or more preceding orthonormal basis.
  • 9. In an image encoding apparatus which comprises producing a DC image composed of each block mean value by dividing an image data per B pixel into a block, making a part of said DC image a DC nest, and where a differential vector which is obtained by separating the DC value from the pixel block to be encoded is over an allowable value, calculating one or more orthogonal basis, to which the differential vector is approximated, by the adaptive orthogonal transform using the DC nest, the improvement comprising a memory to store the DC nest in which the lowest n (n=log2 B) bits of the DC nest pixels are set to 0.
  • 10. A computer readable recording medium storing a program to make a computer to implement the processing comprising the processing recited in claims 1 to 8.
Priority Claims (1)
Number Date Country Kind
2000-141675 May 2000 JP
US Referenced Citations (4)
Number Name Date Kind
5757969 Kim May 1998 A
5852682 Kim Dec 1998 A
5937098 Abe Aug 1999 A
6477276 Inoue et al. Nov 2002 B1
Foreign Referenced Citations (2)
Number Date Country
1104199 May 2001 EP
WO 0002393 Jan 2000 WO
Non-Patent Literature Citations (4)
Entry
Patent Abstracts of Japan, Publ. No.: 07038760, Publ. Date: Feb. 7, 1995.
You H, et al., “Improving LIFS Image Coding Scheme Via Gram-Schmidt Orthogonalization”, IEICE Technical Report, 1998, pp. 37-42.
Takahashi T, et al., “A Fast Computation Algorithm for Predicting AC Components of Imagesusing Mean Values of Blocks”, vol. J81-D-II, No. 4, Apr. 1998, pp. 778-780.
Huang S-C, et al., “A Contrained Vector Quantization Scheme for Real-Time Codebook Retransmission”, vol. 4, No. 1, Feb. 1, 1994, pp. 1-7.