The present disclosure in one or more embodiments relates to an apparatus and method for configuring and indexing a reference image. More particularly, the present disclosure relates to an apparatus and method for configuring and indexing a reference image which may be interpolated in a variety of spatial resolutions or include such reference image as processed through illumination compensation, zoom-in and zoom-out, warping and the like so as to assign the index of reference frame according to the temporal position of the reference image.
The statements in this section merely provide background information related to the present disclosure and may not constitute prior art.
During inter coding, the existing video codec, such as JM or KTA, estimates motion vector by using multiple previous frames or multiple future frames. With such use of several sheets of reference images, it is more likely that blocks similar to a current block will be found, and motion can be accurately estimated when a scene is changed or a hidden object pops up.
In this manner, when motion vector is estimated by using multiple reference images, it is necessary to encode reference frame index indicating a reference image whenever blocks are coded. Currently, JM software utilizes a unary coding method to code index information of a reference image. The unary coding method is shown
Generally, an image temporally closest to a current frame is most similar to the current frame. Therefore, reference frame index generally assigns a shorter codeword to reference frame closer to the current frame, based on temporal order, and assigns a longer codeword to reference frame temporally farther from the current frame. Such a method may be used for bi-directional prediction as shown in
Meanwhile, the process of encoding motion vector estimated in video codec is as follows. A predicted motion vector (PMV) made from peripheral blocks of current block is first calculated, and a calculation is then performed on a differential vector between the PMV and the motion vector found with respect to the current block. The difference vector is encoded into variable length codeword arranged by the integer multiple of motion vector resolution considered in the codec.
The JM software, which is the existing video codec, estimates motion vector in integer pixel unit, half (½) pixel unit, and quarter (¼) pixel unit, and compresses signals by motion vector having resolution of the highest compression efficiency. Meanwhile, in order to estimate motion vector more accurately, the KTA software finds a more accurate motion by estimating motion vector from integer pixel unit down to ⅛ pixel unit. However, in the KTA codec, it is necessary to perform encoding in consideration of motion vectors of all resolutions, such as integer pixel unit, ½ pixel unit, ¼ pixel unit, and ⅛ pixel unit. Therefore, a long codeword needs to be used even when a small motion vector is encoded. This increases the amount of bits generated, resulting in degradation in compression efficiency. Specific examples of these problems will be described below.
Table 3 below shows examples of codewords assigned to differential motion vectors in the codec considering pixel units from integer pixel unit to ¼ pixel unit.
As can be seen from Table 3 above, if motion vectors of different resolutions are encoded in the typical compression standard, long codewords are also used for encoding small motion vectors. Consequently, the size of data generated in the process of encoding motion vectors is increased, which lowers compression efficiency. For example, when the differential motion vector is 3 and 2 in Table 3 above, a bit string ‘000011000’, whose code number is ‘23’, is used for encoding ‘3’, and a bit string ‘000010000’, whose code number is ‘15’, is used for encoding ‘2’. In this manner, long codewords are used for encoding small motion vectors because of using codewords for encoding motion vectors of ½ and ¼ pixel units along with codewords for encoding all motion vectors of integer pixel unit considered.
As described above, estimation of motion vectors of high resolutions is advantageous to find reference blocks which have very high correlation with current coding block. However, the compression efficiency may be lowered by the use of variable length codewords considering all resolution vectors from low-resolution motion vector values to high-resolution vector values. For example, in the case where most internal blocks of specific frame can be encoded by using only motion vectors of integer pixel unit or ½ pixel unit, if variable length codebook considering all resolutions from integer pixel unit to ⅛ pixel unit is used, variable length codewords of frequently used integer pixel and ½ pixel code vectors are lengthened by codewords considering unused ¼ pixels and ⅛ pixels. Consequently, the compression efficiency may be lowered. In this case, motion vectors of all resolutions are not required in some frames or some coding blocks, and only several specific types of resolution vectors may be used to have the encoding done efficiently.
On the contrary, the compression efficiency may be increased when using variable length codewords considering motion vectors of all resolutions from integer pixel unit to ⅛ pixel unit due to characteristics of internal pixel values in specific frame and block.
As another example, the compression efficiency may be increased when using a variable length codebook considering only several specific resolutions (for example, ½ pixel unit, ⅛ pixel unit) due to characteristics of internal pixel values in specific frame and block. In this case, the compression efficiency may be increased when the variable length codebook does not support codewords for motion vectors of integer pixel unit and ¼ pixel unit.
In order to solve the compression rate reduction problem related to resolutions of motion vectors, embodiments of the present disclosure are directed to provide an apparatus and method for configuring and indexing a reference image, in which a reference image is interpolated in different spatial resolutions, such that the reference image can be used as reference images having different reference frame indexes. In the apparatus and method for configuring and indexing the reference image according to one or more embodiments of the present disclosure, there is no limitation to interpolation methods or motion estimation methods.
In addition, reference images may be configured based on differentiated spatial resolutions by respective images. However, embodiments of the present disclosure are directed to providing an apparatus and method for configuring and indexing reference images which can use variously configured reference images, such as illumination compensated reference images, zoomed-in and zoomed-out reference images, warped reference images and the like.
According to a first embodiment of the present disclosure, an apparatus for configuring and indexing a reference image for estimating motion vector includes: a reference image configuring unit for processing the reference image in a variety of arbitrary different methods and arranging reference frames according to processing methods; and an index assignment unit for assigning reference frame indexes according to the method processed by the reference image configuring unit.
The reference frame may be one or more of non-interpolated frame, ½ pixel unit frame, ¼ pixel unit frame, ⅛ pixel unit frame, and frame combined with two or more of ½ pixel unit, ¼ pixel unit, and ⅛ pixel unit.
When motion vector estimated in the reference frame is encoded, the motion vector may be encoded by using different codebooks according to the spatial resolution of the reference frame.
According to a second embodiment of the present disclosure, an apparatus for configuring and indexing a reference image for estimating motion vector includes: a reference image configuring unit for zooming the reference image in a variety of ratios and arranging reference frames according to zooming ratios; and an index assigning unit for assigning reference frame indexes according to the zooming ratios of the reference frame.
The reference frame is one or more of non-zoomed frame, zoomed-in frame, and zoomed-out frame.
When motion vector is estimated in the reference frame, the motion vector may be encoded by using different codebooks according to the zooming ratios of the reference frame.
According to a third embodiment of the present disclosure, an apparatus for configuring and indexing a reference image for estimating motion vector includes: a reference image configuring unit for warping the reference image and arranging reference frames according to warping method; and an index assigning unit for assigning reference frame indexes according to the warping method of the reference frame.
The reference frame may be one of non-warped frame and warped frame.
When motion vector is estimated in the reference frame, if the reference frame is warped frame, the motion is encoded by estimating warping parameters and motion vector.
According to a fourth embodiment of the present invention, an apparatus for configuring and indexing a reference image for estimating motion vector includes: a reference image configuring unit for performing illumination compensation on the reference image and arranging reference frames according to illumination compensation method and levels; and an index assignment unit for assigning reference frame indexes according to the illumination compensation methods and levels of the reference frame.
The reference frame may be one of illumination uncompensated frame and illumination compensated frame.
When motion vector is estimated in the reference frame, if the reference frame is illumination compensated frame, the motion vector may be estimated with respect to the illumination compensated reference image data.
According to a fifth embodiment of the present invention, an apparatus for configuring and indexing a reference image for estimating motion vector includes: a reference image configuring unit for performing processing combined with two or more of interpolation in a variety of spatial resolutions, zooming in various ratios, warping, and illumination compensation with respect to the reference image, and arranging reference frames according to processing methods and processing levels; and an index assignment unit for assigning reference frame indexes according to the processing method and processing level of the reference frame.
In the apparatus for configuring and indexing the reference image according to the first embodiment, a method for configuring and indexing a reference image for estimating motion vector includes: processing the reference image in a variety of arbitrary different methods and arranging reference frames according to processing methods; and assigning reference frame indexes according to the processing methods.
In the apparatus for configuring and indexing the reference image according to the second embodiment, a method for configuring and indexing a reference image for estimating motion vector includes: interpolating the reference image in a variety of spatial resolutions and arranging reference frames according to the spatial resolutions; and assigning reference frame indexes according to the spatial resolution of the reference frame.
In the apparatus for configuring and indexing the reference image according to the third embodiment, a method for configuring and indexing a reference image for estimating motion vector includes: zooming the reference image in a variety of ratios and arranging reference frames according to zooming ratios; and assigning reference frame indexes according to the zooming ratios of the reference frame.
In the apparatus for configuring and indexing the reference image according to the fourth embodiment, a method for configuring and indexing a reference image for estimating motion vector includes: warping the reference image and arranging reference frames according to warping method; and assigning reference frame indexes according to the warping method of the reference frame.
In the apparatus for configuring and indexing the reference image according to the fifth embodiment, a method for configuring and indexing a reference image for estimating motion vector includes: processing illumination compensation on the reference image and arranging reference frames according to illumination compensation method and levels; and assigning reference frame indexes according to the illumination compensation methods and levels of the reference frame.
In addition, with a reference image indexing technique according to the present disclosure, forward and backward reference images may be referenced by continuous index numbers requiring no separate flags to be used to distinctively indicate the reference images.
The method of configuring the reference image described in the foregoing embodiments and detailed description may be delivered to a decoder by using separate information, when a sequence header or picture header or slice header may be used for delivering the information. Further, in the event of configuring the reference image between the encoder and decoder in an agreed method, no separate information is necessary for the decoder to configure the reference image into its decoding operation.
According to the present disclosure as described above, a reference image is interpolated in a variety of spatial resolutions, or reference image is processed through illumination compensation, zoom-in and zoom-out, warping and the like so as to assign the index of reference frame according to the processing method. Therefore, the coding efficiency of motion vector may be improved.
The reference image configuring unit 410 arranges reference frames according to a variety of spatial resolutions by performing interpolation on a reference image in the spatial resolutions. Alternatively, the reference image configuring unit 410 may arrange reference frames according to zooming ratios by performing zooming on a reference image in a variety of ratios. Alternatively, the reference image configuring unit 410 may arrange reference frames according to warping methods by performing warping on a reference image. Alternatively, the reference image configuring unit 410 may arrange reference frames according to illumination compensation and level by performing illumination compensation on a reference image.
The index assignment unit 420 assigns reference frame indexes according to processing methods and processing levels of reference frames arranged by the reference image configuring unit 410.
The reference image configuring unit 410 arranges reference frames according to processing methods and processing levels by performing one or more of interpolation in a variety of spatial resolutions, zooming in various ratios, warping, and illumination compensation with respect to the reference image (S510).
The index assignment unit 420 assigns reference frame indexes to the arranged reference frames according to processing methods and degrees by the reference image configuring unit 410 (S520).
In
When the reference image is configured by using the method of
When the motion vector estimated in the reference frame a1 is encoded, a codebook for encoding only the motion vector of ⅛ pixel unit is used as shown in Table 5.
In addition, when encoding the motion vector found in the reference frame a2, a codebook for encoding motion vectors of only ½ pixel unit and ¼ pixel unit is used as shown in Table 6 below.
Tables 4 to 6 are merely exemplary codebooks for representation of motion vectors according to resolution information, and the embodiment of the present disclosure does not limit the method of representing the motion vectors. In addition,
When the reference image is configured by using the method of
When the reference image is configured by using the method of
When the reference image is configured by using the method of
In the embodiments described above, the description has been given of the case where a specific reference image is used as additional reference images through ‘resolution conversion, ‘zooming’, ‘warping’, and ‘illumination conversion’. However, the present technology is not limited to the reference image conversion using the four methods, and a variety of additional conversion methods may also be used for accurately obtaining motion vector.
In addition, a decoder of the present disclosure may have an advance agreement of the method of configuring specific reference image with an encoder, and the encoder may provide the decoder with additional information on the method of configuring the reference image. There is no restriction to the method of coding the information to be delivered to the decoder.
Also, the technologies proposed in the embodiments of the present disclosure do not limit the combined use of images, modified by one or more methods, as reference images in the list of reference images. For example, among the reference images, ‘unmodified past reference image’, ‘illumination compensated reference image’, ‘warped reference image’, ‘zoomed reference image’, ‘reference image composed of only specific resolution information’, and ‘reference image modified by addition processes’ can be used together. In this case, the method of assigning indexes to the respective processed images is not limited to the embodiments of the present disclosure.
Using Table 1 for encoding the reference frame index in the above description of the present disclosure is merely exemplary, and there is no limitation to the method of encoding reference frame index.
Although exemplary aspects of the present disclosure have been described for illustrative purposes, those skilled in the art will appreciate that various modifications, additions and substitutions are possible, without departing from essential characteristics of the disclosure. Therefore, exemplary aspects of the present disclosure have not been described for limiting purposes. Accordingly, the scope of the disclosure is not to be limited by the above aspects but by the claims and the equivalents thereof.
As described above, the present disclosure is highly useful for application in the fields of reference image configuration and indexing. A reference image is interpolated in various spatial resolutions, or a reference image is variously configured by performing zooming in and out at various magnifications, warping, illumination compensation, and the like, and reference frame indexes are assigned according to the processing methods and processing levels, thereby improving the coding efficiency of motion vector.
If applicable, this application claims priorities under 35 U.S.C §119(a) of Patent Application No. 10-2010-0046178, filed on May 17, 2010 and Patent Application No. 10-2011-0045528, filed on May 16, 2011 in Korea, the entire contents of which are incorporated herein by reference. In addition, this non-provisional application claims priorities in countries, other than the U.S., with the same reason based on the Korean Patent Applications, the entire contents of which are hereby incorporated by reference.
| Filing Document | Filing Date | Country | Kind | 371c Date |
|---|---|---|---|---|
| PCT/KR2011/003628 | 5/17/2011 | WO | 00 | 11/21/2012 |