IMAGE PROCESSING DEVICE AND IMAGE PROCESSING METHOD

TECHNICAL FIELD

The present technology relates to an image processing device and an image processing method, and particularly relates to, for example, an image processing device and an image processing method that enable simplification of processing.

BACKGROUND ART

In the Joint Video Experts Team (JVET), which is a joint standardization organization of ITU-T and ISO/IEC, standardization work of Versatile Video Coding (VVC), which is a next-generation image encoding system, is in progress for the purpose of further improving encoding efficiency over H.265/HEVC.

In the standardization work of VVC, it has been proposed to perform matrix-based intra prediction (MIP), which is intra prediction using matrix operation, on a prediction block (see, for example, Non Patent Document 1).

In the MIP, parameters of a matrix (weight matrix) obtained by parameter learning are defined, and an operation using (the parameters of) the matrix is performed.

CITATION LIST
Non Patent Document

Non Patent Document 1: Benjamin Bross, Jianle Chen, Shan Liu, Versatile Video Coding (Draft 7), JVET-P2001-v14 (version 14—date Nov. 14, 2019)

SUMMARY OF THE INVENTION
Problems to be Solved by the Invention

An MIP operation (operation performed by the MIP) is performed using an offset factor fO. The offset factor fO is changed according to MipSizeId representing the matrix size of the matrix and modeId representing the mode number of the MIP in order to increase bit accuracy.

As described above, in the MIP operation, since the offset factor fO is changed according to the MipSizeId and the modeId, it is necessary to set the offset factor fO for each combination of the MipSizeId and the modeId, and the process becomes complicated.

The present technology has been made in view of such a situation, and enables simplification of processing.

Solutions to Problems

A first image processing device of the present technology is an image processing device including an intra prediction unit that, when performing matrix-based intra prediction that is intra prediction using a matrix operation on a current prediction block to be encoded, performs the matrix-based intra prediction using a coefficient related to a sum of change amounts of pixel values and set to a fixed value, and generates a predicted image of the current prediction block, and an encoding unit that encodes the current prediction block using the predicted image generated by the intra prediction unit.

A first image processing method of the present technology is an image processing method including an intra prediction step of, when performing matrix-based intra prediction that is intra prediction using a matrix operation on a current prediction block to be encoded, performing the matrix-based intra prediction using a coefficient related to a sum of change amounts of pixel values and set to a fixed value, and generating a predicted image of the current prediction block, and an encoding step of encoding the current prediction block using the predicted image generated in the intra prediction step.

In the first image processing device and image processing method of the present technology, when performing matrix-based intra prediction that is intra prediction using a matrix operation on a current prediction block to be encoded, the matrix-based intra prediction is performed using a coefficient related to a sum of change amounts of pixel values and set to a fixed value, and a predicted image of the current prediction block is generated. Then, the current prediction block is encoded using the predicted image.

A second image processing device of the present technology is an image processing device including an intra prediction unit that, when performing matrix-based intra prediction that is intra prediction using a matrix operation on a current prediction block to be decoded, performs the matrix-based intra prediction using a coefficient related to a sum of change amounts of pixel values and set to a fixed value, and generates a predicted image of the current prediction block, and a decoding unit that decodes the current prediction block using the predicted image generated by the intra prediction unit.

A second image processing method of the present technology is an image processing method including an intra prediction step of, when performing matrix-based intra prediction that is intra prediction using a matrix operation on a current prediction block to be decoded, performing the matrix-based intra prediction using a coefficient related to a sum of change amounts of pixel values and set to a fixed value, and generating a predicted image of the current prediction block, and a decoding step of decoding the current prediction block using the predicted image generated in the intra prediction step.

In the second image processing device and image processing method of the present technology, when performing matrix-based intra prediction that is intra prediction using a matrix operation on a current prediction block to be decoded, the matrix-based intra prediction is performed using a coefficient related to a sum of change amounts of pixel values and set to a fixed value, and a predicted image of the current prediction block is generated. Then, the current prediction block is decoded using the predicted image.

Note that the image processing device may be an independent device or an internal block constituting one device.

Furthermore, the image processing device can be achieved by causing a computer to execute a program. The program can be provided by transmitting via a transmission medium or by recording on a recording medium.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a diagram describing a first MIP method.

FIG. 2 is a diagram describing a second MIP method.

FIG. 3 is a diagram describing MIP in a case where 48 is employed as a fixed offset coefficient and five is employed as a fixed shift amount.

FIG. 4 is a diagram illustrating a weight matrix mWeight[i][j] of (M, m)=(0, 0) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 5 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 1) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 6 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 2) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 7 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 3) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 8 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 4) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 9 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 5) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 10 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 6) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 11 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 7) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 12 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 8) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 13 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 9) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 14 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 10) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 15 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 11) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 16 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 12) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 17 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 13) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 18 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 14) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 19 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 15) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 20 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 0) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 21 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 1) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 22 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 2) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 23 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 3) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 24 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 4) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 25 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 5) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 26 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 6) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 27 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 7) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 28 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 0) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 29 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 1) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 30 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 2) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 31 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 3) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 32 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 4) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 33 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 5) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

FIG. 34 is a diagram describing MIP in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 35 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 0) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 36 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 1) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 37 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 2) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 38 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 3) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 39 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 4) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 40 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 5) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 41 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 6) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 42 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 7) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 43 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 8) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 44 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 9) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 45 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 10) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 46 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 11) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 47 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 12) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 48 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 13) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 49 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 14) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 50 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(0, 15) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 51 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 0) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 52 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 1) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 53 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 2) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 54 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 3) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 55 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 4) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 56 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 5) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 57 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 6) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 58 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(1, 7) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 59 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 0) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 60 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 1) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 61 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 2) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 62 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 3) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 63 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 4) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 64 is a diagram illustrating the weight matrix mWeight[i][j] of (M, m)=(2, 5) in a case where 96 is employed as the fixed offset coefficient and six is employed as the fixed shift amount.

FIG. 65 is a block diagram illustrating a configuration example of an embodiment of an image processing system to which the present technology is applied.

FIG. 66 is a block diagram illustrating a configuration example of an encoder 11.

FIG. 67 is a flowchart describing an example of encoding processing of the encoder 11. FIG. 68 is a block diagram illustrating a detailed configuration example of a decoder 51.

FIG. 69 is a flowchart describing an example of decoding processing of the decoder 51.

FIG. 70 is a block diagram illustrating a configuration example of an embodiment of a computer to which the present technology is applied.

MODE FOR CARRYING OUT THE INVENTION
References

The scope disclosed in the present description is not limited to the contents of the embodiment, and the contents of the following References REF 1 to REF 11 known at the time of filing are also incorporated herein by reference. That is, the contents described in the following References REF 1 to REF 11 are also the basis for determining the support requirement. Moreover, the documents referred to in References REF 1 to REF 11 are also grounds for determining the support requirement.

For example, a quad-tree block structure, a quad tree plus binary tree (QTBT) block structure, and a multi-type tree (MTT) block structure are within the scope of the present disclosure and satisfy the support requirements of the claims even in a case where they are not directly defined in the detailed description of the invention. Furthermore, for example, technical terms such as parsing, syntax, and semantics are similarly within the scope of the present disclosure and satisfy the support requirements of the claims even in a case where not directly defined in the detailed description of the invention.

REF 1: Recommendation ITU-T H.264 (April 2017) “Advanced video coding for generic audiovisual services”, April 2017

REF 2: Recommendation ITU-T H.265 (February 2018) “High efficiency video coding”, February 2018

REF 3: Benjamin Bross, Jianle Chen, Shan Liu, Versatile Video Coding (Draft 7), JVET-P2001-v14 (version 14—date Nov. 14, 2019)

REF 4: Jianle Chen, Yan Ye, Seung Hwan Kim, Algorithm description for Versatile Video Coding and Test Model 7 (VTM 7), JVET-P2002-v1 (version 1—date Nov. 10, 2019)

REF 5: JVET-N0217-v3: CE3: Affine linear weighted intra prediction (CE3-4.1, CE3-4.2) (version 7—date Jan. 17, 2019)

REF 6: JVET-M0043-v2: CE3: Affine linear weighted intra prediction (test 1.2.1, test 1.2.2) (version 2—date Jan. 9, 2019)

REF 7: JVET-00408-v3: Non-CE3: On rounding shift of MIP (version 3 date Jul. 4, 2019)

REF 8: JVET-00925-v3: Non-CE3: Simplifications of MIP (version 3 date Jul. 4, 2019)

REF 9: Kenji Kondo, Masaru Ikeda, Teruhiko Suzuki, Junyan Huo, Yanzhuo Ma, Fuzheng Yan, Shuai Wan, Yuanfang Yu, CE3-2: On rounding shift of MIP, JVET-P0056-v3 (version 3—date Oct. 3, 2019)

REF 10: Junyan Huo, Haixin Wang, Yu Sun, Yanzhuo Ma, Shuai Wan, Fuzheng Yang. Yuanfang Yu, Yang Liu, Non-CE3: MIP simplification. JVET-P0136-v2 (version 3—date Oct. 4, 2019)

REF 11: Thibaud Biatek, Adarsh K. Ramasubramonian, Geert Van der Auwera, Marta Karczewicz, Non-CE3: simplified MIP with power-of-two offset. JVET-P0625-v2 (version 2—date Oct. 2, 2019)

To be adjacent includes, for a pixel, not only a case where the pixel is adjacent to the current pixel of interest by one pixel (one line) but also a case where the pixel is adjacent to the current pixel of interest by a plurality of pixels (a plurality of lines). Therefore, an adjacent pixel includes a pixel at a position corresponding to a plurality of pixels continuously adjacent to the current pixel in addition to a pixel at a position corresponding to one pixel directly adjacent to the current pixel. Furthermore, an adjacent block includes a block in a range corresponding to a plurality of blocks continuously adjacent to the current block in addition to a block in a range corresponding to one block directly adjacent to the current block of interest. Moreover, the adjacent block can include a block located in the vicinity of the current block as necessary.

A prediction block means a block (prediction unit (PU)) serving as a processing unit when intra prediction or inter prediction is performed, and also includes a sub-block in the prediction block. In a case where the prediction block, an orthogonal transform block (transform unit (TU)), and an encoding block (coding unit (CU)) are unified into the same block, the prediction block, the orthogonal transform block, and the coding block mean the same block. The orthogonal transform block is a block serving as a processing unit when orthogonal transform is performed, and the encoding block is a block serving as a processing unit when encoding is performed.

The intra prediction mode collectively means variables (parameters) referring to when deriving the intra prediction mode, such as a mode number, a mode number index, a block size of the prediction block, and a size of a sub-block serving as a processing unit in the prediction block when intra prediction is performed.

The matrix-based intra prediction mode (MIP mode) means variables (parameters) referring to when deriving the matrix-based intra prediction mode, such as an MIP mode number, a mode number index, a type of a matrix used when the MIP operation is performed, and a type of a matrix size of a matrix used when the MIP operation is performed.

The parameter is a generic term for data required when encoding or decoding, and is typically a syntax of a bit stream, a parameter set, or the like. Moreover, the parameters include variables and the like used in a derivation process. For the MIP, various data used when the MIP operation is performed corresponds to the parameter. For example, the offset factor fO, a shift amount sW, and (the component of) the weight matrix mWeight[i][j], and the like described in REF 3 correspond to the parameters.

Changing means changing determined contents, for example, changing contents described in the publicly known document based on date before the present application date. Therefore, for example, being different from the content described in Reference REF 3 (values, arithmetic expressions, variables, and the like) corresponds to the change.

In the present technology, identification data for identifying a plurality of patterns can be set as syntax of a bit stream obtained by encoding an image. The bit stream can include identification data identifying various patterns.

In a case where the identification data is included in the bit stream, a decoder that decodes the bit stream can perform processing more efficiently by parsing and referring to the identification data.

FIG. 1 is a diagram describing a first MIP method.

The first MIP method is a method of generating a predicted image of the MIP proposed in Reference REF 3 (JVET-P2001-v14).

In the first MIP method, (part of) pixels (pixel values of) predMip[x][y] of a predicted image of a current prediction block that is a prediction block to be encoded or decoded is generated according to the following expression described as Expression (258) in Reference REF 3.

predMip[x][y]=(((ΣmWeight[i][y*predSize+x]*p[i])+oW)>>sW)+pTemp[] (258)

In Expression (258), a variable oW is calculated according to the following expression described as Expression (257) in Reference REF 3.

oW=(1<<(sW−1))−fO*(Σp[i]) (257)

A<<B and A>>B represent that A is shifted leftward and rightward by B bits, respectively.

Each of the variables in Expressions (258) and (257) is described in Reference REF 3, and thus the description thereof is appropriately omitted.

The predMip[x][y] represents (the pixel value of) a pixel whose horizontal position is x and whose vertical position is y in the predicted image. The pixel of the predicted image will be also referred to as a predicted pixel.

According to Expressions (258) and (257), in the first MIP method, MIP is performed using the weight matrix mWeight[i][j], the shift amount sW, and the offset factor fO set according to the MipSizeId and the modeId, and the pixel predMip[x][y] of the predicted image is generated.

Here, in Expressions (258) and (257), p[i] represents a change amount of (the pixel value of) the pixel pTemp[i] in the current prediction block. For example, p[i] is a change amount of the pixel pTemp[i] with reference to an upper left pixel pTemp[0] of the current prediction block, and is expressed by, for example, an expression p[i]=pTemp[i+1]−pTemp[0] or an expression p[i]=pTemp[i]−pTemp[0].

In Expression (257), the offset factor fO is a coefficient relating to the sum Σp[i] of the change amounts p[i] of pixel values.

The shift amount sW is set according to MipSizeId and modeId in accordance with Table 23 described in Reference REF 3. The offset factor fO is set according to MipSizeId and modeId in accordance with Table 24 described in Reference REF 3.

Note that although Reference REF 3 describes that “the variable sW is derived using mipSizeId and modeId as specified in Table 8-5)”, the “Table 8-5” is a clerical error of “Table 23”. Furthermore, although Reference REF 3 describes that “the variable fO is derived using mipSizeId and modeId as specified in Table 8-6”, the “Table 8-6” is a clerical error of “Table 24”.

Meanwhile, Reference REF 10 (JVET-P0136-v2) proposes deleting Table 24 of Reference REF 3, that is, not using the offset factor fO.

Furthermore, Reference REF 11 (JVET-P0625-v2) proposes to use a value represented by a power of two as the offset factor fO defined in Table 24 of Reference REF 3.

As proposed in Reference REF 10, by not using the offset factor fO, it is not necessary to set the offset factor fO according to the MipSizeId and the modeId, so that the MIP processing can be simplified.

However, in a case where the offset factor fO is not used, the range of the weight matrix mWeight[i][j] increases due to the influence of not using the offset factor fO. Consequently, the storage capacity for storing the weight matrix mWeight[i][j] increases.

As proposed in Reference REF 11, by using a value represented by a power of two as the offset factor fO defined in the Table 24, multiplication of the offset factor fO can be performed by shift operation, so that the MIP processing can be simplified.

However, even in a case where a value represented by a power of two is used as the offset factor fO defined in the Table 24, it is necessary to reset the offset factor fO according to the MipSizeId and the modeId similarly to a case where the current Table 24 is used, and the processing becomes complicated.

For example, in a case where the first MIP method is implemented by hardware, a selector for switching the offset factor fO is required, and the circuit scale increases. Furthermore, in a case where the first MIP method is implemented by software, it is necessary to prepare a table in which the offset factor fO represented by a power of two is defined and refer to the table, and the processing speed decreases.

Therefore, in the present technology, the MIP is performed using the offset factor fO set to a fixed value. That is, for example, operations of Expressions (258) and (257) are changed according to the offset factor fO set to the fixed value, and the predicted image of the MIP is generated according to the changed operation.

Hereinafter, the offset factor fO set to a fixed value as appropriate will be also referred to as a fixed offset coefficient. Since the fixed offset coefficient is the offset factor fO set to the fixed value, the fixed offset coefficient is a coefficient relating to the sum Σp[i] of the change amounts p[i] of the pixel value, similarly to the offset factor fO.

FIG. 2 is a diagram describing a second MIP method.

In the second MIP method, the offset factor fO of Expression (257) for generating the predicted image of the MIP proposed in Reference REF 3 is set to a fixed offset coefficient that is a fixed value.

By setting the offset factor fO to the fixed offset coefficient that is a fixed value, Table 24 of Reference REF 3 becomes unnecessary, and the MIP processing can be simplified.

As the fixed value used as the fixed offset coefficient, for example, a value that causes the range of the weight matrix mWeight[i][j] to be in a predetermined range can be employed.

That is, the weight matrix mWeight[i][j] changes from the value described in Reference REF 3 due to the influence of using the fixed offset coefficient. As the fixed offset coefficient, a value that causes the range of the weight matrix mWeight[i][j] after the change to fall within the predetermined range can be employed.

Moreover, as the fixed value used as the fixed offset coefficient, a value represented by a power of two or a value represented by a sum of powers of two can be employed.

In a case where the value represented by a power of two is employed as the fixed offset coefficient, multiplication of the sum Σp[i] of the change amounts p[i] of the pixel value at the time of calculating the variable oW in Expression (257) and the fixed offset coefficient can be performed only by shift operation. Therefore, the MIP processing can be simplified, that is, the calculation amount of the MIP processing can be reduced.

In a case where a value represented by the sum of powers of two is employed as the fixed offset coefficient, multiplication of the sum Σp[i] of the change amounts p[i] of the pixel value at the time of calculating the variable oW in Expression (257) and the fixed offset coefficient can be performed by shift operation and addition. Therefore, the MIP processing can be simplified, that is, the calculation amount of the MIP processing can be reduced.

Note that in a case where a value represented by a power of two is employed as the fixed offset coefficient, the degree of reduction in the calculation amount of the MIP processing is larger than that in a case where the value represented by the sum of powers of two is employed.

However, calculation accuracy can be enhanced, that is, a prediction error of the predicted image of the MIP can be reduced more in a case of employing the value represented by the sum of powers of two as the fixed offset coefficient than in a case of employing the value represented by a power of two.

As the value represented by a power of two, which is the fixed offset coefficient, for example, 32=2⁵or the like can be employed. As the value represented by the sum of powers of two as the fixed offset coefficient, for example, 48=2³+2⁴, 96=2⁶+2⁵, or the like can be employed.

In the second MIP method, the shift amount sW of Expressions (258) and (257) for generating the predicted image of the MIP proposed in Reference REF 3 can be set to a fixed shift amount that is a fixed value.

By setting the shift amount sW to the fixed shift amount that is a fixed value, Table 23 of Reference REF 3 becomes unnecessary, and the MIP processing can be further simplified.

As the fixed shift amount, for example, one of 5, 6, and 7, which are three (types) shift amounts sW defined in Table 23 of Reference REF 3, can be employed.

In the second MIP method, the operation of Expression (258) is changed according to the fixed offset coefficient and the three shift amounts sW defined in Table 23 of Reference REF 3, or according to the fixed offset coefficient and the fixed shift amount, and the predicted image of the MIP is generated according to the changed operation.

That is, in the second MIP method, the weight matrix mWeight[i][j] is changed from the value described in Reference REF 3 according to the fixed offset coefficient and the three shift amounts sW defined in Table 23 of Reference REF 3, or according to the fixed offset coefficient and the fixed shift amount.

Hereinafter, in order to simplify the description, attention will be paid to a case where the fixed offset coefficient and the fixed shift amount are employed.

A predicted pixel predMip[x][y] obtained in a case where the offset factor fO and the shift amount sW are employed, that is, a predicted pixel predMip[x][y] (of the predicted image of the MIP) obtained according to Reference REF 3 will be also referred to as a standard predicted pixel predMip[x][y].

In the second MIP method, for example, the weight matrix mWeight[i][j] described in Reference REF 3 is changed according to the fixed offset coefficient and the fixed shift amount so that a predicted pixel (hereinafter also referred to as a fixed predicted pixel) obtained in a case where the fixed offset coefficient and the fixed shift amount are employed, that is, the predicted pixel predMip[x][y] obtained according to an expression obtained by replacing the offset factor fO and the shift amount sW in Expressions (258) and (257) with the fixed offset coefficient and the fixed shift amount, respectively, has a value approximate to the standard predicted pixel predMip[x][y]. Alternatively, the weight matrix mWeight[i][j] described in Reference REF 3 is changed according to the fixed offset coefficient and the fixed shift amount so that a prediction error of the fixed predicted pixel predMip[x][y] becomes close to a prediction error of the standard predicted pixel predMip[x][y].

For example, 32 which is a value represented by a power of two can be employed as the fixed offset coefficient, and six can be employed as the fixed shift amount. In this case, the changed weight matrix mWeight[i][j] can fall within a range that can be represented by seven bits from zero to 127.

In the second MIP method, a predicted image of MIP is generated according to the calculation including the weight matrix mWeight[i][j] after the change.

Therefore, in the second MIP method, the offset factor fO of Expressions (258) and (257) and further the shift amount sW are fixed regardless of the combination of MipSizeId and modeId, and thus the MIP processing can be simplified. Consequently, it is not necessary to define Table 24 and further Table 23 of Reference REF 3 by the standard, and the standard can be simplified.

Furthermore, for example, in a case where the second MIP method is implemented by hardware, a selector for switching the offset factor fO and the shift amount sW becomes unnecessary, and increase in the circuit scale can be suppressed. Moreover, in a case where the second MIP method is implemented by software, it is not necessary to refer to Table 24 or Table 23, and decrease in processing speed can be suppressed as compared with a case where Table 24 or Table 23 is referred to.

Note that, as the fixed offset coefficient, a value represented by a power of two other than 32 or a value represented by the sum of powers of two other than 48 and 96 can be employed. Furthermore, as the fixed shift amount, a value other than five, six, and seven can be employed.

FIG. 3 is a diagram describing the MIP in a case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

In this case, in the MIP, as illustrated in FIG. 3, an operation is performed in which the offset factor fO and the shift amount sW in Expressions (258) and (257) are replaced with 48 that is the fixed offset coefficient and five that is the fixed shift amount, respectively.

A of FIG. 3 illustrates an operation performed by the MIP in a case where multiplication 48*(Σp[i]) of the sum Σp[i] of the change amounts p[i] of the pixel values and 48 that is the fixed offset coefficient is performed as it is as the multiplication in the operation in which the offset factor fO and the shift amount sW in Expression (257) are replaced with 48 that is the fixed offset coefficient and five that is the fixed shift amount, respectively.

B of FIG. 3 illustrates an operation performed by the MIP in a case where the multiplication 48*(Σp[i]) of the sum Σp[i] of the change amounts p[i] of the pixel values and 48 that is the fixed offset coefficient is performed by the shift operation and the addition in the operation in which the offset factor fO and the shift amount sW in Expression (257) are replaced with 48 that is the fixed offset coefficient and five that is the fixed shift amount, respectively.

In the case where the multiplication 48*(Σp[i]) of the sum Σp[i] of the change amounts p[i] of the pixel values and 48 that is the fixed offset coefficient is performed as it is as the multiplication in the operation in which the offset factor fO and the shift amount sW in Expression (257) are replaced with 48 that is the fixed offset coefficient and five that is the fixed shift amount, respectively, the operation illustrated in A of FIG. 3 is performed.

That is, an operation of an expression in which the offset factor fO and the shift amount sW in Expressions (258) and (257) are replaced with 48 that is the fixed offset coefficient and five that is the fixed shift amount, respectively, is performed.

In the case where the multiplication 48*(Σp[i]) of the sum Σp[i] of the change amounts p[i] of the pixel values and 48 that is the fixed offset coefficient is performed by the shift operation and the addition in the operation in which the offset factor fO and the shift amount sW in Expression (257) are replaced with 48 that is the fixed offset coefficient and five that is the fixed shift amount, respectively, the operation illustrated in B of FIG. 3 is performed.

That is, the sum=Σp[i] of the change amounts p[i] of the pixel values is calculated. Then, the multiplication 48*(Σp[i]) of the sum Σp[i] of the change amounts p[i] of the pixel values and 48 that is the fixed offset coefficient is performed by the shift operation (sum<<5) and (sum<<4) of the sum and addition of results of the shift operation (sum<<5) and (sum<<4). Thus, the variable oW of Expression (257) is calculated.

Thereafter, in both cases of A and B of FIG. 3, the operation of the expression in which the shift amount sW in Expression (258) is replaced with five that is the fixed shift amount is performed using the variable oW.

Here, a combination of the MipSizeId and the modeId is represented as (M, m). M represents the MipSizeId, and m represents the modeId. Furthermore, the weight matrix mWeight[i][j] in a case where the fixed offset coefficient and the fixed shift amount are employed will be also referred to as a fixed weight matrix mWeight[i][j].

FIG. 4 is a diagram illustrating an example of the weight matrix mWeight[i][j] of (M, m)=(0, 0) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.

In FIG. 4, the (i+1)th value from the left and the (j+1)th value from the top represent the fixed weight matrix mWeight[i][j]. This similarly applies to the following drawings.

Note that the fixed weight matrix mWeight[i][j] of (M, m)=(0, 0) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount is not limited to the values illustrated in FIG. 4.

The fixed weight matrix mWeight[i][j] of (M, m)=(0, 0) can be appropriately changed as long as within a range in which the operation is performed using the fixed offset coefficient and the fixed shift amount and technical effects are exhibited correspondingly. Furthermore, since the range in which the technical effects are exhibited varies depending on the approximation level to be set, the fixed weight matrix mWeight[i][j] of (M, m)=(0, 0) can be changed as appropriate within the range. For example, the change can be made within the range of ±1, and the change can be made within the range of ±3. Moreover, not only all the values can be changed uniformly, but also only some of the values can be changed. Furthermore, it is also possible to individually set a range of values to be changed with respect to existing values.

That is, for example, the fixed weight matrix mWeight[i][j] of (M, m)=(0, 0) can be appropriately changed within a range in which a technical effect such as ensuring predetermined prediction accuracy or more for the predicted image is appropriately exhibited when the operations of Expressions (258) and (257) are performed using the fixed offset coefficient and the fixed shift amount.

Furthermore, the range (degree) in which the technical effect is exhibited changes depending on how much the approximation level is set.

The approximation level means the degree to which the fixed predicted pixel predMip[x][y] is approximated to the true value of the fixed predicted pixel predMip[x][y] or the standard predicted pixel predMip[x][y]. The fixed predicted pixel predMip[x][y] means (a pixel value of) a predicted pixel obtained by MIP using the fixed offset coefficient and the fixed shift amount. The standard predicted pixel predMip[x][y] means a predicted pixel obtained by MIP using the offset factor fO and the shift amount sW described in Reference REF 3.

The fixed weight matrix mWeight[i][j] of (M, m)=(0, 0) can be appropriately changed within a range in which the set approximation level can be maintained. For example, the fixed weight matrix mWeight[i][j] of (M, m)=(0, 0) can be changed within a range of ±1 or within a range of ±3 with reference to the value illustrated in FIG. 4.

The change of the fixed weight matrix mWeight[i][j] of (M, m)=(0, 0) can be performed for all of the fixed weight matrices mWeight[i][j] of (M, m)=(0, 0), and can be performed for only a part of the fixed weight matrix mWeight[i][j] of (M, m)=(0, 0).

Moreover, as the range of values for changing the fixed weight matrix mWeight[i][j] of (M, m)=(0, 0), a uniform range can be employed for all of the fixed weight matrices mWeight[i][j] of (M, m)=(0, 0), or an individual range can be employed for each fixed weight matrix mWeight[i][j] of (M, m)=(0, 0).

For example, the range of values for changing the fixed weight matrix mWeight[i][j] of (M, m)=(0, 0) can be individually set for each value of the corresponding weight matrix mWeight[i][j].

The above point similarly applies to the fixed weight matrix mWeight[i][j] other than (M, m)=(0, 0).

FIG. 5 is a diagram illustrating an example of the weight matrix mWeight[i][j] of (M, m)=(0, 1) in the case where 48 is employed as the fixed offset coefficient and five is employed as the fixed shift amount.