The present application relates to the video compression and coding/decoding field, and in particular, to a chrominance prediction method and apparatus.
A digital video capability can be incorporated into various apparatuses, including a digital television set, a digital live broadcasting system, a radio broadcasting system, a personal digital assistant (PDA), a laptop or desktop computer, a tablet computer, an ebook reader, a digital camera, a digital recording apparatus, a digital media player, a video game apparatus, a video game console, a cellular or satellite radio telephone, a video conferencing apparatus, a video stream apparatus, and the like. A digital video apparatus implements video compression technologies, such as video compression technologies described in standards defined in Moving Picture Experts Group (MPEG)-2 (MPEG-2), MPEG-4, International Telecommunication Unit-Telecommunication Standardization Sector (ITU-TH) ITU-TH.263, Advanced Video Coding (AVC) in ITU-TH.264/MPEG-4 Part 10, and the High Efficiency Video Coding (HEVC) standard in ITU-TH.265, and extensions of the standards, to transmit and receive digital video information more efficiently. The video apparatus may transmit, receive, encode, decode, and/or store the digital video information more efficiently by implementing these video coding/decoding technologies.
In a television system, luminance-chrominance-chrominance (YUV) color encoding is mostly used for video compression and coding, and is a standard widely used in European television systems. A YUV color space includes one luminance signal Y and two color difference signals U and V, and the three components are independent of each other. A representation manner in which YUV color modes are separated from each other is more flexible, occupies less bandwidth for transmission, and has advantages over a conventional red-green-blue (RGB) color model. For example, a YUV 4:2:0 form indicates that each of two chrominance components U and V is only half of a luminance component Y both in a horizontal direction and in a vertical direction. There are four luminance components Y but only one chrominance component U and one chrominance component V in four sampled pixels. When indication is performed in this way, a data volume is further reduced. Video is compressed in this chrominance sampling manner using characteristics of physiological vision of human eyes.
Predictive coding is a commonly used technical means in a video compression technology, and data information of a previously encoded frame is used to predict a frame that is currently to be encoded. A predicted value is obtained through prediction. The predicted value is not completely equal to an actual value, and there is a residual value between the predicted value and the actual value. If prediction is more proper, the predicted value is closer to the actual value, and the residual value is smaller. In this way, a data volume can be greatly reduced by encoding the residual value. An initial image is restored and reconstructed by adding the predicted value and the residual value when decoding is performed on a decoder side. This is a basic idea and method for predictive coding. In mainstream coding standards, predictive coding is classified into two basic types, intra-frame prediction and inter-frame prediction. In intra-frame prediction, luminance prediction and chrominance prediction need to be performed on a luminance component and a chrominance component of a video sequence.
Embodiments of the present application provide a chrominance prediction method and apparatus. Based on a reconstructed luminance unit, an improved candidate chrominance prediction mode set is constructed, and a chrominance prediction mode is selected by performing encoding using an appropriate mode, thereby improving encoding efficiency.
A first aspect of the embodiments of the present application provides a chrominance prediction method, where a to-be-processed chrominance unit corresponds to one luminance processing unit, the luminance processing unit and the to-be-processed chrominance unit are respectively processing units of a luminance component and a chrominance component of a same image area, the luminance processing unit corresponds to one or more reconstructed luminance units, and the method includes determining one or more target luminance prediction modes of the luminance processing unit from preset candidate luminance prediction modes, where a difference between a predicted luminance value, of the luminance processing unit, corresponding to any target luminance prediction mode and a reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is not determined as the target luminance prediction mode and the reconstructed luminance value of the luminance processing unit, and obtaining a predicted chrominance value of the to-be-processed chrominance unit, where a candidate chrominance prediction mode set of the to-be-processed chrominance unit includes the target luminance prediction mode.
A beneficial effect of this embodiment of the present application is as follows. Based on a reconstructed luminance unit, an improved candidate chrominance prediction mode set is constructed, and a chrominance prediction mode is selected by performing encoding using an appropriate mode, thereby improving encoding efficiency.
According to the method in the first aspect, in a first feasible implementation of the first aspect, the determining one or more target luminance prediction modes of the luminance processing unit from preset candidate luminance prediction modes includes determining a candidate luminance prediction mode subset from the preset candidate luminance prediction modes, selecting an initial prediction mode from the candidate luminance prediction mode subset, and when the initial prediction mode does not meet a preset condition, updating the preset candidate luminance prediction modes based on the initial prediction mode, redetermining the candidate luminance prediction mode subset from the updated preset candidate luminance prediction modes, and reselecting the initial prediction mode from the redetermined candidate luminance prediction mode subset, until the reselected initial prediction mode meets the preset condition, or when the initial prediction mode meets a preset condition, using the initial prediction mode as the target luminance prediction mode.
A beneficial effect of this embodiment of the present application is as follows. Through iterative searches, an operation amount needed for finding an optimal prediction mode is reduced, processing efficiency is improved, and a processing time is reduced.
According to the method in the first feasible implementation of the first aspect, in a second feasible implementation of the first aspect, the preset candidate luminance prediction modes include at least one of a prediction mode included in directional prediction modes and a prediction mode included in non-directional prediction modes, the directional prediction modes include a prediction mode that is at an angle of N degrees with a horizontal direction of a two-dimensional plane, N is a non-negative number less than 360, the non-directional prediction modes include a direct current (DC) prediction mode and a planar prediction mode, and the determining a candidate luminance prediction mode subset from the preset candidate luminance prediction modes includes determining that the candidate luminance prediction mode subset is the same as the preset candidate luminance prediction modes, or determining that the candidate luminance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval, or determining that the candidate luminance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval and the non-directional prediction mode.
According to the method in the first or the second feasible implementation of the first aspect, in a third feasible implementation of the first aspect, the selecting an initial prediction mode from the candidate luminance prediction mode subset includes, when the candidate luminance prediction mode subset includes only one candidate luminance prediction mode, determining that the candidate luminance prediction mode is the initial prediction mode, or when the candidate luminance prediction mode subset includes at least two candidate luminance prediction modes, calculating a difference between the reconstructed luminance value and each of candidate predicted luminance values corresponding to the candidate luminance prediction modes in the candidate luminance prediction mode subset, and determining the initial prediction mode based on the difference, where a difference between a predicted luminance value, of the luminance processing unit, corresponding to any initial prediction mode and the reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is in the candidate luminance prediction mode subset and that is not determined as the initial prediction mode and the reconstructed luminance value of the luminance processing unit.
According to the method in the third feasible implementation of the first aspect, in a fourth feasible implementation of the first aspect, the candidate predicted luminance value is a candidate predicted luminance value matrix, the reconstructed luminance value is a reconstructed luminance value matrix, and the calculating a difference between the reconstructed luminance value and each of candidate predicted luminance values corresponding to the candidate luminance prediction modes in the candidate luminance prediction mode subset includes separately calculating a difference between an element at a corresponding location in the candidate predicted luminance value matrix and an element at a corresponding location in the reconstructed luminance value matrix, to obtain a difference matrix, and determining the difference based on the difference matrix.
According to the method in the fourth feasible implementation of the first aspect, in a fifth feasible implementation of the first aspect, the determining the difference based on the difference matrix includes accumulating absolute values of all elements in the difference matrix as the difference, or transforming the difference matrix to obtain a transformed difference matrix, and accumulating absolute values of all elements in the transformed difference matrix as the difference, or sequentially transforming, quantizing, dequantizing, and inversely transforming the difference matrix to obtain a reconstructed difference matrix, and accumulating absolute values of all elements in the reconstructed difference matrix as the difference.
A beneficial effect of this embodiment of the present application is that different calculation manners may be selected based on different complexity and performance requirements, and the different calculation manners are suitable for different application scenarios.
According to the method in the fifth feasible implementation of the first aspect, in a sixth feasible implementation of the first aspect, the transform includes Hadamard transform, Haar transform, discrete cosine transform (DCT), or discrete sine transform (DST), and correspondingly, the inverse transform includes inverse Hadamard transform corresponding to the Hadamard transform, inverse Haar transform corresponding to the Haar transform, inverse discrete cosine transform corresponding to the discrete cosine transform, or inverse discrete sine transform corresponding to the discrete sine transform.
A beneficial effect of this embodiment of the present application is that different transform manners may be selected based on different complexity and performance requirements, and the different transform manners are suitable for different application scenarios.
According to the method in any one of the first to the sixth feasible implementations of the first aspect, in a seventh feasible implementation of the first aspect, the redetermining the candidate luminance prediction mode subset from the updated preset candidate luminance prediction modes includes determining that the redetermined candidate luminance prediction mode subset includes the initial prediction mode and prediction modes that have a preset angle difference from the initial prediction mode.
According to the method in the seventh feasible implementation of the first aspect, in an eighth feasible implementation of the first aspect, the prediction modes that have the preset angle difference from the initial prediction mode include M prediction modes that are adjacent to the initial prediction mode, where M is a positive number.
According to the method in any one of the first to the eighth feasible implementations of the first aspect, in a ninth feasible implementation of the first aspect, the preset condition includes the initial prediction mode is the non-directional prediction mode, or each prediction mode that has the preset angle difference from the initial prediction mode exists in the candidate luminance prediction mode subset that is determined from the preset candidate luminance prediction modes or that is redetermined from the updated preset candidate luminance prediction modes, or a quantity of reselection times of the initial prediction mode reaches a preset quantity of times, or a difference corresponding to the initial prediction mode is less than a preset threshold.
A beneficial effect of this embodiment of the present application is that different iteration termination conditions may be selected based on different complexity and performance requirements, and the different iteration termination conditions are suitable for different application scenarios.
According to the method in any one of the first aspect, or the first to the ninth feasible implementations of the first aspect, in a tenth feasible implementation of the first aspect, the obtaining a predicted chrominance value of the to-be-processed chrominance unit includes determining a candidate chrominance prediction mode set of the to-be-processed chrominance unit based on the target luminance prediction mode, selecting a chrominance prediction mode of the to-be-processed chrominance unit from the candidate chrominance prediction mode set, and determining the predicted chrominance value of the to-be-processed chrominance unit based on the chrominance prediction mode.
According to the method in the tenth feasible implementation of the first aspect, in an eleventh feasible implementation of the first aspect, the determining a candidate chrominance prediction mode set of the to-be-processed chrominance unit based on the target luminance prediction mode includes determining that the candidate chrominance prediction mode set includes only the target luminance prediction mode.
A beneficial effect of this embodiment of the present application is that the candidate chrominance prediction mode includes only the target luminance prediction mode, thereby reducing a code rate of an encoding mode.
According to the method in the tenth feasible implementation of the first aspect, in a twelfth feasible implementation of the first aspect, the determining a candidate chrominance prediction mode set of the to-be-processed chrominance unit based on the target luminance prediction mode includes determining that the candidate chrominance prediction mode set includes the target luminance prediction mode and one or more preset candidate chrominance prediction modes.
A beneficial effect of this embodiment of the present application is that another preset prediction mode is added to the candidate chrominance prediction modes, thereby avoiding impact exerted on encoding performance when the target luminance prediction mode is misjudged.
According to the method in the twelfth feasible implementation of the first aspect, in a thirteenth feasible implementation of the first aspect, the preset candidate chrominance prediction modes include at least one of a horizontal prediction mode, a vertical prediction mode, the direct current prediction mode, the planar prediction mode, and a direct prediction mode (DM).
According to the method in the twelfth or the thirteenth feasible implementation of the first aspect, in a fourteenth feasible implementation of the first aspect, the preset candidate chrominance prediction modes further include the directional prediction mode in a non-horizontal or non-vertical direction, or a linear prediction mode (LM).
According to the method in any one of the tenth to the fourteenth feasible implementations of the first aspect, in a fifteenth feasible implementation of the first aspect, after the determining a candidate chrominance prediction mode set of the to-be-processed chrominance unit, the method further includes determining a codeword of a candidate chrominance prediction mode in the candidate chrominance prediction mode set.
A beneficial effect of this embodiment of the present application is that the codeword of the candidate chrominance prediction mode is adjusted based on a probability that each candidate chrominance prediction mode in the candidate chrominance prediction mode set is selected, so that encoding performance can be further improved.
According to the method in the fifteenth feasible implementation of the first aspect, in a sixteenth feasible implementation of the first aspect, the determining a codeword of a candidate chrominance prediction mode in the candidate chrominance prediction mode set includes determining the codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set using a variable-length code.
According to the method in the sixteenth feasible implementation of the first aspect, in a seventeenth feasible implementation of the first aspect, a prediction mode corresponding to a candidate predicted luminance value that is of the luminance processing unit and that has a smallest difference from the reconstructed luminance value of the luminance processing unit is used as a first target luminance prediction mode, and the determining the codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set using a variable-length code includes, when the candidate chrominance prediction mode set includes the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode, assigning a smallest codeword to the linear prediction mode, assigning, to the first target luminance prediction mode, a smallest codeword other than the codeword used to represent the linear prediction mode, and assigning, to the direct prediction mode, a smallest codeword other than the codewords used to represent the linear prediction mode and the first target luminance prediction mode, or when the candidate chrominance prediction mode set includes the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode, assigning a smallest codeword to the first target luminance prediction mode, assigning, to the linear prediction mode, a smallest codeword other than the codeword used to represent the first target luminance prediction mode, and assigning, to the direct prediction mode, a smallest codeword other than the codewords used to represent the linear prediction mode and the first target luminance prediction mode, or when the candidate chrominance prediction mode set includes the first target luminance prediction mode and the direct prediction mode, assigning a smallest codeword to the first target luminance prediction mode, and assigning, to the direct prediction mode, a smallest codeword other than the codeword used to represent the first target luminance prediction mode.
A beneficial effect of this embodiment of the present application is that a relatively short codeword is assigned to a candidate chrominance prediction mode with a high probability of being selected from candidate chrominance prediction mode set is selected, so that encoding performance can be further improved.
According to the method in the sixteenth or the seventeenth feasible implementation of the first aspect, in an eighteenth feasible implementation of the first aspect, the determining the codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set using a variable-length code further includes determining a length of the variable-length code based on a quantity of the candidate chrominance prediction modes, and when the quantity of the candidate chrominance prediction modes changes, increasing or decreasing the length of the variable-length code by one or more bits.
According to the method in any one of the first aspect, or the first to the eighteenth feasible implementations of the first aspect, in a nineteenth feasible implementation of the first aspect, before the determining one or more target luminance prediction modes of the luminance processing unit from preset candidate luminance prediction modes, the method further includes determining that the preset candidate luminance prediction modes include a candidate luminance prediction mode set, where the candidate luminance prediction mode set includes the directional prediction mode and the non-directional prediction mode, or determining that the preset candidate luminance prediction modes include luminance prediction modes of the one or more reconstructed luminance units corresponding to the luminance processing unit, or determining that the preset candidate luminance prediction modes include chrominance prediction modes of reconstructed chrominance units in a neighborhood of the to-be-processed chrominance unit.
A beneficial effect of this embodiment of the present application is that prediction modes with relatively strong correlation are selected as the candidate luminance prediction modes to participate in searches, thereby improving search efficiency and increasing a search speed.
According to the method in the nineteenth feasible implementation of the first aspect, in a twentieth feasible implementation of the first aspect, the determining that the preset candidate luminance prediction modes include luminance prediction modes of the one or more reconstructed luminance units corresponding to the luminance processing unit further includes determining that the preset candidate luminance prediction modes include luminance prediction modes that are correlated to the luminance prediction modes of the one or more reconstructed luminance units.
According to the method in the twentieth feasible implementation of the first aspect, in a twenty-first feasible implementation of the first aspect, for the luminance prediction modes that are correlated to the luminance prediction modes of the one or more reconstructed luminance units, when the luminance prediction mode is the directional prediction mode, the correlated luminance prediction modes include P prediction modes adjacent to the luminance prediction modes of the one or more reconstructed luminance units, where P is a positive number, or when the luminance prediction mode is the directional prediction mode, the correlated luminance prediction modes include Q prediction modes adjacent to the luminance prediction modes of the one or more reconstructed luminance units and include the non-directional prediction mode, where Q is a positive number, or when the luminance prediction mode is the non-directional prediction mode, the correlated luminance prediction modes include the preset directional prediction mode.
According to the method in any one of the nineteenth to the twenty-first feasible implementations of the first aspect, in a twenty-second feasible implementation of the first aspect, the chrominance prediction modes of the reconstructed chrominance units in the neighborhood of the to-be-processed chrominance unit include chrominance prediction modes of reconstructed chrominance units that are adjacent to the top, the left, the upper left, the upper right, and the lower left of the to-be-processed chrominance unit.
According to the method in any one of the first aspect, or the first to the twenty-second feasible implementations of the first aspect, in a twenty-third feasible implementation of the first aspect, before the determining one or more target luminance prediction modes of the luminance processing unit from preset candidate luminance prediction modes, the method further includes performing downsampling on the luminance processing unit, and correspondingly, the determining one or more target luminance prediction modes of the luminance processing unit from preset candidate luminance prediction modes includes determining one or more target luminance prediction modes of the downsampled luminance processing unit from the preset candidate luminance prediction modes.
A beneficial effect of this embodiment of the present application is that because downsampling is performed on the luminance processing unit, operation complexity is reduced, and the downsampled luminance processing unit can more closely reflect a prediction direction of a chrominance unit.
According to the method in the twenty-third feasible implementation of the first aspect, in a twenty-fourth feasible implementation of the first aspect, the performing downsampling on the luminance processing unit includes performing downsampling on the luminance processing unit using a filter whose filtering coefficient is
or performing downsampling on the luminance processing unit using a filter whose filtering coefficient is
or performing downsampling on the luminance processing unit using a filter whose filtering coefficient is
or performing downsampling on the luminance processing unit using a filter whose filtering coefficient is
A beneficial effect of this embodiment of the present application is that different filters may be selected based on different complexity and performance requirements, and the different filters are suitable for different application scenarios.
According to the method in any one of the tenth to the twenty-fourth feasible implementations of the first aspect, in a twenty-fifth feasible implementation of the first aspect, the method is used to encode the to-be-processed chrominance unit, and the selecting a chrominance prediction mode of the to-be-processed chrominance unit from the candidate chrominance prediction mode set includes traversing candidate chrominance prediction modes in the candidate chrominance prediction mode set to obtain corresponding candidate predicted chrominance values, calculating encoding costs of each candidate chrominance prediction mode based on an original value of the to-be-processed chrominance unit and the candidate predicted chrominance values obtained through the traversing, determining a candidate chrominance prediction mode with smallest encoding costs as the chrominance prediction mode of the to-be-processed chrominance unit, and encoding an index of the chrominance prediction mode in the candidate chrominance prediction mode set.
According to the method in the twenty-fifth feasible implementation of the first aspect, in a twenty-sixth feasible implementation of the first aspect, the encoding an index of the chrominance prediction mode in the candidate chrominance prediction mode set includes encoding the index based on the determined codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set.
According to the method in any one of the tenth to the twenty-fourth feasible implementations of the first aspect, in a twenty-seventh feasible implementation of the first aspect, the method is used to decode the to-be-processed chrominance unit, and the selecting a chrominance prediction mode of the to-be-processed chrominance unit from the candidate chrominance prediction mode set includes decoding an index of the chrominance prediction mode in the candidate chrominance prediction mode set from a bitstream, and determining the chrominance prediction mode from the candidate chrominance prediction mode set based on the index.
According to the method in the twenty-seventh feasible implementation of the first aspect, in a twenty-eighth feasible implementation of the first aspect, the decoding an index of the chrominance prediction mode in the candidate chrominance prediction mode set from a bitstream includes decoding the index based on the determined codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set.
A beneficial effect of this embodiment of the present application is as follows. Based on a reconstructed luminance unit, an improved candidate chrominance prediction mode set is constructed, and a chrominance prediction mode is selected by performing encoding using an appropriate mode, thereby improving encoding efficiency.
A second aspect of the embodiments of the present application provides a chrominance prediction apparatus, where a to-be-processed chrominance unit corresponds to one luminance processing unit, the luminance processing unit and the to-be-processed chrominance unit are respectively processing units of a luminance component and a chrominance component of a same image area, the luminance processing unit corresponds to one or more reconstructed luminance units, and the apparatus includes a first determining module, configured to determine one or more target luminance prediction modes of the luminance processing unit from preset candidate luminance prediction modes, where a difference between a predicted luminance value, of the luminance processing unit, corresponding to any target luminance prediction mode and a reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is not determined as the target luminance prediction mode and the reconstructed luminance value of the luminance processing unit, and a first construction module, configured to obtain a predicted chrominance value of the to-be-processed chrominance unit, where a candidate chrominance prediction mode set of the to-be-processed chrominance unit includes the target luminance prediction mode.
According to the apparatus in the second aspect, in a first feasible implementation of the second aspect, the first determining module includes a second determining module, configured to determine a candidate luminance prediction mode subset from the preset candidate luminance prediction modes, a first selection module, configured to select an initial prediction mode from the candidate luminance prediction mode subset, and an update module, configured to, when the initial prediction mode does not meet a preset condition, update the preset candidate luminance prediction modes based on the initial prediction mode, redetermine the candidate luminance prediction mode subset from the updated preset candidate luminance prediction modes, and reselect the initial prediction mode from the redetermined candidate luminance prediction mode subset, until the reselected initial prediction mode meets the preset condition, or when the initial prediction mode meets a preset condition, use the initial prediction mode as the target luminance prediction mode.
According to the apparatus in the first feasible implementation of the second aspect, in a second feasible implementation of the second aspect, the preset candidate luminance prediction modes include at least one of a prediction mode included in directional prediction modes and a prediction mode included in non-directional prediction modes, the directional prediction modes include a prediction mode that is at an angle of N degrees with a horizontal direction of a two-dimensional plane, N is a non-negative number less than 360, the non-directional prediction modes include a direct current prediction mode and a planar prediction mode, and the second determining module is configured to determine that the candidate luminance prediction mode subset is the same as the preset candidate luminance prediction modes, or determine that the candidate luminance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval, or determine that the candidate luminance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval and the non-directional prediction mode.
According to the apparatus in the first or the second feasible implementation of the second aspect, in a third feasible implementation of the second aspect, when the candidate luminance prediction mode subset includes only one candidate luminance prediction mode, the first selection module is configured to determine that the candidate luminance prediction mode is the initial prediction mode, or when the candidate luminance prediction mode subset includes at least two candidate luminance prediction modes, the first selection module includes a first calculation module, configured to calculate a difference between the reconstructed luminance value and each of candidate predicted luminance values corresponding to the candidate luminance prediction modes in the candidate luminance prediction mode subset, and a comparison module, configured to determine the initial prediction mode based on the difference, where a difference between a predicted luminance value, of the luminance processing unit, corresponding to any initial prediction mode and the reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is in the candidate luminance prediction mode subset and that is not determined as the initial prediction mode and the reconstructed luminance value of the luminance processing unit.
According to the apparatus in the third feasible implementation of the second aspect, in a fourth feasible implementation of the second aspect, the candidate predicted luminance value is a candidate predicted luminance value matrix, the reconstructed luminance value is a reconstructed luminance value matrix, and the first calculation module includes a second calculation module, configured to separately calculate a difference between an element at a corresponding location in the candidate predicted luminance value matrix and an element at a corresponding location in the reconstructed luminance value matrix, to obtain a difference matrix, and a third determining module, configured to determine the difference based on the difference matrix.
According to the apparatus in the third feasible implementation of the second aspect, in a fifth feasible implementation of the second aspect, the third determining module is configured to accumulate absolute values of all elements in the difference matrix as the difference, or transform the difference matrix to obtain a transformed difference matrix, and accumulate absolute values of all elements in the transformed difference matrix as the difference, or sequentially transform, quantize, dequantize, and inversely transform the difference matrix to obtain a reconstructed difference matrix, and accumulate absolute values of all elements in the reconstructed difference matrix as the difference.
According to the apparatus in the fifth feasible implementation of the second aspect, in a sixth feasible implementation of the second aspect, the transform includes Hadamard transform, Haar transform, discrete cosine transform, or discrete sine transform, and correspondingly, the inverse transform includes inverse Hadamard transform corresponding to the Hadamard transform, inverse Haar transform corresponding to the Haar transform, inverse discrete cosine transform corresponding to the discrete cosine transform, or inverse discrete sine transform corresponding to the discrete sine transform.
According to the apparatus in any one of the first to the sixth feasible implementations of the second aspect, in a seventh feasible implementation of the second aspect, the update module is configured to determine that the updated candidate luminance prediction mode subset includes the initial prediction mode and prediction modes that have a preset angle difference from the initial prediction mode.
According to the apparatus in the seventh feasible implementation of the second aspect, in an eighth feasible implementation of the second aspect, the prediction modes that have the preset angle difference from the initial prediction mode include M prediction modes that are adjacent to the initial prediction mode, where M is a positive number.
According to the apparatus in any one of the second aspect, or the first to the eighth feasible implementations of the second aspect, in a ninth feasible implementation of the second aspect, the preset condition includes the initial prediction mode is the non-directional prediction mode, or each prediction mode that has the preset angle difference from the initial prediction mode exists in the candidate luminance prediction mode subset that is determined from the preset candidate luminance prediction modes or that is redetermined from the updated preset candidate luminance prediction modes, or a quantity of reselection times of the initial prediction mode reaches a preset quantity of times, or a difference corresponding to the initial prediction mode is less than a preset threshold.
According to the apparatus in any one of the second aspect, or the first to the ninth feasible implementations of the second aspect, in a tenth feasible implementation of the second aspect, the first construction module includes a fourth determining module, configured to determine a candidate chrominance prediction mode set of the to-be-processed chrominance unit based on the target luminance prediction mode, a second selection module, configured to select a chrominance prediction mode of the to-be-processed chrominance unit from the candidate chrominance prediction mode set, and a fifth determining module, configured to determine the predicted chrominance value of the to-be-processed chrominance unit based on the chrominance prediction mode.
According to the apparatus in the tenth feasible implementation of the second aspect, in an eleventh feasible implementation of the second aspect, the fourth determining module is configured to determine that the candidate chrominance prediction mode set includes only the target luminance prediction mode.
According to the apparatus in the tenth feasible implementation of the second aspect, in a twelfth feasible implementation of the second aspect, the fourth determining module is configured to determine that the candidate chrominance prediction mode set includes the target luminance prediction mode and one or more preset candidate chrominance prediction modes.
According to the apparatus in the twelfth feasible implementation of the second aspect, in a thirteenth feasible implementation of the second aspect, the preset candidate chrominance prediction modes include at least one of a horizontal prediction mode, a vertical prediction mode, the direct current prediction mode, the planar prediction mode, and a direct prediction mode.
According to the apparatus in the twelfth or the thirteenth feasible implementation of the second aspect, in a fourteenth feasible implementation of the second aspect, the preset candidate chrominance prediction modes further include the directional prediction mode in a non-horizontal or non-vertical direction, or a linear prediction mode.
According to the apparatus in any one of the tenth to the fourteenth feasible implementations of the second aspect, in a fifteenth feasible implementation of the second aspect, the first construction module further includes a sixth determining module, configured to determine a codeword of a candidate chrominance prediction mode in the candidate chrominance prediction mode set.
According to the apparatus in the fifteenth feasible implementation of the second aspect, in a sixteenth feasible implementation of the second aspect, the sixth determining module is configured to determine the codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set using a variable-length code.
According to the apparatus in the sixteenth feasible implementation of the second aspect, in a seventeenth feasible implementation of the second aspect, a prediction mode corresponding to a candidate predicted luminance value that is of the luminance processing unit and that has a smallest difference from the reconstructed luminance value of the luminance processing unit is used as a first target luminance prediction mode, and the sixth determining module is configured to, when the candidate chrominance prediction mode set includes the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode, assign a smallest codeword to the linear prediction mode, assign, to the first target luminance prediction mode, a smallest codeword other than the codeword used to represent the linear prediction mode, and assign, to the direct prediction mode, a smallest codeword other than the codewords used to represent the linear prediction mode and the first target luminance prediction mode, or when the candidate chrominance prediction mode set includes the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode, assign a smallest codeword to the first target luminance prediction mode, assign, to the linear prediction mode, a smallest codeword other than the codeword used to represent the first target luminance prediction mode, and assign, to the direct prediction mode, a smallest codeword other than the codewords used to represent the linear prediction mode and the first target luminance prediction mode, or when the candidate chrominance prediction mode set includes the first target luminance prediction mode and the direct prediction mode, assign a smallest codeword to the first target luminance prediction mode, and assign, to the direct prediction mode, a smallest codeword other than the codeword used to represent the first target luminance prediction mode.
According to the apparatus in the sixteenth or the seventeenth feasible implementation of the second aspect, in an eighteenth feasible implementation of the second aspect, the sixth determining module further includes a seventh determining module, configured to determine a length of the variable-length code based on a quantity of the candidate chrominance prediction modes, and an operation module, configured to, when the quantity of the candidate chrominance prediction modes changes, increase or decrease the length of the variable-length code by one or more bits.
According to the apparatus in any one of the second aspect, or the first to the eighteenth feasible implementations of the second aspect, in a nineteenth feasible implementation of the second aspect, the apparatus further includes an eighth determining module, configured to determine that the preset candidate luminance prediction modes include a candidate luminance prediction mode set, where the candidate luminance prediction mode set includes the directional prediction mode and the non-directional prediction mode, or determine that the preset candidate luminance prediction modes include luminance prediction modes of the one or more reconstructed luminance units corresponding to the luminance processing unit, or determine that the preset candidate luminance prediction modes include chrominance prediction modes of reconstructed chrominance units in a neighborhood of the to-be-processed chrominance unit.
According to the apparatus in the nineteenth feasible implementation of the second aspect, in a twentieth feasible implementation of the second aspect, the eighth determining module is configured to determine that the preset candidate luminance prediction modes include luminance prediction modes that are correlated to the luminance prediction modes of the one or more reconstructed luminance units.
According to the apparatus in the twentieth feasible implementation of the second aspect, in a twenty-first feasible implementation of the second aspect, for the luminance prediction modes that are correlated to the luminance prediction modes of the one or more reconstructed luminance units, when the luminance prediction mode is the directional prediction mode, the correlated luminance prediction modes include P prediction modes adjacent to the luminance prediction modes of the one or more reconstructed luminance units, where P is a positive number, or when the luminance prediction mode is the directional prediction mode, the correlated luminance prediction modes include Q prediction modes adjacent to the luminance prediction modes of the one or more reconstructed luminance units and include the non-directional prediction mode, where Q is a positive number, or when the luminance prediction mode is the non-directional prediction mode, the correlated luminance prediction modes include the preset directional prediction mode.
According to the apparatus in any one of the nineteenth to the twenty-first feasible implementations of the second aspect, in a twenty-second feasible implementation of the second aspect, the chrominance prediction modes of the reconstructed chrominance units in the neighborhood of the to-be-processed chrominance unit include chrominance prediction modes of reconstructed chrominance units that are adjacent to the top, the left, the upper left, the upper right, and the lower left of the to-be-processed chrominance unit.
According to the apparatus in any one of the second aspect, or the first to the twenty-second feasible implementations of the second aspect, in a twenty-third feasible implementation of the second aspect, the apparatus further includes a downsampling module, configured to perform downsampling on the luminance processing unit, and correspondingly, the first determining module is configured to determine one or more target luminance prediction modes of the downsampled luminance processing unit from the preset candidate luminance prediction modes.
According to the apparatus in the twenty-third feasible implementation of the second aspect, in a twenty-fourth feasible implementation of the second aspect, the downsampling module is configured to perform downsampling on the luminance processing unit using a filter whose filtering coefficient is
or perform downsampling on the luminance processing unit using a filter whose filtering coefficient is
or perform downsampling on the luminance processing unit using a filter whose filtering coefficient is
or perform downsampling on the luminance processing unit using a filter whose filtering coefficient is
According to the apparatus in any one of the tenth to the twenty-fourth feasible implementations of the second aspect, in a twenty-fifth feasible implementation of the second aspect, the apparatus is configured to encode the to-be-processed chrominance unit, and the second selection module includes a ninth determining module, configured to traverse candidate chrominance prediction modes in the candidate chrominance prediction mode set to obtain corresponding candidate predicted chrominance values, a third calculation module, configured to calculate encoding costs of each candidate chrominance prediction mode based on an original value of the to-be-processed chrominance unit and the candidate predicted chrominance values obtained through the traversing, a tenth determining module, configured to determine a candidate chrominance prediction mode with smallest encoding costs as the chrominance prediction mode of the to-be-processed chrominance unit, and an encoding module, configured to encode an index of the chrominance prediction mode in the candidate chrominance prediction mode set.
According to the apparatus in the twenty-fifth feasible implementation of the second aspect, in a twenty-sixth feasible implementation of the second aspect, the encoding module is configured to encode the index based on the determined codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set.
According to the apparatus in any one of the tenth to the twenty-fourth feasible implementations of the second aspect, in a twenty-seventh feasible implementation of the second aspect, the apparatus is configured to decode the to-be-processed chrominance unit, and the second selection module includes a decoding module, configured to decode an index of the chrominance prediction mode in the candidate chrominance prediction mode set from a bitstream, and an eleventh determining module, configured to determine the chrominance prediction mode from the candidate chrominance prediction mode set based on the index.
According to the apparatus in the twenty-seventh feasible implementation of the second aspect, in a twenty-eighth feasible implementation of the second aspect, the decoding module is configured to decode the index based on the determined codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set.
A third aspect of the embodiments of the present application provides a computer storage medium, configured to store a computer software instruction used to implement the method according to the first aspect, where the instruction includes a program used to perform the chrominance prediction method designed in the first aspect.
A fourth aspect of the embodiments of the present application provides an apparatus, including a memory and a processor coupled to the memory, where the processor is configured to perform the chrominance prediction method designed in the first aspect.
A fifth aspect of the embodiments of the present application provides a first-chrominance prediction method, where a to-be-processed first-chrominance unit corresponds to one second-chrominance processing unit, the second-chrominance processing unit and the to-be-processed first-chrominance unit are respectively processing units of a second-chrominance component and a first-chrominance component of a same image area, the second-chrominance processing unit corresponds to one or more reconstructed second-chrominance units, and the method includes determining one or more target second-chrominance prediction modes of the second-chrominance processing unit from preset candidate second-chrominance prediction modes, where a difference between a predicted second-chrominance value, of the second-chrominance processing unit, corresponding to any one of the target second-chrominance prediction modes and a reconstructed second-chrominance value of the second-chrominance processing unit is less than or equal to a difference between a predicted second-chrominance value, of the second-chrominance processing unit, corresponding to each candidate second-chrominance prediction mode that is not determined as the target second-chrominance prediction mode and the reconstructed second-chrominance value of the second-chrominance processing unit, and obtaining a predicted first-chrominance value of the to-be-processed first-chrominance unit, where a candidate first-chrominance prediction mode set of the to-be-processed first-chrominance unit includes the target second-chrominance prediction mode.
According to the method in the fifth aspect, in a first feasible implementation of the fifth aspect, the determining one or more target second-chrominance prediction modes of the second-chrominance processing unit from preset candidate second-chrominance prediction modes includes determining a candidate second-chrominance prediction mode subset from the preset candidate second-chrominance prediction modes, selecting an initial prediction mode from the candidate second-chrominance prediction mode subset, and when the initial prediction mode does not meet a preset condition, updating the preset candidate second-chrominance prediction modes based on the initial prediction mode, redetermining the candidate second-chrominance prediction mode subset from the updated preset candidate second-chrominance prediction modes, and reselecting the initial prediction mode from the redetermined candidate second-chrominance prediction mode subset, until the reselected initial prediction mode meets the preset condition, or when the initial prediction mode meets a preset condition, using the initial prediction mode as the target second-chrominance prediction mode.
A beneficial effect of this embodiment of the present application is as follows. Through iterative searches, an operation amount needed for finding an optimal prediction mode is reduced, processing efficiency is improved, and a processing time is reduced.
According to the method in the first feasible implementation of the fifth aspect, in a second feasible implementation of the fifth aspect, the preset candidate second-chrominance prediction modes include at least one of a prediction mode included in directional prediction modes and a prediction mode included in non-directional prediction modes, the directional prediction modes include a prediction mode that is at an angle of N degrees with a horizontal direction of a two-dimensional plane, N is a non-negative number less than 360, the non-directional prediction modes include a direct current prediction mode, a planar prediction mode, and a linear prediction mode, and the determining a candidate second-chrominance prediction mode subset from the preset candidate second-chrominance prediction modes includes determining that the candidate second-chrominance prediction mode subset is the same as the preset candidate second-chrominance prediction modes, or determining that the candidate second-chrominance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval, or determining that the candidate second-chrominance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval and the non-directional prediction mode.
According to the method in the first or the second feasible implementation of the fifth aspect, in a third feasible implementation of the fifth aspect, the selecting an initial prediction mode from the candidate second-chrominance prediction mode subset includes, when the candidate second-chrominance prediction mode subset includes only one candidate second-chrominance prediction mode, determining that the candidate second-chrominance prediction mode is the initial prediction mode, or when the candidate second-chrominance prediction mode subset includes at least two candidate second-chrominance prediction modes, calculating a difference between the reconstructed second-chrominance value and each of candidate predicted second-chrominance values corresponding to the candidate second-chrominance prediction modes in the candidate second-chrominance prediction mode subset, and determining the initial prediction mode based on the difference, where a difference between a predicted second-chrominance value, of the second-chrominance processing unit, corresponding to any initial prediction mode and the reconstructed second-chrominance value of the second-chrominance processing unit is less than or equal to a difference between a predicted second-chrominance value, of the second-chrominance processing unit, corresponding to each candidate second-chrominance prediction mode that is in the candidate second-chrominance prediction mode subset and that is not determined as the initial prediction mode and the reconstructed second-chrominance value of the second-chrominance processing unit.
According to the method in the third feasible implementation of the fifth aspect, in a fourth feasible implementation of the fifth aspect, the candidate predicted second-chrominance value is a candidate predicted second-chrominance value matrix, the reconstructed second-chrominance value is a reconstructed second-chrominance value matrix, and the calculating a difference between the reconstructed second-chrominance value and each of candidate predicted second-chrominance values corresponding to the candidate second-chrominance prediction modes in the candidate second-chrominance prediction mode subset includes separately calculating a difference between an element at a corresponding location in the candidate predicted second-chrominance value matrix and an element at a corresponding location in the reconstructed second-chrominance value matrix, to obtain a difference matrix, and determining the difference based on the difference matrix.
According to the method in the fourth feasible implementation of the fifth aspect, in a fifth feasible implementation of the fifth aspect, the determining the difference based on the difference matrix includes accumulating absolute values of all elements in the difference matrix as the difference, or transforming the difference matrix to obtain a transformed difference matrix, and accumulating absolute values of all elements in the transformed difference matrix as the difference, or sequentially transforming, quantizing, dequantizing, and inversely transforming the difference matrix to obtain a reconstructed difference matrix, and accumulating absolute values of all elements in the reconstructed difference matrix as the difference.
A beneficial effect of this embodiment of the present application is that different calculation manners may be selected based on different complexity and performance requirements, and the different calculation manners are suitable for different application scenarios.
According to the method in the fifth feasible implementation of the fifth aspect, in a sixth feasible implementation of the fifth aspect, the transform includes Hadamard transform, Haar transform, discrete cosine transform, or discrete sine transform, and correspondingly, the inverse transform includes inverse Hadamard transform corresponding to the Hadamard transform, inverse Haar transform corresponding to the Haar transform, inverse discrete cosine transform corresponding to the discrete cosine transform, or inverse discrete sine transform corresponding to the discrete sine transform.
A beneficial effect of this embodiment of the present application is that different transform manners may be selected based on different complexity and performance requirements, and the different transform manners are suitable for different application scenarios.
According to the method in any one of the first to the sixth feasible implementations of the fifth aspect, in a seventh feasible implementation of the fifth aspect, the updating the candidate second-chrominance prediction mode subset based on the initial prediction mode includes determining that the updated candidate second-chrominance prediction mode subset includes the initial prediction mode and prediction modes that have a preset angle difference from the initial prediction mode.
According to the method in the seventh feasible implementation of the fifth aspect, in an eighth feasible implementation of the fifth aspect, the prediction modes that have the preset angle difference from the initial prediction mode include M prediction modes that are adjacent to the initial prediction mode, where M is a positive number.
According to the method in any one of the first to the eighth feasible implementations of the fifth aspect, in a ninth feasible implementation of the fifth aspect, the preset condition includes the initial prediction mode is the non-directional prediction mode, or each prediction mode that has the preset angle difference from the initial prediction mode exists in the candidate second-chrominance prediction mode subset that is determined from the preset candidate second-chrominance prediction modes or that is redetermined from the updated preset candidate second-chrominance prediction modes, or a quantity of reselection times of the initial prediction mode reaches a preset quantity of times, or a difference corresponding to the initial prediction mode is less than a preset threshold.
A beneficial effect of this embodiment of the present application is that different iteration termination conditions may be selected based on different complexity and performance requirements, and the different iteration termination conditions are suitable for different application scenarios.
According to the method in any one of the fifth aspect, or the first to the ninth feasible implementations of the fifth aspect, in a tenth feasible implementation of the fifth aspect, the obtaining a predicted first-chrominance value of the to-be-processed first-chrominance unit includes determining a candidate first-chrominance prediction mode set of the to-be-processed first-chrominance unit based on the target second-chrominance prediction mode, selecting a first-chrominance prediction mode of the to-be-processed first-chrominance unit from the candidate first-chrominance prediction mode set, and determining the predicted first-chrominance value of the to-be-processed first-chrominance unit based on the first-chrominance prediction mode.
According to the method in the tenth feasible implementation of the fifth aspect, in an eleventh feasible implementation of the fifth aspect, the determining a candidate first-chrominance prediction mode set of the to-be-processed first-chrominance unit based on the target second-chrominance prediction mode includes determining that the candidate first-chrominance prediction mode set includes only the target second-chrominance prediction mode.
A beneficial effect of this embodiment of the present application is that the candidate first-chrominance prediction mode includes only the target second-chrominance prediction mode, thereby reducing a code rate of an encoding mode.
According to the method in the tenth feasible implementation of the fifth aspect, in a twelfth feasible implementation of the fifth aspect, the determining a candidate first-chrominance prediction mode set of the to-be-processed first-chrominance unit based on the target second-chrominance prediction mode includes determining that the candidate first-chrominance prediction mode set includes the target second-chrominance prediction mode and one or more preset candidate first-chrominance prediction modes.
A beneficial effect of this embodiment of the present application is that another preset prediction mode is added to the candidate first-chrominance prediction mode, thereby avoiding impact exerted on encoding performance when the target second-chrominance prediction mode is misjudged.
According to the method in the twelfth feasible implementation of the fifth aspect, in a thirteenth feasible implementation of the fifth aspect, the preset candidate first-chrominance prediction modes include at least one of a horizontal prediction mode, a vertical prediction mode, the direct current prediction mode, the planar prediction mode, and a direct prediction mode.
According to the method in the twelfth or the thirteenth feasible implementation of the fifth aspect, in a fourteenth feasible implementation of the fifth aspect, the candidate first-chrominance prediction modes further include the directional prediction mode in a non-horizontal or non-vertical direction, or a linear prediction mode.
According to the method in any one of the tenth to the fourteenth feasible implementations of the fifth aspect, in a fifteenth feasible implementation of the fifth aspect, after the determining a candidate first-chrominance prediction mode set of the to-be-processed first-chrominance unit, the method further includes determining a codeword of a candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set.
A beneficial effect of this embodiment of the present application is that the codeword of the candidate first-chrominance prediction mode is adjusted based on a probability that each candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set is selected, so that encoding performance can be further improved.
According to the method in the fifteenth feasible implementation of the fifth aspect, in a sixteenth feasible implementation of the fifth aspect, the determining a codeword of a candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set includes determining the codeword of the candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set using a variable-length code.
According to the method in the sixteenth feasible implementation of the fifth aspect, in a seventeenth feasible implementation of the fifth aspect, a prediction mode corresponding to a candidate predicted second-chrominance value that is of the second-chrominance processing unit and that has a smallest difference from the reconstructed second-chrominance value of the second-chrominance processing unit is used as a first target second-chrominance prediction mode, and the determining the codeword of the candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set using a variable-length code includes, when the candidate first-chrominance prediction mode set includes the first target second-chrominance prediction mode, the linear prediction mode, and the direct prediction mode, assigning a smallest codeword to the linear prediction mode, assigning, to the first target second-chrominance prediction mode, a smallest codeword other than the codeword used to represent the linear prediction mode, and assigning, to the direct prediction mode, a smallest codeword other than the codewords used to represent the linear prediction mode and the first target second-chrominance prediction mode, or when the candidate first-chrominance prediction mode set includes the first target second-chrominance prediction mode, the linear prediction mode, and the direct prediction mode, assigning a smallest codeword to the first target second-chrominance prediction mode, assigning, to the linear prediction mode, a smallest codeword other than the codeword used to represent the first target second-chrominance prediction mode, and assigning, to the direct prediction mode, a smallest codeword other than the codewords used to represent the linear prediction mode and the first target second-chrominance prediction mode, or when the candidate first-chrominance prediction mode set includes the first target second-chrominance prediction mode and the direct prediction mode, assigning a smallest codeword to the first target second-chrominance prediction mode, and assigning, to the direct prediction mode, a smallest codeword other than the codeword used to represent the first target second-chrominance prediction mode.
A beneficial effect of this embodiment of the present application is that a relatively short codeword is assigned to a candidate first-chrominance prediction mode with a high probability that each candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set is selected, so that encoding performance can be further improved.
According to the method in the sixteenth or seventeenth feasible implementation of the fifth aspect, in an eighteenth feasible implementation of the fifth aspect, the determining the codeword of the candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set using a variable-length code further includes determining a length of the variable-length code based on a quantity of the candidate first-chrominance prediction modes, and when the quantity of the candidate first-chrominance prediction modes changes, increasing or decreasing the length of the variable-length code by one or more bits.
According to the method in any one of the fifth aspect, or the first to the eighteenth feasible implementations of the fifth aspect, in a nineteenth feasible implementation of the fifth aspect, before the determining one or more target second-chrominance prediction modes of the second-chrominance processing unit from preset candidate second-chrominance prediction modes, the method further includes determining that the preset candidate second-chrominance prediction modes include a candidate second-chrominance prediction mode set, where the candidate second-chrominance prediction mode set includes the directional prediction mode and the non-directional prediction mode, or determining that the preset candidate second-chrominance prediction modes include second-chrominance prediction modes of the one or more reconstructed second-chrominance units corresponding to the second-chrominance processing unit, or determining that the preset candidate second-chrominance prediction modes include first-chrominance prediction modes of reconstructed first-chrominance units in a neighborhood of the to-be-processed first-chrominance unit.
A beneficial effect of this embodiment of the present application is that prediction modes with relatively strong correlation are selected as the candidate second-chrominance prediction modes to participate in searches, thereby improving search efficiency and increasing a search speed.
According to the method in the nineteenth feasible implementation of the fifth aspect, in a twentieth feasible implementation of the fifth aspect, the determining that the preset candidate second-chrominance prediction modes include second-chrominance prediction modes of the one or more reconstructed second-chrominance units corresponding to the second-chrominance processing unit further includes determining that the preset candidate second-chrominance prediction modes include second-chrominance prediction modes that are correlated to the second-chrominance prediction modes of the one or more reconstructed second-chrominance units.
According to the method in the twentieth feasible implementation of the fifth aspect, in a twenty-first feasible implementation of the fifth aspect, for the second-chrominance prediction modes that are correlated to the second-chrominance prediction modes of the one or more reconstructed second-chrominance units, when the second-chrominance prediction mode is the directional prediction mode, the correlated second-chrominance prediction modes include P prediction modes adjacent to the second-chrominance prediction modes of the one or more reconstructed second-chrominance units, where P is a positive number, or when the second-chrominance prediction mode is the directional prediction mode, the correlated second-chrominance prediction modes include Q prediction modes adjacent to the second-chrominance prediction modes of the one or more reconstructed second-chrominance units and include the non-directional prediction mode, where Q is a positive number, or when the second-chrominance prediction mode is the non-directional prediction mode, the correlated second-chrominance prediction modes include the preset directional prediction mode.
According to the method in any one of the nineteenth to the twenty-first feasible implementations of the fifth aspect, in a twenty-second feasible implementation of the fifth aspect, the first-chrominance prediction modes of the reconstructed first-chrominance units in the neighborhood of the to-be-processed first-chrominance unit include first-chrominance prediction modes of reconstructed first-chrominance units that are adjacent to the top, the left, the upper left, the upper right, and the lower left of the to-be-processed first-chrominance unit.
According to the method in any one of the fifth aspect, or the first to the twenty-second feasible implementations of the fifth aspect, in a twenty-third feasible implementation of the fifth aspect, before the determining one or more target second-chrominance prediction modes of the second-chrominance processing unit from preset candidate second-chrominance prediction modes, the method further includes performing downsampling on the second-chrominance processing unit, and correspondingly, the determining one or more target second-chrominance prediction modes of the second-chrominance processing unit from preset candidate second-chrominance prediction modes includes determining one or more target second-chrominance prediction modes of the downsampled second-chrominance processing unit from the preset candidate second-chrominance prediction modes.
A beneficial effect of this embodiment of the present application is that because downsampling is performed on the second-chrominance processing unit, operation complexity is reduced, and the downsampled second-chrominance processing unit can more closely reflect a prediction direction of a first-chrominance unit.
According to the method in the twenty-third feasible implementation of the fifth aspect, in a twenty-fourth feasible implementation of the fifth aspect, the performing downsampling on the second-chrominance processing unit includes performing downsampling on the second-chrominance processing unit using a filter whose filtering coefficient is
or performing downsampling on the second-chrominance processing unit using a filter whose filtering coefficient is
or performing downsampling on the second-chrominance processing unit using a filter whose filtering coefficient is
or performing downsampling on the second-chrominance processing unit using a filter whose filtering coefficient is
A beneficial effect of this embodiment of the present application is that different filters may be selected based on different complexity and performance requirements, and the different filters are suitable for different application scenarios.
According to the method in any one of the tenth to the twenty-fourth feasible implementations of the fifth aspect, in a twenty-fifth feasible implementation of the fifth aspect, the method is used to encode the to-be-processed first-chrominance unit, and the selecting a first-chrominance prediction mode of the to-be-processed first-chrominance unit from the candidate first-chrominance prediction mode set includes traversing candidate first-chrominance prediction modes in the candidate first-chrominance prediction mode set to obtain corresponding candidate predicted second-chrominance values, calculating encoding costs of each candidate first-chrominance prediction mode based on an original value of the to-be-processed first-chrominance unit and the candidate predicted second-chrominance values obtained through the traversing, determining a candidate first-chrominance prediction mode with smallest encoding costs as the first-chrominance prediction mode of the to-be-processed first-chrominance unit, and encoding an index of the first-chrominance prediction mode in the candidate first-chrominance prediction mode set.
According to the method in the twenty-fifth feasible implementation of the fifth aspect, in a twenty-sixth feasible implementation of the fifth aspect, the encoding an index of the first-chrominance prediction mode in the candidate first-chrominance prediction mode set includes encoding the index based on the determined codeword of the candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set.
According to the method in any one of the tenth to the twenty-fourth feasible implementations of the fifth aspect, in a twenty-seventh feasible implementation of the fifth aspect, the method is used to decode the to-be-processed first-chrominance unit, and the selecting a first-chrominance prediction mode of the to-be-processed first-chrominance unit from the candidate first-chrominance prediction mode set includes decoding an index of the first-chrominance prediction mode in the candidate first-chrominance prediction mode set from a bitstream, and determining the first-chrominance prediction mode from the candidate first-chrominance prediction mode set based on the index.
According to the method in the twenty-seventh feasible implementation of the fifth aspect, in a twenty-eighth feasible implementation of the fifth aspect, the decoding an index of the first-chrominance prediction mode in the candidate first-chrominance prediction mode set from a bitstream includes decoding the index based on the determined codeword of the candidate first-chrominance prediction mode in the candidate first-chrominance prediction mode set.
A sixth aspect of the embodiments of the present application provides a first-chrominance prediction method, where a to-be-processed first-chrominance unit corresponds to one second-chrominance processing unit, the second-chrominance processing unit and the to-be-processed first-chrominance unit are respectively processing units of a second-chrominance component and a first-chrominance component of a same image area, the second-chrominance processing unit corresponds to one or more reconstructed second-chrominance units, and the method includes determining a candidate predicted second-chrominance value that is of the second-chrominance processing unit and that has a smallest difference from a reconstructed second-chrominance value of the second-chrominance processing unit, where any candidate predicted second-chrominance value corresponds to one candidate second-chrominance prediction mode, and obtaining a predicted first-chrominance value of the to-be-processed first-chrominance unit based on a target second-chrominance prediction mode, where the target second-chrominance prediction mode includes a candidate second-chrominance prediction mode corresponding to the candidate predicted second-chrominance value.
According to the method in the sixth aspect, in a first feasible implementation of the sixth aspect, the determining a candidate predicted second-chrominance value that is of the second-chrominance processing unit and that has a smallest difference from a reconstructed second-chrominance value of the second-chrominance processing unit includes iteratively determining an initial prediction mode from preset candidate second-chrominance prediction modes until the initial prediction mode meets a preset condition, where the initial prediction mode that meets the preset condition corresponds to the candidate predicted second-chrominance value.
According to the method in the first feasible implementation of the sixth aspect, in a second feasible implementation of the sixth aspect, the iteratively determining an initial prediction mode includes determining a candidate second-chrominance prediction mode subset from the preset candidate second-chrominance prediction modes, selecting the initial prediction mode from the candidate second-chrominance prediction mode subset, and when the initial prediction mode does not meet the preset condition, updating the preset candidate second-chrominance prediction modes based on the initial prediction mode.
According to the method in the second feasible implementation of the sixth aspect, in a third feasible implementation of the sixth aspect, the preset candidate second-chrominance prediction modes include at least one of a prediction mode included in directional prediction modes and a prediction mode included in non-directional prediction modes, the directional prediction modes include a prediction mode that is at an angle of N degrees with a horizontal direction of a two-dimensional plane, N is a non-negative number less than 360, the non-directional prediction modes include a direct current prediction mode, a planar prediction mode, and a linear prediction mode, and the determining a candidate second-chrominance prediction mode subset from the preset candidate second-chrominance prediction modes includes determining that the candidate second-chrominance prediction mode subset is the same as the preset candidate second-chrominance prediction modes, or determining that the candidate second-chrominance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval, or determining that the candidate second-chrominance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval and the non-directional prediction mode.
According to the method in the third feasible implementation of the sixth aspect, in a fourth feasible implementation of the sixth aspect, the selecting the initial prediction mode from the candidate second-chrominance prediction mode subset includes calculating at least two differences between the reconstructed second-chrominance value and candidate predicted second-chrominance values corresponding to at least two candidate second-chrominance prediction modes in the candidate second-chrominance prediction mode subset, and comparing the at least two differences to determine that the initial prediction mode includes a candidate second-chrominance prediction mode corresponding to a smallest difference.
According to the method in the fourth feasible implementation of the sixth aspect, in a fifth feasible implementation of the sixth aspect, the candidate predicted second-chrominance value is a candidate predicted second-chrominance value matrix, the reconstructed second-chrominance value is a reconstructed second-chrominance value matrix, and the calculating at least two differences between the reconstructed second-chrominance value and candidate predicted second-chrominance values corresponding to at least two candidate second-chrominance prediction modes in the candidate second-chrominance prediction mode subset includes separately calculating a difference between an element at a corresponding location in each of at least two candidate predicted second-chrominance value matrices and an element at a corresponding location in the reconstructed second-chrominance value matrix, to obtain difference matrices, and determining the differences based on the difference matrices.
According to the method in the fifth feasible implementation of the sixth aspect, in a sixth feasible implementation of the sixth aspect, the determining the differences based on the difference matrices includes accumulating absolute values of all elements in the difference matrices as the differences, or transforming the difference matrices to obtain transformed difference matrices, and accumulating absolute values of all elements in the transformed difference matrices as the differences, or sequentially transforming, quantizing, dequantizing, and inversely transforming the difference matrices to obtain reconstructed difference matrices, and accumulating absolute values of all elements in the reconstructed difference matrices as the differences.
According to the method in the sixth feasible implementation of the sixth aspect, in a seventh feasible implementation of the sixth aspect, the transform includes Hadamard transform, Haar transform, discrete cosine transform, or discrete sine transform, and correspondingly, the inverse transform includes inverse Hadamard transform corresponding to the Hadamard transform, inverse Haar transform corresponding to the Haar transform, inverse discrete cosine transform corresponding to the discrete cosine transform, or inverse discrete sine transform corresponding to the discrete sine transform.
According to the method in any one of the second to the seventh feasible implementations of the sixth aspect, in an eighth feasible implementation of the sixth aspect, the updating the candidate second-chrominance prediction mode subset based on the initial prediction mode includes determining that the updated candidate second-chrominance prediction mode subset includes the initial prediction mode and prediction modes that have a preset angle difference from the initial prediction mode.
According to the method in the eighth feasible implementation of the sixth aspect, in a ninth feasible implementation of the sixth aspect, the prediction modes that have the preset angle difference from the initial prediction mode include M prediction modes that are adjacent to the initial prediction mode, where M is a positive number.
According to the method in any one of the first to the ninth feasible implementations of the sixth aspect, in a tenth feasible implementation of the sixth aspect, the preset condition includes the initial prediction mode is the non-directional prediction mode, or each prediction mode that has the preset angle difference from the initial prediction mode exists in any candidate second-chrominance prediction mode subset in the iteration, or a quantity of reselection times of the initial prediction mode reaches a preset quantity of times, or a difference corresponding to the initial prediction mode is less than a preset threshold.
According to the method in any one of the sixth aspect, or the first to the tenth feasible implementations of the sixth aspect, in an eleventh feasible implementation of the sixth aspect, the obtaining a predicted first-chrominance value of the to-be-processed first-chrominance unit based on a target second-chrominance prediction mode includes using the target second-chrominance prediction mode as a first-chrominance prediction mode of the to-be-processed first-chrominance unit, and determining the predicted first-chrominance value of the to-be-processed first-chrominance unit based on the first-chrominance prediction mode.
According to the method in any one of the sixth aspect, or the first to the eleventh feasible implementations of the sixth aspect, in a twelfth feasible implementation of the sixth aspect, before the determining a candidate predicted second-chrominance value that is of the second-chrominance processing unit and that has a smallest difference from a reconstructed second-chrominance value of the second-chrominance processing unit, the method further includes determining that the preset candidate second-chrominance prediction modes include a candidate second-chrominance prediction mode set, where the candidate second-chrominance prediction mode set includes the directional prediction mode and the non-directional prediction mode, or determining that the preset candidate second-chrominance prediction modes include second-chrominance prediction modes of the one or more reconstructed second-chrominance units corresponding to the second-chrominance processing unit, or determining that the preset candidate second-chrominance prediction modes include first-chrominance prediction modes of reconstructed first-chrominance units in a neighborhood of the to-be-processed first-chrominance unit.
According to the method in the twelfth feasible implementation of the sixth aspect, in a thirteenth feasible implementation of the sixth aspect, the determining that the preset candidate second-chrominance prediction modes include second-chrominance prediction modes of the one or more reconstructed second-chrominance units corresponding to the second-chrominance processing unit further includes determining that the preset candidate second-chrominance prediction modes include second-chrominance prediction modes that are correlated to the second-chrominance prediction modes of the one or more reconstructed second-chrominance units.
According to the method in the thirteenth feasible implementation of the sixth aspect, in a fourteenth feasible implementation of the sixth aspect, for the second-chrominance prediction modes that are correlated to the second-chrominance prediction modes of the one or more reconstructed second-chrominance units when the second-chrominance prediction mode is the directional prediction mode, the correlated second-chrominance prediction modes include P prediction modes adjacent to the second-chrominance prediction modes of the one or more reconstructed second-chrominance units, where P is a positive number, or when the second-chrominance prediction mode is the directional prediction mode, the correlated second-chrominance prediction modes include Q prediction modes adjacent to the second-chrominance prediction modes of the one or more reconstructed second-chrominance units and include the non-directional prediction mode, where Q is a positive number, or when the second-chrominance prediction mode is the non-directional prediction mode, the correlated second-chrominance prediction modes include the preset directional prediction mode.
According to the method in any one of the twelfth to the fourteenth feasible implementations of the sixth aspect, in a fifteenth feasible implementation of the sixth aspect, the first-chrominance prediction modes of the reconstructed first-chrominance units in the neighborhood of the to-be-processed first-chrominance unit include first-chrominance prediction modes of reconstructed first-chrominance units that are adjacent to the top, the left, the upper left, the upper right, and the lower left of the to-be-processed first-chrominance unit.
According to the method in any one of the sixth aspect, or the first to the fifteenth feasible implementations of the sixth aspect, in a sixteenth feasible implementation of the sixth aspect, before the determining a candidate predicted second-chrominance value that is of the second-chrominance processing unit and that has a smallest difference from a reconstructed second-chrominance value of the second-chrominance processing unit, the method further includes performing downsampling on the second-chrominance processing unit, and correspondingly, the determining a candidate predicted second-chrominance value that is of the second-chrominance processing unit and that has a smallest difference from a reconstructed second-chrominance value of the second-chrominance processing unit includes determining one or more candidate predicted second-chrominance values of the downsampled second-chrominance processing unit that have a smallest difference from a reconstructed second-chrominance value of the downsampled second-chrominance processing unit.
According to the method in the sixteenth feasible implementation of the sixth aspect, in a seventeenth feasible implementation of the sixth aspect, the performing downsampling on the second-chrominance processing unit includes performing downsampling on the second-chrominance processing unit using a filter whose filtering coefficient is
or performing downsampling on the second-chrominance processing unit using a filter whose filtering coefficient is
or performing downsampling on the second-chrominance processing unit using a filter whose filtering coefficient is
or performing downsampling on the second-chrominance processing unit using a filter whose filtering coefficient is
A seventh aspect of the embodiments of the present application provides a computer storage medium, configured to store a computer software instruction used to implement the method according to the fifth aspect or the sixth aspect, where the instruction includes a program used to perform the chrominance prediction method designed in the fifth aspect or the sixth aspect.
An eighth aspect of the embodiments of the present application provides an apparatus, including a memory and a processor coupled to the memory, where the processor is configured to perform the chrominance prediction method designed in the fifth aspect or the sixth aspect.
It should be understood that technical solutions in the second to the eighth aspects of the embodiments of the present application are consistent with those in the first aspect of the embodiments of the present application, and beneficial effects thereof are similar. Details are not described again.
It can be learned from the foregoing technical solutions that the embodiments of the present application provide the chrominance prediction method and apparatus. Based on a reconstructed luminance unit, an improved candidate chrominance prediction mode set is constructed, and a chrominance prediction mode is selected by performing encoding using an appropriate mode, thereby improving encoding efficiency.
To describe the technical solutions in the embodiments of the present application more clearly, the following briefly describes the accompanying drawings needed for describing the embodiments. Apparently, the accompanying drawings in the following descriptions show merely some embodiments of the present application, and a person of ordinary skill in the art may derive other drawings from these accompanying drawings without creative efforts.
To make the objectives, technical solutions, and advantages of the present application clearer, the following further describes the present application in detail with reference to the accompanying drawings. Apparently, the described embodiments are merely some rather than all of the embodiments of the present application. All other embodiments obtained by a person of ordinary skill in the art based on the embodiments of the present application without creative efforts shall fall within the protection scope of the present application.
In the specification, claims, and accompanying drawings of the present application, the terms “first”, “second”, and the like are intended to distinguish between different objects, but do not indicate a particular order. In addition, the terms “including”, “having”, or any other variant thereof are intended to cover a non-exclusive inclusion. For example, a process, a method, a system, a product, or a device that includes a series of steps or units is not limited to the listed steps or units, but optionally further includes an unlisted step or unit, or optionally further includes another inherent step or unit of the process, the method, or the device.
The electronic device 50 may be, for example, a mobile terminal or user equipment in a wireless communications system. It should be understood that the embodiments of the present application may be implemented in any electronic device or apparatus that may need to encode and decode, or encode, or decode a video image.
The apparatus 50 may include a housing 30 for accommodating and protecting a device. The apparatus 50 may further include a display 32 that is in a form of a liquid crystal display. In other embodiments of the present application, the display may be any appropriate display technology suitable for displaying an image or a video. The apparatus 50 may further include a keypad 34. In other embodiments of the present application, any appropriate data or user interface mechanism may be used. For example, a user interface may be implemented as a virtual keyboard or data entry system as a part of a touch-sensitive display. The apparatus may include a microphone 36 or any appropriate audio input, and the audio input may be digital or analog signal input. The apparatus 50 may further include the following audio output device. In this embodiment of the present application, the audio output device may be any one of the following an earpiece 38, a speaker, or an analog audio output connection or a digital audio output connection. The apparatus 50 may further include a battery 40. In other embodiments of the present application, the device may be powered by any appropriate mobile energy device, such as a solar cell, a fuel cell, or a clock generator. The apparatus may further include an infrared port 42 for short-range line of sight communication with other devices. In another embodiment, the apparatus 50 may further include any appropriate short-range communication solution, such as a Bluetooth wireless connection or a USB/firewire wired connection.
The apparatus 50 may include a controller 56 or a processor for controlling the apparatus 50. The controller 56 may be connected to a memory 58. In this embodiment of the present application, the memory may store data in an image form and data in an audio form, and/or may further store an instruction to be executed on the controller 56. The controller 56 may also be connected to a codec circuit 54 suitable for implementing coding and decoding of audio and/or video data or assisting in coding and decoding implemented by the controller 56.
The apparatus 50 may further include a card reader 48 and a smart card 46, for example, a Universal Integrated Circuit Card (UICC) and a UICC reader, used for providing user information and suitable for providing authentication information for authentication and authorization of a user on a network.
The apparatus 50 may further include a radio interface circuit 52. The radio interface circuit is connected to the controller and is suitable for generating a wireless communications signal, for example, for communication with a cellular communications network, a wireless communications system, or a wireless local area network. The apparatus 50 may further include an antenna 44. The antenna is connected to the radio interface circuit 52 for transmitting a radio frequency signal generated at the radio interface circuit 52 to another apparatus (other apparatuses) and receive a radio frequency signal from the another apparatus (other apparatuses).
In some embodiments of the present application, the apparatus 50 includes a camera that is capable of recording or detecting individual frames, and the codec 54 or the controller receives and processes these single frames. In some embodiments of the present application, the apparatus may receive to-be-processed video image data from another device prior to transmission and/or storage. In some embodiments of the present application, the apparatus 50 may receive an image using a wireless or wired connection for coding/decoding.
The destination apparatus 14 may receive the encoded video data from the source apparatus 12 through a channel 16. The channel 16 may include one or more media and/or apparatuses capable of moving the encoded video data from the source apparatus 12 to the destination apparatus 14. In one instance, the channel 16 may include one or more communications media that enable the source apparatus 12 to directly transmit the encoded video data to the destination apparatus 14 in real time. In this instance, the source apparatus 12 may modulate the encoded video data according to a communications standard (for example, a wireless communications protocol), and may transmit the modulated video data to the destination apparatus 14. The one or more communications media may include wireless and/or wired communications media, such as a radio frequency (RF) spectrum or one or more physical transmission lines. The one or more communications media may form a part of a packet-based network (for example, a local area network, a wide area network, or a global network (for example, the Internet)). The one or more communications media may include a router, a switch, a base station, or another device that facilitates communication from the source apparatus 12 to the destination apparatus 14.
In another instance, the channel 16 may include a storage medium that stores the encoded video data generated by the source apparatus 12. In this instance, the destination apparatus 14 may access the storage medium through disk access or card access. The storage medium may include a variety of locally accessible data storage media, such as a BLU-RAY disc, a Digital Versatile Disc (DVD), a Compact Disc Read Only Memory (CD-ROM), a flash memory, or another suitable digital storage medium used for storing the encoded video data.
In another instance, the channel 16 may include a file server or another intermediate storage apparatus that stores the encoded video data generated by the source apparatus 12. In this instance, the destination apparatus 14 may access, through streaming transmission or downloading, the encoded video data stored on the file server or the another intermediate storage apparatus. The file server may be a type of server that can store the encoded video data and transmit the encoded video data to the destination apparatus 14. An instance file server includes a web server (for example, used for a website), a File Transfer Protocol (FTP) server, a network attached storage (NAS) apparatus, and a local disk drive.
The destination apparatus 14 may access the encoded video data using a standard data connection (for example, an Internet connection). An instance type of the data connection includes a wireless channel (for example, a wireless fidelity (Wi-Fi) connection) or a wired connection (for example, a digital subscriber line (DSL) or a cable modem) that is suitable for accessing the encoded video data stored on the file server, or a combination of a wireless channel and a wired connection. Transmission of the encoded video data from the file server may be streaming transmission, downloading transmission, or a combination of streaming transmission and downloading transmission.
The technology of the present application is not limited to a wireless application scenario. For example, the technology may be used to support video coding/decoding of a plurality of multimedia applications such as the following applications over-the-air television broadcasting, cable television transmission, satellite television transmission, streaming video transmission (for example, through the Internet), encoding of video data stored on a data storage medium, decoding of video data stored on a data storage medium, or other applications. In some instances, the video coding/decoding system 10 may be configured to support unidirectional or bidirectional video transmission, to support applications such as streaming video transmission, video play, video broadcasting, and/or video telephony.
In the instance in
The video encoder 20 may encode video data from the video source 18. In some instances, the source apparatus 12 directly transmits the encoded video data to the destination apparatus 14 through the output interface 22. The encoded video data may also be stored on the storage medium or the file server, so that the destination apparatus 14 can access the encoded video data subsequently for decoding and/or playing.
In the instance in
The video encoder 20 and the video decoder 30 may operate according to a video compression standard (for example, the HEVC H.265 standard), and may comply with an HEVC test model (HM). Text descriptions ITU-T H.265 (V3) (April 2015) of the H.265 standard was released on Apr. 29, 2015 and can be downloaded from https://handle.itu.int/11.1002/1000/12455. All content of the file is incorporated herein by reference.
Alternatively, the video encoder 20 and the video decoder 30 may be designed based on a JEM test model, and reference software for the Joint Exploration Model (JEM) test model may be downloaded from https://jvet.hhi.fraunhofer.de/svn/svn_HMJEMSoftware. All content of the file is incorporated herein by reference.
Alternatively, the video encoder 20 and the video decoder 30 may operate according to other proprietary or industry standards. The standards include ITU-TH.261, ISO/IECMPEG-1 Visual, ITU-TH.262 or ISO/IECMPEG-2 Visual, ITU-TH.263, ISO/IECMPEG-4 Visual, ITU-TH.264 (also referred to as ISO/IECMPEG-4 AVC), and include Scalable Video Coding (SVC) and Multi-view Video Coding (MVC) extensions. It should be understood that the technology of the present application is not limited to any specific coding/decoding standard or technology.
In addition,
Each of the video encoder 20 and the video decoder 30 may be implemented as any one of a plurality of appropriate circuits, for example, one or more microprocessors, digital signal processing (DSP), application specific integrated circuit (ASIC), field programmable gate array (FPGA), discrete logic circuit, hardware, or any combination thereof. If the technology is partially or completely implemented by software, the apparatus may store an instruction of the software in an appropriate non-transitory computer readable storage medium, and may execute, using one or more processors, an instruction in the hardware to execute the technology in the present application. Any one of the foregoing items (including the hardware, the software, a combination of the hardware and the software, and the like) may be considered as one or more processors. Each of the video encoder 20 and the video decoder 30 may be included in one or more encoders or decoders, and either of the video encoder and the video decoder may be integrated as a part of a combined encoder/decoder (codec (CODEC)) in another apparatus.
The present application may generally indicate that a video encoder 20 “signals” one piece of information to another apparatus (for example, a video decoder 30). The term “signals” may generally mean transfer of a syntax element and/or represent transfer of encoded video data. The transfer may occur in real time or approximately in real time. Alternatively, such communication may occur over a time span, for example, may occur during encoding when a syntax element is stored in a computer-readable storage medium using binary data obtained through encoding. The syntax element can be retrieved by the decoding apparatus at any time after being stored in the medium.
Each of the encoder-side prediction module 201 and the decoder-side prediction module 206 includes a prediction processing unit, and the prediction processing unit may segment a pixel block of a coding unit (CU) in one or more prediction units (PU) of the CU. The video encoder 20 and the video decoder 30 may support various PU sizes. It is assumed that a size of a specific CU is 2N×2N, the video encoder 20 and the video decoder 30 may support a PU size of 2N×2N or N×N for intra-frame prediction, and may further support symmetric PUs of 2N×N, N×2N or a similar size, and may further support asymmetric PUs of 2N×nU, 2N×nD, nL×2N, and nR×2N. This is not limited.
For intra-frame prediction, the prediction processing unit may generate predictive data of the PU by performing intra-frame prediction on the PU. The predictive data of the PU may include a predictive pixel block of the PU and various syntax elements. The prediction processing unit may perform intra-frame prediction on PUs in a strip I, a strip P, and a strip B.
To perform intra-frame prediction on the PU, the prediction processing unit may use a plurality of intra-frame prediction modes to generate a plurality of sets of predictive data of the PU. To generate the set of predictive data of the PU using the intra-frame prediction mode, the prediction processing unit may extend sampling of sampling blocks from adjacent PUs across a sample block of the PU in a direction associated with the intra-frame prediction mode. It is assumed that an encoding order from left to right and from top to bottom is used for encoding the PU, the CU and a CTB. Adjacent PUs may be adjacent to the top, the upper right, the upper left, or the left of the PU. Prediction processing units may use different quantities of intra-frame prediction modes. In some instances, the quantity of intra-frame prediction modes may depend on a pixel block size of the PU.
In some instances, the prediction processing unit selects predictive data of the PU of the CU based on a code rate/distortion measure of the set of predictive data. For example, the prediction processing unit performs selection between an encoding mode and a parameter value of the encoding mode, such as an intra-frame prediction direction, using a Lagrangian cost function. In this type of cost function, a weighting factor lambda is used to associate actual or estimated image distortion caused by a lossy encoding method with an actual or estimated information amount needed to represent a pixel value in an image area: C=D+lambda×R, where C is to-be-minimized Lagrangian costs, D is image distortion having a mode and a parameter of the mode, such as a mean square error, and R is a quantity of bits needed to reconstruct an image block in a decoder, for example, includes a data amount used to represent an intra-frame prediction direction. Usually, an encoding mode with minimum costs is selected as an actual encoding mode.
The H.265 standard is used as an example below to describe an intra-frame prediction technology in detail. It should be understood that intra-frame prediction technologies in other video compression coding/decoding standards are based on a same technical principle, and can achieve similar technical effects. Details are not described again.
Intra-frame prediction is to predict a current block using a reconstructed image of an encoded pixel block in a current frame. A total of 35 intra-frame prediction modes are defined in H.265, including a planar prediction mode, a DC prediction mode, and 33 directional intra-frame prediction modes, and the 35 intra-frame prediction modes are used for intra-frame prediction of a luminance component. Definitions of the 33 directions are shown in
In H.265, pixels in a left column, pixels in an upper row, an upper-left pixel, a lower-left pixel, or an upper-right pixel of a current pixel block may be used as a reference pixel. As shown in
The planar prediction mode and the direct current prediction mode in H.265 are also referred to as non-directional prediction modes in this specification. For the planar prediction mode, as shown in
For chrominance prediction, five chrominance prediction modes are specified in H.265 a planar prediction mode, a horizontal prediction mode, a vertical prediction mode, a direct current prediction mode, and a DM. The direct prediction mode indicates that a same prediction mode is used for chrominance prediction and luminance prediction of a corresponding pixel block. When a luminance prediction mode represented by the direct prediction mode is the same as any one of the first four modes, a luminance prediction mode whose index number is 34 is used as a candidate chrominance prediction mode, so that a total quantity of the candidate chrominance prediction modes is still 5.
In a process of forming the H.265 standard, there is further a LM. The LM prediction mode is a mode in which a reconstructed luminance pixel is used to continue to perform linear prediction on a chrominance pixel of a current pixel block, and in the LM prediction mode, a least square method is used to fit a linear relationship between a reconstructed pixel around a downsampled luminance component and a reconstructed pixel around a chrominance component. “WD3: Working Draft 3 of High-efficiency Video Coding” of JCTVC-E603 may be downloaded from http://phenix.int-evry.fr/jct/, and the LM prediction mode is described in detail. All content of this reference is incorporated herein by reference.
After the H.265 standard is formulated, a video compression coding/decoding technology continues to develop. “Algorithm Description of Joint Exploration Test Model 4” of JVE-D1001 may be downloaded from https://phenix.int-evry.fr/jct/, and a related technology is described in detail. All content of this reference is incorporated herein by reference.
Embodiments of the present application provide a chrominance prediction method and apparatus. Based on a reconstructed luminance unit, an improved candidate chrominance prediction mode set is constructed, and a chrominance prediction mode is selected by performing encoding using an appropriate mode, thereby improving encoding efficiency.
In a feasible implementation, a chrominance pixel block on which chrominance prediction is currently to be performed may be referred to as a to-be-processed chrominance unit or a to-be-processed chrominance block. In some video compression coding/decoding technologies, division of a chrominance unit and division of a luminance unit are correlated. A chrominance unit and a corresponding luminance unit are used to encode a chrominance component and a luminance component of a same image area, and a size of the chrominance unit is less than a size of the corresponding luminance unit. For example, for a video sequence in a YUV 420 format, for an image area corresponding to a 4×4 pixel matrix, a luminance unit with a 4×4 pixel matrix, a U chrominance unit with a 2×2 pixel matrix, and a V chrominance unit with a 2×2 pixel matrix are used to encode a luminance component and chrominance components of the image area. It should be understood that one U chrominance unit or one V chrominance unit corresponds to one luminance unit. In some other video compression coding/decoding technologies, division of a chrominance unit and division of a luminance unit are not correlated. A chrominance unit and one or more corresponding luminance units are used to encode a chrominance component and a luminance component of a same image area, and a size of the chrominance unit is less than a size of the corresponding luminance unit. For example, for a video sequence in a YUV 420 format, for an image area corresponding to an 8×8 pixel matrix, one luminance unit with an 8×8 pixel matrix, or four luminance units with a 4×4 pixel matrix, or a U chrominance unit with a quarter of a 16×16 pixel matrix and one 4×4 pixel matrix and a V chrominance unit with one 4×4 pixel matrix are used to encode a luminance component and chrominance components of the image area. It should be understood that each of one U chrominance unit and one V chrominance unit corresponds to one or more luminance units. When a chrominance unit corresponds to some luminance units, it is also referred to as that the chrominance unit corresponds to one luminance unit. In a feasible implementation, the foregoing some or one or more luminance units that are used to encode a same image area together with one chrominance unit may be referred to as a luminance processing unit or a luminance processing block. When chrominance prediction is being performed, the luminance unit has been reconstructed, in other words, a reconstructed luminance unit or a reconstructed luminance block is generated. It may be considered that one luminance processing unit corresponds to one or more reconstructed luminance units. It should be understood that the U chrominance unit and the V chrominance unit share a prediction mode for chrominance prediction, and perform intra-frame prediction processing including same steps. Unless otherwise specified, physical quantities such as a chrominance unit and a chrominance component in the following descriptions are a U chrominance unit or a V chrominance unit and a U chrominance component or a V chrominance component, or the like, and details are not described.
S1001. Perform Downsampling on a Luminance Processing Unit Corresponding to a to-be-Processed Chrominance Unit.
A downsampling process is also a process of low-pass filtering. Because details of a chrominance component are not as rich as details of a luminance component, low-pass filtering is performed on the luminance component, so that a property of the luminance component is closer to that of the chrominance component, and a subsequently obtained candidate luminance prediction mode is more suitable for chrominance prediction.
In a feasible implementation, after the downsampling, the luminance processing unit and the to-be-processed chrominance unit have a same size. For a video sequence in a YUV 420 format, downsampling may be performed using a filter whose filtering coefficient is
It should be understood that downsampling may be alternatively performed on the luminance processing unit using a filter whose filtering coefficient is
to obtain the downsampled luminance processing unit. Another form of filter used for a downsampling operation may be alternatively used. This is not limited.
A beneficial effect of this embodiment of the present application is that different filters may be selected based on different complexity and performance requirements, and the different filters are suitable for different application scenarios.
It should be understood that step S1001 is an optional step. When step S1001 is performed, a subsequent luminance processing unit is a downsampled luminance processing unit, and when step S1001 is not performed, a subsequent luminance processing unit is a luminance processing unit on which downsampling is not performed.
S1002. Determine Preset Candidate Luminance Prediction Modes.
The preset candidate luminance prediction modes include at least one of a prediction mode included in directional prediction modes and a prediction mode included in non-directional prediction modes. The directional prediction modes include a prediction mode that is at an angle of N degrees with a horizontal direction of a two-dimensional plane, N is a non-negative number less than 360, and the non-directional prediction modes include a direct current prediction mode and a planar prediction mode. In other words, one or more prediction modes are preset in the directional prediction modes and the non-directional prediction modes as the preset candidate luminance prediction modes. However, descriptions of a prediction mode in this embodiment of the present application are not limited to the Background. For directional prediction, a value range of N is wider. Non-directional prediction may further include various variant prediction manners of the direct current prediction mode and the planar prediction mode. This is not limited.
In a feasible implementation, the preset candidate luminance prediction modes include all directional prediction modes and all non-directional prediction modes in a candidate luminance prediction mode set, or the preset candidate luminance prediction modes include a subset including any combination of all directional prediction modes and all non-directional prediction modes in a candidate luminance prediction mode set.
In a feasible implementation, the preset candidate luminance prediction modes include luminance prediction modes of the one or more reconstructed luminance units corresponding to the luminance processing unit.
In a feasible implementation, the preset candidate luminance prediction modes include luminance prediction modes that are correlated to the luminance prediction modes of the one or more reconstructed luminance units. For example, when the luminance prediction mode is the directional prediction mode, the preset candidate luminance prediction modes include P prediction modes adjacent to the luminance prediction modes of the one or more reconstructed luminance units, where P is a positive number, or the preset candidate luminance prediction modes further include the non-directional prediction mode. When the luminance prediction mode is the non-directional prediction mode, the preset candidate luminance prediction modes include the preset directional prediction mode. For example, when the luminance prediction mode is a horizontal prediction mode (a mode number is 10, as shown in
In a feasible implementation, the preset candidate luminance prediction modes include chrominance prediction modes of reconstructed chrominance units in a neighborhood of the to-be-processed chrominance unit, for example, chrominance prediction modes of reconstructed chrominance units that are adjacent to the top, the left, the upper left, the upper right, and the lower left of the to-be-processed chrominance unit. For example, as shown in
It should be understood that the foregoing implementations of the preset candidate luminance prediction modes may be performed independently, or may be performed in combination. This is not limited.
A beneficial effect of this embodiment of the present application is that prediction modes with relatively strong correlation are selected as the candidate luminance prediction modes to participate in searches, thereby improving search efficiency and increasing a search speed.
It should be understood that step S1002 is a process of presetting candidate luminance prediction modes. In some feasible implementations, the preset candidate luminance prediction modes have been built in an encoder and a decoder. Therefore, step S1002 is an optional step, and step S1002 may be performed before step S1001, or may be performed after step S1001. This is not limited.
S1003. Determine One or More Target Luminance Prediction Modes of the Luminance Processing Unit from the Preset Candidate Luminance Prediction Modes.
In a feasible implementation, before a chrominance unit is predicted, steps of prediction and reconstruction have been completed for a corresponding luminance processing unit. Therefore, a luminance prediction mode for actual encoding of one or more reconstructed luminance units corresponding to the luminance processing unit and a corresponding reconstructed luminance value have been determined. The luminance processing unit is used as an independent prediction unit to construct a candidate predicted luminance value based on the candidate luminance prediction mode, and any candidate predicted luminance value corresponds to one candidate luminance prediction mode.
In a feasible implementation, step S1003 is iteratively determining an initial prediction mode from the preset candidate luminance prediction modes until the initial prediction mode meets a preset condition.
One or more candidate predicted luminance values generated based on one or more initial prediction modes that are generated through iteration and that meet the preset condition are one or more candidate predicted luminance values of the luminance processing unit that are determined in this step and that have a smallest difference from a reconstructed luminance value of the luminance processing unit, that is, candidate predicted luminance values corresponding to the one or more target luminance prediction modes.
It should be understood that for the one or more candidate predicted luminance values of the luminance processing unit that have the smallest difference from the reconstructed luminance value of the luminance processing unit, a difference between a predicted luminance value, of the luminance processing unit, corresponding to any target luminance prediction mode and the reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is not determined as the target luminance prediction mode and the reconstructed luminance value of the luminance processing unit.
A beneficial effect of this embodiment of the present application is as follows. Through iterative searches, an operation amount needed for finding an optimal prediction mode is reduced, processing efficiency is improved, and a processing time is reduced.
In a feasible implementation, as shown in
S1301. Determine a Candidate Luminance Prediction Mode Subset from the Preset Candidate Luminance Prediction Modes.
In a feasible implementation, step S1301 is determining that the candidate luminance prediction mode subset is the same as the preset candidate luminance prediction modes, in other words, using all the preset candidate luminance prediction modes as the candidate luminance prediction mode subset.
In a feasible implementation, step S1301 is determining that the candidate luminance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval, in other words, extracting, at the preset angle interval, prediction modes corresponding to a plurality of prediction directions from prediction directions to form the candidate luminance prediction mode subset. For example, when there are 32 directional prediction modes, one prediction mode may be selected at an interval of three prediction modes based on prediction directions represented by the 32 directional prediction modes, and the preset angle interval is an angle interval including three prediction modes, to form the candidate luminance prediction mode subset. It should be understood that the foregoing embodiment is merely an example for description, and a quantity of directional prediction modes and the preset angle interval are not limited.
In a feasible implementation, step S1301 is determining that the candidate luminance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval and the non-directional prediction mode. In other words, in this embodiment, in addition to the selected directional prediction mode similar to that in the previous embodiment, the candidate luminance prediction mode subset further includes the non-directional prediction mode, for example, at least one of the direct current prediction mode and the planar prediction mode. The prediction modes selected from the directional prediction mode at the preset angle interval and the non-directional prediction mode jointly form the candidate luminance prediction mode subset.
S1302. Select an Initial Prediction Mode from the Candidate Luminance Prediction Mode Subset.
In a feasible implementation, step S1302 may include the following steps.
S1321. Calculate at Least Two Differences Between a Reconstructed Luminance Value and Candidate Predicted Luminance Values Corresponding to at Least Two Candidate Luminance Prediction Modes in the Candidate Luminance Prediction Mode Subset.
It should be understood that, in a specific implementation, the prediction unit and the reconstructed unit usually participate in calculation in a form of a pixel matrix. Therefore, it may be assumed that the candidate predicted luminance value is a candidate predicted luminance value matrix, and the reconstructed luminance value is a reconstructed luminance value matrix. This step is calculating a difference between an element at a corresponding location in each of at least two candidate predicted luminance value matrices and an element at a corresponding location in the reconstructed luminance value matrix, to obtain difference matrices, and determining the differences based on the difference matrices.
It should be understood that, when there is one candidate luminance prediction mode in the candidate luminance prediction mode subset, this calculation step and a subsequent comparison step are not needed, and the candidate luminance prediction mode is the initial prediction mode.
It may be assumed that a candidate predicted luminance value matrix Pred is
and that a reconstructed luminance value matrix Recon is
In this case, a difference matrix Res is
In a feasible implementation, absolute values of all elements in the difference matrix are accumulated as the difference. For example, the difference may be calculated as
In a feasible implementation, the difference matrix is transformed to obtain a transformed difference matrix, and absolute values of all elements in the transformed difference matrix are accumulated as the difference. For example, it may be assumed that a transform operation is T, a transformed difference matrix
and the difference may be calculated as
In a feasible implementation, the difference matrix is sequentially transformed, quantized, dequantized, and inversely transformed to obtain a reconstructed difference matrix, and absolute values of all elements in the reconstructed difference matrix are accumulated as the difference.
A beneficial effect of this embodiment of the present application is that different calculation manners may be selected based on different complexity and performance requirements, and the different calculation manners are suitable for different application scenarios.
The transform in the foregoing plurality of embodiments may be of a plurality of different types, such as Hadamard transform, Haar transform, DCT, and DST. This is not limited. Correspondingly, the inverse transform in the embodiments includes inverse Hadamard transform corresponding to the Hadamard transform, inverse Haar transform corresponding to the Haar transform, inverse discrete cosine transform corresponding to the discrete cosine transform, or inverse discrete sine transform corresponding to the discrete sine transform. For example, when the Hadamard transform is used to complete the transform operation, it may be assumed that a transform base matrix of the Hadamard transform is
In this case,
A beneficial effect of this embodiment of the present application is that different transform manners may be selected based on different complexity and performance requirements, and the different transform manners are suitable for different application scenarios.
S1322. Compare the at Least Two Differences to Determine that the Initial Prediction Mode Includes Candidate Luminance Prediction Modes Corresponding to One or More Smallest Differences.
The at least two differences obtained in step S1321 are compared. In a feasible implementation, a candidate luminance prediction mode corresponding to the smallest difference may be selected as the initial prediction mode. In a feasible implementation, a plurality of smallest differences, that is, a preset quantity of differences that are smaller than other differences in difference comparison, may be selected as initial selection prediction modes. A difference between a predicted luminance value, of the luminance processing unit, corresponding to any initial prediction mode and the reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is in the candidate luminance prediction mode subset and that is not determined as the initial prediction mode and the reconstructed luminance value of the luminance processing unit.
S1303. Update the Preset Candidate Luminance Prediction Modes Based on the Initial Prediction Mode when the Initial Prediction Mode does not Meet a Preset Condition.
Step S1303 is a condition for determining whether to perform a next iteration. It is determined whether the initial prediction mode meets the preset condition.
In a feasible implementation, the preset condition includes the initial prediction mode is the non-directional prediction mode, and when the initial prediction mode is the non-directional prediction mode, iteration is terminated.
In a feasible implementation, the preset condition includes each prediction mode that has a preset angle difference from the initial prediction mode exists in any candidate luminance prediction mode subset in the iteration, and when each prediction mode that has the preset angle difference from the initial prediction mode has been used for difference comparison in each iteration, iteration is terminated.
In a feasible implementation, the preset condition includes a quantity of iteration times of the initial prediction mode reaches a preset quantity of times.
In a feasible implementation, the preset condition includes a difference corresponding to the initial prediction mode is less than a preset threshold.
It should be understood that the foregoing embodiments of a plurality of preset conditions may be considered separately, or may be considered comprehensively in an “and” or “or” relationship. This is not limited.
A beneficial effect of this embodiment of the present application is that different iteration termination conditions may be selected based on different complexity and performance requirements, and the different iteration termination conditions are suitable for different application scenarios.
In addition, when it is determined to enter a next iteration, in step S1303, the preset candidate luminance prediction modes are updated based on the initial prediction mode.
It should be understood that a process of the foregoing iterative calculation may be alternatively described as follows determining a candidate luminance prediction mode subset from the preset candidate luminance prediction modes, selecting an initial prediction mode from the candidate luminance prediction mode subset, and when the initial prediction mode does not meet a preset condition, updating the preset candidate luminance prediction modes based on the initial prediction mode, redetermining the candidate luminance prediction mode subset from the updated preset candidate luminance prediction modes, and reselecting the initial prediction mode from the redetermined candidate luminance prediction mode subset, until the reselected initial prediction mode meets the preset condition, or when the initial prediction mode meets a preset condition, using the initial prediction mode as the target luminance prediction mode. The preset condition includes the initial prediction mode is the non-directional prediction mode, or each prediction mode that has a preset angle difference from the initial prediction mode exists in the candidate luminance prediction mode subset that is determined from the preset candidate luminance prediction modes or that is redetermined from the updated preset candidate luminance prediction modes, or a quantity of reselection times of the initial prediction mode reaches a preset quantity of times, or a difference corresponding to the initial prediction mode is less than a preset threshold.
In a feasible implementation, the updated preset candidate luminance prediction modes include an initial selection prediction mode selected in a previous iteration and prediction modes that have a preset angle difference from the initial prediction mode. In a feasible implementation, the prediction modes that have the preset angle difference from the initial prediction mode include M prediction modes that are adjacent to the initial prediction mode, where M is a positive number. When the initial prediction mode is the horizontal prediction mode (the mode number is 10, as shown in
It should be understood that, in the previous iteration, a difference corresponding to the initial prediction mode has been obtained. Therefore, when a difference calculation manner remains unchanged, in a current iteration, the difference corresponding to the initial prediction mode may be not calculated again, and the difference corresponding to the initial prediction mode is recorded after the previous iteration and is used for comparison with a difference corresponding to another prediction mode that has the preset angle difference from the initial prediction mode in the current iteration.
After the preset candidate luminance prediction modes are updated, step S1301 is performed to enter the next iteration.
It should be understood that, for step S1003, in a feasible implementation, iteration is performed once. As shown in
S1311. Determine a Candidate Luminance Prediction Mode Subset from the Preset Candidate Luminance Prediction Modes.
S1312. Calculate at Least Two Differences Between a Reconstructed Luminance Value and Candidate Predicted Luminance Values Corresponding to at Least Two Candidate Luminance Prediction Modes in the Candidate Luminance Prediction Mode Subset.
S1313. Compare the at Least Two Differences to Determine Candidate Luminance Prediction Modes Corresponding to One or More Smallest Differences.
For a specific implementation, refer to related steps in the foregoing embodiment. Details are not described again.
S1004. Use the Target Luminance Prediction Mode as a Candidate Chrominance Prediction Mode, to Obtain a Predicted Chrominance Value of the to-be-Processed Chrominance Unit.
In a feasible implementation, as shown in
S1401. Determine a Candidate Chrominance Prediction Mode Set of the to-be-Processed Chrominance Unit Based on the Target Luminance Prediction Mode.
In a feasible implementation, it is determined that the candidate chrominance prediction mode set includes only the target luminance prediction mode. It should be understood that, in a feasible implementation, when there is only one target luminance prediction mode, the target luminance prediction mode is the chrominance prediction mode of the to-be-processed chrominance unit, and no subsequent step needs to be performed.
A beneficial effect of this embodiment of the present application is that the candidate chrominance prediction mode includes only the target luminance prediction mode, thereby reducing a code rate of an encoding mode.
In a feasible implementation, it is determined that the candidate chrominance prediction mode set includes the target luminance prediction mode and one or more preset candidate chrominance prediction modes. In a feasible implementation, the preset candidate chrominance prediction modes include at least one of a horizontal prediction mode, a vertical prediction mode, a direct current prediction mode, a planar prediction mode, and a DM. In a feasible implementation, the preset candidate chrominance prediction modes further include a directional prediction mode in a non-horizontal direction or a non-vertical direction, or a LM. For example, it may be assumed that the target luminance prediction mode is the horizontal prediction mode, and the candidate chrominance prediction mode set includes the target luminance prediction mode (the horizontal prediction mode), the vertical prediction mode, the planar prediction mode, and the direct prediction mode (a prediction mode whose mode number is 8, as shown in
A beneficial effect of this embodiment of the present application is that another preset prediction mode is added to the candidate chrominance prediction modes, thereby avoiding impact exerted on encoding performance when the target luminance prediction mode is misjudged.
S1402. Determine a Codeword of the Candidate Chrominance Prediction Mode in the Candidate Chrominance Prediction Mode Set.
In different chrominance prediction schemes, mode information of a current prediction unit is binarized using different encoding manners, and entropy encoding is then performed on a binary figure obtained through binarization using a CAB AC (context-adaptive binary arithmetic coding) technology. For example, in a chrominance prediction solution specified in the H.265 standard, a binarization solution shown in Table 1 is used.
In a feasible implementation, the codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set is determined using a variable-length code. A rule of determining the codeword is that when probabilities that the candidate chrominance prediction modes are selected are close, higher efficiency is achieved if encoding is performed using a fixed-length code, and when the probabilities that the candidate chrominance prediction modes are selected have a relatively large difference, higher efficiency is achieved if encoding is performed using the variable-length code, and a chrominance mode that is more likely to be selected needs to be represented by a shorter codeword.
It may be assumed that one of the target luminance prediction modes that is corresponding to the candidate predicted luminance value that is of the luminance processing unit and that has a smallest difference from the reconstructed luminance value of the luminance processing unit is the first target luminance prediction mode, and when there is one target luminance prediction mode, the target luminance prediction mode is the first target luminance prediction mode.
In a feasible implementation, if the candidate chrominance prediction mode set includes at least the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode, codewords are respectively assigned to the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode in ascending order of length. A shortest codeword for the candidate chrominance prediction mode in the candidate chrominance prediction mode set may be assigned to the first target luminance prediction mode.
In a feasible implementation, if the candidate chrominance prediction mode set includes at least the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode, codewords are respectively assigned to the linear prediction mode, the first target luminance prediction mode, and the direct prediction mode in ascending order of length. A shortest codeword for the candidate chrominance prediction mode in the candidate chrominance prediction mode set may be assigned to the linear prediction mode.
In a feasible implementation, if the candidate chrominance prediction mode set includes at least the first target luminance prediction mode and the direct prediction mode, codewords are respectively assigned to the first target luminance prediction mode and the direct prediction mode in ascending order of length. A shortest codeword for the candidate chrominance prediction mode in the candidate chrominance prediction mode set may be assigned to the first target luminance prediction mode.
In a feasible implementation, when a quantity of the candidate chrominance prediction modes in the candidate chrominance prediction mode set changes, a length of the variable-length code changes accordingly, for example, one or more bits are added or removed. For example, when the first target luminance prediction mode is added to a candidate chrominance prediction mode set in an H.265 chrominance prediction solution, a binary representation solution shown in Table 2 is used, and it can be seen that each of binary representations corresponding to indexes 0 to 4 is increased by one bit.
It should be understood that step S1402 is a process of determining the codeword of the candidate chrominance prediction mode. In some feasible implementations, the codeword of the candidate chrominance prediction mode has been built in the encoder and the decoder. Therefore, step S1402 is an optional step. In some feasible implementations, when it is determined that there is only one candidate chrominance prediction mode, it is specified only in a codec side protocol that when the first target luminance prediction mode is used as a chrominance prediction mode, the chrominance prediction mode does not need to be encoded or decoded. Therefore, this step may not be needed either.
A beneficial effect of this embodiment of the present application is that the codeword of the candidate chrominance prediction mode is adjusted based on a probability that each candidate chrominance prediction mode in the candidate chrominance prediction mode set is selected, so that encoding performance can be further improved.
S1403. Select a Chrominance Prediction Mode of the to-be-Processed Chrominance Unit from the Candidate Chrominance Prediction Mode Set.
In a feasible implementation, the method is used to encode the to-be-processed chrominance unit, and step S1403 is traversing candidate chrominance prediction modes in the candidate chrominance prediction mode set to obtain corresponding candidate predicted chrominance values, calculating encoding costs of each candidate chrominance prediction mode based on an original value of the to-be-processed chrominance unit and the candidate predicted chrominance values obtained through the traversing, determining a candidate chrominance prediction mode with smallest encoding costs as the chrominance prediction mode of the to-be-processed chrominance unit, and encoding an index of the chrominance prediction mode in the candidate chrominance prediction mode set. It may be assumed that the candidate chrominance prediction mode set includes a prediction mode PredM 1 and a prediction mode PredM 2. Predicted values Pred 1 and Pred 2 corresponding to the prediction modes are obtained. Then, distortion values Dis 1=SumAbs (Pred 1−Org) and Dis 2=SumAbs (Pred 2−Org) are respectively calculated, where Org is the original value of the to-be-processed chrominance unit, and SumAbs( ) represents calculation of an absolute sum. The distortion values are compared to obtain code rates for encoding the distortion values. For example, the distortion values may be encoded, a code rate Rate 1=T(Dis 1), a code rate Rate 2=T(Dis 2), and T( ) represents an encoding process. Encoding costs Cost 1=f(Dis 1, Rate 1) and Cost 2=f(Dis 2, Rate 2) are calculated based on at least one of the code rate and the distortion, where f( ) represents calculation of the encoding costs. A prediction mode corresponding to smallest encoding costs is used as the chrominance prediction mode, when Cost 1<Cost 2, PredM 1 is selected as the chrominance prediction mode, and a mode index of PredM 1 is encoded into a bitstream.
In a feasible implementation, the selected chrominance prediction mode is encoded based on the codeword determined in the foregoing step.
In a feasible implementation, the method is used to decode the to-be-processed chrominance unit, and step S1403 is decoding an index of the chrominance prediction mode in the candidate chrominance prediction mode set from a bitstream, and determining the chrominance prediction mode from the candidate chrominance prediction mode set based on the index, to obtain a corresponding predicted chrominance value.
In a feasible implementation, the selected chrominance prediction mode is decoded based on the codeword determined in the foregoing step.
S1404. Determine a Predicted Chrominance Value of the to-be-Processed Chrominance Unit Based on the Chrominance Prediction Mode.
According to the chrominance prediction method provided in this embodiment of the present application, based on a reconstructed luminance unit, an improved candidate chrominance prediction mode set is constructed, and a chrominance prediction mode is selected by performing encoding using an appropriate mode, thereby improving encoding efficiency.
As described above, for video images in some formats, chrominance components may be further classified into a first-chrominance component and a second-chrominance component. For example, for a video image in a YUV 420 format, chrominance components may be further classified into a U component and a V component. In the embodiment shown in
In a feasible implementation, a first-chrominance component and a second-chrominance component of a same chrominance processing unit may have different prediction modes.
S2001. Determine a Candidate Predicted Second-Chrominance Value that is of a Second-Chrominance Processing Unit and that has a Smallest Difference from a Reconstructed Second-Chrominance Value of a Second-Chrominance Processing Unit.
The second-chrominance processing unit corresponds to one or more reconstructed second-chrominance units. Because human eyes are equally sensitive to the first-chrominance component and the second-chrominance component, generally, the second-chrominance processing unit includes only one reconstructed second-chrominance unit, and corresponds to one first-chrominance unit.
This step has a technical means similar to that of determining one or more target luminance prediction modes of the luminance processing unit from the preset candidate luminance prediction modes in step S1003. Reference may be made to step S1003 and various related feasible implementations. Details are not described again.
S2002. Use a Prediction Mode Corresponding to the Candidate Predicted Second-Chrominance Value as a First-Chrominance Prediction Mode, and Obtain a Predicted First-Chrominance Value of a to-be-Processed First-Chrominance Unit.
This step has a technical means similar to that of obtaining, when there is one target luminance prediction mode, a predicted chrominance value of the to-be-processed chrominance unit based on the target luminance prediction mode in step S1004. Reference may be made to step S1004 and various related feasible implementations. Details are not described again.
For example, it may be assumed that, for a chrominance unit in the YUV 420 format, a U component of the chrominance unit has been reconstructed based on a horizontal prediction mode. It may be assumed that candidate chrominance prediction modes include the horizontal prediction mode, a vertical prediction mode, a planar prediction mode, a direct prediction mode, a direct current prediction mode, and a linear prediction mode, and a luminance prediction mode corresponding to the direct prediction mode is different from other candidate chrominance prediction modes. In this case, six predicted values of the U component of the chrominance unit are still constructed based on the six candidate chrominance prediction modes, encoding costs corresponding to each candidate chrominance prediction modes are calculated based on a reconstructed value of the U component and the predicted values of the U component, for example, Hadamard transform is performed on differences between the reconstructed value of the U component and the predicted values, and energy of a transform coefficient is calculated as the encoding costs, a candidate chrominance prediction mode corresponding to smallest encoding costs is selected as a chrominance prediction mode of a V component, and a predicted value of the V component is obtained based on the chrominance prediction mode.
According to the chrominance prediction method provided in this embodiment of the present application, based on a reconstructed first-chrominance unit, an improved candidate chrominance prediction mode is constructed and used as a prediction mode of second-chrominance prediction, thereby improving encoding efficiency.
It should be understood that, on a decoder side, the implementations shown in
In a chrominance intra-frame prediction mode solution described in JVET-D1001, candidate chrominance intra-frame prediction modes include a direct prediction mode, a linear prediction mode, and four conventional modes (a planar prediction mode, a vertical prediction mode, a horizontal prediction mode, and a direct current prediction mode). When a luminance prediction mode represented by any one of the four conventional modes is the same as a luminance prediction mode represented by the direct prediction mode, a vertical/diagonal prediction mode (a mode number is 66) is used instead. For specific descriptions of the prediction modes, refer to JVET-D1001 and corresponding reference software HM16.6-JEM4.0. Details are not described.
Table 3 describes stipulations for binarization of mode information of a prediction mode in the chrominance intra-frame prediction mode solution.
In this embodiment, a solution described in the chrominance intra-frame prediction mode solution described in JVET-D1001 is improved. Corresponding to the direct prediction mode in the original solution, this embodiment provides a decoder-side direct mode (DDM). In this mode, based on a luminance pixel of a downsampled reconstructed luminance unit corresponding to a chrominance prediction unit, candidate chrominance intra-frame prediction modes are searched separately for optimal prediction modes on an encoder side and a decoder side. The optimal modes are determined by comparing a sum of absolute Hadamard transformed differences (SATD) of a transform residual that is obtained by the chrominance prediction unit in each candidate prediction mode.
S3001. Perform Downsampling on a Reconstructed Luminance Unit Corresponding to a Current Chrominance Prediction Unit Using a 2D 3×2 {{1 2 1}, {1 2 1}} Filter.
The reconstructed luminance unit corresponding to the current chrominance prediction unit is a reconstructed luminance unit that covers a same image area as the current chrominance prediction unit, and is also a reconstructed luminance unit that is in a same code block (CB) in JVET-D1001. For specific execution of step S3001, refer to step S1001. Details are not described again. The downsampling operation reduces search complexity of a subsequent operation.
S3002. Search for an Optimal Prediction Mode Using a SATD Criterion.
In consideration of balance between complexity and performance, an iterative search manner is used to search for the optimal prediction mode. Initial candidate chrominance prediction modes include a planar prediction mode, a direct current prediction mode, and directional prediction modes extracted, using four prediction directions as a sampling interval, from 65 directional prediction modes stipulated in JVET-D1001. A SATD value of each of the foregoing initial candidate chrominance prediction modes is calculated, and a candidate chrominance prediction mode corresponding to a minimum SATD value is used as a start prediction mode of a next iterative search.
If the foregoing start prediction mode is a planar prediction mode or a direct current prediction mode, the start prediction mode is used as the optimal prediction mode, and an iterative search process is terminated. Otherwise, two prediction modes that are respectively before and after the start prediction mode in a prediction direction of the start prediction mode and whose sampling interval is 2 are used as candidate chrominance prediction modes of a second search, and a prediction mode corresponding to a minimum SATD value is selected from the three prediction modes including the two prediction modes and the initial prediction mode, as a start prediction mode of a next iterative search.
If the foregoing start prediction mode is a planar prediction mode or a direct current prediction mode, the start prediction mode is used as the optimal prediction mode, and an iterative search process is terminated. Otherwise, two prediction modes that are most adjacent (that is, a sampling interval is 1) to the start prediction mode in a prediction direction of the start prediction mode are used as candidate chrominance prediction modes of a third search, and a prediction mode corresponding to a minimum SATD value is selected from the three prediction modes including the two prediction modes and the initial prediction mode, as the optimal prediction mode, that is, a prediction mode represented by a DDM.
S3003. Adjust a Codeword of a Chrominance Intra-Frame Prediction Mode Based on a DDM Mode.
If the DDM mode is added to a chrominance prediction mode list of the chrominance intra-frame prediction mode solution described in JVET-D1001, candidate chrominance intra-frame prediction modes include the direct prediction mode, the linear prediction mode, the DDM mode, and three conventional modes (the planar prediction mode, the vertical prediction mode, and the horizontal prediction mode). When a luminance prediction mode represented by any one of the three conventional modes is the same as the luminance prediction mode represented by the direct prediction mode, the direct current prediction mode is used instead. Codewords may be re-allocated, as shown in Table 4, where 0 to 2 represent the three conventional modes.
In addition, for intra-frame prediction, a scanning order of transform coefficients depends on a prediction mode. The DDM mode is obtained based on a luminance reconstruction pixel, and therefore, obtaining of the DDM mode and decoding of a transform coefficient depend on each other. To resolve this problem, when a scanning order of chrominance prediction modes is being determined, a scanning manner corresponding to the direct prediction mode is used as a scanning manner corresponding to the DDM mode.
Table 5 shows beneficial effects of this embodiment in comparison with the chrominance intra-frame prediction mode solution described in JVET-D1001. This test result complies with general test conditions formulated by the JVET standard meeting organization.
It can be seen that, for all images with different resolution (different classes in the table), a bit rate is reduced for same video quality. Generally, gains of 0.29%, 0.16%, and 0.25% can be obtained for Y, U, and V components on average.
This embodiment of the present application provides the chrominance prediction apparatus 40000, where a to-be-processed chrominance unit corresponds to one luminance processing unit, the luminance processing unit and the to-be-processed chrominance unit are respectively processing units of a luminance component and a chrominance component of a same image area, the luminance processing unit corresponds to one or more reconstructed luminance units, and the apparatus includes a first determining module 40001, configured to determine one or more target luminance prediction modes of the luminance processing unit from preset candidate luminance prediction modes, where a difference between a predicted luminance value, of the luminance processing unit, corresponding to any target luminance prediction mode and a reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is not determined as the target luminance prediction mode and the reconstructed luminance value of the luminance processing unit, and a first construction module 40002, configured to obtain a predicted chrominance value of the to-be-processed chrominance unit, where a candidate chrominance prediction mode set of the to-be-processed chrominance unit includes the target luminance prediction mode.
In a feasible implementation, the first determining module 40001 is configured to iteratively determine an initial prediction mode from the preset candidate luminance prediction modes until the initial prediction mode meets a preset condition, where the initial prediction mode that meets the preset condition corresponds to one or more candidate predicted luminance values.
In a feasible implementation, the first determining module 40001 includes a second determining module 41001, configured to determine a candidate luminance prediction mode subset from the preset candidate luminance prediction modes, a first selection module 41002, configured to select an initial prediction mode from the candidate luminance prediction mode subset, and an update module 41003, configured to when the initial prediction mode does not meet a preset condition, update the preset candidate luminance prediction modes based on the initial prediction mode, redetermine the candidate luminance prediction mode subset from the updated preset candidate luminance prediction modes, and reselect the initial prediction mode from the redetermined candidate luminance prediction mode subset, until the reselected initial prediction mode meets the preset condition, or when the initial prediction mode meets a preset condition, use the initial prediction mode as the target luminance prediction mode.
In a feasible implementation, the preset candidate luminance prediction modes include at least one of a prediction mode included in directional prediction modes and a prediction mode included in non-directional prediction modes, the directional prediction modes include a prediction mode that is at an angle of N degrees with a horizontal direction of a two-dimensional plane, N is a non-negative number less than 360, the non-directional prediction modes include a direct current prediction mode and a planar prediction mode, and the second determining module 41001 is configured to determine that the candidate luminance prediction mode subset is the same as the preset candidate luminance prediction modes, or determine that the candidate luminance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval, or determine that the candidate luminance prediction mode subset includes prediction modes selected from the directional prediction modes at a preset angle interval and the non-directional prediction mode.
In a feasible implementation, when the candidate luminance prediction mode subset includes at least two candidate luminance prediction modes, the first selection module 41002 includes a first calculation module 41201, configured to calculate a difference between the reconstructed luminance value and each of candidate predicted luminance values corresponding to the candidate luminance prediction modes in the candidate luminance prediction mode subset, and a comparison module 41202, configured to compare a plurality of differences, and determine that the initial prediction mode includes candidate luminance prediction modes corresponding to one or more smallest differences, where for the candidate luminance prediction modes corresponding to the one or more smallest differences, a difference between a predicted luminance value, of the luminance processing unit, corresponding to any initial prediction mode and the reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is in the candidate luminance prediction mode subset and that is not determined as the initial prediction mode and the reconstructed luminance value of the luminance processing unit.
When the candidate luminance prediction mode subset includes only one candidate luminance prediction mode, the first selection module 41002 is configured to determine that the candidate luminance prediction mode is the initial prediction mode.
In a feasible implementation, the first calculation module 41201 includes a second calculation module 41211, configured to separately calculate a difference between an element at a corresponding location in a candidate predicted luminance value matrix and an element at a corresponding location in a reconstructed luminance value matrix, to obtain a difference matrix, and a third determining module 41212, configured to determine the difference based on the difference matrix.
In a feasible implementation, the third determining module 41212 is configured to accumulate absolute values of all elements in the difference matrix as the difference, or transform the difference matrix to obtain a transformed difference matrix, and accumulate absolute values of all elements in the transformed difference matrix as the difference, or sequentially transform, quantize, dequantize, and inversely transform the difference matrix to obtain a reconstructed difference matrix, and accumulate absolute values of all elements in the reconstructed difference matrix as the difference.
In a feasible implementation, the transform includes Hadamard transform, Haar transform, discrete cosine transform, or discrete sine transform, and correspondingly, the inverse transform includes inverse Hadamard transform corresponding to the Hadamard transform, inverse Haar transform corresponding to the Haar transform, inverse discrete cosine transform corresponding to the discrete cosine transform, or inverse discrete sine transform corresponding to the discrete sine transform.
In a feasible implementation, the update module 41003 is configured to determine that the updated candidate luminance prediction mode subset includes the initial prediction mode and prediction modes that have a preset angle difference from the initial prediction mode.
In a feasible implementation, the prediction modes that have the preset angle difference from the initial prediction mode include M prediction modes that are adjacent to the initial prediction mode, where M is a positive number.
In some feasible implementations, the preset condition includes the initial prediction mode is the non-directional prediction mode, or each prediction mode that has the preset angle difference from the initial prediction mode exists in any candidate luminance prediction mode subset in the iteration, that is, exists in the candidate luminance prediction mode subset that is determined from the preset candidate luminance prediction modes or that is redetermined from the updated preset candidate luminance prediction modes, or a quantity of reselection times, that is, a quantity of iteration times, of the initial prediction mode reaches a preset quantity of times, or a difference corresponding to the initial prediction mode is less than a preset threshold.
In a feasible implementation, the first construction module 40002 includes a fourth determining module 42001, configured to determine a candidate chrominance prediction mode set of the to-be-processed chrominance unit based on the target luminance prediction mode, a second selection module 42002, configured to select a chrominance prediction mode of the to-be-processed chrominance unit from the candidate chrominance prediction mode set, and a fifth determining module 42003, configured to determine the predicted chrominance value of the to-be-processed chrominance unit based on the chrominance prediction mode.
In a feasible implementation, the fourth determining module 42001 is configured to determine that the candidate chrominance prediction mode set includes only the target luminance prediction mode.
In a feasible implementation, the fourth determining module 42001 is configured to determine that the candidate chrominance prediction mode set includes the target luminance prediction mode and one or more preset candidate chrominance prediction modes.
In a feasible implementation, the preset candidate chrominance prediction modes include at least one of a horizontal prediction mode, a vertical prediction mode, a direct current prediction mode, a planar prediction mode, and a direct prediction mode.
In a feasible implementation, the preset candidate chrominance prediction modes further include a directional prediction mode in a non-horizontal direction or a non-vertical direction, or a linear prediction mode.
In a feasible implementation, the first construction module 40002 further includes a sixth determining module 42004, configured to determine a codeword of a candidate chrominance prediction mode in the candidate chrominance prediction mode set.
In a feasible implementation, the sixth determining module 42004 is configured to determine the codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set using a variable-length code.
In a feasible implementation, a prediction mode corresponding to a candidate predicted luminance value that is of the luminance processing unit and that has a smallest difference from the reconstructed luminance value of the luminance processing unit is used as a first target luminance prediction mode, and the sixth determining module 42004 is configured to when the candidate chrominance prediction mode set includes the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode, assign a smallest codeword to the linear prediction mode, assign, to the first target luminance prediction mode, a smallest codeword other than the codeword used to represent the linear prediction mode, and assign, to the direct prediction mode, a smallest codeword other than the codewords used to represent the linear prediction mode and the first target luminance prediction mode, or when the candidate chrominance prediction mode set includes the first target luminance prediction mode, the linear prediction mode, and the direct prediction mode, assign a smallest codeword to the first target luminance prediction mode, assign, to the linear prediction mode, a smallest codeword other than the codeword used to represent the first target luminance prediction mode, and assign, to the direct prediction mode, a smallest codeword other than the codewords used to represent the linear prediction mode and the first target luminance prediction mode, or when the candidate chrominance prediction mode set includes the first target luminance prediction mode and the direct prediction mode, assign a smallest codeword to the first target luminance prediction mode, and assign, to the direct prediction mode, a smallest codeword other than the codeword used to represent the first target luminance prediction mode.
In a feasible implementation, the sixth determining module 42004 further includes a seventh determining module 42401, configured to determine a length of the variable-length code based on a quantity of the candidate chrominance prediction modes, and an operation module 42402, configured to when the quantity of the candidate chrominance prediction modes changes, increase or decrease the length of the variable-length code by one or more bits.
In a feasible implementation, the apparatus further includes an eighth determining module 40003, configured to determine that the preset candidate luminance prediction modes include a candidate luminance prediction mode set, where the candidate luminance prediction mode set includes the directional prediction mode and the non-directional prediction mode, or determine that the preset candidate luminance prediction modes include luminance prediction modes of the one or more reconstructed luminance units corresponding to the luminance processing unit, or determine that the preset candidate luminance prediction modes include chrominance prediction modes of reconstructed chrominance units in a neighborhood of the to-be-processed chrominance unit.
In a feasible implementation, the eighth determining module 40003 is configured to determine that the preset candidate luminance prediction modes include luminance prediction modes that are correlated to the luminance prediction modes of the one or more reconstructed luminance units.
In a feasible implementation, for the luminance prediction modes that are correlated to the luminance prediction modes of the one or more reconstructed luminance units when the luminance prediction mode is the directional prediction mode, the correlated luminance prediction modes include P prediction modes adjacent to the luminance prediction modes of the one or more reconstructed luminance units, where P is a positive number, or when the luminance prediction mode is the directional prediction mode, the correlated luminance prediction modes include Q prediction modes adjacent to the luminance prediction modes of the one or more reconstructed luminance units and include the non-directional prediction mode, where Q is a positive number, or when the luminance prediction mode is the non-directional prediction mode, the correlated luminance prediction modes include the preset directional prediction mode.
In a feasible implementation, the chrominance prediction modes of the reconstructed chrominance units in the neighborhood of the to-be-processed chrominance unit include chrominance prediction modes of reconstructed chrominance units that are adjacent to the top, the left, the upper left, the upper right, and the lower left of the to-be-processed chrominance unit.
In a feasible implementation, the apparatus further includes a downsampling module 40004, configured to perform downsampling on the luminance processing unit, and correspondingly, the first determining module 40001 is configured to determine one or more target luminance prediction modes of the downsampled luminance processing unit from the preset candidate luminance prediction modes.
In a feasible implementation, the downsampling module 40004 is configured to perform downsampling on the luminance processing unit using a filter whose filtering coefficient is
or perform downsampling on the luminance processing unit using a filter whose filtering coefficient is
or perform downsampling on the luminance processing unit using a filter whose filtering coefficient is
or perform downsampling on the luminance processing unit using a filter whose filtering coefficient is
In a feasible implementation, the apparatus is configured to encode the to-be-processed chrominance unit, and the second selection module 42002 includes a ninth determining module 42201, configured to traverse candidate chrominance prediction modes in the candidate chrominance prediction mode set to obtain corresponding candidate predicted chrominance values, a third calculation module 42202, configured to calculate encoding costs of each candidate chrominance prediction mode based on an original value of the to-be-processed chrominance unit and the candidate predicted chrominance values obtained through the traversing, a tenth determining module 42203, configured to determine a candidate chrominance prediction mode with smallest encoding costs as the chrominance prediction mode of the to-be-processed chrominance unit, and an encoding module 42204, configured to encode an index of the chrominance prediction mode in the candidate chrominance prediction mode set.
In a feasible implementation, the encoding module 42204 is configured to encode the index based on the determined codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set.
In a feasible implementation, the apparatus is configured to decode the to-be-processed chrominance unit, and the second selection module 42002 includes: a decoding module 42205, configured to decode an index of the chrominance prediction mode in the candidate chrominance prediction mode set from a bitstream, and an eleventh determining module 42206, configured to determine the chrominance prediction mode from the candidate chrominance prediction mode set based on the index.
In a feasible implementation, the decoding module 42205 is configured to decode the index based on the determined codeword of the candidate chrominance prediction mode in the candidate chrominance prediction mode set.
According to the chrominance prediction apparatus provided in this embodiment of the present application, based on a reconstructed luminance unit, an improved candidate chrominance prediction mode set is constructed, and a chrominance prediction mode is selected by performing encoding using an appropriate mode, thereby improving encoding efficiency.
This embodiment of the present application provides the chrominance prediction apparatus 500, where the apparatus includes a memory 501 and a processor 502 coupled to the memory. The memory is configured to store code and an instruction. The processor is configured to perform the following operations based on the code and the instruction determining one or more target luminance prediction modes of a luminance processing unit from preset candidate luminance prediction modes, where a difference between a predicted luminance value, of the luminance processing unit, corresponding to any target luminance prediction mode and a reconstructed luminance value of the luminance processing unit is less than or equal to a difference between a predicted luminance value, of the luminance processing unit, corresponding to each candidate luminance prediction mode that is not determined as the target luminance prediction mode and the reconstructed luminance value of the luminance processing unit, and obtaining a predicted chrominance value of a to-be-processed chrominance unit, where a candidate chrominance prediction mode set of the to-be-processed chrominance unit includes the target luminance prediction mode, where the to-be-processed chrominance unit corresponds to one luminance processing unit, the luminance processing unit and the to-be-processed chrominance unit are respectively processing units of a luminance component and a chrominance component of a same image area, and the luminance processing unit corresponds to one or more reconstructed luminance units. The processor 502 may further specifically execute various feasible implementations of the chrominance prediction method 1000, and details are not described again.
According to the chrominance prediction apparatus provided in this embodiment of the present application, based on a reconstructed luminance unit, an improved candidate chrominance prediction mode set is constructed, and a chrominance prediction mode is selected by performing encoding using an appropriate mode, thereby improving encoding efficiency.
It should be noted that, the apparatus division is merely logical function division, but the present application is not limited to the foregoing division, provided that corresponding functions can be implemented. In addition, specific names of the functional units are merely provided for the purpose of distinguishing the units from one another, but are not intended to limit the protection scope of the present application.
In the several embodiments provided in this application, it should be understood that the disclosed system, apparatus, and method may be implemented in other manners. For example, the described apparatus embodiment is merely an example. For example, the module or unit division is merely logical function division and may be other division in actual implementation. For example, a plurality of units or components may be combined or may be integrated into another system, or some features may be ignored or not performed. In addition, the displayed or discussed mutual couplings or direct couplings or communication connections may be implemented through some interfaces, or indirect couplings or communication connections between the apparatuses or units.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one position, or may be distributed on a plurality of network units. Some or all of the units may be selected based on actual requirements to achieve the objectives of the solutions of the embodiments.
In addition, functional units in the embodiments of the present application may be integrated into one processing unit, or each of the units may exist alone physically, or two or more units may be integrated into one unit. The integrated unit may be implemented in a form of a software function unit.
When the integrated unit is implemented in the form of a software function unit and sold or used as an independent product, the integrated unit may be stored in a computer-readable storage medium. Based on such an understanding, all or some of the technical solutions may be implemented in a form of a software product. The computer software product is stored in a storage medium, and includes several instructions for instructing a computer device (which may be a personal computer, a server, a network device, or the like) to perform all or some of the steps of the methods described in the embodiments of the present application. The storage medium is a non-transitory medium, and includes any medium that can store program code, such as a flash memory, a removable hard disk, a read-only memory, a random access memory, a magnetic disk, or an optical disc.
The foregoing descriptions are merely specific implementations of the present application, but are not intended to limit the protection scope of the present application. Any variation or replacement readily figured out by a person skilled in the art within the technical scope disclosed in the present application shall fall within the protection scope of the present application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.
This application is a continuation of International Application No. PCT/CN2016/112628, filed on Dec. 28, 2016, the disclosure of which is hereby incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2016/112628 | Dec 2016 | US |
Child | 16448358 | US |