The present invention relates to video coding. In particular, the present invention relates to coding techniques associated with the intra luma prediction mode.
Motion compensated inter-frame coding has been widely adopted in various coding standards, such as MPEG-1/2/4 and H.261/H.263/H.264/AVC. While motion-compensated inter-frame coding can effectively reduce bitrate for compressed video, intra mode coding is still required to process an initial picture or to periodically insert I-pictures (or Intra-coded picture, I-picture), and the I-pictures are inserted periodically to allow quick access to the compressed video data or to alleviate error propagation. Intra prediction exploits the spatial correlation within a picture or within a picture region. In order to further improve coding efficiency, the High-Efficiency Video Coding (HEVC) standard being developed exploits block-based spatial prediction extensively. In HEVC, multiple intra prediction modes are used to exploit spatial features and the number of intra prediction modes depends on the block size of a Prediction Unit (PU). For some PU block sizes (currently larger than 4×4), the number of intra luma prediction modes may be as many as 34. The intra luma prediction mode selected for each block has to be transmitted to the decoder side for proper decoding. The side information associated with the intra luma prediction mode may be substantial and therefore, it is desirable to develop schemes to improve the efficiency for coding intra luma prediction mode.
The intra luma prediction modes among neighboring blocks may be highly correlated. Therefore, the intra luma prediction modes from neighboring blocks that have been reconstructed may be used as a predictor for a current block. In HEVC, a coding scheme for the intra luma prediction mode is being considered, where the intra prediction mode of a current block is compared with the minimum of two intra luma prediction modes corresponding to two neighboring blocks. If the intra prediction mode of the current block is the same as the minimum of the two neighboring intra luma prediction modes, a single bit is transmitted to indicate the case. Otherwise, a single bit is transmitted along with information for the current intra prediction mode represented in fixed-length format. While the coding scheme for the intra luma prediction mode being considered may not fully exploit the correlations of the neighboring intra luma prediction modes, it is desirable to further improve the performance. Accordingly, a coding scheme that further exploits the dependency of the neighboring intra luma prediction mode is disclosed.
A method of video coding using intra prediction for a block of video data is disclosed. In one embodiment according to the present invention, the intra prediction modes are ranked according to a priority order associated with the block size. One or more tables are used, where the tables ranks the intra prediction modes according to a first priority order for the block having a first block size and ranks the intra prediction modes according to a second priority order for the block having a second block size. Two or more neighboring intra prediction modes corresponding to two or more neighboring blocks are received, where each neighboring block has a neighboring block size corresponding to the first block size or the second block size. A highest-priority mode among said two or more neighboring intra prediction modes is selected as the most probable mode. The current mode is then encoded or decoded using the most probable mode as a predictor. The block may correspond to prediction unit (PU).
The neighboring blocks may correspond to an above block located above the current block and a left block located at a left side of the current block. The first block may correspond to a 4×4 or 8×8 block. In this case, Vertical intra prediction mode with mode index 0 has a highest priority in said one or more tables. The second block may correspond to a 16×16, 32×32 or 64×64 block. In this case, DC intra prediction mode with mode index 2 has a highest priority in said one or more tables. Planner intra prediction mode may have different priorities for different block sizes in the group consisting of a 16×16, 32×32 and 64×64 blocks.
Motion compensated inter-frame coding has been widely adopted in various coding standards, such as MPEG-1/2/4 and H.261/H.263/H.264/AVC. While motion-compensated inter-frame coding can effectively reduce bitrate for compressed video, intra coding is still required to process an intra picture (or I-picture), or intra blocks (or I-blocks) for quick access to the compressed video data or to alleviate error propagation. Intra prediction exploits the spatial correlation within a picture or within a picture region. In order to further improve coding efficiency, the High-Efficiency Video Coding (HEVC) standard being developed exploits block-based spatial prediction extensively. In HEVC, more intra prediction modes are used to exploit spatial features and the number of intra prediction modes depends on the block size of a Prediction Unit (PU). For example, HEVC being developed allows intra prediction of luma block (i.e., PU) at sizes of 64×64, 32×32, 16×16, 8×8, and 4×4. For each block size, multiple intra prediction modes are used as shown in Table 1. For block sizes 32×32, 16×16, and 8×8, there are 34 modes including 33 directional prediction modes and a DC mode as shown in
The use of multiple intra prediction modes can improve quality of prediction by resulting in smaller intra prediction residues. However, the intra luma prediction mode selected for each block has to be transmitted to the decoder side for proper decoding. The side information associated with the intra luma prediction mode may be substantial. In HEVC, a coding scheme for the intra luma prediction mode is being considered, where the intra prediction mode of a current block is compared with the minimum of two neighboring intra luma prediction modes. The configuration of the two neighboring blocks is showing in
In the above predictive coding for intra luma prediction mode, the intra mode predictor PredMode is derived to provide a single candidate, i.e., Min(ModeA, ModeB). When the current intra prediction mode ModeC is equal to PredMode, only 1 bit, i.e., MostProbableModeFlag=1 is required to indicate this case. When the current intra prediction mode ModeC is not equal to PredMode, MostProbableModeFlag=0 and RemPredMode have to be transmitted, which will require more bits. Consequently, the cost in terms of side information is much higher for the case of predictor miss than that for the case of predictor hit. Therefore, an embodiment according to the present invention extends the intra mode predictor PredMode to include more than one candidate in order to improve the probability of ModeC being equal to PredMode, i.e., to improve the chance for predictor hit. Instead of providing a single intra prediction mode candidate as described in the conventional HEVC scheme, the present invention provides multiple intra prediction mode candidates based on neighboring intra prediction modes so that there is a better chance for predictor hit. When one of the neighboring mode candidates is not available, the mode candidate can be set to DC or one of other modes to improve the probability of providing a matched predictor. When one of the neighboring modes is not a candidate intra prediction mode of ModeC (e.g., a neighboring PU and the current PU have different PU sizes and different numbers of luma modes, as illustrated in Table 1), it is first mapped to a candidate intra prediction mode of ModeC. After these processes, if ModeA is equal to ModeB, the conventional scheme is used; otherwise, multiple intra prediction mode candidates based on neighboring intra prediction modes can be introduced to the intra mode coding of the current mode. An example to derive two intra prediction mode candidates based on two neighboring intra prediction modes is illustrated in the exemplary flow chart 400 as shown in
The flow chart of
In current HEVC, only one coding table is used for representing information associated with RemPredMode. An embodiment according to the present invention may use a single coding table or multiple coding tables to represent information associated with RemPredMode or IntraPredMode. In the case of multiple coding tables, the table selection can be based on the modes of neighboring blocks to avoid the need of side information. For example, in case of two coding tables used, the first table may be chosen if the neighboring modes are the same and the second table is chosen otherwise, as shown in examples of Table 5 and Table 6. Table 5 lists the codewords (as well as “depth”, namely the length of each codeword) with 17 candidate intra modes for a specific PU size (e.g., 4×4), while Table 6 lists the codewords (as well as “depth”, namely the length of each codeword) with 34 candidate intra modes for a specific PU size (e.g., 8×8, 16×16 and 32×32). In both Table 5 and Table 6, CodingTableOne is used when two neighboring modes are the same while CodingTableTwo is used when two neighboring modes are different. If ModeC is equal to either ModeA or ModeB, MPM_CODEWORD is transmitted followed by MostProbableModeIndex; otherwise, the codeword is selected according to the RemPredMode. Furthermore, the coding tables may also be selected according to different block (i.e., PU) sizes.
The plane or planar intra prediction mode has been used in H.264/AVC. However, this mode is not used in HEVC being considered. In order to improve the predictor performance, the plane or planar mode can also be included as a mode candidate in the prediction mode set. In the conventional HEVC system, a subset of intra luma prediction modes for block sizes 64×64 and 4×4 are derived from the 34 intra luma prediction modes for other block sizes. For example, currently there are 3 intra luma prediction modes corresponding to Vertical mode, Horizontal mode and DC mode are used for block size 64×64. Nevertheless, 3 different modes may be selected for 64×64 blocks and 17 different modes may be selected for 4×4 blocks.
The method of intra luma prediction mode coding for different block size can be different. One example of using a different coding method is to reorder the priority of possible modes. In a conventional scheme, the priority of prediction modes is always arranged according to the mode index. A smaller mode index always receives higher priority. The minimum of two neighboring luma prediction modes ModeA and ModeB is used as predMode. However, an embodiment according to the present invention allows re-ordering of the mode indices to improve performance. An example of mode index re-ordering is shown in Table 7, where 0 represents the highest priority. As shown in Table 7, for block sizes 16×16, 32×32 and 64×64, intra prediction mode 2 (i.e., DC mode) is assigned the highest priority while for block sizes 4×4 and 8×8, intra prediction mode 0 (i.e., Vertical mode) is assigned the highest priority. Also, intra prediction mode 34 (i.e., Plane or Planar mode) are assigned different priority in different block sizes as shown in Table 7. When the prediction mode for a PU with block size N×N is coded, predMode can be chosen to be ModeA or ModeB depending on which one possesses a higher priority. PredMode can be derived based on more neighboring blocks beyond the two neighboring blocks block A and block B.
The method of improved multiple intra prediction mode coding along with the syntax disclosed herein enables a decoder embodying the present invention to properly recover the current intra prediction mode from received bitstream corresponding to the compressed video. Since the derivation of mode predictor is based on neighboring blocks that have been processed and reconstructed, the decoder can derive the mode predictor accordingly. When one of the neighboring mode candidates is not available, the mode candidate can be set to DC or one of other modes to improve the probability of providing a matched predictor. When one of the neighboring modes is not a candidate intra prediction mode of ModeC (e.g., a neighboring PU and the current PU have different PU sizes and/or different numbers of luma modes, as illustrated in Table 1), it is first mapped to a candidate intra prediction mode of ModeC. After these processes, if ModeA is equal to ModeB, the conventional scheme is used; otherwise, the current intra mode is derived from multiple prediction mode candidates based on neighboring blocks. The bitstream contains information to indicate whether the current intra prediction mode is equal to any of the prediction mode candidates. If the information indicates that the current intra prediction mode is equal to one of the prediction mode candidates, the bitstream contains further information to indicate which of the prediction mode candidates is equal to the current prediction mode. In case that the information indicates that the current intra prediction mode is not equal to any of the prediction mode candidates, the bitstream will contain coded presentation for the current intra prediction mode for recovering the current intra prediction mode based on a coding table. Accordingly, the current intra prediction mode can be recovered at the decoder side properly.
An exemplary flow chart 600 for decoding process is illustrated in
Embodiment of video systems incorporating encoding or decoding of intra luma prediction mode according to the present invention as described above may be implemented in various hardware, software codes, or a combination of both. For example, an embodiment of the present invention can be a circuit integrated into a video compression chip or program codes integrated into video compression software to perform the processing described herein. An embodiment of the present invention may also be program codes to be executed on a Digital Signal Processor (DSP) to perform the processing described herein. The invention may also involve a number of functions to be performed by a computer processor, a digital signal processor, a microprocessor, or field programmable gate array (FPGA). These processors can be configured to perform particular tasks according to the invention, by executing machine-readable software code or firmware code that defines the particular methods embodied by the invention. The software code or firmware codes may be developed in different programming languages and different format or style. The software code may also be compiled for different target platform. However, different code formats, styles and languages of software codes and other means of configuring code to perform the tasks in accordance with the invention will not depart from the spirit and scope of the invention.
The invention may be embodied in other specific forms without departing from its spirit or essential characteristics. The described examples are to be considered in all respects only as illustrative and not restrictive. The scope of the invention is, therefore, indicated by the appended claims rather than by the foregoing description. All changes which come within the meaning and range of equivalency of the claims are to be embraced within their scope.
The present application is a divisional of U.S. Non-provisional patent application Ser. No. 13/198,697, filed on Aug. 4, 2011, which claims priority to U.S. Provisional Patent Application Ser. No. 61/430,701, filed on Jan. 7, 2011, entitled “Improved Intra Prediction Mode Coding Method”, U.S. Provisional Patent Application Ser. No. 61/437,910, filed on Jan. 31, 2011, entitled “Improved Intra Prediction Mode Coding Method”, and PCT Patent Application, Serial No. PCT/CN2011/076870, filed on Jul. 5, 2011, entitled “Method and Apparatus of Improved Intra Luma Prediction Mode Coding”. The U.S. Non-provisional Patent Application and U.S. Provisional Patent Applications and the PCT Patent Application are hereby incorporated by reference in their entireties.
Number | Name | Date | Kind |
---|---|---|---|
8526495 | Liu et al. | Sep 2013 | B2 |
9154796 | Seregin | Oct 2015 | B2 |
20030206594 | Zhou | Nov 2003 | A1 |
20040028282 | Kato | Feb 2004 | A1 |
20040234144 | Sugimoto | Nov 2004 | A1 |
20060104354 | Han et al. | May 2006 | A1 |
20080013629 | Karczewicz | Jan 2008 | A1 |
20080123947 | Moriya et al. | May 2008 | A1 |
20080175319 | Sun et al. | Jul 2008 | A1 |
20080267287 | Hannuksela | Oct 2008 | A1 |
20090034632 | Chono | Feb 2009 | A1 |
20090175334 | Ye et al. | Jul 2009 | A1 |
20110038414 | Song | Feb 2011 | A1 |
20110047155 | Sohn | Feb 2011 | A1 |
20110243227 | Yoneji et al. | Oct 2011 | A1 |
20110286520 | Xu | Nov 2011 | A1 |
20110292994 | Lim | Dec 2011 | A1 |
20120082222 | Wang | Apr 2012 | A1 |
20120082223 | Karczewicz et al. | Apr 2012 | A1 |
20120177118 | Karczewicz et al. | Jul 2012 | A1 |
20130034157 | Helle | Feb 2013 | A1 |
20150163512 | Cai | Jun 2015 | A1 |
Number | Date | Country |
---|---|---|
101500161 | Aug 2009 | CN |
101668202 | Mar 2010 | CN |
20060045314 | May 2006 | KR |
WO 2009128653 | Oct 2009 | WO |
WO 2010067529 | Jun 2010 | WO |
Entry |
---|
Lee, J., et al.; “Intra-mixture Prediction Mode and Enhanced Most Probable Mode Estimation for Intra Coding in H.264AVC;” 2009 Fifth International Joint Conference on INC, IMS, and IDC; 2009; pp. 1619-1622. |
Ur-Rehman, J., et al.; “Fast Intra Prediction Mode Decision by Adaptively Selecting Fewer Number of Modes;” Proceedings of the Sixth International Conference on Machine Learning and Cybernetics; Aug. 2007; pp. 2385-2389. |
“The H.264 Advanced Video Compression Standard;” 2010; pp. 137-177. |
Wiegand, T., et al.; “WD1 Working Draft 1 of High-Efficiency Coding;” Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 WP3 and ISO/IEC JTC1/SC29/WG11; Oct. 2010; pp. 1-137. |
Guo, M., et al.; “Improved Intra Mode Coding;” Joint Collaborative Team on Video Coding (JCT-VC) of ITU-T SG16 VP3 and ISO/IEC JTC1/SC29/WG11; Jan. 2011; pp. 1-7. |
Number | Date | Country | |
---|---|---|---|
20150131722 A1 | May 2015 | US |
Number | Date | Country | |
---|---|---|---|
61430701 | Jan 2011 | US | |
61437910 | Jan 2011 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 13198697 | Aug 2011 | US |
Child | 14594145 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2011/076870 | Jul 2011 | US |
Child | 13198697 | US |