The present invention relates to an image processing apparatus for coding an image in accordance with a predetermined coding system and a transcoder and a translator both for converting a coding system of images.
An encoder or a transcoder codes images in accordance with various coding systems. The coding systems include MPEG (Moving Picture Experts Group) 2, H.264, and other various systems. In the coding according to these image coding systems, since images are coded by blocks such as macroblocks, there arises variation in the image quality at a block boundary in a coded image. Such a noise is termed a “block noise”.
The block noise is marked especially in a flat portion and a gradation portion of an image. Therefore, in environments where high image quality is required, the noises at block boundaries are reduced by performing a deblocking process.
An encoder disclosed in Patent Document 1 comprises a deblocking filter to reduce the block noises. The encoder of Patent Document 1 performs a deblocking process on images decoded by a local decoder.
In conventional encoders including the encoder of Patent Document 1, a strength evaluation circuit which determines the filter strength of the deblocking filter is incorporated in a coding circuit block. For this reason, acquisition of the image feature value and evaluation of the filter strength are performed simultaneously with coding, and the process therefore becomes complicated and may cause a delay. Further, since the filter strength instantaneously changes in accordance with the image feature value, when a great change of the image feature value is caused by, for example, a scene change or the like, there arises variation in filter strength among pictures and this disadvantageously causes unnatural instability in moving images.
A coding apparatus disclosed in Patent Document 2 uses an average value of filter strengths calculated on a plurality of pictures.
The present invention is intended for an image processing apparatus. According to an aspect of the present invention, the image processing apparatus comprises a strength evaluation circuit for calculating a filter strength from an image feature value of an input image, and a coding circuit for coding the input image and outputting a coded stream, and in the image processing apparatus of the present invention, the coding circuit includes a filter part for performing a deblocking filtering process on an image generated in a coding process on the basis of the filter strength, and a process of calculating the filter strength performed by the strength evaluation circuit and the coding process performed by the coding circuit are performed concurrently.
By using the image processing apparatus of the present invention, it is possible to increase the processing speed of the coding process.
According to a preferred embodiment of the present invention, the filter part includes a filter strength correction part for correcting the filter strength to be applied to a current picture by using an arithmetic expression with the filter strength calculated for the current picture and the filter strength calculated for a past picture used as an input parameter.
It is thereby possible to prevent variation in the filter strength following an abrupt change of the image feature value.
The present invention is also intended for an image conversion apparatus. According to another aspect of the present invention, the image conversion apparatus comprises a decoding circuit for decoding an inputted first coded stream and outputting a decoded image, a strength evaluation circuit for calculating a filter strength from an image feature value of the decoded image, and a coding circuit for coding the decoded image and outputting a second coded stream, and in the image conversion apparatus of the present invention, the coding circuit includes a filter part for performing a deblocking filtering process on an image generated in a coding process on the basis of the filter strength, and a process of calculating the filter strength performed by the strength evaluation circuit and the coding process performed by the coding circuit are performed concurrently.
Therefore, it is an object of the present invention to provide a technique for eliminating the unnaturalness in a generated moving image while achieving high speed processing in an image processing apparatus comprising a deblocking filter.
These and other objects, features, aspects and advantages of the present invention will become more apparent from the following detailed description of the present invention when taken in conjunction with the accompanying drawings.
Hereinafter, with reference to Figures, the preferred embodiment of the present invention will be discussed.
The MPEG2 decoder 2 receives an MPEG2 stream 101 coded in accordance with an MPEG2 coding system and decodes the MPEG2 stream 101. Then, the MPEG2 decoder 2 stores a decoded image 102 into the DRAM 5.
The MPEG2 decoder 2 further acquires the image feature value of the decoded image 102 from the MPEG2 stream 101 and stores an image feature value parameter 103 into the DRAM 5.
The image feature value parameter 103 stored into the DRAM 5 by the MPEG2 decoder 2 includes a picture activity value act_pic and a picture motion evaluation value sad_pic.
<Image Feature Value Parameter>
First, discussion will be made on a method of calculating the picture activity value act_pic. As expressed by Eq. (1), the MPEG2 decoder 2 first calculates a decoded image “recon”.
In Eq. (1), for an intra macroblock (intra MB), the decoded image “recon” represents a pixel value of the decoded image. For an inter macroblock (inter MB), the decoded image “recon” is obtained by adding a prediction error value “diff” to a pixel value “pred” of a prediction pixel block. The pixel value “pred” of the prediction pixel block is obtained by applying motion compensation to a pixel value of a reference pixel block.
Next, from Eq. (2), an average value of the decoded images “recon” is calculated for each block “blk”, to thereby obtain an average decoded image value AVE_recon. The block “blk” consists of 8×8 pixels, and a macroblock consists of four blocks “blk0” to “blk3” as shown in
Further, as expressed by Eq. (3), a sum of absolute differences (SAD) of the decoded images “recon” and the average decoded image value AVE_recon is calculated for each block “blk”. Then, the sums of absolute differences calculated for the blocks “blk0” to “blk3” are added together, to thereby obtain a macroblock activity value act_mb.
Then, as expressed by Eq. (4), the macroblock activity values act_mb for all the macroblocks in a frame are added together, to thereby obtain the picture activity value act_pic.
Subsequently, discussion will be made on a method of calculating the picture motion evaluation value sad_pic. As expressed by Eq. (5), the MPEG2 decoder 2 calculates a sum of absolute values of the prediction error values “diff” for each block “blk” in the inter macroblock (inter MB). Further, the sums of absolute values calculated for the blocks “blk0” to “blk3” are added together, to thereby obtain a macroblock motion evaluation value sad_mb.
Then, as expressed by Eq. (6), the macroblock motion evaluation value sad_mb for all the macroblocks in the frame are added together, to thereby obtain the picture motion evaluation value sad_pic.
<Calculation of Filter Strength>
Referring again to
Hereinafter, discussion will be made on a method of calculating the filter strength parameter 104 performed by the strength evaluation circuit 3. The strength evaluation circuit 3 holds various parameter values such as a threshold value, as expressed by Eq. (7), in a not-shown register and uses theses parameters to perform the following process.
The strength evaluation circuit 3 performs a process shown in
When the picture type is I picture, the strength evaluation circuit 3 compares the picture activity value act_pic with an activity threshold value act_pic_thr to determine which is larger (Step S2). Herein, as expressed by Eq. (7), act_pic_thr=1800.
When act_pic<act_pic_thr (“YES” in Step S2), a maximum evaluation value dbf_value_max is set as a deblocking filter evaluation value dbf_value (Step S3). In this case, as expressed by Eq. (7), dbf_value_max=4. The deblocking filter evaluation value dbf_value takes an integer value ranging from −7 to 6. As the deblocking filter evaluation value dbf_value becomes larger, the filter strength of the deblocking filter is set greater.
When act_pic≧act_pic_thr (“NO” in Step S2), a default evaluation value dbf_value_def is set as the deblocking filter evaluation value dbf_value (Step S4). In this case, as expressed by Eq. (7), dbf_value_def=0.
Thus, when the picture activity value act_pic is smaller than the activity threshold value act_pic_thr, it is assumed that the picture to be coded is an image required to represent delicate changes, such as a flat image or a gradation image. Then, the maximum evaluation value dbf_value_max is assigned as the deblocking filter evaluation value dbf_value to improve the image quality. On the other hand, when the picture activity value act_pic is not smaller than the activity threshold value act_pic_thr, since it is judged that the image is complicate to some degree, the default evaluation value dbf_value_def is assigned as the deblocking filter evaluation value dbf_value. Though the maximum evaluation value dbf_value_max is set to be 4 herein, this value may be changed as appropriate. Further, the default evaluation value dbf_value_def may take a value other than 0.
In Step S1, when it is judged that the picture type is P picture or B picture, the process goes to the flowchart of
When sad_pic<sad_pic_thr (“YES” in Step S5), a minimum evaluation value dbf_value_min is set as the deblocking filter evaluation value dbf_value (Step S6). In this case, as expressed by Eq. (7), dbf_value_min=−7.
Thus, when the picture motion evaluation value sad_pic is smaller than the motion threshold value sad_pic_thr, since it is judged that the picture to be coded has a small prediction error and causes less degradation in the image quality or that the picture has relatively less motion, the minimum evaluation value dbf_value_min is assigned as the deblocking filter evaluation value dbf_value. It is thereby possible to prevent the fineness of the image from being damaged by the function of the deblocking filter which is stronger than necessary.
When sad_pic≧sad_pic_thr (“NO” in Step S5), the strength evaluation circuit 3 compares the picture activity value act_pic with the activity threshold value act_pic_thr to determine which is larger (Step S7). Herein, as expressed by Eq. (7), act_pic_thr=1800.
When act_pic<act_pic_thr (“YES” in Step S7), the maximum evaluation value dbf_value_max is set as the deblocking filter evaluation value dbf_value (Step S8). In this case, as expressed by Eq. (7), dbf_value_max=4.
When act_pic≧act_pic_thr (“NO” in Step S7), the default evaluation value dbf_value_def is set as the deblocking filter evaluation value dbf_value (Step S9). In this case, as expressed by Eq. (7), dbf_value_def=0.
Thus, when the picture activity value act_pic is smaller than the activity threshold value act_pic_thr, it is assumed that the picture to be coded is an image required to represent delicate changes, such as a flat image or a gradation image. Then, the maximum evaluation value dbf_value_max is assigned as the deblocking filter evaluation value dbf_value to improve the image quality. On the other hand, when the picture activity value act_pic is not smaller than the activity threshold value act_pic_thr, since it is judged that the image is complicate to some degree, the default evaluation value dbf_value_def is assigned as the deblocking filter evaluation value dbf_value.
After the deblocking filter evaluation value dbf_value is set in Step S3, Step S4, Step S6, Step S8, or Step S9, the process goes to the flowchart of
When the picture type is B picture, the strength evaluation circuit 3 sets a B picture filter evaluation value dbf_value_prev_B as a previous filter evaluation value dbf_value_prev (Step S11).
The strength evaluation circuit 3 holds the B picture filter evaluation value dbf_value_prev_B in a not-shown register. In Step S11, the B picture filter evaluation value dbf_value_prev_B stored in the register is read out and set as the previous filter evaluation value dbf_value_prev.
When the picture to be coded is I picture or P picture, the strength evaluation circuit 3 sets an IP picture filter evaluation value dbf_value_prev_IP as the previous filter evaluation value dbf_value_prev (Step S12).
The strength evaluation circuit 3 holds the IP picture filter evaluation value dbf_value_prev_IP in a not-shown register. In Step S12, the IP picture filter evaluation value dbf_value_prev_IP stored in the register is read out and set as the previous filter evaluation value dbf_value_prev.
After the previous filter evaluation value dbf_value_prev is set in Step S11 or Step S12, weighting addition of the deblocking filter evaluation value dbf_value and the previous filter evaluation value dbf_value_prev is performed, to thereby correct the deblocking filter evaluation value dbf_value (Step S13). Specifically, weighting addition of the filter evaluation value calculated for the picture to be coded and the filter evaluation value calculated for the picture which has been coded last time is performed.
For B picture, however, the previous filter evaluation value dbf_value_prev is the evaluation value calculated for the B picture which has been coded immediately before. On the other hand, for I picture or P picture, the previous filter evaluation value dbf_value_prev is the evaluation value calculated for the I picture or the P picture which has been coded immediately before.
Thus, for B picture and the P picture, individual (different) values are stored as the previous filter evaluation value dbf_value_prev. This is because the picture motion evaluation value sad_pic for P picture is assumed to be, for example, about twice as large as the picture motion evaluation value sad_pic for B picture and the filter strength is optimized by using different evaluation values. Further, though the same evaluation value is stored as the previous filter evaluation value dbf_value_prev for I picture and P picture in this preferred embodiment, individual (different) evaluation values may be stored also for I picture and P picture.
Specifically, as shown in
Subsequently, the strength evaluation circuit 3 determines the picture type of the frame to be coded (Step S14). When the picture to be coded is B picture, the previous filter evaluation value dbf_value_prev_B is updated by using the deblocking filter evaluation value dbf_value corrected by the weighting addition (Step S15). When the picture to be coded is I picture or P picture, the previous filter evaluation value dbf_value_prev_IP is updated by using the deblocking filter evaluation value dbf_value corrected by the weighting addition (Step S16). Through the above process, the calculation of the deblocking filter evaluation value dbf_value for the current picture to be coded is completed.
Referring again to
The H.264 encoder 4 codes the decoded image 102 in accordance with the H.264 coding system and outputs an H.264 stream 105. The H.264 encoder 4 performs a deblocking filtering process in the process of coding the decoded image 102. Specifically, the H.264 encoder 4 performs the deblocking filtering process on the image decoded by the local decoder. At that time, as filter parameters for the deblocking process, used are the following three parameters P1 to P3.
P1=disable_deblocking_filter_idc
P2=slice_alpha_c0_offset_div2
P3=slice_beta_offset_div2
The parameter P1 is used for determining whether to apply the deblocking filter. When the deblocking filter evaluation value dbf_value=−7, the parameter P1 is set to be 1 and application of the deblocking filter is set to be “OFF”. When the deblocking filter evaluation value dbf_value ranges from −6 to 6, the parameter P1 is set to be 0 and application of the deblocking filter is set to be “ON”. As discussed above, the deblocking filter evaluation value dbf_value=−7 in Step S6 of
The parameters P2 and P3 are used for setting the filter strength of the deblocking filter, and it is assumed in this preferred embodiment that the parameters P2=P3=dbf_value. As discussed above, the deblocking filter evaluation value dbf_value=4 in Step S3 of
The local decoded image after being subjected to the deblocking filtering process is stored into the DRAM 5 to be used for subsequent coding process by the H.264 encoder 4. When it is found in advance that the current picture is not used as a reference image in the subsequent coding process, however, it is not necessary to perform the deblocking filtering process or store the image into the DRAM 5. Omission of these processes allows reduction in the computation throughput and the memory capacity.
<Process Sequence>
First, in Step S110, the MPEG2 decoder 2 performs the decoding process for I picture (I0). Subsequently, in Step S120, the strength evaluation circuit 3 calculates the filter strength parameter 104 on the basis of the image feature value parameter 103 for the I picture (I0). Further subsequently, in Step S230, the H.264 encoder 4 performs the coding process for the I picture (I0).
Concurrently with Step S230, the MPEG2 decoder 2 performs the decoding process for P picture (P1) (Step S210). Further, concurrently with Step S230, the strength evaluation circuit 3 calculates the filter strength parameter 104 for the P picture (P1) (Step S220).
Thus, since the strength evaluation circuit 3 and the H.264 encoder 4 are different circuit blocks in the transcoder 1 of the present preferred embodiment, the H.264 encoder 4 performs the coding process while the strength evaluation circuit 3 calculates the filter strength parameter 104 for the next picture concurrently.
Subsequently, in Step S330, the H.264 encoder 4 performs the coding process for the P picture (P1). Concurrently with Step S330, the MPEG2 decoder 2 performs the decoding process for B picture (B2) (Step S310). Further, concurrently with Step S330, the strength evaluation circuit 3 calculates the filter strength parameter 104 for the B picture (B2) (Step S320). Thus, concurrently with the coding process for P picture, the filter strength parameter 104 can be calculated from B picture.
After that, similarly, the coding process for the B picture (B2) is performed in Step S430 while Steps S410 and S420 can be executed to calculate the filter strength parameter 104 from B picture (B3).
Further, the coding processes for the B picture (B3), P picture (P4), and B picture (B5) are performed in Steps S530, S630, and S730 while the filter strength parameters 104 can be calculated from the P picture (P4), the B picture (B5), and B picture (B6) in Steps S520, S620, and S720.
As discussed above, since the strength evaluation circuit 3 for calculating the filter strength parameter 104 and the H.264 encoder 4 are different circuits blocks in the transcoder 1 of the present preferred embodiment, a pipeline processing can be performed, where the calculation of the filter strength parameter 104 and the coding of the H.264 stream 105 are performed concurrently, to thereby increase the processing speed. In other words, since the filter strength parameter 104 is calculated from the image feature value parameter 103 for one picture in advance and then the coding process starts, the procedure of the coding process becomes simpler and this causes an increase in the processing speed.
Further, the deblocking filter evaluation value dbf_value used as the filter strength parameter 104 is corrected by the weighting addition with the evaluation value calculated from the picture to be coded and the evaluation value calculated from the picture which has been coded last time. It is thereby possible to prevent an abrupt change in the filter strength following a great change of the image feature value parameter 103. Further, it is possible to prevent degradation in the image quality caused by the variations in the filter strength among pictures and obtain smoother moving images.
<Variations>
In the above-discussed preferred embodiment, the transcoder for converting the MPEG2 stream into the H.264 stream is taken as an example. As another exemplary case, the present invention may be applied to an H.264 translator. In this case, though a filter strength parameter of an input stream can be used in the encoder, an optimized strength parameter may be determined by integrated evaluation of the strength parameter acquired from the input stream and the filter strength obtained by the method of the present invention.
Further, though the case where the MPEG2 coding system is converted into the H.264 coding system is taken as an example in the above-discussed preferred embodiment, this is only one exemplary application of the present invention. The present invention may be applied to, for example, a case where a decoder decodes a stream coded in a coding system other than the MPEG2 coding system or a case where an encoder encodes a stream into a coded stream of MPEG2, VC-1 (Video Codec 1), or the like.
Furthermore, when one picture in an input stream consists of a plurality of slices, there may be a method in which an average value of the filter strengths of the slices in the picture is obtained and the average value is used as a base for the filter strength of the picture.
While the invention has been shown and described in detail, the foregoing description is in all aspects illustrative and not restrictive. It is therefore understood that numerous modifications and variations can be devised without departing from the scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
2009-009473 | Jan 2009 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2009/055830 | 3/24/2009 | WO | 00 | 6/27/2011 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2010/084628 | 7/29/2010 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6483851 | Neogi | Nov 2002 | B1 |
6983079 | Kim | Jan 2006 | B2 |
20020047919 | Kondo et al. | Apr 2002 | A1 |
20030206587 | Gomila | Nov 2003 | A1 |
20040017852 | Garrido et al. | Jan 2004 | A1 |
20040228535 | Honda et al. | Nov 2004 | A1 |
20050062746 | Kataoka et al. | Mar 2005 | A1 |
20060002477 | Bae | Jan 2006 | A1 |
20060182356 | Lillevold | Aug 2006 | A1 |
20070217520 | Kim et al. | Sep 2007 | A1 |
20080101469 | Ishtiaq et al. | May 2008 | A1 |
20080199090 | Tasaka et al. | Aug 2008 | A1 |
20080253454 | Imamura et al. | Oct 2008 | A1 |
20080307198 | Kataoka et al. | Dec 2008 | A1 |
20090141814 | Yin et al. | Jun 2009 | A1 |
20090263032 | Tanaka et al. | Oct 2009 | A1 |
20090304085 | Avadhanam et al. | Dec 2009 | A1 |
20090304086 | Shi et al. | Dec 2009 | A1 |
Number | Date | Country |
---|---|---|
2001-245293 | Sep 2001 | JP |
2004 343451 | Dec 2004 | JP |
2005 70938 | Mar 2005 | JP |
2008 22404 | Jan 2008 | JP |
2008-205534 | Sep 2008 | JP |
2008 263529 | Oct 2008 | JP |
Entry |
---|
(Xu et al. (“An Adaptive De-blocking Method based on Measuring Flatness of Macroblock,”ISPACS 2006, pp. 1-4). |
Zhang et al. (“An Efficient Arithmetic for Deblocking Filter of H264AVC Video Coding,” 4th Int'l Conf. on Wireless Communications, Networking and Mobile Computing, Oct. 12-14, 2008, pp. 1-3). |
Raja et al. (“In-loop deblocking filter for JVT H264AVC,” 5th WSEAS Int'l Conf. Signal Processing, Robotics and Automation, 2006, pp. 235-240). |
International Search Report Issued Apr. 21, 2009 in PCT/JP09/055830 filed Mar. 24, 2009. |
Number | Date | Country | |
---|---|---|---|
20110268366 A1 | Nov 2011 | US |