The present invention relates to a method for measuring flicker, and particularly relates to a method that can accurately measure per-frame flickering artifacts.
Inter-frame flickering is a commonly seen artifact in encoded video signals, especially at periodic I-frames in low or medium bit rate coding, which greatly degrades the overall perceptual quality of the coded video. How to accurately measure the flickering artifact is an important issue in practice. Existing flicker metrics are typically rough solutions suitable only for measuring I-frame flickering of the whole sequence to provide a single measure of I-frame flicker that is descriptive of the whole sequence, and fail when measuring a per-frame flickering artifact (that is, they fail to provide a single measure for a given frame).
Inter-frame flickering is a commonly seen artifact in video coding, as well. Although they are visually identical in the original video signal, the same static background areas in two consecutive video frames may look different in the encoded/reconstructed video. This is because the areas are encoded through different encoding processes with different coding parameters, for example, different macro-block (MB), quantization parameters (QP), and/or different MB coding modes (such as the Inter or Intra mode). Consequently, flickering will be perceived when playing out the coded video.
Flickering is more easily perceived in static or low motion areas of video frames than in medium or high motion areas, because of the well-known motion masking effect of human visual systems (HVS). In addition, flickering is more prominent in low or medium bit rate coding and/or at periodic I-frames in a coded sequence. While the previous frame is coded as a P-frame using inter-frame prediction, the current frame is coded as an I-frame with all the MBs, using intra-frame prediction. Coarse quantization is often used due to limited bit rate. Accordingly, a large reconstruction difference may result for the corresponding static MBs collated in the two frames, and hence, a serious I-frame flickering or pulsing artifact may result. Flickering artifacts, especially I-frame flickering, greatly compromise the perceptual quality of coded video.
How to accurately measure the flickering artifact is an important issue in practice. A good method for measuring flickering would not only prove valuable to evaluate the performance of various de-flickering schemes, but also, perhaps more importantly, such a method can be exploited to develop various encoder optimization techniques to more effectively reduce flickering. For example, rate-distortion (RD) optimization is a commonly adopted framework in practice to make encoding decisions over a variety of coding options, for example, macro-block (MB), motion vector, and coding mode, etc. Conventional RD optimization only considers the distortion between the reconstructed video and the original video. To reduce flickering, and hence, improve perceptual video coding quality, one can further introduce the flickering distortion in RD optimizations.
A widely used method to measure flicker is the so-called S metric, which measures the overall I-frame flickering artifact of an entire coded video sequence. It is defined as follows.
Herein, the concerned frames involve all the non-scene-change I-frames in a sequence. That is, frame n denotes a non-scene-change I-frame and frame n−1 is the frame played out before frame n. Note that scene-change I-frames are excluded, as flickering typically only occurs for frames in the same scene. Additionally, fn,ij and {circumflex over (f)}n,ij denote the original and the reconstructed pixel j of MB i in frame n, respectively. M denotes the total number of pixels in a MB, and Nstatic
MB i in frame n is a static MB, if its mean-absolute-difference (MAD) is small, that is:
In practice, parameter ε could be, for example, 10. Note that instead of MAD, (1) and (2) can also be defined based on another measure, such as, for example, mean-square-difference (MSD) with parameter ε set to, for example, 500.
The S metric has been widely used to evaluate the performance of various I-frame de-flickering schemes. However, in practice, via experiments, the S metric is actually a fairly rough model. For example, it is only applicable to measure the overall I-frame flickering of an entire sequence, and fails when applied to generally measure flickering of each single frame. Note that the S metric is intended to be applied to a sequence, as evidenced by the summation in Equation 1 over a set of I frames. Removing that summation results in a metric that provides a result on a per-frame basis, but that result is not reliable as discussed below.
One fundamental defect of the metric is that using |{circumflex over (f)}n,ij−{circumflex over (f)}n-1,ij−(fn,ij−fn-1,ij)| to measure flicker at each pixel is only accurate if |fn,ij−fn-1,ij| is small (for example, less than 10). However, even with a small MAD for the entire MB, |fn,ij−fn-1,ij| of some particular pixels could be large (for example, around 100). In this case, small |{circumflex over (f)}n,ij−{circumflex over (f)}n-1,ij| pixels which actually render little flickering, will lead to large |{circumflex over (f)}n,ij−{circumflex over (f)}n-1,ij−(fn,ij−fn-1,ij)|, that is, large flickering distortion in the metric. This contradicts the actual viewing experience, and compromises the accuracy of the flickering model. As an example,
Another drawback of the existing metric is that the static MBs are MBs at exactly the same location in the two consecutive frames. That is, no motion compensation is involved. However, a flickering artifact may be perceivable not only at completely static areas, but also at low motion areas. Hence, low motion MBs should also be included in the flickering distortion calculation. However, without motion compensation, even very low motion MBs may have a large inter-frame MB difference. Thus, such low motion MBs are inappropriately excluded from the calculation, which also compromises the accuracy of the metric.
A method is provided that can accurately measure flicker of each single frame all the time, and outperforms the known techniques. The method can be generally applied for accurate perceptual quality evaluation and development of effective flicker reduction schemes. The method for measuring flicker first determines an original value based on a difference between pixel values in corresponding locations in a first picture and a second picture. Next, a reconstructed value is determined, based on a difference between pixel values in corresponding locations in a reconstruction from an encoding of the first picture and a reconstruction from an encoding of the second picture. Then, a determination is made to include a comparison value, which is determined on a difference between the reconstructed value and the original value, in a measurement of flicker between the first picture and the second picture.
The invention will be explained in greater detail in the following with reference to embodiments, referring to the appended drawings, in which:
The invention will now be described in greater detail. Reference will now be made in detail to the implementations of the present invention, which are illustrated in the accompanying drawings and equations.
In at least one implementation, a method to measure flicker is provided that accurately measures a per-frame flickering artifact, and renders better measuring performance than the existing S metric. Experiments show that the provided method accurately measures flickering of a frame, all the time. That is, the method accurately measures flickering on a frame basis (indeed, the method accurately measures flickering on a macro-block basis). The proposed reliable method can be generally applied for accurate perceptual quality evaluation or the development of effective flicker reduction techniques.
The proposed method, according to the invention and one implementation, is defined as follows. The flickering distortion of frame n is calculated as:
Herein, NMBS is the total number of static MBs in a given frame, my denotes the motion vector (MV) of the MB, and MB i+mv of frame n−1 represents the motion compensation reference for MB i of frame n.
Similar to the existing method, S metric, besides absolute difference, the proposed method can also be based on other distortion measures, for example, squared difference.
There are at least three changes in the proposed method from the existing method, S metric. A “max” function is used, and the proposed method calculates the pixel differences differently. Further, the proposed method includes motion compensation, and is performed all the time, on a frame-by-frame basis. The existing method, S metric, includes measurement over a series of frames.
Comparing the existing model, as detailed in Equation (1), with the proposed model disclosed in Equation (3), a fundamental and important change is that flickering of a pixel is measured by max(0, |{circumflex over (f)}n,ij−{circumflex over (f)}n-1,i+mvj|−|fn,ij−fn-1,i+mvj|), as show in Equation (3). To isolate this change, the flickering of a pixel is not measured by |{circumflex over (f)}n,ij−{circumflex over (f)}n-1,i+mvj|−(fn,ij−fn-1,i+mvj)|, as would be predicted if you extended the S metric to include motion information. For now, the change is described in the form of the pixel-based flickering measurement rather than the adjustment for motion. In the proposed method, |fn,ij−fn-1,ij| actually serves as a just-noticeable-difference (JND) threshold on flickering, below which it is assumed that no flickering will be perceived. It should be noted that motion information is included in the discussion for now, in order to isolate the pixel-based calculation.
It is evident that in the case of large |fn,ij−fn-1,ij|, the flickering masking JND threshold is also large. Thus, small |{circumflex over (f)}n,ij−{circumflex over (f)}n-1,ij| will be masked out, and make no contribution to the overall flickering distortion. This is well correlated with the actual viewing experience, where small |{circumflex over (f)}n,ij−{circumflex over (f)}n-1,ij| always represents smooth optical flow, and hence, little flickering artifact. This fundamental change allows the proposed method to provide superior in performance to the existing model, and hence, can be applied to accurately measure a frame-level flickering artifact of any individual frame, and not be limited to non-scene-change I-frames. Changes in the reconstructed images can be masked out of the distortion calculation if the underlying target image had at least as great a difference.
As an example,
In the proposed method, the definition of static MBs also involves a certain degree of motion compensation, to account for the flickering artifact at completely static or low motion MBs. This will lead to more accurate flickering modeling performance. A static MB is defined as a MB with:
Motion estimation is typically conducted over a limited search range (for example, within 3 pixels). Beyond that range, it is assumed that flickering will be all masked out by the high motion. In addition, in motion estimation, a sub-pixel MV search will generally yield much more accurate results than a full-pixel search, and hence, is generally more preferable in practice.
Superior performance of the proposed method for measuring flicker is also provided by using max(0, |{circumflex over (f)}n,ij−{circumflex over (f)}n-1,ij|−|fn,ij−fn-1,ij|) to calculate the per-pixel flickering artifact. Again, motion information has been removed to isolate the change in form. That is, even without including motion information, the proposed method provides superior performance to the existing method, the S metric. Using motion compensation further improves the modeling accuracy, although it does so generally at the price of increased computation complexity. Therefore, in practice, one can decide whether to use motion compensation or not based on the capability of the actual system. That is, one can determine for any given implementation whether or not the additional computation complexity is affordable and/or justified by the increased accuracy.
Due to the superior performance, the proposed method to measure flicker can be applied to develop effective flickering reduction techniques. For example, the method provides an accurate pixel-based metric calculation and so accurately measures flickering of each individual MB (as well as frame). Accordingly, the proposed method for measuring flicker can be integrated into the widely adopted RD optimization framework to optimize various coding parameters, such as, for example, MVs, quantization parameters, or MB coding modes, etc. In contrast, the existing flickering metric has been shown not to have enough modeling accuracy at a frame level, and so does have enough modeling accuracy at the MB level either., and therefore is not well-suited to integration into standard RD optimization frameworks.
One or more implementations are provided having particular features and aspects. However, features and aspects of described implementations may also be adapted for other implementations. For example, the described metric can be varied in different implementations in several ways. Some of these ways include, for example, adjusting the threshold in Equation 4, using a different metric in Equation 4 (such as, for example, one based on squared difference), or adjusting the “max” expression in Equation 3 to ignore differences that are not greater than a threshold. Although implementations described herein may be described in a particular context, such descriptions should in no way be taken as limiting the features and concepts to such implementations or contexts.
The implementations described herein may be implemented in, for example, a method or process, an apparatus, or a software program. Even if only discussed in the context of a single form of implementation (for example, discussed only as a method), the implementation or features discussed may also be implemented in other forms (for example, an apparatus or program). An apparatus may be implemented in, for example, appropriate hardware, software, and firmware. The methods may be implemented in, for example, an apparatus such as, for example, a computer or other processing device. Additionally, the methods may be implemented by instructions being performed by a processing device or other apparatus, and such instructions may be stored on a computer readable medium such as, for example, a CD, or other computer readable storage device, or an integrated circuit. Further, a computer readable medium may store the data values produced by an implementation.
As should be evident to one of skill in the art, implementations may also produce a signal formatted to carry information that may be, for example, stored or transmitted. The information may include, for example, instructions for performing a method, or data produced by one of the described implementations.
Additionally, many implementations may be implemented in one or more of an encoder, a pre-processor to an encoder, a decoder, or a post-processor to a decoder.
In an implementation (for example, a method or an apparatus or a set of instructions) of the invention, a method is provided that measures I-frame flicker that masks, from a distortion calculation, a difference in corresponding pixels in adjacent reconstructed pictures (or frames) if the difference is not greater than a corresponding difference in the underlying target images. Further, motion estimation is used to determine which pixels correspond to each other.
As should be evident to one of skill in the art, implementations may also produce a signal formatted for communicating the measurement of I-frame flicker.
Creating, assembling, storing, transmitting, receiving, and/or processing a measure of flicker according to one or more implementations described in this disclosure.
According to the invention, a device (such as, for example, an encoder, a decoder, a pre-processor, or a post-processor) has been considered that is capable of operating according to, or in communication with, one of the described implementations, as well as a device (such as, for example, a computer readable medium) for storing a measure of I-frame flicker according to an implementation described in this disclosure, or for storing a set of instructions for measuring I-frame flicker according to one or more of the implementations described in this disclosure.
Additionally, and according to the merits of the invention, a signal is considered that is formatted in such a way to include information relating to a measure of I-frame flicker, as described in this disclosure. The signal may be an electromagnetic wave or a baseband signal, wherein the information includes one or more of residue data, motion vector data, and reference indicator data.
Further, other implementations are contemplated by this disclosure. For example, additional implementations may be created by combining, deleting, modifying, or supplementing various features of the disclosed implementations. Additionally, the invention is intended to include a device or devices which can perform the methods disclosed above and which can apply the methods to reduce distortion.
This application claims the benefit, under 35 U.S.C. §365 of International Application PCT/US09/00060, filed Jan. 7, 2009, which was published in accordance with PCT Article 21(2) on Jul. 23, 2009 in English and which claims the benefit of U.S. provisional patent application No. 61/011,507, filed Jan. 17, 2008.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US2009/000060 | 1/7/2009 | WO | 00 | 7/14/2010 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO2009/091503 | 7/23/2009 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
6714191 | Wu et al. | Mar 2004 | B2 |
7218777 | Asano et al. | May 2007 | B2 |
20040165084 | Yamamoto et al. | Aug 2004 | A1 |
20050018920 | Ramamurthy | Jan 2005 | A1 |
20060061669 | Jang et al. | Mar 2006 | A1 |
20060158531 | Yanof | Jul 2006 | A1 |
20070025621 | Lee et al. | Feb 2007 | A1 |
20070036213 | Matsumura et al. | Feb 2007 | A1 |
20070046790 | Nakasuji et al. | Mar 2007 | A1 |
20080025397 | Zhao et al. | Jan 2008 | A1 |
20090046092 | Sato et al. | Feb 2009 | A1 |
Number | Date | Country |
---|---|---|
0936810 | Aug 2006 | EP |
2005348008 | Dec 2005 | JP |
200749629 | Feb 2007 | JP |
2007067469 | Mar 2007 | JP |
2007214785 | Aug 2007 | JP |
2009503999 | Jan 2009 | JP |
Entry |
---|
Becker et al., “Flicker Reduction in Intraframe Codecs”, IEEE Computer Society, Proceedings of the Data Compression Conference (DCC'04), 1068-0314/04, 2004, IEEE. |
Chun et al., “Flicker Reduction in Intra Coded Frames of H.264/AVC”, Korea University, 1-4244-0481-9/06, 2006 IEEE. |
Hara et al., “Flicker Reduction Technique in MPEG-2 Video by Post-Processing”, NHK Science & Technical Research Laboratories, 0-7803-883-0/05, Tokyo, Japan, 2005, IEEE. |
Pandel, “Measuring of Flickering Artifacts in Predictive Coded Video Sequences, Ninth International Workshop on Image Analysis for Multimedia Interactive Services”, 978-0/7695-3130-4/08, 2008 IEEE. |
Search Report Dated Feb. 20, 2009. |
Number | Date | Country | |
---|---|---|---|
20110052084 A1 | Mar 2011 | US |
Number | Date | Country | |
---|---|---|---|
61011507 | Jan 2008 | US |