Preferred embodiments of the present invention will be described hereinafter in detail with reference to the accompanying drawings.
As shown in
A configuration or function of each of the constituent elements of the automatic objective perceptual video quality evaluation apparatus according to the embodiment will be described in detail.
The feature amount extracting unit 1 extracts three video feature amounts necessary to derive a subjective video quality, that is, a block distortion degree P1, a PSNR overall temporal fluctuation degree P2, and a PSNR local temporal fluctuation degree P3. A method of deriving each of the video feature amounts will be described.
The block distortion degree calculating unit 11 calculates an intra-frame average dDC(f) of a DC difference between a pixel block 21 of an arbitrary size shown in
P
1=max{dDCRef(f)−dDCCod(f)}−min{dDCKRef(f)−dDCCod(f)}
In the equation, dDCRef(f) denotes the intra-frame average of the DC difference for the original video x, and dDCCod(f) denotes the intra-frame average of the DC difference for the evaluated video y. In the example shown in
The PSNR overall temporal fluctuation degree P2 is calculated using a maximum value, a minimum value, and an average value of an intra-sequence power error MSE (mean square error).
First, the maximum value, the minimum value, and the average value of the MSE between the original video x and the evaluated video y are defined. If the maximum value, the minimum value, and the average value of the MSE are denoted by emin, emax, and eave, respectively, they are defined as represented by the following Equation (2).
In the Equation (2), x(f, n) denotes a signal value of an nth pixel in the frame f, Np denotes the number of pixels in the frame, and Nf denotes the number of frames in a sequence. For example, if a video quality for ten seconds in which frames are updated 15 times per second is to be evaluated, the number of frames in the sequence is 150. If the sequence of the original video x and that of the evaluated video y differ in frame rate, then corresponding frames are detected by means such as frame matching means, and a PSNR between the corresponding frames is derived.
Next, the PSNR overall temporal fluctuation degree P2 based on the maximum value emax, the minimum value emin, and average value eave is calculated. As stated, the PSNR is significant information for estimating the subjective video quality. However, it is confirmed that the correlation between the objective video quality index and the subjective video quality tends to decrease if only the intra-sequence average value is used while the video quality has great temporal fluctuation in the sequence. Therefore, the PSNR overall temporal fluctuation degree P2 is defined as represented by the following Equation (3) according to deviations of the maximum value emax and the minimum value emin from the average value eave of the intra-sequence power error.
In the Equation (3), f(eave) denotes a scaling function for changing a value according to the average value eave of the intra-sequence average MSE. As to the scaling function f(eave), an arbitrary function monotonically increasing in all ranges of the average value eave (which are, however, substantially in an range eave>0 according to the definition of eave) is available. Examples of the scaling function f(eave) include following functions.
The linear characteristic function is defined as f(eave) eave. A linear characteristic thereof is that shown in
The sigmoid function has a characteristic of saturating in a high eave part and a low eave part. The sigmoid function is defined as represented by the following Equation (4).
The sigmoid function has a characteristic shown in
As can be seen from the property that the function f(eave) monotonically increases, the following effect can be produced according to a term of the function f(eave). If the average value eave is small, that is, the average MSE is small and the video quality of the evaluated video is high, the PSNR overall temporal fluctuation degree P2 is decreased. If the average value eave is large and the video quality of the evaluated video is low, the PSNR overall temporal fluctuation degree P2 is increased. Furthermore, if the sigmoid function is used as the scaling function f(eave), the property of saturating to certain values in regions on both ends shown in
The low rate coding intended at multimedia applications tends to generate temporally local degradations in PSNR resulting from key frame insertion, scene change, occurrence of a sudden motion or the like. Due to this, degradations in the subjective video quality caused by these local degradations are detected based on the PSNR local temporal fluctuation degree P3.
As shown in
P3=max{dPSNR(f)|f ε sequence} Equation (5)
The PSNR local temporal fluctuation degree P3 may be multiplied by a scaling function for changing a value according to the MSE of the frame f. As this scaling function, an arbitrary function that monotonically decreases according to the MSE is applicable.
An objective evaluation index Qobj is defined as represented by the following equation using a weighted sum of the above-stated objective evaluation measures P1, P2, and P3.
Q
obj
=αP
1
+βP
2
+γP
3
In the equation, symbols α, β, and γ denote weight parameters. The weight parameters α, β, and γ are selected so that an estimated error of the objective video quality from the subjective video quality becomes minimum when the objective evaluation index Qobj is subjected to conversion processings by the frame rate-specific correcting unit 4 and the objective evaluation index-subjective video quality mapping unit 5. For example, the weight parameters α, β, and γ can be respectively set to 0.2, 0.4, and 0.004 (α=0.2, ⊖=0.4, and γ=0.004) The weight parameters α, β, and γ may be negative numbers.
The frame rate detecting unit 3 analyzes a video signal of the evaluated video y and outputs its frame rate. According to the present invention, it is premised that frame rate of the original video x is equal to or higher than that of the evaluated video y. Due to this, the frame rate detecting unit 3 detects the frame rate of the evaluated video y, which is lower than the frame rate of the original video x.
The frame rate detecting unit 3 outputs the detected frame rate to the frame rate-specific correcting unit 4.
If a correlation between the objective evaluation index Qobj output from the weighted sum calculating unit 2 and the subjective video quality (DMOS) is obtained, the correlation often differs in characteristics among frame rates a, b, c, etc. as shown in
As shown in
DMOSb=c
0
×Q
a
+c
1
Q
a
=DMOSb/c
0
−c
1
Finally, if the relationship between the objective evaluation index Qobj and the subjective evaluation measure DMOS after the frame rate-specific correction is calculated using many samples, the relationship is shown in, for example,
However, if these pieces of data are classified according to the frame rates, it is understood that data sets are irregular among the frame rates. Therefore, as shown in
DMOS=−0.0035x3+0.1776x2−2.8234x+14.379 (where x=Qobj)
Therefore, this polynomial function is stored in the objective evaluation index-subjective video quality mapping unit (or the subjective video quality estimated value deriving unit) 5 in advance. The corrected objective video quality index Qobj is applied to the polynomial function, thereby deriving the subjective video quality estimated value. Namely, points on a solid-line curve shown in
As stated so far, according to the present invention, it is possible to estimate the objective video quality of the video at low resolution and low frame rate such as a multimedia video without relaying on subjective human judgment.
Needless to say, the methods of deriving the block distortion degree, the PSNR overall temporal fluctuation degree, and the PSNR local temporal fluctuation degree executed by the feature amount extracting unit 1, and the method of calculating the weighted sum executed by the weighted sum calculating unit 2 are given only for illustrative purposes. The other deriving methods and the other calculation method can be applied to the present invention.
Number | Date | Country | Kind |
---|---|---|---|
2006-208091 | Jul 2006 | JP | national |