This application claims the benefit of Korean Patent Application No. 10-2009-0125386, filed on Dec. 16, 2009, in the Korean Intellectual Property Office, the disclosure of which is incorporated herein by reference.
1. Field of the Invention
The present invention relates to a method of assessing a quality of media image generated, transferred, and played according to a multi-level media compression scheme including a Scalable Video Coding (SVC) scheme, which is a Moving Picture Experts Group (MPEG)-21 Part13 standard. More particularly, the present invention relates to a method that may assess an image quality of each of a base layer and an enhanced layer using a different scheme and assess a quality of a multi-level video based on an assessment result, and thereby may provide an excellent performance in aspects of a calculation complexity and a processing rate.
2. Description of the Related Art
A video consisting of a plurality of layers, that is, a multi-level video may use a one-source and multi-use function of providing the same content by dynamically combining the layers depending on a network situation and thus is generally used for a multimedia transmission service.
To provide a function of dynamically configuring an image having a quality most suitable for a performance of a terminal or a communication environment in the multimedia transmission service, there is a need to assess and assess in real time the quality of the image provided to the terminal via a network, and thereby to support a server to determine a layer combination of the video to be transmitted.
In the convention art, schemes using only a quality of an image has been used as a quality assessment scheme of the image.
However, when applying the quality assessment scheme to each of layers constituting the image, it may increase a calculation complexity.
When performing a quality assessment with respect to only a final video where all the layers are integrated, it may be impossible to recognize an error and a quality change that may variously occur for each layer.
Accordingly, there is a need for an image quality assessment method that may accurately detect a change in an image quality of each media layer based on a structural characteristic of a multi-level media and a limited calculation capability of a terminal receiving media, and may also be readily applicable to the terminal.
An aspect of the present invention provides a system of assessing a quality of a multi-level video that may decrease a calculation complexity and quickly detect a quality change differently occurring in each layer of a media stream by directly assessing an image quality of a base layer with respect to a reconstructed image, by assessing an image quality of an enhanced layer using a decoding parameter for image reconstruction, and by assessing a final video quality based on a quality assessment result of each layer.
According to an aspect of the present invention, there is provided a system of assessing a quality of a multi-level video, including: a bitstream extractor to separate the multi-level video into a base layer bitstream and an enhanced layer bitstream; a base layer quality assessment module to generate quality assessment result information of a base layer image by assessing an image quality of the base layer bitstream; an enhanced layer quality assessment module to generate quality assessment result information of an enhanced layer image by assessing an image quality of the enhanced layer bitstream; and a final video quality assessment module to assess a quality of experience (QoE) of the multi-level video based on quality assessment result information of the base layer image and quality assessment result information of the enhanced layer image.
The enhanced layer quality assessment module may include: an enhanced layer analysis module to separate the enhanced layer bitstream into the plurality of bitstream units, and to verify a type of bitstream units and a number of bitstream units used for reconstructing the multi-level video; a parameter extraction module to extract, from each of the bitstream units, at least one decoding parameter associated with the image quality of the enhanced layer bitstream; and a quality assessment module to generate quality assessment result information of the enhanced layer image by assessing a quality of the enhanced layer image corresponding to the enhanced layer bitstream based on the decoding parameter.
The final video quality assessment module may include: an adaptation function selection module to select an adaptation function to be used for an image quality assessment based on the type of bitstream units and the number of bitstream units; and a final video quality assessment output module to output a QoE assessment result of the multi-level video by applying the selected adaptation function to quality assessment result information of the base layer image and quality assessment result information of the enhanced layer image.
According to another aspect of the present invention, there is provided a method of assessing a quality of a multi-level video, including: separating the multi-level video into a base layer bitstream and an enhanced layer bitstream; generating quality assessment result information of a base layer image by assessing an image quality of the base layer bitstream; generating quality assessment result information of an enhanced layer image by assessing an image quality of the enhanced layer bitstream using a scheme different from the base layer bitstream; and assessing a QoE of the multi-level video based on quality assessment result information of the base layer image and quality assessment result information of the enhanced layer image.
According to embodiments of the present invention, there is provided a system of assessing a quality of a multi-level video that may decrease a calculation complexity and quickly detect a quality change differently occurring in each layer of a media stream by directly assessing an image quality of a base layer with respect to a reconstructed image, by assessing an image quality of an enhanced layer using a decoding parameter for image reconstruction, and by assessing a final video quality based on a quality assessment result of each layer.
These and/or other aspects, features, and advantages of the invention will become apparent and more readily appreciated from the following description of exemplary embodiments, taken in conjunction with the accompanying drawings of which:
Reference will now be made in detail to exemplary embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to the like elements throughout. Exemplary embodiments are described below to explain the present invention by referring to the figures.
Referring to
The bitstream extractor 110 may receive the multi-level video and separate the multi-level video into a base layer bitstream and an enhanced layer bitstream.
In this instance, as shown in
The base layer quality assessment module 120 may generate quality assessment result information of the base layer image by assessing an image quality of the base layer bitstream separated by the bitstream extractor 110.
A configuration and an operation of the base layer quality assessment module 120 will be further described with reference to
The enhanced layer quality assessment module 130 may generate quality assessment result information of the enhanced layer image by assessing an image quality of the enhanced layer bitstream separated by the bitstream extractor 110.
A configuration and an operation of the enhanced layer quality assessment module 130 will be further described with reference to
The final video quality assessment module 140 may assess a quality of experience (QoE) of the multi-level video based on quality assessment result information of the base layer image generated by the base layer quality assessment module 120 and quality assessment result information of the enhanced layer image generated by the enhanced layer quality assessment module 130.
A configuration and an operation of the final video quality assessment module 140 will be further described with reference to
As shown in
The base layer image reconstruction module 310 may reconstruct a base layer image corresponding to the base layer bitstream using the base layer bitstream separated by the bitstream extractor 110, and may transmit the reconstructed base layer image to the base layer image quality assessment module 320. In this instance, the base layer quality assessment module 120 may not include the base layer image reconstruction module 310 and thus may receive a reconstructed base layer image from an apparatus having an image reconstruction as a major function such as a media player and then transmit the received reconstructed base layer image to the base layer image quality assessment module 320.
The base layer image quality assessment module 320 may generate quality assessment result information of the base layer image by assessing a quality of the base layer image.
In this instance, the base layer image quality assessment module 320 may assess the quality of the base layer image using one of a full-reference quality assessment scheme capable of using all of source image information prior to compressing or transmitting a reconstructed image, a reduced-referenced quality assessment scheme capable of using partial characteristic information extracted from a source image, and a non-reference quality assessment scheme of not using the source image information.
As shown in
The enhanced layer analysis module 410 may separate the enhanced layer bitstream separated by the bitstream extractor 110 into a plurality of bitstream units, and transmit the separated bitstream units to the parameter extraction module 420.
Also, the enhanced layer analysis module 410 may verify a type of bitstream units and a number of bitstream units used for reconstructing the multi-level video, and transmit the verified type and number of bitstream units to an adaptation function selection module 510 of the final video quality assessment module 140.
The parameter extraction module 420 may extract, from each of the bitstream units received from the enhanced layer analysis module 410, at least one decoding parameter associated with the image quality of the enhanced layer bitstream.
For example, when the multi-level video corresponds to an H.264/Scalable Video Coding (SVC) video including, in the enhanced layer bitstream, a spatial enhanced bitstream unit, a quality enhanced bitstream unit, and a time enhanced bitstream unit, the parameter extraction module 420 may extract at least one decoding parameter from parameters such as a resolution of the enhanced layer image from the spatial enhanced bitstream unit, a reference level of image information of the base layer for reconstruction (Intra-BL Mode and MVD), and a residual energy of a differential signal of the enhanced layer generated based on a differential signal of the base layer.
Also, the parameter extraction module 420 may extract, as the decoding parameter, at least one of a magnitude difference between quantization parameters from the quality enhanced bitstream unit, and a magnitude of the differential signal of the enhanced layer generated based on the differential signal of the base layer.
The quality assessment module 430 may generate quality assessment result information of the enhanced layer image by assessing a quality of the enhanced layer image corresponding to the enhanced layer bitstream based on the decoding parameter extracted by the parameter extraction module 420.
For example, the quality assessment module 430 may generate quality assessment result information of the enhanced layer image by applying, to the decoding parameter extracted from the spatial enhanced bitstream unit, the following Equation 1:
Here, Intra_BL_R denotes a ratio of a block using image information of the base layer to predict a signal of the enhanced layer with respect to the entire image, and MVD_A denotes a differential value of a motion vector of the enhanced layer generated based on a motion vector of the base layer.
Also, RE_SP denotes the residual energy of the differential signal of the enhanced layer generated based on the differential signal of the base layer. In this instance, RE_SP may be calculated according to Equation 2:
Here, R(i, j) denotes a magnitude of a differential signal corresponding to a pixel (i, j), row denotes a vertical length of an image resolution, and col denotes a horizontal length of the image resolution.
Also, although Equation 2 calculates an RE_SP value in a spatial area, the same result may be obtained by calculating the RE_SP value in a frequency domain.
As another example, the quality assessment module 430 may generate quality assessment result information of the enhanced layer image by applying, to the decoding parameter extracted from the quality enhanced bitstream unit, the following Equation 3:
VQM_enhance=f(g(QP—R),h(RE—SN)),if_there_is_no_value→RE—SN,QP—R=0 [Equation 3]
Here, QP_R denotes a difference value between a quantization parameter of the base layer and a quantization parameter of the enhanced layer, and RN_SN denotes a residual energy of the differential signal of the enhanced layer generated based on the differential signal of the base layer. In this instance, RE_SN may be calculated using Equation 2.
As shown in
The adaption function selection module 510 may select an adaptation function to be used for an image quality assessment based on the type of bitstream units and the number of bitstream units.
The final video quality assessment output module 520 may output a QoE assessment result of the multi-level video by applying the selected adaptation function to quality assessment result information of the base layer image transmitted from the base layer image quality assessment module 320, and quality assessment result information of the enhanced layer image transmitted from the quality assessment module 510.
For example, the final video quality assessment output module 520 may calculate a QoE assessment result VQM by applying, to quality assessment result information of the base layer image VQM_base and quality assessment result information of the enhanced layer image VQM_enhance, the following Equation 4:
VQM=α×(VQM_base+β)×(VQM_enhance+δ)
Here, each of α, β, and δ denotes a constant, and may be variously applicable depending on which layer image quality result may be assigned with a weight.
In operation S610, the bitstream extractor 110 may receive the multi-level video and separate the multi-level video into a base layer bitstream and an enhanced layer bitstream.
In operation S620, the enhanced layer quality assessment module 130 may generate quality assessment result information of the enhanced layer image by assessing an image quality of the enhanced layer bitstream separated in operation S610.
In operation S630, the enhanced layer analysis module 410 may separate the separated enhanced layer bitstream into a plurality of bitstream units, and verify a type of bitstream units and a number of bitstream units used for reconstructing the multi-level video.
In operation S640, the parameter extraction module 420 may extract, from each of the bitstream units, at least one decoding parameter associated with the image quality of the enhanced layer bitstream.
In operation S650, the quality assessment module 430 may generate quality assessment result information of the enhanced layer image by assessing a quality of the enhanced layer image corresponding to the enhanced layer bitstream based on the extracted decoding parameter.
In this instance, operations S630, S640, and S650 may be performed with operation S620 in parallel or orders thereof may be changed and thereby be performed.
In operation S660, the adaption function selection module 510 may select an adaptation function to be used for an image quality assessment based on the type of bitstream units and the number of bitstream units.
In operation S670, the final video quality assessment output module 520 may output a QoE assessment result of the multi-level video by applying the selected adaptation function to quality assessment result information of the base layer image generated in operation S620, and the quality assessment result information of the enhanced layer image generated in operation S650.
According to embodiments of the present invention, a system of assessing a quality of a multi-level video may decrease a calculation complexity and quickly detect a quality change differently occurring in each layer of a media stream by directly assessing an image quality of a base layer with respect to a reconstructed image, by assessing an image quality of an enhanced layer using a decoding parameter for image reconstruction, and by assessing a final video quality based on a quality assessment result of each layer.
Although a few exemplary embodiments of the present invention have been shown and described, the present invention is not limited to the described exemplary embodiments. Instead, it would be appreciated by those skilled in the art that changes may be made to these exemplary embodiments without departing from the principles and spirit of the invention, the scope of which is defined by the claims and their equivalents.
Number | Date | Country | Kind |
---|---|---|---|
10-2009-0125386 | Dec 2009 | KR | national |