The present invention relates generally to image processing, and in particular to image scaling.
With the proliferation of video communications, it is becoming increasingly important to improve the quality of images displayed on large display screens at a high resolution. Typically, image/video post-processing functions are implemented to improve and enhance the image/video signals displayed.
Digital video image content is typically encoded by a variety of digital compression techniques such as JPEG and MPEG to meet data bandwidth limitations in communication networks. Compressed digital images contain varying degrees of artifacts that deteriorate the quality of displayed video images and scenes. Such artifacts are referred to herein as “compression noise or blocking artifacts”. As such, compression noise reduction is applied in post-processing to reduce noise. Compression noise reduction detects and removes JPEG/MPEG blocking artifacts from the digital videos before displaying on a screen.
In compression noise reduction, a process for removing blocking artifacts is performed by filtering along block boundaries which are caused by data compression, such as MPEG. However, as the size and resolution of the display screens are increasing, such display screens are larger than the original size of image/video signals transferred through a network. As such, the signals transferred are enlarged using a scaling ratio to fit the size of the larger displays. If the scaling ratio is known, a typical blocking artifact reducer may adjust the location of image filtering according to the scaling ratio. However, if the scaling ratio is not known (such as for an outsource input to a television), then such a blocking artifact reducer fails to remove the artifacts effectively.
The present invention provides a method and system for image scaling detection. One embodiment involves receiving a decoded scaled input image comprising a plurality of pixels, wherein the input image has a scaling ratio relative to an original image; detecting blocking boundary artifact pixels in the image; determining a sum of pixel values for each blocking boundary artifact; detecting the pixel distance value between each pair or neighboring block boundaries; and determining said scaling ratio based on a distance value and said sum of pixel values.
Determining the scaling ratio may further include determining pixel position difference values for each pair of neighboring sum of pixel values, determining a peak value among the pixel position difference values, and determining said scaling ratio based on said peak value and said pixel position difference value. The scaling ratio may be determined as a ratio of the said peak value to said pixel position difference value.
These and other features, aspects and advantages of the present invention will become understood with reference to the following description, appended claims and accompanying figures.
The present invention provides a method and system for detecting the scaling ratio of scaled (enlarged/reduced) image/video signals for image processing such as enhancement, restoration, compression noise reduction, etc. One embodiment involves detecting a scale ratio of an enlarged digital image relative to an original digital image. The scaling ratio can guide the application of image processing techniques to proper locations in an enlarged (scaled) image.
An implementation of the present invention involves detecting an image scale ratio and applying appropriate image processing to the scaled image based on the detected image ratio. The present invention is also applicable to cases where an image size is reduced (scaled down by a ratio) compared to an original image. Utilizing the scaling ratio may improve the performance of blocking artifact reducers for enhancing the compressed image/video data that is then decompressed and enlarged for display, wherein blocking artifact reduction is applied to the decompressed and enlarged image.
Referring back to
Now referring to
Next, commonly used four directional Sobel operators are performed on the low-pass filtered image YL(x,y) to find image edges YH as:
wherein, abs(·) indicates an absolute value of (·), Mk(i,j) is the kth Sobel operator according to the relation set below, wherein each Sobel filter is used to obtain edge information:
An example of obtaining a binary map then involves obtaining a binary map YB for edges/boundaries using a threshold value τ as:
wherein YH represents a boundary as noted above.
Next, a total sum 402 of occurrences in each image column c containing a boundary 306 is obtained (step 303). The occurrence in each column containing the block edges (where, the binary map indicates the block edges) is summed, on the enlarged image.
Further, the sum 402 of occurrences in each of said columns c is then projected on a histogram 403 after threshold (step 305). The bins of the histogram, larger than a certain amount, are selected. Histogram 403 shows the result of the previous counts of occurrence in each column. The vertical axis of the histogram is the number of pixels belonging to the edge.
Thereafter, from left to right of the histogram 403, the column distance between the peaks in the projection histogram 403 is obtained (i.e., peak-to-peak column distance Δ1, Δ2, . . . ). Once the bins of the histogram greater than a certain amount (threshold) are selected, the distance from a bin to its nearest neighboring bin is peak-to-peak distance.
The peak-to-peak column distance is then projected on a histogram 405 of peak-to-peak column distance Δ (step 307). The vertical axis of histogram 405 is the number of counts that the peak-to-peak value falls on each bin.
Finally, from the histogram 405, the scale ratio r can be determined according to relation (1) below, as:
wherein Δpeak is the highest peak in the peak-to-peak distance histogram (i.e., Δpeak=maximum(Δ1, Δ2, . . . )) and n is the number of pixels between the boundaries 306e in
Further, a computation may be performed with the following three cases assuming the highest peak is at the location Δpeak. First, when there is no neighboring peak, then the scale ratio r may be determined according to relation (2) below, as:
Second, when a second highest peak is in the left-hand side of Δpeak, then the scale ratio r may be determined according to relation (3) below, as:
where, f(Δ) is the number/value accumulated in the Δth bin of the given histogram.
The second highest peak is smaller than the highest peak. Based on this second highest peak, a more accurate ratio can be calculated statistically.
Third, when the second highest peak is in the right-hand side of Δpeaks then the scale ratio r may be determined according to relation (4) below, as:
The scaling ratio r may be used for image prose processing, such as in the image enhancer 102 that performs blocking artifact reduction using a scaling ratio determined by an image scale ratio detector 104, according to an embodiment of the present invention. Utilizing the scaling ratio may improve the performance of blocking artifact reducers for enhancing the compressed image 304 that is then decompressed and enlarged for display, wherein blocking artifact reduction is applied to the decompressed and enlarged image 302 for reducing blocking artifacts before display on a display screen. A similar process can be used for determining a horizontal scale ratio using horizontal block boundaries in the image 302. Further, the vertical scale ratio and the horizontal scale ratio value can be combined to determine an overall scaling ratio for the decoded (uncompressed) enlarged image 302, relative to the original decoded image 304.
The de-blocking process uses a filter for removing artifacts across the boundary. The strength of filtering is adjusted based on the difference across the boundary. Therefore, placing the filter on the correct position is important. In the enlarged image, the blocking boundary is usually changed, and adjusting the filtering location allows proper functioning of the process.
As is known to those skilled in the art, the aforementioned example architectures described above, according to the present invention, can be implemented in many ways, such as program instructions for execution by a processor, as logic circuits, as an application specific integrated circuit, as firmware, etc. The present invention has been described in considerable detail with reference to certain preferred versions thereof; however, other versions are possible. Therefore, the spirit and scope of the appended claims should not be limited to the description of the preferred versions contained herein.
Number | Name | Date | Kind |
---|---|---|---|
5467135 | Yamane et al. | Nov 1995 | A |
6236682 | Ota et al. | May 2001 | B1 |
7753077 | Bertolasi et al. | Jul 2010 | B2 |
20040036924 | Ihara | Feb 2004 | A1 |
20040156556 | Lopez | Aug 2004 | A1 |
20070291138 | Cheung et al. | Dec 2007 | A1 |
Number | Date | Country | |
---|---|---|---|
20090245755 A1 | Oct 2009 | US |