This disclosure relates to downsampling processes and more particularly to systems and methods for using iterative refinement techniques as part of the downsampling process.
In image and video processing, images may be represented at many different resolutions—typically denoted by the number of pixels (samples) used to represent the image (i.e., image width×image height). The process of converting a sampled image from one resolution to another is generally termed resampling, and the process of converting a sampled image to a lower resolution is termed downsampling, or subsampling. Downsampling inherently reduces the amount of data required to represent an image, and so may be used to reduce associated storage, transmission, processing, or display requirements. Downsampling also inherently reduces the detail and information content of an image, so a downsampled image will generally appear to be more blurry to a human viewer than the original higher-resolution image if both images are displayed at the same overall physical size (i.e., in which case the individual pixels of the displayed downsampled image would be larger than those of the original, and thus be unable to represent fine detail). For a particular resolution reduction, the nature of the downsampling method determines the quality, as perceived by the human visual system (HVS), of the rendered image.
Many methods exist for downsampling images. These methods have a wide variety of quality characteristics. A very simple but low-quality downsampling method is Nearest-Neighbor. Higher quality techniques are generally based on higher-order sampling/interpolation methods (bilinear, bicubic, Lanczos, etc.). Usually, downsampler selection is based on a balance between computational cost and the desired visual appeal of the rendered downsampled images. In some situations, other criteria exist. For the concepts discussed in the above-identified co-pending patent application SYSTEMS AND METHODS FOR HIGHLY EFFICIENT COMPRESSION OF VIDEO, the primary criteria are 1) the upsampled version of the downsampled image is very close to the original input image, and 2) the process must be computationally efficient for a high-volume application, such as video stream processing.
One known method that would achieve both goals is the sinc filter. While accurate (i.e., will yield high quality upon upsampling), this method is computationally very expensive and is an idealized filter which can only be approximated for finite image resolutions. It is generally considered to be impractical for real time applications such as video stream processing.
Advantage is taken of the concept of Newton iteration to iteratively generate error-corrected downsampled images such that when upsampled with a specified upsampler, the final result very closely matches the original full-resolution image.
An implementation of this method requires a target upsampling method for which results are to be optimized (for example, a bicubic upsampler); a downsampling method appropriate for the required downsampling ratio (for example, a bilinear downsampler); an error measure method for determining how closely an upscaled result image matches the original image; and a stopping criterion.
The foregoing has outlined rather broadly the features and technical advantages of the present invention in order that the detailed description of the invention that follows may be better understood. Additional features and advantages of the invention will be described hereinafter which form the subject of the claims of the invention. It should be appreciated by those skilled in the art that the conception and specific embodiment disclosed may be readily utilized as a basis for modifying or designing other structures for carrying out the same purposes of the present invention. It should also be realized by those skilled in the art that such equivalent constructions do not depart from the spirit and scope of the invention as set forth in the appended claims. The novel features which are believed to be characteristic of the invention, both as to its organization and method of operation, together with further objects and advantages will be better understood from the following description when considered in connection with the accompanying figures. It is to be expressly understood, however, that each of the figures is provided for the purpose of illustration and description only and is not intended as a definition of the limits of the present invention.
For a more complete understanding of the present invention, reference is now made to the following descriptions taken in conjunction with the accompanying drawing, in which:
Process 103 upsamples said first downsample of said image stream (estimated result) to the same resolution as the original image, using the target upsampling method. This results in a first upsample estimated result.
Process 104 subtracts the upsampled estimated result from the original image I to create an ‘error image’ indicating the per-pixel error
Process 105 calculates the ‘error measure’ from the error image, according to the specified error measure method. Typical error measure methods include maximum absolute difference, average absolute difference, or average squared difference. The error measure is a single scalar number representing the degree of difference between the upsampled estimated result and the original image.
Process 106 determines if the stopping criterion has been satisfied. The stopping criterion can be any process which, given the error measure and the current iteration number, will determine whether it is time to terminate the process. Examples of stop criterion are: if a certain iteration number, say 3, has been met; or if the error measurement is under a certain value, say 5; or if the error measurement is diverging instead of converging.
If the stopping criterion of process 106 is satisfied then process 107 provides the best estimated result obtained so far. Other stopping criteria might be, when a specified error measure has been met, or when the error measure increases over an iteration.
If the stopping criterion of process 106 is not satisfied, then process 109 downsamples the error image to the same resolution as the estimated result. This is accomplished by using a downsampler of sufficiently high order such that it will consider all the high-resolution source pixels that overlap the destination low-resolution pixel. The result is a “correction image”.
Process 109 subtracts the correction image from the estimated result, to produce a newly refined estimated result. Process 110 increments the iteration number and the New EstResult is then used in process 103.
Although the present invention and its advantages have been described in detail, it should be understood that various changes, substitutions and alterations can be made herein without departing from the spirit and scope of the invention as defined by the appended claims. Moreover, the scope of the present application is not intended to be limited to the particular embodiments of the process, machine, manufacture, composition of matter, means, methods and steps described in the specification. As one of ordinary skill in the art will readily appreciate from the disclosure of the present invention, processes, machines, manufacture, compositions of matter, means, methods, or steps, presently existing or later to be developed that perform substantially the same function or achieve substantially the same result as the corresponding embodiments described herein may be utilized according to the present invention. Accordingly, the appended claims are intended to include within their scope such processes, machines, manufacture, compositions of matter, means, methods, or steps.
This application is related to commonly owned patent application SYSTEMS AND METHODS FOR HIGHLY EFFICIENT VIDEO COMPRESSION USING SELECTIVE RETENTION OF RELEVANT VISUAL DETAIL, U.S. patent application Ser. No. 12/176,374, filed on Jul. 19, 2008, Attorney Docket No. 54729/P012US/10808779; SYSTEMS AND METHODS FOR DEBLOCKING SEQUENTIAL IMAGES BY DETERMINING PIXEL INTENSITIES BASED ON LOCAL STATISTICAL MEASURES, U.S. patent application Ser. No. 12/333,708, filed on Dec. 12, 2008, Attorney Docket No. 54729/P013US/10808780; VIDEO DECODER, U.S. patent application Ser. No. 12/638,703, filed on Dec. 15, 2009, Attorney Docket No. 54729/P015US/11000742 and concurrently filed, co-pending, commonly owned patent applications SYSTEMS AND METHODS FOR HIGHLY EFFICIENT COMPRESSION OF VIDEO, U.S. patent application Ser. No. ______, Attorney Docket No. 54729/P016US/11000746; DECODER FOR MULTIPLE INDEPENDENT VIDEO STREAM DECODING, U.S. patent application Ser. No. ______, Attorney Docket No. 54729/P018US/11000748; SYSTEMS AND METHODS FOR CONTROLLING THE TRANSMISSION OF INDEPENDENT BUT TEMPORALLY RELATED ELEMENTARY VIDEO STREAMS, U.S. patent application Ser. No. ______, Attorney Docket No. 54729/P019US/11000749; SYSTEMS AND METHODS FOR ADAPTING VIDEO DATA TRANSMISSIONS TO COMMUNICATION NETWORK BANDWIDTH VARIATIONS, U.S. patent application Ser. No. ______, Attorney Docket No. 54729/P020US/11000750; and SYSTEM AND METHOD FOR MASS DISTRIBUTION OF HIGH QUALITY VIDEO, U.S. patent application Ser. No. ______, Attorney Docket No. 54729/P021US/11000751 all of the above-referenced applications are hereby incorporated by reference herein.