The present invention relates to the field of image processing.
The increase in the consumption of digital video has correspondingly accelerated development efforts to make video compression more efficient while maintaining image quality. Many proposed improvements for increasing efficiency are based upon algorithmic changes not associated with the type and makeup of particular images. As an example, the H.264 video encoding algorithm generally uses an 8×8 pixel block size for its basic computational unit. The 8×8 computational unit used for H.264 was set over twenty-five years ago when CPU processing was several orders of magnitude lower than what it is today. More recent standards such as H.265 standard makes use of a 64×64 pixel block size thereby increasing efficiency regardless of the type of images fed into the video encoder.
There are additional video encoding optimizing techniques which take advantage of the type of material being encoded. Many discrete cosine transformation (DCT) based encoders evaluate the amount of motion between frames and correspondingly adjust their quantizer in an effort to obtain the best balance between pixel block fidelity and adequately displaying motion.
U.S. Pat. No. 6,901,164 “Method for Automated High Speed Improvement of Digital Color Images” that issued on May 31, 2005 to Moredechai Sheffer, which is incorporated herein by reference in its entirety, describes methods for optimizing an image making use of the simultaneous processing of light and dark compression using Weber curves.
A particularly vexing problem associated with video encoders is the processing of large areas of equal or near equal luminance within an image. Small changes in image luminance levels on defined boundaries are easily discerned by the human eye. If a group of pixels within an image is generally black but their levels vary slightly and randomly distributed throughout the group, the human eye will not necessarily see this as unnatural or offensive. However, if a small uniform change in black level occurs at a well-defined boundary, the human eye will readily discern the line and may cause the perception of lower quality. Unfortunately for many video encoders, the use of fixed blocks for pixel processing creates regular pixel boundaries within compressed video images.
As outlined in U.S. Pat. No. 6,901,164, pixel luminance levels may be adjusted to better match the perception of the human eye.
Using preprocessing to enhance image intelligibility is beneficial for a variety of purposes. The DRC process described in U.S. Pat. No. 6,901,164 creates images which can be viewed on displays which require less energy. These images may also be capable of being compressed to a higher level and therefore can provide transport bandwidth savings. However, when using preprocessing with a video encoder, the preprocessor may adjust luminance levels which cause the video encoder to generate bit streams with enhanced visual artifacts due to the algorithm of the video encoder. Alternately, the use of a preprocessor may adjust luminance levels which cause the video encoder to increase bit output bit rates.
Various embodiments make use of techniques for signaling between a dynamic range compressor and a video encoder to achieve optimal visual quality and bit rates for encoded bit streams.
A system for adjusting the compression of one or more digital images comprising an image luminance compressor with video encoder control outputs and a video encoder with control inputs coupled to the image luminance compressor control outputs.
The features and advantages described in the specification are not all inclusive and, in particular, many additional features and advantages will be apparent to one of ordinary skill in the art in view of the drawings, specification, and claims. Moreover, it should be noted that the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter.
Embodiments are illustrated by way of example, and not by way of limitation, in the figures of the accompanying drawings and in which like reference numerals refer to similar elements and in which:
The figures depict various embodiments for purposes of illustration only. One skilled in the art will readily recognize from the following discussion that alternative embodiments of the structures and methods illustrated herein may be employed without departing from the principles of the embodiments described herein.
Embodiments are now described with reference to the figures where like reference numbers indicate identical or functionally similar elements.
In the following description and in the accompanying drawings, specific terminology and drawing symbols are set forth to provide a thorough understanding of the embodiments. In some instances, the terminology and symbols may imply specific details that are not required to practice the embodiments. For example, a processing block may either be implemented in software or hardware. Digital representations of numerical quantities are not limited to specific number of bit accuracies. Computations required to implement the embodiments are not limited to fixed or floating point or any combination thereof. In the following description and block diagrams, reference is made to color components Cj, Cn, Cm. Representations of color spaces are not limited to three components. Representations of color space using luma (luminance) and chroma are ubiquitous and are anticipated. One common representation of images utilizes separate luminance (Y) and chroma (U & V) while RGB is another. In the following description, references made to Digital Images include both still and moving. In the following descriptions, operator inputs such as XU, KLU, KDU for the Dynamic Range Compressor are not limited to these and may include other controls.
Feedback 15 path within a Dynamic Range Compressor 13. The Feedback 15 information is comprised of parameters from the Video Encoder 14 which conveys to the Dynamic Range Compressor 13 the nature of the encoding process taking place within the Video Encoder. In one embodiment Feedback 15 information drives the DRC Setting Processor 20 logic of the Dynamic Range Compressor and is comprised of the amount of motion vectors needed by the Video Encoder 14 (
The processing and exchange of feedforward and feedback information between a Dynamic Range Compressor 13 (
Feedforward 15 information to the Video Encoder 14 for which it makes its best attempt at optimizing the encoding process. Feedback 15 may be available to the Dynamic Range Compressor 13 but is not used.
Intelligent arrangement of image luminance prior to compression provides multiple benefits. These include, but are not limited to, bandwidth savings, enhanced visual fidelity and potential display energy savings on the presentation device. By expanding the intelligence of the luminance arrangement to include participation by the compression block, further increases in benefits are achieved.
Reference in the specification to “one embodiment” or to “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiments is included in at least one embodiment. The appearances of the phrase “in one embodiment” or “an embodiment” in various places in the specification are not necessarily all referring to the same embodiment.
Some portions of the detailed description are presented in terms of algorithms and symbolic representations of operations on data bits within a computer memory. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. An algorithm is here, and generally, conceived to be a self-consistent sequence of steps (instructions) leading to a desired result. The steps are those requiring physical manipulations of physical quantities. Usually, though not necessarily, these quantities take the form of electrical, magnetic or optical signals capable of being stored, transferred, combined, compared and otherwise manipulated. It is convenient at times, principally for reasons of common usage, to refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the like. Furthermore, it is also convenient at times, to refer to certain arrangements of steps requiring physical manipulations or transformation of physical quantities or representations of physical quantities as modules or code devices, without loss of generality.
However, all of these and similar terms are to be associated with the appropriate physical quantities and are merely convenient labels applied to these quantities. Unless specifically stated otherwise as apparent from the following discussion, it is appreciated that throughout the description, discussions utilizing terms such as “processing” or “computing” or “calculating” or “determining” or “displaying” or “determining” or the like, refer to the action and processes of a computer system, or similar electronic computing device (such as a specific computing machine), that manipulates and transforms data represented as physical (electronic) quantities within the computer system memories or registers or other such information storage, transmission or display devices.
Certain aspects of the embodiments include process steps and instructions described herein in the form of an algorithm. It should be noted that the process steps and instructions of the embodiments can be embodied in software, firmware or hardware, and when embodied in software, could be downloaded to reside on and be operated from different platforms used by a variety of operating systems. The embodiments can also be in a computer program product which can be executed on a computing system.
The embodiments also relate to an apparatus for performing the operations herein. This apparatus may be specially constructed for the purposes, e.g., a specific computer, or it may comprise a computer selectively activated or reconfigured by a computer program stored in the computer. Such a computer program may be stored in a computer readable storage medium, such as, but is not limited to, any type of disk including floppy disks, optical disks, CD-ROMs, magnetic-optical disks, read-only memories (ROMs), random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical cards, application specific integrated circuits (ASICs), or any type of media suitable for storing electronic instructions, and each coupled to a computer system bus. Memory can include any of the above and/or other devices that can store information/data/programs and can be transient or non-transient medium, where a non-transient or non-transitory medium can include memory/storage that stores information for more than a minimal duration. Furthermore, the computers referred to in the specification may include a single processor or may be architectures employing multiple processor designs for increased computing capability.
The algorithms and displays presented herein are not inherently related to any particular computer or other apparatus. Various systems may also be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to perform the method steps. The structure for a variety of these systems will appear from the description herein. In addition, the embodiments are not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the teachings of the embodiments as described herein, and any references herein to specific languages are provided for disclosure of enablement and best mode.
In addition, the language used in the specification has been principally selected for readability and instructional purposes, and may not have been selected to delineate or circumscribe the inventive subject matter. Accordingly, the disclosure of the embodiments is intended to be illustrative, but not limiting, of the scope of the embodiments, which is set forth in the claims.
While particular embodiments and applications have been illustrated and described herein, it is to be understood that the embodiments are not limited to the precise construction and components disclosed herein and that various modifications, changes, and variations may be made in the arrangement, operation, and details of the methods and apparatuses of the embodiments without departing from the spirit and scope of the embodiments as defined in the appended claims.
This application claims priority from U.S. provisional application No. 62/066,197 filed on Oct. 20, 2014 which is all incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
62066197 | Oct 2014 | US |