This disclosure is in the technical field of visual image processing.
Visual image correction techniques (including de-ghosting) can be used with high dynamic range (HDR) images and other scenes generated from multiple visual images with different image sensor exposure times. The most common way to create an HDR scene is to capture (and then combine) multiple individual images depicting a visual scene which are created using different exposure times. Movement of an object in the scene occurring between different captures of an image (or movement of the camera sensor itself) may cause unwanted visual artifacts (such as ghosting or thin line) which should not be present but which nonetheless appear in the resulting visual image. Motion-related artifacts may also occur in de-noising algorithms (or other techniques) requiring a combination of multiple renderings of a scene to form a single visual image.
Motion compensation uses precise detection of the motion and consistent correction over the image when combining multiple visual images into a single resulting image. Detecting unwanted visual artifacts based solely on light radiation intensity differences may allow false negatives and/or false positives; e.g., motion-related artifacts with similar intensities might remain in the resulting image despite computer processing designed to remove them and/or newly-appearing objects (such as those created by a noisy camera sensor environment) might be detected incorrectly as motion.
Many techniques used for visual image correction utilize storing of the full image in computer buffer memory prior to removing such artifacts and can be performed only during post-image capture processing. Other real-time visual image processing techniques use special computer hardware to accomplish correction and sacrifice optimum image resolution. An alternate technique assigns to a visual image location (e.g., pixel) the light radiation intensity value(s) for which a majority of the individual image(s) are in agreement but this method uses a large number of images with at least some level of redundancy between them.
Exemplary embodiments of the invention as described herein generally provide for detecting and correcting unwanted sensor artifacts arising from processing techniques utilizing multiple visual images while possessing information relating to a localized area of the image that can be accomplished during scanning of the image and independent of the number of images or the computer processing method used to acquire them.
According to an exemplary aspect of the invention, a set of two or more visually inconsistent images are combined as an input from an image sensor and a single corrected image is output to eliminate any artifact(s) appearing in one (or more) of the input image(s) as follows:
According to a further exemplary aspect of the invention, the detection and correction steps are divided into sub-steps relating to discovery of a new object and/or simple motion. If a new object is detected then a new object correction sub-step can find a stable image to use while ignoring others. If simple motion is detected then a simple motion correction sub-step can extend use of at least one previous image selection decision to maintain consistency.
According a to further exemplary aspect of the invention, there is provided a computer device containing software program instruction(s) executable by the computer to perform at least the foregoing operation(s) for detecting and correcting unwanted artifact(s) arising from technique(s) utilizing multiple visual image(s) input from a sensor and processed into an output image depicting a scene.
The subject matter which is regarded as the invention is particularly pointed out and distinctly claimed in the concluding portion of the specification, as set forth in claims which are to be interpreted as broadly as the law permits to cover the full scope of the invention, including all equivalents thereto.
An image detection and correction method according to an exemplary embodiment disclosed herein assumes N>1 images S1 . . . SN taken from the same visual scene with different exposure times E1 . . . EN (respectively) and for an image sensor signal (S) there is assigned (as Sx,y) the light radiation intensity value of the pixel at location (x, y) in the subject image array. For simplicity in the examples below the following assumptions are made:
for some constant C>0.
In the examples below, the visual images 10 are digitally scanned in a row-by-row (e.g., raster) arrayed order where the steps of the scan use only a small localized exposure area (e.g., patch) of each image around the scanned pixel. The edge size of the patch (in pixels) is denoted as P (such that the whole pixel patch for example would be of array size P×P) where the reference pixel 30 around which the patch is defined is substantially in the center of the pixel array 20.
In the examples below, two steps are used to correct an unwanted artifact with the first being its detection and the second being correction of the visual image in the detected region as follows:
For example, the following HDR algorithm can optionally be used to minimize the output image signal-to-noise ratio (SNR) by examining as the output pixel at point (x,y) the value
where (K)=argmaxk{Sx,yk is not burnt} is the numeric index of an input image with a maximum exposure time (E) that yields a defined useful light radiation intensity value for (S) at pixel (x,y) and (i,j,k) are natural numeric counting value(s) defined within the range set by the equation and where the algorithm assumes depiction of the same visual scene in all input images without any motion artifacts occurring.
Flowcharts depicting a general process for detecting and correcting at least one artifact arising from processing multiple visual images depicting a scene according to the exemplary embodiment(s) described herein are shown in
Given two visually inconsistent images having light radiation intensity value(s) (S) and (R) a decision criterion for new object detection around reference pixel (i,j) within a pixel patch array of size (P×P) according to an exemplary embodiment includes:
In this equation To is a pre-defined light radiation intensity variance threshold and the noise reduction parameter w can be used to reduce the effect of camera noise for some for some 0≦w≦P (in experiments
within a pixel patch of array size (P×P); e.g., calculating the variance is based upon averaging light radiation intensity difference value(s) for the pixel(s) in the array patch surrounding reference pixel (i,j) when Do(i,j) is calculated over natural numeric counting value(s) (m, k, n, t) that are defined (in terms of w and P) within the range set by the equation. In extending the decision to more than two images, the value DoS,R (i,j) can be calculated for different combined pair(s) of images (S) and (R) and an aggregation function (such as max or average or weighted average value) can be performed over substantially all possible image combination(s).
Given two visually inconsistent images having light radiation intensity value(s) (S) and (R) a decision criterion for simple motion detection around reference pixel (i,j) within a pixel patch array of size (P×P) according to an exemplary embodiment includes:
In this equation TS is a pre-defined light radiation intensity aggregation threshold and the noise reduction parameter w can again be used to reduce the effect of camera noise for some 0≦w≦P (in experiments w=P) within a pixel patch of array size (P×P) when Ds(i,j) is calculated for reference pixel (i,j) over natural numeric counting value(s) (m, k, n, t) that are defined (in terms of w and P) within the range set by the equation. Arriving at a single value for Ds (i,j) in extending the decision to more than two images again involves use of any known aggregation function (such as max or average or weighted average value) that can be performed over substantially all possible image combination(s).
As mentioned above and as shown in
In the new object correction mode for an image (Si) a threshold (Ui) is set to indicate a maximum permissible normalized overall intensity value for Si; e.g., above this threshold the image is “burnt” (from lengthy exposure time) and cannot be chosen. A lower threshold (Li) is also set below which the image is too “noisy” (from insufficient exposure time) and cannot be chosen. Thus given a set of images S1 . . . SN at a reference pixel position (i,j) the value chosen for the output image according to an exemplary embodiment includes:
In these equation(s) neighborhoodij(S) is an averaging of the light radiation intensity value(s) of the pixel patch array surrounding reference pixel (i,j) which is done to reduce noise artifacts in sensor image signal (S).
The simple motion correction step can be used to handle a case where artifacts are suspected but not confirmed to exist. This step may be also referred to as a propagation step to extend use of at least one previous image selection decision from previously defined pixel(s); e.g., a decision on the light radiation intensity value of the pixel at position (x,y) can be determined by previously known value(s) of pixel(s) at position(s) (x−1,y),(x,y−1) and (x−1, y−1) for example.
To propagate a previous decision to a pixel (i,j) first note that simple motion detection for a pixel at position (x,y) such that x<i,y<j was already determined by some policy
for (N) examined image(s) having corresponding light radiation intensity value(s) (S1 . . . SN) and for some weighting function (W1 . . . WN) such that (Σk Wk=1) when (pxy) is calculated over natural numeric counting value (k) that is defined (in terms of N) within the range set by the equation. The propagation policy at the reference pixel (i,j) according to an exemplary embodiment then includes:
In these equation(s), parameter (q≧1) defines the number of previously examined pixel(s) at reference position (i, j) and can be set to a small value, e.g., 3 and (wxyij≧0) is another weighting function between the pixel at position (x, y) and position (i,j): for fast approximate results this value could be simply set to 1 whereas for more accurate results this value can represent the distance between the two pixels (smaller distance−higher value) and/or the light radiation intensity difference(s) (e.g., if |Sxy−Sij|≦|Sab−ij| then wxyij>wabij) for pixel indice(s) x, aα<i and y, b<j.
As shown with reference to
It will be understood by one skilled in the art that the present inventive concept(s) are only by way of example described and illustrated by reference to the foregoing description taken in conjunction with the accompanying drawings; and that the described feature(s), structure(s) and/or characteristic(s) may be combined and arranged and designed in different ways and that modification(s) and/or change(s) can be made to include device(s), system(s) and/or processe(s) consistent with the inventive concept(s) as embodied in the following claims, which are to be interpreted as broadly as the law permits to cover the full scope of the invention, including all equivalents thereto.
This application claims the benefit under 35 U.S.C. §119(e) to U.S. Provisional Patent Application No. 62/362,766, filed on Jul. 15, 2016 in the United States Patent & Trademark Office, the disclosure of which is incorporated by reference herein in its entirety.
Number | Date | Country | |
---|---|---|---|
62362766 | Jul 2016 | US |