Embodiments of the invention generally relate to video signal processing, and in particular to processing video signals to remove artifacts caused by low-light noise.
Low-light images are especially susceptible to corruption from noise caused by light-detecting sensors (i.e., low-light artifacts). For example, a video or still camera may capture undesirable grains or discolorations in low-light conditions. This noise may lead to uncorrelated pixels and, as a result, reduced compression efficiency for video coding algorithms (e.g., MPEG4 and H.264). Many applications, such as security cameras, capture low-light images and require a large amount of storage space for retaining those images, and any decrease in the required storage space may lead to a more cost-effective application, an increase in the number of images or frames of video stored, or reduced network traffic for transporting the images. Thus, efforts have been made to detect and eliminate low-light noise.
Previous efforts (such as transform-domain methods, DCT, wavelet, or other statistical methods), however, suffer from drawbacks. These methods are computationally intensive and require a significant amount of computing resources, which may not be available on low-power, portable, or other devices. Furthermore, these methods are not adjustable based on available resources or the complexity of the source image, further wasting resources on simple images or during high-load conditions in which the additional resources may not be necessary or available.
In general, various aspects of the systems and methods described herein use a Gaussian distribution and correlation technique to remove uncorrelated low-light noise from images taken from video or still cameras. The images may be split into luma and chroma components and filtered separately. Different filters may be used depending on the complexity of the images and the resources available. The filters may adapt to variations in the image by using edge-detection and dilation filters, thereby preserving high-frequency details at feature edges. Furthermore, the image may be divided into a plurality of sections, filtered separately, and re-combined.
In general, in one aspect, a system for removing noise from a low-light image includes a division circuit, a filter circuit, and a recombination circuit. The division circuit divides the image into a plurality of image regions. The filter circuit creates a plurality of filtered image regions by applying a first filter to luma components of each of the plurality of image regions. The recombination circuit combines the plurality of filtered image regions into a filtered image.
In various embodiments, the filter circuit applies the first filter to one image region at a time. Alternatively, the filter circuit may apply the first filter to more than one image region at a time. The image region may include a square tile, rectangular tile, row, or column. The first filter may be a low-pass averaging filter, median filter, and/or adaptive filter; the adaptive filter may include a morphology filter and/or a comparative filter. A second filter may filter a chroma component of each of the plurality of image regions, and the recombination circuit may combine the filtered luma component of each image region with a corresponding filtered chroma component of each image region. The recombination circuit may store history information related to an image block, image row, and/or image column.
In general, in another aspect, a method removes noise from a low-light image. The image is divided into a plurality of image regions. A first filter, applied to luma components of each of the plurality of image regions, creates a plurality of filtered image regions. The plurality of filtered image regions is combined into a filtered image.
In various embodiments, the first filter is applied to each image region in series. Alternatively, the first filter may be applied to the plurality of image regions in parallel. Applying the first filter may include filtering the image region, median filtering the image region, and/or adaptively filtering the image region (which may include comparing a pixel against neighboring pixels and optionally replacing the pixel). A chroma component of each of the plurality of image regions may be filtered. A filtered luma component of each image region may be combined with a corresponding filtered chroma component of each image region. History information related to an image block, image row, and/or image column may be stored.
These and other objects, along with advantages and features of the present invention herein disclosed, will become more apparent through reference to the following description, the accompanying drawings, and the claims. Furthermore, it is to be understood that the features of the various embodiments described herein are not mutually exclusive and may exist in various combinations and permutations.
In the drawings, like reference characters generally refer to the same parts throughout the different views. In the following description, various embodiments of the present invention are described with reference to the following drawings, in which:
A network of switches 108 selects one of three filters 110, 112, 114 for the brightness component 104 of the image 102. The system 100 may include any number of brightness-component filters, however, including a single filter, and the current invention is not limited to any particular number or type of filter. In one embodiment, a low-pass averaging filter 110 may be selected by the switches 108 if the source image 102 is simple, if only a small degree of filtering is required, and/or if system resources are limited. The low-pass averaging filter 110 attenuates high-frequency signals in the brightness component 104, while allowing low-frequency signals to pass. In one embodiment, the low-pass averaging filter 110 performs a blur function on the brightness component 104.
A median filter 112 may be used to filter the brightness component 104 for images of medium complexity, if a medium amount of filtering is desired, and/or if an average amount of system resources is available. As one of skill in the art will understand, the median filter 112 processes the brightness component 104 pixel by pixel and replaces each pixel with the median of it and surrounding pixels. For example, the median filter 112 may consider a 3×3 window of pixels surrounding a pixel of interest (i.e., nine total pixels). The median filter 112 sorts the nine pixels by their brightness values, selects the value in the middle (i.e., fifth) position, and replaces the pixel of interest with the selected value. In one embodiment, the filter 112 is a rank or rank-median filter, and may select a pixel in any position in the sorted list of pixels (e.g., the third or sixth position). In one embodiment, if the absolute difference between the selected value and the original value is larger than the threshold, the original value is kept; if the difference is smaller than or equal to the threshold, the ranked value is assigned.
An adaptive filter 114 may be used to filter the brightness component 104 for images of high complexity, if a large amount of filtering is desired, and/or if a large amount of system resources is available. The adaptive filter 114 selects a filtering technique based on the dynamically determined characteristics of the brightness component 104, as explained in greater detail below.
A low-pass averaging filter 116 (e.g., a 5×5 low-pass averaging filter) may be used to filter the color component 106. In one embodiment, the color component 106 is less complex than the brightness component and/or is less affected by low-light noise and thus requires less filtering. The filter 116 may be a temporal-averaging filter with sum-of-absolute-differences or any other type of similar filter. The system 100 may include more than one color-component filter 116, and one of the plurality of color-component filters 116 may be selected based on the complexity of the color component 106, the availability of system resources, and/or a desired level of filtering quality.
A dilation-based filter 304 modifies the output of the edge-difference filter 302 by distributing the results of the edge detection to neighboring pixels. The dilation-based filter may be modified to ease implementation on, for example, embedded and/or DSP platforms. For example, if four pixels in a row are dilated, the four pixels may be shifted, depending on the pixel location, to align with a word boundary. In various embodiments, the dilation-based filter 304 is a morphology filter, a 3×4 dilation filter, or a 4×3 dilation filter. The dilation-based filter 304 may expand, or dilate, regions of pixels designated as edge pixels to incorporate other, nearby pixels. For example, a pixel having an intensity different from its neighbors may be the result of low-light noise; but, if the location of the pixel is near a detected edge, the pixel may instead be the result of a real physical feature of the captured image. The dilation-based filter 304, by correlating such pixels occurring near detected edges to edge pixels, prevents their erroneous designation as noise-produced pixels.
Each non-edge pixel in the dilated luma component 104 is then analyzed against a neighboring region of pixels (e.g., a neighboring 3×3 block of pixels). Depending on the differences between the analyzed pixel and its neighbors, as computed by a Gaussian distribution engine 306, the pixel is assigned a new value according to assignment units 308-312 and output by an output unit 314.
In greater detail, the Gaussian distribution engine 306 computes a mean and a variance of the Gaussian distribution of the block or window surrounding the analyzed pixel. The deviation of the pixel from the mean of the block is computed and compared with the variance. If the difference between the pixel and the variance is much greater than the mean (e.g., greater than three times the standard deviation), the pixel is likely the result of low-light noise. In this case, the median block 308 replaces the pixel with the median of the block of pixels. If the difference between the pixel and the variance is near the mean, the low-pass filter 310 replaces the analyzed pixel with the result of low-pass filtering the block of pixels. If the difference between the pixel and the variance is less than the mean, the pixel block 213 passes the analyzed pixel to the output block 314 unchanged.
In general, the algorithm utilized by the assignment units 308-312 may be generalized by the following equations:
If{(Analyzed Pixel)−(Mean of Block of Pixels)}>N×(Variance of Block of Pixels):
Output=Median of Block of Pixels (1)
If{(Analyzed Pixel)−(Mean of Block of Pixels)}>M×(Variance of Block of Pixels):
Output=Result of Low-Pass Filter of Block of Pixels (2)
If{(Analyzed Pixel)−(Mean of Block of Pixels)}>P×(Variance of Block of Pixels):
Output=Original Analyzed Pixel (3)
wherein P≦M≦N. That is, the output 314 is assigned the median 308 for large differences, the low-pass filter 310 for medium differences, and the original pixel 312 for small differences. In one embodiment, the operations performed by the above equations (1)-(3) are executed by specially allocated hardware. In another embodiment, the median operation is performed by the median filter 112 and low-pass filtering is performed by the low-pass averaging filter 110, as shown in
In another example, another pixel 414 is analyzed and compared to its surrounding pixels 416. Here, because the difference between the analyzed pixel 414 and the mean of the block of pixels 412 is less than the first threshold N but greater than a second threshold M when compared to the variance of the block of pixels 412, the pixel 414 is replaced with the result of low-pass filtering the block 416. Finally, because the difference between a third analyzed pixel 418 and the mean of its surrounding block of pixels 420 is much less than a threshold P when compared to the variance of the block of pixels 420, the pixel 418 remains unchanged.
In one embodiment, the above-described system 300 analyzes every pixel in the luma component 104. In other embodiments, the system 300 analyzes only a subset of the total pixels in the luma component 104. For example, the system 300 may analyze only even-numbered pixels (e.g., every second pixel) in the luma component 104. The result of analyzing an even-numbered pixel may be applied not only to that pixel itself, but also to a neighboring odd-numbered pixel (e.g., a pixel adjacent to the analyzed even-numbered pixel in the same row). Because the two pixels are neighbors, the result computed for one pixel is likely to be similar to the uncomputed result of the neighboring pixel, and applying the analyzed pixel's result to both pixels may produce only a small error. Other subsets of pixels may be chosen for analysis, such as odd pixels, every Nth pixel, diagonal pixels, or rows/columns of pixels. The analyzed pixels may constitute 50% of the total pixels, as in the example above, or any other percentage of total pixels.
In one embodiment, the system 600 may be used to divide an image into a number of regions that corresponds to a number of available filter circuits 604. Each filter circuit 604 may include a system 100, as illustrated in
In another embodiment, only one filter circuit 604 is used to process each image region in series. In this embodiment, the size of the image region may be defined by an amount of memory or other storage space available and/or the capabilities of the filter circuit 604. The size of the region may be adjusted to consume more or fewer resources, depending on the constraints of a particular application. For example, an application having very limited memory may require a small region. History information for rows and columns of the regions or image may be stored and managed to ease data movement when switching and/or combining image regions.
Applying the first filter may include low-pass filtering the region, median filtering the region, and/or adaptively filtering the region, as described above with reference to
Embodiments of the present invention may be provided as hardware, software, and/or firmware. For example, the systems 100, 300, 600 may be implemented on an embedded device, such as an ASIC, FPGA, microcontroller, or other similar device, and included in a video or still camera. In other embodiments, elements of the systems 100, 300, 600 may be implemented in software and included on a desktop, notebook, netbook, or handheld computer. In these embodiments, a webcam, cellular-phone camera, or other similar device may capture images or video, and the systems 100, 300, 600 may remove low-light noise therefrom. The present invention may further be provided as one or more computer-readable programs embodied on or in one or more articles of manufacture. The article of manufacture may be any suitable hardware apparatus, such as, for example, a floppy disk, a hard disk, a CD ROM disk, DVD ROM disk, a Blu-Ray disk, a flash memory card, a PROM, a RAM, a ROM, or a magnetic tape. In general, the computer-readable programs may be implemented in any programming language. Some examples of languages that may be used include C, C++, or JAVA. The software programs may be further translated into machine language or virtual machine instructions and stored in a program file in that form. The program file may then be stored on or in one or more of the articles of manufacture.
Certain embodiments of the present invention were described above. It is, however, expressly noted that the present invention is not limited to those embodiments, but rather the intention is that additions and modifications to what was expressly described herein are also included within the scope of the invention. Moreover, it is to be understood that the features of the various embodiments described herein were not mutually exclusive and can exist in various combinations and permutations, even if such combinations or permutations were not made express herein, without departing from the spirit and scope of the invention. In fact, variations, modifications, and other implementations of what was described herein will occur to those of ordinary skill in the art without departing from the spirit and the scope of the invention. As such, the invention is not to be defined only by the preceding illustrative description.