This invention relates to determining noise characteristics of a video signal.
When attempting to filter out random noise from images, particularly video, it is advantageous to know the characteristics of the noise present. This knowledge allows a system to remove the noise more effectively while minimising any unwanted artefacts in the processed image. Noise may be characterised by its amplitude and a histogram of occurrences of amplitude ranges.
One method frequently used to characterise temporal noise is to subtract spatially co-located picture elements, hereinafter referred to as pels, in successive images to form a difference picture. However, this estimate of noise is greatly affected by objects in motion between successive pictures and produces an unreliable result which affects adversely any subsequent process that depends upon this measure of noise. A confusion of noise and movement is caused by their sharing a three dimensional spectral space occupied by a whole signal. An improvement can be effected by a noise estimation measurement method that avoids regions of the spectrum most likely to be occupied by signal spectral components caused by motion when seeking to reduce noise.
It is an object of the present invention at least to ameliorate the aforesaid shortcomings in the prior art
According to a first aspect of the present invention there is provided a method of determining noise characteristics of a video signal comprising the steps of: forming an array of luminance and/or chrominance differences between picture elements of successive frames; determining an absolute difference value between a luminance and/or chrominance difference of a picture element and an average of the luminance and/or chrominance differences of a plurality of neighbouring picture elements; and counting occurrences of predetermined ranges of the difference value.
Conveniently, the absolute difference value is one of the square of the means of the differences and the positive root of the square of the means of the differences.
Conveniently, the plurality of neighbouring picture elements comprises the eight most immediate neighbouring picture elements.
Advantageously, the step of counting comprises storing at an address in memory a number of occurrences of the difference value within a predetermined range.
Conveniently, the method further comprises forming a histogram of a number of occurrences in the predetermined ranges of the difference value.
Preferably, for detecting whether a picture element is at an edge of a frame, the method comprises the further initial steps of: subtracting luminance and/or chrominance values of each of the pels surrounding a central pel being processed from a luminance and/or chrominance value of the central pel to obtain luminance and/or chrominance value differences; summing the luminance and/or chrominance value differences to obtain a summation; and determining whether the summation exceeds a threshold and if not performing the steps of claim 1 and if so sorting the difference values to determine a subset of a predetermined size of the smallest differences from the luminance and/or chrominance of the central picture element and using the subset as the neighbouring picture elements.
Conveniently, the subset of a predetermined size comprises four luminance differences.
According to a second aspect of the invention, there is provided an apparatus arranged to determine noise characteristics of a video signal comprising: subtraction means arranged to form an array of luminance and/or chrominance differences between picture elements of successive frames; processing means arranged to determine an absolute difference value between a luminance and/or chrominance difference of a picture element and an average of the luminance and/or chrominance differences of a plurality of neighbouring picture elements; and counting means arranged to count occurrences of predetermined ranges of the difference value.
Conveniently, the absolute difference value is one of the square of the means of the differences and the positive root of the square of the means of the differences.
Conveniently, the plurality of neighbouring picture elements comprises the eight most immediate neighbouring picture elements.
Advantageously, the counting means comprises a memory arranged to store at an address a number of occurrences of the difference value within a predetermined range.
Conveniently, the apparatus further comprises plotting means arranged to form a histogram of a number of occurrences in the predetermined ranges of the difference value.
Preferably, to detect whether a picture element is at an edge of a frame, the apparatus further comprises: subtraction means arranged to subtract luminance and/or chrominance values of each of the pels surrounding a central pel being processed from a luminance and/or chrominance value of the central pel to obtain luminance and/or chrominance value differences; summing means arranged to sum the luminance and/or chrominance value differences to obtain a summation; and processing means arranged to determine whether the summation exceeds a threshold and if so sorting the difference values to determine a subset of a predetermined size of the smallest differences from the luminance and/or chrominance of the central picture element and using the subset as the neighbouring picture elements.
Conveniently, the subset of a predetermined size comprises four luminance differences.
According to a third aspect of the invention, there is provided a computer program comprising code means for performing all the steps of the method described above, when the program is run on one or more computers.
Other aspects and features of the present invention will become apparent to those ordinarily skilled in the art upon review of the following description of specific embodiments of the invention in conjunction with the accompanying figures.
Embodiments of the present invention will now be described, by way of example only, with reference to the accompanying drawings, in which:
Throughout the description, identical reference numerals are used to identify like parts.
Region 1 represents a region normally occupied by moving picture information, as it generally has lower spatial frequency with reduced occupancy at high temporal frequencies. Region 2 is a distinct area located at high spatial and low temporal frequencies. Region 3 is the space where both high spatial and temporal frequencies are found. Generally speaking, it is assumed that the noise is “white”, that is, it is evenly distributed throughout the Nyquist cube.
So, compared to the prior art, a more reliable estimate of the noise present in a signal can be obtained by looking in regions of the spectrum where the video signal energy is consistently reduced. This occurs at high frequencies in both the spatial and temporal dimensions where an accumulation of the signal energy present correlates more closely with the noise and not the image and motion present.
Processes, both natural and synthetic, within the capture and processing of a video channel mean that the spatial noise tends to be “pink”, that is, it is attenuated towards the high frequencies. However, this attenuation tends to be constant, and the high frequency components are only attenuated to a limited degree. It is straightforward to compensate for the attenuation by means of a simple scaling factor. The important feature of note is that in Region 3 there is always noise present and occasionally signal so that noise predominates.
The following method characterises the image noise from the high frequency Region 3 of
Noise array calculation
Referring to
dn(x,y)=fn(x,y)−fn-1(x,y) Equation 1
i.e. the difference between co-located pixels in the current and previous frames.
Assuming a difference between a picture element and its immediately surrounding picture elements represents noise, the absolute noise amplitude array An={an(x,y)} can be calculated as:
This equation sets an(x,y) equal to the absolute value of the mean of the difference between the sum of the 9 pels in the chosen array adjacent to the pel at position (x,y) and 9 times the value of that pel, there being 9 pels in the sum but only 8 to average. The region size of 9 pels is used for this example but other convenient array sizes may also be used.
However, alternative measures can be used instead, for example, the square of means of the differences between each of the 8 pels in the chosen array adjacent to the pel at position (x,y), as illustrated by equation 3. The positive root of the square of the means could be an equally appropriate alternative as in Equation 4. The size of the array in this example is 9 pels but could be any convenient value.
Calculation of Difference Distribution
The difference distribution P(v) is a histogram i.e. a count of the number of times each possible value v of an occurs within the noise amplitude array An.
where X and Y are the number of pels in each dimension of the image.
Note that if chrominance information is processed as per equations 1 & 2 a separate difference distribution would be accumulated for those results with the assumption that further downstream processing would act separately on these luminance and chrominance noise difference distributions.
In
At the end of processing an image, the memory contains data for a histogram, step 69, of the occurrences of values of the difference which may be used to provide an estimate of the high frequency noise. This may be read 41 and used in other algorithms and systems, not described herein, e.g. to control the noise reduction profile used in the Applicant's application 0610967.2 filed on 2 Jun. 2006.
It will be understood that the method described works in two stages. The first stage is to filter the temporal domain. By subtracting one image from the next a difference picture is obtained.
The second stage of the method discriminates spatial high frequencies from movement by relying on a limited bandwidth mandated in the specifications for sampling of digital video and the nature of most images.
As sampling to digital requires that the bandwidth be limited to below the Nyquist rate, it can be seen that for many points this crude interpolation gives a reasonable approximation of the pel between the examined samples.
When this example is carried on to two-dimensional images the pels in all directions from the pel under operation are used to form the estimated value. This means that edges from any relative direction will be accounted for in the predicted value. The calculation of this two-dimensional average can be thought of as producing a three-dimensional mesh or surface in which the fundamental limit to the bandwidth of the signal is used to form an estimated value to the central pel.
Noise is assumed to come into this process by randomly moving the value of samples with respect to their expected values.
The action of subtracting the expected value from the actual value can be assumed to give an estimate of the noise. The limitation on this method can be seen in the graph of
Two methods to improve the estimate are to do a better interpolation, possibly using an FIR filter with a number of taps in each direction or to discard samples with larger differences from the average process.
Further Extension
One impairment to accurately mapping the noise is that pels that are close to edges of objects within the image tend to produce a difference from the average signal related not to the noise but to the sampling phase related to the position of the edge.
A further enhancement to improve the accuracy of the constructed noise statistics is to detect which pels fall into this category and subtract them, not from the average of the surrounding 8 pels, but from the average of a subset of pels that have the closest corresponding luminance level. The decision as to which method to use can be made based on some convenient statistical measure within the target area.
Referring to
If the summated difference, or sum of squares which would be equally valid, falls below the threshold then the luminance levels of the eight surrounding pels are selected, step 65, and averaged 55 and switched 59 to be subtracted 56 from the luminance level of the pet being processed, as before. If the threshold is exceeded then the difference values are sorted 57, step 64, to find which subset matches most closely the level of the centre pet 512. The four, or other suitable number of, closest matches then have their absolute luminance levels averaged 58 and switched 59 to be subtracted 56 from the pel under operation.
Referring to
One suitable form for the spatial interpolation filter 35 would be a multi-tap FIR filter
Alternative embodiments of the invention can be implemented as a computer program product for use with a computer system, the computer program product being, for example, a series of computer instructions stored on a tangible data recording medium, such as a diskette, CD-ROM, ROM, or fixed disk, or embodied in a computer data signal, the signal being transmitted over a tangible medium or a wireless medium, for example microwave or infrared. The series of computer instructions can constitute all or part of the functionality described above, and can also be stored in any memory device, volatile or non-volatile, such as semiconductor, magnetic, optical or other memory device.
Although the present invention has been described with reference to preferred embodiments, workers skilled in the art will recognize that changes may be made in form and detail without departing from the spirit and scope of the invention.
Number | Date | Country | Kind |
---|---|---|---|
0611222.1 | Jun 2006 | GB | national |