Embodiments of the present invention relate to Optical Character Recognition (OCR), and particularly to a method and system for restoring a motion-blurred image.
Photographic (or other optically captured images) images may be blurred by relative motion between an imaging system or camera and an object of interest. An example of a motion-blurred image is illustrated in
The methods used to restore such images typically involve convolving the real image with a point spread function (PSF):
where
In the case of a blurred image, the PSF is a function of one argument, i.e. h(x,y)=h1D(x sin(α)+y cos(α)), where α is the angle of the blur.
In the Fourier space, the equation (1) becomes:
Since the function G(p,q) and the others are periodical G(p,q)=G(p−K, q)=G(p,q−M), it is assumed everywhere below that the p and q variables may have either positive or negative values.
H(p,q) is the Fourier transform of the PSF, often called optical transfer function (OTF). In the case of a blurred image, the OTF is a complex function of one argument H(p,q)=H1D(p·sin(α+π/2)+q·cos(α+π/2)).
Also, the Wiener filter may be used to restore images:
where
This filter minimizes the root mean square deviation of the restored image from the real image
provided that the average noise value is 0.
Therefore, in order to restore a blurred image, one needs to know:
U.S. Pat. No. 6,470,097 Oct. 22, 2002 describes an iteration method for finding a non-blurred image. At each step of this method, total variation regularization is performed to minimize the energy function with the image blur. The type of distortion is set as a parameter and is predefined. The length of the blur and its direction are also predefined. Additionally, the image is restored from a sequence of images rather than from one blurred image.
U.S. Pat. No. 6,859,564 Feb. 22, 2005 describes a method where the OTF is determined from the scaled αth power of the smoothed magnitude of the blurred image and noise.
Other methods for restoring blurred images are described, for example, in the review by D. Kundur and D. Hatzinakos, “Blind Image Deconvolution Revisited,” IEEE Signal Processing Magazine, vol. 13, no. 6, pp. 61-63, November 1996, and in other sources.
Embodiments of the present invention disclose a method and system for restoring a motion-blurred image. The method comprises determining parameters for a one-dimensional Optical Transfer Function (OTF) for the motion-blurred image in Fourier space; determining a signal-to-noise ratio for the motion-blurred image in the Fourier space; and correcting for motion blur based on the parameters of the OTF. Determining the parameters comprises calculating a function Φ(p,q) which is based on the square of the modulus of the Fourier transform |G(p,q)|2 of the motion-blurred image. The parameters include the absolute value of the one-dimensional OTF, and the phase and sign of the OTF.
In the following description, for purposes of explanation, numerous specific details are set forth in order to provide a thorough understanding of the invention. It will be apparent, however, to one skilled in the art that the invention can be practiced without these specific details. In other instances, structures and devices are shown only in block diagram form in order to avoid obscuring the invention.
Reference in this specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment is included in at least one embodiment of the invention. The appearance of the phrases “in one embodiment” in various places in the specification are not necessarily all referring to the same embodiment, nor are separate or alternative embodiments mutually exclusive of other embodiments. Moreover, various features are described which may be exhibited by some embodiments and not by others. Similarly, various requirements are described which may be requirements for some embodiments but not other embodiments.
Embodiments of the present invention disclose a method and system for restoring a motion-blurred image. Broadly, the method comprises determining parameters for a one-dimensional Optical Transfer Function (OTF) for the motion-blurred image in Fourier space; determining a signal-to-noise ratio for the motion-blurred image in the Fourier space; and correcting for motion blur based on the parameters of the OTF. The parameters include the absolute value of the one-dimensional OTF, and the phase and sign of the OTF. In one embodiment correcting for motion blur involves the uses of the Wiener filter.
Next, the image can be restored to obtain the image shown in
To determine the OTF (302) and the signal-to-noise ratio (303), the steps shown in the flowchart in
Segmenting the Image Into Windows (401)
If the image were not distorted by additive noise n(x,y), then, in accordance with equation (2), calculating the Fourier transform of the source image (function G(p,q)) would find the zeros of the OTF (function H(p,q)), and, if some additional assumptions are made, the absolute value of the function H(p,q). However, in reality, noise is an important factor which makes determining H(p,q) based on G(p,q) impossible. In different areas of the image, the function H(p,q) may be different (e.g. the direction of the blur may be different due to camera rotation at the moment of taking the picture).
To minimize the impact of the noise, the image is divided or segmented into windows, which are square areas of fixed dimensions. A window must be sufficiently large to include individual elements of the source image in their entirety (e.g. the height of the window must equal several rows of text) and sufficiently small to allow segmentation of the image into a considerable number of windows. In this embodiment of the invention, the size of the window is set to 256×256 pixels.
The image is segmented into windows in such a manner that the windows overlap at the edges. In each window, the components of each pixel are multiplied by the function of the window, which equals 1 in the middle of the window and gradually decrements to 0 towards its edges.
Segmenting the Image into Blocks (402) and Calculating the Mean Square of the Modulus of the Fourier Transform (403)
The windows are merged into blocks, the size of each block being approximately 5×5 windows. To determine the blur parameters, the function Φk(p,q) is defined, which is the square of the modulus of the Fourier transform, averaged over the windows and belonging to the kth block:
where A is the number of windows in the kth block.
Provided the average noise value in all the windows is 0,
Considering that
the equation may be rewritten as
Φk(p,q)=|H(p,q)|2
This averaging significantly reduces the role of the noise and allows extracting information about |H(p,q)|2.
Segmenting the image into blocks, rather than summing up square of the modulus of the Fourier transform |Gt(p,q)|2 over all the image windows, is performed in order to take into account the possible differences in the direction of the blur in different areas of the image. Once the direction of the blur is determined in each block, the functions Φk(p,q) are averaged over blocks, taking into account the directions of the blur in the blocks (
Deducting the Noise (404)
Generally, one can identify and deduct the noise
To find the parameters of the noise, the radial profile of the function ln(Φk(p,q)) 701 is created, i.e. for each absolute value of the impulse √{square root over (p2+q2)}, the minimum value ln(Φk(p,q)) is found. This profile is approximated by the logarithm of type (4) (702).
If the parameters of the noise are successfully found, the noise is deducted as follows:
Determining the Direction of the Blur (405)
Based on the function Φk(noiseless)(p,q), the direction of the blur is determined for the entire image 601. For this purpose, the function ln └Φk(noiseless)(p,q)┘, shown in
Finding the Absolute Value of the One-dimensional OTF (302)
Once the direction of the blur 601 is determined in each block and the absolute OTF value has been averaged over the blocks, the absolute value of the one-dimensional OTF is determined. This is done similarly to (5) for the block-averaged function:
Then, along the line that goes through zero (p=0, q=0) in the determined direction, the function H(p,q) is constant and is 1, because PSF is normalized. Since on low frequencies the signal-to-noise ratio is significantly above 1, the section ln(Φ(noiseless)(p,q)) that goes through zero in the direction of the blur will be close to ln
The signal-to-noise ratio can be successfully modeled by a Gauss function, and the logarithm can be modeled by a parabola. Therefore, at the first step, the section of the function ln(Φ(noiseless)(p,q)) 1002 is created which goes along the direction of the blur 601. The section goes through zero and is approximated by the parabola 1003.
Next, it is assumed that the signal-to-noise value is determined only by the absolute value of the impulse √{square root over (p2+q2)}. The one-dimensional OTF H1D can be restored inside the circle enclosed by the condition that requires that the parabola go through zero 1001, i.e. inside the circle in which the signal-to-noise ratio is close to or greater than 1. In this circle, the sections in the direction of the blur are created, which are located at different distances from zero and approximated by the function ln
is determined by the found parabola 1003, and H1D2 is an approximation parameter. Thus, H1D2 is determined for each distance from zero.
Determining the Signal-to-Noise Ratio (303)
It is assumed that the signal-to-noise ratio
depends only on the absolute value of the impulse √{square root over (p2+q2)}. For low frequencies, the logarithm ln
of the signal-to-noise ratio is determined by the parabola that approximates the section of the function ln(Φ(noiseless)(p,q)), which goes through zero in the direction of the blur. For higher frequencies, smoothed values of the same section are used. Additionally, a lower limit is introduced: the signal-to-noise ratio must be greater than 0.5.
Determining the Phase of the One-dimensional OTF Based on its Absolute Value (304)
OTF is a complex function defined by an absolute value and phase. (302) describes how the absolute value of the OTF is determined. The phase has to be determined separately. In a certain class of functions, namely minimum-phase functions, the absolute value ρw and the phase φw of their Fourier transform are linked by the Hilbert transform:
Here w,w′=0 . . . N−1. The minimum phase condition is a condition that requires that the function be invertible and that the function and the inverse function be causal functions. The phase of the one-dimensional OTF is determined by (7).
Selecting the Sign of the OTF Phase (305)
The phase of the OTF is determined based on the absolute value of the OTF using the formula (7), with a corresponding accuracy. In order to select the correct sign of the phase, both signs are used to restore the image in several windows which have the maximum gradient. Then the sign that produces the best result is selected. The best result is the one that has the smallest area of the restored segment filled with pixels with a high absolute value of the derivative in the direction of the blur, which means sharper boundaries on the image.
Restoring the Image (306)
In order to restore the image, the Wiener filter (3) is applied to each window. The blur direction of the image is defined as the blur direction of the block 501 to which it belongs. The pixels of the restored image are obtained by averaging over the pixels of the overlapping windows. The value of the window's function is used as the weight.
The system 1100 also typically receives a number of inputs and outputs for communicating information externally. For interface with a user or operator, the system 1100 may include one or more user input devices 1106 (e.g., a keyboard, a mouse, imaging device, etc.) and one or more output devices 1108 (e.g., a Liquid Crystal Display (LCD) panel, a sound playback device (speaker, etc)).
For additional storage, the system 1100 may also include one or more mass storage devices 1110, e.g., a floppy or other removable disk drive, a hard disk drive, a Direct Access Storage Device (DASD), an optical drive (e.g. a Compact Disk (CD) drive, a Digital Versatile Disk (DVD) drive, etc.) and/or a tape drive, among others. Furthermore, the system 1100 may include an interface with one or more networks 1112 (e.g., a local area network (LAN), a wide area network (WAN), a wireless network, and/or the Internet among others) to permit the communication of information with other computers coupled to the networks. It should be appreciated that the system 1100 typically includes suitable analog and/or digital interfaces between the processor 1102 and each of the components 1104, 1106, 1108, and 1112 as is well known in the art.
The system 1100 operates under the control of an operating system 1114, and executes various computer software applications, components, programs, objects, modules, etc. to implement the techniques described above. Moreover, various applications, components, programs, objects, etc., collectively indicated by reference 1116 in
In general, the routines executed to implement the embodiments of the invention may be implemented as part of an operating system or a specific application, component, program, object, module or sequence of instructions referred to as “computer programs.” The computer programs typically comprise one or more instructions set at various times in various memory and storage devices in a computer, and that, when read and executed by one or more processors in a computer, cause the computer to perform operations necessary to execute elements involving the various aspects of the invention. Moreover, while the invention has been described in the context of fully functioning computers and computer systems, those skilled in the art will appreciate that the various embodiments of the invention are capable of being distributed as a program product in a variety of forms, and that the invention applies equally regardless of the particular type of computer-readable media used to actually effect the distribution. Examples of computer-readable media include but are not limited to recordable type media such as volatile and non-volatile memory devices, floppy and other removable disks, hard disk drives, optical disks (e.g., Compact Disk Read-Only Memory (CD ROMS), Digital Versatile Disks, (DVDs), etc.
Although the present invention has been described with reference to specific example embodiments, it will be evident that various modifications and changes can be made to these embodiments without departing from the broader spirit of the invention. Accordingly, the specification and drawings are to be regarded in an illustrative sense rather than in a restrictive sense.
Number | Name | Date | Kind |
---|---|---|---|
5550935 | Erdem et al. | Aug 1996 | A |
5745597 | Agazzi et al. | Apr 1998 | A |
5778103 | Allan et al. | Jul 1998 | A |
6470097 | Lai et al. | Oct 2002 | B1 |
6798910 | Wilson | Sep 2004 | B1 |
6859564 | Caron | Feb 2005 | B2 |
7619656 | Ben-Ezra et al. | Nov 2009 | B2 |
7639289 | Agrawal et al. | Dec 2009 | B2 |
20010024534 | Gregory et al. | Sep 2001 | A1 |
20030182246 | Johnson et al. | Sep 2003 | A1 |
20070286514 | Brown et al. | Dec 2007 | A1 |
20080012955 | Johnson et al. | Jan 2008 | A1 |
Number | Date | Country | |
---|---|---|---|
20100142845 A1 | Jun 2010 | US |