The present invention relates to super resolution enhancement of sampled data.
Super-resolution refers to the enhancement of resolutions for images of scenery or objects acquired by an image capture device, such as a scanner or a CCD video camera, beyond the resolution inherent in the capture device, and beyond other limitations of the capture device. Super-resolution is used primarily to enhance the quality of captured images, and to increase the information content therein.
Many super-resolution techniques used with imaging sensors are based on micro scanning operations. When using micro-scanning operations, a scene or object being captured is sampled multiple times, each time with a sub pixel shift applied.
Prior art techniques for super-resolution do not achieve exact super resolution, but rather they perform over-sampling. Exact super resolution is limited by the fill factor, i.e., the ratio between the area of the region sensitive to radiation and the pitch. The pitch is the area between centers of adjacent detectors in a sensor panel—equivalently, the pitch is the area between adjacent pixels. For example, if the sensitivity to radiation only extends over half the distance separating detectors, then the fill factor is 0.5*0.5=25%. With a 25% fill factor, exact super-resolution using prior art methods can be obtained up to a factor of two in resolution in each dimension, since the regions of sensitivity to radiation are only half the size of the pitch, and can thus be shifted by half-pixels in each dimension without overlapping. Additional super-resolution, beyond a factor of two, has been achieved by prior art methods at the expense of decreased contrast.
For a fill factor of 80%, which is typical for image sensing devices, the limitation on exact super-resolution is
The experimental barrier is approximately at a factor of 1.5, with some decrease in contrast.
Prior art methods for obtaining super resolution from captured images are based on estimating signal distortion. These methods perform optimal estimation of signal distortion from the captured images, using Bayesian techniques and using criteria such as Maximum Entropy. These methods are advantageous when image acquisition is made over a large distance (such as satellite data acquired from outer space), since atmospheric transmission and turbulence have a major impact on limiting the resolution obtained.
One such method is described in I. Cheeseman, B. Kanefsky, R. Kraft, J. Stutz, R. Hanson, “Super-Resolved Surface Reconstruction from Multiple Images,” in Maximum Entropy and Bayesian Methods, G. R. Heidbreder (ed.), Kiuwer, the Netherlands, 1996, pgs. 293–308. This method is based on inverse graphics theory, and is used for ground modeling from outer space observations. An initial ground model is formed by letting each pixel “vote” on what the corresponding ground position should be, based upon the extent that the corresponding ground position contributes to that pixel. The initial ground model is then used to project what an image should be (i.e., predict each pixel value). The differences between the predicted pixel values and the observed pixel values are used to update the ground model until it cannot be further improved. This procedure produces an increase in both spatial resolution and gray-scale resolution.
Another such method is described in A. Zomet and S. Peleg, “Applying Super-Resolution to Panoramic Mosaics,” IEEE Workshop on Applications of Computer Vision, Princeton, October 1998. This method attains super resolution using an iterative method for mosaicing. Given a video sequence scanning a static scene, a panoramic image can be constructed whose field of view is the entire scene. Each region in the panorama is covered by many overlapping image segments, and this can be exploited to enhance its resolution.
Another such method is described in Pearson, T. I. and Readhead, A. C. S. “Image Formation by Self-Calibration in Radio Astronomy,” Ann. Rev. Astron. Astrophys. 22, 1984, pgs. 97–130. This method uses non linear methods, and is based on optimal spectrum estimation.
The present specification concerns data obtained by sampling a continuous signal, and describes methods and systems for enhancing the resolution of the data. For example, a barcode reader samples a barcode when scanning it, and the present invention can be used to enhance the resolution of the sampled data, thereby providing a better reconstruction of the barcode. For another example, a CCD camera samples an object or scene being viewed, to produce a digital image, and the present invention can be used to enhance the resolution of the digital image, thereby providing a better quality image.
The present invention enhances sampled data by effectively decreasing the sampling period. When used with digital images, the present invention provides sub-pixel accuracy. An original image quantized into pixel area elements can be enhanced using the present invention to a finer granularity quantization with sub-pixel area elements. For example, the enhanced image can have one-eighth pixel granularity.
The present invention overcomes the limitation of prior art methods to achieve exact resolution improvement by more than a factor of 1.5. Using the present invention, any desired resolution improvement may be obtained by attaching an appropriately designed mask to a sensor plane of a capture device.
There is provided in accordance with a preferred embodiment of the present invention a method for enhancing the resolution of an image sensing device, including the steps of attaching a mask to a panel of detectors in an image sensing device, generating multiple fields of view, the multiple fields of view being related to one another by sub-pixel shifts, acquiring multiple images with the image sensing device from the multiple fields of view, and combining the multiple images into an enhanced image of higher pixel resolution than the pixel resolutions of the multiple images.
There is further provided in accordance with a preferred embodiment of the present invention a system for enhancing the resolution of an image sensing device, including an image sensing device comprising a panel of detectors, a mask attached to said panel of detectors, a motion generator generating multiple fields of view, the multiple fields of view being related to one another by sub-pixel shifts, image acquisition circuitry housed within said image sensing device acquiring multiple images from the multiple fields of view, and a combiner combining the multiple images into an enhanced image of higher pixel resolution than the pixel resolutions of the multiple images.
There is still further provided in accordance with a preferred embodiment of the present invention a method for enhancing the resolution of an image sensing device, including the steps of creating replicas of fields of view using an optical element attached to an image sensing device, acquiring multiple images with the sensing device from the replicas of fields of view, and combining the multiple images into an enhanced image of higher pixel resolution than the pixel resolutions of the multiple images.
There is additionally provided in accordance with a preferred embodiment of the present invention a system for enhancing the resolution of an image sensing device, including an image sensing device, an optical element attached to the image sensing device, the optical element being such as to create replicas of fields of view, image acquisition circuitry housed within the image sensing device acquiring multiple images from the replicas of fields of view, and a combiner combining the multiple images into an enhanced image of higher pixel resolution than the pixel resolutions of the multiple images.
The present invention will be more fully understood and appreciated from the following detailed description, taken in conjunction with the drawings in which:
The present invention concerns digital images captured by digital image acquisition devices, such as digital cameras and scanners, and provides a method and system for obtaining super-resolution imagery. Super-resolution refers to the ability to obtain a resolution of a digital image that is greater than that native resolution of the image acquisition device. The present invention uses multiple low resolution captures of a picture to obtain a high resolution digital image. For example, if 64 images of an object are captured by a CCD camera having a spatial resolution of 32×32 pixels, the present invention can be used to integrate the captured data and provide a digital image of the object at a spatial resolution of 256×256 pixels. The present invention makes efficient use of captured image data, in that there is a direct one-to-one relationship between the number of images captured and the ratio of resolution enhancement. That is, if K images are captured then the present invention enhances resolution of the capture device by a factor of K.
Additionally, the present invention applies to acquisition devices, regardless of their dynamic color range. In the example described above, the CCD camera can have as low as 1 bit per pixel color depth, or as high as 8 bits per pixel and higher, and the present invention is applicable.
As used in the present specification in reference to digital images, the terms “low resolution” and “high resolution” are intended to be relative terms, used to indicate an advantage of the present invention in providing digital images of an input object at a higher resolution than those produced by an image acquisition device.
Reference is now made to
The array of detectors 110 in the sensing panel is two-dimensional, and the one-dimensional array of detectors illustrated in
Reference is now made to
The support of the two-dimensional pixel sensitivity function corresponds to the region that is sensitive to radiation. This region typically extends across an area that is contained within the area between neighboring pixel centers. The ratio between the area of the two-dimensional support of the pixel sensitivity function and the area between pixel centers containing it is referred to as the “fill factor.”
It is the averaging with g(x) that gives rise to complications in obtaining exact super-resolution in prior art methods.
Reference is now made to
Reference is now made to
Similarly,
When the present invention was applied to a CCD camera having a spatial resolution of 32×32 pixels and a color depth of 4 bits per pixel (not shown), the reconstructed image (not shown) was indistinguishable from the original.
Reference is now made to
The present specification describes the process for acquiring multiple low resolution images from a single picture, and the way to combine the multiple low resolution image data to generate a high resolution image.
Conventions and Notation
In the ensuing description it is assumed that a sensing device samples an object being acquired in lines, such as horizontal lines, using N pixels per line. The present invention is preferably applied separately in two dimensions, such as in a horizontal dimension and a vertical dimension, to increase the pixel resolutions in each of the dimensions. Since the applications of the present invention in each of the dimensions are similar, for the sake of clarity and definiteness the present specification describes a preferred embodiment of the present invention in a single dimension, such as a horizontal dimension. In particular, rather than use two coordinates (x, y) in the discussion below, a single x coordinate is used, and rather than work with double integrals and double sums, single integrals and since sums are used. Similarly, two-dimensional doubly periodic masking of a two-dimensional array of detectors in a sensing panel is described hereinbelow using a one-dimensional periodic masking function m(x).
When the sensing device samples a line, the detected energy of the n-th pixel is given by
where u(x) denotes the captured object along a specific line, Δx denotes the pixel width and g(x) denotes the pixel sensitivity (i.e. apodization).
Multiple Image Acquisition
In a preferred embodiment of the present invention, the sensing device samples a line K times, each time with a shift in the captured object relative to the capture device by an amount of Δx/K. Letting uk[n] denote the detected energy of the n-th pixel in the k-th sampling, Equation 1 generalizes to
Together, the various samples uk[n] provide KN samples given by
In an preferred embodiment of the present invention, the shifts in successive image acquisitions by Δx/K are implemented by using vibrations of the sensing device. For example, the sensing device can be placed on a vibrating platform. The parameters of the vibrations are estimated by vibration sensors or by use of appropriate algorithms. The vibration parameters are used to synchronize the sampling of the object so that the sampling occurs at times when the vibrated sensing device is located at appropriate sub-pixel shifts.
In an alternate embodiment of the present invention, an assembly of one or more rotated mirrors can be used to shift successive fields of view of a sensing device by sub-pixel shifts. The rotated mirrors reflect the scene to the sensing device.
In yet another embodiment, the shifts in successive image acquisitions by Δx/K can be implemented using an optical element instead of the imaging lens of the sensing device. The optical element is attached to the aperture of the sensing device, and serves like a grating to create replicated views of an object. Typically gratings result in replicas since the Fourier transform of a grating is an impulse train of delta functions. This embodiment using an optical element to create replicated views is particularly well suited for objects that are sufficiently small so that the replicas do not overlap.
The abovementioned optical element can be designed, for example, by a multi-facet lens. It can also be designed by an algorithm such as the one described in Z. Zalevsky, D. Mendlovic and A. W. Lohmann, Gerchberg-Saxton algorithm applied in the fractional Fourier or the Fresnel domain, Optical Letters 21, 1996, pages 842–844, the contents of which are hereby incorporated by reference.
It should be apparent to those skilled in the art that it is not necessary that the multiple acquired images be separated by identical sub-pixel shifts. The present invention applies to multiple acquired images separated by sub-pixel shifts of arbitrary sizes. Moreover, successive shifts between acquired images do not have to be sub-pixel. The shifts can be larger than one pixel shifts, as long as the multiple acquired images are all non-integral pixel shifts from one another. Specifically, the relative shifts of the multiple acquired images with reference to a fixed origin can be set to values r0Δx, r1Δx, . . . , rK-1Δx, as long as the differences rI−rj are non-integral for any distinct indices i and j. The case of equal sub-pixel shifts described above corresponds to rk=k/K.
Reference is now made to
Each row in
The samplings indicated by the second and third rows is often referred to as “sub-pixel” sampling, since these samplings are centered at fractional pixel locations. Specifically, in
of the object u(x).
The present invention uses the samples y[n] to approximately reconstruct the values of
Determining these reconstructed values achieves super-resolution with a resolution enhancement factor of K, since the sensing, device has a resolution of N pixels per line, and the reconstructed signal has KN pixels per line.
In order to describe the reconstruction of the values
from the samples y[n], a classical result from signal processing is used, which relates the discrete Fourier transform of a sampled analog signal to the continuous Fourier transform of the analog signal.
A Bit of History: Frequency Representation of Sampling
In the ensuing description, s(t) is used to denote a continuous time signal, and S(jΩ) is used to denote its continuous-time Fourier transform, defined by:
Periodic sampling of s(t) with a sampling period of T produces a discrete time signal s[n], defined by:
s[n]=s(nT) (5)
The discrete-time Fourier transform of s[n] is denoted by S(ejω), and is defined by:
An age-old result, referred to as the Poisson Summation Formula and which is one of the most prominent formulas used in discrete signal processing, provides a relationship between the continuous-time Fourier transform S(jΩ) of a continuous time signal and the discrete-time Fourier transform S(ejω) of the sampled signal. Specifically, the Poisson Summation Formula states that
In fact, more generally the Poisson Summation Formula establishes that
of which Equation 7 above is a special case corresponding to t=0.
A simple proof of Equation 8, as indicated on page 210 of Rudin, W., “Real and Complex Analysis: Second Edition,” McGraw-Hill, 1974, is as follows. The function f(t) defined by
is a periodic function with period T. As such, it has a standard Fourier series expansion of the form
where the Fourier coefficient ck is given by
Upon substituting Equation 9 into Equation 11 it can be seen that
from which Equation 8 follows when Equation 12 is substituted back into Equation 10.
If the function
is used in Equation 8, instead of the function s(t), one obtains the result
By setting t=0 in Equation 13 one arrives at the familiar sampling equation
which appears as Equation 3.20 in A. V. Oppenheim and R. W. Schafer, “Discrete-Time Signal Processing,” Prentice Hall, 1989. Equation 14 indicates that the discrete Fourier transform of a sampled analog signal is comprised of periodically repeated copies of the continuous Fourier transform of the analog signal. The copies are shifted by integer multiples of the sampling frequency and superimposed. Equation 14 is often expressed in terms of delta functions; namely, that the Fourier transform of an impulse train
is itself an impulse train
It is noted that Equation 14 can also be derived directly from Equation 7 by using
instead of s(t).
For values of ω that are small enough to avoid significant aliasing in Equation 14, say ω<Ωs, one can approximate the sums on the right hand sides of Equations 13 and 14 by the term with k=0. When this approximation is made, it in turn leads to the approximation
It is noted that as the sampling period, T, decreases, the approximation in Equation 15 becomes more accurate. The increase in accuracy is due to the fact that the aliasing, or overlap, between the copies on the right hand sides of Equations 13 and 14 decreases, since the shifts
between copies increase.
The present invention uses a version of Equation 15 to obtain super-resolution. Specifically, instead of sampling s[n] s(nt) as in Equation 5, the sampling is done through an integral
where g(t) represents the sensitivity, or apodization, of a pixel. Multiplying both sides of Equation 15 by g(t) and integrating, results in
which is the form used in the present invention.
Referring back to Equation 3, it follows from Equation 17, upon replacing T by
that
where U(ejω) is the discrete Fourier transform of the samples
sought to be reconstructed; namely,
The Fourier transform Y(ejω) is known from the samples y[n] in Equation 3, and the Fourier transform
is determined from knowledge of the sensitivity function g(x). It is noted that as K increases, the approximation in Equation 18 becomes more accurate.
Regarding the Fourier transform Y(ejω), the values of y[n] set forth in Equation3 above are only defined for indices n=0, 1, . . . , KN-1. For indices n outside of this range, y[n] is preferably defined to be zero. With such a convention, the Fourier transform Y(ejω) is given by the finite sum
In a preferred embodiment, the present invention uses Equation 18 to determine the Fourier transform U(ejω), and then reconstructs the values
n=0, 1, . . . , KN-1 from U(ejω) by using an inverse discrete Fourier transform. Preferably a Fast Fourier Transform (FFT), as is familiar to those skilled in the art, is used to invert the spectral values U(ejω). The reconstruction is approximate, since Equation 18 is itself only an approximation. Since it is necessary to reconstruct KN values of u(x), it is accordingly necessary to apply Equation 18 at KN frequencies that are spaced
apart one from the next, since the Fourier transform U(ejω) is periodic with period 2π. In a preferred embodiment of the present invention, Equation 18 is applied at the KN frequencies
When the Fourier transform U(ejω) is inverted it is possible that values of
are non-zero outside of the index range n=0, 1, . . . , KN-1. As it is only the values of
within the index range n=0, 1, . . . , KN-1 that are of interest, the values outside this index range can be ignored.
An apparent difficulty arises in implementing Equation 18 if there are frequencies ω as given in Equation 21, at which
since this would entail division by zero at such frequencies.
The function G(−jΩ) is typically a real-valued function, since the pixel sensitivity function g(x) is typically symmetric about x=0. Since a pixel is typically more sensitive to light in its central area and less sensitive in its outer regions, the pixel sensitivity function g(x) is typically “bell-shaped,” similar to Gaussian functions. In case g(x) happens to be a Gaussian function, its Fourier transform has no zeros, and there no divisions by zero are encountered in implementing Equation 18. However, for other sensitivity functions, such a difficulty can arise.
The present invention avoids such a difficulty by using specially constructed masks, as described hereinbelow.
Masking
In order to avoid division by zero in Equation 18, the present invention preferably employs an optical device to remove zeros of
that are less than the frequency ΩS. The frequency ΩS is the maximal frequency in the reconstructed data.
The present invention preferably attaches a mask in the form of a fine transmission grating with a period of Δx to the sensing device panel. As mentioned hereinabove, the mask under discussion is really a two-dimensional doubly periodic mask that covers a two-dimensional array of detectors. For the sake of clarity and definiteness it is being described as a single dimensional periodic mask, with the understanding that the one-dimensional analysis presented herein applies to each of the two dimensions of the sensing device panel.
Reference is now made to
Use of a mask decreases the effective size of each pixel so that the Fourier transform G(jΩ) of its sensitivity function has a wider band, and its zeros accordingly move to higher frequencies. The apodized pixel, determined by the transmission pattern of the grating, is preferably designed in an iterative manner, as described hereinbelow, so that the bandwidth of the Fourier transform G(jΩ) of its sensitivity function is wider than the maximal frequency to be reconstructed. Use of a transmission grating has the disadvantage of decreasing the energy sensed at each pixel, since it blocks out some of the illumination.
A mask is described mathematically by a spatial function m(x), preferably taking values 0 and 1. The effect of a mask is to modify the pixel sensitivity function from g(x) to p(x)=g(x)m(x). An objective of choosing a suitable mask is to obtain a function p(x) whose Fourier transform does not have zeroes at frequencies that are between 0 and ΩS, so that when the Fourier transform
of the modified pixel sensitivity function p(x) is used in Equation 18 instead of
there is no division by zero.
A rationale for using a mask to avoid zeroes of G is that typically an effect of a mask is to add frequencies that may have not been present beforehand.
To construct a mask as desired, the present invention preferably uses a grating in the form of a plurality of optically transparent slits.
where M is the total number of slits, x1, x2, xM are the locations of the slits and δxi is the width of the i-th slit. The function rect( ) is given by
and {circle around (c)} denotes convolution.
Reference is now made to
The sensitivity function of each masked pixel is given by
p(x)=g(x)m(x), (25)
and it can be seen that the Fourier transform of p(x) is given by
where sinc(z) denotes the function
Preferably, to determine an optimal mask, a predetermined value of M is selected, and a search is made for position values x1, x2, . . . , xM and widths δx1, δx2, . . . , δ&M for which the |P(j Ω)| is bounded away from zero as much as possible. Specifically, define
where ΩS denotes the frequency up to which no significant aliasing occurs, as defined above. Then optimal values for x1, x2, . . . , xM and δx1, δx2, δxM are preferably determined by
where arg-max denotes the arguments x1, x2, . . . , xM and δx1, δx2, . . . , xM which maximize the function σ.
The search range over which optimal values of x1, x2, . . . , xM and δx1, δx2, . . . , δxM are sought is such that the total δx1+δx2+ . . . +δxM of all the widths does not exceed the pixel width Δx. Additionally, each individual width δxi must not be less than 2λ, where λ is the wavelength of the incoming light—otherwise, the light will not be affected by the mask. Moreover, construction of a mask and attaching it to the detectors may further limit the widths δxi to exceed a minimum value of δxmin.
From energy considerations, it is clear that on the one hand, it is desirable that each width δxi be as small as possible, since δxi is proportional to the energy blocked by the mask. On the other hand, this must be traded off against the condition that |P(jΩ)| be bounded away from zero.
Specifically, if Et denotes the total light energy for a typical viewing target, when viewed without a mask, then the energy detected when a mask is present is given by
since the energy blocked by the slits of the mask is proportional to the total width of the slits combined. Thus for a given minimum energy Emin, required to activate the detectors, the widths δxi are constrained to satisfy
Condition (30), together with the condition that δxi≧δxmin, for i=1, 2, . . . , M, determine the search range for widths δx1, δx2, . . . , δxM over which a maximum of σ(x1, x2, . . . , xM; δx1, δx2, . . . , δxM) is sought.
When values of x1, x2, . . . , xM and δx1, δx2, . . . , δxM are determined according to Equation 28, Equation 22 is used to generate an optimal mask.
In and alternative embodiment of the present invention, rather than maximize σ(x1, x2, . . . , xM; δx1, δx2, . . . , δxM), the values of x1, x2, . . . , xM and δx1, δx2, . . . , δxM are determined by maximizing a weighted function a(x1, x2, . . . , xM; δx1, δx2, . . . , w(δx1, δx2, . . . , δxM), where w is an appropriate weight function. For example, w can put higher weight on smaller values of the widths δxi, so that preference is given to masks having slits with smaller widths.
Reference is now made to
Reference is now made to
When a mask is is/applied in accordance with a preferred embodiment of the present invention, the zeroes of
are removed, and the result obtained is the image shown in
Improved Approximation
As explained hereinabove, the approximation in Equation 18 is based on the assumption that ω is small enough to avoid aliasing in Equation 14. A more accurate approximation can be made by taking into consideration the aliasing between adjacent copies of S in the right-hand side of Equation 14. However, in order to obtain the increase in accuracy, it is necessary to increase the sampling rate.
It is appreciated by those skilled in the art that the assumption that aliasing between copies of S in Equation 14 is due to overlap between adjacent terms
and
is equivalent to the assumption that reduction of the sampling period from T to T/2 removes aliasing. Moreover, the information at frequency ω in the discrete Fourier transform of a signal sampled with sampling period T corresponds with the information at both frequencies ω/2 and ω/2+π in the discrete Fourier transform of the signal sampled with sampling period T/2. A precise formulation of this relationship appears in the above referenced Oppenheim and Schafer as Equation 3.76, which describes a frequency-domain relationship between the input and the output of a sampling rate compressor (“downsampling”).
These observations above are a basis for an improved approximation used in accordance with a preferred embodiment of the present invention, as described hereinbelow. Specifically, replace the sampling period, T, by T/2 in Equation 16:
Under the assumption that aliasing in Equation 14 is due to overlap between adjacent terms
and
the approximation to S(ejω) becomes
for positive values of ω, and
for negative values of ω.
Using similar analysis to that embodied in Equations 13–18, it can be shown that
for positive values of ω, and
for negative values of ω. Equations 34 and 35 determine U(ejω) more accurately than Equation 18, since they incorporate aliasing between adjacent copies of S in the sampling representation for S(ejω). However, they require sub-pixel shifts of
between successively captured images. This corresponds to a factor of 2K increase in data acquisition, whereas the resolution is only enhanced by a factor of K.
An alternative approach to improving the approximation in Equation 18 by taking into consideration the aliasing between adjacent copies of S in the right-hand side of Equation 14, uses an alternating (“checkerboard”) mask geometry. As mentioned hereinabove, the above discussion is based on use of a periodic mask, with a period of Δx. By using instead a mask with a period of 2Δx, one effectively generates two pixel sensitivity functions, g1(x) and g2(x), where g1(x) applies to even-numbered detectors and g2(x) applies to odd-numbered detectors. Specifically, g1(x) corresponds to g(x)m(x) and g2(x) corresponds to g(x)m(x+Δx), where g(x) is the pixel sensitivity function for the image sensing device and m(x) is the overall mask. Unlike the above case where the mask has a period of Δx, when the mask has a period of 2Δx there are effectively two masks operating (namely, the mask itself and the mask shifted by Δx), and consequently the even and odd-numbered pixels are processed differently.
Reference is now made to
In distinction to
When using such a mask, Equation 2 (which is one-dimensional) separates into two equations,
By capturing twice as many images as previously, namely, 2K images, each detector moves over a full period of the mask, namely, 2Δx. Consequently, two sequences of the form of Equation 3 are acquired, namely,
In turn, using the same analysis embodied in Equations 13–18, one arrives at the two conditions
for positive values of Ω, 0<Ω<π, and
for negative values of Ω, −π<Ω<0. These equations can be used to solve for the individual copies
and
and then to recover S(ejω) from Equations 32 and 33. The final result obtained, expressed in terms of U(ejω), is
for positive values of ω, 0<ω<π, and
for negative values of ω, −π<ω, <0. In Equations 44 and 45, the terms δΨG(−jΩ) denote the difference
δΨG(−jΩ)=G(−jΩ)−G(−j(Ω−Ψ). (46)
In distinction to the approach using Equations 34 and 35, whereby the improvement in accuracy stems from reducing the sub-pixel sampling shift in half, from
the approach using Equations 42 and 43 obtains improvement in accuracy by acquiring 2K sampled images, each shifted by
Both approaches require collection of sampled data that is 2K as much data as in a single image captured by a sensing device. They differ in the types of masks used, and in the relative shifts between the multiple images acquired.
When implemented with full two-dimensional processing, both approaches described hereinabove require 4K2 as much data as in a single image captured by a sensing device, in order to enhance image resolution by a factor of two in each pixel dimension.
Geometric Super Resolution vs. Diffractive Super Resolution
The multiple image acquisition described above and illustrated in
In distinction, multiple image acquisition can also be diffractive in nature. The resolution of a system limited by diffraction corresponds to the finest detail that can pass through the system without being distorted, and is proportional to the size of the aperture in the optical system. Thus the resolution of the human visual system is limited by the extent to which the eyes are open. When an observer squints his eyes, the resolution of a scene being viewed decreases, since the size of the opening is decreased.
For example, resolution enhancement can be implemented by an assembly of two moving rotated gratings, a first grating attached to the object being captured, and a second grating attached to a sensor of the sensing device, and moving in a direction opposite to that of the first grating. Such an assembly can be used to enhance the resolution of a diffractive system, since it effectively increases the size of the aperture in the optical system. The movement of the grating attached to the object encodes its spatial information by a Doppler-like effect, and allows the information to be transmitted through a restricted aperture of an imaging lens. The decoding of the information is obtained from the second grating attached to the sensing device, which moves in a direction opposite to that of the first grating. Such an assembly of two rotated gratings is practical for microscopic applications.
However, in many imaging systems the object is distant from the observer, and attachment of a grating to the object is impractical. When applied to scanning systems such as barcode readers, the present invention preferably simulates attachment of a moving grating to the object by illuminating the object with an illumination having a spatial structure of a grating. The illumination is shifted with time in order to simulate temporal motion of the grating. A specially designed diffractive optical element is attached to the illumination source so that a grating pattern appears in the object plane. The grating pattern is moved by phase modulating the light source, since linear phase modulations appear as shifts in a far field.
Additional Considerations
In reading the above description, persons skilled in the art will realize that there are many apparent variations that can be applied to the methods and systems described. For example, the integral in Equation 3 may be replaced by a discrete Gabor transform over u with window function g; namely,
Similarly, one may use instead a wavelet or Mellin transform, as is well known to persons skilled in the art.
Additionally, the shifts between the multiple images that are acquired by the sensing device are not required to be equal. Furthermore, the shifts between the multiple images are not required to be horizontally or vertically disposed, nor are they required to be disposed in a fixed direction. Wavelet transforms and Mellin transforms are particularly advantageous when dealing with unequal shifts between acquired images.
Additionally, the reconstruction algorithm used in the present invention, for generating a high resolution image from multiple low resolution images, need not be separable. That is, it need not be comprised of horizontal and vertical reconstructions. Rather, the reconstruction algorithm used in the present invention can be a genuine two-dimensional algorithm.
In the foregoing specification, the invention has been described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made to the specific exemplary embodiments without departing from the broader spirit and scope of the invention as set forth in the appended claims. Accordingly, the specification and drawings are to be regarded in an illustrative rather than a restrictive sense.
Number | Date | Country | Kind |
---|---|---|---|
129258 | Mar 1999 | IL | national |
133243 | Nov 1999 | IL | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IL00/00201 | 3/30/2000 | WO | 00 | 1/4/2002 |
Publishing Document | Publishing Date | Country | Kind |
---|---|---|---|
WO00/59206 | 10/5/2000 | WO | A |
Number | Name | Date | Kind |
---|---|---|---|
4511929 | Maeda et al. | Apr 1985 | A |
4716414 | Luttrell et al. | Dec 1987 | A |
4748316 | Dickson | May 1988 | A |
4820911 | Arackellian et al. | Apr 1989 | A |
4978860 | Bayley et al. | Dec 1990 | A |
5218457 | Burkhardt et al. | Jun 1993 | A |
5315095 | Marom et al. | May 1994 | A |
5315411 | Blanding | May 1994 | A |
5572037 | Liu et al. | Nov 1996 | A |
5638211 | Shiraishi | Jun 1997 | A |
5657147 | Yoshikawa et al. | Aug 1997 | A |
5680253 | Hasegawa et al. | Oct 1997 | A |
5684620 | Schoon | Nov 1997 | A |
5909521 | Nakao et al. | Jun 1999 | A |
6073851 | Olmstead et al. | Jun 2000 | A |
6310967 | Heine et al. | Oct 2001 | B1 |
6675140 | Irino et al. | Jan 2004 | B1 |
Number | Date | Country |
---|---|---|
0 777 147 | Jun 1997 | EP |