Bayesian approach for sensor super-resolution

Information

  • Patent Grant
  • 7523078
  • Patent Number
    7,523,078
  • Date Filed
    Friday, September 30, 2005
    19 years ago
  • Date Issued
    Tuesday, April 21, 2009
    15 years ago
Abstract
Bayesian super-resolution techniques fuse multiple low resolution images (possibly from multiple bands) to infer a higher resolution image. The super-resolution and fusion concepts are portable to a wide variety of sensors and environmental models. The procedure is model-based inference of super-resolved information. In this approach, both the point spread function of the sub-sampling process and the multi-frame registration parameters are optimized simultaneously in order to infer an optimal estimate of the super-resolved imagery. The procedure involves a significant number of improvements, among them, more accurate likelihood estimates and a more accurate, efficient, and stable optimization procedure.
Description
TECHNICAL FIELD

This disclosure describes a method for combining multiple low-resolution images or signals into a single high-resolution image or signal. More particularly, this disclosure relates to a Bayesian technique for forming a high-resolution image or signal from a plurality of low-resolution images or signals.


BACKGROUND

There are a number of prior techniques for deriving a high-resolution image from a plurality of low-resolution images.


In one technique, each low resolution image is aligned to a reference image using an alignment algorithm such as Lucas-Kanade [1-5]. The aligned images are then combined using stacking (robust sum), Bayesian inference, or learned statistics. There are two primary problems with this approach. (1) It attempts to achieve sub-pixel alignment accuracy in aligning the low-resolution images using only the low-resolution image. (2) This approach is not model-based, so it cannot accommodate barrel/pincushion distortion, diffraction or other effects.


In another technique, both the super-resolved image and the alignment parameters are constructed through optimization of the likelihood of the measured data (y) given the alignment parameters (A) and hypothesized super-resolved image (x) That is, the algorithm maximizes P(y|A, x). Some of these algorithms can optionally use a prior on the alignment parameters or the hypothesis (maximizing either P(y, x|A) or P(y, x, A). However is difficult (and frequently unstable) to simultaneously align and resolve the images.


An advantage of the model based approaches (optimization, Tipping-Bishop [6] and our own) is that the formulation is very general. For example, the set of “alignment parameters” (A) may capture any number of transformation parameters for example, degree of pin-cushion distortion, degree of barrel distortion, shift, rotation, degree of blurring kernels including Gaussian or other diffraction kernels,


US Patent Application Publication US2004/0170340 A1 of Tipping and Bishop refers to a Bayesian technique for computing a high resolution image from multiple low resolution images. The algorithm in the Tipping-Bishop application marginalizes the super-resolved image out of P(y, x|A) allowing one to directly optimize the likelihood for the alignment parameters followed by a super-resolution step. That is, the algorithm allows direct computation of P(y|A), allowing an optimization algorithm to directly optimize the alignment parameters. In the Tipping-Bishop application, these alignment parameters included shift, rotation and width of the point spread function (A=<s, θ, γ>) for the optical system (degree of blur). The problem with the approach of the Tipping-Bishop application is that it is mathematically incorrect. In the derivation of the approach they made a major algebra or formulation mistake with the result that the resulting alignment likelihood P(y|A) is incorrect. In practice, the algorithm frequently diverges when optimizing some imaging parameters, particularly the point spread function.


SUMMARY

We have derived a corrected likelihood function that is more accurate, has a significantly simpler functional form and works extremely well, displaying none of the instability exhibited by the Tipping-Bishop approach. Multiple low-resolution images or signals are accurately aligned to sub-pixel resolution and stacked to form a single high-resolution image or signal. For example, ten-low resolution images are used to form a single high-resolution image with 4× to 9× resolution, that is, 4 to 9 times the pixels or 2 to 3 times the linear resolution of the low resolution images.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows examples of high resolution images derived from low resolution images using the disclosed invention.



FIG. 2 shows one example of the invention of this application.



FIG. 3 shows details of the processor shown in FIG. 2.



FIG. 4 shows the relationship between the low resolution images and the high resolution image derived in accordance with the invention.



FIG. 5 is a flow chart illustrating the generation of a maximum likelihood function in accordance with the invention.



FIG. 6A-6C are comparative results in video imaging obtained using the Tipping-Bishop algorithm and the improved procedures of this invention.



FIG. 7A-7B are sample results for 4× sensor super-resolution of in-flight LWIR imagery.



FIGS. 8A-8C are additional LWIR comparative results obtained indoors.





DETAILED DESCRIPTION


FIG. 1 illustrates examples of two high-resolution images 10 and 12 that have been derived from respective pluralities of low-resolution images 14 and 16. As is shown in FIG. 1, use of the invention of this application significantly increases the clarity and sharpness of the low resolution images 10 and 12.



FIG. 2 illustrates the general principles involved in transforming the low resolution images 14 and 16 into high resolution images 10 and 12 in FIG. 1. In particular, the apparatus of FIG. 2 combines multiple ‘low-resolution’ samples of a scene in order to reconstruct a ‘high-resolution’ estimate. Multiple low-resolution images are combined to infer the parameters that optimally counter the sub-sampling and between-image geometric transformations.


The approach is statistical in that the high-resolution image is obtained through an inference procedure. Inference is based on exploiting the knowledge derived from the low-resolution imagery and the models of the subsampling process (i.e. point-spread-function) and registration of the imagery. We present the details of the procedure and a correct derivation in the next section. The optimization is performed in one or more subsets or portions of the low resolution imagery due to the significant computational requirements of the procedure.



FIG. 2 shows a camera 18 that creates a plurality of low resolution images 20a, 20b, and 20c of a scene 22. The low resolution images 20a, 20b, and 20c from an imaging sensor in the camera 18 are sent to a processor 24 located in the camera 18 or outside the camera 18. The processor 24 generates a high resolution image 26 from the low resolution images 20a, 20b, and 20c produced by the camera 18. The functionality of the processor 24 may be implemented in any form of hardware or software. It may for example be implemented in a personal computer or a special purpose digital signal processor.


The processor 24 contains a block 28 which is responsive to the low resolution images 20a, 20b, and 20c to select one or more small characteristic regions or portions of the low resolution images 20a, 20b, and 20c for processing in accordance with this example of the invention. These selected regions constitute areas containing a great deal of detail or high frequency content in the low resolution images 20a, 20b, and 20c. Once these regions have been identified and selected, block 30 in the processor 24 optimizes both the point spread function and registration parameters for the selected regions. Block 32 then registers and deconvolves the full size image using the parameters from block 30 and generates the high resolution image 26.



FIG. 3 shows more details of the operation of the processor 24 in FIG. 2. Low resolution images 20a, 20b, and 20c from the camera 18 are input to the processor 24 in block 34. Block 36 in FIG. 3 selects one or more regions of interest in the low resolution images. These regions of interest are small subsets or portions of the low resolution images that are selected for the optimization process described in detail below. They may include the center region of the low resolution images and any other region or regions in the low-resolution images spaced from the centers of the images that exhibit a large amount of image information such as regions of great detail or high frequency content. Preferably, the regions of interest are a number of high information content regions spaced away from the centers of the low-resolution images. In one embodiment, we used a technique developed by Lucas and Kanade for identifying information rich areas via eigenvalue analysis. In any case, if the alignment regions are widely separated, we can compute the alignment likelihood for the parameters in that region independent of the alignment likelihoods for other regions, resulting in a significant computational savings.


After the regions of interest have been identified in block 36, block 38 estimates coarse registration parameters which in this application of the method can include translational and rotational orientation of the low resolution images with respect to a predetermined frame of reference. These translations and rotations are produced by slight motion of the camera 18 occurring in the time between capture of each successive low resolution image 20a, 20b, and 20c by the camera 18.


The registration parameters and the point spread function (PSF) parameters are optimized in block 40 using a marginal likelihood function specified by block 42. The details of the marginal likelihood function are described below. Block 40 produces a best estimate of the registration and PSF parameters for the selected regions of interest. These parameters are then used in block 44 to compute the mean of the posterior distribution for the full image which thereby defines the high resolution image.


To implement the Bayesian super-resolution approach in accordance with this invention, we must fuse K ‘low-resolution’ images each containing M-pixels in order to assemble a single super-resolved image with N pixels where N>M. Here, M is the product of the height and width of the k low-resolution image in pixels. N is the product of the height and width of the super resolved image. This patent application describes the alignment and super-resolution of rectangular images. In fact, the algorithm may be used for signals of arbitrary geometry, including 2D signals with non-rectangular sampling grids (such as a radar signal), 2D super-resolved images wrapped over a 3D surface, or recovery of a high-resolution 1D signal (such as audio) from multiple sources (microphones). Furthermore, notice that M can be constant across all samples (as when images are collected from a single camera) or different (i.e. M(k)) as when images are collected from different cameras and fused using the proposed method.


We must derive a model of the generation of the images (i.e. low-res samples) in order to obtain an observation model. In particular, we know that the camera's optics and sampling process cause a simultaneous sub-sampling and blurring of the scene. Hence, the observation model can be captured as:

y(k)=W(k)x+ε
εj=N(0,β−1)

Here, y(k) is the kth low-res image, x is the full-resolution scene, W(k) is a transform that captures the sub-sampling and blurring of the scene for the kth low-res image (i.e. sampling filter), and ε represents Gaussian noise in the process. Specifically, W(k) captures the focus, point spread function, and spatial transformation between the high resolution image and the kth low resolution image. This relationship between a low resolution image y and the high resolution image x is shown by the model of image generation 46 in FIG. 4.


As W(k) captures the map between the high-resolution image and each of the K low-resolution samples, its dimensions are M×N. Given a Gaussian model for the imaging process, the W(k) values must be normalized to the [0,1] range to conserve energy:










W
ji

(
k
)


=


w
ji

(
k
)






i





w

ji



(
k
)








(
A
)








where




w
ji

(
k
)


=

exp



{

-






v
i

-

u
j
k




2


γ
2



}

.






j



[

1
,
K
,
M

]






i



[

1
,
K
,
N

]








(
B
)








Here, ujk are the hypothesized centers for the sampling array, given the alignment parameters. The vi are the centers for the super-resolved image or signal. Both vi ujk are expressed in the same global coordinate system. In this example, the point spread function is a Gaussian blur kernel






exp


{

-






v
i

-

u
j
k




2


γ
2



}






with a point spread variance of y. In practice, we can use any linear transformations, for example, we can use the Biot-Savart Law to determine a super-resolved image of a current sheet given magnetic field measurements.










u
j

(
k
)


=



R


(

θ
k

)




(


v
j

-

v
_


)


+

v
_

+

s
k






(
C
)







where






R


(

θ
k

)



=

(




cos






θ
k





sin






θ
k








-
sin







θ
k





cos






θ
k





)





(
D
)








and sk is the translation.


Each of the vi is the Cartesian coordinates in the super-resolved image space of the center of the grid cell i (i.e. the center of pixel i). Each vj is the Cartesian coordinates of the center of super-resolved grid cell j for each low-resolution image. Each uj(k) is the location of each vj on the super-resolved image after rotation and shift. In practice, we can use other geometric transformations to determine uj(k), including general affine transformations, perspective transformations, etc.


Finally, the prior is defined by a covariance matrix Zx, which is N×N.











Z
X



(

i
,
j

)


=

A





exp



{

-






v
i

-

v
j




2


r
2



}

.






(
E
)








Given the shift sk and rotation θk for each image and the γ for the PSF, we can compute the marginal likelihood:










lg





P


{



y

(
1
)














y

(
k
)



|
A

}


=


-

1
2




(


KM






lg


(

2

π

)



+

KM





lg





β

+

lg



Z



-

lg



Σ



-


μ
T



Σ

-
1



μ

+

β








k




y


(
k
)

T




y

(
k
)






)













Σ
=


[


Z
x

-
1


+

β


(




k
=
1

K




W


(
k
)

T




W

(
k
)




)



]


-
1






(
1
)






μ
=

β






Σ


(




k
=
1

K




W


(
k
)

T




y

(
k
)




)







(
2
)








μ is the mean of the super-resolved image pixels and Σ is their covariance.


Let P=Σ−1. In order to find the registration and deblurring parameters, we perform an optimization procedure a subset f (skk,γ) of the marginal likelihood function (see appendix) to obtain:










f


(


s
k

,

θ
k

,
γ

)


=



-
lg




P



+


μ
T


P





μ






(
3
)







(


s
k

,

θ
k

,
γ

)

=



arg





max



s
k

,

θ
k

,
γ




f


(


s
k

,

θ
k

,
γ

)







(
4
)







The optimization only needs to be performed on one or more small regions of the image. For example, a 9×9 patch at the center of the image or spaced from the center of the image may be optimized. Once the optimization is performed, one can compute the full image μ using (1) and (2) above. Operation of the processor 24 in implementing Equation (2) to derive the high resolution image is illustrated by high resolution image computation block 48 in FIG. 4.


The processor 24 generates the improved marginal likelihood function ƒ of this invention as shown in FIGS. 5A and 5B. The function ƒ for selected regions of interest in the low resolution images is then optimized to ascertain the registration parameters that can be used to derive the entire high resolution image beyond the regions of interest.


The processor 24 includes registers or storage elements that hold system constants or inputs from previous stages that are used in the maximum likelihood calculation. Those storage elements include a storage element 50 containing the coordinates v, for example, the Cartesian coordinates, of the N pixels making up the high resolution image. They also include a storage element 52 containing the standard deviation r of the covariance matrix Z and a storage element 54 containing the variance β of the modeled noise for the imaging process.


The processor 24 also contains storage elements containing current values of optimization parameters. Storage element 56 contains two dimensional shift parameters sk for each of the K low resolution images being analyzed. Storage element 58 contains rotation parameters θk of the K low resolution images being analyzed. Storage element 60 contains the standard deviation γ for the distribution model of the point spread function (PSF). The processor 24 receives a series of inputs from the camera 18 composed of k low resolution images each of which is comprised of M pixels. This input is stored in storage element 62 in FIG. 5A.



FIG. 5A contains a processing block 64 that is responsive to the contents of storage elements 50, 56, 58, 60, and 62 to compute the transform W(k) for each of the K low resolution images. A transform operation 66 takes the contents of storage locations 50, 56, and 58 and produces the u vector described above which is stored in a storage element 68. Storage element 68 stores a u vector composed of M elements for each of the K low resolution images. A vector subtraction operation 70 takes the contents of the storage elements 50 and 68 and produces the difference value used in the numerator in equation (B) above. Operation 72 in FIG. 5A generates a Gaussian distribution based on the contents of the storage elements 50 and 60 by implementing the rest of equation (B) above. The results of the operation 72 are normalized in operation 74 in accordance with equation (A). The result of operation 72 is a transform matrix W(k) for each of the K low resolution images. Each matrix W(k) is an MxN matrix, where M is the number of pixels in the low resolution image being analyzed and N is the number of pixels in the high resolution image corresponding the low resolution image.


Transpose operation 78 is responsive to the contents of storage element 76 to produce the transpose of each of the K matrices in storage element 76. Matrix product operation 80 multiplies the transpose of the W(k) matrices from operation 78 by the low resolution image information from storage element 62. The result of the operation 80 is input to a vector sum operation 82 in FIG. 5B which implements the summation operation specified in equation (2) above. A matrix product operation 84 in block 64 multiplies each matrix W(k) in storage location 76 by its respective transpose resulting from operation 78. A matrix sum operation 86 in FIG. 5B takes the products from operation 84 and sums them to produce the summation in equation (1) above.


A multiplier 88 multiplies the summation produced by operation 86 by the content of storage element 54 to produce one of the bracketed terms in equation (1).The other term is produced as follows. A vector subtract operation 90 in FIG. 5A is responsive to the content of storage element 50 to produce the difference value in the numerator of equation (E). Operation 92 generates a Gaussian distribution based on the contents of storage element 52 and the result of the vector subtract operation 90 by implementing the rest of equation (E). The result is the covariance matrix Z and is stored in storage element 94. The inverse of the covariance matrix is computed in operation 96.


A matrix sum operation 98 in FIG. 5B is responsive to the result of operation 96 in FIG. 5A and the result of operation 88 in FIG. 5B to produce the precision matrix P [which is the inverse of the matrix Σ defined by equation (1)]. The matrix P is an NxN matrix and is stored in storage element 100. Determinant operation 102 calculates the determinant of the matrix in storage element 100 and logarithm computation element 104 produces the negative logarithm of the determinant derived in operation 102. The result of operation 104 is the first term on the right side of equation (3).


The other term on the right side of equation (3) is generated as follows. An inverse operation 106 in FIG. 5B produces the inverse of the matrix P stored in storage element 100. This produces the matrix Σwhich is stored in storage element 108. A matrix multiplier 110 multiplies the content of the storage element 108 and the result of the vector sum operation produced by vector summer 82. The output of the multiplier 110 is multiplied by the content of storage element 54 in multiplier 112 thereby producing the mean of the posterior distribution μ specified in equation (2). The output of the multiplier 112 is stored in storage element 114. The transpose of the matrix μ is produced by operation 116. The inner product of the precision matrix P from storage element 100, the matrix μ from storage element 114, and the transpose of the matrix μ from operation 116 is generated by a multiplier 118. The output of the multiplier 118 is the second term on the right side of equation (3).


The outputs of operation 104 and operation 118 in FIG. 5B are added together by adder 120 to produce the marginal likelihood functionƒ defined by equation (3) which is stored in block 122. As discussed above, an iteration process is undertaken to find the values of the registration parameters that optimize the value of the function ƒ When the registration parameters that optimize ƒ have been found, then the full high resolution image can be derived by the processor 24 using those registration parameters in equations (1) and (2).


DETAILED COMPARATIVE EXAMPLES
Example 1
Visible Imagery

A sample result of super-resolving the image utilizing Tipping and Bishop's algorithm and utilizing this invention is shown in FIGS. 6A, 6B, and 6C. FIG. 6A is the first frame of the original image sequence. FIG. 6B is the result of applying 9× resolution enhancement through the Tipping & Bishop algorithm. FIG. 6C is the result of 9× resolution enhancement with our method where three small regions spread across the image are used to identify the optimal registration and deblurring parameters.


The original images (10 frames) were captured using a handheld digital camera. Each frame was captured so that it underwent a small rotation either as a pan and/or roll of the camera. The resulting image sequence was used as the inputs to the algorithm described in the previous section. In particular, the color images were converted to grayscale and analyzed for information content. In the next step, three regions-of-interest were identified and used as inputs to the optimization step of the algorithm. Once the registration and PSF parameters are identified the entire set of frames are passed to the full-frame enhancement stage to produce the super-resolved image.


Example 2
In-Flight Long Wavelength Infrared (LWIR) Imagery

We obtained LWIR imagery from Rockwell Collins recorded during a flight in December 2004. In this platform, a thermal infrared camera was mounted in a dichroic-beam splitter configuration to simultaneously collect registered visible imagery. FIGS. 7A and 7B presents sample results obtained using a 4× resolution enhancement. FIG. 7A is the original 320×240 LWIR image of a runway with clouds on the horizon. FIG. 7B is a 4× resolution enhanced image.


Example 3
Indoor LWIR Imagery

To supplement the data made available to us by Rockwell Collins in Example 2, we collected indoor imagery utilizing our own FLIR Systems A20 camera. This camera is capable of imaging long-wave infrared at 320×240 resolution. FIG. 8A is an original 320×240 image. FIG. 8B shows the results of applying the techniques of this invention to 9 low-res frames (no size increase, N=M). This result demonstrates the effectiveness of the multi-frame fusion in improving overall contrast and reducing noise (i.e. information enhancement). FIG. 8C demonstrates 4× resolution enhancement. Note the significant enhancement of information and substantial reduction of aliasing across the image.


The example shown in FIGS. 8A, 8B, and 8C presents further evidence of the effectiveness of the Bayesian approach of this invention in obtaining up to 9× sensor resolution. In the first comparison, we present the first frame of the original sequence in FIG. 8A together with the results in FIG. 8B of fusing all 9 available frames using the Bayesian approach (no resolution enhancement was applied). These results highlight the effectiveness of the registration and deblurring artifacts.


The second comparison shown in FIG. 8C showcases an effective 4× (i.e. N=4*M) resolution enhancement of the original 320×240 imagery.


REFERENCES



  • [1] B. D. Lucas and T. Kanade. An Iterative Image Registration Technique with an Application to Stereo Vision (DARPA) Proceedings of the 1981 DARPA Image Understanding Workshop, April, 1981, pp. 121-130.

  • [2] B.D. Lucas and T. Kanade An Iterative Image Registration Technique with an Application to Stereo Vision (IJCAI) Proceedings of the 7th International Joint Conference on Artificial Intelligence (IJCAI '81), April, 1981, pp. 674-679.

  • [3] S. Baker and I. Matthews, Lucas-Kanade 20 Years On: A Unifying Framework: Part 1, tech. report CMU-RI-TR-02-16, Robotics Institute, Carnegie Mellon University, July, 2002.

  • [4] S. Baker, R. Gross, I. Matthews, and T. Ishikawa, Lucas-Kanade 20 Years On: A Unifying Framework: Part 2, tech. report CMU-RI-TR-03-01, Robotics Institute, Carnegie Mellon University, February, 2003.

  • [5] S. Baker, R. Gross, and I. Matthews, Lucas-Kanade 20 Years On: A Unifying Framework: Part 3, tech. report CMU-RI-TR-03-35, Robotics Institute, Carnegie Mellon University, November, 2003.

  • [6] Michael E. Tipping, Christopher M. Bishop: Bayesian Image Super-Resolution. NIPS 2002: 1279-1286.










APPENDIX 1





Table of Variables
















vi
Coordinate frame of reference for the high-resolution signal or



image and registration for low-resolution signals or images


vj
Coordinate frame of reference for the low-resolution signal or



image and registration for low-resolution signals or images


u
Coordinate, transformed frame of reference for low-resolution



image(s)



ν

Mid-point of the signal (in one embodiment, this could be the



center of the image)


y
Low-resolution set of signals or images


M
Dimensionality of the low-resolution signal (in one embodiment,



this could be the number of pixels (width × height) in the low-



resolution images)


N
Dimensionality of the high-resolution signal (in one embodiment,



this could be the number of pixels (width × height) in a high-



resolution image)


j
Index for referencing the M entries in the low-resolution data


i
Index for referencing the N entries in the high-resolution data


i1
Index for reference entry in the high-resolution data


i2
Index for entry (e.g. pixel) to be analyzed for covariance in the



high-resolution data


β
Variance of the modeled noise for the signal collection or imaging



process


K
Number of low-resolution signals or images


k
Index for low-resolution signals or images


s(k)
Shift (translation) required to register signal or image k to the



global frame of reference


θ(k)
Rotation required to register signal or image k to the global frame



of reference


γ
Standard deviation for distribution model of Point-Spread-Function



(PSF)


A
Generic variable that captures any number of transformation pa-



rameters for example, degree of pin-cushion distortion, degree of



barrel distortion, shift, rotation, degree of blurring kernels



including Gaussian or other diffraction kernels.


Z
Covariance matrix defining the prior of the high-resolution signal



or image


r
Standard deviation of Z


W(k)
Model for the mapping of the high-resolution signal or image onto



the low-resolution signal or image k


μ
Mean of the posterior distribution modeled for the high-resolution



signal or image (this is the estimate for the high-resolution



signal or image once the parameters have been optimized)


Σ
Variance attributed to the noise in the inferred high-resolution



signal or image


P
Precision of the posterior distribution


G
A Gaussian distribution operator (generates an appropriate



distribution based on the input parameters).


T
The Transpose-of-a-Matrix operator


f
The marginal likelihood measurement









APPENDIX 2
Derivation of Marginal Likelihood

The marginal likelihood can be derived from the joint on x (the hi-res image) and y (the low-res images) as follows (to simplify the exposition, all of the images y(1) , . . ., y(k) are represented using a single signal y that is the concatenation of the component images:


First, define the probability on x and the probability of y given x:







P


{
x
}


=


N


{

x
;
0
;
Z

}


=



(

2

π

)



-
n

/
2






Z




-
1

/
2








exp
[


-

1
2




x
T



Z

-
1



x

]










P


{

y
|
x

}


=



(

2

π

)



-
m

/
2







Σ
2





-
1

/
2




exp
[


-

1
2





(

y
-
Wx

)

T




Σ
2

-
1




(

y
-
Wx

)



]








where


:








Σ
2




β

-
1




[



1







0



















0







1



]







Now, the joint probability can be defined as the product of the two probabilities defined above:







P


{

x
,
y

}


=



(

2

π

)



-
m

/
2







Σ
2





-
1

/
2





(

2

π

)



-
n

/
2






Z




-
1

/
2




exp
[



-

1
2





(

y
-
Wx

)

T




Σ
2

-
1




(

y
-
Wx

)



-


1
2



x
T



Z

-
1



x


]






A number of simplifications can be performed:







P


{

x
,
y

}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp
[



-

1
2





β


(

y
-
Wx

)


T



(

y
-
Wx

)


-


1
2



x
T



Z

-
1



x


]









P


{

x
,
y

}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp
[


-

1
2




(




x
T



(


Z

-
1


+

β






W
T


W


)



x

-

2

β






y
T


Wx

+

β






y
T


y


)


]







Notice that Σ−1 ≡Z−1 +βWTW


Thus, by completing the square:

2 μT Σ−1 x =2 βyT Wx
Σ−1 μ=βWT y
μ=βΣWTy

We can substitute into our joint:







P


{

x
,
y

}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp
[


-

1
2




(






x
T



Σ

-
1



x

-

2


μ
T



Σ

-
1



x

+


μ
T



Σ

-
1



μ

+







2


μ
T



Σ

-
1



x

-


μ
T



Σ

-
1



μ

-

2

β






y
T


Wx

+

β






y
T


y





)


]






Then simplifying and collecting terms:







P


{

x
,
y

}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp
[


-

1
2




(







(

x
-
μ

)

T




Σ

-
1




(

x
-
μ

)



+







2



(

β





Σ






W
T


y

)

T



Σ

-
1



x

-


μ
T



Σ

-
1



μ

-







2

β






y
T


Wx

+

β






y
T


y





)


]









P


{

x
,
y

}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp
[


-

1
2




(







(

x
-
μ

)

T




Σ

-
1




(

x
-
μ

)



+







2

β






y
T


W






ΣΣ

-
1



x

-


μ
T



Σ

-
1



μ

-







2

β






y
T


Wx

+

β






y
T


y





)


]









P


{

x
,
y

}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp
[


-

1
2




(




(

x
-
μ

)

T




Σ

-
1




(

x
-
μ

)



+

2

β






y
T


Wx

-


μ
T



Σ

-
1



μ

-

2

β






y
T


Wx

+

β






y
T


y


)


]









P


{

x
,
y

}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp
[


-

1
2




(




(

x
-
μ

)

T




Σ

-
1




(

x
-
μ

)



-


μ
T



Σ

-
1



μ

+

β






y
T


y


)


]







Now, we apply the marginalization of x (i.e. sum over the terms dependent on x):







P


{
y
}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp


[


-

1
2




(



-

μ
T




Σ

-
1



μ

+

β






y
T


y


)


]




(




exp


[


-

1
2





(

x
-
μ

)

T




Σ

-
1




(

x
-
μ

)



]





x



)









P


{
y
}


=



(

2

π

)



-

(

n
+
m

)


/
2




β

m
/
2






Z




-
1

/
2




exp


[


-

1
2




(



-

μ
T




Σ

-
1



μ

+

β






y
T


y


)


]




(




Σ



1
/
2





(

2

π

)


n
/
2



)









P


{
y
}


=



(

2

π

)



-
m

/
2




β

m
/
2






Z




-
1

/
2






Σ



1
/
2




exp


[


-

1
2




(



-

μ
T




Σ

-
1



μ

+

β






y
T


y


)


]








This provides the simplified definition of the probability on y. Now, we obtain the marginal likelihood function by taking the log on both sides:










lg





P


{
y
}


=


-

1
2




(


m






lg


(

2

π

)



+

m





lg





β

+

lg



Z



-

lg



Σ



-


μ
T



Σ

-
1



μ

+

β






y
T


y


)















Allowing for K low resolution images, we obtain marginal likelihood function for our problem:







lg





P


{
y
}


=


-

1
2




(





KM






lg


(

2

π

)



+

KM





lg





β

+

lg



Z



-







lg



Σ



-


μ
T



Σ

-
1



μ

+

β






y
T


y





)







Because the marginal likelihood will be used as the optimization function, we can eliminate from it any terms that will remain constant as a function of the alignment parameters (Sk, θk y) that we seek to optimize. In particular the first, second, third, and sixth terms in the above equation are constant under changes to these parameters are thus eliminated from our target equation:







f


(
y
)


=


-

1
2




(



-
lg




Σ



-


μ
T



Σ

-
1



μ


)







Since precision is the inverse ot distribution (P=Σ−1 ), we can avoid performing the computation of the inverse by substituting P:







f


(
y
)


=



-

1
2




(


lg



P



-


μ
T


P





μ


)






-
lg




P



+


μ
T


P





μ




-

g


(
y
)









the result is a function that can readily be optimized by performing a minimization technique on the parameters of interest:









s
,
θ
,
γ



=



arg





max


s
,
θ
,
γ




(



-
lg




P



+


μ
T


P





μ


)






The Title, Technical Field, Background, Summary, Brief Description of the Drawings, Detailed Description, References, Appendices, and Abstract are meant to illustrate the preferred embodiments of the invention and are not in any way intended to limit the scope of the invention. The scope of the invention is solely defined and limited by the claims set forth below.

Claims
  • 1. A computer implemented method of deriving a high resolution image from a plurality of low resolution images, comprising the steps of: initializing one of more alignment parameters to one or more likely values;determining the marginal likelihood of the low resolution images using the one or more alignment parameters, in which the marginal likelihood is a function ƒ of the alignment parameters where: ƒ=−1g|P|+μTPμ,and P is the precision of the posterior distribution and μ is the mean of the distribution modeled for the high resolution image;adjusting the alignment parameters so as to optimize the marginal likelihood determination; anddetermining the high resolution image using the adjusted alignment parameters.
US Referenced Citations (1)
Number Name Date Kind
20040170340 Tipping et al. Sep 2004 A1
Related Publications (1)
Number Date Country
20070130095 A1 Jun 2007 US