The present disclosure relates generally to radiography and digital imaging, as for use in nondestructive testing and other applications. More particularly the disclosure relates to high energy imaging systems, such as fast neutron imaging systems, and to computer-implemented image processing techniques to improve clarity of the image.
This section provides background information related to the present disclosure which is not necessarily prior art.
Fast neutron imaging is an active area of research as it offers unique imaging modalities compared with traditional x-ray and thermal neutron imaging, such as the ability to nondestructively discern features in low-Z objects shielded by thick high-Z materials. In digital fast neutron imaging for example, fast neutrons are passed through a target onto a scintillator whose light is collected by a CCD camera. Scintillator thicknesses of multiple centimeters are required to detect MeV level neutrons with viable efficiencies. Collimated neutron beams are thus typically used because uncollimated and divergent beams will induce a cone beam effect in the resulting image due to both the finite thickness of the target and the scintillator itself. However, collimating a neutron source through increased source distance-to-target drastically reduces the neutron flux incident on the target and results in significantly longer imaging times.
This section provides a general summary of the disclosure, and is not a comprehensive disclosure of its full scope or all of its features.
This disclosure presents a solution that allows uncollimated neutron imaging to be used by removing the cone beam effect caused by the finite thickness of the scintillator via computer-implemented post-processing imaging reconstructions. The technique advantageously exploits a maximum entropy method (MEM) algorithm that minimizes artifacts by explicitly making the reconstructed image as noiseless as possible.
Adding a Bayesian prior to the reconstruction further improves quality because it changes the default image from uniformly flat to the specified prior. For a spherical source which does not impart uniform neutron flux over the entire scintillator, the introduction of the correct Bayesian prior reduces artifacts and improves contrast. This improvement is especially prominent in the peripherals of the image.
In this disclosure, we describe our MEM based computer-implemented technique and apparatus for removing the cone beam effect, and illustrate how to apply the algorithm to a simulated fast neutron radiography image as a proof of concept. The algorithm is able to remove the part of the cone beam effect in the simulated image caused by the thick scintillator. With the help of our disclosed algorithm, the reconstructed image appears as if taken with an infinitesimally thin scintillator (ITS), and thus with cone beam blurring being substantially removed.
In accordance with the disclosed techniques the system for uncollimated digital radiography employs a processor-implemented image deblurring apparatus to deblur and clarify an observed image produced by high energy radiation source that projects a substantially conical beam through a target and onto a scintillator, the scintillator when irradiated without target producing an observed background image. The image deblurring apparatus thus includes a processor programmed to generate a probability transfer matrix by employing a Monte-Carlo simulation based on the scintillator-source geometry. The apparatus further includes a processor programmed to process the observed background image to generate a Bayesian prior. In accordance with the disclosed technique a processor is programmed to use the probability transfer matrix and the Bayesian prior to perform a maximum entropy method reconstruction of the observed image to thereby generate a reconstructed image.
Further areas of applicability will become apparent from the description provided herein. The description and specific examples in this summary are intended for purposes of illustration only and are not intended to limit the scope of the present disclosure.
The drawings described herein are for illustrative purposes only of selected embodiments and not all possible implementations, and are not intended to limit the scope of the present disclosure.
a is a top down view of a Monte-Carlo Neutron Particle (MCNP) model;
b is an expanded view of a test object;
a is an MCNP image with a 2-cm thick scintillator;
b is an MCNP image with the ideal thin scintillator;
c illustrates the MCNP background image;
a and 9b are graphs showing a uniformly flat image before (9a) and after (9b) applying the disclosed linear scintillator model;
a is a histogram of an MCNP background image's neutron counts per pixel;
b is a histogram of a background image's neutron counts per pixel after residual boosted support vector regression (RB-SVR) processing;
c is a graph showing the difference in neutron counts per pixel between original and RB-SVR processing
a is a photographic depiction of an MCNP simulated image with finite thick scintillator, and also showing a square region in upper right corner containing pixels from which SNR values are estimated as shown in
b is a photographic depiction of a restored image with a probability transfer matrix (PTM) built with 2e6 neutrons per voxel;
c is a photographic depiction of a restored image with a PTM built with 2.5e7 neutrons per voxel;
d is an MCNP simulated image with an ideal thin scintillator.
Corresponding reference numerals indicate corresponding parts throughout the several views of the drawings.
Example embodiments will now be described more fully with reference to the accompanying drawings.
Before providing a detailed explanation of the system for uncollimated digital radiography, the following brief summary introduction may be helpful. Refer to
To begin, an object of interest 14 is placed within the conical beam 12 of a radiating energy source 10, and an image is thus generated in the scintillator 16. In real-world applications, the scintillator has a finite thickness and this thickness together with the conical shape of the beam and other effects causes blurring when the image is viewed by a suitable image capturing apparatus such as a CCD camera or CCD array 20. The scintillator 16 and camera 20 are disposed within a lightproof enclosure 22 with a mirror 18 positioned in the enclosure, allowing the camera to be placed off-axis from the scintillator 16. The energy source 10 may be a CF-252 point source, which has the advantage of continuously producing a neutron beam from a relatively small package. The enclosure 22 can have a nominal dimension d on the order of about 1 meter, making the entire instrument readily portable.
The CCD camera 20 is connected to a data input port of computer workstation 24, which includes a display for rendering the captured image after being processed according to the techniques discussed below to remove or substantially reduce the blurring effects by performing computer image processing. If desired a remote server computer 26 can supply pre-computed scintillator response model(s) 28 as will be more fully described below.
As noted, the observed image is seen by the CCD camera through a thick scintillator [TS]. The computer processes this observed image, rendering it as if it were seen through an infinitesimally thin scintillator. The computer applies a transform to the TS image by inverting a linear matrix, [A]. Essentially if y is the TS image and x is the ITS image, the linear equation is simply:
Ax=y.
In real world situations, there is also an additive noise component ε; thus the above relationship is more accurately stated:
Ax=y+ε where ε˜N(0,σIMN).
Note that noise is modeled in the computer to include a tuning parameter σ.
The above linear equation, while easy to express is ill posed and difficult to solve. Thus the preferred computer algorithm uses a maximum entropy method MEM technique for finding an approximate solution. The MEM technique replaces the Ax=y equation with one that is solvable, namely:
The MEM algorithm requires several inputs: A, y, σ and b, where A is the probability transfer matrix that will be discussed below, y is the observed image (from the thick scintillator), σ is a tuning factor to model the additive noise and b is the background image. The background image is the image from the scintillator when there is zero information (no target, but only light produced when the scintillator is hit with the neutron beam).
The probability transfer matrix A represents the geometry of the setup, where a cone-shaped neutron beam passes through the target object(s) and produces illumination in a thick scintillator in the near field of the beam (e.g., source detector distance is within one order of magnitude compared to the detector (scintillator) thickness). Entry Aij in A represents the probability a neutron incident in voxel i will contribute to light-output in voxel j.
The background image is generated from actual observed background image data and thus contains some information about the cone beam effect.
The MEM algorithm operates on the observed image data y (from the thick scintillator TS) represented in vector form. Essentially the two dimensional M by N array of the scintillator image is represented as a vector 1 . . . MN. The actual MEM algorithm is essentially doing two things simultaneously. It is maximizing the Shannon entropy, which has the effect of flattening the image (to remove the cone beam effect) and suppressing noise. Simultaneously it is enforcing the constraint that the resulting output, after transform by A, remains statistically similar to the observed image.
To achieve these concurrent goals, the MEM algorithm needs some further ingredients. It needs the Bayesian prior representation of the background image. It needs the Bayesian prior for each cell in the image. The MEM algorithm also needs to be given the linear scintillator model A.
The Bayesian prior b for each cell is determined by starting with the observed background image and by applying a smoothing algorithm to it. The preferred smoothing algorithm being Residual Boosted-Support Vector Regression RB-SVR. It would also be possible to synthetically generate the background image using Monte-Carlo simulation, if desired, but considerable computing time (e.g., days) may be required.
The linear scintillator model A is supplied as a probability transfer matrix that relates (voxel-by-voxel in the scintillator) neutron counts for each incident voxel with each terminal voxel. The probability transfer matrix is generated using Monte-Carlo simulation. The simulation uses data from a suitable data source, such as the Evaluated Nuclear Reaction Data library.
Thus the computer-implemented technique for reversing the cone beam effect uses three components:
MEM algorithm [needs as subroutine inputs Bayesian prior and Probability Transfer Matrix]
Monte-Carlo simulation of the Probability Transfer Matrix
Synthesized “ideal” noiseless background image expressed in the MEM algorithm as the Bayesian prior.
The MEM algorithm, with these other two components attached, then takes as input the observed image y and applies an inversion to generate the reconstructed image x. The reconstructed image x corresponds to the image that would result if an infinitesimally thin scintillator had been used, and thus has the cone beam effects removed.
With this introductory overview in mind, we turn to a detailed discussion of the disclosed technology, beginning with a discussion of the cone beam effect.
The cone beam effect (CBE) is a generalization of the concept of geometric unsharpness. The effect is dependent on the thickness of the radiation detector used, and the angle formed by the source to detector ray. When the source to detector distance is within an order of magnitude compared to the detector thickness, CBE becomes the prominent factor in image degradation.
As shown in
There are two ways to mitigate the CBE given a fixed d. The first is to reduce the source cross sectional area and for a feature focused at the surface of the detector, we can see this will at most reduce the blur from β1 to β2. The other approach is to reduce the thickness of the detector and from
Having now discussed the cone beam effect, we next present a detailed overview of the disclosed MEM algorithm, as adapted according to our technique.
Our strategy for removing the cone beam effect from the uncollimated images comprises of MEM in conjunction with two supporting subroutines. MEM requires an input of a zero information image for noise suppression purposes and we accomplish this by modeling the ideal background image. Also, MEM requires a linear operator linking the ITS image to the observed image and we compute this operator through a Monte Carlo simulation of the neutron scintillator. Finally, we input the thick scintillator image with both the ideal background image and the linear scintillator model to reconstruct the ITS image.
For validation, we tested the algorithm with a typical fast neutron radiography problem that involves a low Z material hidden behind a high Z shield. We used MCNP simulation to generate both the ITS image and the thick scintillator image since MCNP can predict scattering by the high Z shield, the low Z object, and also scattering within the scintillator itself. To allow for future validation, we used realistic geometries and materials which can be readily replicated in a lab setting,
The simulation setup consists of a Cf-252 source imaging a composite test object placed behind a 1 inch thick lead shield. The test object contained plastics of different densities as well as metal features,
We ran two MCNP simulations, one for the thick scintillator and one for the ITS. In both runs, we used the neutron distribution shown in Eqn. 1, and a 2 mm diameter Cf-252 source. For the 2 cm thick detector, a 500×500 mesh heating tally was imposed on a Bicron BC 400 scintillator with the assumption that the heating tally directly translates the light output of the scintillator. We used this MCNP run as the observed image input for our algorithm,
For the ITS, a 500×500 radiography tally with MCNP's hybrid point detector model, which returns the incident neutron energy flux at each pixel, was used. This resulted in an essentially noiseless image which is shown in
In this section, we describe the maximum entropy method along with its two subroutines in detail.
Maximum Entropy Method
The problem of going from an observed image to an ITS image can be posed as a linear inversion. First, we order the pixels in the observed image (size M×N) from 1 to MN and reformulate the image into a vector based on the ordering. Next, we assume there exists a linear operator relating x, the (MN×1) vector which represents the ITS image, and y, the observed image. We term this linear operator, A. Finally, we are left with a simple linear model relating the ITS image to the observed image, Eqn. 2.
Ax=y (2)
While Ax=y can be solved via Gaussian-Jordan elimination, in practice we are faced with a noised perturbed version of the original equation, Eqn. 3
Ax=y+ε where ε˜N(0,σ·IMN) (3)
Given the additive noise ε, Eqn. 3 is ill posed and requires regularization for a viable solution. Many regularization techniques, such as ridge regression of the L1 loss, have been proposed for this ill posed problem but most lack a sound theoretical basis. Out of these regularization techniques, the maximum entropy method (MEM) stands out because it allows the input of background image or Bayesian prior. MEM uses this Bayesian prior and the observed mage to select a reconstructed image, x, which is most similar to the Bayesian prior while remaining statistically alike to the observed image when transformed with A. To accomplish this, MEM requires these inputs: A, y, σ and b, the zero information image. In most imaging applications of MEM, b is assumed to be the background image and here we do the same. We define bi as the value of the background image for cell i. For numerical purposes, we normalized the background image such that Σi bi=1. Additionally, σ is a tuning parameter trading off noise suppression versus accuracy in the reconstructed image.
Once the inputs are defined MEM approximates a solution to Eqn. 3 by solving Eqn. 4, which is always guaranteed to be well posed.
Eqn. 4 states that the optimal reconstruction is found by maximizing the Shannon entropy of the reconstruction while making sure that the reconstructed image, when operated on by A, is still statistically similar to the observed image. Intuitively, maximizing the Shannon entropy flattens the reconstruction and grants MEM its noise suppression characteristics. Also, the statistical similarity criterion forces the reconstruction to approximately satisfy Eqn. 3 and is what enables MEM to invert A. A derivation of MEM, is given in Appendix B.
In an added note, the Eqn. 4 can be modified in two ways to facilitate operator tuning during real world operations. First, a σ matrix can be loaded into memory allowing each (yi−ΣiAi jxj) to be drawn from a Normal distribution of a different variance, N(0,σi). The change will alter the constraint in Eqn. 4 to below.
This allows the incorporation of information that some pixel counts have more variance than others (i.e. higher regional neutron flux). If less local fine tuning is required, a scalar factor, k, can be added multiplied to the right hand side of the constraint equation to control the amount of global noise suppression vs goodness of fit. The modification is shown below.
Finally, it should be noted that the second modification is a restriction of the first as setting σi=kσ, ∀i will accomplish the same effect as the second modification.
Bayesian Prior Subroutine
Since maximum entropy reconstruction depends heavily on the Bayesian prior to flatten the image, it is crucial that we have an intensity map of the scintillator response for when there is no object of interest. This map also needs to be as smooth as possible as any noise in the Bayesian prior will be magnified in the reconstructed image. We start by noting that the neutron flux is only quasi-radial symmetric because the CF-252 source is not spherical and is large enough to make point particle approximations sufficient.
While we can generate a noise-free background image through a Monte-Carlo routine, the amount of simulated particles required for smooth convergence will require many computer days. Also, this approach assumes that the simulation geometry and experimental geometry are one and the same. Any geometrical artifacts from incomplete calibration will result in errors propagating through the reconstruction. As we expect calibration errors to happen when we eventually deploy this technique, we searched for techniques based on smoothing an observed background image. The top two candidates were inverse square regression and residual boosted support vector regression (RB-SVR). Previous work with SVR in image processing showed great success but we found RB-SVR demonstrated better empirical performance,
Linear Scintillator Model
For our linear scintillator model, we opted for a full discrete treatment instead of any continuous approximation. First, we partitioned the scintillator into voxels, volumetric pixels. Next, we assumed that the distance to first interaction of a neutron in the scintillator is an exponential random variable with its mean equal to the mean free path of the neutron. Finally, we assumed that all neutrons give up all their energy on the first interaction so there is no scattering within the scintillator.
Working off the above assumptions, we defined the bijection between each pixel in the observed image and a voxel on the scintillator,
If any neutron interacts with the scintillator in voxel i, we assume pixel i increased its intensity count by a constant factor. The prior assumption is justified because each voxel interacts with enough neutrons to ensure central limit convergence, and the ratio of standard deviation of neutron count to mean neutron count is less than 0.05.
In the framework of the discrete voxel scintillator, we wish to solve the number of neutrons incident on each voxel given the number of neutrons terminating in each voxel. Neglecting the intensity variance, the number of incident neutrons is exactly the response of the ideal infinitesimally thin detector. Thus, removing the cone-beam effect is equivalent to solving for the number of incident neutrons.
Probability Transfer Matrix
Given our voxel scintillator model, we now clarify our linear model, Eqn. 2. We redefine x as the vector consisting of the incident counts for each voxel, and y as the vector consisting of the termination counts for each voxel. Now A becomes the probability transfer matrix (PTM) between incident voxel and termination voxel, such that [A]ij denotes the probability a neutron incident on voxel i will terminate in voxel j,
While A can be determined analytically for certain neutron emission distributions and scintillator compositions, a presently preferred approach obtains A through Monte Carlo simulation, whose pseudocode is listed below and also shown and discussed in connection with
For sampling the emission energy, en, we utilized Eqn. 1 and to calculate the mean free path given neutron energy, we utilized the Evaluated Nuclear Reaction Data library, and found the mean free path of neutrons with energy ranging from 1e-10 to 20 MEV in Bicron BC-400 through its molecular formula. We show the mean free path in
Finally, as a check, we applied A to a uniformly flat image. Based on the inverse square law, we expect to see a concave sink extending out from center of the image after transformation, confirmed in
To demonstrate the effectiveness of our computer-implemented image processing technique, we next present some numerical results obtained through computer simulation, as described below.
First, to estimate the Bayesian prior of the scintillator response with no object, we used MCNP to simulate a background image as discussed above.
Next, we generated two versions of the PTM in order to study the effects of Monte Carlo noise on the reconstructions. The first PTM has 2e6 neutrons per voxels and the second PTM has 2e7 neutrons per voxel. Given a desktop Intel I7 950, we were able to simulate 2e6 neutrons per second and build a PTM matrix with numNeutrons set at 2e7 in 140 CPU hours.
Finally, after combining the RB-SVR Bayesian prior and the observed image, we used MEM to reconstruct our estimate of the original image, shown in
For both reconstructions, we observed suppression of CBE and restoration of hard edges. Comparing
In this formulation, the reconstructed image is completely defined by the A, b, and σ. We would like to stress that the reconstruction is defined in terms of a global optimization problem and with A, b, and σ fixed, the reconstruction is also completely independent of any initial conditions to the MEM problem. In an effort to explore the robustness of the algorithm to noise, we added increasing levels of Gaussian white noise to
Next, as an objective reference, we estimated the signal to noise ratio SNR for the original image and its corrupted copies. For the estimation, we chose a 50×50 pixel background area in the image,
As shown in
Referring to
In addition to storing the probability transfer matrix, the processor also acquires and stores a noiseless background image, used as the Bayesian prior in the MEM algorithm. This may be done by the processor reading M×N pixels of background image data from the CCD camera 20 (step 110). The background image data are acquired without an object of interest being placed in the beam path. The processor then performs an RB-SVR noise removal process on the acquired background data (step 112) and the result is stored in memory at step 114 as the noiseless background image. If desired the processor used to generate the background data may be different from the processor used to generate the probability transfer matrix, and both of these processors may be different from the processor used to analyze the image data from an object of interest. In other words, the probability transfer matrix and background data may be generated by processors other than the processor used to perform the image analysis. These data would be generated ahead of time and downloaded into the memory associated with processor being used to perform the analysis of an object of interest.
To perform the analysis of an object of interest, processor 50 acquires image data from the CCD camera 20, which it stores in memory as at step 115. This data represents the TS observed image data y as described in connection with the discussion of the MEM algorithm above. To perform the MEM algorithm, the processor accesses memory to iteratively read the observed image data value y, the probability transfer matrix value Aij and the Bayesian prior value b, as depicted at step 116. The processor then performs the maximization algorithm (MEM) using these read values to generate a reconstructed image value x, which is stored in memory. The processor applies the MEM algorithm as discussed above, so that the optimal value is chosen for x, given the constraint that x, when operated upon by the probability transfer matrix [A] is statistically most similar to the Bayesian prior b.
Once all values of observed image data y have been processed to generate corresponding optimized values of x, the optimized values are displayed on the display of computer 24. These optimized values correspond to what the image of the object of interest would look like had an infinitesimally thin scintillator (ITS) been used.
The probability transfer matrix plays an important role in obtaining a the ITS image. In a preferred embodiment, the probability transfer matrix is generated by a processor that has been programmed to perform the Monte Carlo simulation steps depicted generally at 106 in
The process may be performed by processor 50 (or by another processor) by performing steps 202 through 220, iteratively until all MN pixels have been modeled. Thus at step 204, a value α is measured and stored in memory, corresponding to the point on the voxel surface where the neutron enters the scintillator. Next at step 206, a value β is measured and stored in memory, corresponding to the point from which the neutron was emitted from the source's surface. In this case, the neutron source may be modeled as a point source; or if greater accuracy is desired, the actual size and shape of the source may be taken into account.
For each neutron emitted from β and striking the scintillator surface at α, the energy en is obtained and stored. This energy value may be obtained by downloading from a suitable data source 210, such as a data source storing data from the Evaluated Nuclear Reaction Data Library. See A. Trkov, M. Herman, and D. Brown, Endf-6 formats manual, data formats and procedures for the evaluated nuclear data file, endf/b-vi and endf/b-vii,” Tech. Rep. BNL-90365-2009, Brookhaven NL, 2011.
Next the processor calculates the mean free path λn for the obtained energy value en at step 212 and the penetration distance pn is then calculated at step 214. Now, armed with the emission point β, the striking point α the processor performs a ray tracing operation (step 216) to find the termination point given the sample penetration distance pn. The processor then calculates (step 218) which voxel j contains as an element the neutron n and the probability transfer matrix [A]ij is written to storing a probability value [1/numNeutrons] corresponding to the voxel containing the neutron (step 220).
It has been shown by simulation that the cone beam effect can be practically removed from an uncollimated fast neutron image through a simple model of the scintillator response and source to scintillator geometry.
Residual boosted support vector regression was used to smooth the background intensity and large scale Monte Carlo simulation was used to generate a linear approximation of the scintillator response to a near field divergent neutron source. Finally, the maximum entropy method was used to invert the scintillator response from an MCNP simulated observed image.
Overall, the disclosed reconstruction techniques could reduce exposure times or required source intensity without undesirable object blurring on the image by both allowing closer source-to-detector distances to maximize incident radiation flux and the use of thicker scintillators with higher efficiencies. In addition to neutron imaging the technique should also be applicable, with the right probability transfer matrix PTM, for high energy gamma or x-ray radiography using thick scintillators.
The disclosed techniques may be further verified by calibrating an experimental setup which mimics our simulation geometry and applying the algorithm to an empirical image.
In addition,
As a further aid in understanding the techniques employed herein, the reader may wish to refer to the following Appendices, in which Appendix A describes support vector regression theory (SVR) and Appendix B describes the maximum entropy method (MEM).
Support vector regression (SVR) is a machine learning technique which can approximate nonlinear functions.
Given a training set, T:(xi,yi), . . . , (xN,yN)εRm×R, we wish to approximate a function ƒ(xi)≈yi s.t.
ƒ(x)=<θ(w),θ(xi)>+b (A.1)
Where <·,·> denotes the inner product and θ(xi) is a nonlinear mapping from Rm to a higher dimensional space. The parameters w, and b are solved via minimization of the following cost function, Rƒ.
Here, Remp measures empirical risk, |w|2 measures model complexity and C is a regularization parameter which balances model complexity and training set performance. We define Remp as
where |•|ε; termed as the E-insensitive loss is defined as
|yi−ƒ(xi)|ε=max{0,|yi−ƒ(xi)|−ε} (A.4)
Thus, as a result of Eqn. A.4, regression estimates which err by less than a certain amount do not factor into the cost function resulting in an insensitive tube around the regression estimates,
We wish to reformulate Eqn. A.2 as a quadratic programming problem for tractable computation so we introduce slack variables ξi+, and ξi−. The two slack variables, ξi+, and ξi−, measure the deviation of observation i above and below the surface of the ε tube respectively. This formulation is termed the ε-SVR.
y
i−ƒ(xi)≦ξi++ε
ƒ(xi)−yi≦ε+ξi−
ξi−, ξi+≧0, i=1, . . . , N (A.5)
While we now have a well-posed quadratic programming problem, we are required to set the parameter ε a priori. This is unsatisfactory because E is highly data dependent and can range over [0,∞]. To remove the burden of selecting ε, ν-SVR introduces a new parameter ν and redefines the optimization problem as follows.
y
i−ƒ(xi)≦ξi++ε
ƒ(xi)−yi≦ε+ξi−
ξi−, ξi+≧0, i={1, . . . , N} (A.6)
Now, ε is a variable featured in the optimization problem and is no longer a parameter. While we have substituted ν for ε, ν is bounded ε[0,1] and has an intuitive meaning as the maximum fraction of yi's allowed to err by more than ε.
Kernel Functions
Much of the SVR's advantage comes from its projection of data into higher dimensional space, φ(•). Let K(xi,xj)=<φ(xi),φ(xj)>. K(xi,xj) is called a kernel function and it provides the benefit of a high dimensional space without explicit computation. For example, the second order polynomial kernel KP2(xi, xj)=(xi·xj)2 is equivalent to <φ(xi),φ(xj)> with φ:R2→R3 s.t.
Kernels can also be chosen with a priori knowledge and since we know that the true background intensity map will be smooth, we choose a kernel function which favors smoothness.
K(xi,xj)=e(−γ∥xi−xj∥2) (A.7)
This kernel is known as the Gaussian radial basis function (RBF) and the θ associated with this kernel projects the data into an infinite dimension Hilbert space. However, this kernel also introduces an addition parameter γ which must be optimized during training.
Residual Boosting
While ν-SVRs have great native performance, they can be combined through boosting for even better results. In residual boosting, a particular form of ensemble learning, the regression target is iteratively simplified so the machine learner can capture higher order effects in successive iteration.
Residual boosting accomplishes this by defining ti,k, the regression target for observation i at iteration k, as
Thus, the kth iteration of the machine learner only tries to capture the residuals of the prior k−1 iterations. For the final estimate, we take the sum of all the regression functions, Eqn. A.9.
K, the maximum number of boosting iterations, is decided in advance and we found that regression accuracy converge for K>3.
We now present a detailed discussion of the MEM algorithm used to invert the linear operator [A]. This algorithm has the added bonus of reconstructing the flattest image possible given the observed data, reducing the number of post reconstruction artifacts.
To derive the MEM formulation, we assume that we have K balls (neutrons) and when thrown, each ball is independent and is equally likely to land in any of the MN buckets (boxels). We do not know the actual distribution of the balls among the buckets but the best guess would be the distribution with the highest probability. Since each particular distribution is a realization of a multinomial random variable, we can find the most probable distribution by maximizing the probability of a certain distribution happening.
Maximizing P(x1, . . . , xMN) is equivalent to maximizing any monotonic transform of P(x1, . . . , xMN), so we choose to maximize
Since K˜1e18, we can use Sterling's approximation, log(K!)≈K log(K)−K, on Eqn. B.3.
Looking at Eqn. B.4, we see that it is equivalent to the Shannon entropy of a multinomial distribution, Eqn. 18.
Thus, we see maximizing the image entropy is equivalent to finding the most probable image. However, we cannot blindly apply the balls and buckets model to our problem as each voxel possesses a different solid angle area, and thus receives different amounts of neutron flux; therefore we modify each pi in Eqn. B.5 with a Bayesian prior, bi, to correct for the neutron flux difference, Eqn. B.6.
Looking at Eqn. B.6, we see that unconstructed optimization will simply return the bis as the reconstructed image. Thus, we add the Chi-Square constraint, Eqn. B.7, to guarantee the reconstructed image is statistically similar to the observed image when linked through the linear operator.
Combining Eqn. B.6 and Eqn. B.7, we arrive at the full formulation of MEM.
To solve Eqn. B.8, a large scale convex optimization problem involving MN variables, iterative preconditioned gradient descent or quasi-Newton methods can be used. For brevity, we will not restate the various numerical algorithms.
The foregoing description of the embodiments has been provided for purposes of illustration and description. It is not intended to be exhaustive or to limit the disclosure. Individual elements or features of a particular embodiment are generally not limited to that particular embodiment, but, where applicable, are interchangeable and can be used in a selected embodiment, even if not specifically shown or described. The same may also be varied in many ways. Such variations are not to be regarded as a departure from the disclosure, and all such modifications are intended to be included within the scope of the disclosure.
This application claims the benefit of U.S. Provisional Application No. 61/674,223, filed on Jul. 20, 2012. The entire disclosure of the above application is incorporated herein by reference.
The United States Government has rights in this invention pursuant to Contract No. DE-AC52-07NA27344 between the United States Department of Energy and Lawrence Livermore National Security, LLC for the operation of Lawrence Livermore National Laboratory.
Number | Date | Country | |
---|---|---|---|
61674223 | Jul 2012 | US |