The present disclosure relates to the field of radiation detection machines and, more specifically, to a method and an apparatus for identifying photon line-of-responses.
Various types of radiation detection machines are used for a broad array of applications. For example, Positron Emission Tomography (PET) is a medical imaging modality that allows studying metabolic processes of cells or tissues such as glucose transformation in energy. PET uses the coincident detection of two co-linear 511 keV photons emitted as a result of positron annihilation to reconstruct the spatial distribution of positron-emitting radiolabelled molecules within the body. Current PET human scanners can achieve 4-6 mm resolution and the scanner ring is large enough to let the patient occupy a relatively small portion of the field of view. On the other hand, small animal PET scanners have a smaller ring diameter (˜15 cm) and achieve a higher resolution than their human counterpart (≦2 mm) through, for example, an increased detector pixel density. In addition, because of the small diameter ring and large aspect ratio of long (˜2 cm) versus small section (<4 mm2) detectors that are pointing toward the scanner center, error may occur on the position of detection of the annihilation photons (511 keV).
Avalanche PhotoDiodes (APD)-based detection systems, and pixelated detection systems, which allow individual coupling of scintillation crystal to independent Data AcQuisition (DAQ) chains, have been considered for PET scanners, for example for small animal applications. This approach however suffers from poor intrinsic detection efficiency due to the photon interaction processes and from electronic noise problems generated by the APD photodetectors themselves. That noise is a contributor to all measurements and significantly hinders signal processing of the detection.
PET detectors are usually arranged in ring fashion, to allow for optimal radial coverage, and a given scanner often has a stack of such rings to augment its axial field-of-view. The detectors still cover a limited solid angle around the patient or subject, and photons not emitted towards a detector remain undetected. Aside from that, the interaction with matter is probabilistic in nature, and a photon may not necessarily be detected even if emitted toward a detector. Finally, when interacting with matter, a photon can transfer all its energy at once, in which case the process is called a photoelectric absorption, or only part of it. In a partial energy absorption case, the photon undergoes what is then called Compton scattering, where remaining energy is re-emitted in the form of a scattered photon obeying the Compton law, according to equation (1):
where Escattered is the remaining re-emitted photon energy, Eincident is the incident photon energy and θ is the angle between the two photon trajectories.
To properly reconstruct the image, a virtual line is accurately traced on the line spanned by the annihilation photons trajectory. That trajectory is called Line-of-Response (LOR) 62. But because of scattering, probabilistic detection and limited solid angle coverage, the scenarios and combinations of photoelectric or scattered, detected or not detected photons are limitless. It has been shown that for detections involving any Compton scattering, one cannot compute the annihilation trajectory with a certainty level high enough for all scenarios to guarantee acceptable image quality with a sufficiently low computational burden to be practically feasible, and they are currently all rejected as unusable. Only detections involving two photoelectric 511-keV photons are kept, because they involve an unambiguous trajectory computation, but they typically account for less than 1% of all detected photons.
The scanner has consequently a low ratio of usable detections versus injected radioactive dose (known in PET as the sensitivity). That low sensitivity is becoming a critical issue, in terms either of acquisition time, image quality or injected dose, especially in small-animal research where doses can sometimes be considered therapeutically active, or where tracers can saturate neuro-receptors. Sensitivity is critical in small-animal PET, and including more of the discarded detections would increase it. However lowering the energy threshold compromises spatial resolution.
A few efforts have attempted to increase sensitivity by lowering the detection energy threshold and incorporating Compton-scattered photons in the image reconstruction. This has proven to be quite problematic, since recovering the correct photon trajectories and properly determining the sequence of interactions is rendered difficult by the quasi infinite number of scenarios potentially involved. It is difficult to recover the correct trajectory of the annihilation photons, or LOR, among the several possibilities of any given coincidence. In small-animal scanners based on avalanche photodiodes, the image resolution and contrast can be impaired by the relatively low success rate of even the most sophisticated methods.
While the foregoing problems have been described in relation to PET scanners, similar concerns also apply in other types of radiation detection machines capable of detecting photons. Non-limiting examples may comprise Compton cameras, photon calorimeters, scintillation calorimeters, Anger cameras, single positron emission computed tomography (SPECT) scanners, and the like.
Therefore, there is a need for a method and apparatus for identifying line-of-response of photons that compensates for losses of spatial resolution at high sensitivity levels.
Embodiments will be described by way of example only with reference to the accompanying drawings, in which:
The foregoing and other features will become more apparent upon reading of the following non-restrictive description of illustrative embodiments thereof, given by way of example only with reference to the accompanying drawings.
Various aspects of the present disclosure generally address one or more of the problems of identifying line-of-response of photons that compensates for losses of spatial resolution at high sensitivity levels.
The present disclosure introduces a method for use with a radiation detection machine, and an apparatus incorporating a radiation detecting machine, for identifying line-of-responses (LOR) of multiple photons. Photons are detected and measured in the radiation detection machine. The measurements are pre-processed according to known or expected properties of the photons. Pattern recognition is then used to mitigate LOR identification errors remaining in the pre-processed measurements.
In some embodiments, the method and apparatus are for use in positron emission tomography (PET). Discrimination may be made between scattered photons and photoelectric photons lying on the LORs. A PET scanner identifies a plurality of triplets, each triplet comprising a detected photoelectric photon whose energy level is within a range indicative of positron annihilation and two detected scattered photons whose energy sum is also within the positron annihilation energy range. A processor may align the triplets, first by rotation and translation, bringing the photoelectric photons on a same axis. The processor may also rotate further the triplets about the axis of the photoelectric photons, bringing the scattered photons in a same plane. A neural network may be used to mitigate LOR identification errors.
The following terminology is used throughout the present disclosure:
Although explicit analysis of the measurements may be made, mitigation of the LOR identification errors may rely on an implicit representation of the measurements used for pattern recognition. Pre-processing of the measurements of photons may involve geometrical processing, numerical processing and filtering. Such pre-processing facilitates pattern recognition by improving performance, reducing complexity, or both.
In an embodiment, the photons may be detected through photoelectric interaction within a detector. In the same or other embodiment, the photons may be subjected to Compton scattering within the detector. As an example, the radiation detection machine may be a positron emission tomography (PET) apparatus, or scanner, in which some of the detected photons are positron annihilation photons. Identification may be made, in the scanner, of a plurality of positron annihilation photons as photoelectric photons having an energy level within a range indicative of positron annihilation. On the other hand, positron annihilation pholon(s) may further be detected as one or more scattered photons, whose energy sum is within the positron annihilation energy range. The method may discriminate between photoelectric photons and scattered photon lying on the LOR and may further comprise identification of a plurality of photon groups, each photon group comprising a detected photoelectric photon and one or more detected scattered photons. Pre-processing the measurements of the photons then helps a determination of the LORs, based on geometries and numerical properties of a plurality of photoelectric photons and normalizing, within a predetermined range, energy measurements of the photoelectric photons.
In an embodiment, pattern recognition may be performed using algebraic classification methods.
In an embodiment, pattern recognition may be performed using an artificial intelligence technique, for example using a neural network. Mitigating LOR identification errors using pattern recognition of the pre-processed measurements then comprises a pattern recognition analysis of the normalized measurements, executed by the neural network. In some embodiments, the neural network may have, as a part of a pattern recognition process, a feedforward multilayer architecture, a hyperbolic tangent function as a non-linear activation function, and/or be trained using back-propagation of the error when compared to simulated Monte-Carlo data.
Before normalization, the photoelectric photon trajectories may be aligned by rotation and translation, in order to bring the trajectories on a same axis. After this step of aligning and before normalization, rotating further the photoelectric photons about their axis may bring the photon groups in a same plane. Of course, due to measurements impairments and to noise, it is expected that some of the photoelectric photon trajectories cannot be brought on the same axis and that some of the photon groups cannot be brought on the same plane. Pre-processing and pattern recognition applied to photon measurements nevertheless provides sufficient information for the identification of LORs.
In some embodiments, the first processor 404 may align trajectories of the detected photons by rotation and translation, such that the trajectories are brought on a same axis, The first processor 404 may also rotate further the photoelectric photons about their axis to bring the photons in a same plane. The first processor 404 may further normalize the measurements of photons within a predetermined range. In the same or other embodiments, the second processor 406 may comprise a neural network. The neural network may compute the LOR as an output range between −1 and 1. The neural network may further be trained using an optimization algorithm. The neural network may also statistically minimize the LOR identification errors arising from the measurements of photons.
Various embodiments of system for identifying line-of-response of annihilation photons, as disclosed herein, may be envisioned. One such embodiment involves a method and an apparatus for the analysis of photons, for example Compton-scattered photons, in radiation detection machines. The method and apparatus do not require explicit handling of any overly complex, non-linear and probabilistic representations of the Compton interaction scenarios, and are immune to scanner's energy, time and position measurement errors.
In an embodiment, with an energy threshold set as low as 50 keV, triple coincidences analyzed are simple inter-crystal Compton scatter scenarios where one photoelectric 511-keV detection coincides with two detections whose energy sum is also 511-keV. The value 511-keV, or alternately an energy range around the value 511-keV, represents an energy level of positron annihilation. Instead of traditional Compton interaction mathematical models, pattern recognition, which may be implemented as artificial intelligence analysis, for example using a neural network, is used to determine a proper Line-of-Response (LOR) for that coincidence. The following disclosure presents the method for the analysis of Compton-scattered photons and, in particular pre-processing operations used to simplify data fed to the neural network, pre-processing in order to significantly improve LOR computation. The disclosure then presents a Monte Carlo analysis of the method with various point and cylinder sources. A simulated scanner geometry is purposely made to encompass worst-case conditions seen in today's PET scanners, including small diameter, poor photoelectric fraction, and poor 35% Full Width at Half Maximum (FWHM) energy resolution. With the present method and apparatus, LOR identification error is low, in a range of 15 to 25% while sensitivity increases in a range of about 70 to 100%. Images, obtained with overall very good quality, are presented.
In an attempt to improve the efficiency ratio, it is worth recognizing which specific Compton scattering cases are certain enough and can be kept for image reconstruction. However, due to the distribution of the data and the particular operating conditions, that recognition is somewhat impractical using traditional logic, which would impose prohibitive computing power requirements.
Accordingly, a method and an apparatus, which do not require explicit handling of any overly complex, nonlinear and probabilistic representations of the Compton interaction scenarios, and which are immune to the scanner's energy, time and position measurement errors, are used. Artificial intelligence may be used for that purpose.
Block diagram 500 shows that measurements 501 obtained from a radiation detection device, for example radiation detector 402 of
Indeed, the method is an alternative to more “traditional” use of mathematics in other applications, especially when the problem is complex and noisy. Different pattern recognition algorithms have different inherent error mitigation capabilities. For instance, artificial intelligence processes and devices, such as for example neural networks, do not require any explicit representation of the problem and can be trained directly with noisy data. They act as universal approxirriators by way of learning. Simultaneous operation on the inputs, combined with no explicit representation of the problem at hand, gives neural networks good immunity to input noise.
The output of a single-layer neural network is a non-linear distortion of the linear combination of its inputs. In other words, the network forms a hyper-plane in a n-dimension hyper-space defined by the inputs and then performs a non-linear operation on that hyper-plane. In that sense, a neural network with several layers can be viewed as an elaborate non-linear pattern recognition engine, which can compute in which region of the input space a particular input combination lays.
If a large number of measurements pertinent to a given coincidence are fed as inputs to a neural network, then the network can be trained, using those measurements, to recognize the correct and incorrect LORs as separate regions of the input space.
This method is thus suited to resolve the Compton-scattering problem. The application and adaptation of the method to that problem are described hereinafter. Although the present description presents a proof of concept for the application of neural networks to the sensitivity problem in PET, applications of the method are not restricted to that particular case. Likewise, while the present description provides an illustration of a method and apparatus using a neural network, any method or system, such as for example those using algebraic processes or any artificial intelligence system capable of localizing a LOR for a Compton scatter following pre-processing, may substitute for the neural network. References to “neural networks” are presented as examples and should not be understood as limiting.
In an embodiment the method may analyze a highly prevalent Compton scattering scenario, when one 511-keV photon and two 511-keV-sum photons are detected in coincidence. This is a simplest case of Inter-Crystal Scatter (ICS).
The method disclosed herein operates in two phases. In a first phase, pre-processing prepares measurements for subsequent analysis by a pattern recognition process embodied as an artificial intelligence process, for example in a neural network. The neural network itself identifies the photon lying on the LOR in a second phase.
A pre-processing goal is to make the measurements separable into correct and incorrect LOR regions, and it does so in two phases: simplify measurements, and then order the measurements.
Separation is used because of the sheer number of possibilities, even for a simple scenario. Even in the mathematical space defined by all combined measurements available in a scanner, those measurements, when taken as is, overlap and do not directly provide separation between the correct and incorrect LORs.
Ordering forms another pre-processing phase. Photons are simply sorted from the highest energy (photon a) to the lowest (in this case, photon b) to remove another region superposition in the input space arising from random arrival of photon information at the coincidence processing engine.
Enhanced pre-processing can involve normalization of the coordinates and energy. Normalization scales the measurements to known values between ˜1 and 1 or 0 and 1, and produces the positive side-effect that the method is virtually machine-independent. Embodiments of enhanced pre-processing are described hereinbelow.
After preprocessing, the LOR is computed. However, because of measurement noise and imprecision, there still exists some overlap between the regions. The overlap is addressed within a decision as to which photon lies on the LOR. A neural network tackles both tasks. In practice, any technique not using explicit representation of the problem and which is able to abstract noise may alternatively be used.
Each neuron in a network can be described using the traditional representation of artificial neurons of equation (2):
where wn are the weights associated with each input and ƒ is an arbitrary function, often a non-linear function. Neurons can be organized in layers, where the outputs of the neurons in one layer constitute the inputs to the next layer.
In this example, the neural network is fed with simplified measurements pertaining to the ICS coincidence: the x,y coordinates and energy of the two remaining 511-keV-sum photons, for a total of 6 inputs. Table 1 shows information retained from the chosen Compton scenario, forming the 6 inputs, and fed to the neural network.
The network then computes which of photon a (high energy) or photon b (low energy) lies on the LOR, effectively making abstraction of the measurement noise. The following notation is used:
Photon a is a high energy photon before analysis;
Photon b is a low energy photon before analysis;
Photon 1 is one of photons a or b that lies on the LOR after analysis;
Photon 2 is the other one of photons a or b that does not lie on the LOR after analysis.
A neural network needs to be trained. Since there is no efficient method for computing with good certainty which photons are on the LOR, use of real-life data is not appropriate. Simulation data may then be used for training. In this example, the network is trained with data representative of the poorest characteristics obtained with current technology, to prove that the method has widespread application. Thus the energy resolution is chosen as 35% FWHM, the inner diameter of the scanner is set at 11 cm and the detector size is quantized at 2.7×20 mm (in 2D). In this example, the trained neural network has 7 neurons organized in two layers, with 6 neurons on the first layer and a single neuron on the second layer. The function ƒ is in this case a hyperbolic tangent, denoted tan h( ). Weights and bias are listed in Table 2, which shows input weights and input biases for the first layer, and in Table 3, which shows output weights and bias of the second layer.
Although demonstrated here in 2D, the method can be used in 3D. Either the 3D geometries can be brought back in a 2D plane through rotations and translations, or more inputs to the neural networks can be used to accommodate the extra information. Details are provided hereinbelow in the description of embodiments of enhanced pre-processing.
As versatile as the described method might be, all Compton-scattering cases might not be analyzed with a single physical realization of the method. Parallel physical realizations might be used. Also, a coincidence sorting engine may be used for recognizing which coincidence may be analyzed. That sorting engine may also use artificial intelligence techniques, such as for example fuzzy logic.
Since the present method directly computes the correct LOR, traditional mathematical or statistical correction methods 510 used to compensate for the inclusion of erroneous Compton-scattered photons, as shown in
The method described herein may be physically realized through different approaches as, for example and not limited to, offline software running on traditional computers, on Digital Signal Processors (DSPs), as real-time hardware in an integrated circuit or in a Field Programmable Gate Array (FPGA), or as any combination of those means.
The method and apparatus of the present disclosure comprise, amongst others, the following features: The method can analyze Compton-scattered photons. The method can compute, among detected photons resulting from a single disintegration, which ones resulted from the interaction of the original annihilation photons.
Proof of concept of the method has been made by its application in PET, but the method may also be applied to other radiation detection machines. The method does not use any explicit representation (neither certain nor probabilistic) of the phenomenons and scenarios analyzed. While correction is made necessary in ordinary systems by the inclusion of incorrectly analyzed Compton-scattered photons in the reconstruction data, the present method does not require traditional mathematical and/or statistical processing of inter-detector scatter prior to image reconstruction. The method can use measurements readily available in the machine, for example coordinates of detections and detected energy, or indirectly computed physical quantity from those measurements. The method can work on normalized quantities, be machine-independent and hence be ported easily to other machines.
The method uses two phases: A first phase, called pre-processing, simplifies subsequent analysis by reducing the total number of scenarios to be considered. The first phase, among other goals and/or effects, makes the problem separable. In this case, the problem is separable when, in the mathematical space defined by the measurements used, the decision as to which detection was from an original annihilation photon and which was not, that decision forms a neat or noisy boundary in that space, as shown for example in
The method can be assisted, either at the first or second phase, from external help. The external help can take the form, for example, of any sequential or parallel analysis, based on other decision and/or simplification criterions. The external help, for example, can consist in fuzzy classification of one coincidence into different scenarios to be considered for Compton analysis, as shown in
The above mentioned proof of concept shows that, potentially, one would not need explicit handling of the nonlinear and probabilistic representations of the interaction scenarios based on Compton kinematics, while still being somewhat immune to the scanner's energy, time and position measurement errors. It is expressed that correct and incorrect LORs may be recognized by identifying correct and incorrect LOR regions in a pre-processing phase.
In an embodiment, enhanced pre-processing further reduces LOR identification errors. The proposed method is indeed an alternative to more “traditional” mathematics. It does not require any explicit representation of the problem, namely the Compton kinematics law, the various probabilistic models of detection, the incoherent (Compton) scattering effective cross-section and/or the scattering differential cross-section as per the well-known Klein-Nishina formula. It uses learning through direct training with the noisy data. Simultaneous operation on available information, combined with no explicit representation of the problem at hand, gives the method good immunity to measurement impairments like poor energy resolution and detection localization accuracy.
In an embodiment, one inter-crystal Compton scatter scenario offers triple coincidences, where one photoelectric 511-keV detection coincides with detection of two scattered photons whose energy sum is also 511-keV. These triple coincidences, or triplets, may be used to identify a correct LOR. An embodiment of the method analyzes this highly prevalent Compton scattering scenario, where one 511-keV photon and two 511-keV-sum photons resulting from scattering are detected in a triple coincidence, forming a triplet. Alternately, triplets can be selected using a more relaxed criterion, in which the sum of all three detections' energy is 1022 keV. The method recovers the LOR from this simplest case of Inter-Crystal Scatter (ICS). Recitation of Compton scattering by reference to “triplets” is made solely in order to simplify the present description and should not be understood as limiting. The method is not limited to triple coincidences and may be extended to four (4) Compton scatters or more. The method and apparatus presented herein are therefore applicable to multiple Compton scatters. Moreover, the method is not limited to the simple Compton scenario described herein, in which one photon has energy indicative of positron annihilation while two more photons have an energy sum indicative of positron annihilation. The method and apparatus presented herein are therefore applicable to any scenario where it is desired to find a LOR within multiple photon measurements.
As expressed hereinabove, the method proceeds in two phases, comprising a first pre-processing, followed by artificial intelligence computation of the correct LOR, for example in a neural network.
Pre-processing as presented hereinabove can be further enhanced in terms of the method's performance, yielding a simpler neural network that can more readily discriminate the correct LOR. Pre-processing makes the neural network operate in a value-normalized and orientation-normalized coincidence plane rather than in the system-level coordinate reference. Another way to interpret pre-processing would be to express that it removes some or all symmetries and redundancies in the data, so that the multitude of possible triplets in a given scanner are superposed together and become one simple, generic case.
As described hereinbefore, detections are referenced globally, the x and y coordinates being in the transaxial plane, and z representing distance in the axial direction.
In an embodiment, enhanced pre-processing comprise several operations that may be expressed summarily as energy sorting inside a triplet, removal of data superposition in space arising from radial, longitudinal and quadrant symmetries of a scanner, removal of transaxial localization dependence, removal of axial localization dependence, and normalization. Those operations significantly reduce the dimensional complexity of the required neural network. However an embodiment may comprise a subset of the pre-processing operations.
1A. Energy sorting: The detected photons are presented to the network in order of decreasing energy. In this way, the photoelectric photon appears first, and thus its energy has a known value that does not need to be presented to the neural network. However this operation as is may introduce backscatter artifacts in the presence of poor energy resolution because the photoelectric 511-keV photon, intended to be presented to the network first, may sometimes be swapped with a high-energy scattered one. This may be enhanced by adding a geometry criterion to the sort. As shown on
1B, Geometry gating: Operation 1A introduces backscatter artifacts in the presence of poor energy resolution because the 511-keV detection, intended to be presented to the network first, can be involuntarily swapped with the high-energy scattered one. This backscatter artifact can be seen on
2A. Removal of detector symmetry around the scanner's center axial axis: A scanner usually has a high number of symmetries inside a given ring, which can be removed by rotating the whole triplet about the axial axis such that the 511-keV photon consistently ends up with the same coordinates.
2B. Depth-of-interaction (DOI) Processing for the photoelectric detection: Extending the 511-keV detection superposition rationale of operation 2A to radial-DCII-aware detections, the triplet may be translated in the x direction so that the coordinates of the 511-keV detections now lie on top of one another. The x and y coordinates of those photoelectric photons are now trivial and need not he presented to the network.
3. Ring symmetry: Many scanners comprise a plurality of rings, wherein the rings are generally identical. Ring symmetry may be removed by translation of the triplet along the axial axis such that the z coordinate of the photoelectric photon is consistently the same. That z coordinate likewise becomes trivial. At this point information about the photoelectric photon is trivial and can be omitted from the neural network's inputs.
4. Removal of transaxial quadrant symmetry and half-length symmetries: (A) In the transaxial plane, the scanner is symmetric with respect to an imaginary line, called a symmetry line, passing through the scanner center and through the photoelectric photon. That symmetry may be removed by mirroring the triplet about that line such that the y coordinate of the highest energy scattered photon has a positive sign. (B) Similarly, the scanner has an axial symmetry about a plane located at half its length, which may be removed by mirroring the triplet about that line such that the z coordinate of the highest energy scattered photon is consistently positive.
5. Alignment of the triplet axis: Up to this point, the photoelectric photons from the triplets are brought on a same axis and superposed by transformation, but the coincidence planes themselves are still randomly oriented. Defining the triplet axis as the line spanning between the photoelectric photon and the midpoint between the two scattered photons of a triplet, this may be corrected by up to three (3) rotations about the triplet axis. (A) A first rotation is in the transaxial plane, about an axis passing through the photoelectric photon and parallel to the scanner axial direction, by an amount such that the projection in the transaxial plane of the triplet axis coincides with the transaxial symmetry line described in operation 4A, (B) A second rotation is about an axis passing through the photoelectric photon, parallel to the transaxial plane and perpendicular to the scanner radius, by an amount such that the triplet axis itself now lies in the transaxial plane. (C) A third rotation is about the symmetry line described in operation 4(A) by an amount such that the vector between the two scattered photons is parallel to the transaxial plane. At this point, the scattered photons are brought on a same plane, and the z coordinate of the two scattered photons becomes trivial, and need not be presented to the neural network.
6. Scaling of triplet long axis: The triplet axes are now aligned, but the distance between the scattered photons' midpoint and the photoelectric photon is still random. This may be corrected by scaling the triplet along the symmetry line described in operation 4(A), such that the photoelectric photon stays stationary and the midpoints are now superposed. At this point, correct LORs tend to be superimposed on a single line regardless of the annihilation position within the scanner, with the limit that the correct LOR is still unknown and the superposition remains spread somewhat. At this point as well, the resulting trained neural network becomes universal, as the same network can be used with equivalent performance to discriminate the LOR of any dataset of a given scanner regardless of the data with which it was trained, effectively achieving source geometry independence.
7. Dynamic range maximization: Up to this point, the triplet triangle has been transformed to a fixed but arbitrary relationship to the referential origin. Since the 511-keV detection information has become trivial, only the scattered detections' transformed measurements remain pertinent for analysis. To maximize dynamic range utilization in the data presented to the neural network, the triplet may be translated along the x axis so that the scatter detections' midpoint coincides with the origin.
8. Normalization: Because the neural network used herein has a tan h( )activation function whose output ranges between −1 and 1, training converges more easily if the data also lies in that range. Measurements may thus be normalized to their respective maximum.
Computational complexity is a trade-off between pre-processing and the size of the neural networks. However, pre-processing can be performed at little extra cost, for example within a computer graphic display adapter chip, using its dedicated texture manipulation pipelines that are in fact transformation engines. As such, moving computational complexity into the pre-processing phase is not expensive. By opposition, feeding the raw data directly to the neural network would require that it fulfills a task equivalent to pre-processing by itself, requiring a much larger network.
When photon time-of-flight information is insufficiently accurate or unavailable, some theoretically undistinguishable cases arises where the Compton kinematics work both ways, in the sense that the geometry and the energy in the triplet fit such that both the forward scattering scenario and the backscattering scenario are plausible. Such undistinguishable cases in theory only occur in the 170 to 340 keV energy range, or, in terms of scattering angle, between 1.05 and pi radians (60 and 180 degrees).
However, in a real scanner, detector size is finite and, without DOI measurement or other positioning methods, the detection position is quantized, usually to the center of the detector. This increases the energy and angle range of the undistinguishable cases, since it is not possible to compute the scattering angle with sufficient accuracy, either from the measured energy or from the coincidence geometry.
After pre-processing, the neural network learns how to minimize both the identification error arising from the measurement impairment and undistinguishable cases distribution in the training data.
In an embodiment, an algebraic process may be used to mitigate LOR identification errors. The role of the neural network, algebraic analysis process, or other suitable artificial intelligence system, is, within the LOR decision process, to mitigate LOR identification errors due to measurement impairments and to minimize errors in the theoretically indistinguishable cases.
The neural network is fed with the simplified measurements still pertaining to the ICS coincidence: the x, y coordinates and energy of the non-trivial 511-keV-sum scattered photons, for a total of 6 inputs. It computes which of the 2 photons lies on the LOR. Though the foregoing has described enhanced pre-processing, the task of the neural network fundamentally remains as expressed hereinabove, though the neural network itself or other artificial intelligence system may be simplified when enhanced pre-processing is used. Following identification of the photons on the LOR, the original detection coordinates are subsequently backtracked and fed to an image reconstruction software.
A Monte Carlo analysis of the above described method has been made using various point and cylinder sources. Because LOR computation in a real scanner can hardly reach an absolute certainty, simulation data is used to assess the method's performance. Here a GATE model, described at http://www.opengatecollaboration.org/, is used to produce a model of a simple scanner, generating proper list-mode Monte Carlo data.
A custom GATE pulse adder has been coded to circumvent the built-in adder's inclusion in the singles' centroid computation of electronic interactions subsequent to photonic ones (such as the photoelectric photons in the case of Compton scattering). The custom adder reports the energy of electronic interactions at the proper point of photonic interaction, discarding their localization. That way, individual contributors to LOR identification errors can be studied independently because the Compton kinematics remains exact at the singles level.
Although the method is intended to run on a real scanner, study of the method's performance on a real scanner model is suboptimal. Because of detector blocks, of packaging, and of readout specifics, modifying such parameters as detector size, ring size or DOI would require significant rework of the model. It is easier to choose a simpler test geometry. The simulated scanner is also purposely chosen with very poor performance, representative of the poorest characteristics obtained with current technology, in order to demonstrate that the method may be portable to most machines.
The energy resolution was tested at 0% (perfect) and 35% (worst-case) FWHM. The inner diameter is set at 11 cm, since a small diameter along with rather large detectors worsens angle errors between close detectors. The detector size is quantized at 2.7×2.7×20 mm3. The scanner is assumed to have 8 rings of 128 detectors, and Gd2SiOL (OSO), a material with relatively low stopping power, is employed to obtain a low photoelectric fraction. The detectors are not grouped. They are just disposed around the ring. Individual readout of each detector is made necessary by the need to discriminate the scattered photons in adjacent detectors.
For doublets, defined as coincidences consisting of two 511-keV photoelectric detections, the energy window for perfect energy resolution is set at 500 to 520 keV, while at 35% resolution the window extends from 332 keV to 690 keV. For triplets, the low energy cut is set at 50 keV. With perfect energy resolution, triplets are considered valid when one photon lies in a 500-520 keV range, indicative of positron annihilation, and the total energy sum lies within the 1000-1040 keV range. At 35% FWHM resolution, triplets are retained when at least one photon lies in a 332-690 keV range, and the total energy sum is within the 664-1380 keV range.
In this embodiment, the neural network has a standard feedforward architecture, and the non-linear activation function of layers is the hyperbolic tangent function.
In this embodiment, the neural network is trained by backpropagation of the error, using the well-known Levenberg-Marquardt quasi-Newton optimization algorithm. Training uses a variable-size data set ranging from 600 to 15,000 random triplets indifferently, with similar outcome. Training is stopped using a validation set, and ends when the generalization capability of the network has not improved for 75 epochs.
The neural network is trained with discrete target values of −1 and 1 to indicate which of the scattered photons actually lies on the LOR, but in practice the value 0 is used as a discrimination boundary, everything lying on one side of the boundary being assumed belonging to the discrete value on that side.
Weights and biases within the neural network are initialized randomly before training. Like with many non-linear optimization methods, training is thus a non deterministic process, and no information can be recovered from the dispersion of the training results. After at least 15 training tries, the neural network with the best performance is simply retained.
Preliminary tests assessed the performance versus network complexity trade-off. Those tests used point sources and very small data sets with usually less than 20,000 triplets.
A radiation source was moved across a Field Of View (FOV) of the scanners to measure the LOR identification error rate, defined as the ratio of the number of triplets where the wrong scattered photon was computed as being on the LOR, over the total number of triplets. The sensitivity increase was also measured and defined as the ratio of the number of triplets over the number of doublets in a given test set. The sensitivity increase is a direct measure of the scanner sensitivity increase that would result from the inclusion of triplets in the image reconstruction.
The data set used for those tests is relatively small, with usually less than 75,000 triplets.
A cylinder source of 20 mm radius and 20 mm length was also simulated using approximately 250,000 triplets. For that cylinder a binary IDI set at half the detector height (10 mm) was also tried. Furthermore, smaller detectors were also tried, and the scanner was modified to have 11 rings of 172 detectors sized at 2×2×20 mm3, resulting in approximately the same FOV, also with binary DOI.
The method has been implemented in Matlab, from MathWorks™, for those tests and, again, in this embodiment, the resulting network complexity is 6 inputs (energy as well as x and y coordinates of the two scattered photons), 6 neurons on a single hidden layer, and a single output neuron, or [6 6 1],
The same cylinder configuration was used to reconstruct images, using at perfect energy resolution 5.64 million doublets and 3.85 million triplets, and at 35% FWHM energy resolution, 9.89 million doublets and 5.23 million triplets.
“Tomographic Image Reconstruction Interface of the Université de Sherbrooke” (TIRIUS), a reconstruction software described at http://www.pages.usherbrooke.ca/jdleroux/Tirius/TiriusHome.html, uses a 3D Maximum-Likelihood Expectation Maximization (MLEM) method with a system matrix approximated with Gaussian tubes of responses measuring 2.25 mm FWHM ending in the detector centers. Ten (10) iterations were sufficient to obtain the images.
The reconstructed Region Of Interest (ROI) measures 90 mm in diameter and 21.6 mm axial length. Images have 96×96×24 voxels, for an equivalent voxel size of 0.9375×0.9375×0.9 mm3.
A resolution-like source was also used to reconstruct images, with 6.21 million doublets and 4.66 million triplets at perfect energy resolution, and with 11.2 million doublets and 6.26 million triplets at 35% FWHM energy resolution. The resolution phantom has 8 cylindrical hotspots 5.0, 4.0, 3.0, 2.5, 2.0, 1.75, 1.50 and 1.25 mm in diameter and 20 mm in length, of equal activity density per unit volume, and arranged in symmetrical fashion at 10 mm around the FOV center.
Images were zoomed in 10-times post-reconstruction using bicubic interpolation,
Because of the sheer size of the files involved in image reconstruction, the process was ported to C++ programming language. However, pre-processing operations 5(B), 5(C) and 6 were not coded for simplicity. For the image results, the networks thus have 8 inputs (the 6 inputs previously stated plus the z coordinates of the two scattered photons), 10 neurons on a first hidden layer, 10 neurons on a second hidden layer and a single output neuron, or [8 10 10 1].
A preliminary analysis of the performance achievable along with the required network complexity is presented in Table 4, which represents performance and network complexity achieved as a function of used pre-processing operations. It should be observed that a performance attained with no pre-processing is similar to “traditional” methods employing explicit Compton kinematics models in similar conditions.
In the rightmost column of Table 4, the first number within each square bracket identifies a number of data inputs, the last number identifies a single output neuron, and each number in between identifies a number of neurons in distinct hidden neuron layers. Table 4 demonstrates that improvements in reduction of LOR identification error and neural network complexity are already possible even with a limited subset of the pre-processing operations listed hereinabove.
Table 5 summarizes performance results for a point source moved across the FOV for energy resolutions of 0% and 35% FWHM.
When the source is on the scanner axis, computing the correct LOR is in theory trivial since the LOR consistently passes through the scanner center. Most of the time, the network is able to learn that from the data, and the LOR identification error is low, below 10%.
Because of pre-processing, the LOR identification error shows otherwise no statistically significant dependence on the source position, consistently ranging roughly from 18 to 21%. The variability observed is attributable at least in part to the nondeterministic results of network training, as explained earlier. This is significant improvement over “traditional” methods, which were not able to achieve better than 38% LOR identification error.
The energy resolution shows no statistically significant impact on LOR identification error.
Returning to
With ideal energy resolution the impact of scanner geometry (
With degraded energy resolution (
Table 6 shows the cylinder phantom performance results, for a 40 mm diameter, 20 mm length cylindrical source.
A DOI resolution of 10 mm, as simulated here, has little impact on performance. It is anticipated that DOI does not improve the method when its resolution is worse than the average distance travelled by the scattered photon (
On a logarithmic scale (
In the triplet images, the hotspots look slightly oblong, but again this is dependent on using a proper system matrix, as shown on
Otherwise, the simulated triplet images presented herein are of comparable quality to doublet images, even with slightly poorer statistics, which means the sensitivity of a scanner could be substantially increased without compromising image quality.
As another embodiment example, the method has been implemented offline on a LabPET™ scanner.
The method presented hereinabove shows very good performance with low 1.0R identification error (15-25%), high sensitivity increase (70-100%) and images of very good quality. Real-time implementation of the method, including a simple neural network, may run in an FPGA, with more computationally intensive pre—processing offloaded to another processor such as, for example, a graphics processing unit.
The above described method can be used in real-time or offline, and its implementation can take several forms like, for example, software, DSP implementation or FPGA code. Results from the method, or the method itself, may eventually serve or aid in the analysis of other phenomena in the machines such as, for example, in random coincidence rate estimation.
Those of ordinary skill in the art will realize that the description of the method and apparatus for analysis of Compton-scattered photons in radiation detection machines are illustrative only and are not intended to be in any way limiting. Other embodiments will readily suggest themselves to such skilled persons having the benefit of this disclosure. Furthermore, the disclosed method and apparatus can be customized to offer valuable solutions to existing needs and problems of losses of spatial resolution at high sensitivity levels.
In the interest of clarity, not all of the routine features of the implementations of the method and apparatus are shown and described. It will, of course, be appreciated that in the development of any such actual implementation, numerous implementation-specific decisions are routinely made in order to achieve the developer's specific goals, such as compliance with application-, system-, and business-related constraints, and that these specific goals will vary from one implementation to another and from one developer to another, Moreover, it will be appreciated that a development effort might be complex and time-consuming, but would nevertheless be a routine undertaking of engineering for those of ordinary skill in the fields of artificial intelligence and of positron emission tomography having the benefit of this disclosure.
Although the present disclosure has been described hereinabove by way of non-restrictive illustrative embodiments thereof, these embodiments can be modified at will within the scope of the appended claims without departing from the spirit and nature of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
2719381 | Oct 2010 | CA | national |
Number | Date | Country | |
---|---|---|---|
61408299 | Oct 2010 | US |