The invention relates to a method and a device for identifying emitting species emitting X- or gamma radiation, and preferentially for quantitatively determining the contribution of each species to the radiation. It relates to the technical field of nuclear instrumentation, and more specifically X- and gamma spectroscopy.
The invention applies whenever it is necessary to identify radionuclides and/or atomic species exhibiting an X-fluorescence in a sample or in an environment: radiochemistry, chemical and radiochemical analyses, decontamination and dismantling of nuclear sites, etc.
X- or gamma radiation is understood to be an electromagnetic radiation of energy greater than 100 eV; more particularly, in the context of the invention, the focus is on a radiation of energy lying between approximately 2 keV and 2 MeV. The distinction between X- and gamma radiation is not based on the nature of the radiation, but on its origin: an X-radiation has an electronic origin (typically, electronic transitions involving internal energy levels) whereas a gamma radiation is of nuclear origin. Also, the invention makes it possible to identify radionuclides (isotopes) from their gamma radiation spectrum and atomic species that are not necessarily radioactive based on their X-fluorescence spectrum. The atomic species that can be identified from their X-fluorescence spectrum and the radionuclides that can be identified from their gamma spectrum are indicated jointly by the expression “emitting species”. Hereinbelow, the case where the emitting species are radionuclides will be more specifically considered, but, unless indicated otherwise, everything stated can also be applied to X-emitters.
The conventional methods for identifying radionuclides based on their gamma radiation are based primarily on the extraction of the photoelectric peaks in the spectrum deriving from the acquisition of the detector or on the study of zones of interest in the spectrum. See for example (Lutter 2018).
In this type of method, spectral signatures characteristic of the radionuclides of interest (gamma or X-ray lines) are fitted to a Gaussian module and a continuous background to deduce therefrom their position in terms of energy. One or more peaks present in the spectrum are compared simultaneously to the tables of the nuclear lines of the radioactive isotopes (or to the X-fluorescence lines of the different elements), which allows them to be identified.
These methods present the drawback of requiring recourse to an expert to identify the zones of interest of the spectrum. Furthermore, they require sufficient photon statistics to clearly reveal the peaks to be identified. The information contained in the continuous background of the spectrum (Compton background) is lost.
Such methods can be accompanied by artificial intelligence techniques, and notably neural networks. As early as 1995 (Vigneron 1995) it was proposed to analyze gamma emission peaks by means of a neural network of “multilayer perceptron” type to determine the rate of enrichment of the uranium. More recently, (Yoshida 2002) applied a neural network of “multilayer perceptron” type to the identification of radionuclides in a mixture, and (Medhat 2012) to a quantitative radiochemical analysis. The use of artificial intelligence techniques does not however make it possible to overcome all of the drawbacks of these approaches, and notably the need for an “expert” preprocessing to determine the regions of interest of the spectra to be analyzed.
Other techniques allow all of a spectrum to be analyzed, without the need for expert preprocessing.
For example (Olmos 1991) proposes using a neural network to identify radionuclides in a mixture based on the entire gamma radiation spectrum emitted by the latter. The article does not specify the type of neural network actually used; furthermore, the required signal level must be relatively high (approximately 104 photons detected only in the main emission peak).
(Bobin 2016) uses “spiking neural networks” to determine the proportion of radionuclides in a mixture based on the gamma radiation spectrum emitted thereby. The method requires an accurate model of the spectrum of the source and is not robust to changes of configuration (variations of the rate of attenuation/scattering of the radiation). Furthermore, it does not make it possible to identify the radionuclides: also, it is not possible to know whether the atomic species found in low proportions are absent or are really present in low proportions.
(Kamuda 2017) uses a neural network of perceptron type, trained on synthetic data, to determine the proportion of each radionuclide contributing to the gamma radiation emitted by a source. Intrinsically, the method makes it possible to identify only a limited number (for example 4) of distinct radionuclides.
(Abdel-Aal 1997) uses neural networks of abductive AIM type to determine the relative intensities of several sources based on spectra with low resolution, in which regions of interest are identified automatically. As in (Bobin 2016), it is not possible to know whether the atomic species found in low proportions are absent or are really present in low proportions.
US 2019/034786 discloses, generally and with little detail, the use of a multilayer perceptron to detect or identify radionuclides. The perceptron can have several outputs, each corresponding to an emitting species or to a group of emitting species.
CN 109 063 741 also discloses the use of neural networks to detect or identify radionuclides. More specifically, the document discloses the conversion of the spectra into two-dimensional images by means of a Hilbert curve before the application of a neural network.
The invention aims to overcome the abovementioned drawbacks of the prior art. More particularly, it aims to allow the identification of any number of emitting atomic species in a mixture based on the X- or gamma spectrum thereof, and do so independently of the configuration of the scene (presence of absorbent or scattering material between the source or sources present in environment and the spectrometric detector) and without sophisticated preprocessing including the identification of regions of interest of the spectrum.
Advantageously, furthermore, the invention aims to determine the probability of presence of each of the emitting species, and preferentially an uncertainty on this probability.
Advantageously, furthermore, the invention aims to determine the proportion in the signal of each identified source, and give an uncertainty on this proportion.
Advantageously, furthermore, the invention aims to allow the use of different types of spectrometric detector (CdTe, CdZnTe, Hgl2, Nal, HPGe or any type of gamma spectrometer operating within the keV to MeV energy band, etc.), provided that it is capable of measuring and restoring the energy of each detected event, event by event, or at the very least, supplying the spectrum of the measured energies.
Advantageously, furthermore, the invention aims to make it possible to avoid lengthy and complex fine energy calibration operations and overcome any drifts of the detector or of the operational conditions over time. Ideally, a factory calibration with an error of the order of 2% should be sufficient to identify the emitting species and, if necessary, determine their proportion.
According to one aspect of the invention, at least some of these aims are achieved through the use of a plurality of neural networks of convolutional type, each charged with identifying a single emitting species. As a variant, as will be discussed in more detail later, each neuron will be able to be charged with identifying a distinct group of emitting species.
According to another aspect of the invention, at least some of these aims are achieved through the use of a second plurality of neural networks of convolutional type, each charged with determining the proportion of a single, already identified emitting species (or group of emitting species).
According to another aspect of the invention, these neural networks are trained on synthetic spectral data.
According to another aspect of the invention, the spectra supplied as input to the neural networks for identifying emitting species are converted to the logarithmic scale beforehand.
According to another aspect of the invention, a “dropout” operation (random switching off of a fraction of the neurons of the internal layers) is applied to these neural networks in order to determine the levels of uncertainty on the presence, and if necessary on the proportion, of each emitting species.
According to another aspect of the invention, the different neural networks each have a pair of neurons of complementary outputs, using, for example, an activation function of “softmax” type.
According to another aspect of the invention, once the emitting species are identified, this information is used to perform an energy “self-calibration” of the detector.
One subject of the invention is therefore a method for identifying emitting species emitting X- or gamma radiation in a scene, the method comprising the following steps:
Another subject of the invention is a computer program product comprising instructions which, when the program is run by a computer, lead the latter to implement steps b) and subsequent steps of such a method.
Yet another subject of the invention is a device for identifying emitting species emitting X- or gamma radiation in a scene, comprising:
According to particular embodiments of such a device:
The signal processing circuit processing signals generated by the radiation detector can also be configured or programmed to:
The signal processing circuit processing signals generated by the radiation detector can also be configured or programmed to determine optimal values of said calibration parameters by maximization of a correlation function between an acquired spectrum and a theoretical spectrum calculated as a function of the emitting species determined as being present in the scene.
Each convolutional neural network can be associated with a single respective emitting species.
Each convolutional neural network can comprise a pair of complementary output neurons.
The X- or gamma photons detected can exhibit an energy within at least a part of the range lying between 2 keV and 2 MeV.
The device can also comprise said spectrometric detector (SPM).
The attached drawings illustrate the invention:
The device of
The spectrometric detector SPM comprises a sensitive element ES, preferably pixelated, an analog reading circuit EL and an analog-digital converter ADC.
The spectrometric detector acquires photons emanating from a scene SC in which there are various radionuclides (or atomic species emitting a radiation of X-fluorescence) Si(S1 . . . SN) of activity Ai(A1 . . . AN)− the identity and the relative abundance of which are a priori unknown—potentially situated at different distances from the detector. An absorbent or scattering material ABS can be situated between one or more sources and the detector. Each radionuclide S1 emits photons Pi with energies Ek,i. For example, P1(Ek,1) is used to denote a photon of energy Ek,1 emitted by the first radionuclide S1.
The sensitive element ES can be of any type suitable for detecting X/gamma photons emanating from the scene within at least a portion of the spectral range 2 keV-2 MeV. It can be, for example, a semiconductor pixel made of Si, Ge, CdTe, etc., a scintillation sensor, a perovskite sensor, etc.
The sensitive element generates a signal in the form of a physical quantity, generally electrical, representing the energy of each X- or gamma photon received (typically, it is a current pulse of which the electrical charge is proportional to that energy). The electronic reading circuit EL performs a conventional analog preprocessing of the signals coming from the sensitive element: amplification, shaping of the pulses, detection of their height or energy. The analog signals SA coming from the reading electronics are converted to digital format by the converter ADC.
Preferably, the photons are detected one by one, their energy is recorded and date-stamped by the detector. This information, contained in the digital data stream FDN from the converter ADC, is transmitted to the processing circuit CTS. The latter can be embedded or remote; in the latter case, a telecommunications link must be established between the spectrometric detector and the processing circuit.
The signal processing circuit can comprise one or more generic or dedicated processors for the digital processing of signals, programmed appropriately. As a variant or in addition, it can comprise dedicated digital circuits. In general, furthermore, it comprises random-access memories for storing the data to be processed (notably, the events generated by the spectrometric detector) and random-access and/or read-only memories for storing calibration parameters, neural network coefficients, etc. Generally, the invention is not limited to a particular signal processing circuit production technology. In the description hereinbelow of this circuit, the breakdown into blocks and modules is purely functional, these blocks and modules do not necessarily correspond to distinct physical elements.
In the embodiment of [
The digital data stream FDN from the spectrometric detector is received by the identification module of the circuit CTS and stored in an event memory MEV. A spectra construction module MCS converts these events into spectra using a calibration table TE, stored in a memory, which makes it possible to associate a photon energy with each detection event. This calibration table, established beforehand, can be relatively imprecise, with an error on the energy values which can be as high as 2%. As will be explained in detail hereinbelow, the self-calibration module AE makes it possible to update the calibration tables to improve their accuracy. In the case of a pixelated detector, the calibration is done pixel by pixel, with a different calibration table for each pixel.
Each spectrum is in fact an energy histogram: an energy value is attributed to each event; the events are grouped into energy classes (“bins”) and the spectrum is composed of the number of events belonging to each class. “Spectrum” is therefore understood to mean the spectral distribution of the photons detected over a given time interval, the duration of which can be set or chosen by the user.
A data transformation module MTD then performs spectrum preprocessing operations. In the embodiment considered here, two distinct preprocessing operations are performed.
On the one hand, each acquired spectrum is converted to the logarithmic scale, then normalized. The thus-transformed spectrum, SNlog, is used for the identification of the emitting species. More particularly, let Si be the number of events in the i-th energy class. A first operation is applied.
Then, the normalized spectrum is calculated as follows:
The use of a logarithmic scale makes it possible to reveal spectral structures of low amplitude but which effectively contribute to the identification of the emitting species.
On the other hand, the spectrum is also normalized to norm1 (which consists in ensuring that the sum of the values associated with each energy class is 1) without logarithmic conversion. The thus-transformed spectrum, SN1, given by:
is used to determine the proportions of the identified emitting species.
More generally, the preprocessing can be a scale transformation. In all cases, and contrary to the teaching of CN 109 063 741, there is no change of the dimensionality of the data—the preprocessing spectra remain one-dimensional.
The spectrum SNlog is supplied as input to a module CBNN_ID which implements a plurality of “Bayesian” convolutional neural networks M, each taking all of the spectrum as input and supplying at its output a value PPj indicative of a probability of presence in the scene of a particular radionuclide (identified by the index “j”), and a level of confidence on that probability. There is therefore a distinct neural network for each emitting species that is desired to be able to be identified. The structure and the operation of these Bayesian convolutional neural networks will be described in more detail hereinbelow.
A thresholding module MS is used to determine what emitting species are considered as being effectively present in the scene. For that, the thresholding module takes into consideration the probability of presence and, possibly, its level of uncertainty.
At the output of the identification module, a list LEA of emitting species present in the scene is therefore obtained.
The spectrum normalized to norm 1, SN1, is supplied as input to a module CBNN_PRO which implements a plurality of “Bayesian” convolutional neural networks, each corresponding to a particular radionuclide. Each neural network of the module CBNN_PRO which corresponds to a radionuclide “j” already identified as being present in the scene takes as input all of the spectrum and supplies at its output a value PROj indicative of the proportion of this radionuclide in the recorded signal, and a level of confidence on this proportion. In other words, the value PROj corresponds to the percentage of photons recorded which can be attributed to the radionuclide “j” (through a misuse of language, the expression “the proportion of the radionuclide “j” in the scene” is more simply used, but that is exactly the same only in particular conditions, if all the emitters are at the same distance from the detector and in the presence of the same absorbent/scattering materials). There is therefore a distinct neural network for each atomic species. These neural networks can be of the same type as those used for the identification of the emitting species.
At the output of the identification module, a list LP of proportions of the atomic species identified is therefore also obtained.
The neural networks of the modules CBNN_ID and CBNN_PRO have been previously trained by supervised training from a synthetic database BDS, that is to stay simulated data, generated by the training module APP. The training produces two databases of parameters, PRN_ID and PRN_PRO, characterizing the neural networks for identifying and determining the proportions, which are stored in memories.
The synthetic spectra are created by a Monte-Carlo simulator (block SS in
Each emitting atomic species is simulated independently and the simulated data are restored in the form of a list of events giving the energy deposited by each photon in the spectrometric detector. Next, mixtures of different radio elements are generated, also synthetically (MIX block): for that, energies in energy lists simulated for each radioelement of the mixture are randomly drawn, with different statistics and different proportions. The proportion of photons attributed to each emitting atomic species is recorded.
An intentional “decalibration” is then applied for the neural networks to learn the effect of a drift of the calibration laws. A gain g is drawn in a Gaussian centered on 1 and of standard deviation again, as is an offset off in a Gaussian centered on 0 and of standard deviation Goff. Over all the energies E of one and the same mixture, the new decalibrated energy Ed is then calculated by:
E
d
=g(E−off) [Math. 3]
Several decalibration synthetic spectra of each source and source mixture are recorded in the database BDS used for the training.
The blocks A_JD1—A_JDM implement the supervised training algorithms of the M convolutional neural networks of the block CBNN_ID, and produce M sets of parameters each characterizing these neural networks; these parameters are stored in the abovementioned database BDN_ID. Likewise, the blocks A_PRO1—A_PROM implement the supervised training algorithms of the M convolutional neural networks of the block CBNN_PRO, and produce M sets of parameters each characterizing these neural networks; these parameters are stored in the abovementioned database BDN_PRO.
The synthetic sources of the block SS are also used by the module AE in order to implement a process of self-calibration of the spectrometric detector. By knowing the radionuclides present in the source and their proportions (data supplied by the identification module ID), it is in fact possible to use these synthetic sources to calculate an “expected” spectrum. An algorithm of adaptive mesh or genetic algorithm type is then used by the block ECC to find a set of calibration parameters which maximizes the correlation between this expected spectrum and the one supplied by the module MCS. These parameters are used to update the calibration tables used by the module ID. It will be noted that certain blocks (MEV, MCS, SS, TE) appear at several points in [
The block ECC implements the energy calibration algorithm by correlation described in (Maier 2016).
This calibration does not require the intervention of the user and relies only on the measurement performed in real time in the scene to be analyzed.
The Bayesian convolutional networks used for the identification of the emitting atomic sources and the determination of their proportions will now be described with the aid of
As is known per se—see for example (Aloysius 2017) —a convolutional network comprises a first part composed of several convolution layers making it possible to extract different characteristics of the input spectra and a second part consisting of a multilayer perceptron (in the literature, the expression “fully-connected” layers is also used).
In the embodiment of
The first convolutional layer CC1 comprises 16 convolutional neurons. Each convolutional neuron performs the following operations:
The size 2 Max Pooling operation consists in taking the spectrum and conserving only one channel in every two, the greater one. That reduces the dimension of the spectrum by a factor of 2.
The batch normalization operation—known per se, see (loffe 2015) —consists in collecting the data from the training database in subsets called batches, performing a training iteration on each of the batches (as will be described later), then normalizing in average and variance the outputs of each neuron corresponding to the batch considered. Once the training is done, the normalization parameters over the entire database are saved to apply the same normalization when the neural networks are used for the identification and the determination of the proportions of the emitting species.
The activation function ReLU is defined by:
ReLU(x)=max(0,x).
The output of the first convolutional layer therefore consists of 16 spectra of characteristics SP1,0-SP1,15 of 1000 dimension. These data are supplied as input to the second convolutional layer CC2 which is in all respects similar to the first, except that it operates on data of 1000 dimension.
The output of the second convolutional layer therefore consists of 16 spectra of characteristics SP2,0-SP2,15 of 1000 dimension. These data are supplied as input to the third convolutional layer CC3 which is in all respects similar to the first two, except that it operates on data of 500 dimension and its output therefore consists of 16 spectra of characteristics SP3,0-SP3,15 of 250 dimension.
The latter are flattened to form a vector SPA4 of 4000 elements which are all supplied as input to each of the 20 neurons of a perceptron layer CP using, like the convolutional layers, an activation function of ReLU type.
The last layer, or output layer, CS of the perceptron is preferentially composed of two neurons with an activation function of “softmax” type in which, using xi to denote the output of the neuron before the activation function:
j being the index which identifies the two output neurons.
For the identification networks, the first neuron of the output layer gives a number lying between 0 and 1 which represents the absence or the presence of the radioelement in the mixture, while the second neuron is the one's complement of the first neuron.
For the networks evaluating the proportions, the first neuron gives a number lying between 0 and 1 which corresponds to the signal proportion of the radioelement while the second neuron corresponds to the signal proportion of all the other elements.
The use of two complementary output neurons is not essential, but advantageous as will be discussed later with the aid of
The parameters or coefficients of the neural networks (kernels of the convolutional layers CC1, CC2 and CC3, synaptic weights of the perceptron layers) are learned, in a supervised manner, using, for example, a gradient descent algorithm. For example, in the embodiment discussed below with reference to
More specifically, in this embodiment, the cost function used and minimized during the training phase for this identification network is cross binary entropy, defined as:
L(ypred,yreal)yreal·log(ypred)+(1−yreal)·log(1−yreal) [Math. 6]
in which yreal is the true value that the neuron should have at the output and ypred is the value predicted by the network.
The training of the network for evaluating proportions is performed with norm 1 normalized synthetic spectra. The output to be predicted by the first neuron is the proportion of the radioelement in the mixture and that of the second neuron is the proportion of the other radioelements.
The cost function used and minimized for this network is the mean quadratic deviation:
L(ypred,yreal)=(ypred−yreal)2 [Math. 5]
The two cost functions are evaluated by average over all of the examples of the database.
The “Bayesian” character of the neural network of [
Step a) comprises the acquisition of a spectrum by means of the spectrometric detector SPM.
Step b) comprises the transformation of the data prior to the application of the identification neural networks—that is to say, in a preferred embodiment of the invention—the conversion of the spectrum to the logarithmic scale and the normalization thereof.
Step c) comprises the application of the convolutional neural networks, preferably Bayesian, to the spectrum transformed by step c).
Step d) comprises the identification of the emitting species present in the scene as a function of the outputs of the identification neural networks.
Step e) comprises the transformation of the data prior to the application of the proportion neural networks—that is to say, in a preferred embodiment of the invention—the normalization of the spectrum as norm 1.
Step f) comprises the application of the convolutional neural networks, preferably Bayesian, to the spectrum transformed by step e).
Step g) comprises the determination of the proportions of the emitting species identified in step d) as a function of the outputs of the proportion neural networks.
Step h) corresponds to the self-calibration, which updates the calibration tables used for the implementation of step a).
The prior step app) corresponds to the training of the neural networks used in steps c) and f).
The method can be stopped at step c) if only the identification of the emitting species is required, even at step g) if the self-calibration is not used.
The invention was tested by using, as spectrometric detector, a semiconductor-based pixelated spectro-imager CdTe and its optimized reading circuits (Gevin 2012). The spectro-imager comprises 256 pixels at a pitch of 800 μm with a thickness of 2 mm, an energy dynamic range of 1 keV to 1 MeV, and an energy resolution of 0.7 keV (total width at mid-height) to 60 keV.
Similar results were obtained by using a spectro-imager of the same type but having a thickness of 1 mm and a pitch of 625 μm. The change of detector does not even require retraining of the neural networks, which confirms the robustness of the identification method according to the invention.
The spectro-imager was installed in a vacuum chamber, cooled by Peltier modules to a temperature of approximately −15° C. and subjected to a polarization field of 300 V/mm. The signal recorded by the reading circuits is transmitted to a programmable digital circuit Zynq via an interface card CIF. The digital circuit stores the events and transmits them to a control computer on which the spectro-identification process is performed. The events are returned in the form of a list containing the interaction pixel, the non-calibrated deposited energy and the interaction date. The sum spectrum of all the pixels is pre-calibrated with a calibration table determined in advance by a single initial calibration in the laboratory (as a variant, the theoretical transfer function of the acquisition chain could be applied).
The synthetic data were generated using semi-analytical simulation by modelling the detector and its very near environment, namely the structure which contains the detector. The photon-material interactions in the environment and in the detector were simulated and the energy deposits in the detector as well as their position were recorded. A Geant4 simulator can also be used to perform these simulations.
For each energy deposit, the statistical energy fluctuations were taken into account by drawing an energy into a Gaussian centered on the deposited energy E0, of standard deviation
σ=√{square root over (εwE0F)}, in which [Math. 7]
εw=4.42 eV is the electron-hole pair creation energy in the CdTe and F=0.15 is the Fano factor for the CdTe.
To take account of the loss of charge, the weighting potential in the detector was calculated by solving the Poisson equation ΔUw=0 in terms of cylindrical coordinates on the detector using the finite differences resolution method, with the limiting condition of setting 1 on the electrode on which the signal is induced and 0 on the other electrodes and on the cathode. The simulation domain was extended to two electrodes on either side of the electrode on which the signal is induced. As a variant, a finite elements resolution method could have been used. Then, the loss of charge CCE was calculated as a function of the depth z0 using the Hecht equation:
in which, μeτe=1.14.10−3 cm2.V−1, μhτh=3.36.10−4 cm2.V−1, L=2 mm and V0=600 V.
The energy ECCE at the interaction depth z0 was calculated by
It would also have been possible to calculate the loss of charge in 3D in the detector and not only in 1D to obtain a more faithful model of the response of the detector which is not however essential for the training of the neural network.
The error linked to the electronic noise was taken into account by drawing the final energy recorded in a Gaussian centered on ECCE with a standard deviation σ=εw rms in which rms=50 e—corresponds to the number of electrons rms due to the electronic noise.
To create the lists of mixtures of atomic species, between 10 and 10 million photons were drawn in each list of events of each individually simulated radioelement, the drawing being performed with restoration, which means that the same energy can be drawn several times. The decalibration was performed by taking σgain=0.0055 and σoffset=0.5 keV. In this way, 200 000 synthetic mixtures were produced.
The synthetic spectra were obtained by making a histogram composed of 2000 channels of 0.5 keV width over an energy dynamic range of 0 to 1 MeV.
The neural networks used for the identification and the determination of the proportions are as described above with reference to
The detection threshold is set at 0.5 on the median, which is natural if it is considered that the output of an identification network represents the probability of presence of the corresponding emitting species. Note that the statistical dispersion of the outputs is negligible for the radionuclides actually present and, for the others, low enough so as not to create any risk of false positives.
The results show that the performance levels of the networks with one output neuron are equivalent to those of the networks with two output neurons in the case of the sources of 241Am and 152Eu. In the case of the sources of 57Co and 22Na, the relative performance levels depend on the number of photons, and in the case of the 22Na, the performance levels of the network with one output neuron exceed those of the network with two output neurons by very little. Finally, in the case of the 133Ba and 137Cs, the neural networks with two outputs surpass the neural network with a single output.
Generally, at least in the configuration tested, it appears preferable to use a neural network with two outputs.
In all cases, it will be noted that the rate of good responses is close to or greater than 0.9 as soon as the number of photons reaches 103.
An important aspect of the invention is the separation of the steps of identification of the emitting species and of determination of their proportions, which are implemented by distinct sets of neural networks.
That makes it possible to use different spectrum preprocessing operations for the two operations. For the identification, a “logarithmic normalization” is preferably used, which makes it possible to give importance even to the structures which are composed of few photons, which is particularly interesting because the probability of interaction of the photons in the detector decreases greatly when the energy of the photons increases. For the network for evaluating proportions, on the other hand, the invention preferably uses a linear normalization (norm 1) which “ignores” the small structures. That is advantageous because the proportions of each source in the signal are linked linearly to their contribution in the spectrum. Experimentally, it was found that the use of a “logarithmic normalization” for the evaluation of the proportions does not give good results.
That is illustrated by figures
The spectrum of
If only the proportions had been determined, as in certain methods of the prior art, it would have been impossible to know if 57Co was really present in the trace state, or it was in reality absent like, for example, 133Ba, 137Cs and 22Na. The use of distinct identification networks distinct from the proportion networks makes it possible to unambiguously determine that 57Co is really present, although in low proportion.
The method and the device of the invention offer several advantages over the prior art.
The use of a distinct neural network for each source that is desired to be identified makes it possible to not restrict the number of atomic species that can be identified. The addition of new sources, furthermore, is possible by adding new networks without changing the overall architecture and performing retraining only for the perceptron part. On the other hand, there is no need to perform the training again on the convolutional layers.
As a variant, it is possible to associate a neural network with a plurality of emitting species, for example a family of specifies exhibiting similar spectra. In this case, it is not the individual species which are identified, but groups or families of species.
The use of neural networks of convolutional type confers great robustness with respect to the calibration errors (temporal instability of the calibration law, variable instrumental responses from one spectrometric detector to another) and to changes of configuration (presence of absorbent or scattering materials between the source or sources and the spectrometric detector or detectors) by taking these problems into account in the database. Furthermore, it makes it possible to treat cases of spectra with low photon statistics with good performance levels (accuracy greater than 90% with spectra containing at least 1000 photons emanating from the mixture of sources). This method does not concentrate on particular peaks but on the general structure of the spectra (peaks, notably Compton structures).
The Bayesian approach makes it possible to quantify an error on the probability of presence of each source, which informs the user of the relevance of the results and therefore as to the degree of confidence that can be accorded to an automatic analysis.
Furthermore, no “expert” preprocessing is necessary, and the fine calibration is performed automatically. The recourse to qualified technicians is therefore greatly reduced.
As explained above, the separation of the steps of identification and of determination of the proportions makes it possible to reliably determine whether an atomic species is absent or indeed present but in low proportion.
The invention has been described with reference to a particular embodiment, but many variants can be envisaged.
For example, regarding the structure of the neural networks, the number of convolutional layers and of perceptron type, the activation functions, the data reduction method, the size of the convolution kernels etc. are given only by way of example. Moreover, it is not necessary for all the neural networks to be identical.
Other supervised learning techniques can be implemented. Moreover, the recourse to synthetic sources for the training is advantageous, but it is also possible to use real sources instead of or in addition.
Other data transformation operations prior to the application of the neural networks, and notably other normalization techniques, can be used.
The recourse to a random dropping out of the neurons is not essential, if information on the uncertainty of the measurements is not required. Moreover, when it is used, the random dropping out does not necessarily need to concern all of the intermediate layers.
Criteria other than a thresholding at 0.5 can be used to determine if an atomic species is considered to be present or absent, in particular if it is preferred to minimize the risk of false positives or, conversely, of false negatives.
Each neural network can have more than two outputs—for example one output indicative of the probability of presence of an emitting species, one output indicative of the probability of its absence and one output indicative of an indeterminate situation. Advantageously, these outputs are complementary (that is to say that their sum takes a set value, typically 1). The activation function of the outputs can be of Softmax type, but other possibilities can be envisaged by a person skilled in the art.
Data from several distinct spectrometric detectors can be combined.
Number | Date | Country | Kind |
---|---|---|---|
FR1905682 | May 2019 | FR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/064790 | 5/28/2020 | WO |