The invention relates to a method and an apparatus for characterizing samples, and notably foods, medicines, and biological or environmental media, exploiting light scattering and fluorescence measurements and a statistical processing of said measurements.
The invention can be applied in particular, but not exclusively, to the agro-food or pharmaceutical industry. For example, it makes it possible to study the trend of the nutritional and/or toxicological properties of a food during its preparation or conservation, and to control the processes to which a food is subjected (cooking, roasting, etc.).
The invention can also be applied to the environmental industry, for example to the treatment of wastewaters, industrial muds, fermentation media, etc.
More generally, the invention can be applied to the determination of any indicator of quality of a sample, and/or of any parameter characterizing a method to which said sample has been subjected.
The method is based on chemometric methods, and in particular, in advantageous embodiments of the invention, on the multivariable or—preferably, multiway statistical analysis of spectroscopic data. Multiway analysis is the natural extension of multivariable analysis when the data are arranged in tables with three or more ways. It is based on the use of statistical models such as “PARAFAC” (“Parallel Factor”, i.e. parallel factor model) and NPLS (“N-ways Partial Least Squares regression”). Reference in this respect can be made to the reference work by R. Bro, “Multi-way Analysis in the Food Industry Models, Algorithms, and Applications”, PhD thesis, Amsterdam University, 1998.
The document WO 2011/080408 describes a method and an apparatus for the spectroscopic analysis of samples, in particular food samples, implementing a multiway processing of spectroscopic data. The method comprises the lighting of each sample to be analyzed by a plurality of excitation light radiations at respective wavelengths; the acquisition of a plurality of frontal fluorescence spectra from each sample, each corresponding to a respective excitation light radiation; a preprocessing of these fluorescence spectra, intended in particular to subtract from them a contribution due to Rayleigh scattering; the application to the preprocessed spectra of a multiway statistical model and the determination—for example by multilinear regression—of an indicator of quality of each sample and/or of a parameter characterizing a method to which each sample has been subjected.
The document WO 2011/158192 describes a method for characterizing one or more samples of an agro-food product which also uses the acquisition of a plurality of frontal fluorescence spectra from each sample and their processing by means of a multiway data analysis method. This method culminates in the representation of each sample by a point in a multidimensional space, which makes it possible to compute a distance relative to one or more reference samples. This distance makes it possible, for example, to quantify the naturality and/or the freshness of the sample. The articles:
describe methods for characterizing food samples, that make it possible in particular to study their cooking, based on the application of multiway analysis methods to reflectance spectra.
The implementation of these methods is relatively complex, and therefore costly, because it entails acquiring and processing a plurality of spectra, and therefore a significant volume of data.
The invention aims to overcome these drawbacks in the prior art.
One subject of the invention that makes it possible to achieve this aim is a method for characterizing at least one sample, comprising:
a) the lighting of said or each sample to be analyzed by N≧1 light radiations at respective lighting wavelengths;
b) the acquisition, for each said light radiation, of at least one fluorescence light intensity and of at least one elastic scattering light intensity emitted by said or by each sample;
c) for said or each sample, the determination of a vector indicator from said fluorescence and elastic scattering light intensities;
d) the determination of at least one parameter characterizing each sample, or a method to which said sample has been subjected, from the corresponding vector indicator.
By contrast to the abovementioned methods known from the prior art, and more particularly from the documents WO 2011/080408 and WO 2011/158192, a method according to the invention is characterized by the combined use of fluorescence and scattering data which makes it possible to improve the characterization of the sample and/or to reduce the number of lighting sources used, and therefore to simplify the instrumentation implemented. It should be stressed that, in the abovementioned documents WO 2011/080408 and WO 2011/158192, the intensity of the scattered light is seen only as a nuisance disturbing the acquisition of the fluorescence spectra, which are considered to be the sole bearers of usable information.
According to different particular embodiments of the method of the invention:
Another subject of the invention is an apparatus for characterizing at least one sample comprising:
Other features, details and advantages of the invention will become apparent on reading the description given with reference to the attached drawings which are given by way of example, in which:
The apparatus of
The radiations LE1-LE3 are directed—simultaneously or in turn—toward the sample S, which can be a solid, a powder, a liquid contained in a transparent container, etc. Following its lighting by each incident radiation LEi, the sample S emits an LDF radiation which essentially comprises two contributions: one, at the same wavelength as the lighting radiation, due to the elastic scattering; the other, polychromatic, due to the fluorescence. A diffraction grating RD breaks down the LDF radiation into its spectral components. The resultant light spectrum, SP, is acquired by a matrix sensor is DL, generating signals which, after conversion into digital format, are processed by the data processing means MTD.
In the embodiment of
Similarly, a dispersive element other than a diffraction grating—for example a prism—can be used to break down the LDF radiation into its spectral components; it is even possible to replace the dispersive element with a Fourier transformed spectrometer. According to another variant, it is possible to use a spot light detector, mobile or associated with a rotating grating or prism (apparatus of monochromator type).
The use of a diffraction grating offers an advantage, that of giving access to a number of orders of diffraction. The benefit is being able to access the scattering amplitude, even if the detector used is saturated by an excessively strong light intensity (typically scattering light intensity). The replica due to the second order of diffraction is in fact much weaker than that of the first order, and therefore makes it possible to access the amplitude of the first order spike (the ratio between the intensity diffracted in each order being constant, and depending only on the grating). It is thus possible to obtain, at the same time, a high fluorescence signal and the scattering amplitude.
Particularly when the “monochromatic” lighting sources exhibit relatively wide spectra (for example, they are light-emitting diodes), the spectrum acquired by the sensor DL comprises contributions due to the elastic scattering and to the fluorescence which partially overlap. In this case, a preprocessing can be provided in order to eliminate this contribution; the abovementioned document WO 2011/080408 describes a preprocessing method of this type, based on the prediction of the scattering region which overlaps the fluorescence via a generalized linear model (GLZ) with a log link function. This same document also describes other preprocessing operations (normalization, multiplicative correction of the dispersion, etc.) which can also be applied to the present invention.
As in the method of the abovementioned document WO 2011/080408, the application of a statistical model to the fluorescence light intensity values acquired by the sensors DLi provides a vector, called “scores” vector, which characterizes the sample. In accordance with one embodiment of the invention, a vector indicator, or explanatory vector, is constructed by concatenation of said scores vector and of one or more elastic scattering intensity values. In its turn, this vector indicator makes it possible to obtain a parameter—scalar or vector—characterizing the sample S or a method to which the latter has been subjected. Preferably, said or each scalar or vector parameter will be representative of a physical-chemical structure of a “matrix”—in solid, liquid or powder form—of said sample or of a transformation of such a structure. It may be, for example, the content in terms of a compound, or in terms of several compounds of one and the same family, or even a parameter quantifying the physical chemical modifications of said matrix induced by a method such as cooking. “Matrix” should be understood to be the main component of the sample, forming a relatively uniform mass, exhibiting a substantial continuity and potentially containing other minority components such as dispersed particles or droplets in suspension.
In certain cases, the fluorescence “scores” vector can be replaced, or complemented, by a vector made up of a certain number (generally between 1 and 6) of fluorescence intensity values at discrete wavelengths.
In certain cases, the vector indicator can also contain information on the form of one or more elastic scattering spectra. In effect, these spectra are not necessarily identical to the corresponding lighting spectra. Thus, for example, a vector indicator can be obtained by concatenating a fluorescence scores vector, one or more discrete elastic scattering intensity values and one or more scalar parameters representative of the form or of the width of one or more elastic scattering spectra.
Implementing the method first of all entails a calibration phase, involving a plurality of calibration samples Si. The duly acquired fluorescence spectra—if necessary, after “cleaning” the stray contributions due to the elastic scattering—are organized as a third order data tensor (“data cube”) whose three “pathways” are: the samples, the lighting radiations, wavelengths of the fluorescence spectra. If the lighting is monochromatic, the data are represented by a second order tensor.
The statistical model applied to the data can be of the “PARAFAC” type, which consists in breaking down a three-way tensor X into a sum of external products of three vectors (“triads”) ai, bi, ci, plus a residue E, also in the form of a “data cube”. The following can then be written:
in which: “i”, which runs from 1 to I>1, is the index of the samples; “j” which runs from 1 to J>1, is the index of the lighting wavelengths; “k”, which runs from 1 to K>1 is the index of the wavelengths of the fluorescence spectra corresponding to each lighting radiation; “f” is the index of the F PARAFAC breakdown factors. The number F of factors can be defined a priori, or using criteria known from the prior art.
The vector (with F components) ai.=(ai1 . . . aiF) is called the “scores” vector for the sample Si, whereas the vectors bj.=(bj1 . . . bjF) and ck.=(ck1 . . . ckF) are the lighting and emission “loadings” vectors, respectively, which define the statistical model. These vectors are determined, by known iterative methods (for example, alternate least squares), in such a way as to minimize the tensorial Frobenius norm of the residue E, i.e. the value of ∥X−Xmat∥FRO, where Xmat corresponds to the data cube of the PARAFAC model (see below) and ∥.∥FRO denotes the Frobenius norm (or any other suitable norm).
The PARAFAC model can be rewritten in matrix form as follows:
X
mat
=A(C|{circle around (×)} |B)T
in which:
This model makes it possible to obtain a scores vector for a new sample, after the calibration has been done. In effect, if Xnew is taken to be the vector with J*K components containing the fluorescence spectra acquired for said new sample; the vector (of dimensions F) of the scores for this new sample is given by
A
new=(|{circle around (×)} |C)+Xnew;
in which the exponent + indicates the generalized inverse of the tensorial product.
It is, however, important to note that the “PARAFAC” model is empirical; it is therefore valid only for samples similar to those which have been used in calibration.
Then, a vector indicator is constructed from the scores vectors and, for example, one or more elastic scattering intensity values, even also one or more fluorescence intensity values at discrete wavelengths. This will be able to be done simply by concatenation.
Then, a multilinear regression model can be constructed from the calibration data to allow for the prediction, as a function of said vector indicator, of a parameter (even several) characterizing the samples and/or a method to which they have been subjected. This parameter can, for example, be the content of a determined component, or a cooking time. Obviously, the value of the characterizing parameter must be known for the calibration samples.
Several variants of the characterization method described above can be envisaged without departing from the scope of the present invention. For example, the statistical model may not be of PARAFAC type but, for example, N-PLS type, or another known type (on this topic, see the abovementioned work by R. Bro). Furthermore, the transition from the vector indicators (which, it will be recalled, can be made up of the fluorescence scores vectors concatenated with parameters representative of the elastic scattering) to the parameters characterizing the samples can be done in ways other than by multilinear regression, for example by computing a distance—Euclidian or Mahalanobis—between the vector indicator of each sample considered and that of a reference sample. It is also possible to make use of supervised or unsupervised classification techniques, or of a supervised method known as “scoring”.
“Scoring” is a data ranking technique which makes it possible to evaluate, via a score, the probability that a sample or a group of samples resembles another sample or another group of samples. This probability is computed from the vector indicators defined above.
As a nonlimiting example, the following method can be used:
From the PARAFAC fluorescence scores and the scattering intensities, a matrix M(n×m) is constructed in which n is the number of samples, and m is the number of variables, that is to say of components of each vector indicator.
A vector indicator of distance y is then computed by using a linear regression model
y=b
1
a
1
+b
2
a
2
+ . . . +b
m
a
m
+e
The vector y is binary and takes the values: zero for the so-called reference samples, and one for the samples to be characterized. The vectors ai are the columns of the matrix M, the scalars bi are the coefficients of the linear model; e is the vector of the residues.
A Student test t is applied to the distances ŷ{circumflex over (yr)} predicted from the scores and scatterings of the reference sample and from the distances ŷ{circumflex over (ys)} predicted from the scores and scatterings of the sample to be characterized. The statistic t is computed by the following equation:
in which {circumflex over (
The “scoring” score is expressed as a function of the probability density:
in which Γ is the Euler gamma function and v the degree of freedom
The samples that exhibit a higher external probability, on the basis of the null hypothesis over the distributions (H0: {circumflex over (
The technical results of the invention will be illustrated above using two exemplary applications.
The first example relates to controlling the cooking of a food, and in particular of beef.
Measuring the scattering by analyzing the reflectance of the surface of a piece of meat during cooking makes it possible to plot a kinetic curve of the cooking level. The variations of the scattered light intensity over time notably reflect the formation of proteic aggregates deriving from the denaturing of the proteins with heat, a phenomenon which is reflected visually by the change of color of the medium. In the case of other foods, fish or egg white, a loss of transparency of the medium is, rather, noted. In the case of the heating or clotting of milk, no modification is perceptible to the naked eye, but a quantitative measurement shows that, even in this case, the scattering evolves in relation to the formation of aggregates of denatured serous proteins. The fluorescence is also affected by the chemical and physical-chemical transformations which occur during the cooking. It is therefore possible to define an optimal cooking level and its translation in terms of trend kinetic of the scattering or—better—of the scattering-fluorescence combination, measured in the form of distance (Euclidian or Mahalanobis) relative to the initial level. Another approach consists in constructing a regression over the cooking time and identifying the optimal time predicted by the combination of the fluorescence and scattering signals.
Table 1 indicates the correlation coefficients obtained between the different PARAFAC scores computed for samples of beef grill-cooked at 180° C. for 10 min, for the four identified factors (Fact1 to Fact4), as well as the maximum scattering and fluorescence intensities at different wavelengths (the term maximum intensity applies because the scattering and fluorescence spectra exhibit a finite width). More specifically:
The correlation table shows, on the one hand, that the fluorescence is significantly correlated with the scattering and that the fluorescence and light scattering, are correlated with the cooking time.
The second example relates to the trend of the acrylamide content of the chicory during its roasting.
Table II indicates the correlation coefficients obtained between the different PARAFAC scores for four factors (Fact1 to Fact4), as well as the maximum scattering and fluorescence intensities at different wavelengths. D1S1, D1S2 and D1S3 have been defined by precedence; D2S1 corresponds to the elastic scattering at 280 nm observed with the second order of diffraction of the grating RD.
Number | Date | Country | Kind |
---|---|---|---|
12 52569 | Mar 2012 | FR | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2013/052207 | 3/20/2013 | WO | 00 |