The present disclosure is in the technical field of spectroscopy. More particularly, the disclosure relates to spectral variance compressive detection systems, devices, and processes.
Chemical analysis usually consists of two processes: calibration and prediction. Calibration is the process of defining a mathematical model to relate an instrumental response or responses to a chemical or physical property of a sample. An instrument may yield one, two or multiple responses which are termed as variables. One output variable is referred to as a univariate measurement whereas multiple output variables are referred to as a multivariate measurement. Prediction is the act of using a calibration model based on a known chemical or physical property of a sample and predicting the properties of future samples from the instrumental output response variables.
A specific example of multivariate calibration and prediction in analytical spectroscopy is employing measured optical phenomena like absorbance (UV-visible, near infrared or long wave infrared), fluorescence or Raman data at specific wavelengths to predict the concentration of a target analyte in a gas, liquid or solid. Analytical chemists strive to produce linear calibration models which possess the highest level of accuracy and precision to selectively relate an instrumental output to a property of a desired analyte species even in the presence of instrumental output interferences. These interferences may occur due to chemical or physical properties of the sample matrix or other species and ultimately affect the sensitivity of the instrumental calibration.
Calibration models capable of correlating a measured response with a chemical or physical attribute originate from the field of statistics and in chemical systems, chemometrics. Chemometrics encompasses the use of statistical information to analyze chemical data to transform measured values into information for making decisions. Hotelling published a paper in 1933 discussing the transformation of complex statistics into a set of simplified, orthogonal principal components describing the largest sources of variance in a data set. This is known today as Principal Component Analysis (PCA). PCA was further explored by Anderson in 1958, but it was not until computers were available to perform such rigorous calculations that multivariate statistics made a mainstream impact on calibrations. Prior to the 1970s, most of the chemometric implementations were done by hand which resulted in long analysis times and simplified expressions resulting in calculation variations from researcher to researcher.
Multivariate calibrations offer some distinct advantages in both analytical measurements as well as paradigm shifts in chemical analyses. Utilizing multiple variables in a calibration allows multiple components to be analyzed simultaneously. Highly correlated variables or neighboring wavelengths in spectroscopy offer increases in signal-to-noise ratios (SNR). Similar SNR enhancements may be obtained by averaging redundant measurements. Multiple calibration variables also increase the robustness of mathematical models by sampling a larger data region where interfering components may be readily observed.
In a simple system where the instrument response and analyte chemical or physical properties obey a linear relationship, Classical Least Squares (CLS) may be used to perform the univariate calibration. CLS assumes that a linear additive response exists among all of the chemical components in the sample system. Thus, for a spectrometer based instrument the response at a particular wavelength is a linear combination of the attributes of the chemical system under study. Pure spectra must also be measured to construct the calibration model in a CLS system although pure mixtures are also acceptable. Ideally all sources of measurement variance are explicitly accounted for in the model.
Using the Beer-Lambert law of absorption, an instrumental linear response of optical absorbance, A with analyte concentration, c may be expressed using the linear equation:
c 32 A(εd)−1 (Equation 1)
where ε is the molar absorptivity, and d is the pathlength of the sample. The product of ε and d may be replaced with the calibration sensitivity or regression, b. The pure spectra (or pure mixture spectra) of all of the analytes are collected at unit concentrations to calculate the sensitivity values exactly in a linear regression sense.
b*=Ac
−1
+e (Equation 2)
The b* represents the pseudo-inverse of b, and the e represents the model error according to the least squares fit. To perform a prediction of future sample component concentrations, ĉan absorbance measurement is collected and multiplied by b*.
ĉ=Ab* (Equation 3)
CLS is a well understood process with a statistically sound foundation, but it suffers most from the application in real-world systems where all sources of system variation cannot be accounted.
In a complex system where the number of analytes is unknown and thus implicitly accounting for all sources of variation, Inverse Least Squares (ILS) may be used to perform the multivariate calibration. ILS also assumes that a linear additive response exists among all of the chemical components in the sample system, but slight non-linear instrumental responses can be tolerated. Pure compound or mixture spectra are unnecessary for constructing the calibration model, and ILS offers data compression alternatives by transforming the instrumental variable space into PCs of variance. In real-world analyses, the various types of multivariate calibrations have been compared based on predictive performance, stability and the ability to deal with unmodeled interferences.
Stemming from the univariate example described above, the MLR model is constructed from two or more wavelengths that describe uncorrelated variance in the calibration set. By switching to matrix notation where matrices are boldface and the superscripts T and −1 correspond to the transpose and inverse respectively, the calculation of concentration may be expressed as:
ĉb=A
ĉbbT=AbT
ĉ(bbT)(bbT)−1=AbT(b
ĉ=Ab* (Equation 3)
The transpose steps are necessary because only square matrices may be inverted. When many variables or wavelengths are measured, the bbT matrix cannot be inverted and is singular.
MLR attempts to calibrate a spectroscopic system by using an optimal subset of wavelengths to describe all sources of variation. There must be at least the same number of measured wavelengths in the model as there are different sources of spectral variation, and the correlation among the wavelengths must be minimized to ensure a stable inversion of the bbT matrix. Various strategies have been developed in variable selection for an optimal MLR calibration. MLR can be used to design simple measurement systems based on filter photometers as opposed to expensive spectrometers, but it predominantly lacks in the multivariate advantages of signal averaging and error detection.
Linear multivariate models of complex data sets may also be developed through the transformation of the measured variables or spectral data into orthogonal basis vectors. These basis vectors, also known as principal components (PCs) model statistically significant variation in the data as well as measurement noise. Ultimately, the data dimensionality is reduced to a set of basis vectors that model only spectral and measurement variation spanning the space of the data matrix without prior knowledge of the chemical components. An example of PCA applied to NIR spectra is illustrated in
A popular method of calculating the PCs of a data matrix is through the Singular Value Decomposition (SVD) algorithm. A data matrix like absorbance measurements may be decomposed into three new matrices:
X=USVT (Equation 4)
where the columns of U contain the column-mode eigenvectors or PC scores of X, the diagonal of S contains the square root of the eigenvalues of XTX, and the rows of VT contain the row-mode eigenvectors or PC loadings of X. The first eigenvector of VT corresponds to the largest source of variation in the data set, while each additional eigenvector corresponds to a smaller source of variation in the data. The scores or projections of the original absorbance vectors in the PC space are computed by multiplying the U matrix by the S matrix.
Because the PCs are orthogonal, they may be used in a straight forward mathematical procedure to decompose a light sample into the component magnitudes which accurately describe the data in the original sample. Since the original light sample may also be considered a vector in the multi-dimensional wavelength space, the dot product of the original signal vector with a PC vector is the magnitude of the original signal in the direction of the normalized component vector. More specifically, it is the magnitude of the normalized PC present in the original signal. This is analogous to breaking a vector in a three dimensional Cartesian space into its X, Y and Z components. The dot product of the three-dimensional vector with each axis vector, assuming each axis vector has a magnitude of 1, gives the magnitude of the three dimensional vector in each of the three directions. The dot product of the original signal and some other vector that is not perpendicular to the other three dimensions provides redundant data, since this magnitude is already contributed by two or more of the orthogonal axes.
Because the PCs are orthogonal, or perpendicular, to each other, the dot, or direct product of any PC with any other PC is zero. Physically, this means that the components do not interfere with each other. If data is altered to change the magnitude of one component in the original light signal, the other components remain unchanged. In the analogous Cartesian example, reduction of the X component of the three-dimensional vector does not affect the magnitudes of the Y and Z components.
PCA provides the fewest orthogonal components that can accurately describe the data carried by the light samples. Thus, in a mathematical sense, the PCs are components of the original light that do not interfere with each other and that represent the most compact description of the entire data carried by the light. Physically, each PC is a light signal that forms a part of the original light signal. Each has a shape over some wavelength range within the original wavelength range. Summing the PCs produces the original signal, provided each component has the proper magnitude. An example of reconstructing a spectrum from a reduced set of PCs is illustrated in
The PCs comprise a compression of the data carried by the total light signal. In a physical sense, the shape and wavelength range of the PCs describe what data is in the total light signal while the magnitude of each component describes how much of that data is there. If several light samples contain the same types of data, but in differing amounts, then a single set of PCs may be used to exactly describe (except for noise) each light sample by applying appropriate magnitudes to the components.
Multivariate Optical Computing (MOC) combines the data collection and processing steps of a traditional multivariate chemical analysis in a single step. It offers an all-optical computing technology with little to no moving parts. MOC instrumentation is inexpensive to manufacture compared to scanning instrumentation in a compact, field-portable design. The speed benefit due to an optical regression can offer real-time measurements with relatively high SNR that realize the advantages of chemometrics in a simple instrument.
MOC may be separated into two categories defined by the method of applying a multivariate regression optically. The first focuses on the utilization of thin film interference filters called Multivariate Optical Elements (MOEs) to apply a dot product with an incident radiometric quantity yielding a single measured value related to a spectroscopically active chemical or physical property. An alternative optical regression method involves the modification of scanning or dispersive instrumentation with weighted integration intervals at each wavelength. This may be accomplished with an optical mask or by shuttering the detector or light source heterogeneously across the spectral range in intervals proportional to a calculated multivariate regression. Ultimately, an optical regression implements the complicated steps of a digital regression in a hardened apparatus where the chemometric advantages may be realized in a simple instrument that a non-expert can operate.
Interference filter pairs were introduced by Nelson et al. in 1998 as an optical regression technique. PCA was performed on Raman spectra from a polymer curing experiment to construct a multivariate regression. The positive portion of the regression vector was used as a template for designing an interference filter to express a similar dot product. The absolute value of the negative portion of the regression vector was also used as a template for an interference filter; an operation amplifier inverted the resulting signal. These filters were spatially homogeneous, and a photodiode sensed all wavelengths simultaneously. Spatial Light Modulators (SLM) and Digital Micro-mirror Devices (DMD) have also been utilized to apply spectroscopic regressions after the incident light has passed through a dispersive element. Such devices have allowed the real-time modification of the optical regression.
Compressive sensing and detection is the process in which a fully resolved waveform or image is reconstructed from a smaller set of sparse measurements. A sparse sample implies a waveform or image data set with coefficients close to or equal to zero. Compressive sensing utilizes the redundancy in information across the sampled signal similar to lossy compression algorithms utilized for digital data storage. A fully expanded data set may be created through the solution of an undetermined linear system, an equation where the compressive measurements collected are smaller than the size of the original waveform or image. To date, sensors employing MOEs have yielded a direct analytical concentration prediction or classification as opposed to reconstructing the original waveform or hyperspectral image.
A system, device, and process that show one or more improvements in comparison to the prior art would be desirable in the art.
In an embodiment, an optical analysis system includes one or more optical filter mechanisms disposed to receive light from a light source and a detector mechanism in operative communication with the one or more optical filter mechanisms to measure properties of filtered light, filtered by the one or more optical filter mechanisms from the received light. The one or more optical filter mechanisms are configured so that the magnitude of the properties measured by the detector mechanism is proportional to information carried by the light filtered.
In another embodiment, an optical device includes an optical filter mechanism capable of receiving light from a light source and capable of operation with a detector mechanism to measure properties of filtered light, filtered by the optical filter mechanism from the received light. The optical filter mechanism is configured so that the magnitude of the properties measured by the detector mechanism is proportional to information carried by the filtered light.
In another embodiment, an optical analysis process includes providing one or more optical filter mechanisms and a detector mechanism in operative communication with the one or more optical filter mechanisms, receiving light from a light source with the one or more optical filter mechanisms, filtering the received light to generate filtered light, and measuring properties of the filtered light by the optical filter mechanisms. The optical filter mechanisms are configured so that the magnitude of the properties measured by the detector mechanism is proportional to information carried by the filtered light.
Other features and advantages of the present invention will be apparent from the following more detailed description, taken in conjunction with the accompanying drawings which illustrate, by way of example, the principles of the invention.
Wherever possible, the same reference numbers will be used throughout the drawings to represent the same parts.
The present disclosure includes a system, device, and process that employ multivariate optical elements (MOEs) for use as spectral variance or PC loading vectors. These independent MOE amplitude measurements are utilized to reconstruct a fully resolved spectroscopic measurement of a sample. A fully resolved optical spectrum is calculated by linearly combining the known optical filter spectroscopic pattern vectors with the corresponding spectral variance or PC amplitude measurements.
Referring now to various embodiments of the disclosure in more detail, in
In further detail, in
Referring now to
Referring now to
Referring now to
Referring now to
In further detail, in
Referring now to
In further detail, in
Referring now to
In further detail, in
Referring now to
In further detail, in
Referring now to
Referring now to
Among other things, the embodiments of the present disclosure have the ability to compute a fully resolved optical spectrum or hyperspectral image with M discrete wavelength variables from a set of N optical filter measurements where N is smaller than M.
While the invention has been described with reference to one or more embodiment, it will be understood by those skilled in the art that various changes may be made and equivalents may be substituted for elements thereof without departing from the scope of the invention. In addition, many modifications may be made to adapt a particular situation or material to the teachings of the invention without departing from the essential scope thereof. Therefore, it is intended that the invention not be limited to the particular embodiment disclosed as the best mode contemplated for carrying out this invention, but that the invention will include all embodiments falling within the scope of the appended claims.
The present application is a Non-provisional Patent Application claiming priority and benefit to U.S. Provisional Patent Application No. 61/752,728, filed Jan. 15, 2013, the entirety of which is hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
61752728 | Jan 2013 | US |