Embodiments of this invention relate to a method of data acquisition from a target biological substance by optical communication between the target substance and the III-V/IV semiconductor photonic sensor and the method of data processing to retrieve an absolute concentration level of a target molecule within the substance. This is applicable but not limited to transdermal sensing and monitoring of blood glucose, urea, lactate, creatinine, ethanol and other constituent molecules by means of tunable wavelength absorption spectroscopic sensing. The described technology is compatible with consumer electronics technology platforms in terms of manufacturing technology and size, weight, power, and cost requirements and offers a pivotal advantage in terms of usefulness for wearable healthcare device technology. This technology may be utilized by people impaired by chronic diseases such as diabetes where currently no non-invasive sensing solution exist. Moreover, a novel approach is provided for continuously monitoring vital physiological markers non-invasively, where currently only a point-of-care solution exists.
Many techniques for spectroscopic, non-invasive measurement of analytes, such as measurement of blood glucose using near-infrared spectroscopy, employ a broadband light source, such as halogen lamp. The electromagnetic radiation (EMR) emitted from such a source, and that received from a medium to be analyzed (e.g., diffusively reflected by or transmitted through the medium), have components at a number of wavelengths. The components from the EMR received from the medium are typically separated using a grating technique to obtain a spectrum. A spectrometer having a broadband source and a grating mechanism is typically a large, complex structure, that can be cumbersome or impractical for in-field or at-home use.
Photonic systems-on-a-chip (P-SoC) offer ultimate size reduction potential which is necessary for large volume applications such as consumer electronics markets, automotive, home-use medical devices, etc. The P-SoC concept combines all or most of the functions of a general photonic system and enables those functions to be realized within a single chip assembly. Typically, this can be realized as a monolithic photonic integrated circuit (PIC) based on III-V semiconductor or a combination of III-V semiconductor and group-IV semiconductor. The first approach allows all active and passive optical components be realized within the same wafer, allowing a completely monolithic device. This is ideal as all light sources and detectors are inherently aligned to the waveguides and do not require any assembly steps. However, inherent III-V material properties, such as higher absorption and lower light confinement in the waveguides and thus larger waveguides bending radii to reduce bend loss, together with complex technology requiring multiple epitaxial growths, limit the scaling potential to very large markets such as consumer electronics, as very low cost per chip is required by the market. As a trade-of, hybrid III-V/IV P-SoC offer a solution where light generation function is realized within the III-V semiconductor chip and light routing, filtering and other functions are realized within a group-IV semiconductor chip. Light detection depending on the wavelength of EMR, can be realized either within group-III-V or group-IV semiconductor chip. Hybrid approach proves to be beneficial for large volumes markets as group-IV semiconductor manufacturing technologies such as CMOS—for example offer unmatched scaling potential. Techniques for analyte measurement using P-SoC are generally not known, however.
Hybrid integration of III-V semiconductor chips with group-IV semiconductor photonic integrated circuit offers the potential to combine the best of two worlds, with light detection and light generation function being realized within direct bandgap III-V semiconductor for ultimate efficiency, performance, cost, and yield, whereas passive functions such as light filtering, routing, locking, feedback control are realized within a photonic integrated circuit (PIC) within a group-IV semiconductor, for example silicon-on-insulator, or silicon-on-silicon nitride, or silicon on silicon nitride or silicon on insulator. In various embodiments, a swept wavelength laser based photonic system on a chip with integrated emission wavelength tuning (sweeping), and wavelength shift tracking and absolute wavelength calibration functions, are deployed for remote acquisition of relevant data from a biological object such as a living object. In different embodiments, acquired data is then processed to provide biomolecule specific absolute value such as concentration level and/or concentration level as a function of time (trend). The combination of the hybrid III-V/IV semiconductor platform and techniques for processing the acquired data on-chip offers new opportunities for wearable device platforms such as for example, smart watches for monitoring important physiological parameters in real time.
Techniques for acquiring and processing data in combination with a photonic sensor system-on-a-chip to provide real-time calibrated concentration levels of an analyte (e.g., a constituent molecule within a biological substance) are described. The biological substance may be blood, interstitial fluid, tissue or a combination of substances. The photonic sensor system-on-a-chip (SoC) assembly includes a hybrid III-V and group-IV semiconductor assembly, with the III-V semiconductor elements providing optical gain and detection functions, and optical feedback, light routing, filtering, locking and other passive functions being provided within the group-IV semiconductor photonic integrated circuit.
In use, the assembly is in optical communication with the biological substance, and the sensor may be remote from the substance (in vivo scenario) or embedded within the substance (implanted). The sensor interacts with the target substance via optical communication, the light from the sensor interacts with the substance, and the light signal is modulated due to light-molecule interaction, where the interaction is molecule specific. After the interaction, the signal is collected by the sensor chip by means of diffuse reflectance or transmittance.
In practical scenarios, where such photonic sensor performs a direct transdermal measurement with a living object or is implanted within a living object, the raw signal collected by the sensor is very complex due to the complex nature of a typical biological substance, such as whole blood and or tissue. Various data analysis techniques described herein in combination with the hardware (e.g., the Soc) can be used to retrieve calibrated concentration level values from most complex biological substances. This is of particular importance for transdermal/implanted monitoring of vital metabolites such as glucose, lactate, urea, ethanol, serum albumin, creatinine, and others both for subjects impaired with chronic diseases such as diabetes, kidney or liver malfunctions as well as acute clinical cases such as sepsis or fitness level or diet monitoring for athletes and general public.
Accordingly, in one aspect a method is provided for calibrating a sensor for measurement of the concentration of an analyte in a medium. The method includes collecting, using a hybrid group III-V/group IV semiconductor photonics system-on-a-chip (SoC), a number of raw spectra from an object (e.g., the medium or sample) having the analyte. The method also includes partitioning the raw spectra according to respective spectral shapes thereof into a set of clusters, where each cluster includes a group of raw spectra. The method further includes, within each cluster: (i) applying a respective local scattering correction (LSC) to each raw spectrum belonging to the cluster to obtain a group of locally corrected spectra; and (ii) deriving a cluster-specific optimized set of pre-processing parameters and a cluster-specific calibration vector. The optimized set of pre-processing parameters and the calibration vector are derived using the locally corrected spectra and gold standard analyte concentration values corresponding to the group of raw spectra belonging to the cluster,
In some embodiments, deriving the cluster-specific optimized set of pre-processing parameters and the cluster-specific calibration vector for a particular cluster includes: (i) evaluating each one of a number of candidate sets of pre-processing parameters, where the evaluation of a particular candidate set includes: (A) pre-processing each locally corrected spectrum belonging to the particular cluster using the particular candidate set; (B) deriving a candidate calibration vector by applying multivariate regression calibration to the pre-processed locally corrected spectra and using the gold standard analyte concentration values corresponding to the group of raw spectra belonging to the particular cluster; and (C) computing a corresponding accuracy measure for the candidate calibration vector via cross-validation. Thereafter, the candidate set and the corresponding candidate calibration vector associated with a maximum accuracy measure are designated as the cluster-specific optimized set of pre-processing parameters and cluster-specific calibration vector, respectively.
The cluster-specific optimized set of pre-processing parameters may include a set of data processing parameters such as a) order of filtering, b) sort or type of filter used for smoothing, c) order of derivatives used for baseline removal, etc. The optimized set of parameters may be stored in the memory and may be used subsequently to preprocess data in the sensing mode.
The object may include tissue, and the analyte may include blood glucose, blood lactate, ethanol, creatinine, keratin, collagen, urea, serum albumin globulin, troponin, acetone, acetate, hydroxybutyrate, cholesterol, albumin, globulin, ketones-acetone, or water among others.
In some embodiments, the step of partitioning the raw spectra according to respective spectral shapes thereof includes applying a global scattering correction (GSC) to each of the raw spectra to obtain several globally corrected spectra. The partitioning step may also include clustering the several globally corrected spectra according to: (A) a specified number of clusters, (B) a specified maximum distance of a globally corrected spectrum from a centroid of a cluster, or (C) both the specified number of clusters and the specified maximum distance of a globally corrected spectrum from a centroid of a cluster. The partitioning step may further include, within each cluster, designating to that cluster a respective raw spectrum corresponding to a globally corrected spectrum belonging to the cluster. The clustering may include k-means clustering, affinity propagation, or agglomerative clustering.
In some embodiments, the method further includes storing in the SoC a GSC reference spectrum generated as part of the global scattering correction. The global scattering correction may be implemented as global multiplicative scattering correction, global standard normal variate (SNV) correction, global mean centering and normalization correction, Kubelka-Munk (K-M) correction, Saunderson correction, or a combination thereof. The local and/or global scattering correction may incorporate particle-size difference correction and/or pathlength-difference correction and may utilize K-M correction, Saunderson correction, multiplicative scattering correction, or a combination thereof. In some embodiments, the method includes storing in the SoC, for each cluster: (i) a corresponding LSC reference spectrum, and/or (ii) a corresponding calibration vector, (iii) cluster centroids, and/or (iv) the optimized set of pre-processing parameters for each cluster. The local scattering correction may also be implemented as local multiplicative scattering correction, or local standard normal variate (SNV) correction, local mean centering and normalization correction, K-M correction, Saunderson correction, or a combination of the aforementioned correction techniques, to achieve the linearization effect. Global and local scattering correction, when chosen appropriately, allow to account for particle-size difference effect on light scattering as well as account for optical path difference correction in tissue, e.g., to linearize the raw spectra, so that both linear Beer-Lambert absorption law as well as linear regression, including multivariate partial least square, techniques are applicable.
In some embodiments, determining the respective spectral shapes of the several raw spectra includes pre-processing the raw spectra by applying thereto a linear transformation and a baseline correction based on a reference spectrum of a selected analyte. The pre-processing may include Kubelka-Munk correction, Saunderson correction, multiplicative scattering correction, or a combination of any two or all three correction techniques.
In another aspect, a method is provided for measuring concentration of an analyte, where the method includes obtaining, using a hybrid group III-V/group IV semiconductor photonics system-on-a-chip (SoC), a raw spectrum from an object (e.g., a medium or a sample) having the analyte, and identifying from a number of clusters of spectra a cluster to which the raw spectrum belongs, where the cluster is identified based on the spectral shape of the raw spectrum. The method also includes applying a local scattering correction (LSC) to the raw spectrum to obtain a locally corrected spectrum, pre-processing the locally corrected spectrum using a cluster-specific optimized set of pre-processing parameters, and multiplying the pre-processed locally corrected spectrum with a cluster-specific calibration vector to obtain a corresponding calibrated concentration value for the analyte.
In some embodiments, obtaining the raw spectrum includes directing from the SoC to the object electromagnetic radiation (EMR) tunable at several different wavelengths, measuring using the SoC intensities of EMR received from the object at each of the different wavelengths, and converting the intensities into absorbance values, so that the raw spectrum includes an absorbance spectrum. The several different wavelengths may be selected from a range 1000-3500 nm or a range 1900-2500 nm.
In some embodiments, the clusters of spectra correspond to spectra collected previously using the SoC, and each of the clusters may be represented via a respective LSC reference, a respective cluster centroid, and/or a respective calibration vector, where the respective LSC reference, the respective cluster centroid, and the respective calibration vector for each cluster may be stored on the SoC. Identifying from the several clusters of spectra the cluster to which the raw spectrum belongs may include deriving a globally corrected spectrum using a global scattering correction (GSC) reference. Identification of the cluster to which the raw spectrum belongs may also include, within each of the several clusters, comparing the globally corrected spectrum with a respective LSC reference to obtain a distance corresponding to that cluster, and selecting a cluster for which the corresponding distance is minimum.
The global scattering correction may be implemented as global multiplicative scattering correction, global standard normal variate (SNV) correction, global mean centering and normalization correction, K-M correction, Saunderson correction, or a combination thereof. The local and/or global scattering correction may incorporate particle-size difference correction and/or pathlength-difference correction. The local scattering correction may be implemented as local multiplicative scattering correction, or local standard normal variate (SNV) correction, or local mean centering and normalization correction, K-M correction, Saunderson correction, or a combination thereof. LSC and GSC involve performing a linearizing transformation on the raw spectra to account for tissue/object scattering and absorption, to facilitate further data processing based on linear absorption techniques such as those based on the Beer-Lambert law, where the spectrum is decomposed into individual components and/or is processed further using PLS linear regression or a similar technique.
In some embodiments, determining the spectral shape of the raw spectrum includes pre-processing the raw spectrum by applying thereto a linear transformation and a baseline correction based on a reference spectrum of a selected analyte. The pre-processing may include Kubelka-Munk correction, Saunderson correction, multiplicative scattering correction, or a combination of any two or all three correction techniques.
In another aspect, a system for measuring concentration of an analyte includes a hybrid group III-V/group IV semiconductor photonics system-on-a-chip (SoC) for obtaining a raw spectrum from an object (e.g., a medium or a sample) having the analyte, and a processing unit that includes a processor and memory and that is configured to perform certain operations so as to measure the analyte concentration, store information, etc. Specifically, the processing unit is configured to obtain, using the hybrid group III-V/group IV semiconductor photonics system-on-a-chip (SoC), a raw spectrum from an object having the analyte, and to identify from a number of clusters of spectra a cluster to which the raw spectrum belongs, based on the spectral shape of the raw spectrum. The processing unit is also configured to apply a cluster-specific local scattering correction (LSC) to the raw spectrum, to obtain a locally corrected spectrum. The processing unit is further configured to pre-process the locally corrected spectrum using a cluster-specific optimized set of pre-processing parameters, and to multiply the pre-processed locally corrected spectrum with a cluster-specific calibration vector, to obtain a calibrated concentration value for the analyte.
In some embodiments, to obtain the raw spectrum, the SoC is configured to direct to the object electromagnetic radiation (EMR) that is tunable at several wavelengths, and to measure intensities of the EMR received from the object at each of the wavelengths. The processing unit is programmed to convert the intensities into absorbance values, so that the raw spectrum includes or is represented as an absorbance spectrum. The SoC may be configured to emit EMR at wavelengths in the range 1900-2500 nm or in the range 1000-3500 nm.
The several clusters of spectra may correspond to spectra collected previously using the SoC. Each of the clusters may represented via a respective LSC reference and a respective calibration vector. The SoC may include memory for storing, for each cluster, the respective LSC reference and the respective calibration vector, as well as a global scattering correction reference (also called a global scattering correction vector). The memory of the SoC may also store, for each cluster, the corresponding optimized set of pre-processing parameters.
In some embodiments, to identify among the several clusters of spectra the cluster to which the raw spectrum belongs, the processor is programmed to derive a globally corrected spectrum using a global scattering correction (GSC) reference stored in the memory. The processor may also be programmed to, within each cluster (i) compare the globally corrected spectrum with a respective LSC reference to obtain a distance corresponding to that cluster, and (ii) select a cluster for which the corresponding distance is minimum. The global scattering correction may include global multiplicative scattering correction, or global standard normal variate (SNV) correction, or global mean centering and normalization correction. Similarly, the local scattering correction may include local multiplicative scattering correction, or local standard normal variate (SNV) correction, or local mean centering and normalization correction. The local and/or global scattering correction may incorporate linearizing transformation for particle-size difference correction and/or pathlength-difference correction.
In some embodiments, the SoC includes a wavelength shift tracker to track a shift in wavelength of radiation emitted by the SoC, and/or a wavelength tracker to track absolute wavelength of the radiation emitted by the SoC, and/or a temperature sensor to measure temperature of the chip, and/or an SoC output power monitor to monitor or measure the intensity of the EMR emitted by the SoC during a wavelength sweep, so as to obtain a power curve.
In some embodiments, to determine the respective spectral shapes of the several raw spectra, the processor is configured to pre-processing the raw spectra by applying thereto a linear transformation and a baseline correction based on a reference spectrum of a selected analyte. In order to perform the pre-processing, the processor may be configured to apply Kubelka-Munk correction, Saunderson correction, multiplicative scattering correction, or a combination of any two or all three correction techniques.
Optical remote sensing is a developed technique for a broad range of applications. Sensing can be performed as a form of ranging—i.e., measuring distance by means of time-of-flight or frequency modulated continuous wave (FMCW) technique, or sensing can be performed to remotely detect, identify, and quantify the presence or absence of one or more molecules within an object by spectroscopic sensing.
The term spectroscopic sensing, as used herein, refers to deployment of a hybrid III-V/IV semiconductor photonic system-on-a-chip (P-SoC), which emits wavelength tunable laser radiation and is in communication with a remote target object. The wavelength change and absolute value are monitored and accounted within every sweep, such that the SoC is autocalibrated in terms of absolute wavelength and wavelength shift and power spectrum.
The light impinges the object and penetrates to a certain depth, defined by the optical length, which depends on the individual specifity of the object such as scattering matrix, content, etc. For example, using tunable laser radiation in the 1900-2500 nm spectral region to perform a transdermal sensing experiment with a living object, the light penetrates up to about a 1 mm below the skin surface, where it is scattered and is partially absorbed by the tissue, blood, and interstitial fluid. Such absorption is molecule-specific and each constituent molecule modifies the light spectrum with a unique spectral absorption signature. After interaction with the object, the transmitted, scattered or reflected light is collected and detected with a photodetector.
A schematic block diagram describing an embodiment of the invention is shown in
In the illustrated embodiment, the hybrid III-V/IV semiconductor chip 1 includes a hybrid III-V/IV external cavity laser 100, which emits swept-wavelength laser radiation via an optical path 10. A portion of the beam is split via path 11, and fed into the wavelength shift tracker 120 via path 11, absolute wavelength reference 130 via optical path 14, laser power curve monitoring block 140 via optical path 17 and output section via optical path 19. Chip 1 may also include a temperature sensor 110, for sensing the temperature of the chip, which in turn can be used for absolute wavelength reference calibration.
Wavelength shift tracker 120 can be any type of non-balanced interferometer such as Mach-Zender, Michelson, Fabry-Perot, etc. A non-balanced interferometer provides a beat signal at the output of the 120 via the optical path 12, and the photodetector block 121 register an oscillating signal, where the oscillation period depends on the optical path difference within the interferometer and wavelength. The optical path difference is defined by the design and is a known parameter. The wavelength shift value can thus be extracted if the absolute value of the wavelength at any given moment is known. This is provided by the absolute wavelength reference block 130, coupled to monitoring photodetector 131 via optical path 15. The absolute wavelength reference can be a distributed Bragg grating (DBR), micro-ring resonator (MRR), distributed feedback grating (DFB) or any other optical cavity structure with unambiguous characteristic transmission or reflectance feature within the spectral region covered by the hybrid laser 100 sweep. In such a way, the photodetector blocks 121 and 131 cooperatively provide information about the absolute wavelength value and wavelength shift value at any given moment of time within the sweep.
Tracking wavelength shift and absolute wavelength value is often necessary in order to decouple the system effects from the object related effects. For example, the emission wavelength might change in a non-linear manner on the system side, and thus without precise knowledge of absolute wavelength shift and value information, it can be difficult to perform signal conversion from time-domain to wavelength (or frequency domain). Another aspect is that the collected spectra will change due to changes on the object side—such as water displacement due to temperature, or changes in other strong baseline contributors. Without knowing the system output at all times, it is impossible to decouple whether the collected spectrum from the object is shifted due to changes at the output of the system or influenced by the changes within the object. Therefore, wavelength shift and absolute wavelength information tracking within every sweep allows one to decouple the system specific modulation on the collected spectra from the object specific modulation, the latter being the useful signal.
In practical cases, the target molecules such as glucose, lactate, ethanol etc. have concentrations which are very small compared to the main baseline contributors, which for the case of transdermal sensing are main proteins (collagen, albumin, keratin) and water. These main contributors provide signal which is 10 000 or more times stronger compared to the target molecules, and thus a small change in water displacement due to temperature effect can lead to the baseline change that if unnoticed can smear out any useful signal that can be attributed to glucose. Therefore, the ability to track the wavelength shift and absolute value within every sweep allows access to tracking the baseline change within every sweep.
The wavelength shift may be monitored as the beat signal during the sweep, whereas the absolute value is measured once per sweep, and the information from both the wavelength shift and absolute wavelength is used to calibrate the recorded information immediately afterwards the sweep is complete. The accuracy for the determination of the wavelength shift depends on the system design such the optical path difference within the wavelength shift tracker which in turn provide a beat signal. In a practical case scenario, this depend on the finesse of the absorption feature of the target molecular specie within the object. In case where the object is a biological substance and the molecules represent liquid phase, which are characterized by very broad spectroscopic signatures, the wavelength shift tracker can have the accuracy of 0.1 nm to few nm., 3-5 nm being a typical value.
In case of gas sensing, where the absorption line width of interest can be in the range of 100 MHz, the, wavelength shift tracking needs to be designed to possess a better resolution and the absolute wavelength reference needs to be designed to provide the absolute wavelength with a high enough resolution. In practical cases, this can be achieved with a very good accuracy. For instance, typical group-IV semiconductor fabrication technologies rely on node sizes as low as 160 nm or even down to 7 nm, which is three orders of magnitude compared to a typical emission wavelength. The time duration for one sweep is defined by the system architecture and last from minutes, when the tuning mechanism is performed by a mechanical motion of the tuning element, to few micro seconds, if the tuning is electronic. In a practical case for a hybrid III-V/IV sensor chip, the sweep rate can be from few tens of Hz up to MHz range, depending on actual practical system design and the application requirements.
Depending on the sensor design, and the requirement for the spectral bandwidth coverage, a single sweep can contain from several 10s to several 100s of discrete wavelengths. A typical practical case for transdermal glucose sensing requires around 100 or more discrete wavelengths to perform accurate prediction. Based on the existing state-of-the art widely tunable (swept-wavelength) laser concepts, the sweep can be almost continuous when the Vernier-filter is operated in combination with phase control. In some embodiments, the absolute value of the wavelength of emission is tuned within a specified range, e.g., 1000 to 3000 nm, 1900 nm to 2500 nm, etc. Thus, the tuned value of the wavelength of emission at a particular time may be 1898 nm, 1905 nm, etc. The corresponding wavelength shift can be 1 nm, 2 nm, 10 nm, etc.
The EMR received from the medium of interest is converted from the optical domain into electrical signals within the photodetectors 121 and 131, and the electrical signal from the photodetector is routed via electrical paths 13 and 16 to the electrical path 30 which connects to the drive and control electronics block 2, and analog-digital-converter (ADC) and amplifier block 210 therein. Here, the analog signal from the photonic chip is amplified and digitized. Digitized signal is fed into CPU 220, which performs signal filtering, averaging and other processing The CPU 220 contains a memory block with a calibration model. This calibration model is applied to the collected data to retrieve a calibrated concentration level value, which is then fed to an output port, e.g., the display 240 via electrical route 39. Another function of the CPU 220 is to provide control signals to the driver and digital-to-analog converter (DAC) block 230 via path 38, which in turn provides control and drive signals to the SoC via path 40. The entire sensor system is powered by the power supply 200 via electrical bus 31, 32, 33, 34, 35, 36.
A simplified version of the sensor system of
When deployed in the field, the photonic sensor on a chip 1 sends a wavelength tunable signal to a remote object 3 via optical path 20. The intensity I of the signal can be represented as an arbitrary function of frequency ω (or wavelength) and time t:
I=ƒ(ω,t) (1)
The light interacts with the object 3, and undergoes numerous scattering and absorption events within the object. A portion of scattered and diffusely reflected light is collected with signal photodetector 150 via the optical path 21. This light signal intensity can be represented by a frequency and time function I′:
I′=ƒ′(ω,t) (2)
This signal is modulated due to the interaction with the object and carries object-specific information such as concentration level of constituent elements. The latter can be evaluated as absorbance A, which can be represented as a linear superposition of individual absorbances Ai:
Here, ε(ω)i is frequency dependent individual molar absorptivity of a constituent i, ci—is individual molar concentration of the constituent i and l—is the effective optical length within the object.
In a practical case, where the object is a living body, the individual absorbance contributions can be represented as contributions by different constituent elements such as for example: 1—keratin, 2—glucose, 3—lactate, 4—urea, 5—collagen, etc. This provides a path of elemental decomposition of complex matrix and thus offers a possibility for sensing. The procedure for collecting and processing data and deriving calibrated concentration values is shown in the form of a block diagram in
The basic operation method for performing sensing includes first using the calibration algorithm in combination with the hardware to create a calibration model and store it in the memory of the CPU. This model can be considered universal and be deployed with every sensor in the field without needing to modify it during use. The next step is to then use the sensing algorithm in accordance with
According to an embodiment of the invention, when deployed in the sensing configuration, the photonic system on a chip provides several output channels that contain information about the state of the photonic chip, such as wavelength shift value via photodetector 121, absolute wavelength reference value via photodetector 131, laser intensity curve via laser power curve monitoring block 140, and/or the reflected signal containing object-specific information via signal photodetector 150. These electrical signals are routed to the control and signal processing electronics block 2. Here, the signals are fed into the analog-digital-converter and amplifier block 210.
The algorithm for processing the acquired analog signals received from the photonic SoC 1 starts by first amplifying and digitizing the received signal in the ADC and amplifier block 210. At this phase the signals are still processed as time domain signals. These amplified and digitized signals are then fed to the central processing unit (CPU) 220, where the object-specific signal 22 is processed and converted from time-domain into the frequency-domain using the information of wavelength shift received via electrical path 13, and absolute wavelength calibration received via electrical path 16, and is normalized with regard to the laser power curve received via electrical path 18. This procedure allows first having the signals in the frequency domain and also addresses the system-related nonlinearities, to further process the signal that primarily carries object-specific data, indicated as step 2210 in
Multiple spectra are collected, averaged, and filtered to reduce noise. For example, in
Referring to
As illustrated, the global MSC corrected data is only used for assigning raw spectra to each cluster. Thus, the assigned cluster contains raw or unprocessed data. Clustering can be performed in a number of ways. Two possible paths are shown in
Once the clustering is complete, an individual calibration model within each cluster is created at step 2280. An individual calibration model assigns a calibrated concentration level value to every spectrum within each cluster as measured by the gold standard as indicated. This set of calibration models is then stored in the CPU memory next to the MSC reference vector in step 2300.
The algorithm for constructing an individual calibration model 2280 is depicted in
The local reference from 2281 is then used to construct a partial-least-square (PLS) model within each cluster and obtain optimal model parameters such as noise filtering parameters, derivative order, number of PLS latent vectors using a cross-validation method within step 2282. The task yields an optimal set of data preprocessing parameters 2283 that are then applied to every cluster containing raw spectra to construct individual calibration model 2284. In other words, within each cluster, the raw spectra are modified using the local scattering correction reference. This ensures that all data is treated in the same manner, with the same set of parameters. A calibration model then assigns a calibrated concentration level(s) of the analyte(s) of interest that is/are measured by a selected reference technique (also called gold standard) to each locally corrected spectrum. The calibration model maps the absorbance represented by a spectrum at a particular wavelength to an analyte concentration level. Referring to
c=w1A1+w2A2+ . . . +wnAn, where n is the number of wavelengths in the spectrum. In some cases, when the sample is associated with a relatively simple scattering matrix, and when the sample includes fewer constituents, a reasonable concentration prediction can be obtained simply by preprocessing the spectral data obtained from the sample to correct for the nonlinear effects of scattering using Kubelka-Munk correction, MSC, Saunderson correction, or a combination thereof, and then by removing the baseline to obtain the spectrum of a constituent of interest. For greater accuracy and, especially for more complex samples such as biological tissue, scattering correction (or linearizing transformation) may be used in combination with multivariate linear regression such as PLS or similar.
In general during calibration, EMR is directed to a sample (also called medium), where the EMR is swept through a range of wavelengths. In response, EMR is received from the sample, where the received EMR is diffusively reflected by or is transmitted through the sample. The received EMR, having components at different wavelengths, is converted into a raw absorbance spectrum (also called a raw spectrum). This process may be repeated several times to obtain a number of raw spectra, which are then averaged to obtain an averaged raw spectrum. In the discussion below, we omit the term “averaged” for simplicity. These raw spectra may be denoted Xiraw, where the index i denotes respective, averaged raw samples and can range from 1 to M, where M can be any number such as 50; 100; 2000; 10,000, or more. The above-described process is repeated at different times, where the analyte concentration in the sample may be different at different times, using different regions of the samples or different samples, where the analyte concentration may be different in different regions of the same sample or in different samples.
Scattering correction (MSC, Kubelka-Munk correction, Saunderson correction, etc.) is then applied to the raw spectra Xiraw, to obtain a global reference denoted XrefG and, globally corrected spectra XiGC. Global reference XrefG is stored in the memory. Clustering is then performed using the globally corrected spectra XiGC to identify N clusters. The number N (e.g., 4, 5, 6, 10, etc.) may either be specified for the clustering operation or, alternatively, the clustering itself may determine the best N. For each XiGC, a corresponding cluster Ck, k∈[1,N], is identified and, thereafter, the corresponding raw spectrum Xiraw is designated to the same cluster. After clustering, the optimal number of clusters, cluster centroids and maximum allowable distance to cluster centroids are stored into the memory to be used for the sensing function.
Once all the raw spectra are designated to their respective clusters, within each cluster the above-described process is repeated. Specifically, scattering correction is applied to the raw spectra Xiraw within a particular cluster k, to obtain a local reference denoted XrefL
Recall, the different raw spectra Xiraw may correspond to different levels of analyte concentration. These concentrations levels, denoted i, are obtained from the samples using a selected gold-standard technique. Finally, a calibration vector Vk is generated for each cluster k via multivariate linear regression calibration. The calibration vector Vk, local reference XrefL
One example process for obtaining an optimal data preprocessing set is as follows:
Steps 1-4 may be repeated with different sets of parameters selected in step 1. An optimal set of parameters is the set that resulted in best average prediction accuracy.
A multivariate regression algorithm models a relation between predictor and response variables. Thus, a calibration spectral matrix χ∈d may be considered as predictor, where d is a number of wavelengths, and analytes concentration vector ∈ is considered as a response. Each ith row of spectral matrix corresponds to locally corrected and preprocessed spectrum (e.g., Savitzky-Golay filter and second derivative applied on locally corrected absorbance spectrum) and each ith row of response vector corresponds to analyte concentration measured with the gold standard. Once the relation between predictor and response is determined, unknown value of analyte concentration can be predicted based on the new locally corrected and preprocessed spectrum. Multivariate regression may include partial least squares regression and its modifications, multiple linear regression, support vector regression, artificial neural networks, and/or principal components regression.
Referring to
Next, the collected absorbance spectra undergo baseline correction, using global scattering correction GSC reference taken from the CPU memory in step 2221 in order to initiate the clustering procedure. For clustering the collected spectra, cluster centroids and maximum allowable distance to the cluster centroids are provided from the CPU memory, and the data is classified accordingly in step 2223. If the distance to the provided cluster centroids exceeds a maximum allowable distance, the CPU initiates an error message to indicate to the user to adjust the sensor position and restart the data collection until the error is not larger than the maximum allowable, in step 2224. If the collected data, after baseline correction, has the distance to cluster centroids within the allowable range in step 2225, the collected corresponding raw spectrum is assigned to a cluster with minimal distance to the centroid in step 2226.
Next, in step 2227, the raw spectrum within the newly assigned cluster undergoes baseline correction using a local scattering correction reference from the CPU memory and the data is preprocessed using the data processing set from the CPU memory in step 2228 in order to be eligible for data prediction step 2229, where it is multiplied with the individual calibration vector Vk from the CPU memory obtained by a multivariate regression calibration. Multiplying row vector of spectrum with column vector of regression weights, we obtain a single value for analyte concentration. Each different analyte will have different calibration vector and thus weights—i.e., different wavelength specificity for a particular analyte. For instance, 2100 nm can be relevant both for lactate and glucose, however the weights will be different. Analyte's concentration is c=w1*A1+w2*A2+ . . . +wn*An. Here, wn is the calibration weight at nth wavelength and An is locally corrected and pre-processed absorbance at nth wavelength. The output is then a calibrated concentration level of the analyte of interest.
In general the sensing process starts in a similar way as the calibration process. Specifically, EMR is directed to a sample (also called medium) from where analyte concentration is to be determined. The EMR is swept through a range of wavelengths. In response, EMR is received from the sample, where the received EMR is diffusively reflected by or is transmitted through the sample. The received EMR, having components at different wavelengths, is converted into a raw absorbance spectrum (also called a raw spectrum). This process may be repeated several times to obtain a number of raw spectra, which are then averaged to obtain an averaged raw spectrum denoted Yraw. Here again in the discussion below, we omit the term “averaged” for simplicity.
Scattering correction is then applied to the raw spectrum Yraw using the global reference denoted XrefG (generated during the calibration process), to obtain a globally corrected spectrum YGC. Clustering is then performed using the cluster centroid value σk and the distance to the centroid value from the memory. That cluster may be denoted Ck, where k∈[1,N], and where the number N was either specified for the clustering operation or, alternatively, was determined while performing clustering as part of the calibration process. The corresponding raw spectrum Yraw is then designated to the same cluster Ck.
Thereafter, scattering correction is applied again to the raw spectrum Yraw within the selected cluster Ck, using the corresponding local reference denoted XrefL
An example of a transdermal sensor performance with a piglet for three different analytes—i.e., blood glucose, blood lactate and blood ethanol, in accordance with an embodiment of the invention, is provided in
In
The representative results, demonstrate excellent sensor's performance in a wide dynamic glucose concentration level range from 75 mg/dl (4.16 mmol/l) until 400 mg/dl (22.2 mmol/l), the determination coefficient of 97.2%, root mean square error of prediction (RMSEP) of 14.7 mg/dl (or 0.8 mmol/l) and mean absolute relative difference of 6.7% in the entire range.
In
In
In
Due to the nonlinear nature of the diffuse reflectance one of the important steps in data preprocessing is the linearization and scattering correction of the collected spectrum, which, when correctly applied, allows further processing of the data for instance Beer-Lambert absorbance based analysis, where of the linearized and corrected spectrum is decomposed into individual components. This subsequent analysis may be performed in combination with other linear regression techniques to obtain a calibrated value of the concentration level of the constituent/analyte of interest.
In
Further processing of the isolated spectrum can be performed as shown in
In
In
The described embodiments of the invention are intended to be merely exemplary and numerous variations and modifications are intended to be within the scope of the present invention as defined in the appended claims.
This application claims priority to and benefit of U.S. Provisional Patent Application No. 62/944,644, entitled “Systems and Methods for Measuring Concentration of an Analyte,” filed on Dec. 6, 2019, the entire contents of which are incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/EP2020/084552 | 12/3/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62944644 | Dec 2019 | US |