This application claims the benefit of Japanese Patent Application No. 2013-149741, filed on Jul. 18, 2013. The content of the aforementioned application is incorporated herein by reference in its entirety.
1. Technical Field
The present invention relates to a technology of creating a calibration curve used in acquiring content of a target component in blood, and a technology of acquiring the content of the target component in blood.
2. Related Art
In the related art, there is provided a method of performing independent component analysis of observation data of a test object which is obtained by observing at a plurality of different positions, setting an independent component calculated by the independent component analysis as a fundamental function, and representing the observation data as a linear sum of the fundamental function, to analyze concentration or the like of a target component (see JP-A-2007-44104). According to the method disclosed in JP-A-2007-44104, it is possible to acquire oxyhemoglobin concentration and deoxyhemoglobin concentration in blood, from non-invasive observation data.
However, in the technology of the related art, a plurality of different observation data items of the test object are necessary every time when performing calibration of a target component of the test object, and it is difficult to perform the calibration with high accuracy from one observation data item. In addition, since the test object is a living body, acquiring the plurality of observation data items causes burden on a test subject, and solving of the aforementioned problems is necessary.
Various noise may be included in the observation data. In this case, accuracy of the independent component analysis or the calibration using that may be degraded.
Further, the observation data may vary depending on the test object, due to unevenness of a composition or a structure of the test object. In such a case, the accuracy of the independent component analysis or the calibration using that may also be degraded.
An advantage of some aspects of the invention is to allow highly precise calibration of a test object from one observation data item, when performing calibration of a target component in blood for the test object.
The invention can be implemented as the following forms or application examples.
This application example is directed to a calibration curve creating method of creating a calibration curve used in acquiring content of a target component which is a specific component in blood from observation data of a living body which is a test object, the method including: (a) causing a computer to acquire the observation data of a plurality of samples of a living body, when near infrared light having a wavelength of 800 nm to 1300 nm is emitted to the living body and an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof is set as the observation data; (b) causing the computer to acquire content of the target component of each sample; (c) causing the computer to estimate a plurality of independent components when the observation data of each sample is separated into the plurality of independent components, and to acquire a mixing coefficient corresponding to the target component for each sample based on the plurality of independent components; and (d) causing the computer to acquire a regression equation of the calibration curve based on the content of the target component of the plurality of samples and the mixing coefficient for each sample, in which (i) causing the computer to acquire an independent component matrix including the independent component of each sample, (ii) causing the computer to acquire an estimated mixing matrix showing a vector set for regulating a ratio of an independent component element of each independent component in each sample, from the independent component matrix, and (iii) causing the computer to acquire a correlation of content of the target component of the plurality of samples, for each vector included in the estimated mixing matrix, and to select the vector which is determined to have a highest correlation, as a mixing coefficient corresponding to the target component, are included in (c), and in (i), the computer acquires the independent component matrix by performing a first preprocessing including normalization of the observation data, a second preprocessing including whitening, and an independent component analysis process in this order, and the computer performs normalization after a process performed by project on null space in the first preprocessing.
According to the calibration curve creating method of Application Example 1, a calibration curve for acquiring the content of the target component which is the specific component in the blood from the observation data of the living body which is the test object, is created from the observation data acquired from each sample and the content of the target component of the plurality of samples of the living body. Accordingly, it is possible to acquire the content of the target component with high accuracy by using this calibration curve, even when one observation data item of the living body which is the test object is used. Therefore, if the calibration curve is previously created according to the calibration curve creating method of Application Example 1, it is only necessary to acquire one observation data item of the living body which is the test object when performing calibration. As a result, it is possible to acquire a target component amount in the blood from one observation data item which is an actually-measured value, with high accuracy. In particular, according to the calibration curve creating method of Application Example 1, it is possible to allow the light to reach a blood vessel, by setting light to be emitted to the living body as near-infrared light having a wavelength of 800 nm to 1300 nm. Accordingly, it is possible to obtain observation data in which an effect of a blood component is directly reflected, and to improve calibration accuracy. In addition, the estimated mixing matrix is acquired and the vector having a high correlation with respect to the content of the target component of the sample is extracted from the estimated mixing matrix, and therefore it is possible to obtain a mixing coefficient having high estimation accuracy. Further, since the process performed by the project on null space is performed in the first preprocessing, it is possible to decrease an effect of baseline variation included in the observation data to improve calibration accuracy.
This application example is directed to the calibration curve creating method according to Application Example 1, wherein the computer performs whitening by factor analysis in the second preprocessing.
In this method, since the whitening by factor analysis is performed in the second preprocessing, it is possible to decrease an effect of noise (particularly, random noise) included in the observation data to improve calibration accuracy.
This application example is directed to the calibration curve creating method according to Application Example 1 or 2, wherein the computer uses β divergence as an independence index of the independent component analysis process.
In this method, since β divergence is used as the independence index of the independent component analysis process, it is possible to decrease an effect of an outlier such as spike noise included in the observation data to improve calibration accuracy.
This application example is directed to a calibration curve creation apparatus which creates a calibration curve used in acquiring content of a target component which is a specific component in blood from observation data of a living body which is a test object, the apparatus including: a sample observation data acquisition unit which acquires the observation data of a plurality of samples of a living body, when near infrared light having a wavelength of 800 nm to 1300 nm is emitted to the living body and an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof is set as the observation data; a sample target component amount acquisition unit which acquires content of the target component of each sample; a mixing coefficient estimation unit which estimates a plurality of independent components when the observation data of each sample is separated into the plurality of independent components, and acquires a mixing coefficient corresponding to the target component for each sample based on the plurality of independent components; and a regression equation calculation unit which acquires a regression equation of the calibration curve based on the content of the target component of the plurality of samples and the mixing coefficient for each sample, in which the mixing coefficient estimation unit includes an independent component matrix calculation unit which acquires an independent component matrix including each independent component of each sample, an estimated mixing matrix calculation unit which acquires an estimated mixing matrix showing a vector set for regulating a ratio of an independent component element of each independent component in each sample, from the independent component matrix, and a mixing coefficient selection unit which acquires a correlation of content of the target component of the plurality of samples, for each vector included in the estimated mixing matrix, and selects the vector which is determined to have a highest correlation, as a mixing coefficient corresponding to the target component, the independent component matrix calculation unit acquires the independent component matrix by performing a first preprocessing including normalization of the observation data, a second preprocessing including whitening, and an independent component analysis process in this order, and the independent component matrix calculation unit performs normalization after a process performed by project on null space in the first preprocessing.
According to the calibration curve creation apparatus of Application Example 4, in the same manner as the calibration curve creating method according to Application Example 1, it is only necessary to acquire one observation data item of the living body which is the test object when performing the calibration. Accordingly, it is possible to acquire the target component amount from one observation data item which is an actually-measured value, with high accuracy. In addition, since the process performed by the project on null space is performed in the first preprocessing, it is possible to decrease an effect of baseline variation included in the observation data to improve calibration accuracy.
This application example is directed to the calibration curve creation apparatus according to Application Example 4, wherein the independent component matrix calculation unit performs whitening by factor analysis in the second preprocessing.
In this apparatus, since the whitening by factor analysis is performed in the second preprocessing, it is possible to decrease an effect of noise (particularly, random noise) included in the observation data to improve calibration accuracy.
This application example is directed to the calibration curve creation apparatus according to Application Example 4 or 5, wherein the independent component matrix calculation unit uses β divergence as an independence index of the independent component analysis process.
In this apparatus, since β divergence is used as the independence index of the independent component analysis process, it is possible to decrease an effect of an outlier such as spike noise included in the observation data to improve calibration accuracy.
This application example is directed to the calibration curve creation apparatus according to any one of Application Examples 4 to 6, which further includes a storage unit which stores the independent component matrix calculated by the independent component matrix calculation unit, a target component order which shows a position of the mixing coefficient selected by the mixing coefficient selection unit in the estimated mixing matrix, and the regression equation calculated by the regression equation calculation unit.
According to this configuration, in the calibration curve creation apparatus, it is possible to store the independent component matrix, the target component order, and the regression equation in the storage unit.
This application example is directed to a blood component calibration apparatus which acquires content of a target component which is a specific component in blood for a living body which is a test object, the apparatus including: a test object observation data acquisition unit which emits near infrared light having a wavelength of 800 nm to 1300 nm to the test object, and acquires an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof as observation data; a calibrating data acquisition unit which acquires calibrating data including at least an independent component corresponding to the target component; a mixing coefficient calculation unit which acquires a mixing coefficient with respect to the target component of the test object, based on the observation data of the test object and the calibrating data; and a target component amount calculation unit which calculates content of the target component based on a constant of a regression equation showing a relationship between the mixing coefficient and the content corresponding to the target component, and the mixing coefficient acquired by the mixing coefficient calculation unit which are prepared in advance, in which the mixing coefficient calculation unit performs a first preprocessing including normalization of the observation data and a second preprocessing including whitening in this order, and performs normalization after a process performed by project on null space in the first preprocessing.
According to this blood component calibration apparatus, although only one observation data item is acquired for the living body which is the test object, it is possible to acquire the content of the target component for the test object with high accuracy. Since the process performed by the project on null space is performed in the first preprocessing, it is possible to decrease an effect of baseline variation included in the observation data to improve calibration accuracy.
This application example is directed to the blood component calibration apparatus according to Application Example 8, wherein the mixing coefficient calculation unit performs whitening by factor analysis in the second preprocessing.
In this apparatus, since the whitening by factor analysis is performed in the second preprocessing, it is possible to decrease an effect of noise (particularly, random noise) included in the observation data to improve calibration accuracy.
This application example is directed to the blood component calibration apparatus according to Application Example 8 or 9, wherein the calibrating data acquisition unit acquires the independent component which is previously acquired as a component corresponding to the target component, as the calibrating data, and the mixing coefficient calculation unit acquires an inner product of the independent component and the observation data of the test object, and set the inner product as the mixing coefficient.
According to this blood component calibration apparatus, it is possible to easily acquire a mixing coefficient having a high correlation with the target component of the test object with high accuracy.
This application example is directed to the blood component calibration apparatus according to Application Example 8 or 9, wherein the calibrating data acquisition unit acquires the plurality of independent components when each observation data item of the plurality of samples is separated into the plurality of independent components, as the calibration data, and the mixing coefficient calculation unit calculates an estimated mixing matrix of the test object based on the observation data and the plurality of independent components of the test object, and extracts a mixing coefficient corresponding to the target component from the calculated estimated mixing matrix.
According to this blood component calibration apparatus, it is possible to acquire a mixing coefficient having a high correlation with the target component of the test object with high accuracy.
Further, the invention can be implemented in the following various aspects, and for example, can be implemented as an aspect of the blood component calibration apparatus which stores the regression equation acquired by the calibration curve creating method in a memory, an aspect of a computer program which implements the configuration of each unit included in the blood component calibration apparatus as a function, an aspect of the computer program or a non-transitory storage medium in which the computer program is recorded, or the like.
The invention will be described with reference to the accompanying drawings, wherein like numbers reference like elements.
Hereinafter, embodiments of the invention will be described in the following order.
A. Calibration Curve Creating Method
B. Calibrating Method of Target Component
C. Various Algorithms and Effect Thereof on Calibration Accuracy
D. Modification Examples
The following terms will be used in the description of the embodiments of the invention.
ICA: Independent Component Analysis
SNV: Standard Normal Variate Transformation
PNS: Project on Null Space
PCA: Principal Components Analysis
FA: Factor Analysis
Hereinafter, a first embodiment of the invention will be described. The first embodiment relates to a method of creating a calibration curve for acquiring glucose concentration in blood (blood sugar level) from an absorbance spectrum (=absorption spectrum) obtained from a human body, that is, a living body of a person as observation data.
The step 1 is a preparation step and is performed by an operator. The operator first prepares multiple people who will be living body samples. Herein, the prepared number of people is set as n people (n is an integer equal to or larger than 2).
A step 2 is a measurement step of a spectrum, and is performed by the operator using a blood component measuring apparatus.
The light receiving unit 220 includes a light receiving probe 222, a spectral element 224, and a light receiving element 226. The light receiving probe 222 is provided at a position facing the living body BD in the same manner as the light emitting probe 214 of the light emitting unit 210. The white light emitted by the light emitting probe 214 passes through the blood vessel BV in a flow path LP which is a so-called “banana shape” and reaches the light receiving probe 222. The light which reaches the light receiving probe 222 can be called “diffusely-reflected light” as it is light returning from (reflected by) an inner part of the living body BD, and the light receiving probe 222 receives the diffusely-reflected light from the inner part of the living body BD and transfers the light to the spectral element 224. The spectral element 224 is a spectral element using a Fabry-Perot type filter, a grating, a liquid crystal tunable filter, an acoustical engineering variable wavelength filter, or the like. The spectral element 224 disperses light by allowing selective transmission of light having a wavelength in accordance with a control command from the driving circuit 230. The driving circuit 230 performs driving control of the spectral element 224 so that light having a plurality of different wavelengths in a range of the wavelengths of 800 nm to 1300 nm passes through. By receiving a spectrum of light which exits from the spectral element 224 by the light receiving element 226, the blood component measuring apparatus 200 measures light intensity of the light having the plurality of wavelengths in a range of the wavelengths of 800 nm to 1300 nm, that is, a spectrum of spectral reflectance. The light receiving element 226 is, for example, a CCD, a CMOS, an InGaAs photodiode, or the like.
That is, the blood component measuring apparatus 200 emits light including near infrared light having a wavelength of 800 nm to 1300 nm (white light in a case of the first embodiment) to the living body BD and measures the spectrum of the spectral reflectance obtained from the diffusely-reflected light thereof. The operator images each of the plurality of living body samples prepared in the step 1 with the blood component measuring apparatus 200, to measure the spectrum of the spectral reflectance for each living body sample. A region of the wavelengths of 800 nm to 1300 nm is a so-called “biological window”, and has high transmittance with respect to the living body. Accordingly, the light having this wavelength easily reaches the blood vessel BV, and it is optimal for acquisition of signals in which an effect of a blood component is directly reflected.
A relationship represented by the following formula (1) is satisfied between the spectrum of the spectral reflectance and the absorbance spectrum.
[Absorbance]=−log10[Reflectance] (1)
Accordingly, the spectrum of the spectral reflectance measured by the blood component measuring apparatus 200 is converted into the absorbance spectrum using the formula (1). The conversion into the absorbance is performed because it is necessary to satisfy linear combination in a mixed signal analyzed in independent component analysis which will be described later and the linear combination is satisfied for the absorbance from the Lambert-Beer law.
In the step 2, the absorbance spectrum may be measured instead of the spectrum of the spectral reflectance. As a measured result, data of absorbance distribution showing a property of a test object with respect to the wavelength is output. The absorbance spectrum obtained as described above is also called “spectral data”, hereinafter.
In addition, instead of measuring the spectral reflectance spectrum and the absorbance spectrum with the blood component measuring apparatus as a spectroscopic measurement instrument, the spectra may be estimated from other measured values. For example, the living body sample may be measured with a multiband camera to estimate the spectral reflectance or absorbance spectrum from the obtained multiband image. As such an estimating method, a method disclosed in JP-A-2001-99710 can be used.
The embodiment will be described by returning to
A step 4 is an estimation step of a mixing coefficient and is performed using a personal computer.
The computer 100 is a well-known device which includes a CPU 10 which performs various processes or control by executing a computer program (hereinafter, simply referred to as a “program”), a memory 20 (storage unit) which is a data saving location, a hard disk drive 30 which stores program, data, or information, an input interface (I/F) 50, and an output interface (I/F) 60.
The blood component measuring apparatus 200 shown in
As a result of acquisition of the spectral data and the glucose amount, a data set (hereinafter, referred to as a “measurement data set”) DS1 including the spectral data and the glucose amount is stored in the hard disk drive 30 of the computer 100.
The CPU 10 performs a process of estimating the mixing coefficient which is an operation of the step 4, by loading a predetermined program stored in the hard disk drive 30 into the memory 20 and executing the program. Herein, the predetermined program can be downloaded using a network such as the Internet from outside. In the step 4, the CPU 10 functions as the mixing coefficient estimation unit 430 of
Independent component analysis (ICA) is one of multi-dimensional signal analysis methods, and is a technology of measuring a mixed signal on which an independent signal is superimposed, under several different conditions, and separating an independent original signal from the mixed signal based thereon. If independent component analysis is used, by recognizing the spectral data obtained by the step 2 as data in which m independent components (unknowns) such as glucose are mixed with each other, the spectra of the independent components can be estimated from the spectral data (observation data) obtained by the step 2.
In the first embodiment, the independent component analysis is performed by performing the process by the three processing units 450, 460, and 470 shown in
It is not necessary to perform the process by the PNS 454 in a case of performing the SNV 452 with respect to the spectral data obtained in the step 2 of
A process other than the SNV or the PNS may be performed as the first preprocessing. In the first preprocessing, it is preferable to perform any normalization process, but the normalization process may be omitted. Hereinafter, the first preprocessing unit 450 is also called a “normalization processing unit”. The content of the two processes 452 and 454 will be further described. The first preprocessing can also be omitted in a case where the data to be processed which is applied to the independent component matrix calculation unit 432 is normalized data.
The second preprocessing unit 460 can perform preprocessing using any one of principal components analysis (PCA) 462 and factor analysis (FA) 464. A process other than PCA or FA may be used as the second preprocessing. Hereinafter, the second preprocessing unit 460 is also called a “whitening processing unit”. In the typical ICA method, dimensional compression of the data to be processed and non-correlating are performed, as the second preprocessing. Since a transformation matrix to be acquired in the ICA is limited to an orthogonal transformation matrix by the second preprocessing, it is possible to decrease computational complexity of the ICA. Such a second preprocessing is called “whitening” and PCA is used in many cases. However, in a case where random noise is included in the data to be processed, the PCA may be affected by an effect thereof, and accordingly error may be generated in a result. Herein, in order to decrease the effect of the random noise, it is preferable to perform the whitening using the FA having robustness with respect to the noise, instead of the PCA. The second preprocessing unit 460 of
The independent component analysis processing unit (ICA processing unit) 470 performs the ICA with respect to the spectral data which is subjected to the first preprocessing and the second preprocessing, to estimate a spectrum of the independent component. The ICA processing unit 470 can perform analysis using any one of the first processing 472 which uses a kurtosis as an independence index, and the second processing 474 which uses β divergence as an independence index. As an index for separating the independent components, the ICA generally uses higher order statistics representing independence between the separated data items as the independence index. The kurtosis is a typical independence index. However, in a case where an outlier such as spike noise is included in the data to be processed, statistics including the outlier are calculated as the independence index. Therefore, an error may be generated between original statistics and the calculated statistics of the data to be processed, and this may cause a decrease in separation accuracy. Herein, in order to decrease an effect from the outlier in the data to be processed, it is preferable to use the independence index which is hardly affected by the effect thereof. β divergence can be used as the independence index having such properties. The content of the kurtosis and the β divergence will be further described later. An index other than the kurtosis or the β divergence may be used as the independence index of the ICA.
Next, typical processing content of the independent component analysis will be described in detail. Spectra S of m unknown components (sources) (hereinafter, this spectra may be simply referred to as “unknown components”) are assumed to be applied by a vector of the following formula (2), and n spectral data items X obtained by the step 2 are assumed to be applied by a vector of the following formula (3). In addition, each of elements (S1, S2, . . . , Sm) included in the formula (2) is set to be the vector (spectrum). That is, the element S1 is represented as a formula (4), for example. Elements (X1, X2, . . . , Xn) included in the formula (3) are also the vectors, and the element X1 is represented as a formula (5), for example. An index 1 is the number of wavelength bands in which the spectra are measured. The number of elements m of the spectra S of the unknown components is an integer equal to or larger than 1, and is empirically and experimentally determined in advance.
s=[s
1
,s
2
, . . . ,s
m]T (2)
X=[X
1
,X
2
, . . . ,X
n]T (3)
S
1
={S
11
,S
12
, . . . ,S
11} (4)
X
1
={X
11
,X
12
, . . . ,X
11} (5)
Each unknown component is assumed to be independent statistically. A relationship of the following formula (6) is satisfied between the unknown components S and the spectral data items X.
X=A·S (6)
A of the formula (6) is the mixing matrix, and can also be represented by the following formula (7). Herein, it is necessary to show a latter “A” as a thick letter as shown in the formula (7), but it is shown as a normal letter in sentences due to limitation of letters of the specification. Hereinafter, other thick letters representing the matrix are shown as the normal letters, in the same manner.
A mixing coefficient aij included in the mixing matrix A represents a contribution degree of an unknown component Sj (j=1 to m) to spectral data Xi (i=1 to n) which is the observation data.
In a case where the mixing matrix A is known, a least squares solution of the unknown component S can be simply acquired as A+·X using a pseudo inverse matrix A+ of A, but in a case of the first embodiment, since the mixing matrix A is also unknown, it is necessary to estimate the unknown component S and the mixing matrix A only from the observation data X. That is, as shown in the following formula (8), a matrix (hereinafter, referred to as an “independent component matrix”) Y showing the spectrum of the independent component is calculated using a separation matrix W of m×n, only from the observation data X. As an algorithm for acquiring this separation matrix W of the following formula (8), various algorithms such as Infomax, Fast Independent Component Analysis (FastICA), Joint Approximate Diagonalization of Eigenmatrices (JADE), and the like can be used.
Y=W·X (8)
The independent component matrix Y corresponds to an estimated value of the unknown component S. Accordingly, the following formula (9) can be obtained, and the following formula (10) can be obtained by transforming the formula (9).
X=·Y (9)
Â=X·Y
+ (10)
Herein, ̂A is an estimated mixing matrix of A and Y+ is a pseudo inverse matrix of Y.
The estimated mixing matrix ̂A (denoted as this due to limitation of letters of the specification, but actually meaning the letter with attached symbol on left side of the formula (10), this applies to the other letters) obtained with the formula (10) can be represented by the following formula (11).
In Step S110 of
After performing Step S110, the CPU 10 performs a process of calculating the independent component matrix Y, based on the separation matrix W and the spectral data X of each sample which is obtained in the step 2 and is previously stored in the hard disk drive 30 (Step S120). This calculation process is a process of performing an arithmetic operation according to the formula (8). In the processes of Steps 5110 and S120, the CPU 10 functions as the independent component matrix calculation unit 432 of
Next, the CPU 10 performs a process of calculating the estimated mixing matrix ̂A, based on the spectral data X of each sample which is previously stored in the hard disk drive 30, and the independent component matrix Y which is calculated in Step S120 (Step S130). This calculation process is a process of performing an arithmetic operation according to the formula (10).
The estimated mixing matrix ̂A is obtained by the process up to Step S130. That is, the coefficients (estimated mixing coefficient) ̂aij included in the estimated mixing matrix ̂A are obtained. After that, the process proceeds to Step S140.
In Step S140, the CPU 10 acquires a correlation (degree of similarity) between the glucose amounts C1, C2, . . . , Cn measured in the step 3, and a component (hereinafter, referred to as a vector ̂α) of each row included in the estimated mixing matrix ̂A calculated in Step S130. In detail, a correlation between the glucose amount C (C1, C2, . . . , Cn) and the vector ̂α1 (̂a11, ̂a21, ̂an1) of a first row is acquired, then, a correlation between the glucose amount C (C1, C2, . . . , Cn) and the vector ̂α2 (̂a12, ̂a22, . . . , ̂an2) of a second row is acquired, and by doing so, a correlation of each row with respect to the glucose amount C is subsequently acquired, and lastly, a correlation between the glucose amount C (C1, C2, . . . , Cn) and the vector ̂αm (̂a1m, ̂a2m, . . . , ̂anm) of an m-th row is acquired.
The correlations can be acquired by a correlation coefficient R according to the following formula (12). The correlation coefficient R is called Pearson's product-moment correlation coefficient.
−C and −̂αk each represent a chlorophyll amount and an average value of a vector ̂αk.
As a result of Step S140 of
In the case of the Table TB of
The step 5 is a calculation step of a regression equation, and is performed using the computer 100 in the same manner as during the time of performing the step 4. In the step 5, the computer 100 performs a process of calculating the regression equation of the calibration curve. The step 5 may be performed by transferring the data which is subjected to the process up to the step 4, to another computer.
F:C=u{circumflex over (α)}
k
+v (13)
After performing Step S210, the CPU 10 stores the constants u and v of the regression equation F acquired in Step S210, the target component order k (
Next, a calibrating method of the target component will be described. A test object is configured with the same components as the sample used when creating the calibration curve. In detail, the calibrating method of the target component is performed using the computer. In addition, the computer herein may be the computer 100 used when creating the calibration curve or may be another computer.
X
p
={X
p1
,X
p2
, . . . ,X
pl} (14)
In the process of Step S310, the CPU 10 functions as the test object observation data acquisition unit 510 of
After performing Step S315, the preprocessing is performed with respect to the absorbance spectrum Xp of the test object which is obtained in Step S310 (Step S325). As this preprocessing, it is preferable to perform the same process as the preprocessing (that is, normalization process by the first preprocessing unit 450 and the whitening process by the second preprocessing unit 460) used in the step 4 of
After that, the CPU 10 performs the process of acquiring the estimated mixing matrix ̂A of the test object, based on the independent component matrix Y included in the calibrating data set DS2 and the preprocessed spectrum obtained in Step S325 (Step S335). In detail, since the arithmetic process according to the formula (10) is performed, an inverse matrix (pseudo inverse matrix) Y+ of the independent component matrix Y included in the calibrating data set DS2 is acquired, and the pseudo inverse matrix Y+ is applied to the preprocessed vector obtained in Step S325, to acquire the estimated mixing matrix ̂A.
As shown in the following formula (15), the estimated mixing matrix ̂A in the calibration process is a row vector (1×m matrix) which is formed of the mixing coefficients corresponding to each independent component. Herein, after performing Step S335, the CPU 10 reads out the target component order k included in the calibrating data set DS2 from the hard disk drive 30, extracts the mixing coefficient ̂αk of a k-th component corresponding to the target component order k, from the estimated mixing matrix ̂A acquired in Step S335, and temporarily stores the mixing coefficient ̂αk in the memory 20 as a mixing coefficient of glucose which is the target component (Step S340). In the processes of Steps S325, S335, and S340, the CPU 10 functions as the mixing coefficient calculation unit 530 of
{circumflex over (A)}=({circumflex over (α)}1,{circumflex over (α)}2, . . . ,{circumflex over (α)}m) (15)
Next, the CPU 10 reads out the constants u and v of the regression equation included in the calibrating data set DS2 from the hard disk drive 30 and substitutes the constants u and v and the mixing coefficient ̂αk of glucose which is the target component obtained in Step S340 into the right side of the formula (13), to acquire the content C of the glucose (Step S350). The content C is acquired as glucose concentration, that is, a mass of glucose included in capacitance unit volume of blood (for example, 100 ml) of the test object. In the process of Step S350, the CPU 10 functions as the target component amount calculation unit 540 of
In the first embodiment, the content C (mass per unit volume) acquired in Step S350 is set as the content of glucose of the test object, but instead of this, the content C acquired in Step S350 may be corrected with the normalized coefficient used in the normalization in Step S325 and the corrected value may be set as the content to be acquired. In detail, an absolute value (grams) of the content may be acquired by multiplying the standard deviation by the content C. According to this configuration, the content C can have yet further higher accuracy depending on the kinds of the target components.
According to the calibration curve creating method of the embodiment configured as described above, it is possible to acquire the glucose amount from one spectrum which is an actually-measured value of the blood which is the test object, with high accuracy.
Various algorithms used in the first preprocessing unit 450, the second preprocessing unit 460, and the independent component analysis processing unit 470 shown in
As the first preprocessing performed by the first preprocessing unit 450, the standard normal variate transformation (SNV) and the project on null space (PNS) can be used.
SNV is given by the following formula (16).
Herein, z represents processed data, x represents the data to be processed (absorbance spectrum in the first embodiment) and xave represents an average value of the data to be processed x, and σ represents a standard deviation of the data to be processed x. As a result of the standard normal variate transformation, the normalized data z in which the average value is 0 and the standard deviation is 1, is obtained.
If the PNS is performed, it is possible to decrease the baseline variation included in the data to be processed. In measurement of the data to be processed (absorbance spectrum in the first embodiment), variation between data items, called baseline variation, such as an increase or a decrease of the average value of the data for each measurement data item occurs due to various reasons. Accordingly, it is preferable to remove the reasons for the variation before performing the independent component analysis process (ICA). The PNS can be used as the preprocessing which can decrease the baseline variation of the data to be processed. In particular, great baseline variation occurs in the measurement data of absorbed light spectrum or reflected light spectrum including the near-infrared region, and accordingly it is very advantageous to apply the PNS. Hereinafter, a principle of removing the baseline variation included in the data obtained by the measurement (also simply referred to as “measurement data x”) by the PNS will be described. In addition, as a typical example, a case where the measurement data is the absorbed light spectrum or the reflected light spectrum including the near-infrared region, will be described.
In general, in an ideal system, the measurement data x (data to be processed x) is represented by the following formula (17), using m (m is an integer equal to or larger than 2) independent components si (i=1 to m) and each mixing ratio ci.
Herein, A is a matrix (mixing matrix) formed by the mixing ratio ci.
The process is performed also in the independent component analysis (ICA) with this model. However, various variation factors (change of a state of a specimen or a measurement environment, and the like) exist in the actual measurement data. Herein, as a model obtained by considering those, a model which represents the measurement data x is considered using the following formula (18).
Herein, b represents a parameter showing a variation amount of the spectrum in an amplitude direction, a, d, and e each represent constant baseline variation E (also referred to as an “average value variation”), a parameter showing an amount of variation λ linearly dependent on a wavelength, and a parameter showing an amount of variation λ2 secondarily dependent on a wavelength, and ε represents other variation components. In addition, the constant baseline variation E is given by E={1, 1, 1, . . . 1}T, and a data length thereof is a constant vector which is equivalent to a data length (number of sections of the wavelength band) of the measurement data x. The variation λ and λ2 dependent on the wavelength are given by λ={λ1, λ2, . . . λN}T and λ2={λ12, λ22, . . . λN2}T, and N herein is the data length of the measurement data x. High-order variation which is equal to or higher than tertiary variation can also be considered as the variation dependent on the wavelength, and up to g-order variation λg (g is an integer equal to or larger than 2) can be generally considered. Since the variation components are error factors in the ICA or the calibration, it is desirable to remove the variations in advance.
In PNS, by imaging the measurement data x in a space (null space) not including the variation components by considering a space configured with each of the baseline variation components E, λ, λ2, . . . λg described above, it is possible to obtain the data in which the baseline variation components E, λ, λ2, . . . λg (g is an integer equal to or larger than 2) are decreased. As the specific arithmetic operation, the processed data z from the PNS is calculated by the following formula (19).
Herein, P+ is a pseudo inverse matrix of P. ki is a component obtained by imaging the configuration component si of the formula (18) in the null space not including the variation components. In addition, ε* is a component obtained by imaging the variation component ε of the formula (18) in the null space.
If the normalization (for example, SNV) is performed after processing of PNS, it is possible to eliminate an effect on the variation amount b of the spectrum in an amplitude direction of the formula (18).
If ICA is performed with respect to the data which is subjected to the preprocessing by such PNS, the obtained independent component is an estimated value of the component ki of the formula (19) and is different from the actual configuration component si. However, the mixing ratio ci does not change from the original value of the formula (18), and accordingly the calibration process (
In addition, the PNS is specifically disclosed, for example, in “Extracting Chemical Information from Spectral Data with Multiplicative Light Scattering Effects by Optical Path-Length Estimation and Correction”, Zeng-Ping Chen, Julian Morris, and Elaine Martin, 2006.
Principal components analysis (PCA) and factor analysis (FA) can be used as the second preprocessing performed by the second preprocessing unit 460.
In the typical ICA method, dimensional compression of the data to be processed and non-correlating are performed, as the preprocessing. Since a transformation matrix to be acquired in the ICA is limited to the orthogonal transformation matrix by this preprocessing, it is possible to decrease the computational complexity of the ICA. Such preprocessing is called “whitening” and PCA is used in many cases. The whitening using PCA is described, for example, in Chapter 6 of “Independent Component Analysis”, Aapo Hyvarinen, Juha Karhumen, Erkki Oja, 2001, John Wiley & Sons, Inc. (“Independent Component Analysis” February 2005, published by Tokyo Denki University Publishing Department).
However, in PCA, in a case where random noise is included in the data to be processed, the PCA may be affected by the effect of the random noise, and accordingly error may be generated in the processed result. Herein, in order to decrease the effect of the random noise, it is preferable to perform the whitening using the factor analysis (FA) having robustness with respect to the noise, instead of the PCA. Hereinafter, the principle of the whitening by FA will be described.
As described above, in the typical ICA, a linear mixing model (formula (17)) representing the data to be processed x as a linear sum of the configuration components si is assumed, and the mixing ratio ci and the configuration component si are acquired. However, the random noise other than that for the configuration component si is added to the actual data, in many cases. Herein, as the model obtained by considering the random noise, a model representing the measurement data x by the following formula (20) is considered.
x=A·s+ρ (20)
Herein, ρ represents the random noise.
The whitening considering this noise mixing model is performed, and then it is possible to obtain the estimation of the mixing matrix A and the independent component si by performing the ICA.
In the FA of the first embodiment, it is assumed that each of the independent components si and the random noise ρ is in accordance with normal distribution N (0, Im) and N (0, Σ). As generally known, a first parameter x1 of the normal distribution N (x1, x2) represents an expected value and a second parameter x2 thereof represents a standard deviation. At that time, since the data to be processed x is the linear sum of a variable in accordance with the normal distribution, the data to be processed x is also in accordance with the normal distribution. Herein, when a covariance matrix of the data to be processed x is set as V[x], the normal distribution of the data to be processed x can be represented as N (0, V[x]). At that time, a likelihood function of the covariance matrix V[x] of the data to be processed x can be calculated in the following order.
First, if it is assumed that the independent components si are orthogonal to each other, the covariance matrix V [x] of the data to be processed x is calculated by the following formula (21).
V[x]=E[xx
T
]=AA
T+Σ (21)
Herein, Σ represents the covariance matrix of the noise ρ.
As described above, the covariance matrix V[x] can be represented by the mixing matrix A and the covariance matrix Σ of the noise. At that time, a log likelihood function L (A, Σ) is given by the following formula.
Herein, n represents the number of data items of the data x, m represents the number of independent components, an operator tr represents a trace of the matrix (sum of diagonal components), and an operator det represents a determinant. In addition, C represents a sample covariance matrix acquired by sample calculation from the data x, and is calculated by the following formula.
The mixing matrix A and the covariance matrix E of the noise can be acquired by maximum-likelihood method using the log likelihood function L (A, Σ) of the formula (22). As the mixing matrix A, a mixing matrix A which is substantially not affected by the random noise ρ of the formula (20) is obtained. This is a basic principle of the FA. As the algorithm of the FA, there are various algorithms using the algorithm other than the maximum likelihood method. Such various FA can also be used in the first embodiment.
Meanwhile, the estimated value obtained by the FA is merely a value of AAT, in a case where the mixing matrix A adapted for this value is determined, the non-correlating of the data can be performed while decreasing the effect of the random noise, but it is difficult to uniquely determine the plurality of configuration components si since a degree of freedom of rotation remains. Meanwhile, the ICA is a process of decreasing the degree of freedom of the rotation of the plurality of configuration components si so that the plurality of configuration components si are orthogonal to each other. Herein, in the first embodiment, an arbitrary property with respect to the remaining rotation is specified by the ICA, using a value of the mixing matrix A acquired by the FA as the whitened matrix (matrix subjected to the whitening). Accordingly, after performing the whitening process which is robust to the random noise, by performing the ICA, the independent configuration components si orthogonal to each other can be determined. In addition, as a result of such a process, it is possible to decrease the effect of the random noise and to improve the calibration accuracy related to the configuration components si.
In the independent component analysis (ICA), as the index for separating the independent components, the higher order statistics representing independence between the separated data items are generally used as the independence index. The kurtosis is a typical independence index. The ICA using the kurtosis as the independence index is, for example, described in Chapter 8 of “Independent Component Analysis”, Aapo Hyvarinen, Juha Karhumen, Erkki Oja, 2001, John Wiley & Sons, Inc. (“Independent Component Analysis” February 2005, published by Tokyo Denki University Publishing Department).
However, in a case where an outlier such as spike noise is included in the data to be processed, statistics including the outlier are calculated as the independence index. Therefore, error may be generated between original statistics and the calculated statistics of the data to be processed, and this may cause a decrease in separation accuracy. Herein, it is preferable to use the independence index which is hardly affected by the effect from the outlier in the data to be processed. β divergence can be used as the independence index having such properties. Hereinafter, a principle of the β divergence as the independence index in the ICA will be described.
As described above, in the typical ICA, a linear mixing model (formula (17)) representing the data to be processed x as a linear sum of the configuration components si is assumed, and the mixing ratio ci and the configuration component si are acquired. An estimated value y of the configuration component s acquired by the ICA is represented as y=W·y using the separation matrix W. At that time, the separation matrix W is desirably an inverse matrix of the mixing matrix A.
Herein, a log likelihood function L (̂W) of an estimated value ̂W of the separation matrix W can be represented by the following formula.
Herein, an element of a summation sign Σ is a log likelihood of each data point x (t). This log likelihood function L (̂W) can be used as the independence index of the ICA. A method of the β divergence is a method of applying a suitable function to the log likelihood function L (̂W) to convert the log likelihood function L (̂W) so as to suppress the effect of the outlier such as the spike noise in the data.
In a case of using the β divergence as the independence index, first, the log likelihood function L (̂W) is converted by the following formula using a function Φβ which is previously selected.
This function LΦ(̂W) is considered as a new likelihood function.
As the function Φβ for decreasing the effect of the outlier such as the spike noise, a function in which the function Φβ decreases in an exponential manner as the value of the log likelihood function (value in brackets of the function Φβ) decreases, is considered. As such a function Φβ, the following formula can be used, for example.
In this function, as the value of β increases, a function value with respect to each data point z (log likelihood in the formula (25)) decreases. The value of β can be determined empirically, and can be set as approximately 0.1, for example. The function Φβ is not limited to that of the formula (26), and it is possible to use another function in which, as the value of β increases, the function value with respect to each data point z decreases.
When using the β divergence as the independence index, it is possible to suitably suppress the effect of the outlier such as the spike noise. In a case where the likelihood function LΦ(̂W) such as the formula (25) is considered, a pseudo distance among probability distributions which is minimized corresponding to maximization of the likelihood is β divergence. If the ICA using the β divergence as the independence index is performed, it is possible to decrease the effect of the outlier such as the spike noise to improve the calibration accuracy of the configuration component si.
The ICA using the β divergence is, for example, described in “Robust Blind Source Separation by β-Divergence” Minami Mihoko, Shinto Eguchi, 2002.
The following two values are used as index values showing the calibration accuracy. R2 is the square of the correlation coefficient R between the actually-measured values and the calibration values obtained by the independent component analysis, and SEP is estimated standard error between the actually-measured value and the calibration value obtained by the independent component analysis. In general, the calibration accuracy is excellent when R2 is large (close to 1), and the calibration accuracy is excellent when SEP is small.
In process conditions 1, the standard normal variate transformation (SNV) is used in the first preprocessing, the principal components analysis (PCA) is used in the second preprocessing, and the kurtosis is used as the independence index of the independent component analysis (ICA). In process conditions 2, the process is performed in the same manner as the process conditions 1, except for using the factor analysis (FA) in the second preprocessing. In process conditions 3, the process is performed in the same manner as the process conditions 1, except for using the project on null space (PNS) in the first preprocessing. In a case of using the PNS in the first preprocessing (process conditions 3, 5, 6, and 8), the SNV is performed after the PNS.
An effect of usage of the PNS in the first preprocessing can be recognized, when the process conditions 1 and the process conditions 3 of
An effect of usage of the FA in the second preprocessing can be recognized, when the process conditions 1 and the process conditions 2 of
An effect of usage of the β divergence as the independence index of the ICA can be recognized, when the process conditions 1 and the process conditions 4 of
Eight types of process conditions of
An effect of usage of the PNS in the first preprocessing is slight in comparison between the process conditions 1 and the process conditions 3 of
An effect of usage of the FA in the second preprocessing is considerably large so as to be recognized from comparison between the process conditions 1 and the process conditions 2 of
An effect of usage of the β divergence as the independence index of the ICA is slight in comparison between the process conditions 1 and the process conditions 4 of FIG. 15, but is considerably large in comparison between the process conditions 3 and the process conditions 5, and the considerably large effect thereof is also obtained, in the same manner, from comparison between the process conditions 2 and the process conditions 7 or comparison between the process conditions 6 and the process conditions 8. That is, the effect of usage of the β divergence as the independence index of the ICA becomes more significant, when employing at least one of usage of the PNS in the first preprocessing, and usage of the FA in the second preprocessing.
As recognized from the evaluation result of
As shown in
As shown in
As shown in
The invention is not limited to the embodiments described above or other modification examples, and can be executed in various embodiments within a range not departing from a gist thereof, and the following modifications can also be performed, for example.
In the embodiments described above, the test object observation data acquisition unit 510 (
(i) The calibrating data set DS2 stored in the hard disk drive 30 is read out to acquire the k-th row element (independent component) Yk corresponding to the target component order k from the independent component matrix Y included in the calibrating data set DS2. The independent component Yk has a highest correlation with respect to the glucose amount and corresponds to the glucose amount. (ii) Next, an inner product of the extracted independent component Yk and the spectrum Xp of the test object which is the observation data (for example, normalized spectrum obtained in Step S320) is acquired, and an inner product value thereof is set as the mixing coefficient αk of the target component. That is, an arithmetic operation according to the following formula (27) is performed.
αk=Xp·Yk (27)
Herein, since it is assumed that the observation data is a linear sum of the independent components, and orthogonality between the independent components is sufficiently high, by calculating the inner product between the spectrum which is the observation data and the independent component matrix of the target component, only the value of the independent component remains and values of all of the other components become 0. Accordingly, the mixing coefficient αk of the target component is easily calculated. However, in a case where the orthogonality between the independent components is not sufficiently high, it is preferable to acquire the estimated mixing matrix ̂A of the formula (15) without using the arithmetic operation of the formula (27).
In the process of the part (i), the CPU 10 functions as the calibrating data acquisition unit. In the process of the part (ii), the CPU 10 functions as the mixing coefficient calculation unit. In addition, instead of the configuration of the part (i), the calibrating data acquisition unit may be configured to acquire the independent component Yk, from a storage unit such as the hard disk drive 30 in which the k-th row element (independent component) Yk corresponding to the target component order k from the independent component matrix Y is previously stored. In a case of using the inner product, the independent component corresponding to the target component is only necessary, and therefore the other independent components are not necessary. In this case, the independent component is the vector, and it is not necessary to store the target component order.
In the embodiments and the modification example described above, the target component is set as glucose in the blood, but instead of glucose, other components in the blood, for example, oxyhemoglobin, deoxyhemoglobin, or the like may be used.
In the embodiments and the modification examples described above, the mixing coefficient estimation step has a configuration of acquiring the independent component matrix, acquiring the estimated mixing matrix, and extracting the mixing coefficient corresponding to the target component from the estimated mixing matrix, but it is not necessary to have this configuration. That is, any configuration can be used, as long as it is the configuration in which each independent component included in the observation data of each sample when separating the observation data into the plurality of the independent components is assumed, and the mixing coefficient corresponding to the target component is acquired for each sample, based on each independent component.
In the calibration curve creating method of the embodiments and the modification examples described above, it is configured to measure the content of the target component of the sample, but instead of this, the sample with known content of the target component may be prepared and the content may be input from a keyboard or the like.
In the embodiments and the modification examples described above, the number of elements m of the spectra S of the unknown component is empirically and experimentally determined in advance, but the number of elements m of the spectra S of the unknown component may be determined by minimum description length (MDL) or information criteria known as Akaike information criteria (AIC). In a case of using the MDL or the like, the number of elements m of the spectra S of the unknown component can be automatically determined by the arithmetic operation from the observation data of the sample. The MDL is, for example, described in “Independent component analysis for noisy data—MEG data analysis, 2000”.
In the embodiments and the modification examples described above, the test object which is a target of the calibration process is configured with the same component as the sample used when creating the calibration curve, but in a case of acquiring the mixing coefficient using the inner product as in Modification Example 1, unknown components other than the same component as in the sample used when creating the calibration curve may be included in the test object. This is because, since the inner products between the independent components are assumed to be 0, the inner products between the independent components corresponding to the unknown components are also considered to be 0, and the effect of the unknown components can be ignored in a case of acquiring the mixing coefficient with the inner product.
For the computer used in the embodiments and the modification examples described above, an exclusive apparatus can be used instead of the personal computer. For example, the personal computer which implements the calibrating method of the target component can be set as an exclusive calibration apparatus.
In the embodiments described above, the input of the spectrum of spectral reflectance of the sample or the test object is performed by inputting the spectrum measured by the spectroscopic measurement instrument, but the invention is not limited thereto. For example, the optical spectrum may be estimated from a plurality of band images having different wavelength bands and the optical spectrum may be input. The band images are obtained, for example, by imaging the sample or the test object with a multiband camera including a filter capable of changing transmission wavelength bands.
In the embodiments, for the blood component measuring apparatus as the spectroscopic measurement instrument, a member having a probe shape as a point of contact with the living body is used, but this is not necessarily limited thereto, and the following configuration may be used.
A sensor module 750 is provided on the rear surface of the main body case 712 so as to come in contact with a skin surface of a person. The sensor module 750 is a measuring device which emits measurement light to the skin surface and receives reflected or transmitted light thereof, and is alight source built-in thin image sensor.
The blood component measuring apparatus 700 having the configuration described above, first causes all of the light emitting elements 752 of the sensor module 750 to emit light simultaneously, to emit light to the entire area of the measurement part of the person. Then, the entire area of the measurement part is imaged using all of the light receiving elements 754. Next, a blood vessel part suitable for the spectral measurement is selected from the captured image by a control device (not shown) provided in the blood component measuring apparatus 700, the light emission of measurement light is performed with respect to the blood vessel part by the light emitting elements 752, and the diffusely-reflected light from the blood vessel part is received by the light receiving elements 754. The “blood vessel part suitable for the spectral measurement” is a part of a thick blood vessel, a branched part of a blood vessel, a joining part thereof, or the like. The diffusely-reflected light received by the light receiving elements 754 is transferred to a spectral element (not shown) and is dispersed. By doing so, the absorbance spectrum is measured. According to the blood component measuring apparatus 700 of Modification Example 10 having such a configuration, since the blood vessel part suitable for the measurement of the spectral data is selected and measured, it is possible to increase accuracy of the independent component analysis or accuracy of calibration using that.
In the embodiments and Modification Examples 9 and 10, the light including the near-infrared light is emitted to the living body to obtain the absorbance spectrum from the diffusely-reflected light thereof, but instead of this, the absorbance spectrum may be obtained from transmitted light which transmits the inside of the living body. That is, as shown in
In the embodiments and the modification examples described above, the light source included in the blood component measuring apparatus is set to the xenon flash tube, but this is not necessarily limited thereto, and a tungsten lamp, a halogen lamp, or a laser light source may be used, for example. In addition, the blood component measuring apparatus may be configured with a Raman spectroscopic measurement instrument which uses a laser light source as the light source, and includes a Rayleigh light removing filter for cutting Rayleigh scattering light, a spectral element which disperses Raman scattering light transmitted the filter, and a light receiving element. According to this configuration, it is possible to perform measurement with respect to the target component with higher sensitivity and to further improve the measurement accuracy.
In the embodiments and the modification examples, the functions implemented by the software may be implemented by the hardware.
Among the configuration elements of the embodiments and the modification examples, the elements other than the elements disclosed in independent claims are additional elements and may be suitably omitted.
Number | Date | Country | Kind |
---|---|---|---|
2013-149741 | Jul 2013 | JP | national |