CALIBRATION CURVE CREATING METHOD AND APPARATUS FOR THE SAME, AND BLOOD COMPONENT CALIBRATION APPARATUS

Information

  • Patent Application
  • 20150025340
  • Publication Number
    20150025340
  • Date Filed
    July 17, 2014
    10 years ago
  • Date Published
    January 22, 2015
    9 years ago
Abstract
A calibration curve creating method includes: (a) acquiring observation data of a plurality of samples of a living body, when near infrared light is emitted to the living body and an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof is set as the observation data; (b) acquiring content of a target component of each sample; (c) estimating a plurality of independent components when the observation data of each sample is separated into the plurality of independent components, and acquiring a mixing coefficient corresponding to the target component for each sample; and (d) acquiring a regression equation of a calibration curve. (c) includes acquiring an independent component matrix by performing processes of normalization, whitening, and independent component analysis of the observation data, and the normalization is performed after a process performed by project on null space.
Description

This application claims the benefit of Japanese Patent Application No. 2013-149741, filed on Jul. 18, 2013. The content of the aforementioned application is incorporated herein by reference in its entirety.


BACKGROUND

1. Technical Field


The present invention relates to a technology of creating a calibration curve used in acquiring content of a target component in blood, and a technology of acquiring the content of the target component in blood.


2. Related Art


In the related art, there is provided a method of performing independent component analysis of observation data of a test object which is obtained by observing at a plurality of different positions, setting an independent component calculated by the independent component analysis as a fundamental function, and representing the observation data as a linear sum of the fundamental function, to analyze concentration or the like of a target component (see JP-A-2007-44104). According to the method disclosed in JP-A-2007-44104, it is possible to acquire oxyhemoglobin concentration and deoxyhemoglobin concentration in blood, from non-invasive observation data.


However, in the technology of the related art, a plurality of different observation data items of the test object are necessary every time when performing calibration of a target component of the test object, and it is difficult to perform the calibration with high accuracy from one observation data item. In addition, since the test object is a living body, acquiring the plurality of observation data items causes burden on a test subject, and solving of the aforementioned problems is necessary.


Various noise may be included in the observation data. In this case, accuracy of the independent component analysis or the calibration using that may be degraded.


Further, the observation data may vary depending on the test object, due to unevenness of a composition or a structure of the test object. In such a case, the accuracy of the independent component analysis or the calibration using that may also be degraded.


SUMMARY

An advantage of some aspects of the invention is to allow highly precise calibration of a test object from one observation data item, when performing calibration of a target component in blood for the test object.


The invention can be implemented as the following forms or application examples.


Application Example 1

This application example is directed to a calibration curve creating method of creating a calibration curve used in acquiring content of a target component which is a specific component in blood from observation data of a living body which is a test object, the method including: (a) causing a computer to acquire the observation data of a plurality of samples of a living body, when near infrared light having a wavelength of 800 nm to 1300 nm is emitted to the living body and an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof is set as the observation data; (b) causing the computer to acquire content of the target component of each sample; (c) causing the computer to estimate a plurality of independent components when the observation data of each sample is separated into the plurality of independent components, and to acquire a mixing coefficient corresponding to the target component for each sample based on the plurality of independent components; and (d) causing the computer to acquire a regression equation of the calibration curve based on the content of the target component of the plurality of samples and the mixing coefficient for each sample, in which (i) causing the computer to acquire an independent component matrix including the independent component of each sample, (ii) causing the computer to acquire an estimated mixing matrix showing a vector set for regulating a ratio of an independent component element of each independent component in each sample, from the independent component matrix, and (iii) causing the computer to acquire a correlation of content of the target component of the plurality of samples, for each vector included in the estimated mixing matrix, and to select the vector which is determined to have a highest correlation, as a mixing coefficient corresponding to the target component, are included in (c), and in (i), the computer acquires the independent component matrix by performing a first preprocessing including normalization of the observation data, a second preprocessing including whitening, and an independent component analysis process in this order, and the computer performs normalization after a process performed by project on null space in the first preprocessing.


According to the calibration curve creating method of Application Example 1, a calibration curve for acquiring the content of the target component which is the specific component in the blood from the observation data of the living body which is the test object, is created from the observation data acquired from each sample and the content of the target component of the plurality of samples of the living body. Accordingly, it is possible to acquire the content of the target component with high accuracy by using this calibration curve, even when one observation data item of the living body which is the test object is used. Therefore, if the calibration curve is previously created according to the calibration curve creating method of Application Example 1, it is only necessary to acquire one observation data item of the living body which is the test object when performing calibration. As a result, it is possible to acquire a target component amount in the blood from one observation data item which is an actually-measured value, with high accuracy. In particular, according to the calibration curve creating method of Application Example 1, it is possible to allow the light to reach a blood vessel, by setting light to be emitted to the living body as near-infrared light having a wavelength of 800 nm to 1300 nm. Accordingly, it is possible to obtain observation data in which an effect of a blood component is directly reflected, and to improve calibration accuracy. In addition, the estimated mixing matrix is acquired and the vector having a high correlation with respect to the content of the target component of the sample is extracted from the estimated mixing matrix, and therefore it is possible to obtain a mixing coefficient having high estimation accuracy. Further, since the process performed by the project on null space is performed in the first preprocessing, it is possible to decrease an effect of baseline variation included in the observation data to improve calibration accuracy.


Application Example 2

This application example is directed to the calibration curve creating method according to Application Example 1, wherein the computer performs whitening by factor analysis in the second preprocessing.


In this method, since the whitening by factor analysis is performed in the second preprocessing, it is possible to decrease an effect of noise (particularly, random noise) included in the observation data to improve calibration accuracy.


Application Example 3

This application example is directed to the calibration curve creating method according to Application Example 1 or 2, wherein the computer uses β divergence as an independence index of the independent component analysis process.


In this method, since β divergence is used as the independence index of the independent component analysis process, it is possible to decrease an effect of an outlier such as spike noise included in the observation data to improve calibration accuracy.


Application Example 4

This application example is directed to a calibration curve creation apparatus which creates a calibration curve used in acquiring content of a target component which is a specific component in blood from observation data of a living body which is a test object, the apparatus including: a sample observation data acquisition unit which acquires the observation data of a plurality of samples of a living body, when near infrared light having a wavelength of 800 nm to 1300 nm is emitted to the living body and an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof is set as the observation data; a sample target component amount acquisition unit which acquires content of the target component of each sample; a mixing coefficient estimation unit which estimates a plurality of independent components when the observation data of each sample is separated into the plurality of independent components, and acquires a mixing coefficient corresponding to the target component for each sample based on the plurality of independent components; and a regression equation calculation unit which acquires a regression equation of the calibration curve based on the content of the target component of the plurality of samples and the mixing coefficient for each sample, in which the mixing coefficient estimation unit includes an independent component matrix calculation unit which acquires an independent component matrix including each independent component of each sample, an estimated mixing matrix calculation unit which acquires an estimated mixing matrix showing a vector set for regulating a ratio of an independent component element of each independent component in each sample, from the independent component matrix, and a mixing coefficient selection unit which acquires a correlation of content of the target component of the plurality of samples, for each vector included in the estimated mixing matrix, and selects the vector which is determined to have a highest correlation, as a mixing coefficient corresponding to the target component, the independent component matrix calculation unit acquires the independent component matrix by performing a first preprocessing including normalization of the observation data, a second preprocessing including whitening, and an independent component analysis process in this order, and the independent component matrix calculation unit performs normalization after a process performed by project on null space in the first preprocessing.


According to the calibration curve creation apparatus of Application Example 4, in the same manner as the calibration curve creating method according to Application Example 1, it is only necessary to acquire one observation data item of the living body which is the test object when performing the calibration. Accordingly, it is possible to acquire the target component amount from one observation data item which is an actually-measured value, with high accuracy. In addition, since the process performed by the project on null space is performed in the first preprocessing, it is possible to decrease an effect of baseline variation included in the observation data to improve calibration accuracy.


Application Example 5

This application example is directed to the calibration curve creation apparatus according to Application Example 4, wherein the independent component matrix calculation unit performs whitening by factor analysis in the second preprocessing.


In this apparatus, since the whitening by factor analysis is performed in the second preprocessing, it is possible to decrease an effect of noise (particularly, random noise) included in the observation data to improve calibration accuracy.


Application Example 6

This application example is directed to the calibration curve creation apparatus according to Application Example 4 or 5, wherein the independent component matrix calculation unit uses β divergence as an independence index of the independent component analysis process.


In this apparatus, since β divergence is used as the independence index of the independent component analysis process, it is possible to decrease an effect of an outlier such as spike noise included in the observation data to improve calibration accuracy.


Application Example 7

This application example is directed to the calibration curve creation apparatus according to any one of Application Examples 4 to 6, which further includes a storage unit which stores the independent component matrix calculated by the independent component matrix calculation unit, a target component order which shows a position of the mixing coefficient selected by the mixing coefficient selection unit in the estimated mixing matrix, and the regression equation calculated by the regression equation calculation unit.


According to this configuration, in the calibration curve creation apparatus, it is possible to store the independent component matrix, the target component order, and the regression equation in the storage unit.


Application Example 8

This application example is directed to a blood component calibration apparatus which acquires content of a target component which is a specific component in blood for a living body which is a test object, the apparatus including: a test object observation data acquisition unit which emits near infrared light having a wavelength of 800 nm to 1300 nm to the test object, and acquires an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof as observation data; a calibrating data acquisition unit which acquires calibrating data including at least an independent component corresponding to the target component; a mixing coefficient calculation unit which acquires a mixing coefficient with respect to the target component of the test object, based on the observation data of the test object and the calibrating data; and a target component amount calculation unit which calculates content of the target component based on a constant of a regression equation showing a relationship between the mixing coefficient and the content corresponding to the target component, and the mixing coefficient acquired by the mixing coefficient calculation unit which are prepared in advance, in which the mixing coefficient calculation unit performs a first preprocessing including normalization of the observation data and a second preprocessing including whitening in this order, and performs normalization after a process performed by project on null space in the first preprocessing.


According to this blood component calibration apparatus, although only one observation data item is acquired for the living body which is the test object, it is possible to acquire the content of the target component for the test object with high accuracy. Since the process performed by the project on null space is performed in the first preprocessing, it is possible to decrease an effect of baseline variation included in the observation data to improve calibration accuracy.


Application Example 9

This application example is directed to the blood component calibration apparatus according to Application Example 8, wherein the mixing coefficient calculation unit performs whitening by factor analysis in the second preprocessing.


In this apparatus, since the whitening by factor analysis is performed in the second preprocessing, it is possible to decrease an effect of noise (particularly, random noise) included in the observation data to improve calibration accuracy.


Application Example 10

This application example is directed to the blood component calibration apparatus according to Application Example 8 or 9, wherein the calibrating data acquisition unit acquires the independent component which is previously acquired as a component corresponding to the target component, as the calibrating data, and the mixing coefficient calculation unit acquires an inner product of the independent component and the observation data of the test object, and set the inner product as the mixing coefficient.


According to this blood component calibration apparatus, it is possible to easily acquire a mixing coefficient having a high correlation with the target component of the test object with high accuracy.


Application Example 11

This application example is directed to the blood component calibration apparatus according to Application Example 8 or 9, wherein the calibrating data acquisition unit acquires the plurality of independent components when each observation data item of the plurality of samples is separated into the plurality of independent components, as the calibration data, and the mixing coefficient calculation unit calculates an estimated mixing matrix of the test object based on the observation data and the plurality of independent components of the test object, and extracts a mixing coefficient corresponding to the target component from the calculated estimated mixing matrix.


According to this blood component calibration apparatus, it is possible to acquire a mixing coefficient having a high correlation with the target component of the test object with high accuracy.


Further, the invention can be implemented in the following various aspects, and for example, can be implemented as an aspect of the blood component calibration apparatus which stores the regression equation acquired by the calibration curve creating method in a memory, an aspect of a computer program which implements the configuration of each unit included in the blood component calibration apparatus as a function, an aspect of the computer program or a non-transitory storage medium in which the computer program is recorded, or the like.





BRIEF DESCRIPTION OF THE DRAWINGS

The invention will be described with reference to the accompanying drawings, wherein like numbers reference like elements.



FIG. 1A is a flowchart showing a calibration curve creating method as a first embodiment of the invention.



FIG. 1B is an explanatory diagram showing a schematic configuration of a blood component measuring apparatus.



FIG. 2 is a graph showing a relationship between a wavelength of light for a living body and an absorbance spectrum.



FIG. 3A is an explanatory diagram showing a personal computer and peripheral devices used in a step 4 and a step 5.



FIG. 3B is a functional block diagram of a device used in a step 4 and a step 5.



FIG. 3C is a functional block diagram showing an example of an internal configuration of an independent component matrix calculation unit.



FIG. 4 is an explanatory diagram schematically showing a measurement data set stored in a hard disk drive.



FIG. 5 is a flowchart showing a mixing coefficient estimation process performed by a CPU.



FIG. 6 is an explanatory diagram for illustrating an estimated mixing matrix ̂A.



FIG. 7 is an explanatory diagram showing an example of a scatter diagram having a high correlation.



FIG. 8 is an explanatory diagram showing an example of a scatter diagram having a low correlation.



FIG. 9 is a flowchart showing a calculation process of a regression equation performed by a CPU of a computer.



FIG. 10 is a functional block diagram of a device used when performing calibration of a target component.



FIG. 11 is a flowchart showing a target component calibration process performed by a CPU of a computer.



FIG. 12A is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+PCA+kurtosis) relating to absorbance of a mixture of sucrose and salt.



FIG. 12B is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+FA+kurtosis) relating to absorbance of a mixture of sucrose and salt.



FIG. 12C is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+PCA+kurtosis) relating to absorbance of a mixture of sucrose and salt.



FIG. 12D is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+PCA+β divergence) relating to absorbance of a mixture of sucrose and salt.



FIG. 12E is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+PCA+β divergence) relating to absorbance of a mixture of sucrose and salt.



FIG. 12F is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+FA+kurtosis) relating to absorbance of a mixture of sucrose and salt.



FIG. 12G is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+FA+β divergence) relating to absorbance of a mixture of sucrose and salt.



FIG. 12H is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+FA+β divergence) relating to absorbance of a mixture of sucrose and salt.



FIG. 13 is a diagram showing comparison of calibration accuracy of FIG. 12A to FIG. 12H.



FIG. 14A is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+PCA+kurtosis) relating to a mixed signal of voices of people.



FIG. 14B is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+FA+kurtosis) relating to a mixed signal of voices of people.



FIG. 14C is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+PCA+kurtosis) relating to a mixed signal of voices of people.



FIG. 14D is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+PCA+β divergence) relating to a mixed signal of voices of people.



FIG. 14E is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+PCA+β divergence) relating to a mixed signal of voices of people.



FIG. 14F is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+FA+kurtosis) relating to a mixed signal of voices of people.



FIG. 14G is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+FA+β divergence) relating to a mixed signal of voices of people.



FIG. 14H is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+FA+β divergence) relating to a mixed signal of voices of people.



FIG. 15 is a diagram showing comparison of calibration accuracy of FIG. 14A to FIG. 14H.



FIG. 16A is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+PCA+kurtosis) relating to a signal obtained by adding Gaussian noise to a mixed signal of voices of people.



FIG. 16B is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+FA+kurtosis) relating to a signal obtained by adding Gaussian noise to a mixed signal of voices of people.



FIG. 16C is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+PCA+kurtosis) relating to a signal obtained by adding baseline variation to a mixed signal of voices of people.



FIG. 16D is a diagram showing calibration accuracy of independent component analysis (algorithm is PNS+PCA+kurtosis) relating to a signal obtained by adding baseline variation to a mixed signal of voices of people.



FIG. 16E is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+PCA+kurtosis) relating to a signal obtained by adding spike noise to a mixed signal of voices of people.



FIG. 16F is a diagram showing calibration accuracy of independent component analysis (algorithm is SNV+PCA+R divergence) relating to a signal obtained by adding spike noise to a mixed signal of voices of people.



FIG. 17 is a diagram showing comparison of calibration accuracy of FIG. 16A to FIG. 16F.



FIG. 18 is an explanatory diagram showing a measuring attachment included in a blood component measuring apparatus of Modification Example 9.



FIGS. 19A to 19C are explanatory diagrams showing a blood component measuring apparatus of Modification Example 10.



FIG. 20 is an explanatory diagram showing a blood component measuring apparatus of Modification Example 11.





DESCRIPTION OF EXEMPLARY EMBODIMENTS

Hereinafter, embodiments of the invention will be described in the following order.


A. Calibration Curve Creating Method


B. Calibrating Method of Target Component


C. Various Algorithms and Effect Thereof on Calibration Accuracy


D. Modification Examples


The following terms will be used in the description of the embodiments of the invention.


ICA: Independent Component Analysis


SNV: Standard Normal Variate Transformation


PNS: Project on Null Space


PCA: Principal Components Analysis


FA: Factor Analysis


Hereinafter, a first embodiment of the invention will be described. The first embodiment relates to a method of creating a calibration curve for acquiring glucose concentration in blood (blood sugar level) from an absorbance spectrum (=absorption spectrum) obtained from a human body, that is, a living body of a person as observation data.


A. CALIBRATION CURVE CREATING METHOD


FIG. 1A is a flowchart showing a calibration curve creating method as the first embodiment of the invention. As shown in the drawing, the calibration curve creating method is configured with five steps from a step 1 to a step 5. The steps 1 to 5 are performed in this order. The steps 1 to 5 will be described in order.


Step 1

The step 1 is a preparation step and is performed by an operator. The operator first prepares multiple people who will be living body samples. Herein, the prepared number of people is set as n people (n is an integer equal to or larger than 2).


Step 2

A step 2 is a measurement step of a spectrum, and is performed by the operator using a blood component measuring apparatus.



FIG. 1B is an explanatory diagram showing a schematic configuration of the blood component measuring apparatus. As shown in the drawing, a blood component measuring apparatus 200 is a so-called spectroscopic measurement instrument, and includes alight emitting unit 210, a light receiving unit 220, and a driving circuit 230. The light emitting unit 210 includes a light source (for example, xenon flash tube) 212 and a light emitting probe 214, and emits white light towards a living body BD as a living body sample from the light emitting probe 214. The light emitting probe 214 performs emission of white light towards the living body BD in a noninvasive manner. An emission position thereof is a position in which a blood vessel BV can be significantly irradiated, for example, an inner part of an elbow. The emission position thereof may be other parts such as a fingertip, an inner part of a wrist, and the like, instead of the inner part of the elbow. The “noninvasive manner” means it is not necessary to cut the living body.


The light receiving unit 220 includes a light receiving probe 222, a spectral element 224, and a light receiving element 226. The light receiving probe 222 is provided at a position facing the living body BD in the same manner as the light emitting probe 214 of the light emitting unit 210. The white light emitted by the light emitting probe 214 passes through the blood vessel BV in a flow path LP which is a so-called “banana shape” and reaches the light receiving probe 222. The light which reaches the light receiving probe 222 can be called “diffusely-reflected light” as it is light returning from (reflected by) an inner part of the living body BD, and the light receiving probe 222 receives the diffusely-reflected light from the inner part of the living body BD and transfers the light to the spectral element 224. The spectral element 224 is a spectral element using a Fabry-Perot type filter, a grating, a liquid crystal tunable filter, an acoustical engineering variable wavelength filter, or the like. The spectral element 224 disperses light by allowing selective transmission of light having a wavelength in accordance with a control command from the driving circuit 230. The driving circuit 230 performs driving control of the spectral element 224 so that light having a plurality of different wavelengths in a range of the wavelengths of 800 nm to 1300 nm passes through. By receiving a spectrum of light which exits from the spectral element 224 by the light receiving element 226, the blood component measuring apparatus 200 measures light intensity of the light having the plurality of wavelengths in a range of the wavelengths of 800 nm to 1300 nm, that is, a spectrum of spectral reflectance. The light receiving element 226 is, for example, a CCD, a CMOS, an InGaAs photodiode, or the like.


That is, the blood component measuring apparatus 200 emits light including near infrared light having a wavelength of 800 nm to 1300 nm (white light in a case of the first embodiment) to the living body BD and measures the spectrum of the spectral reflectance obtained from the diffusely-reflected light thereof. The operator images each of the plurality of living body samples prepared in the step 1 with the blood component measuring apparatus 200, to measure the spectrum of the spectral reflectance for each living body sample. A region of the wavelengths of 800 nm to 1300 nm is a so-called “biological window”, and has high transmittance with respect to the living body. Accordingly, the light having this wavelength easily reaches the blood vessel BV, and it is optimal for acquisition of signals in which an effect of a blood component is directly reflected.


A relationship represented by the following formula (1) is satisfied between the spectrum of the spectral reflectance and the absorbance spectrum.





[Absorbance]=−log10[Reflectance]  (1)


Accordingly, the spectrum of the spectral reflectance measured by the blood component measuring apparatus 200 is converted into the absorbance spectrum using the formula (1). The conversion into the absorbance is performed because it is necessary to satisfy linear combination in a mixed signal analyzed in independent component analysis which will be described later and the linear combination is satisfied for the absorbance from the Lambert-Beer law.


In the step 2, the absorbance spectrum may be measured instead of the spectrum of the spectral reflectance. As a measured result, data of absorbance distribution showing a property of a test object with respect to the wavelength is output. The absorbance spectrum obtained as described above is also called “spectral data”, hereinafter.



FIG. 2 is a graph showing a relationship between the wavelength of the light for the living body and the absorbance spectrum. As shown in the drawing, a peak of the absorbance exists in the range of the wavelengths of 800 nm to 1300 nm. This is because light absorption by glucose occurs with a peak in this wavelength region. Since the absorbance changes as the amount of glucose changes, the amount of glucose changes a spectrum waveform. Accordingly, in the step 2, the absorbance spectrum for each living body sample is obtained.


In addition, instead of measuring the spectral reflectance spectrum and the absorbance spectrum with the blood component measuring apparatus as a spectroscopic measurement instrument, the spectra may be estimated from other measured values. For example, the living body sample may be measured with a multiband camera to estimate the spectral reflectance or absorbance spectrum from the obtained multiband image. As such an estimating method, a method disclosed in JP-A-2001-99710 can be used.


Step 3

The embodiment will be described by returning to FIG. 1A. A step 3 is a measuring step of the glucose amount and is performed by the operator. The operator performs drawing of blood from each of the plurality of living body samples prepared in the step 1 to perform chemical analysis of the blood, and measures the glucose amount which is a content of the target component of each living body sample. Specifically, glucose which is the target component is extracted from the blood obtained by drawing of blood from each living body sample, and the amount (or concentration) thereof is measured. In addition, a position of the drawing of blood may be any part of the living body sample, but the position thereof preferably coincides with the part on which the spectrum is measured in the step 2.


Step 4

A step 4 is an estimation step of a mixing coefficient and is performed using a personal computer. FIG. 3A is an explanatory diagram showing a personal computer 100 and peripheral devices used in the step 4 and a step 5 which will be described later. As shown in the drawing, the personal computer (hereinafter, simply referred to as a “computer”) 100 is electrically connected to the blood component measuring apparatus 200 and a keyboard 300.


The computer 100 is a well-known device which includes a CPU 10 which performs various processes or control by executing a computer program (hereinafter, simply referred to as a “program”), a memory 20 (storage unit) which is a data saving location, a hard disk drive 30 which stores program, data, or information, an input interface (I/F) 50, and an output interface (I/F) 60.



FIG. 3B is a functional block diagram of a device used in the step 4 and the step 5. This device 400 includes a sample observation data acquisition unit 410, a sample target component amount acquisition unit 420, a mixing coefficient estimation unit 430, and a regression equation calculation unit 440. The mixing coefficient estimation unit 430 includes an independent component matrix calculation unit 432, an estimated mixing matrix calculation unit 434, and a mixing coefficient selection unit 436. The sample observation data acquisition unit 410 and the sample target component amount acquisition unit 420 are implemented by the CPU 10 of FIG. 3A in cooperation with the input I/F 50 and the memory 20, for example. The mixing coefficient estimation unit 430, the independent component matrix calculation unit 432, the estimated mixing matrix calculation unit 434, and the mixing coefficient selection unit 436 are implemented by the CPU 10 of FIG. 3A in cooperation with the memory 20, for example. The regression equation calculation unit 440 is implemented by the CPU 10 of FIG. 3A in cooperation with the memory 20, for example. These units can also be implemented by another specific apparatus or a hardware circuit other than the personal computer shown in FIG. 3A.



FIG. 3C is a functional block diagram showing an example of an internal configuration of the independent component matrix calculation unit 432. The independent component matrix calculation unit 432 includes a first preprocessing unit 450, a second preprocessing unit 460, and an independent component analysis processing unit 470. The three processing units 450, 460, and 470 acquire an independent component matrix (which will be described later) by processing data to be processed (absorbance spectrum in the first embodiment) in this order. Processing content of each unit will be described later.


The blood component measuring apparatus 200 shown in FIG. 3A is the unit shown in FIG. 1B which is used in the step 2. The computer 100 acquires the absorbance spectrum obtained from a spectral distribution measured by the blood component measuring apparatus 200 in the step 2, through the input I/F 50 as spectral data (corresponding to the sample observation data acquisition unit 410 of FIG. 3B). The computer 100 acquires the glucose amount measured in the step 3 through the input I/F 50 through manipulation of the keyboard 300 by the operator (corresponding to the sample target component amount acquisition unit 420 of FIG. 3B). The glucose amount measured in the step 3 may be input to the computer 100 as a glucose concentration which represents the amount of glucose in mg contained in 100 ml of the blood. Alternatively, the glucose amount may be input as an absolute value of mass.


As a result of acquisition of the spectral data and the glucose amount, a data set (hereinafter, referred to as a “measurement data set”) DS1 including the spectral data and the glucose amount is stored in the hard disk drive 30 of the computer 100.



FIG. 4 is an explanatory diagram schematically showing the measurement data set DS1 stored in the hard disk drive 30. As shown in the drawing, the measurement data set DS1 has a data structure including sample numbers B1, B2, . . . , Bn for identifying the plurality of living body samples (hereinafter, simply referred to as “samples”) prepared in the step 1, glucose amounts C1, C2, . . . , Cn of each sample, and spectral data items X1, X2, . . . , Xn of each sample. In the measurement data set DS1, the sample numbers B1, B2, . . . , Bn are given to the glucose amounts C1, C2, . . . , Cn, and spectral data items X1, X2, . . . , Xn, so as to determine which sample the amount and the data item are for.


The CPU 10 performs a process of estimating the mixing coefficient which is an operation of the step 4, by loading a predetermined program stored in the hard disk drive 30 into the memory 20 and executing the program. Herein, the predetermined program can be downloaded using a network such as the Internet from outside. In the step 4, the CPU 10 functions as the mixing coefficient estimation unit 430 of FIG. 3B.



FIG. 5 is a flowchart showing a mixing coefficient estimation process performed by the CPU 10. When the process is started, the CPU 10, first, performs the independent component analysis (Step S110).


Independent component analysis (ICA) is one of multi-dimensional signal analysis methods, and is a technology of measuring a mixed signal on which an independent signal is superimposed, under several different conditions, and separating an independent original signal from the mixed signal based thereon. If independent component analysis is used, by recognizing the spectral data obtained by the step 2 as data in which m independent components (unknowns) such as glucose are mixed with each other, the spectra of the independent components can be estimated from the spectral data (observation data) obtained by the step 2.


In the first embodiment, the independent component analysis is performed by performing the process by the three processing units 450, 460, and 470 shown in FIG. 3C in this order. The first preprocessing unit 450 can perform preprocessing using one or both of standard normal variate transformation (SNV) 452 and project on null space (PNS) 454. The SNV 452 is a process of subtracting an average value of data to be processed and dividing a resultant value by a standard deviation thereof, to obtain normalized data in which an average value is 0 and the standard deviation is 1. The PNS 454 is a process for removing baseline variation included in the data to be processed. In measurement of the spectrum, variation between data items, called baseline variation, such as an increase or a decrease of the average value of the data for each measurement data item occurs due to various reasons. Accordingly, it is preferable to remove the reasons for the variation before performing the independent component analysis process. The PNS can be used as preprocessing which can remove the arbitrary baseline variation. In addition, the PNS is, for example, illustrated in “Extracting Chemical Information from Spectral Data with Multiplicative Light Scattering Effects by Optical Path-Length Estimation and Correction”, Zeng-Ping Chen, Julian Morris, and Elaine Martin, 2006.


It is not necessary to perform the process by the PNS 454 in a case of performing the SNV 452 with respect to the spectral data obtained in the step 2 of FIG. 1A. On the other hand, in a case of performing the PNS 454, it is preferable to perform any normalization process (for example, SNV 452) after that.


A process other than the SNV or the PNS may be performed as the first preprocessing. In the first preprocessing, it is preferable to perform any normalization process, but the normalization process may be omitted. Hereinafter, the first preprocessing unit 450 is also called a “normalization processing unit”. The content of the two processes 452 and 454 will be further described. The first preprocessing can also be omitted in a case where the data to be processed which is applied to the independent component matrix calculation unit 432 is normalized data.


The second preprocessing unit 460 can perform preprocessing using any one of principal components analysis (PCA) 462 and factor analysis (FA) 464. A process other than PCA or FA may be used as the second preprocessing. Hereinafter, the second preprocessing unit 460 is also called a “whitening processing unit”. In the typical ICA method, dimensional compression of the data to be processed and non-correlating are performed, as the second preprocessing. Since a transformation matrix to be acquired in the ICA is limited to an orthogonal transformation matrix by the second preprocessing, it is possible to decrease computational complexity of the ICA. Such a second preprocessing is called “whitening” and PCA is used in many cases. However, in a case where random noise is included in the data to be processed, the PCA may be affected by an effect thereof, and accordingly error may be generated in a result. Herein, in order to decrease the effect of the random noise, it is preferable to perform the whitening using the FA having robustness with respect to the noise, instead of the PCA. The second preprocessing unit 460 of FIG. 3C can perform the whitening by any one of the PCA and the FA being selected. The content of the two processes 462 and 464 will be further described later. In addition, the whitening process may be omitted.


The independent component analysis processing unit (ICA processing unit) 470 performs the ICA with respect to the spectral data which is subjected to the first preprocessing and the second preprocessing, to estimate a spectrum of the independent component. The ICA processing unit 470 can perform analysis using any one of the first processing 472 which uses a kurtosis as an independence index, and the second processing 474 which uses β divergence as an independence index. As an index for separating the independent components, the ICA generally uses higher order statistics representing independence between the separated data items as the independence index. The kurtosis is a typical independence index. However, in a case where an outlier such as spike noise is included in the data to be processed, statistics including the outlier are calculated as the independence index. Therefore, an error may be generated between original statistics and the calculated statistics of the data to be processed, and this may cause a decrease in separation accuracy. Herein, in order to decrease an effect from the outlier in the data to be processed, it is preferable to use the independence index which is hardly affected by the effect thereof. β divergence can be used as the independence index having such properties. The content of the kurtosis and the β divergence will be further described later. An index other than the kurtosis or the β divergence may be used as the independence index of the ICA.


Next, typical processing content of the independent component analysis will be described in detail. Spectra S of m unknown components (sources) (hereinafter, this spectra may be simply referred to as “unknown components”) are assumed to be applied by a vector of the following formula (2), and n spectral data items X obtained by the step 2 are assumed to be applied by a vector of the following formula (3). In addition, each of elements (S1, S2, . . . , Sm) included in the formula (2) is set to be the vector (spectrum). That is, the element S1 is represented as a formula (4), for example. Elements (X1, X2, . . . , Xn) included in the formula (3) are also the vectors, and the element X1 is represented as a formula (5), for example. An index 1 is the number of wavelength bands in which the spectra are measured. The number of elements m of the spectra S of the unknown components is an integer equal to or larger than 1, and is empirically and experimentally determined in advance.






s=[s
1
,s
2
, . . . ,s
m]T  (2)






X=[X
1
,X
2
, . . . ,X
n]T  (3)






S
1
={S
11
,S
12
, . . . ,S
11}  (4)






X
1
={X
11
,X
12
, . . . ,X
11}  (5)


Each unknown component is assumed to be independent statistically. A relationship of the following formula (6) is satisfied between the unknown components S and the spectral data items X.






X=A·S  (6)


A of the formula (6) is the mixing matrix, and can also be represented by the following formula (7). Herein, it is necessary to show a latter “A” as a thick letter as shown in the formula (7), but it is shown as a normal letter in sentences due to limitation of letters of the specification. Hereinafter, other thick letters representing the matrix are shown as the normal letters, in the same manner.









A
=

(




a
11







a

1

m


















a

n





1








a
nm




)





(
7
)







A mixing coefficient aij included in the mixing matrix A represents a contribution degree of an unknown component Sj (j=1 to m) to spectral data Xi (i=1 to n) which is the observation data.


In a case where the mixing matrix A is known, a least squares solution of the unknown component S can be simply acquired as A+·X using a pseudo inverse matrix A+ of A, but in a case of the first embodiment, since the mixing matrix A is also unknown, it is necessary to estimate the unknown component S and the mixing matrix A only from the observation data X. That is, as shown in the following formula (8), a matrix (hereinafter, referred to as an “independent component matrix”) Y showing the spectrum of the independent component is calculated using a separation matrix W of m×n, only from the observation data X. As an algorithm for acquiring this separation matrix W of the following formula (8), various algorithms such as Infomax, Fast Independent Component Analysis (FastICA), Joint Approximate Diagonalization of Eigenmatrices (JADE), and the like can be used.






Y=W·X  (8)


The independent component matrix Y corresponds to an estimated value of the unknown component S. Accordingly, the following formula (9) can be obtained, and the following formula (10) can be obtained by transforming the formula (9).






X=·Y  (9)






Â=X·Y
+  (10)


Herein, ̂A is an estimated mixing matrix of A and Y+ is a pseudo inverse matrix of Y.


The estimated mixing matrix ̂A (denoted as this due to limitation of letters of the specification, but actually meaning the letter with attached symbol on left side of the formula (10), this applies to the other letters) obtained with the formula (10) can be represented by the following formula (11).










A
^

=

(





a
^

11








a
^


1

m



















a
^


n





1









a
^

nm




)





(
11
)







In Step S110 of FIG. 5, the CPU 10 performs the process up to a process of acquiring the separation matrix W described above. In detail, the spectral data X of each sample which is obtained in the step 2 and is previously stored in the hard disk drive 30 is input, and the separation matrix W is acquired based on this input, using any algorithm such as Infomax, FastICA, or JADE described above. As shown in FIG. 3C described above, it is preferable to perform the normalization process by the first preprocessing unit 450 and the whitening process by the second preprocessing unit 460, as the preprocessing of the independent component analysis.


After performing Step S110, the CPU 10 performs a process of calculating the independent component matrix Y, based on the separation matrix W and the spectral data X of each sample which is obtained in the step 2 and is previously stored in the hard disk drive 30 (Step S120). This calculation process is a process of performing an arithmetic operation according to the formula (8). In the processes of Steps 5110 and S120, the CPU 10 functions as the independent component matrix calculation unit 432 of FIG. 3B.


Next, the CPU 10 performs a process of calculating the estimated mixing matrix ̂A, based on the spectral data X of each sample which is previously stored in the hard disk drive 30, and the independent component matrix Y which is calculated in Step S120 (Step S130). This calculation process is a process of performing an arithmetic operation according to the formula (10).



FIG. 6 is an explanatory diagram for illustrating the estimated mixing matrix ̂A. A table TB shown in the drawing includes the sample numbers B1, B2, . . . , Bn in a vertical direction, and elements (hereinafter, referred to as “independent component elements”) Y1, Y2, . . . , Ym of the independent component matrix Y in a horizontal direction. An element in the table TB which is specified from the sample number Bi (i=1 to n) and the independent component element Yj (j=1 to m), is the same as a coefficient ̂aij included in the estimated mixing matrix ̂A (see the formula (11)). From the table TB, it is found that the coefficient ̂aij included in the estimated mixing matrix ̂A represents each ratio of the independent component elements Y1, Y2, . . . , Ym of each sample. A target component order k shown in FIG. 6 will be described later. In the process of Step S130, the CPU 10 functions as the estimated mixing matrix calculation unit 434 of FIG. 3B.


The estimated mixing matrix ̂A is obtained by the process up to Step S130. That is, the coefficients (estimated mixing coefficient) ̂aij included in the estimated mixing matrix ̂A are obtained. After that, the process proceeds to Step S140.


In Step S140, the CPU 10 acquires a correlation (degree of similarity) between the glucose amounts C1, C2, . . . , Cn measured in the step 3, and a component (hereinafter, referred to as a vector ̂α) of each row included in the estimated mixing matrix ̂A calculated in Step S130. In detail, a correlation between the glucose amount C (C1, C2, . . . , Cn) and the vector ̂α1 (̂a11, ̂a21, ̂an1) of a first row is acquired, then, a correlation between the glucose amount C (C1, C2, . . . , Cn) and the vector ̂α2 (̂a12, ̂a22, . . . , ̂an2) of a second row is acquired, and by doing so, a correlation of each row with respect to the glucose amount C is subsequently acquired, and lastly, a correlation between the glucose amount C (C1, C2, . . . , Cn) and the vector ̂αm (̂a1m, ̂a2m, . . . , ̂anm) of an m-th row is acquired.


The correlations can be acquired by a correlation coefficient R according to the following formula (12). The correlation coefficient R is called Pearson's product-moment correlation coefficient.









R
=





i
=
1

n




(


C
i

-

C
_


)



(



a
^

ik

-



α
^

_

k


)









i
=
1

n




(


C
i

-

C
_


)

2









i
=
1

n




(



a
^

ik

-



α
^

_

k


)

2









(
12
)








C and ̂αk each represent a chlorophyll amount and an average value of a vector ̂αk.



FIG. 7 is a graph of a scatter diagram. The scatter diagram of the drawing includes the glucose amount C on a vertical axis, and a coefficient of the estimated mixing matrix ̂A (hereinafter, referred to as an “estimated mixing coefficient”) ̂a on a horizontal axis. Points specified from each element C1, C2, . . . , Cn of the glucose amount C and the estimated mixing coefficients ̂a1j, ̂a2j, . . . , ̂anj (j=1 to m) included in the vector ̂α in the vertical direction of the estimated mixing matrix ̂A are plotted. In the example shown in the drawing, plotted points are relatively concentrated in a vicinity of a linear line L. In this case, the correlation between the glucose amount C and the estimated mixing coefficient ̂a is high. In contrast, when the correlation between the glucose amount C and the estimated mixing coefficient ̂a is decreased, the plotted points are not aligned linearly, but are widely scattered, as shown in FIG. 8. That is, as the correlation between the glucose amount C and the estimated mixing coefficient ̂a increases, the plotted points have an increasing tendency to be concentrated linearly. The correlation coefficient R shown in the formula (12) represents a degree of the tendency of the plotted points to be concentrated linearly.


As a result of Step S140 of FIG. 5, a correlation coefficient Rj (j=1, 2, . . . , m) of each independent component (independent component spectrum) Yj is obtained. After that, the CPU 10 specifies a correlation coefficient having highest correlation, that is, having a value close to 1, from the correlation coefficients Rj obtained in Step S140. In a case of the scatter diagram described above, the correlation coefficient Rj which is obtained with plotted points most concentrated in a linear shape, is specified. The row vector ̂α with which the highest correlation coefficient R is obtained, is selected from the estimated mixing matrix ̂A (Step S150).


In the case of the Table TB of FIG. 6, the selection in Step S150 is selection of one row from the plurality of rows. An element of the selected row is a mixing coefficient of the independent component corresponding to glucose which is the target component. As a result of the selection, the vector ̂αk (̂a1k, ̂a2k, . . . , ̂ank) is obtained. Herein, k is any integer of 1 to m. A value of k is temporarily stored in the memory 20 as the target component order which shows what number of the independent component corresponds to the target component. Each of ̂a1k, ̂a2k, . . . , ̂ank included in the vector ̂αk correspond to the “mixing coefficients corresponding to the target component” in Application Example 1. In the example of FIG. 6, target component order k=2 shows the row vector ̂α2 corresponding to the independent component Y2=(̂a12, ̂a22, . . . , ̂an2). In the present specification, a term “order” is used with a meaning of a “value showing a position in the matrix”. In the processes of Steps S140 and S150, the CPU 10 functions as the mixing coefficient selection unit 436 of FIG. 3B. After performing Step S150, the CPU ends the calculation process of the mixing coefficients. As a result, the step 4 ends and then the process proceeds to the step 5.


Step 5

The step 5 is a calculation step of a regression equation, and is performed using the computer 100 in the same manner as during the time of performing the step 4. In the step 5, the computer 100 performs a process of calculating the regression equation of the calibration curve. The step 5 may be performed by transferring the data which is subjected to the process up to the step 4, to another computer.



FIG. 9 is a flowchart showing the calculation process of the regression equation performed by the CPU 10 of the computer 100. When the process is started, the CPU 10 first calculates a regression equation F, based on the glucose amount C (C1, C2, . . . , Cn) measured in the step 3, and the vector ̂αk (̂a1k, ̂a2k, . . . , ̂ank) selected in Step S150 (Step S210). In a case where the scatter diagram shown in FIG. 7 has highest correlation, the linear line L in the drawing corresponds to the regression equation F. The calculation method of the regression equation is well known and therefore will not be specifically described herein, but the linear line L is acquired using a least-squares method so as to set a distance from the linear line L to each plotted point (residual error) to be close to 0, for example. The regression equation F can be represented by the following formula (13). In Step S210, constants u and v of the formula (13) are acquired.






F:C=u{circumflex over (α)}
k
+v  (13)


After performing Step S210, the CPU 10 stores the constants u and v of the regression equation F acquired in Step S210, the target component order k (FIG. 6) obtained in Step S150, and the independent component matrix Y calculated in Step S120 of the calculation process (FIG. 5) of the mixing coefficient, in the hard disk drive 30 as a calibrating data set DS2 (Step S220). After that, the CPU 10 temporarily ends the calculation process of the regression equation by proceeding to a “return” step. As a result, the regression equation of the calibration curve can be acquired, and the calibration curve creating method shown in FIG. 1A also ends. In the processes of Step S210 and S220, the CPU 10 functions as the regression equation calculation unit 440 of FIG. 3B.


B. CALIBRATING METHOD OF TARGET COMPONENT

Next, a calibrating method of the target component will be described. A test object is configured with the same components as the sample used when creating the calibration curve. In detail, the calibrating method of the target component is performed using the computer. In addition, the computer herein may be the computer 100 used when creating the calibration curve or may be another computer.



FIG. 10 is a functional block diagram of a device used when performing the calibration of the target component. This device 500 includes a test object observation data acquisition unit 510, a calibrating data acquisition unit 520, a mixing coefficient calculation unit 530, and a target component amount calculation unit 540. The mixing coefficient calculation unit 530 includes a preprocessing unit 532. The preprocessing unit 532 has functions of both the first preprocessing unit 450 and the second preprocessing unit 460 of FIG. 3C. The test object observation data acquisition unit 510 is implemented by the CPU 10 of FIG. 3A in cooperation with the input I/F 50 and the memory 20, for example. The calibrating data acquisition unit 520 is implemented by the CPU 10 of FIG. 3A in cooperation with the memory 20 and the hard disk drive 30, for example. The mixing coefficient calculation unit 530 and the target component amount calculation unit 540 are implemented by the CPU 10 of FIG. 3A in cooperation with the memory 20, for example. The computer which implements each function of FIG. 10 is set as the computer 100 used when creating the calibration curve, and the calibrating data set DS2 described above is stored in the storage unit such as the hard disk drive or the like.



FIG. 11 is a flowchart showing a target component calibration process performed by the CPU 10 of the computer 100. The target component calibration process is implemented by loading a predetermined program stored in the hard disk drive 30 into the memory 20 and executing the program by the CPU 10. As shown in the drawing, if the process is started, first, the CPU 10 performs a process of imaging the living body which is a test object with the spectroscopic measurement instrument (Step S310). The imaging in Step S310 can be performed in the same manner as in the step 2, and as a result, an absorbance spectrum Xp of the test object is obtained. For the spectroscopic measurement instrument used in the calibration process, it is preferable to use the same type as that of the spectroscopic measurement instrument used in creating the calibration curve, in order to suppress error. In order to further suppress error, it is more preferable to use the same instrument. In the same manner as in the step 2 of FIG. 1A, instead of measuring the spectral reflectance spectrum or the absorbance spectrum, the spectrum may be estimated from other measured values. The absorbance spectrum Xp of the test object which is obtained when one test object is imaged once, is represented by a vector as in the following formula (14).






X
p
={X
p1
,X
p2
, . . . ,X
pl}  (14)


In the process of Step S310, the CPU 10 functions as the test object observation data acquisition unit 510 of FIG. 10. Next, the CPU 10 acquires the calibrating data set DS2 from the hard disk drive 30 and stores the calibrating data set in the memory 20 (Step S315). In the process of Step S315, the CPU 10 functions as the calibrating data acquisition unit 520 of FIG. 10.


After performing Step S315, the preprocessing is performed with respect to the absorbance spectrum Xp of the test object which is obtained in Step S310 (Step S325). As this preprocessing, it is preferable to perform the same process as the preprocessing (that is, normalization process by the first preprocessing unit 450 and the whitening process by the second preprocessing unit 460) used in the step 4 of FIG. 1A (more specifically, Step S110 of FIG. 5) when creating the calibration curve.


After that, the CPU 10 performs the process of acquiring the estimated mixing matrix ̂A of the test object, based on the independent component matrix Y included in the calibrating data set DS2 and the preprocessed spectrum obtained in Step S325 (Step S335). In detail, since the arithmetic process according to the formula (10) is performed, an inverse matrix (pseudo inverse matrix) Y+ of the independent component matrix Y included in the calibrating data set DS2 is acquired, and the pseudo inverse matrix Y+ is applied to the preprocessed vector obtained in Step S325, to acquire the estimated mixing matrix ̂A.


As shown in the following formula (15), the estimated mixing matrix ̂A in the calibration process is a row vector (1×m matrix) which is formed of the mixing coefficients corresponding to each independent component. Herein, after performing Step S335, the CPU 10 reads out the target component order k included in the calibrating data set DS2 from the hard disk drive 30, extracts the mixing coefficient ̂αk of a k-th component corresponding to the target component order k, from the estimated mixing matrix ̂A acquired in Step S335, and temporarily stores the mixing coefficient ̂αk in the memory 20 as a mixing coefficient of glucose which is the target component (Step S340). In the processes of Steps S325, S335, and S340, the CPU 10 functions as the mixing coefficient calculation unit 530 of FIG. 10.





{circumflex over (A)}=({circumflex over (α)}1,{circumflex over (α)}2, . . . ,{circumflex over (α)}m)  (15)


Next, the CPU 10 reads out the constants u and v of the regression equation included in the calibrating data set DS2 from the hard disk drive 30 and substitutes the constants u and v and the mixing coefficient ̂αk of glucose which is the target component obtained in Step S340 into the right side of the formula (13), to acquire the content C of the glucose (Step S350). The content C is acquired as glucose concentration, that is, a mass of glucose included in capacitance unit volume of blood (for example, 100 ml) of the test object. In the process of Step S350, the CPU 10 functions as the target component amount calculation unit 540 of FIG. 10. After that, the target component calibration process ends by proceeding to a “return” step.


In the first embodiment, the content C (mass per unit volume) acquired in Step S350 is set as the content of glucose of the test object, but instead of this, the content C acquired in Step S350 may be corrected with the normalized coefficient used in the normalization in Step S325 and the corrected value may be set as the content to be acquired. In detail, an absolute value (grams) of the content may be acquired by multiplying the standard deviation by the content C. According to this configuration, the content C can have yet further higher accuracy depending on the kinds of the target components.


According to the calibration curve creating method of the embodiment configured as described above, it is possible to acquire the glucose amount from one spectrum which is an actually-measured value of the blood which is the test object, with high accuracy.


C. VARIOUS ALGORITHMS AND EFFECT THEREOF ON CALIBRATION ACCURACY

Various algorithms used in the first preprocessing unit 450, the second preprocessing unit 460, and the independent component analysis processing unit 470 shown in FIG. 3C, and the effects thereof on the calibration accuracy will be described in order.


C-1. First Preprocessing (Normalization Process Using SNV/PNS)

As the first preprocessing performed by the first preprocessing unit 450, the standard normal variate transformation (SNV) and the project on null space (PNS) can be used.


SNV is given by the following formula (16).









z
=


x
-

x
ave


σ





(
16
)







Herein, z represents processed data, x represents the data to be processed (absorbance spectrum in the first embodiment) and xave represents an average value of the data to be processed x, and σ represents a standard deviation of the data to be processed x. As a result of the standard normal variate transformation, the normalized data z in which the average value is 0 and the standard deviation is 1, is obtained.


If the PNS is performed, it is possible to decrease the baseline variation included in the data to be processed. In measurement of the data to be processed (absorbance spectrum in the first embodiment), variation between data items, called baseline variation, such as an increase or a decrease of the average value of the data for each measurement data item occurs due to various reasons. Accordingly, it is preferable to remove the reasons for the variation before performing the independent component analysis process (ICA). The PNS can be used as the preprocessing which can decrease the baseline variation of the data to be processed. In particular, great baseline variation occurs in the measurement data of absorbed light spectrum or reflected light spectrum including the near-infrared region, and accordingly it is very advantageous to apply the PNS. Hereinafter, a principle of removing the baseline variation included in the data obtained by the measurement (also simply referred to as “measurement data x”) by the PNS will be described. In addition, as a typical example, a case where the measurement data is the absorbed light spectrum or the reflected light spectrum including the near-infrared region, will be described.


In general, in an ideal system, the measurement data x (data to be processed x) is represented by the following formula (17), using m (m is an integer equal to or larger than 2) independent components si (i=1 to m) and each mixing ratio ci.












x
=






i
-
1

m




c
i



s
i









=



A
·
s








(
17
)







Herein, A is a matrix (mixing matrix) formed by the mixing ratio ci.


The process is performed also in the independent component analysis (ICA) with this model. However, various variation factors (change of a state of a specimen or a measurement environment, and the like) exist in the actual measurement data. Herein, as a model obtained by considering those, a model which represents the measurement data x is considered using the following formula (18).









x
=


b





i
=
1

m




c
i



s
i




+
aE
+

d





λ

+

e






λ
2


+
ɛ





(
18
)







Herein, b represents a parameter showing a variation amount of the spectrum in an amplitude direction, a, d, and e each represent constant baseline variation E (also referred to as an “average value variation”), a parameter showing an amount of variation λ linearly dependent on a wavelength, and a parameter showing an amount of variation λ2 secondarily dependent on a wavelength, and ε represents other variation components. In addition, the constant baseline variation E is given by E={1, 1, 1, . . . 1}T, and a data length thereof is a constant vector which is equivalent to a data length (number of sections of the wavelength band) of the measurement data x. The variation λ and λ2 dependent on the wavelength are given by λ={λ1, λ2, . . . λN}T and λ2={λ12, λ22, . . . λN2}T, and N herein is the data length of the measurement data x. High-order variation which is equal to or higher than tertiary variation can also be considered as the variation dependent on the wavelength, and up to g-order variation λg (g is an integer equal to or larger than 2) can be generally considered. Since the variation components are error factors in the ICA or the calibration, it is desirable to remove the variations in advance.


In PNS, by imaging the measurement data x in a space (null space) not including the variation components by considering a space configured with each of the baseline variation components E, λ, λ2, . . . λg described above, it is possible to obtain the data in which the baseline variation components E, λ, λ2, . . . λg (g is an integer equal to or larger than 2) are decreased. As the specific arithmetic operation, the processed data z from the PNS is calculated by the following formula (19).













z
=




(

1
-

PP
+


)


x







=




b





i
=
1

m




c
i



k
i




+

ɛ
*












P
=

{

1
,
λ
,


λ
2









λ
8



}






(
19
)







Herein, P+ is a pseudo inverse matrix of P. ki is a component obtained by imaging the configuration component si of the formula (18) in the null space not including the variation components. In addition, ε* is a component obtained by imaging the variation component ε of the formula (18) in the null space.


If the normalization (for example, SNV) is performed after processing of PNS, it is possible to eliminate an effect on the variation amount b of the spectrum in an amplitude direction of the formula (18).


If ICA is performed with respect to the data which is subjected to the preprocessing by such PNS, the obtained independent component is an estimated value of the component ki of the formula (19) and is different from the actual configuration component si. However, the mixing ratio ci does not change from the original value of the formula (18), and accordingly the calibration process (FIG. 11) using the mixing ratio ci is not affected. As described above, when the PNS is performed as the preprocessing of ICA, it is difficult to obtain the actual configuration component si by the ICA, and therefore applying the PNS to the preprocessing of the ICA is not considered. On the other hand, in the first embodiment, since the calibration process is not affected although the PNS is performed as the preprocessing of the ICA, if the PNS is performed as the preprocessing, it is possible to perform the calibration with higher accuracy.


In addition, the PNS is specifically disclosed, for example, in “Extracting Chemical Information from Spectral Data with Multiplicative Light Scattering Effects by Optical Path-Length Estimation and Correction”, Zeng-Ping Chen, Julian Morris, and Elaine Martin, 2006.


C-2. Second Preprocessing (Whitening Process Using PCA/FA)

Principal components analysis (PCA) and factor analysis (FA) can be used as the second preprocessing performed by the second preprocessing unit 460.


In the typical ICA method, dimensional compression of the data to be processed and non-correlating are performed, as the preprocessing. Since a transformation matrix to be acquired in the ICA is limited to the orthogonal transformation matrix by this preprocessing, it is possible to decrease the computational complexity of the ICA. Such preprocessing is called “whitening” and PCA is used in many cases. The whitening using PCA is described, for example, in Chapter 6 of “Independent Component Analysis”, Aapo Hyvarinen, Juha Karhumen, Erkki Oja, 2001, John Wiley & Sons, Inc. (“Independent Component Analysis” February 2005, published by Tokyo Denki University Publishing Department).


However, in PCA, in a case where random noise is included in the data to be processed, the PCA may be affected by the effect of the random noise, and accordingly error may be generated in the processed result. Herein, in order to decrease the effect of the random noise, it is preferable to perform the whitening using the factor analysis (FA) having robustness with respect to the noise, instead of the PCA. Hereinafter, the principle of the whitening by FA will be described.


As described above, in the typical ICA, a linear mixing model (formula (17)) representing the data to be processed x as a linear sum of the configuration components si is assumed, and the mixing ratio ci and the configuration component si are acquired. However, the random noise other than that for the configuration component si is added to the actual data, in many cases. Herein, as the model obtained by considering the random noise, a model representing the measurement data x by the following formula (20) is considered.






x=A·s+ρ  (20)


Herein, ρ represents the random noise.


The whitening considering this noise mixing model is performed, and then it is possible to obtain the estimation of the mixing matrix A and the independent component si by performing the ICA.


In the FA of the first embodiment, it is assumed that each of the independent components si and the random noise ρ is in accordance with normal distribution N (0, Im) and N (0, Σ). As generally known, a first parameter x1 of the normal distribution N (x1, x2) represents an expected value and a second parameter x2 thereof represents a standard deviation. At that time, since the data to be processed x is the linear sum of a variable in accordance with the normal distribution, the data to be processed x is also in accordance with the normal distribution. Herein, when a covariance matrix of the data to be processed x is set as V[x], the normal distribution of the data to be processed x can be represented as N (0, V[x]). At that time, a likelihood function of the covariance matrix V[x] of the data to be processed x can be calculated in the following order.


First, if it is assumed that the independent components si are orthogonal to each other, the covariance matrix V [x] of the data to be processed x is calculated by the following formula (21).






V[x]=E[xx
T
]=AA
T+Σ  (21)


Herein, Σ represents the covariance matrix of the noise ρ.


As described above, the covariance matrix V[x] can be represented by the mixing matrix A and the covariance matrix Σ of the noise. At that time, a log likelihood function L (A, Σ) is given by the following formula.










L


(

A
,


)


=


-

n
2




{


tr


(



(


AA
T

+


)


-
1



C

)


+

log


(

det
(


AA
T

+


)

)


+

m





log





2

π


}






(
22
)







Herein, n represents the number of data items of the data x, m represents the number of independent components, an operator tr represents a trace of the matrix (sum of diagonal components), and an operator det represents a determinant. In addition, C represents a sample covariance matrix acquired by sample calculation from the data x, and is calculated by the following formula.









C
=


1
n






i
=
1

n




x
i



x
i
T








(
23
)







The mixing matrix A and the covariance matrix E of the noise can be acquired by maximum-likelihood method using the log likelihood function L (A, Σ) of the formula (22). As the mixing matrix A, a mixing matrix A which is substantially not affected by the random noise ρ of the formula (20) is obtained. This is a basic principle of the FA. As the algorithm of the FA, there are various algorithms using the algorithm other than the maximum likelihood method. Such various FA can also be used in the first embodiment.


Meanwhile, the estimated value obtained by the FA is merely a value of AAT, in a case where the mixing matrix A adapted for this value is determined, the non-correlating of the data can be performed while decreasing the effect of the random noise, but it is difficult to uniquely determine the plurality of configuration components si since a degree of freedom of rotation remains. Meanwhile, the ICA is a process of decreasing the degree of freedom of the rotation of the plurality of configuration components si so that the plurality of configuration components si are orthogonal to each other. Herein, in the first embodiment, an arbitrary property with respect to the remaining rotation is specified by the ICA, using a value of the mixing matrix A acquired by the FA as the whitened matrix (matrix subjected to the whitening). Accordingly, after performing the whitening process which is robust to the random noise, by performing the ICA, the independent configuration components si orthogonal to each other can be determined. In addition, as a result of such a process, it is possible to decrease the effect of the random noise and to improve the calibration accuracy related to the configuration components si.


C-3. ICA (Kurtosis and β Divergence as Independence Index)

In the independent component analysis (ICA), as the index for separating the independent components, the higher order statistics representing independence between the separated data items are generally used as the independence index. The kurtosis is a typical independence index. The ICA using the kurtosis as the independence index is, for example, described in Chapter 8 of “Independent Component Analysis”, Aapo Hyvarinen, Juha Karhumen, Erkki Oja, 2001, John Wiley & Sons, Inc. (“Independent Component Analysis” February 2005, published by Tokyo Denki University Publishing Department).


However, in a case where an outlier such as spike noise is included in the data to be processed, statistics including the outlier are calculated as the independence index. Therefore, error may be generated between original statistics and the calculated statistics of the data to be processed, and this may cause a decrease in separation accuracy. Herein, it is preferable to use the independence index which is hardly affected by the effect from the outlier in the data to be processed. β divergence can be used as the independence index having such properties. Hereinafter, a principle of the β divergence as the independence index in the ICA will be described.


As described above, in the typical ICA, a linear mixing model (formula (17)) representing the data to be processed x as a linear sum of the configuration components si is assumed, and the mixing ratio ci and the configuration component si are acquired. An estimated value y of the configuration component s acquired by the ICA is represented as y=W·y using the separation matrix W. At that time, the separation matrix W is desirably an inverse matrix of the mixing matrix A.


Herein, a log likelihood function L (̂W) of an estimated value ̂W of the separation matrix W can be represented by the following formula.










L


(

W
^

)


=


1
N






i
=
1

N



l


(


x


(
t
)


,

W
^


)








(
24
)







Herein, an element of a summation sign Σ is a log likelihood of each data point x (t). This log likelihood function L (̂W) can be used as the independence index of the ICA. A method of the β divergence is a method of applying a suitable function to the log likelihood function L (̂W) to convert the log likelihood function L (̂W) so as to suppress the effect of the outlier such as the spike noise in the data.


In a case of using the β divergence as the independence index, first, the log likelihood function L (̂W) is converted by the following formula using a function Φβ which is previously selected.











L
Φ



(

W
^

)


=


1
N






i
=
1

N




Φ
β



(

l


(


x


(
t
)


,

W
^


)


)








(
25
)







This function LΦ(̂W) is considered as a new likelihood function.


As the function Φβ for decreasing the effect of the outlier such as the spike noise, a function in which the function Φβ decreases in an exponential manner as the value of the log likelihood function (value in brackets of the function Φβ) decreases, is considered. As such a function Φβ, the following formula can be used, for example.











Φ
β



(
z
)


=


1
β



{


exp


(

β





z

)


-
1

}






(
26
)







In this function, as the value of β increases, a function value with respect to each data point z (log likelihood in the formula (25)) decreases. The value of β can be determined empirically, and can be set as approximately 0.1, for example. The function Φβ is not limited to that of the formula (26), and it is possible to use another function in which, as the value of β increases, the function value with respect to each data point z decreases.


When using the β divergence as the independence index, it is possible to suitably suppress the effect of the outlier such as the spike noise. In a case where the likelihood function LΦ(̂W) such as the formula (25) is considered, a pseudo distance among probability distributions which is minimized corresponding to maximization of the likelihood is β divergence. If the ICA using the β divergence as the independence index is performed, it is possible to decrease the effect of the outlier such as the spike noise to improve the calibration accuracy of the configuration component si.


The ICA using the β divergence is, for example, described in “Robust Blind Source Separation by β-Divergence” Minami Mihoko, Shinto Eguchi, 2002.


C-4. Evaluation of Effects of Algorithms on Calibration Accuracy (1)


FIG. 12A to FIG. 12H are diagrams showing comparison of calibration accuracy obtained by processes of combination of algorithms under 8 types of different process conditions, and FIG. 13 is a diagram showing collection of the calibration accuracy of FIG. 12A to FIG. 12H. In the effects evaluation, the absorbance of 8 types of mixtures having different mixing ratios of sucrose and salt was measured with a spectrophotometer to acquire the data to be processed, and the calibration curve (similar to that in FIG. 7) is created in accordance with the procedures of FIG. 5 and FIG. 9. A calibration target is concentration (ratio with respect to unit volume) of sucrose. A reason for converting the absorbance of the mixture into the data to be processed, is for checking the effect of various algorithms with respect to the data to be processed including various variation. FIG. 12A to FIG. 12H each show a relationship between a calibration value obtained when using the calibration curve obtained as described above, and an actual value.


The following two values are used as index values showing the calibration accuracy. R2 is the square of the correlation coefficient R between the actually-measured values and the calibration values obtained by the independent component analysis, and SEP is estimated standard error between the actually-measured value and the calibration value obtained by the independent component analysis. In general, the calibration accuracy is excellent when R2 is large (close to 1), and the calibration accuracy is excellent when SEP is small.


In process conditions 1, the standard normal variate transformation (SNV) is used in the first preprocessing, the principal components analysis (PCA) is used in the second preprocessing, and the kurtosis is used as the independence index of the independent component analysis (ICA). In process conditions 2, the process is performed in the same manner as the process conditions 1, except for using the factor analysis (FA) in the second preprocessing. In process conditions 3, the process is performed in the same manner as the process conditions 1, except for using the project on null space (PNS) in the first preprocessing. In a case of using the PNS in the first preprocessing (process conditions 3, 5, 6, and 8), the SNV is performed after the PNS.


Effect of Usage of Project on Null Space (PNS) in First Preprocessing

An effect of usage of the PNS in the first preprocessing can be recognized, when the process conditions 1 and the process conditions 3 of FIG. 13 are compared to each other. That is, with the process conditions 3 in which the PNS is used in the first preprocessing, both the correlation coefficient R and the expected standard error SEP are improved and the calibration accuracy is improved, compared to the process conditions 1 in which only the SNV is used. This effect can also be recognized substantially in the same manner from the comparison between the process conditions 4 and the process conditions 5 or the comparison between the process conditions 7 and the process conditions 8. As described above, the PNS is the algorithm having a great effect for decreasing the effect of the baseline variation. In 8 analysis results shown in FIG. 13, the absorbance spectrum including the near-infrared region is set as a process target as the measurement data. In particular, great baseline variation occurs in the measurement data of absorbance light spectrum or reflected light spectrum including the near-infrared region, and accordingly it is very advantageous to apply the PNS.


Effect of Usage of Factor Analysis (FA) in Second Preprocessing

An effect of usage of the FA in the second preprocessing can be recognized, when the process conditions 1 and the process conditions 2 of FIG. 13 are compared to each other. That is, with the process conditions 2 in which the FA is used in the second preprocessing, both the correlation coefficient R and the expected standard error SEP are improved and the calibration accuracy is improved, compared to the process conditions 1 in which the PCA is used. This effect can also be recognized substantially in the same manner from the comparison between the process conditions 4 and the process conditions 7. However, in FIG. 13, the effect of the usage of the FA in the second preprocessing is slightly less than the effect of the usage of the PNS in the first preprocessing. The reason thereof is assumed to be that the FA is effective mainly in decreasing the effect of the random noise and the random noise is slightly included in the measurement data used in the analysis of FIG. 13.


Effect of Usage of β Divergence as Independence Index of ICA

An effect of usage of the β divergence as the independence index of the ICA can be recognized, when the process conditions 1 and the process conditions 4 of FIG. 13 are compared to each other. That is, with the process conditions 4 in which the β divergence is used as the independence index of the ICA, both the correlation coefficient R and the expected standard error SEP are slightly improved and the calibration accuracy is improved, compared to the process conditions 1 in which the kurtosis is used. This effect can also be recognized substantially in the same manner from the comparison between the process conditions 6 and the process conditions 8. However, in FIG. 13, the effect of the β divergence as the independence index of the ICA is slight, and in comparison between the process conditions 2 and the process conditions 7, the calibration accuracy is slightly degraded in a case where the β divergence is used. The reason thereof is assumed to be that the β divergence is effective mainly in decreasing the effect of the outlier such as the spike noise, and the spike noise is slightly included in the measurement data used in the analysis of FIG. 13.


C-5. Evaluation of Effects of Algorithms on Calibration Accuracy (2)


FIG. 14A to FIG. 14H are diagrams showing results obtained by evaluating the effects of various algorithms when various variations exist, and FIG. 15 is a diagram showing collection of the calibration accuracy of FIG. 14A to FIG. 14H. In the effects evaluation, 40 types of sample voice signals obtained by mixing two voice items v1 and v2 at random ratios, are created as the data to be processed, and the calibration curves are created in accordance with the procedure of FIG. 5 and FIG. 9. The calibration target is a ratio of the first voice v1. A reason of setting the voice signals as the data to be processed, is for checking the effect of various algorithms with respect to the data to be processed including various variations.



FIG. 14A to FIG. 14H each show a relationship between a calibration value obtained when using the calibration curve obtained as described above, and an actual value. The following three types of variations are added to the sample voice signals. (1) Gaussian noise: Gaussian noise having variance of 0.05 is added. (2) Spike noise: Spike noise according to χ2 distribution with a parameter 5 is added at a ratio of 1%. (3) Baseline variation: the constant baseline variation E, the variation λ, linearly dependent on a wavelength, and the variation λ2 secondarily dependent on a wavelength are randomly added in order of 10−1, 10−5, and 10−6.


Eight types of process conditions of FIG. 15 are the same as those shown in FIG. 13. The SNV is disclosed in brackets in FIG. 15 because the sample voice signals may be considered as signals subjected to the SNV, as those are previously created as normalized signals in which an average value is 0 and standard deviation is 1.


Effect of Usage of Project on Null Space (PNS) in First Preprocessing

An effect of usage of the PNS in the first preprocessing is slight in comparison between the process conditions 1 and the process conditions 3 of FIG. 15, but is considerably large in comparison between the process conditions 2 and the process conditions 6. The considerably large effect of the PNS can also be recognized, in comparison between the process conditions 4 and the process conditions 5, and comparison between the process conditions 7 and the process conditions 8. That is, the effect of usage of the PNS in the first preprocessing becomes more significant, when employing at least one of usage of the FA in the second preprocessing, and usage of β divergence as the independence index of the ICA.


Effect of Usage of Factor Analysis (FA) in Second Preprocessing

An effect of usage of the FA in the second preprocessing is considerably large so as to be recognized from comparison between the process conditions 1 and the process conditions 2 of FIG. 15, and the considerably large effect thereof can also be recognized, in the same manner, from comparison between the process conditions 3 and the process conditions 6, comparison between the process conditions 4 and the process conditions 7, and comparison between the process conditions 5 and the process conditions 8. In addition, the effect of usage of the FA in the second preprocessing becomes more significant, when employing at least one of usage of the PNS in the first preprocessing, and usage of β divergence as the independence index of the ICA.


Effect of Usage of β Divergence as Independence Index of ICA

An effect of usage of the β divergence as the independence index of the ICA is slight in comparison between the process conditions 1 and the process conditions 4 of FIG. 15, but is considerably large in comparison between the process conditions 3 and the process conditions 5, and the considerably large effect thereof is also obtained, in the same manner, from comparison between the process conditions 2 and the process conditions 7 or comparison between the process conditions 6 and the process conditions 8. That is, the effect of usage of the β divergence as the independence index of the ICA becomes more significant, when employing at least one of usage of the PNS in the first preprocessing, and usage of the FA in the second preprocessing.


As recognized from the evaluation result of FIG. 15 described above, the effect of usage of the PNS in the first preprocessing, the effect of usage of the FA in the second preprocessing, and the effect of usage of the β divergence as the independence index of the ICA are more significant, in a case where the data in which various variations exist is set as the data to be processed.


C-6. Evaluation of Effects of Algorithms on Calibration Accuracy (3)


FIG. 16A to FIG. 16F are diagrams showing results obtained by evaluating the effects of various algorithms when only one type of variation exists from three types of variations such as Gaussian noise, the baseline variation, and the spike noise, and FIG. 17 is a diagram showing collection of the calibration accuracy of FIG. 16A to FIG. 16F. In the effects evaluation, in the same manner as FIG. 15, 40 types of sample voice signals obtained by mixing two voice items v1 and v2 at random ratios, are created as the data to be processed, and the calibration curves are created in accordance with the procedure of FIG. 5 and FIG. 9. However, only one type of variation among the three types of variations such as the Gaussian noise, the baseline variation, and the spike noise, is added to the sample voice signals.


As shown in FIG. 17, the data to which the Gaussian noise is added is subjected to the process with the process conditions 1 and the process conditions 2. The process conditions 1 and the process conditions 2 are different from each other in a point of using the PCA or using FA as the second preprocessing. If the FA is used as the second preprocessing for the data to which the Gaussian noise is added, the calibration accuracy is significantly improved compared to the case of using the PCA. From this comparison, it is confirmed that the FA is effective in decreasing the effect of the random noise such as the Gaussian noise.


As shown in FIG. 17, the data to which the baseline variation is added is subjected to the process with the process conditions 1 and the process conditions 3. The process conditions 1 and the process conditions 3 are different from each other in a point of using the SNV or using PNS as the first preprocessing. If the PNS is used as the first preprocessing for the data to which the baseline variation is added, the calibration accuracy is significantly improved compared to the case of using only the SNV. From this comparison, it is confirmed that the PNS is effective in decreasing the effect of the baseline variation.


As shown in FIG. 17, the data to which the spike noise is added is subjected to the process with the process conditions 1 and the process conditions 4. The process conditions 1 and the process conditions 4 are different from each other in a point of using the kurtosis or using β divergence as the independence index of the ICA. If the β divergence is used as the independence index of the ICA for the data to which the spike noise is added, the calibration accuracy is significantly improved compared to the case of using the kurtosis. From this comparison, it is confirmed that the β divergence is effective in decreasing the effect of the spike noise.


D. MODIFICATION EXAMPLES

The invention is not limited to the embodiments described above or other modification examples, and can be executed in various embodiments within a range not departing from a gist thereof, and the following modifications can also be performed, for example.


Modification Example 1

In the embodiments described above, the test object observation data acquisition unit 510 (FIG. 10) acquires the calibrating data set DS2 from the hard disk drive 30 to acquire the independent component matrix Y including the independent component corresponding to the target component, and the mixing coefficient calculation unit 530 (FIG. 10) acquires the estimated mixing matrix ̂A of the test object based on the absorbance spectrum of the test object and the independent component matrix Y and extracts the k-th row mixing coefficient αk corresponding to the target component order k from the estimated mixing matrix ̂A, to acquire the mixing coefficient of the target component for the test object, but the invention is not limited thereto. For example, the modification example may have a configuration of performing the following parts (i) and (ii) in order.


(i) The calibrating data set DS2 stored in the hard disk drive 30 is read out to acquire the k-th row element (independent component) Yk corresponding to the target component order k from the independent component matrix Y included in the calibrating data set DS2. The independent component Yk has a highest correlation with respect to the glucose amount and corresponds to the glucose amount. (ii) Next, an inner product of the extracted independent component Yk and the spectrum Xp of the test object which is the observation data (for example, normalized spectrum obtained in Step S320) is acquired, and an inner product value thereof is set as the mixing coefficient αk of the target component. That is, an arithmetic operation according to the following formula (27) is performed.





αk=Xp·Yk  (27)


Herein, since it is assumed that the observation data is a linear sum of the independent components, and orthogonality between the independent components is sufficiently high, by calculating the inner product between the spectrum which is the observation data and the independent component matrix of the target component, only the value of the independent component remains and values of all of the other components become 0. Accordingly, the mixing coefficient αk of the target component is easily calculated. However, in a case where the orthogonality between the independent components is not sufficiently high, it is preferable to acquire the estimated mixing matrix ̂A of the formula (15) without using the arithmetic operation of the formula (27).


In the process of the part (i), the CPU 10 functions as the calibrating data acquisition unit. In the process of the part (ii), the CPU 10 functions as the mixing coefficient calculation unit. In addition, instead of the configuration of the part (i), the calibrating data acquisition unit may be configured to acquire the independent component Yk, from a storage unit such as the hard disk drive 30 in which the k-th row element (independent component) Yk corresponding to the target component order k from the independent component matrix Y is previously stored. In a case of using the inner product, the independent component corresponding to the target component is only necessary, and therefore the other independent components are not necessary. In this case, the independent component is the vector, and it is not necessary to store the target component order.


Modification Example 2

In the embodiments and the modification example described above, the target component is set as glucose in the blood, but instead of glucose, other components in the blood, for example, oxyhemoglobin, deoxyhemoglobin, or the like may be used.


Modification Example 3

In the embodiments and the modification examples described above, the mixing coefficient estimation step has a configuration of acquiring the independent component matrix, acquiring the estimated mixing matrix, and extracting the mixing coefficient corresponding to the target component from the estimated mixing matrix, but it is not necessary to have this configuration. That is, any configuration can be used, as long as it is the configuration in which each independent component included in the observation data of each sample when separating the observation data into the plurality of the independent components is assumed, and the mixing coefficient corresponding to the target component is acquired for each sample, based on each independent component.


Modification Example 4

In the calibration curve creating method of the embodiments and the modification examples described above, it is configured to measure the content of the target component of the sample, but instead of this, the sample with known content of the target component may be prepared and the content may be input from a keyboard or the like.


Modification Example 5

In the embodiments and the modification examples described above, the number of elements m of the spectra S of the unknown component is empirically and experimentally determined in advance, but the number of elements m of the spectra S of the unknown component may be determined by minimum description length (MDL) or information criteria known as Akaike information criteria (AIC). In a case of using the MDL or the like, the number of elements m of the spectra S of the unknown component can be automatically determined by the arithmetic operation from the observation data of the sample. The MDL is, for example, described in “Independent component analysis for noisy data—MEG data analysis, 2000”.


Modification Example 6

In the embodiments and the modification examples described above, the test object which is a target of the calibration process is configured with the same component as the sample used when creating the calibration curve, but in a case of acquiring the mixing coefficient using the inner product as in Modification Example 1, unknown components other than the same component as in the sample used when creating the calibration curve may be included in the test object. This is because, since the inner products between the independent components are assumed to be 0, the inner products between the independent components corresponding to the unknown components are also considered to be 0, and the effect of the unknown components can be ignored in a case of acquiring the mixing coefficient with the inner product.


Modification Example 7

For the computer used in the embodiments and the modification examples described above, an exclusive apparatus can be used instead of the personal computer. For example, the personal computer which implements the calibrating method of the target component can be set as an exclusive calibration apparatus.


Modification Example 8

In the embodiments described above, the input of the spectrum of spectral reflectance of the sample or the test object is performed by inputting the spectrum measured by the spectroscopic measurement instrument, but the invention is not limited thereto. For example, the optical spectrum may be estimated from a plurality of band images having different wavelength bands and the optical spectrum may be input. The band images are obtained, for example, by imaging the sample or the test object with a multiband camera including a filter capable of changing transmission wavelength bands.


Modification Example 9

In the embodiments, for the blood component measuring apparatus as the spectroscopic measurement instrument, a member having a probe shape as a point of contact with the living body is used, but this is not necessarily limited thereto, and the following configuration may be used.



FIG. 18 is an explanatory diagram showing a measuring attachment included in a blood component measuring apparatus of Modification Example 9. As shown in the drawing, a measuring attachment 600 includes a contact surface 610 which comes in contact with a fingertip, and includes a light projecting unit and a light receiving unit (not shown) inside of the contact surface 610. A person who is a test object places a finger FG on an upper portion of the measuring attachment 600 so that a fingertip comes in contact with the contact surface 610, to perform the measurement.


Modification Example 10


FIGS. 19A to 19C are explanatory diagrams showing a blood component measuring apparatus of Modification Example 10. FIG. 19A shows a usage embodiment of a blood component measuring apparatus 700 and FIG. 19B shows a rear surface of the blood component measuring apparatus 700. The blood component measuring apparatus 700 has a watch shape, and includes a main body case 712 and a fixing band 714. The fixing band 714 is, for example, a Magic Tape (trade mark) for installing and fixing the main body case 712 on a measurement part such as a wrist or an arm of a person who is a test object.


A sensor module 750 is provided on the rear surface of the main body case 712 so as to come in contact with a skin surface of a person. The sensor module 750 is a measuring device which emits measurement light to the skin surface and receives reflected or transmitted light thereof, and is alight source built-in thin image sensor.



FIG. 19C shows a plan view of the sensor module 750. The sensor module 750 is a device configured by laminating a light emitting layer in which a plurality of light emitting elements 752 are two-dimensionally arranged in a flat surface shape, and a light receiving layer in which a plurality of light receiving elements 754 are two-dimensionally arranged in a flat surface shape, to each other. The light emitting element 752 is a light emitting unit which emits the measurement light, and is implemented with, for example, an LED or an OLED. The light receiving element 754 is implemented with an imaging element such as a CCD or a CMOS.


The blood component measuring apparatus 700 having the configuration described above, first causes all of the light emitting elements 752 of the sensor module 750 to emit light simultaneously, to emit light to the entire area of the measurement part of the person. Then, the entire area of the measurement part is imaged using all of the light receiving elements 754. Next, a blood vessel part suitable for the spectral measurement is selected from the captured image by a control device (not shown) provided in the blood component measuring apparatus 700, the light emission of measurement light is performed with respect to the blood vessel part by the light emitting elements 752, and the diffusely-reflected light from the blood vessel part is received by the light receiving elements 754. The “blood vessel part suitable for the spectral measurement” is a part of a thick blood vessel, a branched part of a blood vessel, a joining part thereof, or the like. The diffusely-reflected light received by the light receiving elements 754 is transferred to a spectral element (not shown) and is dispersed. By doing so, the absorbance spectrum is measured. According to the blood component measuring apparatus 700 of Modification Example 10 having such a configuration, since the blood vessel part suitable for the measurement of the spectral data is selected and measured, it is possible to increase accuracy of the independent component analysis or accuracy of calibration using that.


Modification Example 11

In the embodiments and Modification Examples 9 and 10, the light including the near-infrared light is emitted to the living body to obtain the absorbance spectrum from the diffusely-reflected light thereof, but instead of this, the absorbance spectrum may be obtained from transmitted light which transmits the inside of the living body. That is, as shown in FIG. 20, in a blood component measuring apparatus 800 of Modification Example 11, a light emitting probe 814 provided on a light emitting unit 810, and a light receiving probe 822 provided on a light receiving unit 820 are disposed to oppose each other with a finger FG interposed therebetween. According to this configuration, it is possible to obtain the absorbance spectrum from the transmitted light which transmits the inside of the living body.


Modification Example 12

In the embodiments and the modification examples described above, the light source included in the blood component measuring apparatus is set to the xenon flash tube, but this is not necessarily limited thereto, and a tungsten lamp, a halogen lamp, or a laser light source may be used, for example. In addition, the blood component measuring apparatus may be configured with a Raman spectroscopic measurement instrument which uses a laser light source as the light source, and includes a Rayleigh light removing filter for cutting Rayleigh scattering light, a spectral element which disperses Raman scattering light transmitted the filter, and a light receiving element. According to this configuration, it is possible to perform measurement with respect to the target component with higher sensitivity and to further improve the measurement accuracy.


Modification Example 13

In the embodiments and the modification examples, the functions implemented by the software may be implemented by the hardware.


Among the configuration elements of the embodiments and the modification examples, the elements other than the elements disclosed in independent claims are additional elements and may be suitably omitted.

Claims
  • 1. A calibration curve creating method of creating a calibration curve used in acquiring content of a target component which is a specific component in blood from observation data of a living body which is a test object, the method comprising: (a) causing a computer to acquire the observation data of a plurality of samples of a living body, when near infrared light having a wavelength of 800 nm to 1300 nm is emitted to the living body and an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof is set as the observation data;(b) causing the computer to acquire content of the target component of each sample;(c) causing the computer to estimate a plurality of independent components when the observation data of each sample is separated into the plurality of independent components, and to acquire a mixing coefficient corresponding to the target component for each sample, based on the plurality of independent components; and(d) causing the computer to acquire a regression equation of the calibration curve based on the content of the target component of the plurality of samples and the mixing coefficient for each sample,wherein (c) includes (i) causing the computer to acquire an independent component matrix including the independent component of each sample, (ii) causing the computer to acquire an estimated mixing matrix showing a vector set for regulating a ratio of an independent component element of each independent component in each sample, from the independent component matrix, and (iii) causing the computer to acquire a correlation of content of the target component of the plurality of samples, for each vector included in the estimated mixing matrix, and to select the vector which is determined to have a highest correlation, as a mixing coefficient corresponding to the target component, andin (i), the computer acquires the independent component matrix by performing a first preprocessing including normalization of the observation data, a second preprocessing including whitening, and an independent component analysis process in this order, and the computer performs normalization after a process performed by project on null space in the first preprocessing.
  • 2. The calibration curve creating method according to claim 1, wherein the computer performs whitening by factor analysis in the second preprocessing.
  • 3. The calibration curve creating method according to claim 1, wherein the computer uses β divergence as an independence index of the independent component analysis process.
  • 4. A calibration curve creation apparatus which creates a calibration curve used in acquiring content of a target component which is a specific component in blood from observation data of a living body which is a test object, the apparatus comprising: a sample observation data acquisition unit which acquires the observation data of a plurality of samples of a living body, when near infrared light having a wavelength of 800 nm to 1300 nm is emitted to the living body and an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof is set as the observation data;a sample target component amount acquisition unit which acquires content of the target component of each sample;a mixing coefficient estimation unit which estimates a plurality of independent components when the observation data of each sample is separated into the plurality of independent components, and acquires a mixing coefficient corresponding to the target component for each sample based on the plurality of independent components; anda regression equation calculation unit which acquires a regression equation of the calibration curve based on the content of the target component of the plurality of samples and the mixing coefficient for each sample,wherein the mixing coefficient estimation unit includes an independent component matrix calculation unit which acquires an independent component matrix including each independent component of each sample, an estimated mixing matrix calculation unit which acquires an estimated mixing matrix showing a vector set for regulating a ratio of an independent component element of each independent component in each sample, from the independent component matrix, and a mixing coefficient selection unit which acquires a correlation of content of the target component of the plurality of samples, for each vector included in the estimated mixing matrix, and selects the vector which is determined to have a highest correlation, as a mixing coefficient corresponding to the target component,the independent component matrix calculation unit acquires the independent component matrix by performing a first preprocessing including normalization of the observation data, a second preprocessing including whitening, and an independent component analysis process in this order, andthe independent component matrix calculation unit performs normalization after a process performed by project on null space in the first preprocessing.
  • 5. The calibration curve creation apparatus according to claim 4, wherein the independent component matrix calculation unit performs whitening by factor analysis in the second preprocessing.
  • 6. The calibration curve creation apparatus according to claim 4, wherein the independent component matrix calculation unit uses β divergence as an independence index of the independent component analysis process.
  • 7. The calibration curve creation apparatus according to claim 4, further comprising: a storage unit which stores the independent component matrix calculated by the independent component matrix calculation unit, a target component order which shows a position of the mixing coefficient selected by the mixing coefficient selection unit in the estimated mixing matrix, and the regression equation calculated by the regression equation calculation unit.
  • 8. A blood component calibration apparatus which acquires content of a target component which is a specific component in blood for a living body which is a test object, the apparatus comprising: a test object observation data acquisition unit which emits near infrared light having a wavelength of 800 nm to 1300 nm to the test object, and acquires an absorbance spectrum obtained from transmitted light or diffusely-reflected light thereof as observation data;a calibrating data acquisition unit which acquires calibrating data including at least an independent component corresponding to the target component;a mixing coefficient calculation unit which acquires a mixing coefficient with respect to the target component of the test object, based on the observation data of the test object and the calibrating data; anda target component amount calculation unit which calculates content of the target component based on a constant of a regression equation showing a relationship between the mixing coefficient and the content corresponding to the target component, and the mixing coefficient acquired by the mixing coefficient calculation unit which are prepared in advance,wherein the mixing coefficient calculation unit performs a first preprocessing including normalization of the observation data and a second preprocessing including whitening in this order, and performs normalization after a process performed by project on null space in the first preprocessing.
  • 9. The blood component calibration apparatus according to claim 8, wherein the mixing coefficient calculation unit performs whitening by factor analysis in the second preprocessing.
  • 10. The blood component calibration apparatus according to claim 8, wherein the calibrating data acquisition unit acquires the independent component which is previously acquired as a component corresponding to the target component, as the calibrating data, andthe mixing coefficient calculation unit acquires an inner product of the independent component and the observation data of the test object, and sets the inner product as the mixing coefficient.
  • 11. The blood component calibration apparatus according to claim 8, wherein the calibrating data acquisition unit acquires the plurality of independent components when each observation data item of the plurality of samples is separated into the plurality of independent components, as the calibrating data, andthe mixing coefficient calculation unit calculates an estimated mixing matrix of the test object based on the observation data and the plurality of independent components of the test object, and extracts a mixing coefficient corresponding to the target component from the calculated estimated mixing matrix.
Priority Claims (1)
Number Date Country Kind
2013-149741 Jul 2013 JP national