1. Technical Field
The present invention relates to a technique of creating a calibration curve, which is used to derive the content of a target component in a subject, from observation data of the subject, and a technique of calculating the content of the target component in the subject.
2. Related Art
A method has been proposed in which the concentration or the like of a target component is analyzed by performing independent component analysis of observation data, which is observed at a plurality of different positions of the subject, and expressing the observation data as a linear sum of a basic function with an independent component calculated by the independent component analysis as the basic function (refer to JP-A-2007-44104).
In the known technique described above, however, there is a problem in that a plurality of different pieces of observation data for a subject are required whenever a target component of the subject is measured and the measurement can not be accurately performed from a piece of observation data.
In addition, a variety of noises may be included in the observation data. In addition, depending on the subject, the observation data may be changed due to variations in the composition or the structure of the subject. In such a case, there is a problem in that the accuracy of independent component analysis or measurement using the same is reduced.
On the other hand, in order to prevent the reduction in accuracy of independent component analysis or accuracy of measurement using the same, there is a method of performing pre-processing to reduce noise or the variation in observation data. However, there are many methods for pre-processing. For this reason, there has been a problem in that it is difficult to know which pre-processing is suitable for the observation data and which pre-processing should be selected to perform accurate measurement.
An advantage of some aspects of the invention is that accurate measurement from a piece of observation data regarding a subject can be achieved when measuring a target component of the subject.
The invention can be implemented as the following forms or application examples.
This application example is directed to a calibration curve creation method of creating a calibration curve, which is used to derive a content of a target component in a subject, from observation data of the subject. The calibration curve creation method includes: (a) acquiring the observation data for a plurality of samples of the subject; (b) acquiring the content of the target component in each sample; (c) executing pre-processing for the observation data of each sample, a pre-processing method is selected from a plurality of options; (d) estimating a plurality of independent components when separating the pre-processed observation data of each sample into a plurality of independent components and calculating a mixing coefficient corresponding to the target component for each sample based on the plurality of independent components; and (e) calculating a regression equation of the calibration curve based on the content of the target component of each of the plurality of samples and the mixing coefficient of each sample. In the process (c), the pre-processing includes first pre-processing including processing for correcting the observation data and second pre-processing including whitening, and a plurality of processing methods are prepared as processing methods of each of the first pre-processing and the second pre-processing and the pre-processing method is set by combining one or more of the processing methods of each of the first pre-processing and the second pre-processing. The process (d) includes: (i) calculating an independent component matrix including the independent component of each sample; (ii) calculating an estimated mixing matrix, which indicates a set of vectors defining a ratio of an independent component element of each independent component in each sample, from the independent component matrix; and (iii) calculating a correlation between each of the vectors included in the estimated mixing matrix and the content of the target component of each of the plurality of samples and select the vector, which is determined to have the highest correlation, as a mixing coefficient corresponding to the target component. In the process (i), the first pre-processing, the second pre-processing, and independent component analysis processing are executed in this order using the pre-processing method selected in the process (c).
According to the calibration curve creation method of Application Example 1, for a plurality of samples of the subject, the calibration curve for deriving the amount of target component included in the subject from the observation data of the subject is created from the content of the target component and the observation data acquired from each sample. For this reason, if this calibration curve is used, the content of the target component can be accurately calculated even if the number of pieces of observation data of the subject is one. Therefore, if the calibration curve is created in advance according to the calibration curve creation method of Application Example 1, it is sufficient to acquire a piece of observation data for the subject at the time of measurement. As a result, the amount of target component can be accurately calculated from a piece of observation data that is an actual measurement value. In addition, since an estimated mixing matrix is calculated and a vector highly correlated with the content of the target component of the sample is extracted from the estimated mixing matrix, it is possible to obtain the mixing coefficient with high estimation accuracy.
In addition, since the appropriate pre-processing is selected and executed according to the characteristics of the observation data of the subject, information included in the observation data of the subject can be appropriately extracted. As a result, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation method according to Application Example 1, wherein in the process (c), the processing methods of the first pre-processing includes a projection on null space.
According to this configuration, it is possible to improve the measurement accuracy by reducing the baseline variation of the observation data by pre-processing based on the projection on null space.
This application example is directed to the calibration curve creation method according to Application Example 1, wherein in the process (c), the processing methods of the first pre-processing includes centering.
According to this configuration, since it is possible to align the baseline of the observation data by subtracting the average value of the observation data by pre-processing based on the centering, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation method according to Application Example 1, wherein in the process (c), the processing methods of the first pre-processing includes normalization.
According to this configuration, since it is possible to reduce the variation in the observation data due to changes in the measurement conditions by setting the average value of the observation data to 0 and the variance to 1 by pre-processing based on the normalization, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation method according to Application Example 1, wherein in the process (c), the processing methods of the first pre-processing includes smoothing processing.
According to this configuration, since it is possible to reduce unnecessary random noise included in the observation data by pre-processing based on the smoothing processing, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation method according to Application Example 1, wherein in the process (c), the processing methods of the first pre-processing includes differential spectrum processing.
According to this configuration, since it is possible to emphasize the variation component of the observation data by pre-processing based on the differential spectrum processing, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation method according to Application Example 1, wherein in the process (c), the processing methods of the first pre-processing includes differential processing.
According to this configuration, since it is possible to extract a variation portion of the observation data by pre-processing based on the differential processing, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation method according to Application Example 1, wherein in the process (c), the processing methods of the second pre-processing includes a principal component analysis.
According to this configuration, since it is possible to perform orthogonalization and dimensional reduction of the observation data by pre-processing based on the principal component analysis, the independent component analysis processing of the process (d) can be accurately performed at high speed.
This application example is directed to the calibration curve creation method according to Application Example 1, wherein in the process (c), the processing methods of the second pre-processing includes a factor analysis.
According to this configuration, since it is possible to perform orthogonalization and dimensional reduction considering the random noise included in the observation data by pre-processing based on the factor analysis, the independent component analysis processing of the process (d) can be accurately performed at high speed.
This application example is directed to a calibration curve creation apparatus that creates a calibration curve, which is used to derive a content of a target component in a subject, from observation data of the subject. The calibration curve creation apparatus includes: a sample observation data acquisition unit that acquires the observation data for a plurality of samples of the subject; a sample target component amount acquisition unit that acquires the content of the target component in each sample; a pre-processing method selection unit that selects a processing method of a pre-processing of the observation data from a plurality of options, the pre-processing includes first pre-processing including correction processing and second pre-processing including whitening; a mixing coefficient estimation unit that estimates a plurality of independent components when separating the observation data of each sample into a plurality of independent components and calculates a mixing coefficient corresponding to the target component for each sample based on the plurality of independent components; and a regression equation calculation unit that calculates a regression equation of the calibration curve based on the content of the target component of each of the plurality of samples and the mixing coefficient of each sample. A plurality of processing methods are prepared as processing methods of each of the first pre-processing and the second pre-processing, and the pre-processing method selection unit combines one or more of the processing methods of each of the first pre-processing and the second pre-processing to set the pre-processing method having a plurality of options and selects an optimal combination from the set pre-processing method. The mixing coefficient estimation unit includes: an independent component matrix calculation section that calculates an independent component matrix including the independent component of each sample; an estimated mixing matrix calculation section that calculates an estimated mixing matrix, which indicates a set of vectors defining a ratio of an independent component element of each independent component in each sample, from the independent component matrix; and a mixing coefficient selection section that calculates a correlation between each of the vectors included in the estimated mixing matrix and the content of the target component of each of the plurality of samples and selects the vector, which is determined to have the highest correlation, as a mixing coefficient corresponding to the target component. The independent component matrix calculation section calculates the independent component matrix by executing the first pre-processing, the second pre-processing, and independent component analysis processing in this order using the pre-processing method selected by the pre-processing method selection unit.
According to the calibration curve creation apparatus of Application Example 10, similar to the calibration curve creation method of Application Example 1, it is sufficient to acquire a piece of observation data for the subject at the time of measurement. Therefore, an effect that the amount of target component can be accurately calculated from a piece of observation data, which is an actual measurement value, is obtained. In addition, since the appropriate pre-processing is selected and executed according to the characteristics of the observation data by the pre-processing method selection unit, information included in the observation data can be appropriately extracted by the mixing coefficient estimation unit. As a result, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the pre-processing method selection unit includes a projection on null space as an option of the processing method of the first pre-processing.
According to this configuration, it is possible to improve the measurement accuracy by reducing the baseline variation of the observation data by pre-processing based on the projection on null space.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the pre-processing method selection unit includes centering as an option of the processing method of the first pre-processing.
According to this configuration, since it is possible to align the baseline of the observation data by subtracting the average value of the observation data by pre-processing based on centering, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the pre-processing method selection unit includes normalization as an option of the processing method of the first pre-processing.
According to this configuration, since it is possible to reduce the variation in the observation data due to changes in the measurement conditions by setting the average value of the observation data to 0 and the variance to 1 by pre-processing based on normalization, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the pre-processing method selection unit includes smoothing processing as an option of the processing method of the first pre-processing.
According to this configuration, since it is possible to reduce unnecessary random noise included in the observation data by pre-processing based on the smoothing processing, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the pre-processing method selection unit includes differential spectrum processing as an option of the processing method of the first pre-processing.
According to this configuration, since it is possible to emphasize the variation component of the observation data by pre-processing based on the differential spectrum processing, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the pre-processing method selection unit includes differential processing as an option of the processing method of the first pre-processing.
According to this configuration, since it is possible to extract a variation portion of the observation data by pre-processing based on the differential processing, it is possible to improve the measurement accuracy.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the pre-processing method selection unit includes a principal component analysis as an option of the processing method of the second pre-processing.
According to this configuration, since it is possible to perform orthogonalization and dimensional reduction of the observation data by pre-processing based on the principal component analysis, the calculation of the independent component matrix calculation section can be accurately performed at high speed.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the pre-processing method selection unit includes a factor analysis as an option of the processing method of the second pre-processing.
According to this configuration, since it is possible to perform orthogonalization and dimensional reduction considering the random noise included in the observation data by pre-processing based on the factor analysis, the calculation of the independent component matrix calculation section can be accurately performed at high speed.
This application example is directed to the calibration curve creation apparatus according to Application Example 10, wherein the calibration curve creation apparatus further includes a storage unit that stores the independent component matrix calculated by the independent component matrix calculation section, a target component rank indicating at which position of the estimated mixing matrix the mixing coefficient selected by the mixing coefficient selection section is present, and a regression equation calculated by the regression equation calculation unit.
According to this configuration, the calibration curve creation apparatus can store the independent component matrix, the target component rank, and the regression equation in the storage unit.
This application example is directed to a target component gauging apparatus that calculates a content of a target component in a subject. The target component gauging apparatus includes: a subject observation data acquisition unit that acquires observation data of the subject; a data-for-measurement acquisition unit that acquires measurement data including at least an independent component corresponding to the target component; a mixing coefficient calculation unit that calculates a mixing coefficient with respect to the target component for the subject based on the measurement data and the observation data of the subject; and a target component amount calculation unit that calculates the content of the target component based on a constant of a regression equation indicating a relationship between a content and a mixing coefficient corresponding to the target component, which is prepared in advance, and the mixing coefficient calculated by the mixing coefficient calculation unit. The mixing coefficient calculation unit executes a pre-processing method, which is selected by a pre-processing method selection unit of a calibration curve creation apparatus that calculates the independent component, as first pre-processing including processing for correcting the observation data and second pre-processing including whitening, in this order.
According to the target component gauging apparatus, the content of the target component in the subject can be accurately calculated just by acquiring apiece of observation data regarding the subject.
This application example is directed to the target component gauging apparatus according to Application Example 20, wherein the data-for-measurement acquisition unit acquires an independent component, which corresponds to the target component and is calculated in advance, as the measurement data, and the mixing coefficient calculation unit calculates an inner product of the independent component and the observation data of the subject and sets an value of the inner product as the mixing coefficient.
According to the target component gauging apparatus, a mixing coefficient highly correlated with the target component of the subject can be accurately and easily calculated.
This application example is directed to the target component gauging apparatus according to Application Example 20, wherein the data-for-measurement acquisition unit acquires, as the data for measurement, a plurality of independent components when separating observation data of a plurality of samples into a plurality of independent components, and the mixing coefficient estimation unit calculates an estimated mixing matrix for the subject based on the observation data of the subject and the plurality of independent components, and extracts a mixing coefficient corresponding to the target component from the calculated estimated mixing matrix.
According to the target component gauging apparatus, a mixing coefficient highly correlated with the target component of the subject can be accurately calculated.
In addition, the invention can be realized in various forms other than those described above. For example, the invention can also be realized in a form as a target component gauging apparatus that stores the regression line calculated by the calibration curve creation method in a memory, a form as a computer program to realize as a function the configuration of each unit included in the target component gauging apparatus, and a storage medium (non-transitory storage medium) on which the computer program or the computer program is recorded.
The invention will be described with reference to the accompanying drawings, wherein like numbers reference like elements.
Hereinafter, embodiments of the invention will be described in the following order.
A. Calibration curve creation method
B. Target component measuring method
C. Various algorithms and influences on the measurement accuracy
D. Modification examples
In the present embodiment, the following abbreviations are used.
Hereinafter, embodiments of the invention will be described. An embodiment is related to a method of creating the calibration curve for deriving the chlorophyll content in green vegetables from the spectrum of the spectral reflectance of the green vegetables as observation data. The green vegetables are spinach, Chinese cabbage, and a green pepper, for example.
The step 1 is a preparatory step, and is performed by the operator. The operator prepares a plurality of green vegetables (for example, spinach) of the same type, which have different freshness, as samples. In the present embodiment, n (n is an integer of 2 or more) samples are used.
The step 2 is a spectrum measurement step, and is performed by the operator using a spectrometer. The operator measures the spectrum of the spectral reflectance for each sample by imaging each of the plurality of samples prepared in step 1 using the spectrometer. The spectrometer is a known instrument that measures a spectrum by making light from a measured object be transmitted through a spectroscope and receiving the spectrum output from the spectroscope on the imaging surface of an imaging device. The relationship expressed as in the following Expression (1) is satisfied between the spectrum of the spectral reflectance and the spectrum of absorbance.
[Absorbance]=−log10[Reflectance] (1)
The spectrum of the measured spectral reflectance is converted into the absorbance spectrum using Expression (1). Conversion into the absorbance spectrum is performed because a linear combination needs to be established in the mixed signal analyzed in the independent component analysis, which will be described later, and the linear combination is established for the absorbance from the Lambert-Beer's law. Therefore, in step 2, it is also possible to measure the absorbance spectrum instead of the spectral reflectance spectrum. As a measurement result, data of absorbance distribution showing the characteristics with respect to the wavelength of the measured object is output. The data of absorbance distribution is also referred to as spectral data.
Specifically, in step 2, the operator images a predetermined portion for each sample, and measures the spectrum of the predetermined portion. The predetermined portion may be any portion in each sample, but a portion having freshness that is not greatly different from that of the entire sample is preferable. For example, when the freshness of a certain portion in a sample is extremely low, a portion excluding the portion with low freshness is set as a predetermined portion to be measured.
In addition, instead of measuring the spectral reflectance spectrum or the absorbance spectrum using a spectroscope, it is possible to estimate these spectra from other measured values. For example, it is also possible to measure a sample with a multi-band camera and estimate the spectral reflectance or the absorbance spectrum from the obtained multi-band image. As such an estimation method, for example, a method disclosed in JP-A-2001-99710 can be used.
The step 3 is a step of measuring the chlorophyll content, and is performed by the operator. The operator measures the chlorophyll content, which is the content of a target component in each sample, by chemically analyzing each of the plurality of samples prepared in step 1. Specifically, a predetermined portion is extracted from each sample, chlorophyll that is a target component is extracted from the predetermined portion, and the chlorophyll content is measured. Although the “predetermined portion” may be any portion of the sample, it is preferable that the “predetermined portion” be the same as the portion in which the spectrum has been measured in step 2.
The step 4 is a pre-processing selection step, and is performed using a personal computer.
The computer 100 is a known apparatus including a CPU 10 that executes various kinds of processes and control when executing a computer program (hereinafter, simply referred to as a “program”), a memory 20 (storage unit) that is a data storage location, a hard disk drive 30 that stores a program or data and information, an input interface (I/F) 50, and an output interface (I/F) 60.
The step 4 is a step of selecting the combination of pre-processing, and is performed by a personal computer.
A first pre-processing section 470 can select processing from variations of standard normal variate transformation (SNV) 472 and projection on null space (PNS) 474 and perform the pre-processing in combination.
The SNV 472 is a process for obtaining normalized data, in which the average value is 0 and the standard deviation is 1, by subtracting the average value of data to be processed and dividing the result by the standard deviation.
The PNS 474 is a process for removing a baseline variation included in the data to be processed. In the measurement of the spectrum, a variation between data called a baseline variation, such as an increase or decrease in the average value of data, occurs in the measurement data due to various factors. For this reason, it is preferable to remove the variation factors before performing the independent component analysis. The PNS can be used as pre-processing that can remove any baseline variation.
Assuming that the order of the target baseline variation is zero-order, first-order, and second-order, the PNS can remove the baseline variation of any combination thereof.
In addition, the PNS is described in Zeng-Ping Chen, Julian Morris, and Elaine Martin, “Extracting Chemical Information from Spectral Data with Multiplicative Light Scattering Effects by Optical Path-Length Estimation and Correction”, 2006, for example.
Although the PNS is a method for removing the variation of the baseline whose influence changes algebraically functionally in the data length direction, it depends on target measurement data which-order algebraic function influence is to be removed. Therefore, there is a plurality of variations in the PNS depending on a method of selecting the baseline to be removed.
In addition, when performing the SNV 472 on the spectral data obtained in step 2 of
In addition, as the first pre-processing, it is possible to perform processing other than the SNV or the PNS. In the first pre-processing, it is preferable to perform certain normalization processing, but the normalization processing may be omitted. The first pre-processing section 470 is also referred to as a “correction processing section” hereinbelow. Details of these two processes 472 and 474 will be further described later.
A second pre-processing section 480 can perform pre-processing using either a principal component analysis (PCA) 482 or a factor analysis (FA) 484. In addition, as the second pre-processing, it is possible to use processing other than the PCA or the FA. The second pre-processing section 480 is also referred to as a “whitening processing section” hereinbelow. In a general ICA method, dimensional compression of data to be processed and decorrelation are performed as the second pre-processing. Since a transformation matrix to be calculated by the ICA is limited to an orthogonal transformation matrix by the second pre-processing, it is possible to reduce the amount of calculation in the ICA. Such second pre-processing is called “whitening”, and the PCA is used in many cases. In the PCA, however, when random noise is included in the data to be processed, an erroneous result may be obtained due to the influence. Therefore, in order to reduce the influence of random noise, it is preferable to perform the whitening using the FA, which has robustness against noise, instead of the PCA. The second pre-processing section 480 shown in
As the pre-processing, it is preferable to select an appropriate combination of processes from the above processes according to the characteristics of observation data and perform the selected combination of processes. In order to determine which combination of processes is appropriate, possible combinations of pre-processing are evaluated, and the most accurate combination is selected as the pre-processing. In order to find a combination of pre-processing that is optimal for the target sample observation data, the regression equation of the calibration curve is calculated for each combination, and the accuracy is evaluated.
When the SNV and the PNS are used in the first pre-processing and the PCA and the FA are used in the second pre-processing, examples of the combination of pre-processing shown in
In step 4, these combinations of pre-processing are selected sequentially from the start, and are executed for the observation data. For the pre-processed observation data, the regression equation of the calibration curve is obtained through steps 5 and 6 to be described later. Then, the accuracy is evaluated in step 7. These steps are repeated to evaluate the accuracy for all combinations of pre-processing and select an optimal combination of pre-processing.
Although the result of pre-processing is evaluated to select pre-processing in the present embodiment, other methods may be used as a method of selecting pre-processing. The operator may select pre-processing from the list of pre-processing.
The step 5 is a step of estimating a mixing coefficient, and is performed using a personal computer.
The spectrometer 200 shown in
As a result of the acquisition of the spectral data and the chlorophyll content described above, a data set including the spectral data and the chlorophyll content (hereinafter, referred to as a “measured data set”) DS1 is stored in the hard disk drive 30 of the computer 100.
The CPU 10 loads a predetermined program stored in the hard disk drive 30 to the memory 20 and executes the program to perform a process for estimating the mixing coefficient that is the operation of step 4. The predetermined program can also be downloaded from the outside using a network, such as the Internet. In step 4, the CPU 10 functions as the mixing coefficient estimation unit 440 shown in
The independent component analysis (ICA) is one of the multi-dimensional signal analysis methods, and is a technique for observing a mixed signal, in which independent signals overlap each other, in some different conditions and separating the original independent signals based on the result. By using the independent component analysis, the spectrum as an independent component can be estimated from the spectral data (observation data) obtained in step 2 by regarding the spectral data obtained in step 2 as mixed data of “m” (unknown) independent components including the spectrum due to chlorophyll.
In the present embodiment, the independent component analysis is performed when the three processing sections 470, 480, and 490 shown in
Subsequent to the first pre-processing of the first pre-processing section 470 and the second pre-processing of the second pre-processing section 480, the independent component analysis processing of the independent component analysis processing section (ICA processing section) 490 is performed.
Although a plurality of methods are prepared for the first pre-processing and the second pre-processing, it depends on target measurement data which pre-processing is appropriate. Therefore, each process in each combination of pre-processing prepared is performed for evaluation, and pre-processing determined to have the highest performance is used eventually.
The independent component analysis processing section (ICA processing section) 490 estimates the spectrum as an independent component by performing the ICA on the spectral data subjected to the first pre-processing and the second pre-processing. Generally, in the ICA, a high-order statistic indicating the independence of separated pieces of data is used as an indicator for the separation of independent components (independence indicator). For example, kurtosis is a typical independence indicator. In addition to kurtosis, it is also possible to use indicators, such as β divergence, as independence indicators of ICA.
Next, the typical processing of independent component analysis will be described in detail. It is assumed that the spectrum S (hereinafter, this spectrum may be simply referred to as an “unknown component”) of “m” unknown components (source) is given as a vector of the following Expression (2) and “n” spectral data X obtained in step 2 is given as a vector of the following Expression (3). Each element (S1, S2, . . . , Sm) included in Expression (2) is a vector (spectrum). That is, for example, an element S1 is expressed as in Expression (4). Elements (X1, X2, . . . , Xn) included in Expression (3) are also vectors. For example, the element X1 is expressed as in Expression (5). Subscript 1 is the number of wavelength ranges where the spectrum has been measured. In addition, the number of elements m of the spectrum S of unknown components is an integer of 1 or more, and is determined experimentally or empirically in advance according to the type (here, spinach) of sample.
S=[S
1
,S
2
, . . . ,S
m]T (2)
X=[X
1
,X
2
, . . . ,X
n]T (3)
S
1
={S
11
,S
12
, . . . ,S
11} (4)
X
1
={X
11
,X
12
, . . . ,X
11} (5)
Each unknown component is assumed to be statistically independent. The relationship of the following Expression (6) is satisfied between the unknown component S and the spectra data X.
X=A·S (6)
A in Expression (6) is a mixing matrix, and can be expressed as in the following Expression (7). Although the letter “A” needs to be expressed in bold as shown in Expression (7), the letter “A” is expressed in a normal letter herein from the limitation of letters used in the specification. Hereinafter, other bold letters representing the matrix are also similarly expressed in normal letters.
The mixing coefficient aij included in the mixing matrix A indicates the degree of contribution of the unknown component Sj (j=1 to m) to the spectral data Xi (i=1 to n) that is observation data.
When the mixing matrix A is known, the least square solution of the unknown component S can be easily calculated as A+·X by using a pseudo-inverse matrix A+ of A. In the present embodiment, however, since the mixing matrix A is unknown, the unknown component S and the mixing matrix A should be estimated only from the observation data X. That is, as shown in the following Expression (8), a matrix showing the spectrum as an independent component (hereinafter, referred to as an “independent component matrix”) Y is calculated only from the observation data X using the separation matrix W (m×n). As an algorithm for calculating the separation matrix W in the following Expression (8), it is possible to adopt various algorithms, such as Infomax, Fast Independent Component Analysis (Fast ICA), and Joint Approximate Diagonalization of Eigenmatrices (JADE).
Y=W·X (8)
The independent component matrix Y corresponds to the estimate of the unknown component S. Therefore, the following Expression (9) can be obtained, and the following Expression (10) can be obtained by transforming Expression (9).
X=·Y (9)
Â=X·Y
+ (10)
The estimated mixing matrix ΛA obtained by Expression (10) (written in this manner from the limitation of letters used in the specification, but means a signed letter on the left side of Expression (10) in practice. The same for the other letters) can be expressed as in the following Expression (11).
In step S110 of
After the execution of step S110, the CPU 10 performs processing for calculating the independent component matrix Y based on the separation matrix W and the spectral data X of each sample, which is obtained in step 2 and is stored in advance in the hard disk drive 30 (step S120). In this calculation processing, calculation is performed according to Expression (8) described above. In the processing of steps S110 and S120, the CPU 10 functions as the independent component matrix calculation section 442 shown in
Then, the CPU 10 performs processing for calculating the estimated mixing matrix ΛA based on the spectral data X of each sample stored in advance in the hard disk drive 30 and the independent component matrix Y calculated in step S120 (step S130). In this calculation processing, calculation is performed according to Expression (10) described above.
The estimated mixing matrix ΛA is obtained by the processing up to step S130. That is, the coefficient (estimated mixing coefficient) Λaij included in the estimated mixing matrix ΛA is obtained. Then, the process proceeds to step S140.
In step S140, CPU 10 calculates a correlation (degree of similarity) between the chlorophyll content C1, C2, . . . , Cn measured in step 3 and components (hereinafter, referred to as a vector Λα) of each column included in the estimated mixing matrix ΛA calculated in step S130. Specifically, a correlation between the chlorophyll content C (C1, C2, . . . , Cn) and the vector Λα1 (Λa11, Λa21, . . . , Λan1) of the first column is calculated, and then a correlation between the chlorophyll content C (C1, C2, . . . , Cn) and the vector Λα2 (Λa12, Λa22, . . . , Λan2) of the second column is calculated. In this manner, a correlation between the chlorophyll content C and the vector of each column is sequentially calculated, and a correlation between the chlorophyll content C (C1, C2, . . . , Cn) and the vector Λαm (Λa1m, Λa2m, . . . , Λanm) of the m-th column is finally calculated.
Such a correlation can be calculated by using a correlation coefficient R according to the following Expression (12). The correlation coefficient R is called a Pearson's product-moment correlation coefficient.
where
As a result of step S140 of
The selection in step S150 means selecting a column from a plurality of columns in the table TB shown in
The step 6 is a step of calculating the regression equation, and is performed using the computer 100 in the same manner as when performing step S. In step 6, the computer 100 performs processing for calculating the regression equation of the calibration curve. In addition, data up to step S may be transferred to another computer to perform step 6.
F:C=u{circumflex over (α)}
k
+v (13)
After the execution of step S210, the CPU 10 stores a combination method of the constants u and v of the regression equation F calculated in step S210, the target component rank k (
The step 7 is an algorithm evaluation step, and is performed using the computer 100 in the same manner as when performing steps 5 and 6.
One of the combinations of pre-processing is selected in step 4, mixing coefficient calculation processing is performed, and the regression line of the calibration curve is calculated. The accuracy of the calibration curve in this case is evaluated, it is evaluated how much the combination of pre-processing selected in step 4 is effective for the observation data, and the combination of pre-processing selected in step 4 is compared with other combinations of pre-processing. A correlation coefficient between the mixing coefficient and the true value can be used for the evaluation. A result when calculating the measurement accuracy SEP by measuring the sample data using the calibration curve can be used.
Based on the evaluation result, a combination of pre-processing with the highest accuracy among the combinations of pre-processing evaluated up to now is determined as a candidate of pre-processing. When there is a combination of pre-processing that has not been evaluated yet, the process returns to step 4 to evaluate the next pre-processing. When the evaluation of all pre-processing ends, the current pre-processing candidate is adopted as pre-processing for the target observation data.
Next, the target component measuring method will be described. A subject is assumed to contain the same components as a sample used when creating the calibration curve. Specifically, the target component measuring method is performed using a computer. In addition, the computer herein may be the computer 100 used when creating the calibration curve, or may be a different computer.
X
p
={X
p1
,X
p2
, . . . ,X
pl} (14)
In the processing of step S310, the CPU 10 functions as the subject observation data acquisition unit 510 shown in
In the processing of step S315, the CPU 10 functions as the data-for-measurement acquisition unit 520 shown in
After the execution of step S315, pre-processing is performed on the absorbance spectrum Xp of the subject obtained in step S310 (step S325). As this pre-processing, the same processing as the pre-processing (that is, the normalization processing of the first pre-processing section 470 and the whitening processing of the second pre-processing section 480) used in step 4 of
Then, the CPU 10 performs processing for calculating the estimated mixing matrix ΛA regarding the subject based on the independent component matrix Y included in the data set for measurement DS2 and the pre-processed spectrum obtained in step S325 (step S335). Specifically, arithmetic processing according to Expression (10) described above is performed. The estimated mixing matrix ΛA is obtained by calculating the inverse matrix (pseudo-inverse matrix) Y+ of the independent component matrix Y included in the data set for measurement DS2 and multiplying the pre-processed spectrum obtained in step S325 by the pseudo-inverse matrix Y.
As shown in the following Expression (15), the estimated mixing matrix ΛA in the measurement process is a row vector (“1×m” matrix) configured to include mixing coefficients corresponding to the respective independent components. After the execution of step S335, the CPU 10 reads the target component rank k included in the data set for measurement DS2 from the hard disk drive 30, extracts the mixing coefficient Λαk of the k-th component corresponding to the target component rank k from the estimated mixing matrix ΛA calculated in step S335, and temporarily stores the mixing coefficient Λαk in the memory 20 as a mixing coefficient of chlorophyll that is a target component (step S340). In the processing of steps S325, S335, and S340, the CPU 10 functions as the mixing coefficient calculation unit 530 shown in
{circumflex over (Λ)}=({circumflex over (α)}1,{circumflex over (α)}2, . . . ,{circumflex over (α)}m) (15)
Then, the CPU 10 reads the constants u and v of the regression equation included in the data set for measurement DS2 from the hard disk drive 30, and calculates the content C of chlorophyll by substituting the constants u and v and the mixing coefficient Λαk of chlorophyll as a target component, which is obtained in step S340, into the right side of Expression (13) (step S350). The content C is calculated as a mass of chlorophyll per unit mass (for example, per 100 g) of the subject. In the processing of step S350, the CPU 10 functions as the target component amount calculation unit 540 shown in
In the present embodiment, the content C (mass per unit mass) calculated in step S350 is used as the content of chlorophyll in the subject. However, instead of this, the content C calculated in step S350 may be corrected using the normalization coefficient used in the normalization of step S325 and the corrected value may be used as the content to be calculated. Specifically, the absolute value (gram) of the content may be calculated by multiplying the content C by the standard deviation. According to this configuration, it is possible to calculate the content C more accurately depending on the type of target component.
According to the calibration curve creation method of the embodiment configured as described above, the chlorophyll content can be accurately calculated from one spectrum that is an actual measurement value of the green vegetable as a subject.
Hereinafter, various algorithms used in the first pre-processing section 470, the second pre-processing section 480, and the independent component analysis processing section 490 shown in
The difference in accuracy by the combination of pre-processing is shown using actual observation data as an example. Food data is used as a target.
As the first pre-processing performed by the first pre-processing section 470, standard normal variate transformation (SNV) and projection on null space (PNS) can be used.
The SNV is given as the following Expression (16).
Here, z is data after processing, x is data to be processed (absorbance spectrum in the present embodiment), xave is the average value of the data to be processed x, and σ is a standard deviation of the data to be processed x. As a result of standard normal variate transformation, the normalized data z whose average value is 0 and standard deviation is 1 is obtained.
By performing the PNS, it is possible to reduce the baseline variation included in the data to be processed. In the measurement of data to be processed (absorbance spectrum in the present embodiment), a variation between data called a baseline variation, such as an increase or decrease in the average value of data, occurs in the measurement data due to various factors. For this reason, it is preferable to remove the variation factors before performing the independent component analysis (ICA). The PNS can be used as pre-processing that can reduce any baseline variation of the data to be processed. In particular, for the measurement data of the absorbed light spectrum or the reflected light spectrum including an infrared region, the advantage of applying the PNS is large since such a baseline variation occurs frequently. The principle of removing the baseline variation, which is included in data obtained by measurement (simply referred to as “measurement data x”), by the PNS will be described below. In addition, as a typical example, a case will be described in which the measurement data is an absorbed light spectrum or a reflected light spectrum including an infrared region will be described. However, the PNS can also be similarly applied for other types of measurement data (for example, sound data).
Generally, in an ideal system, the measurement data x (data to be processed x) is expressed as in the following Expression (17) using “m” (m is an integer of 2 or more) independent components si (i=1 to m) and each mixture ratio ci.
Here, A is a matrix (mixing matrix) formed with the mixture ratio ci.
Also in the independent component analysis (ICA), processing is performed on the assumption that this model is used. However, there are various variation factors (condition of a sample, changes in the measurement environment, and the like) in actual measurement data. Therefore, as a model that takes these variation factors into consideration, a model that expresses the measurement data x as in the following Expression (18) can be considered.
Here, b is a parameter indicating a variation in the amplitude direction of the spectrum, a, d, and e are parameters indicating the amount of constant baseline variation E (also referred to as “average value variation”), the amount of variation λ that linearly depends on the wavelength, and the amount of variation λ2 that depends on the square of the wavelength, respectively, and ε is other variation components. In addition, the constant baseline variation E is given as E={1, 1, 1, . . . , 1}T, and is a constant vector whose data length is equal to the data length (the number of segments of the wavelength range) of the measurement data x. The variations λ and λ2 depending on the wavelength are given as λ={λ1, λ2, . . . , λN}T and λ2={λ12, λ22, . . . , λN2}T, respectively. Here, N is the data length of the measurement data x. In addition, as a variation depending on the wavelength, third-order or higher variations can also be taken into consideration. In general, it is possible to take into consideration up to the g-th order variation λg (g is an integer of 2 or more).
In the PNS, data in which the baseline variation components E, λ, λ2, . . . , λg (g is an integer of 2 or more) have been reduced can be obtained by considering the space including the baseline variation components E, λ, λ2, . . . , λg and projecting the measurement distance x to the space (null space) that does not include these variation components. As specific calculation, the data z after processing of the PNS is calculated by the following Expression (19).
Here, P+ is a pseudo-inverse matrix of P. ki is a result obtained by projecting the component si of Expression (18) to the null space not including that does not include variation components. In addition, ε* is a result obtained by projecting the variation component ε of Expression (18) to the null space.
In addition, by performing normalization (for example, the SNV) after processing of the PNS, it is also possible to eliminate the influence of the variation b in the amplitude direction of the spectrum in Expression (18).
An independent component obtained by performing the ICA on the data pre-processed by the PNS is an estimate of the component ki of Expression (19), which is different from the true component si. However, since the mixture ratio ci is not changed from the value in original Expression (18), there is no influence on the measurement process (
The order of the variation to be removed by the PNS can be removed in any combination. Since these variations are error factors in the ICA or the measurement, removing the variations in advance is desirable in many cases. However, not only the variation components but also information required for the measurement may be removed together. Depending on the characteristics of the observation data, it may be better to leave the required information even if there are variations in order to improve the measurement accuracy. Therefore, as the processing of the PNS, when zero-order, first-order, and second-order variations are considered, it is possible to remove variation component combinations, such as [zero-order, first-order, second-order], [zero-order, first-order], [zero-order, second-order], and [zero-order], for example.
In addition, details of the PNS is described in Zeng-Ping Chen, Julian Morris, and Elaine Martin, “Extracting Chemical Information from Spectral Data with Multiplicative Light Scattering Effects by Optical Path-Length Estimation and Correction”, 2006, for example.
As second pre-processing performed by the second pre-processing section 480, principal component analysis (PCA) and factor analysis (FA) can be used.
In a general ICA method, dimensional compression of data to be processed and decorrelation are performed as pre-processing. Since a transformation matrix to be calculated by the ICA is limited to an orthogonal transformation matrix by this pre-processing, it is possible to reduce the amount of calculation in the ICA. Such pre-processing is called “whitening”, and the PCA is used in many cases. The whitening using the PCA is described in detail in Chapter 6 of Aapo Hyvarinen, Juha Karhumen, Erkki Oja, “Independent Component Analysis”, 2001, John Wiley & Sons, Inc., for example.
In the PCA, however, when random noise is included in the data to be processed, an erroneous result may be obtained due to the influence of the random noise. Then, in order to reduce the influence of random noise, it is preferable to perform the whitening using the factor analysis (FA), which has robustness against noise, instead of the PCA. Hereinafter, the principle of the whitening using the FA will be described.
As described above, generally, in the ICA, a linear mixture model (above Expression (17)) that expresses the data to be processed x as a linear sum of the component si is assumed, and the mixture ratio ci and the component si are calculated. However, random noise as well as the component si is added to actual data in many cases. Therefore, as a model that takes random noise into consideration, a model that expresses the measurement data x as in the following Expression (20) can be considered.
X=A·s+ρ (20)
Here, ρ is random noise.
In addition, it is possible to obtain an estimation of the mixing matrix A and the independent component si by performing whitening considering the noise mixture model and then performing the ICA.
In the FA of the present embodiment, it is assumed that the independent component si and the random noise ρ follow the normal distribution N(0, Im) and N(0, Σ), respectively. In addition, as is generally known, the first parameter x1 of the normal distribution N(x1, x2) indicates an expected value, and the second parameter x2 indicates a standard deviation. In this case, since the data to be processed x is a linear sum of the variable according to the normal distribution, the data to be processed x also follows the normal distribution. Here, assuming that the covariance matrix of the data to be processed x is V [x], the normal distribution that the data to be processed x follow can be expressed as N(0, V[x]). In this case, the likelihood function regarding the covariance matrix V[x] of the data to be processed x can be calculated by the following procedure.
First, assuming that the independent components si are perpendicular to each other, the covariance matrix V[x] of the data to be processed x is calculated by the following Expression (21).
V[x]=E[xx
T
]=AA
T+Σ (21)
Here, Σ is a covariance matrix of the noise ρ.
Thus, the covariance matrix V[x] can be expressed by the mixing matrix A and the covariance matrix Σ of noise. In this case, the logarithmic likelihood function L(A, Σ) is given as the following Expression.
Here, n is the number of pieces of data x, m is the number of independent components, an operator tr is a trace (sum of diagonal elements) of a matrix, and an operator det is a determinant. In addition, C is a sample covariance matrix obtained by sample calculation from the data x, and is calculated by the following Expression.
The mixing matrix A and the covariance matrix Σ of noise can be calculated from the maximum likelihood method using the logarithmic likelihood function L(A, Σ) of the above Expression (22). As the mixing matrix A, it is possible to obtain a matrix that is hardly influenced by the random noise ρ of the above Expression (20). This is the basic principle of the FA. In addition, as the algorithm of the FA, there are various algorithms using the algorithm other than the maximum likelihood method. Also in the present embodiment, it is possible to use such various kinds of FA.
Incidentally, the estimate obtained by the FA is just the value of AAT. When the mixing matrix A suitable for this value is determined, it is possible to de-correlate the data while reducing the influence of random noise. However, since the degree of freedom of rotation remains, it is not possible to determine each of the plurality of components si uniquely. On the other hand, the ICA is processing for reducing the degree of freedom of rotation of the plurality of components si so that the plurality of components si are perpendicular to each other. In the present embodiment, therefore, the value of the mixing matrix A calculated by the FA is used as a whitening matrix (matrix after whitening), and the arbitrary property with respect to the left rotation is specified by the ICA. Thus, by performing the ICA after performing the whitening processing robust against noise, it is possible to determine the independent component si perpendicular to each other. In addition, as a result of such processing, it is possible to improve the measurement accuracy regarding the component si by reducing the influence of random noise.
The FA can be considered to be an extension corresponding to the noise of the PCA. In the FA, as a precondition for this extension, it is assumed that noise is normally distributed. This assumption is reasonable in many cases, and better performance can be expected. However, depending on the characteristics of the observation data, the accuracy may not be stable or may not be improved by the FA from the reason that the noise distribution deviates from the normal distribution, for example. In this case, it is appropriate to perform the known process using the PCA.
Generally, in the independent component analysis (ICA), a high-order statistic indicating the independence of separated pieces of data is used as an indicator for the separation of independent components (independence indicator). Kurtosis is a typical independence indicator. The ICA using kurtosis as an independence indicator is described in detail in chapter 8 of Aapo Hyvarinen, Juha Karhumen, Erkki Oja, “Independent Component Analysis”, 2001, John Wiley & Sons, Inc., for example.
The invention is not limited to the above-described embodiment or modification examples thereof, but various modifications may be made within the scope without departing from the subject matter or the spirit of the invention. For example, the following modification examples are also possible.
In the embodiment described above, the subject observation data acquisition unit 510 (
(i) The data set for measurement DS2 stored in the hard disk drive 30 is read, and an element (independent component) Yk of the k-th column corresponding to the target component rank k is acquired from the independent component matrix Y included in the data set for measurement DS2. The independent component Yk has the highest correlation to the chlorophyll content, and corresponds to the chlorophyll content.
(ii) Subsequently, an inner product of the extracted independent component Yk and the spectrum Xp (for example, the normalized spectrum obtained in step S320) of the subject that is observation data is calculated, and the inner product value is set as the mixing coefficient αk of the target component. That is, calculation according to the following Expression (24) is performed.
αk=Xp·Yk (24)
Here, the observation data is a linear sum of independent components, and it is assumed that the orthogonality of independent components is sufficiently high. Therefore, by calculating the inner product of the independent component matrix of the target component and the spectrum that is observation data, only the values of the independent components remain and all of the other components become 0. As a result, it becomes easy to calculate the mixing coefficient αk of the target component. However, when the orthogonality of independent components are not sufficiently high, it is preferable to calculate the estimated mixing matrix ΛA of Expression (15) without using the calculation of Expression (27).
In the process (i) described above, the CPU 10 functions as a data-for-measurement acquisition unit. In the process (ii) described above, the CPU 10 functions as a mixing coefficient calculation unit. Instead of the configuration of the above (i), the data-for-measurement acquisition unit may be configured to acquire the independent component Yk from a storage unit, such as the hard disk drive 30 in which the element (independent component) Yk of the k-th column corresponding to the target component rank k in the independent component matrix Y is stored in advance. This is because only independent components corresponding to the target component are necessary and other independent components are not necessary when using the inner product. In this case, the independent component becomes a vector, and it is not necessary to store the target component rank.
In the embodiment and the modification example described above, the chlorophyll content of a subject, which is a green vegetable, is detected. However, instead of the chlorophyll content of the green vegetable, applications to various subjects and target components, such as oleic acid in meat and collagen in the skin. In short, if a sample having the same components as a subject is prepared to create a calibration curve, it is possible to correspond to various subjects and target components. In the embodiment and each modification example described above, a configuration is adopted in which measurement is performed with the absorbance spectrum as observation data. However, even if sound data in which sound emitted from a plurality of sound sources is mixed is used as the observation data instead of the absorbance spectrum, it is possible to measure the magnitude of the sound from the specific sound source with the same configuration. In short, in the case of a signal having a sufficient amount of information to know the statistical properties of the signal source, the invention can be applied to various kinds of observation data.
In the embodiment and each modification example described above, in the mixing coefficient estimation step, an independent component matrix is calculated, an estimated mixing matrix is calculated, and a mixing coefficient corresponding to the target component is extracted from the estimated mixing matrix. However, this configuration does not necessarily need to be adopted. In short, it is possible to adopt any configuration in which each independent component, which is included in observation data of each sample, when dividing the observation data into a plurality of independent components is estimated and a mixing coefficient corresponding to the target component is calculated for each sample based on each independent component.
In the calibration curve creation methods of the embodiment and each modification example described above, the content of the target component in each sample is measured. However, instead of this configuration, it is also possible to prepare a sample containing a target component whose content is known and input the content through a keyboard or the like.
In the embodiment and each modification example described above, the number of elements m of the spectrum S of an unknown component is determined experimentally or empirically in advance. However, the number of elements m of the spectrum S of the unknown component may also be determined according to the information criteria known as Minimum Description Length (MDL) or Akaike Information Criteria (AIC). When the MDL or the like is used, the number of elements m of the spectrum S of the unknown component can be automatically determined by calculation from the observation data of the sample. In addition, the MDL is described in “Independent component analysis for noisy data—MEG data analysis, 2000”, for example.
In the embodiment and each modification example described above, a subject that is the target of the measurement process has the same components as a sample used when creating the calibration curve. However, when calculating the mixing coefficient using an inner product as in the modification example 1, an unknown component other than the same component as the sample used when creating the calibration curve may be contained in the subject. Since the inner product of independent components is assumed to be 0, the inner product of independent components corresponding to the unknown component can also be considered to be 0. Therefore, the influence of the unknown component can be neglected when calculating the mixing coefficient using an inner product.
The computer used in the embodiment and each modification can be replaced with a dedicated apparatus instead of a personal computer. For example, the personal computer to realize the target component measuring method can be replaced with a dedicated gauging apparatus.
In the embodiment described above, the input of the spectrum of the spectral reflectance of a sample or a subject is performed by inputting the spectrum measured by the spectrometer. However, the invention is not limited to this. For example, it is also possible to estimate a spectrum from a plurality of band images having different wavelength bands and input this spectrum. The band images are obtained by imaging a sample or a subject using a multi-band camera including a filter capable of changing the transmission wavelength band.
In the embodiment and each modification example described above, the function realized by software may also be realized by hardware.
In addition, elements in the embodiment and each modification example described above, which are not elements mentioned in the appended independent claims, are additional elements, and may be appropriately omitted.
In the embodiment described above, as a pre-processing selection method, a method of selecting the optimal pre-processing by repeating the selection of pre-processing in step 4 and the evaluation in step 7 is adopted. However, it is possible to use other methods. For example, the operator may select pre-processing in step 4, and step 7 may not be performed.
The entire disclosure of Japanese Patent Application No. 2013-065763, filed Mar. 27, 2013 is expressly incorporated by reference herein.
Number | Date | Country | Kind |
---|---|---|---|
2013-065763 | Mar 2013 | JP | national |