MATERIAL SAMPLE, DISPLAY METHOD, AND ESTIMATION METHOD

Information

  • Patent Application
  • 20200293934
  • Publication Number
    20200293934
  • Date Filed
    March 13, 2020
    4 years ago
  • Date Published
    September 17, 2020
    4 years ago
Abstract
Provided is a technique capable of adequately presenting the product specifications of a material sample even when the concentration distribution of the material is asymmetrical. A material sample according to the present disclosure stores a material having variation according to a probability distribution. As the product specifications of the material sample, the representative value of the probability distribution as well as an interval in which the amount of the material is greater than or equal to a target probability on the probability distribution is displayed.
Description
CROSS REFERENCE TO RELATED APPLICATIONS

The present application claims priority from Japanese patent applications JP 2019-046887 filed on Mar. 14, 2019, JP 2019-232236 filed on Dec. 24, 2019, and JP 2020-015263 filed on Jan. 31, 2020, the entire content of which are hereby incorporated by reference into this application.


BACKGROUND
Field of the Invention

The present disclosure relates to a technique for quantitative determination of a material stored in a container.


Description of the Related Art

To measure the detection limit of a genetic testing device or create a calibration curve for a low-concentration region for quantitative determination, it is necessary to use a low-copy-number nucleic acid reference material with guaranteed accuracy for quantitative determination. As a method of creating a quantified nucleic acid reference material, a method of performing limiting dilution of a DNA (deoxyribo nucleic acid) solution, or a method of dispensing cells containing DNAs using ink-jet is used, for example.


The limiting dilution method is known to provide a diluted concentration that obeys a Poisson distribution. Such a distribution can be approximated to a normal distribution if the concentration is relatively high. However, if the concentration is low (in particular, if the copy number of nucleic acid molecules is 1 to 100), the right side (i.e., positive side) of the distribution becomes relatively long. Further, there is a relatively prominent tendency that the distribution becomes discrete.


Japanese Patent Application No. 2018-096636 describes a method of dispensing cells using ink-jet. This method does not have a cell measurement accuracy of 100% during dispensing, and the resulting concentration distribution is asymmetrical, with the right side longer than the left side due to aggregation of cells, for example. In addition, such a distribution is not a common distribution like a Poisson distribution that can be represented by a mathematical expression.


To show the product specifications of a nucleic acid reference material, it is necessary to present the concentration of the nucleic acid reference material included in the product. Specifically, specification values representing the representative concentration (typically, a mean value is used) and variation in the concentration are important. In particular, a smaller specification value regarding variation means higher concentration accuracy. Thus, it is important to use an adequate method of representing variation. In addition, the concentration distribution of a nucleic acid reference material may differ depending on a method of creating the material, and some concentration distribution may not be able to be represented using a common distribution. Thus, it would be desirable to use a method capable of computing a concentration irrespective of the shape of a distribution. That is, it is necessary to compute a concentration using a non-parametric method.


As a method of computing and displaying the product specifications of a reference material, a method that is based on the assumption that variation in the concentration obeys a normal distribution is known (hereinafter referred to as a normal distribution method). The normal distribution method is a method of estimating an interval in which the concentration of a reference material is in the range of the mean value±2σ (where a is the standard deviation of the concentration distribution). This method assumes that the concentration distribution obeys a normal distribution and thus is said to be one of the parametric methods.


A percentage-point method, which is used to compute a process capability index, identifies a probability distribution function or a p % point and a (1−p) % point on a probability distribution with a given shape. The p % point means a point where the probability of occurrence of a phenomenon having a given value or less is p %. For example, the 2.275% point and the 97.725% point on a standard normal distribution correspond to −2 and +2, respectively. In addition, when a cumulative probability for an interval [p % point, (1−p) % point] is to satisfy a target probability α, p=(1−α)×100/2. Percentage points can be used irrespective of the shape of a probability distribution. Thus, percentage points can be used to determine a concentration in a non-parametric way. When the way of thinking a process capability index based on the percentage-point method is used, for example, it is possible to determine an interval in which the probability that the concentration is within the range of a non-normal distribution is 95.45%, which is equivalent to the mean value±2σ on a normal distribution, for example.


WO 2006/030822 describes the following technique as a technique of correcting unevenness in a data distribution of a DNA chip: “[U]nevenness in data on a DNA chip is adequately detected and, if possible, corrected. In a gene expression data processing method for processing array data obtained on the basis of the expression level of genes on a DNA chip and thus obtaining analyzable data, the DNA chip is divided into small regions. The values of data forming the array data are standardized (step 300). The mean value of the standardized data values or the standard deviation of the median is calculated for each small region (step 310). The presence or absence of unevenness in the data on the DNA chip is detected on the basis of an increase in the standard deviation (step 310)” (see Abstract).


Japanese Unexamined Patent Application Publication No. 2004-257809 describes a technique of, based on an object of “providing a standard plastic material prepared by dispersing a known amount of a known element in a plastic base material, the standard plastic material having a predetermined thickness and being capable of providing an accurate analysis result, and a production method therefor,” “irradiating a plurality of portions of the standard plastic material with predetermined exciting beams to suppress variation in the intensity of fluorescent X rays generated from the element to less than or equal to 10% of the relative standard deviation” (see Abstract).


SUMMARY

When a concentration is computed based on the assumption that the resulting concentration distribution obeys a normal distribution, the computation result for a low-concentration reference material sample, in particular, may greatly deviate from the actual concentration. This occurs when the actual concentration distribution is bilaterally asymmetrical and a relatively large part of the distribution is not within the range of the mean value±2σ, for example. Alternatively, even when the actual concentration distribution concentrates in a range narrower than the mean value±2σ, the computation result will greatly deviate from the actual concentration. Further, when a normal distribution is assumed, the representative value of the concentration distribution has a decimal point and numbers following the decimal point, but the concentration of a low-concentration reference material sample is represented by an integer value (that is, the copy number of molecules). Thus, it would be unnatural to present the numbers following the decimal point as the values of the product specifications. Such problems are difficult to be solved with the technique described in Japanese Patent Application No. 2018-096636.


The aforementioned percentage-point method is a non-parametric method. Thus, it can also be used for concentration distributions other than normal distributions. Meanwhile, the method determines percentage points that are bilaterally symmetrical irrespective of a distribution (for example, the 2.25% point and the 97.75% point). Thus, when the actual concentration distribution is an asymmetrical, discrete distribution, an increase in the percentage on one side of the distribution is sharp (Example in Table 1: the 37.07% point follows the 0.43% point), and thus, the computation result may include a redundant interval.


In WO 2006/030822, the representative value and the standard deviation of a probability distribution are used, and the data distribution is evaluated based on the premise that the probability distribution is bilaterally symmetrical. Thus, the method is considered to have problems similar to Japanese Patent Application No. 2018-096636 in terms of the shape of the non-parametric probability distribution. Japanese Unexamined Patent Application Publication No. 2004-257809 is also considered to have similar problems to Japanese Patent Application No. 2018-096636 as it uses the relative standard deviation of a probability distribution.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 is a perspective view of a reference material sample 1;



FIG. 2A is a graph illustrating the concentration distribution of a reference material contained in the reference material sample 1;



FIG. 2B is a schematic diagram for comparing the probability distribution of FIG. 2A with a normal distribution; and



FIG. 3 is a flowchart illustrating the procedures for computing the product specifications of the reference material sample 1.





DETAILED DESCRIPTION

In view of the foregoing, exemplarily embodiments relate to providing a technique capable of adequately presenting the product specifications of a material sample even when the concentration distribution of the material is asymmetrical.


A material sample according to the present disclosure stores a material having variation according to a probability distribution. As the product specifications, the representative value of the probability distribution as well as an interval in which the amount of the material is greater than or equal to a target probability on the probability distribution is displayed. The interval is bilaterally asymmetrical.


According to the material sample of the present disclosure, the product specifications of even a material that has a bilaterally asymmetrical, discrete probability distribution can be presented accurately. Accordingly, the product specifications of even a low-concentration sample, which stores a small number of molecules, for example, can be presented accurately.



FIG. 1 is a perspective view of a reference material sample 1 according to an embodiment of the present disclosure. The reference material sample 1 includes one or more containers 11 (also referred to as wells) on a plate. Each container 11 stores a reference material that has been quantified in advance. For example, the container 11 can store a predetermined number of DNA molecules each having a target base sequence for use in an analytical operation. In such a case, storing a predetermined number of copied DNA molecules in each container 11 can create the reference material sample 1. Material samples other than the DNA molecules will be described later.



FIG. 2A is a graph illustrating an exemplary concentration distribution of the reference material contained in the reference material sample 1. Herein, the graph illustrates the probability distribution of the copy number of DNA molecules stored in a single container 11. In the example of FIG. 2A, the probability that 9 DNA molecules are stored in a single container 11 is the highest, and the probability that 10 DNA molecules is stored is the second highest. The probabilities of the other numbers of DNA molecules are significantly lower than that of the 9 or 10 DNA molecules. Thus, 10 can be determined as the representative value, for example.


To produce the reference material sample 1, a target probability is set in advance. In the example of FIG. 2A, suppose that the reference material sample 1 was produced so that the probability that 9 to 12 copies of DNA molecules are stored in a single container 11 would become 95.45%, for example. As the target probability in such a case is 95.45%, a concentration interval corresponding to the target probability is expressed as [9, 12] (the copy number of molecules, the representative value=10), for example. Such numerical values can be presented as the product specifications of the reference material sample 1.


A probability distribution such as the one illustrated in FIG. 2A has the following characteristics. As seen in FIG. 2A, the shape of the distribution is not bilaterally symmetrical about the representative value as the center. In particular, a low-concentration sample (for which the concentration is represented by the copy number of molecules) tends to have a longer distribution on its lower-probability portion on the right side (i.e., the side of a greater copy number). Further, since the reference material sample 1 has been created by copying DNA molecules, the abscissa axis of the concentration distribution represents the copy number. That is, the concentration distribution is a discrete distribution. According to such characteristics, the concentration distribution of the low-concentration sample represented by the copy number of molecules appears very different from a normal distribution.



FIG. 2B is a schematic diagram for comparing the probability distribution of FIG. 2A with a normal distribution. When a method based on a normal distribution is used for a sample with a concentration distribution such as the one illustrated in FIG. 2A, the following inconvenience occurs. Suppose a case where an interval in which the concentration is in the range of ±2σ, for example, based on a normal distribution, is presented as the product specifications of the reference material sample 1. However, as illustrated in FIG. 2B, a major part of the actual concentration distribution is not in compliance with the normal distribution on the left side (i.e., the side of a smaller copy number), in particular. Therefore, display of the concentration based on a normal distribution is not considered to be appropriate as the product specifications.


Specifically, the following case would cause inconveniences. In FIG. 2B, if a relatively large part of the actual concentration distribution is located outside of the range of ±2a on the normal distribution, the actual concentration distribution that is included in the range of ±2σ on the normal distribution may become smaller than the original target probability. Alternatively, if the actual concentration distribution concentrates around the representative value and on one side, the target probability may be met within a range narrower than ±2G on the normal distribution. In such a case, if the range of ±2σ on the normal distribution is presented as the product specifications, it follows that a range greater than the target probability is presented as the product specifications. That is, an excessively wide concentration interval is presented as the product specifications. Further, the representative value and the interval of ±2a on the normal distribution are not necessarily integers, whereas the representative value and the interval on the actual concentration distribution are integers, thus causing a mismatch between the two.


The present embodiment provides a technique for solving the foregoing inconveniences. That is, the present embodiment aims at adequately computing and displaying the product specifications of the reference material sample 1 according to the actual concentration distribution even when the concentration distribution is bilaterally asymmetrical and discrete. Accordingly, it is considered that the product specifications of, in particular, a low-concentration sample that is represented by the copy number of molecules, for example, can be presented more adequately than is conventionally done.



FIG. 3 is a flowchart illustrating the procedures for computing the product specifications of the reference material sample 1 in the present embodiment. The flowchart of FIG. 3 can be carried out either manually or through implementation of each step with hardware, such as a circuit device, or implementation of each step with software and execution of the step with an arithmetic unit. Each step in FIG. 3 will be described below.


(FIG. 3: Step S301)


A target probability for the reference material sample 1 is set. When a target probability equal to that of ±2σ on a normal distribution is set, for example, such a target probability is 95.45%. It is also possible to set other appropriate target probabilities.


(FIG. 3: Step S302)


The representative value on the concentration distribution of the reference material sample 1 is computed. Specifically, the actual concentration distribution of the reference material sample 1 is measured or computed in advance to obtain a concentration distribution such as the one illustrated in FIG. 2A. The following description is also based on FIG. 2A. Then, the representative value on the concentration distribution is determined. For the representative value, an appropriate value may be selected according to the shape of the distribution as described below, or an appropriate value may be selected according the intended use of the reference material sample 1. The representative value may also be selected using other appropriate criteria.


(FIG. 3: Step S303)


An interval [the representative value−a, the representative value+b] is set as an initial interval to execute the following steps. The representative value is the one computed in step S302, and is 10 in the example of FIG. 2A. Symbols a and b are integers greater than or equal to zero, and need not be the same numerical value. For example, an initial interval of [9, 10] is set in the example of FIG. 2A. In such a case, a=1 and b=0.


(FIG. 3: Step S304)


A variable width w for changing the initial interval is set. For example, when the initial interval is changed in increments of 1 in the abscissa axis direction in the example of FIG. 2A, w is set to 1 (w=1). When the copy number is large (for example, 100 copies or more), w may be set to 2 or more (w=2 or more). With a smaller width w, each interval on the concentration distribution will be checked in detail. Meanwhile, with a larger width w, the computation efficiency can be increased. The following steps are performed based on the assumption that w is 1 (w=1).


(FIG. 3: Step S304: Supplementary Notes)


When this step is executed again (second time or later), the previous width w may be changed. For example, the width w may be initially set large to allow for a general search, and then gradually set smaller to allow for a more detailed search.


(FIG. 3: Step S305)


The cumulative probability of the concentrations included in the interval [the representative value−(a+w), the representative value+b] on the actual concentration distribution is computed. If w=1, the cumulative probability (i.e., first probability) of the concentrations included in the interval [8, 10] is computed. Specifically, the probabilities of the copy number=8 to the copy number=10 are added together.


(FIG. 3: Step S306)


The cumulative probability of the concentrations included in the interval [the representative value−a, the representative value+(b+w)] on the actual concentration distribution is computed. If w=1, the cumulative probability (i.e., second probability) of the concentrations included in the interval [9, 11] is computed. Specifically, the probabilities of the copy number=9 to the copy number=11 are added together.


(FIG. 3: Step S307-1)


The first probability computed in step S305 is compared with the second probability computed in step S306. If the first probability is greater than or equal to the second probability, the process proceeds to step S307-2, and if not, the process proceeds to step S307-3.


(FIG. 3: Step S307-2 to Step S307-3)


If the first probability is greater than or equal to the second probability, it means that extending the interval leftward will allow a more portion of the actual concentration distribution to be included. Therefore, in such a case, the interval set in step S305 is fixed (S307-2). If the first probability is less than the second probability, it means that extending the interval rightward will allow a more portion of the actual concentration distribution to be included. Therefore, in such a case, the interval set in step S306 is fixed (S307-3).


(FIG. 3: Step S308)


Steps S304 to S307 (S307-1 to S307-3) are repeated until the cumulative probability computed in step S305 or S306 becomes greater than or equal to the target probability. The flowchart ends at a time point when the cumulative probability becomes greater than or equal to the target probability.


The product specifications computed according to the flowchart of FIG. 3 can be displayed as the product specifications of the reference material sample 1. For example, a label describing the product specifications can be attached to the reference material sample 1 so that the product specifications can be displayed. That is, the reference material sample 1 includes a display portion displaying the product specifications.


Computation Example 1

Table 1 illustrates an example of the actual concentration distribution of the reference material sample 1, which correspond to the real values in FIG. 2A. In the following, product specifications computed according to the procedures described with reference to FIG. 3 and product specifications computed with a conventional method are compared using the reference material sample 1 of Table 1.












TABLE 1







Copy/Well
Probability



















8
0.43%



9
36.64%



10
35.57%



11
18.31%



12
6.62%



13
1.88%



14
0.44%



15
0.09%



16
0.02%










Table 2 illustrates the results of the product specifications computed with each method. The representative value of the normal distribution method is the mean value. For the percentage-point method B, a Poisson distribution with a mean value of 10 was used as the concentration distribution. Thus, the mean value (=10) of the Poisson distribution was selected as the representative value. For the present embodiment and the percentage-point method A, the probabilities of an interval of 9 molecules and an interval of 10 molecules are almost the same. Thus, 10 as the median was selected as the representative value. For each of the percentage-point methods A and B, the target probability α is 95.45%. Thus, an interval was identified through a search for the 2.275% point and the 97.725% point.















TABLE 2







Target
Rep-


Probability



Probability
Probability
resentative
Estimated
Interval
within


Method
Distribution
α
Value
Interval
Width
Interval





















Present
Non-
95.45%
10
[9, 12]
3
97.14%


Embodiment
Parametric
Note:






Percentage-
Non-
This
10
[9, 13]
4
99.02%


Point Method
Parametric
corresponds






A

to






Percentage-
Poisson
±2σ of
10
[4, 17]
13
100.00%


Point Method
Distribution
the






B

normal






Normal
Normal
distribution.
10.02
[7.92,
4.20
97.57%


Distribution
Distribution


12.12]




Method









As a result, each method is found to achieve the target probability a. However, the fixed interval of the normal distribution method has a decimal value. Thus, the cumulative probability of an interval of the closest integers [8, 12] within the interval of the computation result is displayed. As illustrated in Table 2, the computation result of the present embodiment has the narrowest interval width of all the other methods. Thus, the present embodiment is deemed to display the properties of the reference material sample 1 most accurately. For the percentage-point method A, the probability of the copy number being less than the lower limit value 9 is 0.43%, but a percentage point with a lower probability of 2.275% was searched for. Thus, a redundant interval width of 2.275−0.43%=1.845% was secured. For this reason, it is considered that the interval width of the estimation result is greater than that of the present embodiment. For the percentage-point method B, the shape of the probability distribution greatly differs from that of the Poisson distribution. For this reason, it is considered that the interval width of the estimation result is significantly greater than that of the present embodiment. For the normal distribution method, an interval that is bilaterally symmetrical about the representative value is the estimation result. Thus, the interval on the left side, in particular, is larger than that of the present embodiment.


Computation Example 2

Table 3 illustrates another example of the actual concentration distribution of the reference material sample 1. In the following, product specifications computed according to the procedures described with reference to FIG. 3 and product specifications computed with a conventional method are compared using the reference material sample 1 of Table 3.












TABLE 3







Copy/Well
Probability



















4
0.29%



5
41.44%



6
35.86%



7
15.98%



8
4.96%



9
1.19%



10
0.23%



11
0.04%



12
0.01%










Table 4 illustrates the results of the product specifications computed with each method. The representative value of the normal distribution method is the mean value. For the percentage-point method B, a Poisson distribution with a mean value of 6 was used as the concentration distribution. Thus, the mean value (=6) of the Poisson distribution was selected as the representative value. For each of the present embodiment and the percentage-point method A, the probability of the mode 5 is obviously higher than those of the other intervals. Thus, the mode 5 was selected as the representative value. For each of the percentage-point methods A and B, the target probability a is 95.45%. Thus, an interval was identified through a search for the 2.275% point and the 97.725% point.















TABLE 4







Target
Rep-


Probability



Probability
Probability
resentative
Estimated
Interval
within


Method
Distribution
α
Value
Interval
Width
Interval





















Present
Non-
95.45%
5
[5, 8]
3
98.24%


Embodiment
Parametric
Note:






Percentage-
Non-
This
5
[5, 8]
3
98.24%


Point Method
Parametric
corresponds






A

to ±2σ






Percentage-
Poisson
of the
6
 [2, 11]
9
99.99%


Point Method
Distribution
normal






B

distribution.






Normal
Normal

5.89
[3.96,
3.85
93.57%


Distribution
Distribution


7.81]




Method















As a result, the present embodiment and the percentage-point method A are found to achieve the target probability a. For each of the present embodiment and the percentage-point method A, the interval width is 3, which is the smallest of all. For the percentage-point method B, the shape of the probability distribution greatly differs from that of the Poisson distribution. For this reason, it is considered that the interval width of the estimation result is significantly greater than that of the present embodiment. For the normal distribution method, the fixed interval has decimal values. Thus, an interval of the closest integers [4, 7] within the interval of the computation result is the substantial estimation result. For the normal distribution method, an interval that is bilaterally symmetrical about the representative value is the estimation result. Thus, an interval on the left side, in particular, is large and an interval on the right is insufficient, and for this reason, it is considered that the cumulative probability does not satisfy the target probability α.


Computation Example 3

Table 5 illustrates the results obtained by computing the product specifications of the reference material sample 1, which has been created with the limiting dilution method, using each method. Unlike Tables 1 to 4, the reference material sample 1 has been created with the limiting dilution method. Thus, the concentration distribution obeys a Poisson distribution. In Table 5, examples in which a single container 11 stores an average of 1 to 10000 DNA molecules were used. Since the concentration distribution obeys a Poisson distribution, there is no difference between the percentage-point methods A and B. Thus, such methods are collectively referred to as a “percentage-point method” in Table 5.














TABLE 5






Average Copy







Number



Probability


Target
(Representative

Estimated
Interval
within


Probability α
Value)
Method
Interval
Width
Interval




















95.45%
1
Present
[0, 3] 
3
98.10%


Note:

Embodiment


This

Percentage-
[0, 3] 
3
98.10%


corresponds

Point Method


to ±2σ of

Normal
[−1, 3] 
4

100%



the normal

Distribution


distribution.

Method



10
Present
[4, 16]
12
96.26%




Embodiment




Percentage-
[4, 17]
13
97.54%




Point Method




Normal
[3.68, 16.32]
12.65
96.26%




Distribution




Method



100
Present
[81, 120]
39
95.47%




Embodiment




Percentage-
[81, 120]
39
95.47%




Point Method




Normal
[80, 120]
40
95.99%




Distribution




Method



1000
Present
[937, 1063]
126
95.54%




Embodiment




Percentage-
[937, 1064]
127
95.70%




Point Method




Normal
[936.75, 1063.25]
126.49
95.54%




Distribution




Method



10000
Present
[9801, 10200]
399
95.45%




Embodiment




Percentage-
[9801, 10200]
399
95.45%




Point Method




Normal
[9800, 10200]
400
95.50%




Distribution




Method









As a result, each method is found to achieve the target probability α. For each number of molecules, the interval width of the present embodiment is the smallest. Thus, it is found that the present embodiment can also be advantageously used when the concentration distribution obeys a typical Poisson distribution. The interval width according to the present embodiment is smaller than those of the normal distribution method and the percentage-point method by one. In particular, regarding a case where the number of molecules is less than or equal to 100, when the present embodiment is compared with one of the two other methods that has a greater interval width, the interval width of the present embodiment is smaller than that of the other method by 2.5 to 25%. This can confirm that the present embodiment is particularly advantageous for a low-concentration sample containing an average of 100 molecules or less.


Conclusion of the Present Embodiment

The reference material sample 1 according to the present embodiment includes a display portion that displays the product specifications computed according to the procedures described with reference to FIG. 3. As the product specifications, (a) the representative value of the concentration distribution and (b) the lower limit value and the upper limit value representing an interval on the probability distribution can be displayed. Further, (c) the target probability a may also be displayed. The target probability a need not be displayed on the display portion if the numerical value can be known without the display portion such as when the value is determined as the industry standard, for example. Displaying the product specifications via the display portion will allow a user to know the correct specifications of the reference material sample 1.


In the present embodiment, after an initial interval is set on the probability distribution, the initial interval is widened to a side with a higher cumulative probability while being gradually widened to the right or left side by the variable width w. Setting the variable width w according to the properties of the probability distribution can, for a given shape of the probability distribution, estimate with high accuracy the minimum interval for which the cumulative probability is greater than or equal to the target probability α. Accordingly, product specifications with higher accuracy than those of conventional methods can be presented for a low-concentration sample containing 100 molecules or less, in particular.


Another Embodiment 1

The amount y of the reference material is influenced by other amounts (for example, the concentration x1 of the solution and the volume x2 of the solution dispensed). When a single sample is created by mixing a reference material sample with the distribution of Table 1 and a reference material sample with the distribution of Table 3, a representative value of a new sample as well as an interval having a probability greater than or equal to 95.45% can be easily estimated according to the following procedures. The result is that the representative value is 15 copies/wells and the estimated interval is [10.30, 14.42]. When the estimated interval is represented by integers, it becomes an interval of the minimum integers [10, 15] that can totally include the estimated interval.


Provided that the amount y of the material is influenced by other amounts x1, x2, . . . , xn, y=f(x1, x2, . . . , xn), the absolute value of the difference between the representative value Ry of the amount y and the lower limit tolerance is Ay, the absolute value of the difference between the upper limit tolerance and the representative value Ry is By, the absolute value of the difference between the representative value Rxi of each of the amounts x1, x2, . . . , xn and the lower limit tolerance is Ax1, Ax2, . . . , Axn, and the absolute value of the difference between the upper limit tolerance and the representative value Rxi is Bx1, Bx2, . . . , B, the values of Ay, By, and Ry can be estimated with the following formulae.











A
y

=





n


i
=
1





(




y




x
i



·

A

x

i



)

2











B
y

=





i
=
1

n




(




y




x
i



·

B

x

i



)

2











R
y

=

f


(


R

x





1


,

R

x





2


,





,

R

x

n



)







[

Math
.




1

]







Another Embodiment 2

The aforementioned embodiment illustrates an example in which the amount of the reference material contained in the reference material sample is estimated and displayed. Instead, the average amount of each sample of the reference material contained in the reference material sample may also be estimated and displayed. In such a case, the measurement result of the average amount of the reference material has measurement uncertainty according to a probability distribution.


For example, when there are 14 samples of a reference material sample having the distribution of Table 1, the representative value of the average amount of each sample as well as an interval having a probability greater than or equal to 95.45% can be easily estimated according to the following procedures. The result is that the representative value is 10 copies/wells and the estimated interval is [7.59, 13.21]. When the estimated interval is represented by integers, it becomes an interval of the minimum integers [7, 14] that can totally include the estimated interval.


Provided that the absolute value of the difference between the representative value Ry of the amount y of the material and the lower limit tolerance is Ay, the absolute value of the difference between the upper limit tolerance and the representative value Ry is By, the absolute value of the difference between the representative value Rym of the mean value of the amount y of the material and the lower limit tolerance is Aym, the absolute value of the difference between the representative value Rym and the upper limit tolerance is B and the number of samples of the material is Ny, the values of Aym, Bym, and Rym can be estimated with the following formulae.











A
ym

=


A
y



N
y











B

y

m


=


B
y



N
y











R

y

m


=

R
y






[

Math
.




2

]







Another Embodiment 3

When a given amount of a nucleic acid solution sample is dispensed into each of the 14 wells, the average amount y of the nucleic acid dispensed into each well is influenced by the concentration x1 of the solution, the volume x2 of the solution dispensed, and a deviation x3 attributed to a Poisson distribution. Table 6 illustrates the representative value of each element and an interval having a probability greater than or equal to 9.5.4.5%. Each value can be determined according to the following procedures. Since y=x1×x2+x3, the representative value of the average amount of each well is 1 copy/well, and the estimated interval is [0.60, 1.63]. When the estimated interval is represented by integers, it becomes an interval of the minimum integers [0, 2] that can totally include the estimated interval.












TABLE 6







Representative Value
Estimated Interval



















Concentration of Reference
0.25
Copy/μL
[0.174, 0.331]


Material


Volume of Dispensed
4
μL
[3.94, 4.06]


Solution


Deviation Attributed to
0
Copy
[−0.267, 0.535]


Poisson Distribution









Provided that the amount y of the material is influenced by other amounts x1, x2, . . . , xn described above, y=f(x1, x2, . . . , xn), the absolute value of the difference between the representative value Rym of the mean value of the amount y and the lower limit tolerance is Aym, the absolute value of the difference between the representative value Rym and the upper limit tolerance is Bym, the absolute value of the difference between the representative value Rxm1, Rxm2, . . . , Rxmn of the mean value of each of the amounts x1, x2, . . . , xn and the lower limit tolerance is Axm1, Axm2, . . . , Axmn, and the absolute value of the difference between the representative value Rxm1, Rxm2, . . . , Rxmn and the upper limit tolerance is Bxm1, Bxm2, . . . , Bxmn, the values of Aym, Bym, and Rym can be computed with the following formulae.











A

y

m


=





i
=
1

n




(




y




x
i



·

A

x

m

i



)

2











B

y

m


=





i
=
1

n




(




y




x
i



·

B

x

m

i



)

2











R

y

m


=

f


(


R

xm





1


,

R

xm





2


,





,

R

x

m

n



)







[

Math
.




3

]







Another Embodiment 4

When a calibration curve is created with six types of the designated amounts of a reference material, using a real-time PCR (polymerase chain reaction) for quantitative determination of nucleic acids through nucleic acid amplification, the uncertainty of the average quantitative result y is influenced by variation x1 in the quantitative results of the samples for quantitative determination, variation x2 in the designated amounts of the reference material used to create the calibration curve, and variation x3 in the amplification results of the designated amounts of the reference material for creation of the calibration curve. The representative value of y is the mean value of the quantitative results and thus is known. Since the relationship between y and x1 to x3 cannot be represented by a formula, ∂y/∂xi is typically regarded as 1. Provided that the absolute value of the difference between the representative value Rxm of the mean value of each element and the lower limit of the estimated interval is Axm, and the absolute value of the difference between the representative value Rxm and the upper limit of the estimated interval is Bxm, each relative value with respect to the representative value Rxm is illustrated in Table 7. Each value can be determined according to the following procedures. When the representative value of y is 7.03, the estimated interval of y is [5.35, 9.33].












TABLE 7







Axm/Rxm
Bxm/Rxm




















x1
0.220
0.287



x2
0.034
0.125



x3
0.085
0.094










Provided that the amount y of the material is influenced by other amounts x1, x2, . . . , xn described above, y=f(x1, x2, . . . , xn), the absolute value of the difference between the representative value Rym of the mean value of the amount y and the lower limit tolerance is Aym, the absolute value of the difference between the representative value Rym and the upper limit tolerance is Bym, the absolute value of the difference between the representative value Rxm1, Rxm2, . . . , Rxmn of the mean value of each of the amounts x1, x2, . . . , xn and the lower limit tolerance is Axm1, Axm2, . . . , Axmn, and the absolute value of the difference between the representative value Rxm1, Rxm2, . . . , Rxmn and the upper limit tolerance is Bxm1, Bxm2, . . . , Bxmn, the values of Aym, Bym, and Rym can be estimated with the following formulae.











A

y

m


=


R

y

m








i
=
1

n




(




y




x
i



·


A

x

m

i



R

x

m

i




)

2












B

y

m


=


R

y

m








i
=
1

n




(




y




x
i



·


B

x

m

i



R

x

m

i




)

2












R

y

m


=

f


(


R

x

m

1


,

R

x

m

2


,





,

R

x

m

n



)







[

Math
.




4

]







To create a calibration curve, six types of a reference material that include 1, 5, 10, 20, 40, and 80 copies/wells are used. Provided that the representative value of the mean value of each reference material is Rz, the absolute value of the difference between the representative value and the lower limit of the estimated interval is Az, the absolute value of the difference between the representative value and the upper limit of the estimated interval is Bz, and the number of the samples of the reference material is Nz, each relative value with respect to the representative value Rz is illustrated in Table 8. Each value can be determined according to the following procedures. Axm/Rxm and Bxm/Rxm with respect to the variation x2 in the designated amounts of the reference material used to create the calibration curve are 0.034 and 0.125, respectively.














TABLE 8







RZ
AZ/RZ
BZ/RZ
NZ






















Z1
1
0.000
0.277
14



Z2
5
0.053
0.107
14



Z3
10
0.053
0.053
14



Z4
20
0.027
0.040
14



Z5
40
0.020
0.033
14



Z6
80
0.017
0.020
14










Table 9 illustrates AZ/RZ, BZ/RZ, and NZ obtained by performing an inverse operation on the copy number of nucleic acids of each reference material using the created calibration curve. As a result of the computation performed through the same procedures, it is found that Axm/Rxm and Bxm/Rxm with respect to the variation x3 in the amplification results of the designated amounts of the reference material for creation of the calibration curve are 0.085 and 0.094, respectively.














TABLE 9







RZ
AZ/RZ
BZ/RZ
NZ






















Z1
1.06
0.095
0.105
14



Z2
5.00
0.109
0.123
14



Z3
9.36
0.089
0.098
14



Z4
21.17
0.092
0.102
14



Z5
40.64
0.057
0.060
14



Z6
81.65
0.056
0.059
14










Regarding the plurality of types of material samples having different mean values of the amounts x, provided that the absolute value of the difference between the representative value RZ1, RZ2, . . . , RZn of each of the mean values and the lower limit tolerance is AZ1, AZ2, . . . , AZn, the absolute value of the difference between the representative value RZ1, RZ2, . . . , RZn and the upper limit tolerance is BZ1, BZ2, . . . , BZn, and the number of the material samples is NZ1, NZ2, . . . , NZn, the values of Axm/Rxm and Bxm/Rxm can be estimated with the following formulae.












A

x

m



R
xm


=






i
=
1

n




(


N
Zi

-
1

)

·


(


A

Z

i



R

Z

i



)

2







i
=
1

n



(


N

Z

i


-
1

)













B

x

m



R

x

m



=






i
=
1

n




(


N

Z

i


-
1

)

·


(


B

Z

i



R

Z

i



)

2







i
=
1

n



(


N

Z

i


-
1

)









[

Math
.




5

]







Regarding Modified Example of the Present Embodiment

The present embodiment has been described above using a low-concentration sample as an example of a reference material with a discrete, bilaterally asymmetrical concentration distribution. For example, the reference material sample 1 created by copying DNA molecules corresponds to such a sample. The present embodiment can also be advantageously used for a concentration distribution that is not discrete and a concentration distribution that is bilaterally symmetrical as described with reference to Computation Example 3.


Although the aforementioned embodiment illustrates a nucleic acid reference material sample as an exemplary material sample having containers 11 each storing a quantified material, the present embodiment is also applicable to a material sample having containers storing other quantified materials. That is, although the aforementioned embodiment illustrates an example in which copied nucleic acid molecules are stored in each container 11, the material stored in each container 11 is not necessarily limited to copied nucleic acid molecules. The present embodiment is also applicable to other material samples because it is applicable to any probability distributions. Further, the present embodiment is also applicable to probability distributions other than concentration probability distributions.


In the foregoing embodiments, the target base sequences of molecules stored in each container 11 may be either the same or different. Further, the target base sequence of each molecule stored in a single container 11 may be either the same or different. For material samples other than DNA molecules, the material composition may also be either the same or different among the containers 11.


Although the aforementioned embodiment exemplarily illustrates a label to be attached to the reference material sample 1 as an example of a display portion, the display portion may be configured in different ways. For example, a piece of paper, such as a manual of a product describing its specifications, may be displayed by being packed together with the reference material sample 1. Further, the product specifications may be displayed by being presented over a network. The product specifications may also be displayed using any other appropriate methods.


In the aforementioned embodiment, information displayed on the display portion may be expressed with any method as long as it can identify product specifications. For example, the lower limit and the upper limit of an interval may be displayed like an “interval [8, 12],” or information that can identify an interval according to some procedures may be presented. This is also true of the target probability a and the representative value.


The reference material sample 1 can be created by ejecting molecules of a material (for example, DNA molecules) into the containers 11 from an ink-jet apparatus (i.e., liquid droplet ejection apparatus), for example (see Japanese Patent Application No. 2018-096636). In such a case, the representative value can be computed using the properties of the ink-jet apparatus in step S302. For example, using properties, such as a single liquid droplet ejection amount and the concentration of a material contained in the liquid droplet, is considered.

Claims
  • 1. A material sample storing a quantified material, comprising: a container adapted to store the material; anda display portion configured to display an amount of the material stored in the container,wherein:the amount of the material stored in the container has variation according to a probability distribution,a probability that the amount of the material stored in the container is within a range of a lower limit tolerance to an upper limit tolerance on the probability distribution is greater than or equal to a target probability,the display portion is configured to display a representative value of the amount of the material on the probability distribution,a lower limit value representing the lower limit tolerance, andan upper limit value representing the upper limit tolerance, andan absolute value of a difference between the representative value and the lower limit value is different from an absolute value of a difference between the representative value and the upper limit value.
  • 2. The material sample according to claim 1, wherein the display portion is further configured to display a target probability value representing the target probability.
  • 3. The material sample according to claim 1, wherein the material includes nucleic acid molecules.
  • 4. The material sample according to claim 3, wherein:the material is nucleic acid molecules, andeach of the representative value, the lower limit tolerance, and the upper limit tolerance is represented by the number of the nucleic acid molecules stored in the container.
  • 5. The material sample according to claim 3, wherein:the material is nucleic acid molecules, andthe representative value is a numerical value representing 100 nucleic acid molecules or less.
  • 6. A display method for displaying an amount of a quantified material, the amount of the material having variation according to a probability distribution, and a probability that the amount of the material is within a range of a lower limit tolerance to an upper limit tolerance on the probability distribution being greater than or equal to a target probability, the display method comprising: displaying a representative value of the amount of the material on the probability distribution;displaying a lower limit value representing the lower limit tolerance; anddisplaying an upper limit value representing the upper limit tolerance,wherein:an absolute value of a difference between the representative value and the lower limit value is different from an absolute value of a difference between the representative value and the upper limit value.
  • 7. The display method according to claim 6, further comprising displaying a target probability value representing the target probability.
  • 8. The display method according to claim 6, wherein the material includes nucleic acid molecules.
  • 9. The display method according to claim 8, wherein:the material is nucleic acid molecules, andeach of the representative value, the lower limit tolerance, and the upper limit tolerance is represented by the number of the nucleic acid molecules.
  • 10. The display method according to claim 8, wherein:the material is nucleic acid molecules, andthe representative value is a numerical value representing 100 nucleic acid molecules or less.
  • 11. An estimation method for estimating the amount of the material to be displayed with the display method according to claim 6, comprising: a first step of setting the target probability;a second step of computing the representative value;a third step of setting an interval of a lower limit to an upper limit including the representative value on the probability distribution;a fourth step of setting a variable width of the interval;a fifth step of increasing a size of the interval by the variable width from the representative value toward the lower limit, and computing a first probability that the amount of the material is within the interval with the increased size;a sixth step of increasing the size of the interval by the variable width from the representative value toward the upper limit, and computing a second probability that the amount of the material is within the interval with the increased size; anda seventh step of fixing the size of the interval set in the fifth step if the first probability is greater than or equal to the second probability, or fixing the size of the interval set in the sixth step if the first probability is less than the second probability,wherein:the fourth step, the fifth step, the sixth step, and the seventh step are repeated until the probability that the amount of the material is within the interval becomes greater than or equal to the target probability,the lower limit tolerance is the lower limit of the interval at a time point when the probability that the amount of the material is within the interval becomes greater than or equal to the target probability, andthe upper limit tolerance is the upper limit of the interval at a time point when the probability that the amount of the material is within the interval becomes greater than or equal to the target probability.
  • 12. The estimation method according to claim 11, further comprising calculating the probability distribution, wherein:the calculating the probability distribution includes computing the probability distribution using a property of an injection apparatus configured to inject the material into a container.
  • 13. The estimation method according to claim 12, wherein:the injection apparatus is a liquid droplet ejection apparatus configured to eject a liquid droplet, andthe calculating the probability distribution includes computing the probability distribution using a performance property of the liquid droplet ejection apparatus.
  • 14. An estimation method for estimating the amount of the material to be displayed with the display method according to claim 6, wherein provided that an amount y of the material is influenced by other amounts x1, x2, . . . , xn, y=f(x1,x2, . . . ,xn),an absolute value of a difference between a representative value Ry of the amount y and the lower limit tolerance is Ay,an absolute value of a difference between the upper limit tolerance and the representative value Ry is By,an absolute value of a difference between a representative value Rxi of each of the amounts x1, x2, . . . , xn and the lower limit tolerance is Ax1, Ax2, . . . , Axn, andan absolute value of a difference between the upper limit tolerance and the representative value Rxi is Bx1, Bx2, . . . , Bxn,values of Ay, By, and Ry are estimated with the following formulae.
  • 15. A display method for displaying a measurement result of an average amount of a quantified material, the measurement result of the average amount of the material having a measurement uncertainty according to a probability distribution, and a probability that the average amount of the material is within a range of a lower limit tolerance to an upper limit tolerance on the probability distribution being greater than or equal to a target probability, the display method comprising: displaying a representative value of the average amount of the material on the probability distribution;displaying a lower limit value representing the lower limit tolerance; anddisplaying a upper limit value representing the upper limit tolerance,wherein:an absolute value of a difference between the representative value and the lower limit value is different from an absolute value of a difference between the representative value and the upper limit value.
  • 16. An estimation method for estimating the average amount of the material to be displayed with the display method according to claim 15, wherein provided that an absolute value of a difference between a representative value Ry of an amount y of the material and the lower limit tolerance is Ay,an absolute value of a difference between the upper limit tolerance and the representative value Ry is By,an absolute value of a difference between a representative value Rym of a mean value of the amount y of the material and the lower limit tolerance is Aym,an absolute value of a difference between the representative value Rym and the upper limit tolerance is Bym, andthe number of samples of the material is Ny,values of Aym, Bym, and Rym are estimated with the following formulae.
  • 17. An estimation method for estimating the average amount of the material to be displayed with the display method according to claim 15, wherein provided that an amount y of the material is influenced by other amounts x1, x2, . . . , xn, y=f(x1,x2, . . . ,xn),an absolute value of a difference between a representative value Rym of a mean value of the amount y and the lower limit tolerance is Aym,an absolute value of a difference between the representative value Rym and the upper limit tolerance is Bym,an absolute value of a difference between a representative value Rxm1, Rxm2, . . . , Rxmn of a mean value of each of the amounts x1, x2, . . . , xn and the lower limit tolerance is Axm1, Axm2, Axmn, andan absolute value of a difference between the representative value Rxm1, Rxm2, . . . , Rxmn and the upper limit tolerance is Bxm1, Bxm2, . . . , Bxmn,values of Aym, Bym, and Rym are estimated with the following formulae.
  • 18. An estimation method for estimating the average amount of the material to be displayed with the display method according to claim 15, wherein provided that an amount y of the material is influenced by other amounts xn, y=f(x1,x2, . . . ,xn),an absolute value of a difference between a representative value Rym of a mean value of the amount y and the lower limit tolerance is Aym,an absolute value of a difference between the representative value Rym and the upper limit tolerance is Bym,an absolute value of a difference between a representative value Rxm1, Rxm2, . . . , Rxnm of a mean value of each of the amounts x1, x2, . . . , xn and the lower limit tolerance is Axm1, Axm2, . . . , Axmn, andan absolute value of a difference between the representative value Rxm1, Rxm2, . . . , Rxmn and the upper limit tolerance is Bxm1, Bxm2, . . . , Bxmn,values of Aym, Bym, and Rym are estimated with the following formulae.
  • 19. The estimation method according to claim 18, wherein for a plurality of types of material samples having different mean values of the amounts x, provided that an absolute value of a difference between a representative value RZ1, RZ2, . . . , RZn of each of the mean values and the lower limit tolerance is AZ1, AZ2, . . . , AZn,an absolute value of a difference between the representative value RZ1, RZ2, . . . , RZn and the upper limit tolerance is BZ1, BZ2, . . . , BZn, andthe number of the material samples is NZ1, NZ2, . . . , NZn,values of Axm/Rxm and Bxm/Rxm are estimated with the following formulae.
  • 20. The estimation method according to claim 17, wherein at least one of the plurality of amounts x1, x2, and xn is an average amount of the material.
  • 21. The estimation method according to claim 20, wherein at least one of the materials for which the average amount is estimated is a material injected into a container using a liquid droplet ejection apparatus.
Priority Claims (3)
Number Date Country Kind
2019-046887 Mar 2019 JP national
2019-232236 Dec 2019 JP national
2020-015263 Jan 2020 JP national