The technology of the present disclosure relates to a drug discovery support apparatus, a method for operating a drug discovery support apparatus, and a program for operating a drug discovery support apparatus.
In a field of drug discovery, a test is performed which administers a candidate substance for a drug to a subject, such as a rat, and evaluates the efficacy and toxicity of the candidate substance. In this evaluation test, a section image obtained by imaging a tissue section (for example, a brain section, a liver section, or a heart section) collected by autopsying the subject is used. In the related art, a pathologist observes the section image to perform the evaluation. However, with the recent progress of an image analysis technique, a technique has been developed that automatically performs the evaluation without causing the pathologist trouble.
For example, JP2021-508373A discloses a technique that performs evaluation with a machine learning model trained using a section image obtained by imaging a tissue section (referred to as a “tissue sample without a substantial abnormality” in JP2021-508373A) in which an abnormality, such as a tumor or inflammation, has not occurred. Specifically, in JP2021-508373A, first, a section image of a subject, to which a candidate substance has been administered, is input to the machine learning model. Then, a difference (referred to as “deviation from a normal score” in JP2021-508373A) between a feature amount extracted from the section image and a feature amount extracted from a section image for learning obtained by imaging a tissue section, in which no abnormality has occurred, is calculated. Then, it is determined whether or not an abnormality occurs in the tissue section of the subject, to which the candidate substance has been administered, on the basis of the calculated difference.
The subjects are divided into an administration group to which the candidate substance is administered and a control group to which the candidate substance is not administered for comparison with the administration group. Abnormalities may occur in some subjects in the control group even though the candidate substance is not administered. It is considered that the abnormalities occur in the subjects in the control group due to an attribute, a weekly age, and/or a breeding environment, such as feed. The subject in which an abnormality occurs due to the attribute and/or the breeding environment is also present in the administration group. Therefore, in a case where an abnormality occurs in the tissue section of the subject in the administration group, it is necessary to determine whether or not the cause of the abnormality is really the candidate substance. However, JP2021-508373A does not disclose the abnormality that occurs due to the attribute and/or the breeding environment and, of cause, does not disclose a method for determining whether or not the cause of the abnormality is the candidate substance.
An embodiment according to the technology of the present disclosure provides a drug discovery support apparatus, a method for operating a drug discovery support apparatus, and a program for operating a drug discovery support apparatus that can determine whether or not an abnormality caused by a candidate substance for a drug has occurred in a subject in an administration group.
According to an aspect of the present disclosure, there is provided a drug discovery support apparatus comprising a processor configured to: acquire a first section image obtained by imaging a tissue section of at least one of a plurality of subjects constituting an administration group to which a candidate substance for a drug has been administered; extract a first feature amount from the first section image; acquire a plurality of second section images including tissue sections of a plurality of subjects that have the same attributes and are placed under the same breeding environment as the subjects constituting the administration group and that constitute a control group to which the candidate substance for a drug has not been administered; extract a second feature amount from each of the plurality of second section images; acquire third feature amount information related to a third feature amount extracted from each of a plurality of third section images including tissue sections of a plurality of subjects constituting a control group to which the candidate substance was not administered in a past evaluation test; and determine whether or not an abnormality is present in the tissue section included in the first section image on the basis of the first feature amount, the second feature amount, and the third feature amount information.
Preferably, the processor is configured to extract the first feature amount and the second feature amount using a machine learning model trained using the plurality of third section images.
Preferably, the first feature amount, the second feature amount, and the third feature amount have the same number of dimensions and are comparable in the same feature amount space, and the third feature amount information is information of a representative position of the plurality of third feature amounts in the feature amount space. Preferably, the processor is configured to: calculate a first distance between the representative position and a position of the first feature amount in the feature amount space; calculate a second distance between the representative position and a position of each of the plurality of second feature amounts in the feature amount space; and perform the determination on the basis of the first distance and the second distance.
Preferably, the processor is configured to: extract the first feature amount for each of a plurality of first patch images obtained by subdividing the first section image and calculate the first distance for each of the plurality of first patch images; extract the second feature amount for each of a plurality of second patch images obtained by subdividing the second section image and calculate the second distance for each of the plurality of second patch images; and perform the determination on the basis of a first area ratio of the first patch images, for which the first distance is out of a setting range, to the first section image and a second area ratio of the second patch images, for which the second distance is out of the setting range, to the second section image.
Preferably, the processor is configured to determine that the abnormality is present in the tissue section included in the first section image in a case where there is a statistically significant difference between the first area ratio and the second area ratio.
Preferably, the processor is configured to determine that the abnormality is present in the tissue section included in the first section image for which the first area ratio is greater than a threshold value set on the basis of the second area ratio.
Preferably, the first feature amount, the second feature amount, and the third feature amount have the same number of dimensions and are comparable in the same feature amount space, and the third feature amount information is information of a representative position of the plurality of third feature amounts in the feature amount space. Preferably, the processor is configured to: calculate a first distance between the representative position and a position of the first feature amount in the feature amount space; calculate a third distance between a representative position of a plurality of the second feature amounts and the position of the first feature amount in the feature amount space; and perform the determination on the basis of the first distance and the third distance.
Preferably, the processor is configured to determine that the abnormality is present in the tissue section included in the first section image in a case where a weighted sum of the first distance and the third distance is out of a setting range.
Preferably, the processor is configured to receive an instruction to set weighting coefficients, by which the first distance and the third distance are multiplied, in the weighted sum.
Preferably, the first section image and the second section image are images obtained by imaging a slide specimen on which tissue sections of a plurality of types of organs are placed, and the processor is configured to: identify the tissue sections of each organ from the first section image and the second section image; extract the first feature amount and the second feature amount of each of the identified tissue sections of each organ; and determine whether or not an abnormality is present in each of the tissue sections of each organ included in the first section image on the basis of the first feature amount, the second feature amount, and the third feature amount information.
According to another aspect of the present disclosure, there is provided a method for operating a drug discovery support apparatus. The method comprises: acquiring a first section image obtained by imaging a tissue section of at least one of a plurality of subjects constituting an administration group to which a candidate substance for a drug has been administered; extracting a first feature amount from the first section image; acquiring a plurality of second section images including tissue sections of a plurality of subjects that have the same attributes and are placed under the same breeding environment as the subjects constituting the administration group and that constitute a control group to which the candidate substance for a drug has not been administered; extracting a second feature amount from each of the plurality of second section images; acquiring third feature amount information related to a third feature amount extracted from each of a plurality of third section images including tissue sections of a plurality of subjects constituting a control group to which the candidate substance was not administered in a past evaluation test; and determining whether or not an abnormality is present in the tissue section included in the first section image on the basis of the first feature amount, the second feature amount, and the third feature amount information.
According to still another aspect of the present disclosure, there is provided a program for operating a drug discovery support apparatus. The program causes a computer to execute a process comprising: acquiring a first section image obtained by imaging a tissue section of at least one of a plurality of subjects constituting an administration group to which a candidate substance for a drug has been administered; extracting a first feature amount from the first section image; acquiring a plurality of second section images obtained by imaging tissue sections of a plurality of subjects that have the same attributes and are placed under the same breeding environment as the subjects constituting the administration group and that constitute a control group to which the candidate substance for a drug has not been administered; extracting a second feature amount from each of the plurality of second section images; acquiring third feature amount information related to a third feature amount extracted from each of a plurality of third section images obtained by imaging tissue sections of a plurality of subjects constituting a control group to which the candidate substance was not administered in a past evaluation test; and determining whether or not an abnormality is present in the tissue section included in the first section image on the basis of the first feature amount, the second feature amount, and the third feature amount information.
According to the technology of the present disclosure, it is possible to provide a drug discovery support apparatus, a method for operating a drug discovery support apparatus, and a program for operating a drug discovery support apparatus that can determine whether or not an abnormality caused by a candidate substance for a drug has occurred in a subject in an administration group.
Exemplary embodiments according to the technique of the present disclosure will be described in detail based on the following figures, wherein:
For example, as illustrated in
A section image 15 is input to the drug discovery support apparatus 10. The section image 15 is generated, for example, by the following procedure. First, a subject S, such as a rat, that is prepared to evaluate the candidate substance 26 is autopsied, and a plurality of tissue sections (hereinafter, referred to as liver sections) LVS of cross sections of an organ, here, a liver LV of the subject S are collected. Then, the collected liver sections LVS are attached one by one to a slide glass 16. Then, the liver sections LVS are stained, here, with a hematoxylin-eosin dye. Then, the stained liver sections LVS are covered with a cover glass 17 to complete a slide specimen 18. Then, the slide specimen 18 is set in an imaging device 19, such as a digital optical microscope, and the section image 15 is captured by the imaging device 19. The section image 15 obtained in this way is given subject identification data (ID) for uniquely identifying the subject S, an image ID for uniquely identifying the section image 15, an imaging date and time, and the like. In addition, the staining may be staining with a hematoxylin dye alone, staining with a nuclear fast red dye, or the like.
For example, as illustrated in
The second section image 152 is an image obtained by imaging the liver section LVS of the subject S in a current control group 27. The current control group 27 is composed of a plurality of subjects S to which the candidate substance 26 for a drug has not been administered, contrary to the current administration group 25. The subjects S constituting the current control group 27 have the same attributes and are placed under the same breeding environment as the subjects S constituting the current administration group 25. The same attributes include, for example, the same weekly age and/or the same gender. In addition, the same attributes also include the same weekly age composition ratio and/or the same gender composition ratio (for example, five males and five females). The same breeding environment means, for example, that feed is the same, that the temperature and humidity of a breeding space are the same, and/or that the size of the breeding space is the same. The “same” in the same breeding environment indicates not only the exact same, but also the same including an error that is generally allowed in the technical field to which the technology of the present disclosure belongs and that does not go against the gist of the technology of the present disclosure. The number of subjects S constituting the current control group 27 is also, for example, about 5 to 10 that is equal to the number of subjects S in the current administration group 25. Similarly to the first section images 151, a plurality of second section images 152 are obtained from one subject S. Therefore, the number of second section images 152 obtained from the current control group 27 is a value obtained by multiplying the number of second section images 152 obtained from one subject S by the number of subjects S.
Further, a plurality of groups having different doses of the candidate substance 26 are present in the current administration group 25. For example, the dose of the candidate substance 26 varies in three stages of a high-dose group, a medium-dose group, and a low-dose group. This makes it possible to determine the influence of the dose of the candidate substance 26 on the subject S.
For example, as illustrated in
The storage 30 is a hard disk drive that is provided in the computer constituting the drug discovery support apparatus 10 or that is connected via a cable or a network. Alternatively, the storage 30 is a disk array in which a plurality of hard disk drives are connected in series. The storage 30 stores a control program, such as an operating system, various application programs, various types of data associated with these programs, and the like. In addition, a solid state drive may be used instead of the hard disk drive.
The memory 31 is a work memory for the CPU 32 to execute processes. The CPU 32 loads the program stored in the storage 30 into the memory 31 and executes a process corresponding to the program. Therefore, the CPU 32 controls the overall operation of each unit of the computer. In addition, the CPU 32 is an example of a “processor” according to the technology of the present disclosure. Further, the memory 31 may be provided in the CPU 32. The communication unit 33 controls the transmission of various types of information to an external device such as the imaging device 19.
For example, as illustrated in
In a case where the operation program 40 is started, the CPU 32 of the computer constituting the drug discovery support apparatus 10 functions as a read write (hereinafter, abbreviated to RW) control unit 50, a first feature amount extraction unit 511, a second feature amount extraction unit 512, a determination unit 52, and a display control unit 53 in cooperation with the memory 31 and the like.
The RW control unit 50 controls the storage of various types of data in the storage 30 and the reading-out of various types of data in the storage 30. For example, the RW control unit 50 stores the first section image 151 and the second section image 152 from the imaging device 19 in the storage 30.
The RW control unit 50 reads out the first section image 151 from the storage 30 to acquire the first section image 151. The RW control unit 50 outputs the first section image 151 to the first feature amount extraction unit 511 and the display control unit 53. The first section image 151 output from the RW control unit 50 to the first feature amount extraction unit 511 is an object in which whether or not an abnormality is present in the liver section LVS is to be determined. Hereinafter, the first section image 151 in which whether or not an abnormality is present in the liver section LVS is to be determined is referred to as a target first section image 151T (see
The RW control unit 50 reads out a plurality of second section images 152 from the storage 30 to acquire the plurality of second section images 152. The RW control unit 50 outputs all of the second section images 152 to the second feature amount extraction unit 512.
The RW control unit 50 reads out the feature amount extractor 41 from the storage 30 and outputs the feature amount extractor 41 to the first feature amount extraction unit 511 and the second feature amount extraction unit 512. In addition, the RW control unit 50 reads out the third feature amount information 42 from the storage 30 to acquire the third feature amount information 42 and outputs the third feature amount information 42 to the determination unit 52. Further, the RW control unit 50 reads out the setting range information 43 from the storage 30 and outputs the setting range information 43 to the determination unit 52.
The first feature amount extraction unit 511 extracts a first feature amount 601 from the target first section image 151T using the feature amount extractor 41. The first feature amount extraction unit 511 outputs the first feature amount 601 to the determination unit 52.
The second feature amount extraction unit 512 extracts a second feature amount 602 from each of the second section images 152 using the feature amount extractor 41. The second feature amount extraction unit 512 outputs the second feature amount 602 to the determination unit 52.
The determination unit 52 determines whether or not an abnormality caused by the candidate substance 26 is present in the liver section LVS included in the target first section image 151T on the basis of the first feature amount 601, the second feature amount 602, the third feature amount information 42, and the setting range information 43. Examples of the abnormality include lesions, such as tumors, inflammation, cysts, and infiltrates, that are not seen in the normal liver section LVS. The determination unit 52 outputs, to the display control unit 53, a determination result 61 of whether or not the abnormality caused by the candidate substance 26 is present in the liver section LVS included in the target first section image 151T.
The display control unit 53 performs control to display various screens on the display 11. The various screens include, for example, a drug discovery support screen 100 (see
For example, as illustrated in
For example, as illustrated in
For example, as illustrated in
As is well known, the encoder unit 71 includes a convolutional layer that performs a convolution process using a filter, a pooling layer that performs a pooling process, such as a maximum pooling process, and the like. The same applies to the decoder unit 72. The encoder unit 71 repeatedly performs the convolution process using the convolutional layer and the pooling process using the pooling layer on the input patch image 65 a plurality of times to extract the feature amount 60. The extracted feature amount 60 indicates the shape and texture features of the liver section LVS included in the patch image 65.
The feature amount 60 is a set of a plurality of numerical values. That is, the feature amount 60 is multi-dimensional data. The number of dimensions of the feature amount 60 is, for example, 512, 1024, 2048, or the like. The first feature amount 601, the second feature amount 602, and a third feature amount 603 (see
For example, as illustrated in
In the learning phase of the autoencoder 70, the series of processes of the input of the third patch image 653L for learning to the autoencoder 70, the output of the restored image 73L for learning from the autoencoder 70, the loss calculation, the update setting, and the update of the autoencoder 70 is repeatedly performed while the third patch image 653L for learning is exchanged. The repetition of the series of processes is ended in a case where the accuracy of the restoration from the third patch image 653L for learning to the restored image 73L for learning reaches a predetermined setting level. The encoder unit 71 of the autoencoder 70 whose restoration accuracy has reached the setting level in this way is stored as the feature amount extractor 41 in the storage 30 of the drug discovery support apparatus 10. In addition, in a case where the series of processes is repeated a set number of times, the training may be ended, regardless of the accuracy of the restoration from the third patch image 653L for learning to the restored image 73L for learning.
This training of the autoencoder 70 may be performed by the drug discovery support apparatus 10 or may be performed by an apparatus different from the drug discovery support apparatus 10. In the latter case, the feature amount extractor 41 is transmitted from another apparatus to the drug discovery support apparatus 10, and the RW control unit 50 stores the feature amount extractor 41 in the storage 30.
For example, as illustrated in
Next, the structure of the third feature amount information 42 will be described. First, for example, as illustrated in
Similarly to the training of the autoencoder 70, the third feature amount information 42 may be created by the drug discovery support apparatus 10 or may be created by an apparatus different from the drug discovery support apparatus 10. In the latter case, the third feature amount information 42 is transmitted from another apparatus to the drug discovery support apparatus 10, and the RW control unit 50 stores the third feature amount information 42 in the storage 30. In addition, the third feature amount information 42 may be the plurality of third feature amounts 603. In this case, the drug discovery support apparatus 10 derives the representative position coordinates 81.
For example, as illustrated in
The determination unit 52 calculates a first distance D1 between the representative position of the third feature amount 603 indicated by the representative position coordinates 81 of the third feature amount information 42 and the position of the first feature amount 601 in the feature amount space 80. The determination unit 52 calculates the first distances D1 of a plurality of first feature amounts 601 extracted for each of a plurality of first patch images 651 obtained by subdividing one target first section image 151T. In addition, the determination unit 52 calculates a second distance D2 between the representative position of the third feature amount 603 and the position of the second feature amount 602 in the feature amount space 80. The determination unit 52 calculates the second distances D2 of a plurality of second feature amounts 602 extracted for each of a plurality of second patch images 652 obtained by subdividing all of the second section images 152. The first distance D1 and the second distance D2 are Mahalanobis distances.
For example, as illustrated in
For example, as illustrated in
For example, as illustrated in
As in the case of the first area ratio 951, the determination unit 52 calculates a second area ratio 952 of the second patch images 652, for which the second distance D2 is out of the setting range, to the second section image 152. However, since the second distance D2 is calculated on the basis of all of the second section images 152 as described above, the determination unit 52 calculates the second area ratio 952 for each of the second section images 152 as illustrated in
For example, as illustrated in
For example, as illustrated in
An abnormality determination button 102 is provided in a lower portion of the drug discovery support screen 100. In a case where the operator selects the abnormality determination button 102, the instruction receiving unit 110 receives an abnormality determination instruction. The abnormality determination instruction is an instruction for causing the first feature amount extraction unit 511, the second feature amount extraction unit 512, and the determination unit 52 to extract the first feature amount 601 and the second feature amount 602 and to determine whether or not an abnormality is present in the liver section LVS included in the first section image 151 displayed on the drug discovery support screen 100. That is, the first section image 151 displayed on the drug discovery support screen 100 corresponds to the target first section image 151T.
In a case where the abnormality determination button 102 has been selected and the determination unit 52 has determined whether or not an abnormality is present in the liver section LVS included in the target first section image 151T, for example, the display control unit 53 displays the determination result 61 below each first section image 151 as illustrated in
Next, the operation of the above-described configuration will be described with reference to, for example, a flowchart illustrated in
First, the imaging device 19 captures the first section image 151 obtained by imaging the liver section LVS of the subject S in the current administration group 25 and the second section image 152 obtained by imaging the liver section LVS of the subject S in the current control group 27. The first section image 151 and the second section image 152 are output from the imaging device 19 to the drug discovery support apparatus 10. In the drug discovery support apparatus 10, the RW control unit 50 stores the first section image 151 and the second section image 152 from the imaging device 19 in the storage 30 (Step ST100).
In a case where the operator inputs an instruction to display the first section image 151 through the input device 12 (YES in Step ST110), the RW control unit 50 reads out the first section image 151 designated by the display instruction from the storage 30 (Step ST120). The first section image 151 is output from the RW control unit 50 to the display control unit 53. As illustrated in
In a case where the operator selects the abnormality determination button 102 on the drug discovery support screen 100 to issue an abnormality determination instruction (YES in Step ST140), the RW control unit 50 reads out the first section image 151 displayed on the drug discovery support screen 100 at that time as the target first section image 151T from the storage 30 to acquire the first section image 151 (Step ST150). In addition, the RW control unit 50 reads out all of the second section images 152 from the storage 30 to acquire the second section images 152 (Step ST150). The target first section image 151T is output from the RW control unit 50 to the first feature amount extraction unit 511. Further, the second section image 152 is output from the RW control unit 50 to the second feature amount extraction unit 512.
The feature amount extractor 41 is read out from the storage 30 by the RW control unit 50 (Step ST150) and is output to the first feature amount extraction unit 511 and the second feature amount extraction unit 512. In addition, the third feature amount information 42 is read out from the storage 30 by the RW control unit 50 to be acquired (Step ST150) and is output to the determination unit 52. Further, the setting range information 43 is read out from the storage 30 by the RW control unit 50 (Step ST150) and is output to the determination unit 52.
As illustrated in
Then, as illustrated in
As illustrated in
As illustrated in
As illustrated in
As described above, the CPU 32 of the drug discovery support apparatus 10 comprises the RW control unit 50, the first feature amount extraction unit 511, the second feature amount extraction unit 512, and the determination unit 52. The RW control unit 50 reads out the first section image 151, the second section image 152, and the third feature amount information 42 from the storage 30 to acquire them. The first section image 151 is an image obtained by imaging the liver section LVS of the subject S constituting the current administration group 25 to which the candidate substance 26 for a drug has been administered. The second section images 152 are a plurality of images obtained by imaging the liver sections LVS of a plurality of subjects S that have the same attributes and are placed in the same breeding environment as the subjects S constituting the current administration group 25 and that constitute the current control group 27 to which the candidate substance 26 for a drug has not been administered. The third feature amount information 42 is information related to the third feature amount 603 extracted from the third section image 153. The third section images 153 are a plurality of images obtained by imaging the liver sections LVS of a plurality of subjects S constituting the past control group 75 to which the candidate substance was not administered in the past evaluation test.
The first feature amount extraction unit 511 extracts the first feature amount 601 from the target first section image 151T. The second feature amount extraction unit 512 extracts the second feature amount 602 from each of the plurality of second section images 152. The determination unit 52 determines whether or not an abnormality is present in the liver section LVS included in the target first section image 151T on the basis of the first feature amount 601, the second feature amount 602, and the third feature amount information 42.
The current control group 27 includes the subject S in which an abnormality has occurred due to the attribute and/or the breeding environment. The same applies to the past control group 75. In the technology of the present disclosure, the second feature amount 602 and the third feature amount information 42 based on the current control group 27 and the past control group 75 are compared with the first feature amount 601 based on the current administration group 25 to determine whether or not an abnormality is present in the liver section LVS of the subject S constituting the current administration group 25. Therefore, in a case where the determination result 61 output by this determination indicates that an abnormality is present, it can be seen that the cause of the abnormality is not the attribute and/or the breeding environment, in other words, that the cause of the abnormality is the candidate substance 26. Therefore, according to the technology of the present disclosure, it is possible to determine whether or not the abnormality caused by the candidate substance 26 for a drug has occurred in the subject S in the current administration group 25.
The subjects S constituting the current control group 27 have the same attributes and are placed in the same breeding environment as the subjects S constituting the current administration group 25. However, since the number of subjects S constituting the current control group 27 is small, it is difficult to perform the determination on the basis of only the number of subjects S constituting the current control group 27. Therefore, in the technology of the present disclosure, the small number of subjects S constituting the current control group 27 is compensated for by the past control group 75. On the other hand, the number of subjects S constituting the past control group 75 is large, but the subjects S constituting the past control group 75 may have attributes and breeding environments different from those of the subjects S constituting the current administration group 25. As a result, in a case where the determination is performed on the basis of only the number of subjects S constituting the past control group 75, there is a concern that there will be many erroneous determinations. Therefore, in the technology of the present disclosure, the differences in the attributes and breeding environments of the subjects S constituting the past control group 75 are compensated for by the current control group 27. That is, the current control group 27 and the past control group 75 have a complementary relationship. In the technology of the present disclosure, the current control group 27 and the past control group 75 having the complementary relationship therebetween can be appropriately combined to determine whether or not the cause of the abnormality occurring in the subject S is the candidate substance 26.
The first feature amount extraction unit 511 and the second feature amount extraction unit 512 extract the first feature amount 601 and the second feature amount 602, using the feature amount extractor 41 used as the encoder unit 71 of the autoencoder 70 trained using a plurality of third section images 153. Therefore, it is possible to easily extract the first feature amount 601 and the second feature amount 602 that well represent the shape and texture features of the liver section LVS.
The first feature amount 601, the second feature amount 602, and the third feature amount 603 have the same number of dimensions and can be compared in the same feature amount space 80. The third feature amount information 42 is information of the representative positions of a plurality of third feature amounts 603 in the feature amount space 80. The determination unit 52 calculates the first distance D1 between the representative position of the third feature amount 603 and the position of the first feature amount 601 in the feature amount space 80. In addition, the determination unit 52 calculates the second distance D2 between the representative position of the third feature amount 603 and the position of each of a plurality of second feature amounts 602 in the feature amount space 80. The determination unit 52 performs the determination on the basis of the first distance D1 and the second distance D2.
The first distance D1 indicates the degree of deviation between the target first section image 151T and the third section image 153, and the second distance D2 indicates the degree of deviation between the second section image 152 and the third section image 153. The degree of deviation between the target first section image 151T and the third section image 153 indicates the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the past control group 75. In addition, the degree of deviation between the second section image 152 and the third section image 153 indicates the degree of difference between the abnormality occurring in the current control group 27 and the abnormality occurring in the past control group 75. Therefore, in a case where the determination is performed on the basis of the first distance D1 and the second distance D2, it is possible to perform the determination, considering both the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the past control group 75 and the degree of difference between the abnormality occurring in the current control group 27 and the abnormality occurring in the past control group 75. As a result, it is possible to increase the accuracy of the determination.
The first feature amount extraction unit 511 extracts the first feature amount 601 for each of a plurality of first patch images 651 obtained by subdividing the target first section image 151T. In addition, the second feature amount extraction unit 512 extracts the second feature amount 602 for each of a plurality of second patch images 652 obtained by subdividing the second section image 152. The determination unit 52 calculates the first distance D1 for each of a plurality of first feature amounts 601 extracted from the plurality of first patch images 651. Further, the determination unit 52 calculates the second distance D2 for each of a plurality of second feature amounts 602 extracted from the plurality of second patch images 652. The determination unit 52 performs the determination on the basis of the first area ratio 951 of the first patch images 651, for which the first distance D1 is out of the setting range, to the target first section image 151T and the second area ratio 952 of the second patch images 652, for which the second distance D2 is out of the setting range, to the second section image 152. The amount of information used as a reference for determination increases significantly, as compared to a case where the determination is performed without subdividing the target first section image 151T and the second section image 152 into the first patch images 651 and the second patch images 652. Therefore, it is possible to further increase the accuracy of the determination.
The value of the first area ratio 951 is larger as the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the past control group 75 is larger (the abnormality occurring in the current administration group 25 is more caused by the candidate substance 26). On the other hand, the value of the second area ratio 952 is smaller as the degree of difference between the abnormality occurring in the current control group 27 and the abnormality occurring in the past control group 75 is smaller (the abnormality occurring in the current control group 27 is more similar to the abnormality occurring in the past control group 75). Therefore, in a case where it is considered that there is a statistically significant difference between the first area ratio 951 and the second area ratio 952, it can be said that the abnormality caused by the candidate substance 26 is highly likely to be present in the liver section LVS included in the target first section image 151T. Therefore, in a case where there is a statistically significant difference between the first area ratio 951 and the second area ratio 952, it is determined that an abnormality is present in the liver section LVS included in the target first section image 151T, which makes it possible to further increase the accuracy of the determination.
In addition, a criterion for determination may be, for example, a threshold value TH set on the basis of the second area ratio 952 as illustrated in
For example, as illustrated in
The determination unit 52 calculates a weighted sum DS=W1×D1+W3×D3 of the first distance D1 and the third distance D3. Here, W1 is a weighting coefficient by which the first distance D1 is multiplied, and W3 is a weighting coefficient by which the third distance D3 is multiplied. 0≤W1≤1 and 0≤W3≤1 are established, and W1+W3=1 is established. The determination unit 52 compares the weighted sum DS with a setting range of setting range information 105. The setting range information 105 is stored in the storage 30 and includes one threshold value indicating the setting range. In a case where the weighted sum DS is less than the threshold value of the setting range, the determination unit 52 outputs a comparison result 106 indicating that the weighted sum DS is within the setting range. On the other hand, in a case where the weighted sum DS is equal to or greater than the threshold value of the setting range, the determination unit 52 outputs a comparison result 106 indicating that the weighted sum DS is out of the setting range.
The determination unit 52 calculates the first distance D1, the third distance D3, and the weighted sum DS for each of the first feature amounts 601 constituting the distribution 85 illustrated in
As described above, in the 2_1-th embodiment, the determination unit 52 calculates the first distance D1 between the representative position of the third feature amount 603 and the position of the first feature amount 601 in the feature amount space 80 and calculates the third distance D3 between the representative position of a plurality of second feature amounts 602 and the position of the first feature amount 601 in the feature amount space 80. Then, the determination unit 52 performs the determination on the basis of the first distance D1 and the third distance D3.
As described above, the first distance D1 indicates the degree of deviation between the target first section image 151T and the third section image 153, and the degree of deviation between the target first section image 151T and the third section image 153 indicates the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the past control group 75. In addition, the third distance D3 indicates the degree of deviation between the target first section image 151T and the second section image 152, and the degree of deviation between the target first section image 151T and the second section image 152 indicates the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the current control group 27. Therefore, in a case where the determination is performed on the basis of the first distance D1 and the third distance D3, it is possible to perform the determination, considering both the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the past control group 75 and the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the current control group 27. As a result, it is possible to increase the accuracy of the determination. However, in the 2_1-th embodiment, it is necessary to derive the representative position of the second feature amounts 602. Therefore, in the first embodiment that does not require the derivation, it takes a small amount of time and effort to perform the process.
In a case where the weighted sum DS of the first distance D1 and the third distance D3 is out of the setting range, the determination unit 52 determines that an abnormality is present in the liver section LVS included in the target first section image 151T. Therefore, it is possible to perform the determination, considering both the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the past control group 75 and the degree of difference between the abnormality occurring in the current administration group 25 and the abnormality occurring in the current control group 27. As a result, it is possible to increase the accuracy of the determination.
In addition, the determination may be performed on the basis of a simple sum D1+D3 of the first distance D1 and the third distance D3 instead of the weighted sum DS. Further, in a case where the area ratio of the first patch images 651, for which the comparison result 106 is out of the setting range, to the first section image 151 is equal to or greater than the threshold value, it may be determined that an abnormality is present in the liver section LVS included in the first section image 151.
For example, as illustrated in
In a case where the current evaluation test related to the current administration group 25 and the current control group 27 and the past evaluation test related to the past control group 75 have similar content, the operator sets the weighting coefficients such that the weighting coefficient W1 is larger than the weighting coefficient W3. On the other hand, in a case where the current evaluation test related to the current administration group 25 and the current control group 27 and the past evaluation test related to the past control group 75 have different content, the operator sets the weighting coefficients such that the weighting coefficient W3 is larger value than the weighting coefficient W1.
As described above, in the 2_2-th embodiment, the instruction receiving unit 110 receives the coefficient setting instruction 111 which is an instruction to set the weighting coefficients W1 and W3 that are multiplied by the first distance D1 and the third distance D3 in the weighted sum DS. Therefore, the operator can set the weighting coefficients W1 and W3 adapted to the similarity between the current evaluation test related to the current administration group 25 and the current control group 27 and the past evaluation test related to the past control group 75. As a result, it is possible to increase the reliability of the determination result 61.
In addition, the similarity between the current evaluation test related to the current administration group 25 and the current control group 27 and the past evaluation test related to the past control group 75 may be derived by the comparison between the test conditions of the current evaluation test and the test conditions of the past evaluation test, and the weighting coefficients W1 and W3 adapted to the derived similarity may be automatically set.
For example, as illustrated in
The CPU 32 of the drug discovery support apparatus 10 according to the third embodiment functions as an identification unit 121 in addition to each of the processing units 50 to 53 according to the first embodiment. For example, the identification unit 121 identifies tissue sections of each organ from the section image 15 (the first section image 151 and the second section image 152), using a machine learning model and a template for identifying tissue sections of each organ. The identification unit 121 outputs coordinate information of frames 122 to 125 surrounding the tissue sections of each organ as identification results. The frame 122 is a frame surrounding the heart section HS, and the frame 123 is a frame surrounding the liver section LVS. In addition, the frame 124 is a frame surrounding the brain section BS, and the frame 125 is a frame surrounding the bone marrow section BMS.
As described above, in the third embodiment, the first section image 151 and the second section image 152 are images obtained by imaging the slide specimen 120 on which the tissue sections of a plurality of types of organs are placed. The identification unit 121 identifies the tissue sections of each organ from the first section image 151 and the second section image 152. The first feature amount extraction unit 511 and the second feature amount extraction unit 512 extract the first feature amount 601 and the second feature amount 602 of each of the identified tissue sections of each organ. The determination unit 52 determines whether or not an abnormality is present in each of the tissue sections of each organ included in the target first section image 151T on the basis of the first feature amount 601, the second feature amount 602, and the third feature amount information 42. Therefore, it is also possible to respond to the slide specimen 120 on which the tissue sections of a plurality of types of organs are placed. For the slide specimen, the slide specimen 120 on which the tissue sections of a plurality of types of organs are placed as in the third embodiment is more general than the slide specimen 18 on which the tissue section of one organ is placed as in the first embodiment. Therefore, it is possible to perform drug discovery support that is more suitable for general operations.
A configuration may be adopted in which the operator manually defines the frames indicating the tissue sections of each organ in the first section image 151 and the second section image 152.
In addition, as in a drug discovery support screen 135 illustrated in
The encoder unit 71 of the autoencoder 70 is given as an example of the feature amount extractor 41. However, the present disclosure is not limited thereto. An encoder unit of a convolutional neural network which has been trained to output a determination result of whether or not an abnormality is present in a certain class, for example, the liver section LVS included in the patch image 65 in response to the input of the patch image 65 may be used as the feature amount extractor 41. In addition, another machine learning model, such as a support vector machine, may be used as the feature amount extractor 41.
In the tissue section included in the third patch image 653L for learning, a region that is likely to be erroneously detected as an abnormality may be masked and then used for learning. Examples of the region that is likely to be erroneously detected as an abnormality include a blood vessel region in the liver section LVS and dust stuck in a case where the slide specimen 18 or 120 is created.
The feature amount 60 is not limited to the feature amount extracted by the feature amount extractor 41. The feature amount 60 may be, for example, the average value, maximum value, minimum value, mode value, or variance of the pixel values of the patch image 65.
The organ is not limited to the liver LV or the like given as an example. The organ may be a stomach, a lung, a small intestine, a large intestine, or the like. In addition, the subject S is not limited to the rat. The subject S may be a mouse, a guinea pig, a sand mouse, a hamster, a ferret, a rabbit, a dog, a cat, a monkey, or the like.
The technique described in JP2021-508373A may be applied. It is determined that an abnormality has occurred in the tissue section of the subject S in the current administration group 25 by the technique described in JP2021-508373A. However, the technology of the present disclosure shows that, in a case where the content of the determination result 61 of the subject S is that there is no abnormality, the cause of the abnormality is not the candidate substance 26 but the attribute and the breeding environment.
The drug discovery support apparatus 10 may be a personal computer that is installed in a pharmaceutical facility as illustrated in
In a case where the drug discovery support apparatus 10 is configured by the server computer, the first section image 151 and the second section image 152 are transmitted from the personal computer installed in each pharmaceutical facility to the server computer via a network such as the Internet. The server computer delivers the drug discovery support screen 100 or 135 to the personal computer in the form of screen data for web delivery created by a markup language such as Extensible Markup Language (XML). The personal computer reproduces the drug discovery support screen 100 or 135 to be displayed on the web browser on the basis of the screen data and displays the drug discovery support screen 100 or 135 on the display. Further, instead of XML, another data description language, such as JavaScript (registered trademark) Object Notation (JSON), may be used.
The drug discovery support apparatus 10 according to the technology of the present disclosure can be widely used throughout all stages of drug development from the setting of a drug discovery target, which is the initial stage, to a clinical trial which is the final stage.
The hardware configuration of the computer constituting the drug discovery support apparatus 10 according to the technology of the present disclosure can be modified in various ways. For example, the drug discovery support apparatus 10 may be configured by a plurality of computers that are separated as hardware for the purpose of improving processing capability and reliability. For example, the functions of the first feature amount extraction unit 511 and the second feature amount extraction unit 512 and the functions of the determination unit 52 may be distributed to two computers. In this case, the drug discovery support apparatus 10 is configured by the two computers.
As described above, the hardware configuration of the computer of the drug discovery support apparatus 10 can be changed as appropriate depending on required performance such as processing capacity, safety, and reliability. Further, not only the hardware but also an application program, such as the operation program 40, can be duplicated or distributed and stored in a plurality of storages for the purpose of ensuring safety and reliability.
In each of the above-described embodiments, for example, the following various processors can be used as a hardware structure of processing units executing various processes, such as the RW control unit 50, the first feature amount extraction unit 511, the second feature amount extraction unit 512, the determination unit 52, the display control unit 53, the instruction receiving unit 110, and the identification unit 121. The various processors include, for example, the CPU 32 which is a general-purpose processor executing software (operation program 40) to function as various processing units as described above, a programmable logic device (PLD), such as a field programmable gate array (FPGA), which is a processor whose circuit configuration can be changed after manufacture, and a dedicated electric circuit, such as an application specific integrated circuit (ASIC), which is a processor having a dedicated circuit configuration designed to execute a specific process.
One processing unit may be configured by one of the various processors or by a combination of two or more processors of the same type or different types (for example, a combination of a plurality of FPGAs and/or a combination of a CPU and an FPGA). Further, a plurality of processing units may be configured by one processor.
A first example of the configuration in which a plurality of processing units are configured by one processor is an aspect in which one processor is configured by a combination of one or more CPUs and software and functions as a plurality of processing units. A representative example of this aspect is a client computer or a server computer. A second example of the configuration is an aspect in which a processor that implements the functions of the entire system including a plurality of processing units using one integrated circuit (IC) chip is used. A representative example of this aspect is a system on chip (SoC). As described above, the various processing units are configured by using one or more of the above various processors as the hardware structure.
In addition, more specifically, an electric circuit (circuitry) in which circuit elements, such as semiconductor elements, are combined can be used as the hardware structure of these various processors.
In the technology of the present disclosure, the above-described various embodiments and/or various modification examples may be combined with each other as appropriate. In addition, it goes without saying that the present disclosure is not limited to each of the above-described embodiments, and various configurations can be adopted without departing from the gist of the present disclosure. Furthermore, the technology of the present disclosure extends to a storage medium that non-temporarily stores a program, in addition to the program.
The above descriptions and illustrations are detailed descriptions of portions related to the technology of the present disclosure and are merely examples of the technology of the present disclosure. For example, the above description of the configurations, functions, operations, and effects is the description of examples of the configurations, functions, operations, and effects of portions related to the technology of the present disclosure. Therefore, unnecessary portions may be deleted or new elements may be added or replaced in the above descriptions and illustrations without departing from the gist of the technology of the present disclosure. In addition, in the above descriptions and illustrations, the description of, for example, common technical knowledge that does not need to be particularly described to enable the implementation of the technology of the present disclosure is omitted in order to avoid confusion and facilitate the understanding of portions related to the technology of the present disclosure.
In the present specification, “A and/or B” is synonymous with “at least one of A or B”. That is, “A and/or B” means only A, only B, or a combination of A and B. Further, in the present specification, the same concept as “A and/or B” is applied to a case where the connection of three or more matters is expressed by “and/or”.
All of the publications, the patent applications, and the technical standards described in the present specification are incorporated by reference herein to the same extent as each individual document, each patent application, and each technical standard are specifically and individually stated to be incorporated by reference.
Number | Date | Country | Kind |
---|---|---|---|
2022-054509 | Mar 2022 | JP | national |
This application is a continuation application of International Application No. PCT/JP2023/002930, filed on Jan. 30, 2023, the disclosure of which is incorporated herein by reference in its entirety. Further, this application claims priority from Japanese Patent Application No. 2022-054509, filed on Mar. 29, 2022, the disclosure of which is incorporated herein by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2023/002930 | Jan 2023 | WO |
Child | 18898698 | US |