The present disclosure relates to an information processing apparatus, an information processing method, and a program.
In fields such as medicine and biochemistry, to measure the properties of large numbers of cells rapidly, the use of a flow cytometer is becoming commonplace. A flow cytometer is an apparatus that optically measures the properties of cells by irradiating cells flowing through a flow cell with light, and detecting fluorescence emitted from the cells, scattered light, and the like.
Recently, for flow cytometers, the number of fluorescences that can be measured at one time is increasing. With this arrangement, since the increase in the dimensionality of the measurement data causes a combinatorial explosion to occur, it is becoming difficult to manually analyze the data measured with a flow cytometer.
For this reason, the analysis of multidimensional data measured with a flow cytometer by clustering through machine learning is being investigated, as disclosed in Elad David Amir, et al, “viSNE enables visualization of high dimensional single-cell data and reveals phenotypic heterogeneity of leukemia”, Nature Biotechnology, 2013 Jun. 31(6), 545-552.
However, in the case in which the amount of noise is the same in each dimension, the clustering performance drops for data of high dimensionality. For this reason, in the case of clustering the measurement data of a flow cytometer, it is typical to reduce the dimensionality by performing fluorescence separation and the like on the measurement data, and cluster the dimensionally-compressed data.
However, in the case of performing dimension compression on multidimensional data, part of the information included in the measurement data is lost due to the dimension compression. For this reason, for example, in the case in which the clustering result of the multidimensional data is not appropriate, it is difficult for a user to go back to the measurement data of the flow cytometer and verify the validity of the clustering result.
Accordingly, the present disclosure proposes a novel and improved information processing apparatus, information processing method, and program capable of clustering measurement data using dimensionally-compressed data, while also enabling verification of the clustering result going back to the measurement data.
According to an embodiment of the present disclosure, there is provided an information processing apparatus including: an information storage unit that stores a result of sensing light from cells, namely first data, and a result of separating the first data into a plurality of fluorescences, namely second data, in association with each other; a clustering unit that clusters the cells into a plurality of clusters on the basis of the second data; and an output unit that outputs a clustering result from the clustering unit. The output unit additionally outputs at least one or more of the first data and the second data about the cells included in a cluster selected by a user from among the plurality of clusters.
In addition, according to an embodiment of the present disclosure, there is provided an information processing method including: storing a result of sensing light from cells, namely first data, and a result of separating the first data into a plurality of fluorescences, namely second data, in association with each other; clustering the cells into a plurality of clusters on the basis of the second data; and outputting a clustering result from the clustering unit; and additionally outputting at least one or more of the first data and the second data about the cells included in a cluster selected by a user from among the plurality of clusters.
In addition, according to an embodiment of the present disclosure, there is provided a program causing a computer to function as: an information storage unit that stores a result of sensing light from cells, namely first data, and a result of separating the first data into a plurality of fluorescences, namely second data, in association with each other; a clustering unit that clusters the cells into a plurality of clusters on the basis of the second data; and an output unit that outputs a clustering result from the clustering unit. The output unit is made to function to additionally output at least one or more of the first data and the second data about the cells included in a cluster selected by a user from among the plurality of clusters.
According to an embodiment of the present disclosure, measurement data and data obtained by dimensionally compressing the measurement data may be stored in association with each other.
According to an embodiment of the present disclosure as described above, it is possible to cluster measurement data using dimensionally-compressed data, while also enabling verification of the clustering result going back to the measurement data.
Note that the effects described above are not necessarily limitative. With or in the place of the above effects, there may be achieved any one of the effects described in this specification or other effects that may be grasped from this specification.
Hereinafter, (a) preferred embodiment(s) of the present disclosure will be described in detail with reference to the appended drawings. Note that, in this specification and the appended drawings, structural elements that have substantially the same function and structure are denoted with the same reference numerals, and repeated explanation of these structural elements is omitted.
1. Exemplary Configuration of Overall System
2. Exemplary Configuration of Information Processing Apparatus
3. Example of Operations of Information Processing Apparatus
4. Modifications
5. Exemplary Hardware Configuration
First,
As illustrated in
The measurement apparatus 10 is a measurement apparatus capable of detecting the fluorescence of each color from cells or the like targeted for measurement. The measurement apparatus 10 may be, for example, a flow cytometer that causes fluorescently stained cells to flow through a flow cell at high speed, and by irradiating the flowing cells with light rays, detects the fluorescence of each color of light from the cells.
The information processing apparatus 20 clusters each of the cells targeted for measurement on the basis of information related to the fluorescence of the cells measured by the measurement apparatus 10. With this arrangement, the information processing apparatus 20 is able to divide each of the cells measured by the measurement apparatus 10 into multiple groups (that is, clusters). Also, the information processing apparatus 20 stores the measurement data measured by the measurement apparatus 10 in association with clustering data obtained by performing dimension compression and the like to make the measurement data suitable for clustering. With this arrangement, the information processing apparatus 20 is able to reference the measurement data from before the dimension compression when judging the validity of the analysis result, while also reducing the time and cost taken for clustering by performing dimension compression and the like on the measurement data. For example, the information processing apparatus 20 may be a server or the like that is capable of processing large amounts of data quickly.
The terminal apparatus 30 and 40 are display apparatus or the like on which a clustering result from the information processing apparatus 20 is output, for example. For example, each of the terminal apparatus 30 and 40 may be a computer, a laptop, a smartphone, a tablet, or the like provided with a display unit that displays an analysis result received from the information processing apparatus 20 with an image, text, or the like.
In the system 100 including the information processing apparatus 20 according to the present embodiment, first, the information processing apparatus 20 acquires measurement data measured by the measurement apparatus 10 provided in each of a hospital, a clinic, or a laboratory via the network N. After that, the information processing apparatus 20 clusters the acquired measurement data, and outputs a clustering result to the terminal apparatus 30 and 40. Since clustering imposes a heavy information processing load, by intensively executing such processing on the information processing apparatus 20 that includes a dedicated server or the like, the efficiency of the system 100 as a whole may be improved. Furthermore, on the basis of a user selection, the information processing apparatus 20 controls the output of the measurement data and the clustering data obtained by processing the measurement data. With this arrangement, the information processing apparatus 20 is able to appropriately switch to and output information according to the user's demand.
Note that in the above, the measurement apparatus 10, the information processing apparatus 20, and the terminal apparatus 30 and 40 are described as being interconnected through the network N, but the technology according to the present disclosure is not limited to such an example. For example, the measurement apparatus 10, the information processing apparatus 20, and the terminal apparatus 30 and 40 may also be connected directly.
Next,
As illustrated in
The input unit 201 acquires a result of measuring cells targeted for measurement from the measurement apparatus 10. Specifically, the input unit 201 acquires information related to the fluorescence of the cells targeted for measurement from the measurement apparatus 10. The input unit 201 is provided with an external input interface including a connection port, a communication apparatus, or the like for acquiring information from the measurement apparatus 10 through the network N, for example.
Herein, the content of the fluorescence-related information acquired by the input unit 201 is different depending on the fluorescence detection mechanism in the measurement apparatus 10. The specific content of the fluorescence-related information will be described together with the fluorescence detection mechanism in the measurement apparatus 10 illustrated in
As illustrated in
The dichroic mirrors 15 are mirrors that reflect light of a specific wavelength band while transmitting light of another wavelength band. The photodetectors 17 are photomultiplier tubes, photodiodes, or the like, for example. In the first measurement method, by providing the dichroic mirrors 15 that reflect light of different wavelength bands on the optical path of the fluorescent light from the sample 13, the fluorescent light from the sample 13 may be spectrally separated into each wavelength band. For example, in the first detection mechanism, by providing each of a dichroic mirror 15 that reflects light of the wavelength band corresponding to red, a dichroic mirror 15 that reflects light of the wavelength band corresponding to green, and a dichroic mirror 15 that reflects light of the wavelength band corresponding to blue in order from the side where light from the sample 13 is incident, the fluorescent light from the sample 13 may be spectrally separated into each wavelength band.
In the case in which the measurement apparatus 10 detects fluorescence with such a first detection mechanism, the fluorescence-related information acquired by the input unit 201 becomes information related to the intensity of the fluorescent light in each wavelength band.
Also, as illustrated in
The prism 16 is an optical member that disperses incident light, and the photodetector array 18 is a sensor in which multiple photodetectors (photomultipliers or photodiodes) are disposed in an array. In the second detection mechanism, by dispersing fluorescent light from the sample 13 with the prism 16 and detecting the dispersed light with the photodetector array 18, the fluorescent light from the sample 13 may be detected as a continuous spectrum.
In the case in which the measurement apparatus 10 detects fluorescence with such a second detection mechanism, the fluorescence-related information acquired by the input unit 201 becomes information related to the optical spectrum of the fluorescent light.
The fluorescence separation unit 203 separates each fluorescence included in the fluorescent light measured by the measurement apparatus 10, and thereby derives an expression level of a fluorescent substance corresponding to each fluorescence. The cells targeted for measurement are marked by multiple fluorescent substances, and the wavelength distributions of the fluorescence emitted from each of the fluorescent substances overlap each other. For this reason, by correcting the overlap of the wavelength distributions of the fluorescence emitted from each of the fluorescent substances and deriving net amount of light in each fluorescence, the fluorescence separation unit 203 is able to derive the expression level of each fluorescent substance and the expression level of biomolecules and the like marked by each fluorescent substance.
More specifically, in the case in which the fluorescence-related information acquired by the input unit 201 is information related to the intensity of the fluorescent light in each wavelength band, the fluorescence separation unit 203 is able to derive the expression level of each fluorescent substance by the method described with reference to
As illustrated in
For this reason, as illustrated in
Also, in the case in which the fluorescence-related information acquired by the input unit 201 is information related to the optical spectrum of the fluorescent light, the fluorescence separation unit 203 is able to derive the expression level of each fluorescent substance by the method described with reference to
As illustrated in
For this reason, as illustrated in
With this arrangement, even if the content of the fluorescence-related information acquired by the input unit 201 is any of the above, the fluorescence separation unit 203 is able to derive the expression level of the fluorescent substance corresponding to each fluorescence exhibited by the cells.
The information storage unit 205 stores the fluorescence-related information acquired by the input unit 201 in association with information related to the expression level of each fluorescent substance derived by the fluorescence separation unit 203. Specifically, the information storage unit 205 stores information related to fluorescence from cells obtained by the measurement apparatus 10 sensing a sample as first data, and stores information related to the expression level of each fluorescent substance obtained by separating the fluorescence from the cells into multiple fluorescences as second data. At this time, the information storage unit 205 stores this data as consolidated data by storing the first data and the second data derived from the first data in association with each other.
For example, the information storage unit 205 may store data consolidating the first data and the second data in a format as illustrated in
As illustrated in
Herein, the first data is N-dimensional data including information about N fluorescence intensities “PMT1” to “PMTN”, and the second data is M-dimensional data including information about M expression levels “Pigment 1” to “Pigment M”. The dimensionality M of the second data is smaller than the dimensionality of the first data due to the fluorescence separation of the fluorescence separation unit 203.
At this point, since the efficiency and accuracy of clustering rises as the dimensionality becomes smaller, in the clustering unit 207 described later, cells are clustered using the second data. However, since the second data is dimensionally compressed in the fluorescence separation by the fluorescence separation unit 203, there is a possibility that information loss and the like may occur. For this reason, by storing both types of data in association with each other, the information storage unit 205 makes it possible to verify or confirm clustering going back to the measurement data of the measurement apparatus 10 easily, while also raising the efficiency and accuracy of the clustering.
Note that the first data and the second data stored in association with each other in the information storage unit 205 do not have to be information output from the input unit 201 and the fluorescence separation unit 203. For example, in the case in which the measurement apparatus 10 is provided with the fluorescence separation unit 203, the information processing apparatus 20 may acquire information related to the fluorescence of cells from the measurement apparatus 10 and information related to the expression level of each fluorescent substance in the cells, and the information storage unit 205 may store the acquired information in association with each other as the first data and the second data. Alternatively, the information processing apparatus 20 may acquire information related to the fluorescence of cells and information related to the expression level of each fluorescent substance in the cells stored in an external storage apparatus, and the information storage unit 205 may store the acquired information in association with each other as the first data and the second data.
The clustering unit 207 clusters cells on the basis of the expression level of each fluorescent substance in the cells derived by the fluorescence separation unit 203. In other words, the clustering unit 207 clusters cells on the basis of the second data stored by the information storage unit 205. Since the second data indicating the expression level of each fluorescent substance of the cells is multidimensional data, the information processing apparatus 20 is able to use clustering technology based on machine learning to divide the cells into multiple groups (clusters) faster than manually.
The clustering technique used by the clustering unit 207 is not particularly limited, and may be a publicly available clustering technique. For example, the clustering unit 207 may perform clustering using a typical clustering technique such as the ward method, the group average method, the single-link method, or the k-means method, or may also perform clustering using the self-organization map method.
The output unit 209 outputs the result of clustering by the clustering unit 207 to the terminal apparatus 30 and 40 or the like. For example, in the terminal apparatus 30 and 40, the output clustering result may be presented to a user as an image display.
For example, the result of clustering by the clustering unit 207 may be displayed by the image displays illustrated in
For example, as illustrated in
In the display illustrated in
For example, as illustrated in
In the display illustrated in
Furthermore, in the display illustrated in
Also, for a cluster selected by the user, the output unit 209 additionally outputs data about the cells included in the cluster to the terminal apparatus 30 and 40 or the like. Specifically, the output unit 209 additionally outputs one or both of the first data and the second data to the terminal apparatus 30 and 40 or the like as data about the cells included in a cluster selected as a display target by the user. Whether the output unit 209 outputs the first data, the second data, or both the first data and the second data to the terminal apparatus 30 and 40 may be selected by the user, for example.
For example, the output unit 209 may output the first data to the terminal apparatus 30 and 40 as the image display illustrated in
As illustrated in
The output unit 209 may also output the second data to the terminal apparatus 30 and 40 as the image display illustrated in
As illustrated in
According to the information processing apparatus 20 provided with the above configuration, the user becomes able to reference information going back to the measurement data that has not undergone fluorescence separation or the like from the clustering result, and therefore is able to judge the reliability of the clustering and the reliability of the measurement result more easily. Consequently, the information processing apparatus 20 according to the present embodiment is able to improve the traceability of information with respect to the clustering result.
Next,
As illustrated in
Next, the information storage unit 205 stores the first data in association with the second data generated from the first data (S105). Next, the clustering unit 207 clusters the cells on the basis of the second data (S107). Specifically, the clustering unit 207 clusters the cells on the basis of the expression level of each fluorescent substance in the cells. The technique of the clustering by the clustering unit 207 is not particularly limited, and it is possible to use a publicly available technique.
After that, the output unit 209 outputs the result of clustering by the clustering unit 207 to the terminal apparatus 30 and 40 or the like (S109). At this time, assume that from the user who has checked the terminal apparatus 30 and 40 to which the clustering result has been output to an image display or the like, a cluster targeted for additional output is selected (S111), and which of the first data and the second data is to be output is selected (S113). With this arrangement, the output unit 209 confirms whether or not the data selected for output from the user is the first data (S113), and in the case in which the selected data is the first data (S113/Yes), the output unit 209 outputs the first data of each cell belonging to the selected cluster to the terminal apparatus 30 and 40 or the like (S121). On the other hand, in the case in which the selected data is the second data (S113/No), the output unit 209 causes the user to select a combination of fluorescent substances in the second data (S117), and outputs data about the expression levels of the selected combination of fluorescent substances from among the second data to the terminal apparatus 30 and 40 or the like (S119).
According to the above operations, the information processing apparatus 20 is able to go back to the first data and the second data from the clustering result and present information to the user. Consequently, the information processing apparatus 20 according to the present embodiment is able to improve the traceability of information with respect to the clustering result.
Next,
As illustrated in
The sample comparison unit 211 compares the clustering results of multiple samples, and specifies a cluster for which a disparity exists between the compared multiple samples. Specifically, in the case of comparing a first sample and a second sample, first, the sample comparison unit 211 maps each of the cells in the second sample onto the result of the clustering of the first sample by the clustering unit 207. Next, the sample comparison unit 211 compares the clustering result of the first sample to the mapping result of the second sample, and specifies a cluster in which the change between the clustering result of the first sample and the mapping result of the second sample is a threshold value or greater as a disparate cluster. The first data or the second data of the specified disparate cluster may be output to the terminal apparatus 30 and 40 by the output unit 209, for example. Note that, the first sample is a sample gathered from a healthy individual, for example, while the second sample is a sample gathered from a diseased individual, for example.
At this point,
As illustrated in
Next, in each of the clusters clustering each first sample, the sample comparison unit 211 computes representative values of the second data of the cells belonging to each cluster (S203). For example, the sample comparison unit 211 may treat the mean, the mode, or the median of each expression level of each fluorescent substance in the second data as the representative values.
Next, on the basis of the second data of the second sample, the sample comparison unit 211 maps each cell of the second sample onto the cluster with the shortest distance among the clusters in the clustering result of the first sample (S205). Specifically, the sample comparison unit 211 calculates the Euclidean distance or the Manhattan distance between a vector of the second data of each cell of the second sample and the representative value of a cluster that is a clustering result of the first sample.
Next, the sample comparison unit 211 compares the clustering result of the first sample and the mapping result of the second sample, and determines the existence or non-existence of a cluster in which the number of belonging cells has changed by a threshold value or greater between the first sample and the second sample (S207). In the case in which a cluster in which the number of belonging cells has changed by the threshold value or greater between the first sample and the second sample does not exist (S207/No), the information processing apparatus 21 ends operation.
On the other hand, in the case in which a cluster in which the number of belonging cells has changed by the threshold value or greater between the first sample and the second sample exists (S207/Yes), the sample comparison unit 211 specifies the cluster as a disparate cluster (S209). For example, the sample comparison unit 211 may specify a cluster in which the number of belonging cells has changed by a threshold value (such as 2, for example) or greater between the clustering result of the first sample and the mapping result of the second sample as a disparate cluster. Alternatively, the sample comparison unit 211 may specify a cluster in which the proportion of the number of belonging cells with respect to the sample as a whole has changed by a threshold value or greater between the clustering result of the first sample and the mapping result of the second sample as a disparate cluster.
The first data and/or second data of the disparate cluster specified by the sample comparison unit 211 is output by the output unit 209 as an image display or the like to the terminal apparatus 30 and 40, and thereby presented to the user. With this arrangement, the user is able to check information related to the fluorescence and information related to the expression level of each fluorescent substance in a cell group for which a disparity exists between the first sample and the second sample.
Next,
In the example of operations according to the flowchart illustrated in
As illustrated in
Next, the relationships of clustering and mapping of the first sample and the second sample are interchanged, and the operations of S201 to S205 above are executed (S211).
Specifically, each second sample is clustered on the basis of the second data by the clustering unit 207. Next, in each of the clusters clustering each second sample, the sample comparison unit 211 computes representative values of the second data of the cells belonging to each cluster. Next, on the basis of the second data of the first sample, the sample comparison unit 211 maps each cell of the first sample onto the cluster with the shortest distance among the clusters in the clustering result of the second sample. Herein, the combination of the clustered sample (second sample) and the mapped sample (first sample) in S211 will also be designated the second combination.
Next, the sample comparison unit 211 determines whether or not the amount of change in each cluster for the case of mapping the second sample onto the clustering result of the first sample (the first combination) is greater than the amount of change in each cluster for the case of mapping the first sample onto the clustering result of the second sample (the second combination) (S213). In the case in which the amount of change in each cluster of the first combination is greater (S213/Yes), the sample comparison unit 211 selects the first combination (S217), whereas in the case in which the amount of change in each cluster of the second combination is greater (S213/No), the sample comparison unit 211 selects the second combination (S215). The comparison between the amount of change in each cluster of the first combination and the amount of change in each cluster of the second combination may be made according to the maximum value of the amounts of change for all clusters, or according to the number of clusters in which the amount of change is a threshold value or greater, for example.
After that, the sample comparison unit 211 compares the clustering result and the mapping result in the selected combination, and determines the existence or non-existence of a cluster in which the number of belonging cells has changed by a threshold value or greater between the clustering result and the mapping result (S207). In the case in which a cluster in which the number of belonging cells has changed by the threshold value or greater between the clustering result and the mapping result does not exist (S207/No), the information processing apparatus 21 ends operation.
On the other hand, in the case in which a cluster in which the number of belonging cells has changed by the threshold value or greater between the clustering result and the mapping result exists (S207/Yes), the sample comparison unit 211 specifies the cluster as a disparate cluster (S209).
The first data and/or second data of the disparate cluster specified by the sample comparison unit 211 is output by the output unit 209 as an image display or the like to the terminal apparatus 30 and 40, and thereby presented to the user. With this arrangement, the user is able to check information related to the fluorescence and information related to the expression level of each fluorescent substance in a cell group for which a disparity exists between the first sample and the second sample.
Next,
As illustrated in
The cell inquiry unit 213 queries an external database to specify which cell type the disparate cluster specified by the sample comparison unit 211 corresponds to biologically. Specifically, the cell inquiry unit 213 generates information related to an expression pattern of cells included in a disparate cluster from the second data of the disparate cluster specified by the sample comparison unit 211. Next, by inputting information related to the generated expression pattern into an external ontology database, the cell inquiry unit 213 specifies which cell group the disparate cluster corresponds to. Information about the specified cell group may be presented to the user by being output to the terminal apparatus 30 and 40 by the output unit 209, for example.
For the external ontology database, a public database such as the “cell ontology database (https://bioportal.bioontology.org/ontologies/CL)”, for example, or the “flowCL (https://bioconductor.org/packages/release/bioc/html/flowCL.html)” database may be used.
At this point,
As illustrated in
At this point, the cell inquiry unit 213 computes a representative value of the expression level of each fluorescent substance in the disparate cluster from the second data of the cells included in the disparate cluster (S303). For example, the cell inquiry unit 213 may treat the mean, the mode, or the median of the expression level of each fluorescent substance in each cell included in the disparate cluster as the representative value of the expression level of each fluorescent substance in the disparate cluster.
Next, the cell inquiry unit 213 generates information that is inputtable into an external database, on the basis of the computed expression level of each fluorescent substance in the disparate cluster (S305). For example, in the case of using “flowCL” as the external database, the cell inquiry unit 213 may generate information (such as CD3+; CD8−; CD20+, for example) that specifies the positivity or negativity of expression of each marker molecule in the cells.
The positivity or negativity of expression of each marker molecule may be decided relatively by setting an appropriate threshold value such that the expression levels of fluorescent substances in all clusters are dichotomized, and determining whether or not the representative value of the expression level of each fluorescent substance in the disparate cluster exceeds the threshold value. Alternatively, the positivity or negativity of each marker molecule may be decided absolutely on the basis of whether or not the representative value of the expression level of each fluorescent substance in the disparate cluster exceeds a predesignated threshold value.
Next, by inputting the generated information into the external database, the cell inquiry unit 213 queries the cell type of the cells included in the disparate cluster (S307). After that, the cell inquiry unit 213 specifies the cell type of the cells belonging to the disparate cluster on the basis of the query result (S309).
The cell type of the disparate cluster specified by the cell inquiry unit 213 is output to the terminal apparatus 30 and 40 by the output unit 209, and presented to the user as an image display or the like. In addition, the output unit 209 may also output the first data or the second data of the disparate cluster to the terminal apparatus 30 and 40 as well. With this arrangement, the user is able to check the kind of biological cell type of a cell group for which a disparity exists between the first sample and the second sample.
Next,
As illustrated in
The CPU 901 functions as a computational processing device and control device, and controls the overall operation of the information processing apparatus 20 by following various programs stored in the ROM 902 and the like. The ROM 902 stores programs and computational parameters used by the CPU 901, while the RAM 903 stores programs used during execution by the CPU 901 and parameters that change appropriately during such execution. For example, the CPU 901 may execute the functions of the fluorescence separation unit 203, the clustering unit 207, the sample comparison unit 211, and the cell inquiry unit 213.
The CPU 901, the ROM 902, and the RAM 903 are interconnected by the bridge 907, the internal buses 905 and 906, and the like. In addition, the CPU 901, the ROM 902, and the RAM 903 are also connected to the input apparatus 911, the output apparatus 912, the storage apparatus 913, the drive 914, the connection port 915, and the communication apparatus 916 through the interface 908. For example, the RAM 903 may execute the functions of the information storage unit 205.
The input apparatus 911 includes input devices that accept the input of information, such as a touch panel, a keyboard, a mouse, a button, a microphone, a switch, or a lever. Additionally, the input apparatus 911 also includes an input control circuit or the like that generates an input signal on the basis of input information, and outputs the generated input signal to the CPU 901. The input apparatus 911 may execute the functions of the input unit 201, for example.
The output apparatus 912 includes a display device such as a cathode ray tube (CRT) display device, a liquid crystal display device, or an organic electroluminescence (EL) display device, for example. Additionally, the output apparatus 912 may also include an audio output device such as a speaker or headphones. The output apparatus 912 may execute the functions of the output unit 209, for example.
The storage apparatus 913 is a storage device used for data storage in the information processing apparatus 20. The storage apparatus 913 may include a storage medium, a storage device that stores data in the storage medium, a readout device that reads out data from the storage medium, and a deletion device that deletes data stored in the storage medium.
The drive 914 is a reader/writer for a storage medium, and is internally housed inside, or externally attached to, the information processing apparatus 20. For example, the drive 914 reads out information stored in a removable storage medium such as an inserted magnetic disk, optical disc, magneto-optical disc, or semiconductor memory, and outputs the information to the RAM 903. It is also possible for the drive 914 to write information to a removable storage medium.
The connection port 915 is a connection interface including connection ports for connecting with externally connected equipment, such as a Universal Serial Bus (USB) port, an Ethernet (registered trademark) port, an IEEE 802.11 standard port, and an optical audio terminal.
The communication apparatus 916 is a communication interface including a communication device or the like that connects to the network N, for example. Also, the communication apparatus 916 may be a communication apparatus supporting wired or wireless LAN, and may also be a cable communication apparatus that communicates over a wired cable. The communication apparatus 916 and the connection port 915 may execute the functions of the input unit 201 and the output unit 209, for example.
Note that it is also possible to create a computer program for causing hardware such as a CPU, ROM, and RAM built into the information processing apparatus 20 to exhibit functions similar to each component of the information processing apparatus according to the present embodiment described above. It is also possible to provide a storage medium having such a computer program stored therein.
It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and alterations may occur depending on design requirements and other factors insofar as they are within the scope of the appended claims or the equivalents thereof.
Further, the effects described in this specification are merely illustrative or exemplified effects, and are not limitative. That is, with or in the place of the above effects, the technology according to the present disclosure may achieve other effects that are clear to those skilled in the art from the description of this specification.
Additionally, the present technology may also be configured as below.
(1) An information processing apparatus including:
an information storage unit that stores a result of sensing light from cells, namely first data, and a result of separating the first data into a plurality of fluorescences, namely second data, in association with each other;
a clustering unit that clusters the cells into a plurality of clusters on the basis of the second data; and
an output unit that outputs a clustering result from the clustering unit, in which
the output unit additionally outputs at least one or more of the first data and the second data about the cells included in a cluster selected by a user from among the plurality of clusters.
(2) The information processing apparatus according to (1), in which
a dimensionality of the first data is greater than a dimensionality of the second data.
(3) The information processing apparatus according to (1) or (2), in which
the first data is optical spectrum data of light from the cells.
(4) The information processing apparatus according to any one of (1) to (3), in which
the second data is output as a combination of fluorescences selected from among the plurality of fluorescences.
(5) The information processing apparatus according to any one of (1) to (4), in which
the clustering result is output as an image display.
(6) The information processing apparatus according to any one of (1) to (5), further including:
a sample comparison unit that compares a first sample, for which the first data and the second data are stored in the information storage unit, and a second sample to each other.
(7) The information processing apparatus according to (6), in which
the sample comparison unit clusters cells of the first sample into a plurality of clusters, and maps cells of the second sample onto the plurality of clusters based on the clustering result of the first sample.
(8) The information processing apparatus according to (7), in which
the sample comparison unit compares the clustering result of the first sample to a mapping result of the second sample, and thereby specifies a cluster in which an amount of change between the first sample and the second sample is a threshold value or greater, and
the output unit additionally outputs at least one or more of the first data and the second data about the cells included in the specified cluster.
(9) The information processing apparatus according to (6), in which
the sample comparison unit performs each of a first clustering that maps the second sample onto a plurality of clusters based on a clustering result of the first sample and a second clustering that maps the first sample onto a plurality of clusters based on a clustering result of the second sample.
(10) The information processing apparatus according to (9), in which
in each of the first clustering and the second clustering, the sample comparison unit compares each of the clustering result and the mapping result of the first sample and the second sample, and thereby specifies a cluster in which an amount of change between the first sample and the second sample is a threshold value or greater, and
the output unit additionally outputs at least one or more of the first data and the second data about the cells included in the specified cluster.
(11) The information processing apparatus according to (8) or (10), further including:
a cell inquiry unit that inputs information related to the cells included in the cluster specified by the sample comparison unit into a database for specifying the cells.
(12) The information processing apparatus according to (11), in which
the cell inquiry unit specifies a cell type of the cells included in the cluster on the basis of a result of inquiring the database, and
the output unit additionally outputs at least one or more of the first data and the second data about the cells with the specified cell type.
(13) The information processing apparatus according to (11) or (12), in which
the information related to the cells that is input into the database is generated on the basis of the second data.
(14) The information processing apparatus according to (13), in which
the information related to the cells that is input into the database is information related to an expression level of a marker molecule corresponding to each of the plurality of fluorescences.
(15) An information processing method including:
storing a result of sensing light from cells, namely first data, and a result of separating the first data into a plurality of fluorescences, namely second data, in association with each other;
clustering the cells into a plurality of clusters on the basis of the second data; and
outputting a clustering result from the clustering unit; and
additionally outputting at least one or more of the first data and the second data about the cells included in a cluster selected by a user from among the plurality of clusters.
(16) A program causing a computer to function as:
an information storage unit that stores a result of sensing light from cells, namely first data, and a result of separating the first data into a plurality of fluorescences, namely second data, in association with each other;
a clustering unit that clusters the cells into a plurality of clusters on the basis of the second data; and
an output unit that outputs a clustering result from the clustering unit, in which
the output unit is made to function to additionally output at least one or more of the first data and the second data about the cells included in a cluster selected by a user from among the plurality of clusters.
This Application claims priority under 35 U.S.C. § 119(e) to U.S. Provisional Application Ser. No. 62/782,688, entitled “INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND PROGRAM,” filed on Dec. 20, 2018, which is herein incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
62782688 | Dec 2018 | US |