The present specification discloses a cell analysis method, a training method for a deep learning algorithm, a cell analyzer, a training apparatus for a deep learning algorithm, a cell analysis program, and a training program for a deep learning algorithm.
Japanese Laid-Open Patent Publication No. S63-180836 discloses a cell analyzer that analyzes the type of a blood cell or the like contained in peripheral blood. In such a cell analyzer, for example, light is applied to each cell in peripheral blood flowing in a flow cell, and signal strengths of scattered light and fluorescence obtained from the cell to which light has been applied are obtained. Peak values of the signal strengths obtained from a plurality of cells are each extracted and plotted on a scattergram. Cluster analysis is performed on the plurality of cells on the scattergram, to identify the type of cells belonging to each cluster.
International Publication WO2018/203568 describes a method for classifying the type of each cell, using an imaging flow cytometer.
The scope of the present invention is defined solely by the appended claims, and is not affected to any degree by the statements within this summary.
In a case where the type of a cell is to be identified on the basis of a scattergram, when, for example, a cell that usually does not appear in peripheral blood of a healthy individual, such as a blast or a lymphoma cell, is present in a specimen, there are cases where the cell is classified as a normal cell in cluster analysis.
Since the cluster analysis is a statistical analysis technique, when the number of cells plotted on the scattergram is small, the cluster analysis becomes difficult in some cases.
Further, in the method described in International Publication WO2018/203568, in order to perform more accurate determination of the type of each cell, a method of capturing an image of each cell that flows in a flow cell and applying structure illumination is adopted. Therefore, International Publication WO2018/203568 has a problem that a detection system conventionally used for obtaining a scattergram cannot be used.
An object of an embodiment of the present invention is to further improve the accuracy of determination also of different types of cells that appear in the same cluster. Another object of an embodiment of the present invention is to provide a cell type determination method applicable to a measurement apparatus that has conventionally performed measurement on a scattergram.
With reference to
In the cell analysis method, preferably, from the individual cells passing through a predetermined position in the flow path, the signal strength is obtained, for each of the cells, at a plurality of time points in a time period while the cell is passing through the predetermined position, and each obtained signal strength is stored in association with information regarding a corresponding time point at which the signal strength has been obtained. According to this embodiment, the types of cells that cannot be determined by a conventional cell analyzer can be determined. Since information regarding the time points at each of which the signal strength has been obtained is obtained, when a plurality of signals have been received from a single cell, data can be synchronized.
In the cell analysis method, preferably, the obtaining of the signal strength at the plurality of time points is started at a time point at which the signal strength of each of the individual cells has reached a predetermined value, and ends after a predetermined time period after the start of the obtaining of the signal strength. According to this embodiment, more accurate determination can be performed. In addition, the volume of data to be obtained can be reduced.
In the cell analysis method, preferably, the signal is a light signal or an electric signal.
More preferably, the light signal is a signal obtained by light being applied to each of the individual cells passing through the flow cell. The predetermined position is a position where the light is applied to each cell in the flow cell (4113, 551). Further preferably, the light is laser light, and the light signal is at least one type selected from a scattered light signal and a fluorescence signal. Still more preferably, the light signal is a side scattered light signal, a forward scattered light signal, and a fluorescence signal. According to this embodiment, the determination accuracy of the types of cells in the flow cytometer can be improved.
In the cell analysis method, the numerical data corresponding to the signal strength inputted to the deep learning algorithm (60) includes information obtained by combining signal strengths of the side scattered light signal, the forward scattered light signal, and the fluorescence signal that have been obtained for each cell at the same time point. According to this embodiment, the determination accuracy by the deep learning algorithm can be further improved.
In the analysis method, when the signal is an electric signal, a measurement part includes a sheath flow electric resistance-type detector. According to this embodiment, the types of cells can be determined on the basis of data measured by a sheath flow electric resistance method.
In the cell analysis method, the deep learning algorithm (60) calculates, for each cell, a probability that the cell for which the signal strength has been obtained belongs to each of a plurality of types of cells associated with an output layer (60b) of the deep learning algorithm (60). Preferably, the deep learning algorithm (60) outputs a label value 82 of a type of a cell that has a highest probability that the cell for which the signal strength has been obtained belongs thereto. According to this embodiment, the determination result can be presented to a user.
In the cell analysis method, on the basis of the label value of the type of the cell that has the highest probability that the cell for which the signal strength has been obtained belongs thereto, the number of cells that belong to each of the plurality of types of cells is counted, and a result of the counting is outputted; or on the basis of the label value of the type of the cell that has the highest probability that the cell for which the signal strength has been obtained belongs thereto, a proportion of cells that belong to each of the plurality of types of cells is calculated, and a result of the calculation is outputted. According to this embodiment, the proportions of the type of cells contained in the biological sample can be obtained.
In the cell analysis method, preferably, the biological sample is a blood sample. More preferably, the type of a cell includes at least one type selected from a group consisting of neutrophil, lymphocyte, monocyte, eosinophil, and basophil. Further preferably, the type of a cell includes at least one type selected from the group consisting of (a) and (b) below. Here, (a) is immature granulocyte; and (b) is at least one type of abnormal cell selected from the group consisting of tumor cell, lymphoblast, plasma cell, atypical lymphocyte, nucleated erythrocyte selected from proerythroblast, basophilic erythroblast, polychromatic erythroblast, orthochromatic erythroblast, promegaloblast, basophilic megaloblast, polychromatic megaloblast, and orthochromatic megaloblast, and megakaryocyte. According to this embodiment, the types of immature granulocytes and abnormal cells contained in a blood sample can be determined.
In the cell analysis method, in a case where the biological sample is a blood sample and the type of cell includes abnormal cell, when there is a cell that has been determined to be an abnormal cell by the deep learning algorithm (60), a processing part (20) may output information indicating that an abnormal cell is contained in the biological sample.
In the cell analysis method, the biological sample may be urine. According to this embodiment, determination can be performed also for cells contained in urine.
A certain embodiment of the present embodiment relates to an analysis method for cells contained in a biological sample. In the cell analysis method, the cells are caused to flow in a flow path; from the individual cells passing through a predetermined position in the flow path, a signal strength regarding each of scattered light and fluorescence is obtained, for each of the cells, at a plurality of time points in a time period while the cell is passing through the predetermined position; and on the basis of a result of recognizing, as a pattern, the obtained signal strengths at the plurality of time points regarding each of the individual cells, a type of the cell is determined for each cell. According to the present embodiment, the types of cells that cannot be determined by a conventional cell analyzer can be determined.
A certain embodiment of the present embodiment relates to a method for training a deep learning algorithm (50) having a neural network structure for analyzing cells in a biological sample. The cells contained in the biological sample are caused to flow in a cell detection flow path in a measurement part capable of detecting cells individually; numerical data corresponding to a signal strength obtained for each of the individual cells passing through the flow path is inputted as first training data to an input layer of the deep learning algorithm; and information of a type of a cell that corresponds to the cell for which the signal strength has been obtained is inputted as second training data to the deep learning algorithm. According to the present embodiment, it is possible to generate a deep learning algorithm for determining the types of individual cells that cannot be determined by a conventional cell analyzer.
A certain embodiment of the present embodiment relates to a cell analyzer (4000, 4000′) configured to determine a type of each cell, by using a deep learning algorithm (60) having a neural network structure. The cell analyzer (4000, 4000′) includes a processing part (20). The processing part (20) is configured to: obtain, when cells contained in a biological sample and caused to pass through a cell detection flow path in a measurement part capable of detecting cells individually, a signal strength regarding each of the individual cells; input, to the deep learning algorithm (60), numerical data corresponding to the obtained signal strength regarding each of the individual cells; and on the basis of a result outputted from the deep learning algorithm, determine, for each cell, a type of the cell for which the signal strength has been obtained. According to the present embodiment, the types of cells that cannot be determined by a conventional cell analyzer can be determined.
Further, the cell analyzer (4000, 4000′) includes a measurement part (400) capable of detecting cells individually and configured to obtain, when the cells contained in the biological sample and caused to flow in the cell detection flow path of the measurement part pass through the flow path, a signal strength regarding each of the individual cells. According to the present embodiment, due to the cell analyzer including the measurement part, the types of cells that cannot be determined by a conventional cell analyzer can be determined.
A certain embodiment of the present embodiment relates to a training apparatus (100) for training a deep learning algorithm (50) having a neural network structure for analyzing cells in a biological sample. The training apparatus includes a processing part (10). The processing part (10) is configured to: cause the cells contained in the biological sample to flow in a cell detection flow path in a measurement part capable of detecting cells individually, and input, as first training data to an input layer of the deep learning algorithm, numerical data corresponding to a signal strength obtained for each of the individual cells passing through the flow path; and input, as second training data to the deep learning algorithm, information of a type of a cell that corresponds to the cell for which the signal strength has been obtained. According to the present embodiment, it is possible to generate a deep learning algorithm for determining the types of cells that cannot be determined by a conventional cell analyzer.
A certain embodiment of the present embodiment relates to a computer-readable storage medium having stored therein a computer program for analyzing cells contained in a biological sample, by using a deep learning algorithm (60) having a neural network structure. The computer program is configured to cause a processing part (20) to execute a process including: causing the cells contained in the biological sample to flow in a cell detection flow path in a measurement part capable of detecting cells individually, and obtaining a signal strength regarding each of the individual cells passing through the flow path; inputting, to the deep learning algorithm, numerical data corresponding to the obtained signal strength regarding each of the individual cells; and on the basis of a result outputted from the deep learning algorithm, determining, for each cell, a type of the cell for which the signal strength has been obtained. According to the present embodiment, due to the cell analyzer including the measurement part, the types of cells that cannot be determined by a conventional cell analyzer can be determined.
A certain embodiment of the present embodiment relates to a computer-readable storage medium having stored therein a computer program for training a deep learning algorithm (50) having a neural network structure for analyzing cells in a biological sample. The computer program is configured to cause a processing part (10) to execute a process including: causing the cells contained in the biological sample to flow in a cell detection flow path in a measurement part capable of detecting cells individually, and inputting, as first training data to an input layer of the deep learning algorithm, numerical data corresponding to a signal strength obtained for each of the individual cells passing through the flow path; and inputting, as second training data to the deep learning algorithm, information of a type of a cell that corresponds to the cell for which the signal strength has been obtained. According to the present embodiment, it is possible to generate a deep learning algorithm for determining the types of cells that cannot be determined by a conventional cell analyzer.
The types of cells that cannot be determined by a conventional cell analysis method can be determined. Therefore, the determination accuracy for cells can be improved.
Hereinafter, the outline and embodiments of the present invention will be described in detail with reference to the attached drawings. In the description below and the drawings, the same reference characters represent the same or similar components. Thus, description of the same or similar components is not repeated.
The present embodiment relates to a cell analysis method for analyzing cells contained in a biological sample. In the analysis method, numerical data corresponding to a signal strength regarding each of individual cells is inputted to a deep learning algorithm that has a neural network structure. Then, on the basis of the result outputted from the deep learning algorithm, the type of the cell for which the signal strength has been obtained is determined for each cell.
With reference to
The present embodiment is focused on data indicating the signal strength that is derived from each of individual cells and that is obtained when creating a scattergram. In (d) of
In the present embodiment, a deep learning algorithm 50, 60 shown in (f) of
An example of a biological sample is a biological sample collected from a subject. Examples of the biological sample can include blood such as peripheral blood, venous blood, or arterial blood, urine, and a body fluid other than blood and urine. Examples of the body fluid other than blood and urine can include bone marrow, ascites, pleural effusion, spinal fluid, and the like. Hereinafter, the body fluid other than blood and urine may be simply referred to as a “body fluid”. The blood sample may be any blood sample that is in a state where the number of cells can be counted and the types of cells can be determined. Preferably, blood is peripheral blood. Examples of blood include peripheral blood collected using an anticoagulant agent such as ethylenediamine tetraacetate (sodium salt or potassium salt), heparin sodium, or the like. Peripheral blood may be collected from an artery or may be collected from a vein.
The types of cells to be determined in the present embodiment are those according to the types of cells based on morphological classification, and are different depending on the kind of the biological sample. When the biological sample is blood and the blood is collected from a healthy individual, the types of cells to be determined in the present embodiment include red blood cell, nucleated cell such as white blood cell, platelet, and the like. Nucleated cells include neutrophils, lymphocytes, monocytes, eosinophils, and basophils. Neutrophils include segmented neutrophils and band neutrophils. Meanwhile, when blood is collected from an unhealthy individual, nucleated cells may include at least one type selected from the group consisting of immature granulocyte and abnormal cell. Such cells are also included in the types of cells to be determined in the present embodiment. Immature granulocytes can include cells such as metamyelocytes, bone marrow cells, promyelocytes, and myeloblasts.
The nucleated cells may include abnormal cells that are not contained in peripheral blood of a healthy individual, in addition to normal cells. Examples of abnormal cells are cells that appear when a person has a certain disease, and such abnormal cells are tumor cells, for example. In a case of the hematopoietic system, the certain disease can be a disease selected from the group consisting of: myelodysplastic syndrome; leukemia such as acute myeloblastic leukemia, acute promyelocytic leukemia, acute myelomonocytic leukemia, acute monocytic leukemia, erythroleukemia, acute megakaryoblastic leukemia, acute myeloid leukemia, acute lymphoblastic leukemia, lymphoblastic leukemia, chronic myelogenous leukemia, or chronic lymphocytic leukemia; malignant lymphoma such as Hodgkin's lymphoma or non-Hodgkin's lymphoma; and multiple myeloma.
Further, abnormal cells can include cells that are not usually observed in peripheral blood of a healthy individual, such as: lymphoblasts; plasma cells; atypical lymphocytes; reactive lymphocytes; erythroblasts, which are nucleated erythrocytes, such as proerythroblasts, basophilic erythroblasts, polychromatic erythroblasts, orthochromatic erythroblasts, promegaloblasts, basophilic megaloblasts, polychromatic megaloblasts, and orthochromatic megaloblasts; megakaryocytes including micromegakaryocytes; and the like.
When the biological sample is urine, the types of cells to be determined in the present embodiment can include red blood cells, white blood cells, epithelial cells such as those of transitional epithelium, squamous epithelium, and the like. Examples of abnormal cells include bacteria, fungi such as filamentous fungi and yeast, tumor cells, and the like.
When the biological sample is a body fluid that usually does not contain blood components, such as ascites, pleural effusion, or spinal fluid, the types of cells can include red blood cell, white blood cell, and large cell. The “large cell” here means a cell that is separated from an inner membrane of a body cavity or a peritoneum of a viscus, and that is larger than white blood cells. Specifically, mesothelial cells, histiocytes, tumor cells, and the like correspond to the “large cell”.
When the biological sample is bone marrow, the types of cells to be determined in the present embodiment can include, as normal cells, mature blood cells and immature hematopoietic cells. Mature blood cells include red blood cells, nucleated cells such as white blood cells, platelets, and the like. Nucleated cells such as white blood cells include neutrophils, lymphocytes, plasma cells, monocytes, eosinophils, and basophils. Neutrophils include segmented neutrophils and band neutrophils. Immature hematopoietic cells include hematopoietic stem cells, immature granulocytic cells, immature lymphoid cells, immature monocytic cells, immature erythroid cells, megakaryocytic cells, mesenchymal cells, and the like. Immature granulocytes can include cells such as metamyelocytes, bone marrow cells, promyelocytes, and myeloblasts. Immature lymphoid cells include lymphoblasts and the like. Immature monocytic cells include monoblasts and the like. Immature erythroid cells include nucleated erythrocytes such as proerythroblasts, basophilic erythroblasts, polychromatic erythroblasts, orthochromatic erythroblasts, promegaloblasts, basophilic megaloblasts, polychromatic megaloblasts, and orthochromatic megaloblasts. Megakaryocytic cells include megakaryoblasts, and the like.
Examples of abnormal cells that can be included in bone marrow include hematopoietic tumor cells of a disease selected from the group consisting of: myelodysplastic syndrome; leukemia such as acute myeloblastic leukemia, acute promyelocytic leukemia, acute myelomonocytic leukemia, acute monocytic leukemia, erythroleukemia, acute megakaryoblastic leukemia, acute myeloid leukemia, acute lymphoblastic leukemia, lymphoblastic leukemia, chronic myelogenous leukemia, or chronic lymphocytic leukemia; malignant lymphoma such as Hodgkin's lymphoma or non-Hodgkin's lymphoma; and multiple myeloma, which have been described above, and metastasized tumor cells of a malignant tumor developed in an organ other than bone marrow.
In the cell analysis method of the present embodiment, the determination method of the type of cell is not limited to a method that uses a deep learning algorithm. From individual cells passing through a predetermined position in a flow path, a signal strength is obtained, for each of the cells, at a plurality of time points in a time period while the cell is passing through the predetermined position, and on the basis of a result obtained by recognizing, as a pattern, the obtained signal strengths at the plurality of time points regarding the individual cells, the types of cells may be determined. The pattern may be recognized as a numerical pattern of signal strengths at a plurality of time points, or may be recognized as a shape pattern obtained when signal strengths at a plurality of time points are plotted on a graph. When the pattern is recognized as a numerical pattern, if a numerical pattern of an analysis target cell and a numerical pattern for which the type of cell is already known are compared with each other, the type of cell can be determined. For the comparison between the numerical pattern of an analysis target cell and a control numerical pattern, Spearman rank correlation, z-score, or the like can be used, for example. When the pattern of the graph shape of an analysis target cell and the pattern of a graph shape for which the type of cell is already known are compared with each other, the type of cell can be determined. For the comparison between the pattern of the graph shape of an analysis target cell and the pattern of the graph shape for which the type of cell is already known, geometric shape pattern matching may be used, or a feature descriptor represented by SIFT Descriptor may be used, for example.
Next, with reference to the examples shown in
The example shown in
In the example shown in
When the respective pieces of the training waveform data 70a, 70b, 70c in
For the sequence data 76a, 76b, 76c, the obtained signal strength values may be directly used, but processing such as noise removal, baseline correction, and normalization may be performed as necessary. In the present specification, “numerical data corresponding to a signal strength” can include an obtained signal strength value itself, and a value that has been subjected to noise removal, baseline correction, normalization, and the like as necessary.
With reference to
Preferably, at least the obtain merit condition and the condition for generating, from each piece of waveform data or the like, data to be inputted to the neural network are the same between generation of the analysis data 85 and generation of the training data 75. With respect to the sequence data 82a, 82b, 82c, for each analysis target cell, the time points of obtainment of the signal strengths are synchronized, and sequence data 86a (forward scattered light), sequence data 86b (side scattered light), and sequence data 86c (side fluorescence) are obtained. The sequence data 86a, 86b, 86c are combined such that three signal strengths (a signal strength of forward scattered light, a signal strength of side scattered light, and a signal strength of side fluorescence) at the same time point form one set, and is inputted as the analysis data 85 to the deep learning algorithm 60.
When the analysis data 85 has been inputted to an input layer 60a of the neural network 60 serving as a trained deep learning algorithm 60, a probability that the analysis target cell from which the analysis data 85 has been obtained belongs to each of types of cells inputted as training data is outputted from an output layer 60b. The reference character 60c in
Waveform data according to the present embodiment can be obtained in a first cell analyzer 4000 or a second cell analyzer 4000′.
With reference to
The detector 410 includes: a nucleated cell detector 411 which detects nucleated cells such as white blood cells at least; a red blood cell/platelet detector 412 which measures the number of red blood cells and the number of platelets; and a hemoglobin detector 413 which measures the amount of hemoglobin in blood as necessary. The nucleated cell detector 411 is implemented as an optical detector, and more specifically, includes a component for performing detection by flow cytometry.
As shown in
The digital value calculation part 483 is connected to the interface part 489 via an interface part 484 and a bus 485. The interface part 489 is connected to the display/operation part 450 via the bus 485 and the interface part 486, and is connected to the detector 410, the apparatus mechanism part 430, and a sample preparation part 440 via the bus 485 and the interface part 488.
The A/D converter 482 converts a reception light signal, which is an analogue signal outputted from the analogue processing part 420, into a digital signal, and outputs the digital signal to the digital value calculation part 483. The digital value calculation part 483 performs predetermined arithmetic processing on the digital signal outputted from the A/D converter 482. Examples of the predetermined arithmetic processing include, but not limited to: a process in which, during a time period from the start, upon forward scattered light reaching a predetermined threshold, of obtainment of the signal strength of forward scattered light, the signal strength of side scattered light, and the signal strength of side fluorescence, until the end of the obtainment after a predetermined time period, each piece of waveform data is obtained for a single training target cell at a plurality of time points at a certain interval; a process of extracting a peak value of the waveform data; and the like. Then, the digital value calculation part 483 outputs the calculation result (measurement result) to the processing unit 300 via the interface part 484, the bus 485, and the interface part 489.
The processing unit 300 is connected to the digital value calculation part 483 via the interface part 484, the bus 485, and the interface part 489, and the processing unit 300 can receive the calculation result outputted from the digital value calculation part 483. In addition, the processing unit 300 performs control of the apparatus mechanism part 430 including a sampler (not shown) that automatically supplies sample containers, a fluid system for preparation/measurement of a sample, and the like, and performs other controls.
The nucleated cell detector 411 causes a measurement sample containing cells to flow in a cell detection flow path, applies light to each cell flowing in the cell detection flow path, and measures scattered light and fluorescence generated from the cell. The red blood cell/platelet detector 412 causes a measurement sample containing cells to flow in a cell detection flow path, measures electric resistance of each cell flowing in the cell detection flow path, and detects the volume of the cell.
In the present embodiment, the measurement unit 400 preferably includes a flow cytometer and/or a sheath flow electric resistance-type detector. In
Flow Cytometer
As shown in
In the present embodiment, scattered light may be any scattered light that can be measured by a flow cytometer that is distributed in general. Examples of scattered light include forward scattered light (e.g., light reception angle: about 0 to 20 degrees), and side scattered light (light reception angle: about 90 degrees). It is known that side scattered light reflects internal information of a cell, such as a nucleus or granules of the cell, and forward scattered light reflects information of the size of the cell. In the present embodiment, forward scattered light intensity and side scattered light intensity are preferably measured as scattered light intensity.
Fluorescence is light that is emitted from a fluorescent dye bound to a nucleic acid or the like in a cell when excitation light having an appropriate wavelength is applied to the fluorescent dye. The excitation light wavelength and the reception light wavelength depend on the kind of the fluorescent dye that is used.
In the present embodiment, the light source 4111 of the flow cytometer is not limited in particular, and a light source 4111 that has a wavelength suitable for excitation of the fluorescent dye is selected. As such a light source 4111, a semiconductor laser including a red semiconductor laser and/or a blue semiconductor laser, a gas laser such as an argon laser or a helium-neon laser, a mercury arc lamp, or the like is used, for example. In particular, a semiconductor laser is suitable because the semiconductor laser is very inexpensive when compared with a gas laser.
As shown in
Reception light signals outputted from the respective light receiving elements 4116, 4121, and 4122 are subjected to analogue processing such as amplification/waveform processing by the analogue processing part 420 shown in
With reference back to
In
The obtained measurement sample is sent to the flow cell 4113 in the nucleated cell detector 411, together with a sheath liquid (e.g., CELLPACK (II) manufactured by Sysmex Corporation), to be measured by flow cytometry in the nucleated cell detector 411.
Sheath Flow-Type Electric Resistance Detector
As shown in
As a configuration example of the second cell analyzer 4000′, an example of a block diagram when the measurement unit 500 is a flow cytometer for measuring a urine sample or a body fluid sample is shown.
In the reaction chamber 512u, the distributed biological sample is mixed with a first reagent 519u as a diluent and a third reagent 518u that contains a dye. Due to the dye contained in the third reagent 518u, solid components in the specimen are stained. When the biological sample is urine, the sample prepared in the reaction chamber 512u is used as a first measurement sample for analyzing solid components in urine that are relatively large, such as red blood cells, white blood cells, epithelial cells, or tumor cells. When the biological sample is a body fluid, the sample prepared in the reaction chamber 512u is used as a third measurement sample for analyzing red blood cells in the body fluid.
Meanwhile, in the reaction chamber 512b, the distributed biological sample is mixed with a second reagent 519b as a diluent and a fourth reagent 518b that contains a dye. As described later, the second reagent 519b has a hemolytic action. Due to the dye contained in the fourth reagent 518b, solid components in the specimen are stained. When the biological sample is urine, the sample prepared in the reaction chamber 512b serves as a second measurement sample for analyzing bacteria in the urine. When the biological sample is a body fluid, the sample prepared in the reaction chamber 512b serves as a fourth measurement sample for analyzing nucleated cells (white blood cells and large cells) and bacteria in the body fluid.
A tube extends from the reaction chamber 512u to the flow cell 551 of the optical detector 505, whereby the measurement sample prepared in the reaction chamber 512u can be supplied to the flow cell 551. A solenoid valve 521u is provided at the outlet of the reaction chamber 512u. A tube extends also from the reaction chamber 512b, and this tube is connected to a portion of the tube extending from the reaction chamber 512u. Accordingly, the measurement sample prepared in the reaction chamber 512b can be supplied to the flow cell 551. A solenoid valve 521b is provided at the outlet of the reaction chamber 512b.
The tube extending from the reaction chamber 512u, 512b to the flow cell 551 is branched before the flow cell 551, and a branched tube is connected to a syringe pump 520a. A solenoid valve 521c is provided between the syringe pump 520a and the branched point.
Between the connection point of the tubes extending from the respective reaction chambers 512u, 512b and the branched point, the tube is further branched. A branched tube is connected to a syringe pump 520b. Between the branched point of the tube extending to the syringe pump 520b and the connection point, a solenoid valve 521d is provided.
The sample preparation part 502 has connected thereto a sheath liquid storing part 522 which stores a sheath liquid, and the sheath liquid storing part 522 is connected to the flow cell 551 by a tube. The sheath liquid storing part 522 has connected thereto a compressor 522a, and when the compressor 522a is driven, compressed air is supplied to the sheath liquid storing part 522, and the sheath liquid is supplied from the sheath liquid storing part 522 to the flow cell 551.
As for the two kinds of suspensions (measurement samples) prepared in the respective reaction chambers 512u, 512b, the suspension (the first measurement sample when the biological sample is urine, and the third measurement sample when the biological sample is a body fluid) of the reaction chamber 512u is first led to the optical detector 505, to form a thin flow enveloped by the sheath liquid in the flow cell 551, and laser light is applied to the thin flow. Then, in a similar manner, the suspension (the second measurement sample when the biological sample is urine, and the fourth measurement sample when the biological sample is a body fluid) of the reaction chamber 512b is led to the optical detector 505, to form a thin flow in the flow cell 551, and laser light is applied to the thin flow. Such operations are automatically performed by causing the solenoid valves 521u, 521b, 521c, 521d, a drive part 503, and the like to operate by control of the microcomputer 511 (controller) described later.
The first reagent to the fourth reagent are described in detail. The first reagent 519u is a reagent having a buffer as a main component, contains an osmotic pressure compensation agent so as to allow obtainment of a stable fluorescence signal without hemolyzing red blood cells, and is adjusted to have 100 to 600 mOsm/kg so as to realize an osmotic pressure suitable for classification measurement. Preferably, the first reagent 519u does not have a hemolytic action on red blood cells in urine.
Different from the first reagent 519u, the second reagent 519b has a hemolytic action. This is for facilitating passage of the later-described fourth reagent 518b through cell membranes of bacteria so as to promote staining. Further, this is also for contracting contaminants such as mucus fibers and red blood cell fragments. The second reagent 519b contains a surfactant in order to acquire a hemolytic action. As the surfactant, a variety of anionic, nonionic, and cationic surfactants can be used, but a cationic surfactant is particularly suitable. Since the surfactant can damage the cell membranes of bacteria, nucleic acids of bacteria can be efficiently stained by the dye contained in the fourth reagent 518b. As a result, bacteria measurement can be performed through a short-time staining process.
As still another embodiment, the second reagent 519b may acquire a hemolytic action not by a surfactant but by being adjusted to be acidic or to have a low pH. The second reagent 519b having a low pH means that the second reagent 519b has a lower pH than the first reagent 519u. When the first reagent 519u is neutral or weakly acidic to weakly alkaline, the second reagent 519b is acidic or strongly acidic. When the pH of the first reagent 519u is 6.0 to 8.0, the pH of the second reagent 519b is lower than that, and is preferably 2.0 to 6.0.
The second reagent 519b may contain a surfactant and be adjusted to have a low pH.
As still another embodiment, the second reagent 519b may acquire a hemolytic action by having a lower osmotic pressure than the first reagent 519u.
Meanwhile, the first reagent 519u does not contain any surfactant. In another embodiment, the first reagent 519u may contain a surfactant, but the kind and concentration thereof need to be adjusted so as not to hemolyze red blood cells. Therefore, preferably, the first reagent 519u does not contain the same surfactant as that of the second reagent 519b, or even if the first reagent 519u contains the same surfactant as that of the second reagent 519b, the concentration of the surfactant in the first reagent 519u is lower than that in the second reagent 519b.
The third reagent 518u is a staining reagent to be used in measurement of solid components in urine (red blood cells, white blood cells, epithelial cells, casts, or the like). As the dye contained in the third reagent 518u, a dye that stains membranes is selected, in order to also stain solid components that do not have nucleic acids. Preferably, the third reagent 518u contains an osmotic pressure compensation agent for the purpose of preventing hemolysis and for the purpose of obtaining a stable fluorescence intensity, and is adjusted to have 100 to 600 mOsm/kg so as to realize an osmotic pressure suitable for classification measurement. The cell membrane and nucleus (membrane) of solid components in urine are stained by the third reagent 518u. As the staining reagent containing a dye that stains membranes, a condensed benzene derivative is used, and a cyanine-based dye can be used, for example. The third reagent 518u stains not only cell membranes but also nuclear membranes. When the third reagent 518u is used in nucleated cells such as white blood cells and epithelial cells, the staining intensity in the cytoplasm (cell membrane) and the staining intensity in the nucleus (nuclear membrane) are combined, whereby the staining intensity becomes higher than in the solid components in urine that do not have nucleic acids. Accordingly, nucleated cells such as white blood cells and epithelial cells can be discriminated from solid components in urine that do not have nucleic acids such as red blood cells. As the third reagent, the reagents described in U.S. Pat. No. 5,891,733 can be used. U.S. Pat. No. 5,891,733 is incorporated herein by reference. The third reagent 518u is mixed with urine or a body fluid, together with the first reagent 519u.
The fourth reagent 518b is a staining reagent that can accurately measure bacteria even when the specimen contains contaminants having sizes equivalent to those of bacteria and fungi. The fourth reagent 518b is described in detail in EP Patent Application Publication No. 1136563. As the dye contained in the fourth reagent 518b, a dye that stains nucleic acids is suitably used. As the staining reagent containing a dye that stains nuclei, the cyanine-based dyes of U.S. Pat. No. 7,309,581 can be used, for example. The fourth reagent 518b is mixed with urine or a specimen, together with the second reagent 519b. EP Patent Application Publication No. 1136563 and U.S. Pat. No. 7,309,581 are incorporated herein by reference.
Therefore, preferably, the third reagent 518u contains a dye that stains cell membranes, whereas the fourth reagent 518b contains a dye that stains nucleic acids. Solid components in urine may include those that do not have a nucleus, such as red blood cells. Therefore, by the third reagent 518u containing a dye that stains cell membranes, solid components in urine including those that do not have a nucleus can be detected. In addition, the second reagent can damage cell membranes of bacteria, and nucleic acids of bacteria and fungi can be efficiently stained by the dye contained in the fourth reagent 518b. As a result, bacteria measurement can be performed through a short-time staining process.
A third embodiment in the present embodiment relates to a waveform data analysis system.
With reference to
The deep learning apparatus 100A is implemented as a general-purpose computer, for example, and performs a deep learning process on the basis of a flow chart described later. The analyzer 200A is implemented as a general-purpose computer, for example, and performs a waveform data analysis process on the basis of a flow chart described later. The storage medium 98 is a computer-readable non-transitory tangible storage medium such as a DVD-ROM or a USB memory, for example.
The deep learning apparatus 100A is connected to a measurement unit 400a or a measurement unit 500a. The configuration of the measurement unit 400a or the measurement unit 500a is the same as that of the measurement unit 400 or the measurement unit 500 described above. The deep learning apparatus 100A obtains training waveform data 70 obtained by the measurement unit 400a or the measurement unit 500a. The generation method of the training waveform data 70 is as described above. The analyzer 200A is also connected to the measurement unit 400b or the measurement unit 500b. The configuration of the measurement unit 400b or the measurement unit 500b is the same as that of the measurement unit 400 or the measurement unit 500 described above.
As shown in
The processing part 10 includes: a CPU (Central Processing Unit) 11 which performs data processing described later; a memory 12 to be used as a work area for data processing; a storage 13 which stores a program and processing data described later; a bus 14 which transmits data between parts; an interface part 15 which inputs/outputs data with respect to an external apparatus; and a GPU (Graphics Processing Unit) 19. The input part 16 and the output part 17 are connected to the processing part 10 via the interface part 15. For example, the input part 16 is an input device such as a keyboard or a mouse, and the output part 17 is a display device such as a liquid crystal display. The GPU 19 functions as an accelerator that assists arithmetic processing (e.g., parallel arithmetic processing) performed by the CPU 11. That is, the processing performed by the CPU 11 described below also includes processing performed by the CPU 11 using the GPU 19 as an accelerator. Here, instead of the GPU 19, a chip that is suitable for calculation in a neural network may be installed. Examples of such a chip include FPGA (Field-Programmable Gate Array), ASIC (Application specific integrated circuit), and Myriad X (Intel).
In order to perform the process of each step described below with reference to
In the description below, unless otherwise specified, the processes performed by the processing part 10 mean processes performed by the CPU 11 on the basis of the program stored in the storage 13 or the memory 12, and the neural network 50. The CPU 11 temporarily stores necessary data (such as intermediate data being processed) using the memory 12 as a work area, and stores, as appropriate in the storage 13, data to be saved for a long time such as calculation results.
With reference to
The processing part 20 includes: a CPU (Central Processing Unit) 21 which performs data processing described later; a memory 22 to be used as a work area for data processing; the storage 23 which stores a program and processing data described later; a bus 24 which transmits data between parts; an interface part 25 which inputs/outputs data with respect to an external apparatus; and a GPU (Graphics Processing Unit) 29. The input part 26 and the output part 27 are connected to the processing part 20 via the interface part 25. For example, the input part 26 is an input device such as a keyboard or a mouse, and the output part 27 is a display device such as a liquid crystal display. The GPU 29 functions as an accelerator that assists arithmetic processing (e.g., parallel arithmetic processing) performed by the CPU 21. That is, the processing performed by the CPU 21 described below also includes processing performed by the CPU 21 using the GPU 29 as an accelerator.
In order to perform the process of each step described in the waveform data analysis process below, the processing part 20 has previously stored, in the storage 23, a program and the deep learning algorithm 60 having a trained neural network structure according to the present invention, in an executable form, for example. The executable form is a form generated through conversion of a programming language by a compiler, for example. The processing part 20 uses the program and the deep learning algorithm 60 stored in the storage 23 to perform processes.
In the description below, unless otherwise specified, the processes performed by the processing part 20 mean, in actuality, processes performed by the CPU 21 of the processing part 20 on the basis of the program and the deep learning algorithm 60 stored in the storage 23 or the memory 22. The CPU 21 temporarily stores data (such as intermediate data being processed) using the memory 22 as a work area, and stores, as appropriate in the storage 23, data to be saved for a long time such as calculation results.
With reference to
The training waveform data 70a, 70b, 70c is obtained in advance by the measurement unit 400, 500, and is stored in advance in the storage 13 or the memory 12 of the processing part 10A. The deep learning algorithm 50 is stored in advance in the algorithm database 105 in association with the kind of cell to which each analysis target cell belongs, for example.
The processing part 10A of the deep learning apparatus 100A performs the process shown in
With reference to
First, the processing part 10A obtains the training waveform data 70a, 70b, 70c. The training waveform data 70a is waveform data of forward scattered light, the training waveform data 70b is waveform data of side scattered light, and the training waveform data 70c is waveform data of side fluorescence. The training waveform data 70a, 70b, 70c is obtained via the I/F part 15 in accordance with an operation by an operator, from the measurement unit 400, 500, from the storage medium 98, or via a network. When the training waveform data 70a, 70b, 70c is obtained, information regarding which kind of cell the training waveform data 70a, 70b, 70c indicates is also obtained. The information regarding which kind of cell is indicated may be associated with the training waveform data 70a, 70b, 70c, or may be inputted by the operator through the input part 16.
In step S11, the processing part 10A provides: information that indicates which kind of cell is indicated and that is associated with the training waveform data 70a, 70b, 70c; label values associated with the kinds of cells stored in the memory 12 or the storage 13; and a label value 77 that corresponds to the sequence data 76a, 76b, 76c obtained by synchronizing the sequence data 72a, 72b, 72c in terms of the time of obtainment of the waveform data of forward scattered light, side scattered light, and side fluorescence. Accordingly, the processing part 10A generates training data 75.
In step S12 shown in
In the cell type analysis method according to the present embodiment, a convolution neural network is used, and a stochastic gradient descent method is used. Therefore, in step S13, the processing part 10A determines whether or not training results of a previously-set predetermined number of trials have been accumulated. When the training results of the predetermined number of trials have been accumulated (YES), the processing part 10A advances to the process of step S14, and when the training results of the predetermined number of trials have not been accumulated (NO), the processing part 10A advances to the process of step S15.
Next, when the training results of the predetermined number of trials have been accumulated, the processing part 10A updates, in step S14, connection weights w of the neural network 50, by using the training results accumulated in step S12. In the cell type analysis method according to the present embodiment, since the stochastic gradient descent method is used, the connection weights w of the neural network 50 are updated at the stage where the learning results of the predetermined number of trials have been accumulated. Specifically, the process of updating the connection weights w is a process of performing calculation according to the gradient descent method, expressed by Formula 11 and Formula 12 described later.
In step S15, the processing part 10A determines whether or not the neural network 50 has been trained using a prescribed number of pieces of training data 75. When the training has been performed using the prescribed number of pieces of training data 75 (YES), the deep learning process ends.
When the neural network 50 has not been trained using the prescribed number of pieces of training data 75 (NO), the processing part 10A advances from step S15 to step S16, and performs the processes from step S11 to step S15 with respect to the next training waveform data 70.
In accordance with the processes described above, the neural network 50 is trained, whereby a deep learning algorithm 60 is obtained.
As described above, a convolution neural network is used in the present embodiment.
In the neural network 50, a plurality of nodes 89 arranged in a layered manner are connected between the layers. Accordingly, information is propagated only in one direction indicated by an arrow D in
[Math 1]
u=w
1
x
1
+w
2
x
2
+w
3
x
3
+w
4
x
4
+b (Formula 1)
Each input is multiplied by a different weight. In Formula 1, b is a value called bias. The output (z) of the node serves as an output of a predetermined function f with respect to the total input (u) expressed by Formula 1, and is expressed by Formula 2 below. The function f is called an activation function.
[Math 2]
z=f(u) (Formula 2)
[Math 3]
u
1
=w
11
x
1
+w
12
x
2
+w
13
x
3
+w
14
x
4
+b
1 (Formula 3-1)
u
2
=w
21
x
1
+w
22
x
2
+w
23
x
3
+w
24
x
4
+b
2 (Formula 3-2)
u
3
=w
31
x
1
+w
32
x
2
+w
33
x
3
+w
34
x
4
+b
3 (Formula 3-3)
When Formula 3-1 to Formula 3-3 are generalized, Formula 3-4 is obtained. Here, i=1, . . . I, j=1, . . . J.
[Math 4]
u
j=Σi=11wjixi+bj (Formula 3-4)
When Formula 3-4 is applied to the activation function, an output is obtained. The output is expressed by Formula 4 below.
[Math 5]
z
f
=f(uj)(j=1,2,3) (Formula 4)
(Activation Function)
In the cell type analysis method according to the embodiment, a rectified linear unit function is used as the activation function. The rectified linear unit function is expressed by Formula 5 below.
[Math 6]
f(u)=max(u,0) (Formula 5)
Formula 5 is a function obtained by setting u=0 to the part u<0 in the linear function with z=u. In the example shown in
[Math 7]
z
1=max((w11x1+w12x2+w13x3+w14x4+b1),0)
(Neural Network Learning)
If the function expressed by use of a neural network is defined as y(x:w), the function y(x:w) varies when a parameter w of the neural network is varied. Adjusting the function y(x:w) such that the neural network selects a more suitable parameter w with respect to the input x is referred to as neural network learning. It is assumed that a plurality of pairs of an input and an output of the function expressed by use of the neural network have been provided. If a desirable output for an input x is defined as d, the pairs of the input/output are given as {(x1,d1), (x2,d2), . . . , (xn,dn)}. The set of pairs each expressed as (x,d) is referred to as training data. Specifically, the set of pieces of waveform data (forward scattered light waveform data, side scattered light waveform data, fluorescence waveform data) shown in
The neural network learning means adjusting the weight w such that, with respect to any input/output pair (xn,dn), the output y(xn:w) of the neural network when given an input xn, becomes as close to the output dn as much as possible. An error function is a measure for the closeness
[Math 8]
y(xn:w)≈dn
between the training data and the function expressed by use of the neural network. The error function is also called a loss function. An error function E(w) used in the cell type analysis method according to the embodiment is expressed by Formula 6 below. Formula 6 is also called cross entropy.
[Math 9]
E(w)=−Σn=1NΣk=1Kdnk log yk(xn:w) (Formula 6)
A method for calculating the cross entropy in Formula 6 is described. In the output layer 50b of the neural network 50 used in the cell type analysis method according to the embodiment, i.e., in the last layer of the neural network, an activation function for classifying inputs x into a finite number of classes according to the contents, is used. The activation function is called a softmax function, and expressed by Formula 7 below. It is assumed that, in the output layer 50b, the nodes are arranged by the same number as the number of classes k. It is assumed that the total input u of each node k (k=1, . . . , K) of an output layer L is given as uk(L) from the outputs of the previous layer L−1. Accordingly, the output of the k-th node in the output layer is expressed by Formula 7 below.
Formula 7 is the softmax function. The sum of output y1, . . . yK determined by Formula 7 is always 1.
When each class is expressed as C1, . . . , CK, output yK of node k in the output layer L (i.e., uk(L)) represents the probability that the given input x belongs to class CK. Refer to Formula 8 below. The input x is classified into a class in which the probability expressed by Formula 8 becomes largest.
[Math 11]
p(Ck|x)=yk=zk(L) (Formula 8)
In the neural network learning, a function expressed by the neural network is considered as a model of the posterior probability of each class, the likelihood of the weight w with respect to the training data is evaluated under such a probability model, and a weight w that maximizes the likelihood is selected.
It is assumed that target output dn by the softmax function of Formula 7 is 1 only if the output is a correct class, and otherwise, target output dn is 0. In a case where the target output is expressed in a vector format of dn=[dn1, . . . , dnK], if, for example, the correct class of input xn is C3, only target output dn3 becomes 1, and the other target outputs become 0. When coding is performed in this manner, the posterior distribution is expressed by Formula 9 below.
[Math 12]
p(d|x)=Πk=1Kp(Ck|x)d
Likelihood L(w) of weight w with respect to the training data {(xn,dn)} (n=1, N) is expressed by Formula 10 below. When the logarithm of likelihood L(w) is taken and the sign is inverted, the error function of Formula 6 is derived.
Learning means minimizing error function E(w) calculated on the basis of the training data, with respect to parameter w of the neural network. In the cell type analysis method according to the embodiment, error function E(w) is expressed by Formula 6.
Minimizing error function E(w) with respect to parameter w has the same meaning as finding a local minimum point of function E(w). Parameter w is a weight of connection between nodes. The local minimum point of weight w is obtained by iterative calculation of repeatedly updating parameter w from an arbitrary initial value as a starting point. An example of such calculation is the gradient descent method.
In the gradient descent method, a vector expressed by Formula 11 below is used.
In the gradient descent method, a process of moving the value of current parameter w in the negative gradient direction (i.e., −∇E) is repeated many times. When the current weight is w(t) and the weight after the moving is w(t+1), the calculation according to the gradient descent method is expressed by Formula 12 below. Value t means the number of times the parameter w is moved.
[Math 15]
w
(t+1)
=w
(t)
−ϵ∇E (Formula 12)
[Math 16]
ϵ
The above symbol is a constant that determines the magnitude of the update amount of parameter w, and is called a learning coefficient. As a result of repetition of the calculation expressed by Formula 12, error function E(w(t)) decreases in association with increase of value t, and parameter w reaches a local minimum point.
It should be noted that the calculation according to Formula 12 may be performed on all of the training data (n=1, . . . , N) or may be performed on only part of the training data. The gradient descent method performed on only part of the training data is called a stochastic gradient descent method. In the cell type analysis method according to the embodiment, the stochastic gradient descent method is used.
(Waveform Data Analysis Process)
The analysis waveform data 80a, 80b, 80c is obtained by the measurement unit 400, 500 and is stored in the storage 23 or the memory 22 of the processing part 20A. The trained deep learning algorithm 60 including the trained connection weight w is associated with, for example, the kind of cell to which the analysis target cell belongs, and is stored in the algorithm database 105, and functions as a program module, which is part of the program that causes the computer to execute the waveform data analysis process. That is, the deep learning algorithm 60 is used by the computer including a CPU and a memory, and is used for calculating the probability of which kind of cell the analysis target cell corresponds to, and generating an analysis result 83 regarding the cell.
The generated analysis result 83 is outputted in the following manner. The CPU 21 of the processing part 20A causes the computer to function so as to execute calculation or processing of specific information according to the intended use. Specifically, the CPU 21 of the processing part 20A generates an analysis result 83 regarding the cell, by using the deep learning algorithm 60 stored in the storage 23 or the memory 22. The CPU 21 of the processing part 20A inputs the analysis data 85 into the input layer 60a, and outputs, from the output layer 60b, the label value of the type of cell to which the analysis target cell belongs, i.e., the label value of the kind of the cell identified as the one to which the cell corresponding to the analysis waveform data belongs.
With reference to the flow chart shown in
With reference to
First, the processing part 20A obtains analysis waveform data 80a, 80b, 80c. The analysis waveform data 80a, 80b, 80c is obtained via the I/F part 25, in accordance with an operation by the user or automatically, from the measurement unit 400, 500, from the storage medium 98, or via a network.
In step S21, from the sequences 82a, 82b, 82c, the processing part 20A generates analysis data in accordance with the procedure described in the analysis data generation method above.
Next, in step S22, the processing part 20A obtains the deep learning algorithm stored in the algorithm database 105. The order of steps S21 and S22 may be reversed.
Next, in step S23, the processing part 20A inputs the analysis data, to the deep learning algorithm. In accordance with the procedure described in the waveform data analysis method above, the processing part 20A outputs a label value of the type of cell to which the analysis target cell from which the analysis waveform data 80a, 80b, 80c has been obtained has been determined to belong, on the basis of the deep learning algorithm. The processing part 20A stores this label value into the memory 22 or the storage 23.
In step S24, the processing part 20A determines whether the identification has been performed on all of the pieces of the analysis waveform data 80a, 80b, 80c obtained first. When the identification of all of the pieces of the analysis waveform data 80a, 80b, 80c has ended (YES), the processing part 20A advances to step S25, and outputs an analysis result including information 83 regarding each cell. When the identification of all of the pieces of the analysis waveform data 80a, 80b, 80c has not ended (NO), the processing part 20A advances to step S26, and performs the processes from step S22 to step S24, on the analysis waveform data 80a, 80b, 80c for which the identification has not yet been performed.
According to the present embodiment, it is possible to identify the kind of cell irrespective of the skill of the examiner.
The present embodiment includes a computer program, for waveform data analysis for analyzing the type of cell, that causes a computer to execute the processes of step S11 to S16 and/or S21 to S26.
Further, a certain embodiment of the present embodiment relates to a program product, such as a storage medium, having stored therein the computer program. That is, the computer program is stored in a storage medium such as a hard disk, a semiconductor memory device such as a flash memory, or an optical disk. The storage form of the program into the storage medium is not limited, as long as the vendor-side apparatus 100 and/or the user-side apparatus 200 can read the program. Preferably, the program is stored in the storage medium in a nonvolatile manner.
Another aspect of the waveform data analysis system is described.
In
The hardware configuration of the analyzer 200B is the same as the hardware configuration of the user-side apparatus 200 shown in
The processing part 20B of the analyzer 200B performs the process shown in
The procedure of the deep learning process and the procedure of the waveform data analysis process that are performed by the analyzer 200B are similar to the procedures respectively performed by the deep learning apparatus 100A and the analyzer 200A. However, the analyzer 200B obtains the training waveform data 70a, 70b, 70c from the measurement unit 400b, 500b.
In the case of the analyzer 200B, the user can confirm the identification accuracy by the trained deep learning algorithm 60. Should the determination result by the deep learning algorithm 60 be different from the determination result according to the observation of the waveform data by the user, if the analysis waveform data 80a, 80b, 80c is used as the training data 70a, 70b, 70c, and the determination result according to the observation of the waveform data by the user is used as the label value 77, it is possible to train the deep learning algorithm again. Accordingly, the training efficiency of the deep learning algorithm 50 can be improved.
Another aspect of the waveform data analysis system is described.
In the third waveform data analysis system, the integrated-type analyzer 100B provided on the vendor side has both functions of the deep learning apparatus 100A and the analyzer 200A. Meanwhile, the third waveform data analysis system includes the terminal apparatus 200C, and provides the user-side terminal apparatus 200C with an input interface for the analysis waveform data 80a, 80b, 80c, and an output interface for the analysis result of waveform data. That is, the third waveform data analysis system is a cloud-service type system in which the vendor side that performs the deep learning process and the waveform data analysis process has an input interface for providing the analysis waveform data 80a, 80b, 80c to the user side, and an output interface for providing information 83 regarding cells to the user side. The input interface and the output interface may be integrated.
The analyzer 100B is connected to the measurement unit 400a, 500a, and obtains the training waveform data 70a, 70b, 70c obtained by the measurement unit 400a, 500a.
The terminal apparatus 200C is connected to the measurement unit 400b, 500b, and obtains the analysis waveform data 80a, 80b, 80c obtained by the measurement unit 400b, 500b.
The hardware configuration of the analyzer 100B is the same as the hardware configuration of the vendor-side apparatus 100 shown in
The training waveform data 70a, 70b, 70c is obtained in advance by the measurement unit 400a, 500a as described above, and is stored in advance in the training data database (DB) 104 or in the storage 13 or the memory 12 of the processing part 10B. It is assumed that the analysis waveform data 80a, 80b, 80c is obtained by the measurement unit 400b, 500b, and is stored in advance in the storage 23 or the memory 22 of the processing part 20C of the terminal apparatus 200C.
The processing part 10B of the analyzer 100B performs the process shown in
The procedure of the deep learning process and the procedure of the waveform data analysis process that are performed by the analyzer 100B are similar to the procedures respectively performed by the deep learning apparatus 100A and the analyzer 200A according to the present embodiment.
The processing part 10B receives the training waveform data 70a, 70b, 70c from the user-side terminal apparatus 200C, and generates training data 75 in accordance with steps S11 to S16 shown in
In step S25 shown in
As described above, by transmitting the analysis waveform data 80a, 80b, 80c to the analyzer 100B, the user of the terminal apparatus 200C can obtain analysis results 83 regarding the types of cells, as an analysis result.
According to the analyzer 100B of the third embodiment, the user can use a discriminator without obtaining the training data database 104 and the algorithm database 105 from the deep learning apparatus 100A. Accordingly, a service of identifying the kinds of cells can be provided as a cloud service.
Although the outline and specific embodiments of the present invention have been described, the present invention is not limited to the outline and the embodiments described above.
In each waveform data analysis system, the processing part 10A, 10B is realized as a single apparatus. However, the processing part 10A, 10B need not be a single apparatus. The CPU 11, the memory 12, the storage 13, the GPU 19, and the like may be provided at separate places and connected to each other through a network. The processing part 10A, 10B, the input part 16, the output part 17 also need not necessarily be provided at one place, and may be respectively provided at different places and communicably connected to each other through a network. This also applies to the processing part 20A, 20B, 20C.
In the first to third embodiments, the function blocks of the training data generation part 101, the training data input part 102, the algorithm update part 103, the analysis data generation part 201, the analysis data input part 202, and the analysis part 203 are executed by the single CPU 11 or the single CPU 21. However, these function blocks need not necessarily be executed by a single CPU, and may be executed in a distributed manner by a plurality of CPUs. These function blocks may be executed in a distributed manner by a plurality of GPUs, or may be executed in a distributed manner by a plurality of CPUs and a plurality of GPUs.
In the second and third embodiments, the program for performing the process of each step described in
In each waveform data analysis system, the input part 16, 26 is an input device such as a keyboard or a mouse, and the output part 17, 27 is realized as a display device such as a liquid crystal display. Instead of this, the input part 16, 26 and the output part 17, 27 may be integrated to be realized as a touch panel-type display device. Alternatively, the output part 17, 27 may be implemented as a printer or the like.
In each waveform data analysis system, the measurement unit 400a, 500a is directly connected to the deep learning apparatus 100A or the analyzer 100B. However, the measurement unit 400a, 500a may be connected to the deep learning apparatus 100A or the analyzer 100B via the network 99. Similarly, although the measurement unit 400b, 500b is directly connected to the analyzer 200A or the analyzer 200B, the measurement unit 400b, 500b may be connected to the analyzer 200A or the analyzer 200B via the network 99.
1. Construction of Deep Learning Model
Using Sysmex XN-1000, blood collected from a healthy individual was measured as a healthy blood sample, and XN CHECK Lv2 (control blood from Streck (having been subjected to processing such as fixation)) was measured as an unhealthy blood sample. As a fluorescence staining reagent, Fluorocell WDF manufactured by Sysmex Corporation was used. As a hemolytic agent, Lysercell WDF manufactured by Sysmex Corporation was used. For each cell contained in each specimen, waveform data of forward scattered light, side scattered light, and side fluorescence was obtained at 1024 points at a 10 nanosecond interval from the measurement start of forward scattered light. With respect to the healthy blood sample, waveform data of cells in blood collected from 8 healthy individuals was pooled as digital data. With respect to the waveform data of each cell, classification of neutrophil (NEUT), lymphocyte (LYMPH), monocyte (MONO), eosinophil (EO), basophil (BASO), and immature granulocyte (IG) was manually performed, and each piece of waveform data was provided with annotation (labelling) of the type of cell. The time point at which the signal strength of forward scattered light exceeded a threshold was defined as the measurement start time point, and the time points of obtainment of pieces of waveform data of forward scattered light, side scattered light, and side fluorescence were synchronized to each other, to generate training data. In addition, the control blood was provided with annotation “control blood-derived cell (CONT)”. The training data was inputted to the deep learning algorithm to be learned by the deep learning algorithm.
With respect to blood cells of another healthy individual different from the healthy individual from whom the cell data having been learned was obtained, analysis waveform data was obtained by Sysmex XN-1000 in a manner similar to that for training data. Waveform data derived from the control blood was mixed, to create analysis data. With respect to this analysis data, blood cells derived from the healthy individual and blood cells derived from the control blood overlapped each other on the scattergram, and were not able to be discerned at all by a conventional method. This analysis data was inputted to a constructed deep learning algorithm, and data of the types of individual cells was obtained.
Next, with respect to each type of cell, ROC analysis was performed, and sensitivity and specificity were evaluated.
From the result above, it has been clarified that type of cell can be determined by using the deep learning algorithm on the basis of signals obtained from a cell contained in a biological sample and on the basis of waveform data.
Further, there are cases where, when unhealthy blood cells such as a control blood are mixed with healthy blood cells, it is difficult to make determination by a conventional scattergram method. However, it has been shown that, when the deep learning algorithm of the present embodiment is used, even when unhealthy blood cells are mixed with healthy blood cells, it is possible to make determination about these cells.
This application is a continuation of International Application PCT/JP2020/011596 filed on Mar. 17, 2020, which claims benefit of Japanese patent application No. JP2019-055385 filed on Mar. 22, 2019, both of which are incorporated herein by reference in their entireties.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/JP2020/011596 | Mar 2020 | US |
Child | 17480683 | US |