The present invention relates to a cell selecting apparatus, a cell selecting method, and a computer program product.
A technique for analysis of cell culture data and the like is described by conventional methods relating to bioinformatics.
For example, according to the analytical methods disclosed in Non Patent Literature 1 and Non Patent Literature 2, an analytical technique using GI50 (50% Growth Inhibition), which indicates the sensitivity of an anti-cancer agent against cancer cells cultured under a certain culture condition, and cluster analysis is disclosed.
Further, a method for diversified investment of assets is disclosed by conventional investment theories.
For example, according to the portfolio selection adopted for modern investment theories described in Non Patent Literature 3, a method of lowering investment risk by combining plural investment options is disclosed. Specifically, since financial goods with high expected return (profit) are generally accompanied with high risk while financial goods with low profit have low risk, it is believed in the modern investment theories that a tradeoff is present between the expected return and the risk (deviation in return represented by standard deviation). Thus, with regard to the portfolio selection, a method of finding an optimum combination (portfolio) having desirable profit rate and acceptable risk among the financial goods (investment goods) each having different expected profit rate and risk is suggested for an investor or others who purchase multiple financial goods.
However, according to the analytical method described in Non Patent Literature 1 and Non Patent Literature 2, presence or absence of proliferation inhibition was examined by administering an anti-cancer agent to each cancer cells cultured under fixed culture condition that is different from the diverse environments inside a living body. For such reasons, according to the analytical method, only the cells fitted to the fixed culture condition (over adaptation) can proliferate and survive selectively, while other cells are reduced in accordance with the passage. Thus, according to the analytical method, there is a problem that during the screening of an anti-cancer agent it is difficult to specify an anti-cancer agent or a combination of an anti-cancer agent, which can stably inhibit cancers under various environments.
Further, as the conventional portfolio selection described in Non Patent Literature 3 is directed only to a method of reasonable and diversified investment of financial goods, an application to bioinformation is not considered at all. Thus, there is a problem that, even when bioinformation is simply linked to portfolio selection, it cannot determine an optimum combination of cells or the like.
The present invention is devised in view of the problems described above, and an object of the invention is to provide a cell selecting apparatus, a cell selecting method, and a computer program product for selecting an optimum combination of cells or microorganisms by mapping, in a yield risk space, a portfolio including production outputs based on cell proliferation rate, production of biomass by microorganisms, or the like, and variables indicating the extent of the variation of production outputs, that are calculated from suppressed proliferation, reduced production outputs, or the like considering multiple disturbances and probability of its occurrence.
In order to attain this object, the cell selecting apparatus according to one aspect of the present invention is a cell selecting apparatus comprising at least a storage unit and a control unit, wherein the storage unit includes a measured data storage unit that stores, for each cells, measured data about a proliferation of the cells cultured under multiple conditions or a production of biomass, and the control unit includes a setting unit that sets, for each cells or each condition, a production output of the cells or the biomass based on the measured data that are stored in the measured data storage unit and a variable indicating an extent of variation of the production output, a variable range calculating unit that calculates a variable range of the production output and the variable in case of combining the multiple cells or conditions based on the production output and the variable set by the setting unit, and an optimizing unit that calculates an optimum combination within the variable range by using an optimization method.
The cell selecting apparatus according to another aspect of the present invention is the cell selecting apparatus, wherein the variable is standard deviation of the production output.
The cell selecting apparatus according to still another aspect of the present invention is the cell selecting apparatus, wherein the variable range calculating unit calculates, based on the production output and the variable, a variable range of the production output and the variable in case of combining the multiple cells or conditions on the basis of covariance.
The cell selecting apparatus according to still another aspect of the present invention is the cell selecting apparatus, wherein the optimizing unit calculates a combination closest to effective frontier for maximizing the production output under the constant variable within the variable range by using the optimization method.
The cell selecting apparatus according to still another aspect of the present invention is the cell selecting apparatus, wherein the optimization method is Pareto optimization, genetic algorithm, least square method, or a suitable probabilistic or analytical optimization method.
The cell selecting apparatus according to still another aspect of the present invention is the cell selecting apparatus, wherein the multiple cells are the cells of multiple types.
The cell selecting apparatus according to still another aspect of the present invention is the cell selecting apparatus, wherein the multiple cells are the multiple cells of a single type.
The cell selecting apparatus according to still another aspect of the present invention is the cell selecting apparatus, wherein the cells are microorganisms.
The cell selecting apparatus according to still another aspect of the present invention is the cell selecting apparatus, wherein the cells are cancer cells.
The cell selecting apparatus according to still another aspect of the present invention is the cell selecting apparatus, wherein the measured data is GI50 that is a concentration of a compound for inhibiting cell proliferation by 50% and the production output is a reciprocal number of GI50.
The cell selecting method according to still another aspect of the present invention is a cell selecting method executed by a cell selecting apparatus including at least a storage unit and a control unit, wherein the storage unit includes a measured data storage unit that stores, for each cells, measured data about a proliferation of the cells cultured under multiple conditions or a production of biomass, the method executed by the control unit comprising a setting step of setting, for each cells or each condition, a production output of the cells or the biomass based on the measured data that are stored in the measured data storage unit and a variable indicating an extent of variation of the production output, a variable range calculating step of calculating a variable range of the production output and the variable in case of combining the multiple cells or conditions based on the production output and the variable set at the setting step, and an optimizing step of calculating an optimum combination within the variable range by using an optimization method.
The computer program product according to still another aspect of the present invention is a computer program product having a non-transitory computer readable mediums including programmed instructions for a cell selecting method executed by a cell selecting apparatus including at least a storage unit and a control unit, wherein the storage unit includes a measured data storage unit that stores, for each cells, measured data about a proliferation of the cells cultured under multiple conditions or a production of biomass, wherein the instructions, when executed by the control unit, cause the control unit to execute a setting step of setting, for each cells or each condition, a production output of the cells or the biomass based on the measured data that are stored in the measured data storage unit and a variable indicating an extent of variation of the production output, a variable range calculating step of calculating a variable range of the production output and the variable in case of combining the multiple cells or conditions based on the production output and the variable set at the setting step, and an optimizing step of calculating an optimum combination within the variable range by using an optimization method.
According to the invention, because the present invention sets, for each cells or each condition, a production output of the cells or the biomass based on the measured data and a variable indicating an extent of variation of the production output, calculates a variable range of the production output and the variable in case of combining the multiple cells or conditions based on the production output and the variable, and calculates an optimum combination within the variable range by using an optimization method, for producing antibiotics and biomass such as bioethanol, an optimum combination of microorganisms can be found for a plant in which the conditions are easily varies, and therefore an effect of achieving effective operation of biomass production can be obtained. Further, for screening of pharmaceuticals such as anti-cancer agent, an effect of developing a new pharmaceutical for efficient cancer inhibition can be obtained regardless of many differences among individuals, inside body environments, and the like.
Further, according to the invention, because the variable is standard deviation of the production output, an effect of enabling the numeric expression of a deviation of cell proliferation rate or biomass production outputs can be obtained.
Further, according to the invention, because the present invention calculates, based on the production output and the variable, a variable range of the production output and the variable in case of combining the multiple cells or conditions on the basis of covariance, an effect of enabling the setting of a portfolio selectable region considering correlation and anti-correlation between cell lines can be obtained.
Further, according to the invention, because the present invention calculates a combination closest to effective frontier for maximizing the production output under the constant variable within the variable range by using the optimization method, an effect of enabling the selection of a combination of cells and the like that is close to the combination of cells and the like performing the theoretically most effective production under constant risk can be obtained.
Further, according to the invention, because the optimization method includes Pareto optimization, genetic algorithm, least square method, or a suitable probabilistic and analytical optimization method, an effect of enabling the shortening of process time can be obtained by using a general calculation method.
Further, according to the invention, because the multiple cells are the cells of multiple types, an effect of enabling the application on screening of an anti-cancer agent and the like which can efficiently suppress proliferation of a group of cancer cells and the like having genetic diversity can be obtained.
Further, according to the invention, because the multiple cells are the multiple cells of single type, an effect of enabling the specification of a combination of conditions and the like for efficient proliferation of specific cells can be obtained.
Further, according to the invention, because the cells are microorganisms, an effect of enabling the utilization for production of antibiotics, bioethanol, and the like can be obtained.
Further, according to the invention, because the cells are cancer cells, an effect of enabling the utilization for development of an anti-cancer agent can be obtained.
Further, according to the invention, because the measured data are GI50, i.e., concentration of a compound for inhibiting cell proliferation by 50%, and the production outputs are a reciprocal number of GI50, an effect of enabling the specification of an optimum combination of cells or anti-cancer agent by using the data accumulated all over the world until now can be obtained by using generic indices even when the measured data itself is small.
Herein below, embodiments for carrying out the cell selecting apparatus, cell selecting method, and computer program product according to the invention will be described in detail with reference to the drawings. However, it is evident that the invention is not limited to the embodiments.
In the embodiments given below, an example of applying the invention to pharmaceuticals and biomass production will be described, in particular. However, the present invention is not limited thereto, the same application can be made to all technical fields in which cell culture is employed.
Outline of the embodiment of the invention will be described with reference to
Briefly, the present embodiment has basic characteristics as follows. That is, as illustrated in
In addition, the control unit of the cell selecting apparatus calculates, based on the production outputs (yield) and variables (risk) set in the step SA-1, variable ranges (portfolio selectable ranges) of the production outputs (yield) and variables (risk) in case of forming a combination (portfolio) of multiple cells or conditions (step SA-2). In this regard, the control unit may calculate, based on the production outputs and variables, the variable ranges of the production outputs and variables in case of combining multiple cells or conditions on the basis of covariance. Further, the control unit may calculate, based on the production outputs and variables, the variable ranges of the production outputs and variables in case of combining multiple cells or conditions by calculating the correlation and anti-correlation between cells using the covariance calculation method. Further, the multiples cells may be cells of multiple types or multiple cells of a single type.
In addition, the control unit of the cell selecting apparatus calculates an optimum combination of cells or conditions (portfolio selection) within the variable ranges (portfolio selectable ranges) by using an optimization method (step SA-3) and ends the process. Herein, the control unit may calculate a combination closest to effective frontier for maximizing the production outputs under constant variables within the variable ranges by using an optimization method. Further, the optimization method can be Pareto optimization, genetic algorithm, least square method, or other suitable probabilistic or analytical optimization method.
Herein below, basic concept for portfolio selection will be described with reference to
As illustrated in
Herein, as illustrated in
Back to
In
As used herein, the effective frontier indicates a group of points in a yield risk space which consists of an optimum combination of cells for maximizing the expected yield under constant risk. A combination of cells outside the effective frontier is inferior to the combination present on the effective frontier. However, on the effective frontier, since the change in expected yield is directly linked to a change in risk, a tradeoff between risk and expected yield occurs in the optimized portfolio present on the effective frontier. Further, in theory, any portfolio not present on the effective frontier can maximize the expected yield without increasing the risk or can reduce the risk without lowering the expected yield. For example, as shown by the arrow extended from the portfolio X illustrated in
Further, as illustrated in
As above, descriptions for the outline of the embodiment have been explained.
Next, constitution of a cell selecting apparatus 100 will be described with reference to
In
Various database or tables that are stored in the storage unit 106 (a measurement database 106a) are a storage unit such as a fixed disc device. For example, the storage unit 106 stores various programs, tables, files, database, and web pages or the like that are used for various processes.
Among the constitutional elements of the storage unit 106, the measurement database 106a is a measured data storage unit, that stores, for each cells, measured data regarding proliferation of cells cultured under multiple conditions (culture condition, disturbance, and the like) or production of biomass (for example, indices or the like to be adopted as yield (production outputs) such as proliferation rate by each cells under each condition, in which each cells has been cultured under multiple conditions, or biomass production). Herein, the cell can be a microorganism. Further, the cell can be a cancer cell. Further, the measured data can be GI50, which indicates the concentration of a compound inhibiting the cell proliferation by 50%.
Further, in
Among those described above, the setting unit 102a is a setting unit that sets production outputs of cells or biomass based on the measured data that are stored in the measurement database 106a and variables for each cells or condition in which the variables indicate the extent of the variation of the production outputs. Herein, the variables can be standard deviation of the production outputs. Further, the production outputs can be reciprocal number of GI50. Further, the setting unit 102a may set the production outputs by considering the disturbance may probability of cell generation for each culture condition.
Further, the variable range calculating unit 102b is a variable range calculating unit that calculates the variable ranges of production outputs and variables in case of combining multiple cells or conditions, based on the production outputs and variables that are set by the setting unit 102a. Herein, based on the production outputs and variables, the variable range calculating unit 102b may calculate the variable ranges of production outputs and variables in case of combining multiple cells or conditions on the basis of covariance. Further, based on the production outputs and variables, the variable range calculating unit 102b may calculate the correlation and anti-correlation among cells by a method for calculating covariance and calculate the variable ranges of production outputs and variables in case of combining multiple cells or conditions. Further, the multiple cells may be cells of multiple types or multiple cells of a single type.
Still further, the optimizing unit 102c is an optimizing unit that calculates, by using an optimization method, an optimum combination within the variable ranges. Herein, the optimizing unit 102c may calculate, using the optimization method, a combination closest to effective frontier for maximizing the production output under constant variables within the variable ranges. Further, the optimization method may be Pareto optimization, genetic algorithm, least square method, or any probabilistic or analytical optimization.
Next, an example of the process of the system of the embodiment that is constituted as described above will be described in detail with reference to
First, details about the process with the cell selecting apparatus 100 in biomass production according to the embodiment will be described with reference to
As illustrated in
Further, the setting unit 102a calculates the standard deviation about variation in production outputs of the biomass obtained in the step SB-1 and sets the calculated standard deviation as risk for each subgroup (step SB-2).
Further, the setting unit 102a projects (maps) the yield set in the step SB-1 and the risk set in the step SB-2 into a yield risk space by using the yield and the risk as a coordinate of subgroups (step SB-3).
Further, the variable range calculating unit 102b calculates correlation and anti-correlation among subgroups using a method for calculating covariance based on the coordinates for each subgroup projected into a yield risk space by the setting unit 102a in the step SB-3, and calculates the variable ranges (portfolio selectable ranges) of a combination (portfolio) of multiple subgroups (step SB-4).
Further, by performing optimization which uses a separately defined indifference curve in a yield risk space (for example, Pareto optimization, genetic algorithm, or least square method), the optimizing unit 102c calculates the ratio and the combination (portfolio) of a subgroup having the maximum effect as being closest to the effective frontier in the portfolio selectable ranges obtained by the calculation in the step SB-4 (step SB-5). Accordingly, an optimum operation condition for the biomass plant can be set (for example, what kind of microorganisms should be cultured at which ratio in a tank in order to stably obtain the production outputs), and thus the invention is effective for optimum operation of a plant having variable culture conditions.
Next, details of the process of the cell selecting apparatus 100 for screening of an anti-cancer agent will be described with reference to
As illustrated in
Herein, referring to
In general, for screening of an anti-cancer agent, cancer cells are cultured under a fixed culture condition that is different from the environment inside a living body. Since the cancer cells fitted to the fixed culture condition will exhibit the highest proliferation, even when the developments are made by having as a candidate for new pharmaceuticals a compound exhibiting an effect against the cancer cells suitable for the condition, there is a possibility that the results different from those expected may be obtained in real clinical practice. Thus, as measured data for the embodiment, it is possible that multiple disturbances in condition are presumed to prevent over adaptation to the culture condition and the measured data obtained from cancer cells that are actually cultured with certain probability are used. Accordingly, the measured data obtained from cancer cells according to the embodiment is not limited to the measured data obtained from over adaptation to culture condition, and as a result, screening of suitable anti-cancer agent can be made. Herein, examples of the disturbances in condition may include variation in factors which fluctuate depending on one-day period, such as NADP, ATP, and glucose, and periodic variation in gene expression. The disturbances in condition may be approximated by various methods such as change in medium composition, and introduction of RNAi to cells during the cell culture.
Herein, referring to
As illustrated in
Herein, as a result of examining in more detail the cell group having increased ratio and the cell group having decreased ratio illustrated in
Accordingly, as the measured data stored in the measurement database 106a of the embodiment of the invention, data of proliferation rate measured by adding an anti-cancer agent for each condition of disturbance, and data obtained by measuring GI50 under each condition of disturbances by culturing each cancer cell line under multiple conditions of disturbance may be also used. For example, when several fragments under actual environment are cultured under C kinds of disturbance without an anti-cancer agent or with an anti-cancer agent (a candidate compound) of N types at D concentrations, culture can be carried out with C*(1+N*D) kinds. Accordingly, having the culture results with no anti-cancer agent as a reference data, C*N*D kinds of GI50 of the cells can be calculated. Further, when it is believed that the presumed disturbance occurs at presumed probability at the time of administering an anti-cancer agent in the embodiment of the invention, an anti-cancer agent having small GI50 and small risk (standard deviation) will be the most desired one. Thus, by setting the combination of disturbances that are close to actual environment inside a living body and probability of the occurrence thereof, a compound believed to have the highest effect in actual environment can be predicted among the candidate anti-cancer agents.
In addition, the setting unit 102a, when culture condition of C types occurs with constant probability, the setting may be used for yield calculation. Further, the setting unit 102a may presume multiple cases with varying probability of occurrence and calculate yield for each case.
Referring to
From various studies on cancer cells, it has been found that genetically diverse cancer cells are present as a mixture even in a tumor of single patient (Baisse B, Bouzourene H, Saraga E P, Bosman F T, Benhattar J. Intratumor genetic heterogeneity in advanced human colorectal adenocarcinoma. Int J Cancer. 2001 Aug. 1; 93(3): 346-52.). For such reasons, there is a case in which one anti-cancer agent has an effect on cancer cells having specific mutation but has no effect on cancer cells having other mutation, that is, resistance to the anti-cancer agent may be exhibited. Thus, when an anti-cancer agent is administered in such state, cancer having an ability of maintaining proliferation even in the presence of the anti-cancer agent may be present at great ratio due to genetic diversity of cancer. Thus, as a way of coping with the genetic diversity of cancer, use of an additional anti-cancer agent, that is, use of a mixture of multiple anti-cancer agents, is required.
Thus, as illustrated in
Further, the setting unit 102a may set the average value (0.1276) of reciprocal number of GI50 obtained by the conversion into a log scale as the yield (proliferation inhibition efficiency) of the anti-cancer agent 1 and the standard deviation (0.011) of GI50 obtained by the conversion into a log scale as the risk of the anti-cancer agent 1.
Further, as an example of the embodiment of the invention, when the proliferation inhibition efficiency of the anti-cancer agent 1 for each cancer cell subgroup is set by the setting unit 102a, the opposite pattern of proliferation inhibition efficiency of the anti-cancer agent 2 is set by the setting unit 102a, and they are consequently selected as an optimum combination by the optimizing unit 102c to be described below, thus design of having anti-cancer agents that are used simultaneously to evenly inhibit the proliferation of the cancer cells can be achieved. Such calculation is usable for selection of pharmaceuticals in clinical practice and it allows a pharmaceutical company to perform the calculation of an anti-cancer agent that is commercially available at present or under development with a screening condition that they are recommended as an anti-cancer agent to be administered as a combination.
Further, with regard to the cell lines from which measured data stored in the measurement database 106a of the embodiment of the invention is collected, standard cancer cell lines exhibiting the most similar drug response to that of an identified cancer cell subgroup, in which the response is determined in certain form by examining the diversity of cancer cells in a patient, may be selected as substitute cells (that is, surrogate cells) for screening. For example, the cell lines from which measured data stored in the measurement database 106a of the embodiment of the invention is collected may be selected as surrogate cell lines exhibiting the most similar response to known cancer cell lines, as illustrated in
Back to
The setting unit 102a projects (maps) the yield set in the step SC-1 and the risk set in the step SC-2 into a yield risk space as coordinates of the cells or anti-cancer agent (step SC-3).
Herein, referring to
As illustrated in
Back to
Further, the optimizing unit 102c calculates optimum combination (portfolio) in the variable ranges using an optimization method based on a pre-determined indifference curve, specifies most effective combination of cancer cells cultured under which condition (that is, administered with an anti-cancer agent), and the like, and selects an optimum combination of cancer agent from multiple cancer agents (step SC-5). The process is terminated thereafter. Accordingly, the user such as a pharmaceutical company can decide which candidate compound should be further followed as a subject of development.
The embodiment of the present invention is explained above. However, the present invention may be implemented in various different embodiments other than the embodiment described above within a technical scope described in claims.
For example, an example in which the cell selecting apparatus 100 performs the processing as a standalone apparatus is explained. However, the cell selecting apparatus 100 can be configured to perform processes in response to request from the client terminal (which is a separate case from the cell selecting apparatus 100) and return the process results to the client terminal.
All the automatic processes explained in the present embodiment can be, entirely or partially, carried out manually. Similarly, all the manual processes explained in the present embodiment can be, entirely or partially, carried out automatically by a known method.
The process procedures, the control procedures, specific names, information including registration data for each process and various parameters such as search conditions, display example, and database construction, mentioned in the description and drawings can be changed as required unless otherwise specified.
Further, each constitutional element of the cell selecting apparatus 100 is based on functional concept, and it is not necessarily constituted physically as illustrated in the figures.
For example, the process functions performed by each device of the cell selecting apparatus 100, especially the each process function performed by the control unit 102, can be entirely or partially realized by CPU and a computer program executed by the CPU or by a hardware using wired logic. The computer program, recorded on a non-transitory computer readable recording medium including programmed commands for causing a computer to execute the method of the present invention, can be mechanically read by the cell selecting apparatus 100 as the situation demands. In other words, the storage unit 106 such as read-only memory (ROM) or hard disk drive (HDD) stores the computer program that can work in coordination with an operating system (OS) to issue commands to the CPU and cause the CPU to perform various processes. The computer program is first loaded to the random access memory (RAM), and forms the control unit in collaboration with the CPU.
Further, the computer program may be stored in an application program server which is connected to the cell selecting apparatus 100 via any network, and if necessary, part or entire program can be downloaded.
The computer program may be stored in a computer-readable recording medium, or may be structured as a program product. Here, the “recording medium” includes any “portable physical medium” such as a memory card, a USB (Universal Serial Bus) memory, an SD (Secure Digital) card, a flexible disk, an optical disk, a ROM, an EPROM (Erasable Programmable Read Only Memory), an EEPROM (Electronically Erasable and Programmable Read Only Memory), a CD-ROM (Compact Disk Read Only Memory), an MO (Magneto-Optical disk), a DVD (Digital Versatile Disk), and a Blu-ray Disc.
“Program” refers to a data processing method written in any computer language and written method, and can have software codes and binary codes in any format. The “program” can be a dispersed form in the form of a plurality of modules or libraries, or can perform various functions in collaboration with a different program such as the OS. Any known configuration in the each device according to the embodiment can be used for reading the recording medium. Similarly, any known process procedure for reading or installing the computer program can be used.
Various databases (the measurement database 106a) stored in the storage unit 106 is a storage unit such as a memory device such as a RAM or a ROM, a fixed disk device such as a HDD, a flexible disk, and an optical disk, and stores therein various programs, tables, databases, and web page files used for providing various processing or web sites.
The cell selecting apparatus 100 may be structured as an information processing apparatus such as known personal computers or workstations, or may be structured by connecting any peripheral devices to the information processing apparatus. Furthermore, the cell selecting apparatus 100 may be realized by mounting software (including programs, data, or the like) for causing the information processing apparatus to implement the method according of the invention.
The distribution and integration of the device are not limited to those illustrated in the figures. The device as a whole or in parts can be functionally or physically distributed or integrated in an arbitrary unit according to various attachments or how the device is to be used. That is, any embodiments described above can be combined when implemented, or the embodiments can selectively be implemented.
As described in detail in the above, the invention is capable of providing the cell selecting apparatus, the cell selecting method, and the computer program product for selecting an optimum combination of cells or microorganisms by mapping, in a yield risk space, a portfolio including production outputs based on cell proliferation rate, production of biomass by microorganisms, or the like, and variables indicating the extent of the variation of production outputs, that are calculated from suppressed proliferation, reduced production outputs, or the like considering multiple disturbances and probability of its occurrence.
Number | Date | Country | Kind |
---|---|---|---|
2010-130513 | Jun 2010 | JP | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/JP2011/058299 | 3/31/2011 | WO | 00 | 12/5/2012 |