Claims
- 1. A method for developing a model to predict a chemical compound property, the method comprising:
obtaining at least one descriptor from structural data for each of a plurality of compounds; obtaining at least one descriptor from experimental or predicted data for each of a plurality of compounds; obtaining at least one chemical compound property for each of the plurality of compounds; and developing the model by mapping the descriptors to the chemical compound property.
- 2. The method of claim 1, wherein the chemical property is an ADME property.
- 3. The method of claim 2, wherein the ADME property is absorption.
- 4. The method of claim 2, wherein the ADME property is Caco-2 Effective Permeability.
- 5. The method of claim 1, wherein the chemical property is a toxicity property.
- 6. The method of claims 1-5 wherein obtaining at least one descriptor comprises selecting the descriptors applicable to the characteristic to be predicted based on an analysis of the plurality of compounds.
- 7. The system of claim 6 wherein the analysis used to select the descriptors for predicting the characteristic is selected from at least one of the following: Affine Regressions, Kernel Methods, Artificial neural networks, Finite State Machines—Maximum A Posteriori, Nearest Neighbor Methods, Fisher's Linear Discriminate Analysis, or other regression/classification methods.
- 8. The system of claim 6 further comprising:
performing a chemical space analysis of the plurality of compounds; if the chemical space analysis indicates that the plurality of compounds selected should be modified to improve diversity of the chemical space, then modifying the plurality of compounds by addition or deletion of a compound to improve the diversity of the chemical space covered by the plurality of compounds.
- 9. A system for predicting an ADME/Tox of a compound in a mammalian body, the system comprising:
a database facility, the database facility configured to store and to provide structural and experimental or predicted data; and a processor facility, the processor facility configured to allow the entry of data relating to a new proposed chemical compound including structural data and experimental or predicted data, to perform an analysis of the chemical compound by mapping the data entered to produce a predicted ADME/Tox property of the chemical compound based on the analysis.
- 10. A method for compiling chemical compound data to be used for evaluating the characteristics of a proposed compound, the method comprising:
selecting a plurality of compounds; obtaining a descriptor analysis for each of the plurality of compounds; obtaining test results related to the characteristics being evaluated; and loading the descriptor analysis and the test results into a database used to predict the characteristics of proposed compounds.
- 11. The method of claim 10 further comprising:
performing a chemical space analysis of the plurality of compounds; if the chemical space analysis indicates that the plurality of compounds selected should be modified to improve diversity of the chemical space, then modifying the plurality of compounds by addition or deletion of a compound to improve the diversity of the chemical space covered by the plurality of compounds.
- 12. A system for predicting the chemical properties of a proposed compound comprising:
a database facility configured to store and to serve data relating to the characteristics of selected compound, including structure data, descriptor data, and test data; and a processor facility coupled to the database facility and configured to predict the characteristics of a proposed compound by:
(a) receiving at least one proposed compound via a user input means; (b) selecting training compounds from the database facility based on the characteristics to be predicted of the proposed compounds; (c) selecting the most meaningful descriptors applicable to the characteristic to be predicted based on an analysis of the training compounds selected in step (b); (d) creating validation data subsets of the training data based upon the distribution of descriptors and target characteristics of compounds selected in (b/c); (e) mapping the training set obtained in (d) to the target data resulting in a model which could predict the target data of a proposed compound; (f) modifying (for example: boosting, bootstrap aggregation (bagging)), and other model enhancement methods, etc.) one or more models produced in (e) based upon performance on validation sets obtained in (d) to form a composite model; (g) combining (via boosting, committee machines etc,) a set of two or more models produced in (e or f) based upon performance on validation sets obtained in (d) to form a composite model; and (h) running the model determined in either step (e), (f) or (g) using the required input data (the identity of the subset of input data itself was determined in step (c)) to predict the required target data.
- 13. The system of claim 12 wherein the analyses consider model biases and over training.
- 14. A method for predicting a characteristic of a chemical compound, the method comprising:
receiving as an input structure data for the compound; and mapping the data to at least one chemical characteristic.
- 15. A predictive model of a chemical compound property produced according to the method of any of claims 1-3.
- 16. A computer readable medium containing a chemical compound characteristic model, the medium comprising:
a computer readable medium; and a data structure on the medium that generates at least one characteristic for a compound from structure data and experimental or predictive data for the compound.
- 17. The medium of claim 16, wherein the characteristic is an ADME property.
- 18. The method of claim 17, wherein the ADME property is absorption.
- 19. The method of claim 16, wherein the characteristic is a toxic property.
Parent Case Info
[0001] This application claims the benefit of U.S. Provisional Application Nos. 60/221,548 filed Jul. 28, 2000, entitled PHARMACOKINETIC-BASED DRUG DESIGN TOOL AND METHOD; and 60/267,435 filed Feb. 9, 2001 entitled SYSTEM AND METHOD FOR PREDICTING ADME CHARACTERISTICS OF A COMPOUND BASED ON ITS STRUCTURE.
PCT Information
Filing Document |
Filing Date |
Country |
Kind |
PCT/US01/23763 |
7/30/2001 |
WO |
|