Claims
- 1. A method of classifying a biological sample comprising:
determining the expression pattern of one or more markers in a biological sample; providing a model generated by a supervised learning algorithm based on a dataset of expression values from known biological classes; and comparing the expression pattern of the markers in the sample to the model, thereby classifying said biological sample.
- 2. The method of claim 1, wherein the biological sample is classified as a disease sample or as a normal sample.
- 3. The method of claim 2, wherein the disease is selected from the group consisting of cancer, coronary artery disease, neurodegenerative disease and pulmonary disease.
- 4. The method of claim 3, wherein the disease is cancer.
- 5. The method of claim 1, wherein the dataset comprises data from known classes of a particular disease.
- 6. The method of claim 5, wherein the particular disease is selected from the group consisting of cancer, coronary artery disease, neurodegenerative disease and pulmonary disease.
- 7. The method of claim 6, wherein the disease is cancer.
- 8. The method of claim 7, wherein the classes of cancer are selected from the group consisting of breast adenocarcinoma, prostate adenocarcinoma, lung adenocarcinoma, colorectal adenocarcinoma, lymphoma, bladder transitional cell carcinoma, melanoma, uterine adenocarcinoma, leukemia, renal cell carcinoma, pancreatic adenocarcinoma, ovarian carcinoma, pleural mesothelioma and central nervous system.
- 9. The method of claim 1, wherein the biological sample is compared to the model in a pairwise manner for each biological class.
- 10. The method of claim 9, wherein the pairwise comparison is a one class versus all other comparison.
- 11. The method of claim 1, wherein the supervised learning algorithm is a support vector machine algorithm.
- 12. The method of claim 11, wherein the support vector machine algorithm is linear or non-linear.
- 13. The method of claim 1, wherein the steps are performed in a computer system.
- 14. The method of claim 1, wherein the a digital processor is used to compare the expression pattern of the markers in the sample to the model.
- 15. In a computer system, a method for classifying at least one biological sample to be tested that is obtained from an individual, wherein expression values of more than one marker are determined for the sample to be tested, comprising:
receiving the gene expression values for more than one marker in the sample to be tested; providing a model generated by a supervised learning algorithm based on a dataset of expression values from known biological classes; comparing the gene expression values of the sample to that of the model, to thereby produce a classification of the sample; and providing an output indication of the classification.
- 16. A computer apparatus for providing an indication of the classification of a biological sample, wherein the sample is obtained from an individual, wherein the apparatus comprises:
a source of expression values of more than one marker in the sample; means for providing a model generated by a supervised learning algorithm based on a dataset of expression values from known biological classes; a processor routine executed by a digital processor, coupled to receive the expression values from the source, the processor routine determining classification of the sample by comparing the expression values of the sample to the model; and an output assembly, coupled to the digital processor, for providing an indication of the classification of the sample.
- 17. A method of determining a treatment plan for an individual having a disease, comprising:
obtaining a biological sample from the individual; providing a model generated by a supervised learning algorithm based on a dataset of expression values from known biological classes; assessing the sample for the level of expression of more than one marker; using the model to perform one or more pairwise comparisons of the sample versus at least one disease class, thereby resulting in the disease class classification of the sample; and using the disease class to determine a treatment plan.
- 18. A method of determining the efficacy of a drug designed for the treatment of a disease, comprising:
obtaining a biological sample from an individual having the disease; subjecting the sample to the drug; assessing the drug-exposed sample for the level of expression of more than one marker; providing a model generated by a supervised learning algorithm based on a dataset of expression values from known samples on which the drug has different levels of efficacy; and using a computer to compare the drug-exposed sample to the model to determine the efficacy of the drug in treating the disease.
- 19. A model produced from a dataset of expression data comprising a plurality of markers from known biological samples formed using a supervised learning algorithm to define a hyperplane that characterizes a biological class.
- 20. A method of classifying a biological sample comprising:
determining the expression pattern of one or more markers in a biological sample; providing a model generated by a linear support vector machine algorithm based on a dataset of expression values from multiple known biological classes; and using a digital processor to compare the expression pattern of the markers in the sample to the model using one or more one versus all other pairwise comparisons, thereby classifying said biological sample.
RELATED APPLICATION
[0001] This application claims the benefit of U.S. Provisional Application No. 60/332,268, filed Nov. 14, 2001.
[0002] The entire teachings of the above application are incorporated herein by reference.
GOVERNMENT SUPPORT
[0003] The invention was supported, in whole or in part, by training grant 5T32 HL07623 from the National Institutes of Health. The Government has certain rights in the invention.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60332268 |
Nov 2001 |
US |