I. Field of the Invention
The invention relates generally to the field of molecular biology. More particularly, it concerns methods and compositions involving microRNA molecules (miRNAs). Certain aspects of the invention include applications for miRNAs in diagnostics, prognostics, and therapy for thyroid cancer.
II. Background
In 2001, several groups used a cloning method to isolate and identify a large group of “microRNAs” (miRNAs) from C. elegans, Drosophila, and humans (Lagos-Quintana et al., 2001; Lau et al., 2001; Lee and Ambros, 2001). Several hundreds of miRNAs have been identified in plants and animals—including humans—which do not appear to have endogenous siRNAs. Thus, while similar to siRNAs, miRNAs are nonetheless distinct.
miRNAs thus far observed have been approximately 21-22 nucleotides in length and arise from longer precursors, which are transcribed from non-protein-encoding genes. See review of Carrington et al. (2003). The precursors form structures that fold back on themselves in self-complementary regions; they are then processed by the nuclease Dicer in animals or DCL1 in plants. miRNA molecules interrupt translation through precise or imprecise base-pairing with their targets.
Many miRNAs are conserved among diverse organisms, and this has led to the suggestion that miRNAs are involved in essential biological processes throughout the life span of an organism (Esquela-Kerscher and Slack, 2006). In particular, miRNAs have been implicated in regulating cell growth and cell and tissue differentiation, cellular processes that are associated with the development of cancer. For instance, lin-4 and let-7 both regulate passage from one larval state to another during C. elegans development (Ambros, 2001). miR-14 and bantam are Drosophila miRNAs that regulate cell death, apparently by regulating the expression of genes involved in apoptosis (Brennecke et al., 2003, Xu et al., 2003).
Research on miRNAs is increasing as scientists are beginning to appreciate the broad role that these molecules play in the regulation of eukaryotic gene expression. In particular, several recent studies have shown that expression levels of numerous miRNAs are associated with various cancers (reviewed in Esquela-Kerscher and Slack, 2006). Reduced expression of two miRNAs correlates strongly with chronic lymphocytic leukemia in humans, providing a possible link between miRNAs and cancer (Calin et al., 2002). Others have evaluated the expression patterns of large numbers of miRNAs in multiple human cancers and observed differential expression of almost all miRNAs across numerous cancer types (Lu et al., 2005).
Thus, compelling evidence indicates that miRNAs play key roles in the development of many diseases and cancers, including thyroid cancer. Thyroid cancer is the fastest growing cancer diagnosis in the US with a total of 44,670 new cases and 1,690 deaths expected in 2010. From nuclear disasters such as Chemobyl in 1986, it is clear that radiation exposure is a significant risk factor for thyroid cancer but, the majority of thyroid cancers appear to be sporadic in nature. Thyroid cancers encompass a variety of lesions that range from benign adenoma to malignant tumors. They also span the spectrum from, well-differentiated, poorly differentiated or undifferentiated (anaplastic). More than 95% of thyroid cancers are derived from thyroid follicular cells, while 2-3% of thyroid tumors (medullary thyroid cancers) are derived from the calcitonin producing C-cells.
A number of genetic alterations have been shown to be involved in the development of follicular cell-derived cancers. These point mutations and translocations occur in genes for several important signaling pathways, in particular the mitogen-activated protein kinase (MAPK) pathway, and are required for transformation of well-differentiated follicular cell-derived thyroid cancers, i.e., papillary thyroid cancer (PTC) and follicular thyroid cancer (FTC).
Asuragen miRInform Thyroid Panel Assay tests for the most common gene alterations identified in follicular cell-derived thyroid carcinomas. The panel consists of 7 genes that are either mutated (BRAF, NRAS, HRAS, KRAS) or rearranged by translocation (RET/PTC1, RET/PTC3, and PAX8/PPARγ). However, approximately 30% of thyroid cancers do not carry any known mutation and therefore cannot be identified using a test based on these markers only.
Methods and compositions that improve the ability to distinguish between benign thyroid conditions from malignant thyroid tumors, including mutation- and translocation-negative tumors.
Disclosed herein are methods and compositions for evaluating a thyroid sample from a patient to provide a clinician with information that is useful for determining diagnosis and/or treatment options. Methods involve obtaining information about the levels of expression of certain microRNAs or miRNAs whose expression levels differ between malignant and benign cells and tissue in the thyroid. In some embodiments, differences in miRNA expression between thyroid malignancies and benign thyroid cells and tissue are highlighted when expression level differences are first compared among two or more miRNAs and those differential values are compared to or contrasted with the differential values of malignant thyroid growths from non-malignant or benign thyroid growths. Such growths refer to abnormal growth of the thyroid and they include but are not limited to what are referred to as tumors, cysts, and nodules.
Embodiments concern methods and compositions that can be used for evaluating a thyroid sample; categorizing a thyroid sample; evaluating a thyroid sample that has been tested for a mutation associated with malignancy; distinguishing a thyroid sample or growth; identifying a thyroid sample or growth as malignant; identifying a thyroid sample as benign; characterizing a BRAF V600E-negative thyroid sample or growth; characterizing a BRAF V600E-positive thyroid sample or growth; characterizing a thyroid sample or growth whose BRAF V600E status is not yet known; characterizing a thyroid sample or growth suspected of being malignant; characterizing a thyroid sample or growth that has already undergone a cytology or histology evaluation; testing a thyroid sample or growth known to be negative for mutations associated with thyroid malignancies; identifying thyroid tissue as a target for thyroidectomy; determining whether a thyroid sample or growth is malignant; determining the thyroid should not be subject to a thyroidectomy; categorizing a thyroid tumor, cyst, or nodule; diagnosing a thyroid tumor, growth, or cyst; evaluating a recurrent thyroid tumor, growth or cyst; providing a prognosis to a patient regarding a thyroid tumor, growth or cyst; evaluating treatment options for a thyroid sample suspected of being malignant; monitoring a patient's thyroid; or, treating a patient with malignant thyroid cancer. These methods can be implemented involving steps and compositions described below in different embodiments.
In some embodiments methods for evaluating a thyroid sample are provided in which methods involve measuring the level of expression of one or more miRNAs or the miRNA precursors, or one or more targets of the miRNA. In some embodiments, the thyroid sample has not been evaluated for any mutations associated with malignancy. In other embodiments, the thyroid sample has been tested for a BRAF V600E mutation. In certain cases, the thyroid sample from a patient has been determined to be negative for a BRAF V600E mutation. In further embodiments, the thyroid sample has been alternatively or additionally tested for a point mutation in one or more of the following: N-Ras, H-Ras or K-Ras and/or for the following genetic alterations: RET/PTC 1 (translocation), RET/PTC3 (translocation), and/or PAX8-PPARg Fusion Protein (PPFP) (translocation); these will be collectively referred to as mutations. In certain embodiments, a thyroid sample has been determined to be negative for a BRAF V600E mutation. In other embodiments, the thyroid sample has been determined to be negative for the following mutations: N-Ras, H-Ras, K-Ras, RET/PTC 1 (translocation), RET/PTC3 (translocation), PAX8-PPARg Fusion Protein (PPFP) (translocation). In further embodiments, however, a thyroid sample has been determined to be positive for a mutation related to thyroid malignancy, such as N-Ras, H-Ras, K-Ras, RET/PTC 1 (translocation), RET/PTC3 (translocation), PAX8-PPARg Fusion Protein (PPFP) (translocation). In some embodiments, a patient may be tested for a BRAF V600E mutation by having a thyroid sample assayed for miR-146b expression (Chou et al. 2010, which is hereby incorporated by reference).
In some embodiments, methods and steps discussed below are implemented on a thyroid sample that has been determined to be negative for a BRAF V600E mutation, to be positive for a mutation in N-Ras, H-Ras, K-Ras, RET/PTC 1 (translocation), RET/PTC3 (translocation), and/or PAX8-PPARg Fusion Protein (PPFP) (translocation), or to be negative for a mutation in BRAF V600E, N-Ras, H-Ras, K-Ras, RET/PTC 1 (translocation), RET/PTC3 (translocation), and PAX8-PPARg Fusion Protein (PPFP) (translocation), or to have an unknown mutation status related to BRAF V600E, N-Ras, H-Ras, K-Ras, RET/PTC 1 (translocation), RET/PTC3 (translocation), and PAX8-PPARg Fusion Protein (PPFP) (translocation). In some embodiments, methods include determining whether a thyroid sample has a BRAF V600E mutation. In even further embodiments, methods include assaying the sample for a BRAF V600E mutation. This assay may be performed before, after, or at the same time that the expression level of one or more miRNAs or other biomarkers is measured. Also, this assay may be performed before, after, or at the same time that a sample is assayed for a mutation in N-Ras, H-Ras, and/or K-Ras and/or for RET/PTC 1 (translocation), RET/PTC3 (translocation), and/or PAX8-PPARg Fusion Protein (PPFP) (translocation).
Methods for evaluating a thyroid sample generally concern determining expression levels of one or more miRNAs because patterns of expression levels differ between malignant thyroid samples and benign thyroid samples. In some embodiments, methods are particularly valuable because they are able to distinguish malignant thyroid samples from benign thyroid samples when those samples were previously undistinguishable either from their pathology and/or from other genetic analysis. The sample may or may not have been evaluated by a cytology or histology to determine whether there were malignant cells present.
In some embodiments there are methods for evaluating a thyroid sample from a patient determined to be BRAF V600E-negative comprising a) measuring the level of expression of at least one biomarker miRNAs, wherein the biomarker is indicative of a thyroid malignancy; b) comparing the expression level of the at least one biomarker miRNA to the expression level of a comparative marker from the sample; and, c) evaluating the thyroid sample by calculating a score based on the compared expression levels, wherein the score indicates probability that the thyroid sample is benign or malignant.
Additional embodiments concern methods for determining whether a thyroid sample from a patient is malignant comprising: a) measuring expression levels of at least two differentially expressed diff pair miRNAs in the sample, wherein the expression of one miRNA is increased in a malignant thyroid sample relative to its level in a benign thyroid sample or other control and the expression of a second miRNA is decreased in malignant thyroid sample relative toits level in a benign thyroid sample or other control; b) calculating a diff pair value based on the expression levels of the increased miRNA and the decreased miRNA; and, c) evaluating the thyroid sample by calculating a score based on the compared expression levels, wherein the score indicates probability that the thyroid sample is benign or malignant.
In some embodiments there are methods for evaluating a thyroid sample from a patient comprising: a) measuring the level of expression of miR-375 or its precursor in the sample; b) comparing the expression level of miR-375 or its precursor in the sample to the level of expression of a first comparative miRNA; and, c) evaluating the sample based on the expression levels of miR-375 or its precursor and the comparative miRNA, wherein the evaluation assesses whether the sample comprises a malignant thyroid tumor. In some embodiments, methods further comprise measuring the level of expression of miR-34a-5p in the sample and comparing the miR-34a-5p expression level to the level of expression of the first or a second comparative miRNA. In additional embodiments, the first comparative miRNA is miR-7-5p. In particular embodiments, the sample's BRAF V600E mutation status has already been determined and known; in other embodiments, methods include determining whether the sample has a BRAF V600E mutation. In certain other embodiments, methods include assaying the sample for a BRAF V600E mutation.
Methods may involve measuring the level of expression of a biomarker miRNA. The term “biomarker miRNA” refers to a miRNA whose expression level is indicative of a particular disease or condition. In some embodiments the biomarker miRNA is one whose expression level is detectably increased in one population relative to another population, such as in malignancies compared to a benign thyroid growth. In other embodiments, a biomarker miRNA is one whose expression level is detectably decreased in one population relative to another population. A biomarker miRNA may be a diff pair miRNA in certain embodiments. As part of a diff pair, the level of expression of a biomarker miRNA may highlight or emphasize differences in miRNA expression between different populations, such as a thyroid malignancy and a thyroid growth that is benign. In some embodiments, when miRNA expression is different in a particular population relative to another population, differences between miRNA expression levels can be increased, highlighted, emphasized, or otherwise more readily observed in the context of a diff pair. In some embodiments a biomarker miRNA is miR-10b, miR-15a, miR-21, miR-24-1-star, miR-24-2-star, miR-27a, miR-27b-star, miR-31, miR-31-star, miR-34a, miR-34a-star, miR-34b-star, miR-127-3p, miR-132, miR-132-star, miR-138-1-star, miR-140-5p, miR-142-5p, miR-146a, miR-146b-3p, miR-146b-5p, miR-155, miR-181a-star, miR-181c, miR-182, miR-196a, miR-205, miR-221, miR-221-star, miR-222, miR-222-star, miR-224, miR-301a, miR-329, miR-335, miR-340, miR-346, miR-369-5p, miR-375, miR-376c, miR-409-3p, miR-425, miR-493-star, miR-495, miR-539, miR-551b, miR-1827, miR-1910, miR-3065-5p, miR-3200-3p, and/or miR-4288. In further embodiments, a biomarker miRNA is measuring the level of expression of at least two of the following biomarker miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, miR-31-3p, miR-146a, miR-181c, and/or miR-182-5p. In certain embodiments, the first 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25 or more (or any range derivable therein) or more of the miRNAs listed in the previous paragraph may be used in certain embodiments. In some embodiments a biomarker miRNA is one or more of the following: miR-10b, miR-15a, miR-21, miR-24-1-5p, miR-24-2-5p, miR-27a, miR-27b-5p, miR-31-5p, miR-31-3p, miR-34a-5p, miR-34a-3p, miR-34b-5p, miR-127-3p, miR-132-3p, miR-132-5p, miR-138-1-3p, miR-140-5p, miR-142-5p, miR-146a-5p, miR-146b-3p, miR-146b-5p, miR-155-5p, miR-181a-2-3p, miR-181c-5p, miR-182-5p, miR-196a-5p, miR-205-5p, miR-221-3p, miR-221-5p, miR-222-3p, miR-222-5p, miR-224-5p, miR-301a-3p, miR-329, miR-335-5p, miR-340-5p, miR-346, miR-369-5p, miR-375, miR-376c-3p, miR-409-3p, miR-425, miR-493-5p, miR-495-3p, miR-539-5p, miR-551b-3p, miR-1827, miR-1910, miR-3065-5p, miR-3200-3p, and/or miR-4288.
It is contemplated that in some embodiments 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 biomarker miRNAs (or any range derivable therein) may be measured. In some embodiments, a biomarker miRNA is an miRNA listed in Table 1, 4D, or 5B. In other embodiments at least or at most 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 biomarker miRNAs are measured. In certain embodiments, any miRNA shown in Table 4A or 4B may be a biomarker or comparative miRNA. In further embodiments it is contemplated that a method or other embodiments may include at least or at most embodiments 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 (or any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, miR-31-3p, miR-146a, miR-181c, and/or miR-182-5p. In specific embodiments, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or all 12 of the following miRNAs are included: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, and miR-15a-5p. In further embodiments, 1, 2, 3, 4, 5, 6, 7, 8, or all 9 (any range derivable therein) of the following miRNAs are included additional or alternatively to the previous set: miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p. In additional embodiments, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 (or any range derivable therein) of the following miRNAs may be included: miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, miR-31-3p, miR-146a, miR-181c, and miR-182-5p. In some cases, miR-224-5p and/or miR-222-5p may be included.
Measuring a microRNA or miRNA refers to measuring the amount of a mature microRNA or miRNA, though it is contemplated that a mature miRNA may be indirectly determined by measuring the level of an immature or unprocessed form of the miRNA, such as the double-stranded RNA molecule or RNA hairpin structure. Moreover, in some embodiments, the amount of a mature miRNA is determined by measuring the amount of one or more of the miRNA's target or the targets complement. An miRNA's target refers to the endogenous RNA in the thyroid cell that is the target for the miRNA and whose expression is affected by the miRNA. Consequently, any embodiments discussed herein in the context of determining the amount of a microRNA (i.e., the mature form of a microRNA) can be implemented instead by measuring a precursor of the miRNA or one or more of the miRNA's target (or the complement thereof). Unless qualified, the term “measuring” refers to directly measuring. Mature miRNAs may be indirectly determined by directly measuring precursor microRNA molecules. It will be understood that the term “star” in the context of a miR refers to an asterisk (*); for example, miR-222-star is the same as miR-222*.
Measuring or assaying for expression levels of a microRNA can be accomplished by a variety of different chemical and/or enzymatic reactions that are well known to those of skill in the art. In certain embodiments, methods may involve amplification and/or hybridization. It is contemplated that the level of a mature microRNA (miRNA) may be indirectly determined by measuring the level of the immature or unprocessed microRNA. Whether the mature or immature form of a microRNA is measured depends on the detection method, such as which primer or probe is used in the method. A person of ordinary skill in the art knows how this would be implemented.
Some methods also involve comparing the expression level of the at least one biomarker miRNA to the expression level of a comparative marker from the sample. In other embodiments, methods involve comparing the expression level of at least one biomarker miRNA to the expression level of that biomarker miRNA in a standardized sample, such as a sample known to be thyroid malignancy or known to be a benign thyroid condition. An increase or decrease in the level of expression will be evaluated. In some embodiments, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 comparative markers (or any range derivable therein) may be used in comparisons or compared to the expression level of a biomarker or of a biomarker miRNA. In other embodiments at least or at most 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 comparative markers are measured. In particular embodiments, at least or at most 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 comparative markers are compared to one or more biomarker miRNAs. A “comparative marker” refers to a gene product (such as a protein, RNA transcript, miRNA, or unprocessed miRNA) whose expression level is used to evaluate the level of an miRNA in the sample; in some embodiments, the expression level of a comparative miRNA is used to evaluate a biomarker miRNA expression level. In specific embodiments, the comparative marker is an miRNA. Moreover, in certain embodiments, a comparative marker may also be a biomarker miRNA. In particular cases, a comparative marker is one whose expression level appears to change in the opposite direction as a biomarker miRNA against which it is compared. For example, a biomarker may have an increased expression level in a malignant thyroid sample as compared to a benign thyroid sample. The comparative marker that is used to compare against that biomarker may have a decreased expression level in a malignant thyroid sample as compared to a benign thyroid sample.
In some embodiments, the comparative marker is miR-7, miR-139-5p, miR-185-star, miR-486-5p, miR-519a, miR-1910, or miR-3123. In additional embodiments, a comparative miRNA is an miRNA listed in Table 1, 4D, or 5B. In further embodiments, a comparative marker is any of miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, miR-31-3p, miR-146a, miR-181c, and/or miR-182-5p.
Some methods involve measuring expression levels of at least two differentially expressed diff pair miRNAs in the sample, wherein the expression of one miRNA is increased in a malignant thyroid sample relative to a benign thyroid sample and the expression of a second miRNA is decreased in malignant thyroid sample relative to a benign thyroid sample. In some embodiments, the increased miRNA is miR-10b, miR-15a, miR-21, miR-24-1-star, miR-24-2-star, miR-27a, miR-27b-star, miR-31, miR-31-star, miR-34a, miR-34a-star, miR-34b-star, miR-127-3p, miR-132, miR-132-star, miR-138-1-star, miR-140-5p, miR-142-5p, miR-146a, miR-146b-3p, miR-146b-5p, miR-155, miR-181a-star, miR-181c, miR-182, miR-196a, miR-205, miR-221, miR-221-star, miR-222, miR-222-star, miR-224, miR-301a, miR-329, miR-335, miR-340, miR-346, miR-369-5p, miR-375, miR-376c, miR-409-3p, miR-425, miR-493-star, miR-495, miR-539, miR-551b, miR-1827, miR-1910, miR-3065-5p, miR-3200-3p, or miR-4288. It is contemplated that 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 of these increased miRNAs (or any range derivable therein) may be measured. In some embodiments, an increased miRNA is an miRNA listed in Tables 1, 4D, or 5B that shows a relative increase in expression. The term “diff pair miRNA” refers to a miRNA that is one member of a pair of miRNAs where the expression level of one miRNA of the diff pair in a sample is compared to the expression level of the other miRNA of the diff pair in the same sample. The expression levels of two diff pair miRNAs may be evaluated with respect to each other, i.e., compared, which includes but is not limited to subtracting, dividing, multiplying or adding values representing the expression levels of the two diff pair miRNAs. A “diff pair” refers to the pair of diff pair miRNAs (or other marker). The diff pair is identified by an entity in front of a slash (/) and an entity following the slash. In embodiments described herein, the biomarker miRNA is understood as the entity in front of the slash and the comparative marker follows the slash. Moreover, it is readily apparent that the miRNA used as a biomarker and the miRNA used as the comparative miRNA may be switched, and that any calculated value can be evaluated accordingly by a person of ordinary skill in the art. However, a person of ordinary skill in the art understands that different pair analysis may be adjusted, particular with respect to altering the comparative miRNA in a pair without affecting the concept of the embodiments discussed herein.
In some embodiments it is specifically contemplated that the expression level of miR-467b in the sample is not measured. In other embodiments, the expression level of miR-221 and/or miR-222 in the sample is not measured. In further embodiments, they may be measured, but not to determine whether a sample is benign or malignant but for subclassifying a sample. In some embodiments, methods involve determining at least one diff pair value based on the expression levels of a diff pair, wherein the biomarker miRNA and a comparative marker are the diff pair. A differential pair value or diff pair value between the biomarker miRNA and the comparative marker can be calculated or determined or evaluated; this value is a number that is referred to as a “diff pair value” when it is based on the expression level of two miRNAs (or their precursors or targets). A diff pair value can be calculated, determined or evaluated using one or more mathematical formulas or algorithms. In some embodiments, the value is calculated, determined or evaluated using computer software. In particular embodiments, the diff pair is miR-375/miR-7. In other embodiments, the diff pair is miR-34a/miR-7 (miR-7 is synonymous with miR-7-5p). In additional embodiments a diff pair used in embodiments is 1, 2, 3, 4, 5, 6, 7, 8, or all 9 (and any range derivable therein) of the following diff pairs: miR-375/miR-7-5p, miR-375/miR-204-5p, miR-375/miR-141-3p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, miR-146b-3p/mir-7, and/or miR-146b-3p/miR-141-3p.
In some embodiments, a weighted coefficient may be applied to a diff pair value. In certain embodiments, a diff pair value for miR-375/miR-7-5p is weighted more heavily than a diff pair value for miR-34a/miR-7-5p. In other embodiments, a diff pair value for miR-34a/miR-7 is weighted more heavily than the diff pair value for miR-375/miR-7-5p. In some embodiments, the diff pair value for miR-375/miR-7-5p is weighted more heavily than a diff pair value for 1, 2, 3, 4, 5, 6, 7, or all 8 of miR-375/miR-204-5p, miR-375/miR-141-3p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, miR-146b-3p/miR-7-5p, and/or miR-146b-3p/miR-141-3p, miR-375/miR-7-5p, miR-375/miR-204-5p, miR-375/miR-141-3p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, miR-146b-3p/miR-7-5p, and/or miR-146b-3p/miR-141-3p. In some embodiments, the diff pair value for miR-375/miR-204-5p is weighted more heavily than a diff pair value for 1, 2, 3, 4, 5, 6, 7, or all 8 of miR-375/miR-7-5p, miR-375/miR-141-3p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, miR-146b-3p/miR-7-5p, and/or miR-146b-3p/miR-141-3p. In some embodiments, the diff pair value for miR-375/miR-141-3p is weighted more heavily than a diff pair value for 1, 2, 3, 4, 5, 6, 7, or all 8 of miR-375/miR-7-5p, miR-375/miR-204-5p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, miR-146b-3p/miR-7-5p, and/or miR-146b-3p/miR-141-3p. In some embodiments, the diff pair value for miR-146b-5p/miR-7-5p is weighted more heavily than a diff pair value for 1, 2, 3, 4, 5, 6, 7, or all 8 of miR-375/miR-7-5p, miR-375/miR-204-5p, miR-375/miR-141-3p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, miR-146b-3p/miR-7-5p, and/or miR-146b-3p/miR-141-3p. In some embodiments, the diff pair value for miR-miR-146b-5p/miR-204-5p is weighted more heavily than a diff pair value for 1, 2, 3, 4, 5, 6, 7, or all 8 of miR-375/miR-7-5p, miR-375/miR-204-5p, miR-375/miR-141-3p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, miR-146b-3p/miR-7-5p, and/or miR-146b-3p/miR-141-3p, miR-375/miR-7-5p, miR-375/miR-204-5p, miR-375/miR-141-3p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, miR-146b-3p/miR-7-5p, and/or miR-146b-3p/miR-141-3p. In some embodiments, the diff pair value for miR-146b-3p/miR-7-5p is weighted more heavily than a diff pair value for 1, 2, 3, 4, 5, 6, 7, or all 8 of miR-375/miR-7-5p, miR-375/miR-204-5p, miR-375/miR-141-3p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, and/or miR-146b-3p/miR-141-3p. In some embodiments, the diff pair value for miR-146b-3p/miR-141-3p is weighted more heavily than a diff pair value for 1, 2, 3, 4, 5, 6, 7, or all 8 of miR-375/miR-7-5p, miR-375/miR-204-5p, miR-375/miR-141-3p, miR-146b-5p/miR-7-5p, miR-146b-5p/miR-204-5p, miR-146b-5p/miR-141-3p, miR-146b-3p/miR-204-5p, and/or miR-146b-3p/miR-7-5p.
Methods also include evaluating the thyroid sample by calculating a score based on the compared expression levels or diff pair values, wherein the score indicates probability that the thyroid sample is benign or malignant.
In certain embodiments, methods may involve classifying a sample as a hyperplastic nodule, follicular adenoma, oncocytic follicular adenoma, multi nodular goiter or Hashimoto's thyroiditis. In some cases, the sample is classified as a papillary carcinoma, follicular variant of papillary carcinoma, or a follicular carcinoma. The sample may be generally categorized as benign or malignant and subcategorized as one of the classifications discussed above. In certain embodiments, the categorization as benign or malignant occurs prior to subcategorization as one of the classifications, however, the testing for these may be done at the same (parallel) or different (serial) times. In some embodiments, miR-224-5p and/or miR-155-5p may be used for identifying Hashimoto's thyroiditis.
In other embodiments, a diff pair need not be evaluated and instead, a coefficient value is applied to each miRNA expression level. The coefficient value reflects the weight that the expression level of that particular miRNA has in assessing the chances that a particular pancreatic cyst is a high risk lesion or a low risk lesion. In some embodiments, instead of a diff pair value, a coefficient value is used for the measured expression level of miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and/or miR-182. In certain embodiments, the coefficient values for a plurality of miRNAs whose expression levels are measured add up to zero (0). The plurality may be, be at least, or be at most 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26 of these miRNAs, as well as any miRNAs discussed herein. Methods and computer readable medium can be implemented with coefficient values instead of or in addition to diff pair values. In some cases, a coefficient may be multiplied against a diff pair value to reflect the weight of that diff pair in any analysis or diagnostic score calculation.
In some embodiments, the coefficient for miR-375 is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-146b-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-146b-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-221-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-222-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-551b-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-204-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-7-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-141-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-31-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-221-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-222-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-3p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-21-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-424-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-34a-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-197-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-19b-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-155-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and/or miR-182. In other embodiments, the coefficient for miR-138-1-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-139-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-151a-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-29b-3p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-96-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-34a-star, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-34a-star is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-3065-5p, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-3065-5p is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-34b-star, and miR-182. In other embodiments, the coefficient for miR-34b-star is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, and miR-182. In other embodiments, the coefficient for miR-182 is greater than or less than the coefficient for 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, or all 25 (any range derivable therein) of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-star, miR-3065-5p, and miR-34b-star.
A difference between or among weighted coefficients may be, be at least or be at most 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2.0, 2.5, 3.0, 3.5, 4.0, 4.5, 5.0, 5.5, 6.0, 6.5, 7.0, 7.5, 8.0, 8.5, 9.0, 9.5, 10.0, 10.5, 11.0, 11.5, 12.0, 12.5, 13.0, 13.5, 14.0, 14.5, 15.0, 15.5, 16.0, 16.5, 17.0, 17.5, 18.0, 18.5, 19.0. 19.5, 20.0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 105, 110, 115, 120, 125, 130, 135, 140, 145, 150, 155, 160, 165, 170, 175, 180, 185, 190, 195, 200, 205, 210, 215, 220, 225, 230, 235, 240, 245, 250, 255, 260, 265, 270, 275, 280, 285, 290, 295, 300, 305, 310, 315, 320, 325, 330, 335, 340, 345, 350, 355, 360, 365, 370, 375, 380, 385, 390, 395, 400, 410, 420, 425, 430, 440, 441, 450, 460, 470, 475, 480, 490, 500, 510, 520, 525, 530, 540, 550, 560, 570, 575, 580, 590, 600, 610, 620, 625, 630, 640, 650, 660, 670, 675, 680, 690, 700, 710, 720, 725, 730, 740, 750, 760, 770, 775, 780, 790, 800, 810, 820, 825, 830, 840, 850, 860, 870, 875, 880, 890, 900, 910, 920, 925, 930, 940, 950, 960, 970, 975, 980, 990, 1000 times or -fold (or any range derivable therein). Moreover, a person of ordinary skill in the art would understand that the coefficients could be negative or positive (in order to make their sum be zero, for example), but that the coefficient still reflected a difference in weight that is shown above.
In some embodiments, the accuracy of methods to determine whether a thyroid sample is benign or malignant or provide a diagnostic score is, is at least, or is at most 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98, 99 or 100% (and any range derivable therein). In other embodiments, the accuracy of categorizing the sample into a subtype of benign or malignant conditions is, is at least, or is at most 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98, 99 or 100% (and any range derivable therein).
Methods for treating a patient with thyroid cancer are also provided. In some embodiments such methods comprise: a) obtaining a diagnostic score based on expression levels of miRNAs in a thyroid sample from the patient, wherein the expression levels differ between malignant thyroid cancer cells compared to benign thyroid cells and wherein the thyroid sample has been determined to be negative for a BRAF V600E mutation; and, b) performing a thyroidectomy on a patient determined to have a diagnostic score indicative of malignant thyroid cancer.
In some embodiments, a patient is also administered radioactive iodine, radiation and/or chemotherapy as part of a treatment regimen. In further embodiments, methods may involve determining the patient as having or likely having a benign thyroid condition, or determining the patient as not having or likely not having a malignancy. In such cases, a clinician may then decide not to subject the patient to surgery. In such cases, the patient may continue to be monitored. In certain embodiments, methods involve imaging an unresected thyroid growth or doing a biopsy before and/or after miRNA levels are measured. In further embodiments, the imaging or biopsy occurs after 1, 2, 3, 4, 5, 6 months following a test that involves measuring one or more miRNA expression levels.
In some embodiments, methods will involve determining or calculating a diagnostic score based on data concerning the expression level of one or more miRNAs, meaning that the expression level of the one or more miRNAs is at least one of the factors on which the score is based. A diagnostic score will provide information about the biological sample, such as the general probability that the thyroid sample is malignant and/or an aggressive tumor or that the thyroid sample is benign. In some embodiments, the diagnostic score represents the probability that the thyroid sample is more likely than not either malignant or benign. In certain embodiments, a probability value is expressed as a numerical integer or number that represents a probability of 0% likelihood to 100% likelihood that a patient has a thyroid malignancy or a benign thyroid condition. In some embodiments, the probability value is expressed as a numerical integer or number that represents a probability of 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% likelihood (or any range derivable therein) that a patient has a particular type of thyroid condition or growth. Alternatively, the probability may be expressed generally in percentiles, quartiles, or deciles.
In some embodiments, methods include evaluating one or more differential pair values using a scoring algorithm to generate a diagnostic score for the thyroid growth in terms of being malignant or being benign, wherein the patient is identified as having or as not having such a growth based on the score. It is understood by those of skill in the art that the score is a predictive value about the classification of the thyroid growth. In some embodiments, a report is generated and/or provided that identifies the diagnostic score or the values that factor into such a score. In some embodiments, a cut-off score is employed to characterize a sample as likely having a thyroid malignancy (or alternatively a benign thyroid condition). In some embodiments, the risk score for the patient is compared to a cut-off score to characterize the biological sample from the patient with respect to a malignancy or a benign condition. In certain embodiments, the diagnostic score is calculated using a weighted coefficient for each of the measured miRNA levels of expression. The weighted coefficients will typically reflect the significance of the expression level of a particular miRNA for determining risk of a certain disease state, such as malignancy or benign status, or any one of hyperplastic nodule, follicular adenoma, oncocytic follicular adenoma, multi nodular goiter, Hashimoto's thyroiditis, papillary carcinoma, follicular variant of papillary carcinoma, or a follicular carcinoma.
In some embodiments, determination of calculation of a diagnostic score is performed by applying classification algorithms based on the expression values of biomarkers with differential expression p values of about, between about, or at most about 0.005, 0.006, 0.007, 0.008, 0.009, 0.01, 0.011, 0.012, 0.013, 0.014, 0.015, 0.016, 0.017, 0.018, 0.019, 0.020, 0.021, 0.022, 0.023, 0.024, 0.025, 0.026, 0.027, 0.028, 0.029, 0.03, 0.031, 0.032, 0.033, 0.034, 0.035, 0.036, 0.037, 0.038, 0.039, 0.040, 0.041, 0.042, 0.043, 0.044, 0.045, 0.046, 0.047, 0.048, 0.049, 0.050, 0.051, 0.052, 0.053, 0.054, 0.055, 0.056, 0.057, 0.058, 0.059, 0.060, 0.061, 0.062, 0.063, 0.064, 0.065, 0.066, 0.067, 0.068, 0.069, 0.070, 0.071, 0.072, 0.073, 0.074, 0.075, 0.076, 0.077, 0.078, 0.079, 0.080, 0.081, 0.082, 0.083, 0.084, 0.085, 0.086, 0.087, 0.088, 0.089, 0.090, 0.091, 0.092, 0.093, 0.094, 0.095, 0.096, 0.097, 0.098, 0.099, 0.1, 0.2, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9 or higher (or any range derivable therein). In certain embodiments, the diagnostic score is calculated using one or more statistically significantly differentially expressed biomarkers (either individually or as diff pairs).
Methods may involve obtaining from the patient a thyroid sample, which means the sample is obtained directly from the patient. In other embodiments, a patient's thyroid tissue sample may be obtained from an entity that is not the patient, such as the doctor, clinician, hospital or laboratory. In certain embodiments, methods involve a thyroid tissue sample or a thyroid cyst or nodule sample. In particular embodiments, the sample is a tissue sample, while in other embodiments, the sample is a cystic fluid sample. In some cases, methods involve fixing the tissue sample in formalin and embedding it in paraffin prior to measuring the level of expression of one or more miRNAs or diff pair miRNAs in the sample. In additional embodiments, the sample is obtained by fine needle aspirate or FNA. In other embodiments, the sample is retrieved from a biopsy, such as a fine needle aspiration biopsy (FNAB) or a needle aspiration biopsy (NAB).
In some embodiments, a patient is determined to have a thyroid sample indicative of a malignancy or aggressive cancer. The term “indicative of a malignancy” means the data indicate that the patient likely has a malignancy, where “likely” means “greater than not.” In other embodiments, the patient is determined to have a thyroid sample indicative of a benign condition. The determination may or may not be based on a diagnostic score that is calculated based on one or more miRNA expression levels or diff pair values. In additional embodiments, methods involve determining a treatment for the patient based on one or more diff pair values. In some embodiments, methods include determining a treatment for the patient based on a calculated diagnostic score. In some embodiments, a patient may be suspected of having a thyroid malignancy. In other embodiments, the patient may have previously had a thyroid condition suspected of being malignant that was then subsequently treated. In other embodiments, the patient has recurring thyroid growth, which may or may not be malignant. In still further embodiments, the patient has a familial history of thyroid growths, particularly malignant thyroid growths. In some circumstances, a patient also presents with symptoms of a thyroid growth, such as fatigue, change in appetite and other symptoms relating to a change in thyroid hormone levels.
Some embodiments further involve isolating ribonucleic or RNA from a biological sample. Other steps may or may not include amplifying a nucleic acid in a sample and/or hybridizing one or more probes to an amplified or non-amplified nucleic acid. In certain embodiments, a microarray may be used to measure or assay the level of miRNA expression in a sample.
The term “miRNA” is used according to its ordinary and plain meaning and refers to a microRNA molecule found in eukaryotes that is involved in RNA-based gene regulation. See, e.g., Carrington et al., 2003, which is hereby incorporated by reference. The term will be used to refer to the single-stranded RNA molecule processed from a precursor. Individual miRNAs have been identified and sequenced in different organisms, and they have been given names. Names of miRNAs that are related to the disclosed methods and compositions, as well as their sequences, are provided herein. The name of the miRNAs that are used in methods and compositions refers to an miRNA that is at least 90% identical to the named miRNA based on its mature sequence listed herein and that is capable of being detected under the conditions described herein using the designated ABI part number for the probe. In most embodiments, the sequence provided herein is the sequence that is being measured in methods described herein. In some methods, a step may involve using a nucleic acid with the sequence comprising or consisting of any of the complements of any of SEQ ID NOs:1-65 or any sequence found herein to measure expression of a miRNA in the sample. Alternatively, probes directed to the immature form of these miRNAs may be used, as may be probes directed to the targets of the miRNAs. In some embodiments, a complement of SEQ ID NO:1 (5′-GGAGAAAUUAUCCUUGGUGUGU′3′) is used to measure expression of naturally occurring miR-539 in a sample. In some embodiments, a complement of SEQ ID NO:2 (5′-UGGAAGACUAGUGAUUUUGUUGU′3′) is used to measure expression of naturally occurring miR-7-5p in a sample. In some embodiments, a complement of SEQ ID NO:3 (5′-UGAGAACUGAAUUCCAUAGGCU′3′) is used to measure expression of naturally occurring miR-146b-5p in a sample. In some embodiments, a complement of SEQ ID NO:4 (5′-UGCCCUGUGGACUCAGUUCUGG′3′) is used to measure expression of naturally occurring miR-146b-3p in a sample. In some embodiments, a complement of SEQ ID NO:5 (5′-AACACACCUGGUUAACCUCUUU′3′) is used to measure expression of naturally occurring miR-329 in a sample. In some embodiments, a complement of SEQ ID NO:6 (5′-UCGGAUCCGUCUGAGCUUGGCU′3′) is used to measure expression of naturally occurring miR-127-3p in a sample. In some embodiments, a complement of SEQ ID NO:7 (5′-ACCUGGCAUACAAUGUAGAUUU′3′) is used to measure expression of naturally occurring miR-221* in a sample. In some embodiments, a complement of SEQ ID NO:8 (5′-CCAGUCCUGUGCCUGCCGCCU′3′) is used to measure expression of naturally occurring miR-1910 in a sample. In some embodiments, a complement of SEQ ID NO:9 (5′-AAACAAACAUGGUGCACUUCUU′3′) is used to measure expression of naturally occurring miR-495 in a sample. In some embodiments, a complement of SEQ ID NO:10 (5′-GAAUGUUGCUCGGUGAACCCCU′3′) is used to measure expression of naturally occurring miR-409-3p in a sample. In some embodiments, a complement of SEQ ID NO:11 (5′-AACAUAGAGGAAAUUCCACGU′3′) is used to measure expression of naturally occurring miR-376c in a sample. In some embodiments, a complement of SEQ ID NO:12 (5′-UCCUUCAUUCCACCGGAGUCUG′3′) is used to measure expression of naturally occurring miR-205 in a sample. In some embodiments, a complement of SEQ ID NO:13 (5′-UUUGUUCGUUCGGCUCGCGUGA′3′) is used to measure expression of naturally occurring miR-375 in a sample. In some embodiments, a complement of SEQ ID NO:14 (5′-CUCAGUAGCCAGUGUAGAUCCU′3′) is used to measure expression of naturally occurring miR-222* in a sample. In some embodiments, a complement of SEQ ID NO:15 (5′-AGAUCGACCGUGUUAUAUUCGC′3′) is used to measure expression of naturally occurring miR-369-5p in a sample. In some embodiments, a complement of SEQ ID NO:16 (5′-UGUCUGCCCGCAUGCCUGCCUCU′3′) is used to measure expression of naturally occurring miR-346 in a sample. In some embodiments, a complement of SEQ ID NO:17 (5′-AGCUACAUUGUCUGCUGGGUUUC′3′) is used to measure expression of naturally occurring miR-221 in a sample. In some embodiments, a complement of SEQ ID NO:18 (5′-AGGCAAGAUGCUGGCAUAGCU′3′) is used to measure expression of naturally occurring miR-31 in a sample. In some embodiments, a complement of SEQ ID NO:19 (5′-UUGUACAUGGUAGGCUUUCAUU′3′) is used to measure expression of naturally occurring miR-493* in a sample. In some embodiments, a complement of SEQ ID NO:21 (5′-CACCUUGCGCUACUCAGGUCUG′3′) is used to measure expression of naturally occurring miR-3200-3p in a sample. In some embodiments, a complement of SEQ ID NO:22 (5′-UAGCUUAUCAGACUGAUGUUGA′3′) is used to measure expression of naturally occurring miR-21 in a sample. In some embodiments, a complement of SEQ ID NO:23 (5′-UGGCAGUGUCUUAGCUGGUUGU′3′) is used to measure expression of naturally occurring miR-34a in a sample. In some embodiments, a complement of SEQ ID NO:24 (5′-GCUACUUCACAACACCAGGGCC′3′) is used to measure expression of naturally occurring miR-138-1* in a sample. In some embodiments, a complement of SEQ ID NO:25 (5′-UCUACAGUGCACGUGUCUCCAG′3′) is used to measure expression of naturally occurring miR-139-5p in a sample. In some embodiments, a complement of SEQ ID NO:26 (5′-UAACACUGUCUGGUAAAGAUGG′3′) is used to measure expression of naturally occurring miR-141-3p in a sample. In some embodiments, a complement of SEQ ID NO:28 (5′-UUAAUGCUAAUCGUGAUAGGGGU′3′) is used to measure expression of naturally occurring miR-155 in a sample. In some embodiments, a complement of SEQ ID NO:30 (5′-UUCACCACCUUCUCCACCCAGC′3′) is used to measure expression of naturally occurring miR-197-3p in a sample. In some embodiments, a complement of SEQ ID NO:31 (5′-AGCUACAUCUGGCUACUGGGU′3′) is used to measure expression of naturally occurring miR-222 in a sample. In some embodiments, a complement of SEQ ID NO:32 (5′-CAAGUCACUAGUGGUUCCGUU′3′) is used to measure expression of naturally occurring miR-224 in a sample. In some embodiments, a complement of SEQ ID NO:33 (5′-AAUGACACGAUCACUCCCGUUGA′3′) is used to measure expression of naturally occurring miR-425 in a sample. In some embodiments, a complement of SEQ ID NO:34 (5′-UGAGAACUGAAUUCCAUGGGUU′3′) is used to measure expression of naturally occurring miR-146a in a sample. In some embodiments, a complement of SEQ ID NO:35 (5′-UACCCUGUAGAACCGAAUUUGUG′3′) is used to measure expression of naturally occurring miR-10b in a sample. In some embodiments, a complement of SEQ ID NO:36 (5′-UAGCAGCACAUAAUGGUUUGUG′3′) is used to measure expression of naturally occurring miR-15a in a sample. In some embodiments, a complement of SEQ ID NO:37 (5′-UGCCUACUGAGCUGAUAUCAGU′3′) is used to measure expression of naturally occurring miR-24-1* in a sample. In some embodiments, a complement of SEQ ID NO:38 (5′-UGCCUACUGAGCUGAAACACAG′3′) is used to measure expression of naturally occurring miR-24-2* in a sample. In some embodiments, a complement of SEQ ID NO:39 (5′-UUCACAGUGGCUAAGUUCCGC′3′) is used to measure expression of naturally occurring miR-27a in a sample. In some embodiments, a complement of SEQ ID NO:40 (5′-AGAGCUUAGCUGAUUGGUGAAC′3′) is used to measure expression of naturally occurring miR-27b* in a sample. In some embodiments, a complement of SEQ ID NO:41 (5′-UGCUAUGCCAACAUAUUGCCAU′3′) is used to measure expression of naturally occurring miR-31* in a sample. In some embodiments, a complement of SEQ ID NO:42 (5′-CAAUCAGCAAGUAUACUGCCCU′3′) is used to measure expression of naturally occurring miR-34a* in a sample. In some embodiments, a complement of SEQ ID NO:43 (5′-UAGGCAGUGUCAUUAGCUGAUUG′3′) is used to measure expression of naturally occurring miR-34b* in a sample. In some embodiments, a complement of SEQ ID NO:44 (5′-UAACAGUCUACAGCCAUGGUCG′3′) is used to measure expression of naturally occurring miR-132 in a sample. In some embodiments, a complement of SEQ ID NO:45 (5′-ACCGUGGCUUUCGAUUGUUACU′3′) is used to measure expression of naturally occurring miR-132* in a sample. In some embodiments, a complement of SEQ ID NO:46 (5′-CAUAAAGUAGAAAGCACUACU′3′) is used to measure expression of naturally occurring miR-140-5p in a sample. In some embodiments, a complement of SEQ ID NO:47 (5′-ACCAUCGACCGUUGAUUGUACC′3′) is used to measure expression of naturally occurring miR-181a* in a sample. In some embodiments, a complement of SEQ ID NO:48 (5′-AACAUUCAACCUGUCGGUGAGU′3′) is used to measure expression of naturally occurring miR-181c in a sample. In some embodiments, a complement of SEQ ID NO:49 (5′-UUUGGCAAUGGUAGAACUCACACU′3′) is used to measure expression of naturally occurring miR-182 in a sample. In some embodiments, a complement of SEQ ID NO:50 (5′-UAGGUAGUUUCAUGUUGUUGGG′3′) is used to measure expression of naturally occurring miR-196a in a sample. In some embodiments, a complement of SEQ ID NO:51 (5′-CAGUGCAAUAGUAUUGUCAAAGC′3′) is used to measure expression of naturally occurring miR-301a in a sample. In some embodiments, a complement of SEQ ID NO:52 (5′-UCAAGAGCAAUAACGAAAAAUGU′3′) is used to measure expression of naturally occurring miR-335 in a sample. In some embodiments, a complement of SEQ ID NO:53 (5′-UUAUAAAGCAAUGAGACUGAUU′3′) is used to measure expression of naturally occurring miR-340 in a sample. In some embodiments, a complement of SEQ ID NO:54 (5′-GCGACCCAUACUUGGUUUCAG′3′) is used to measure expression of naturally occurring miR-551b in a sample. In some embodiments, a complement of SEQ ID NO:55 (5′-UGAGGCAGUAGAUUGAAU′3′) is used to measure expression of naturally occurring miR-1827 in a sample. In some embodiments, a complement of SEQ ID NO:56 (5′-UCAACAAAAUCACUGAUGCUGGA′3′) is used to measure expression of naturally occurring miR-3065-5p in a sample. In some embodiments, a complement of SEQ ID NO:57 (5′-UUGUCUGCUGAGUUUCC′3′) is used to measure expression of naturally occurring miR-4288 in a sample. In some embodiments, a complement of SEQ ID NO:63 (5′CAUAAAGUAGAAAGCACUACU 3′) is used to measure expression of naturally occurring miR-142-5p in a sample. In some embodiments, a complement of SEQ ID NO:64 (5′-GUCCCUGUUCAGGCGCCA′3′) is used to measure expression of naturally occurring miR-1274a in a sample. In some embodiments, a complement of SEQ ID NO:90 (5′-UGUGCAAAUCCAUGCAAAACUGA′3′) is used to measure expression of naturally occurring miR-19b-3p in a sample. In some embodiments, a complement of SEQ ID NO:129 (5′-UUUGGCACUAGCACAUUUUUGCU′3′) is used to measure expression of naturally occurring miR-96-5p in a sample. In some embodiments, a complement of SEQ ID NO:139 (5′-UAGCACCAUUUGAAAUCAGUGUU′3′) is used to measure expression of naturally occurring miR-29b-3p in a sample. In some embodiments, a complement of SEQ ID NO:178 (5′-ACCACUGACCGUUGACUGUACC′3′) is used to measure expression of naturally occurring miR-181a-2-3p in a sample. In some embodiments, a complement of SEQ ID NO:192 (5′-UUCCCUUUGUCAUCCUAUGCCU′3′) is used to measure expression of naturally occurring miR-204-5p in a sample. In some embodiments, a complement of SEQ ID NO:388 (5′-CUAGACUGAAGCUCCUUGAGG′3′) is used to measure expression of naturally occurring miR-151a-3p in a sample. In some embodiments, a complement of SEQ ID NO:413 (5′-CAGCAGCAAUUCAUGUUUUGAA′3′) is used to measure expression of naturally occurring miR-424-5p in a sample. It is contemplated that a probe used in methods may be 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, or 100% complementary (and any range derivable therein) to any of SEQ ID NOs:1-64 or any sequence found herein. Such probes may be 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28. 29, or 30 residues in length (or any range derivable therein).
It is contemplated that the target of a miRNA may be measured as a surrogate of miRNA expression levels. For example, a target for miR-221 and miR-222 is CDKN1B (p27Kip1).
Methods may also involve one or more of the following steps: obtaining a thyroid sample of a patient; preparing the sample to characterize miRNA in the sample (for instance, for hybridization and/or amplification); storing a sample from a patient; assessing the integrity or adequacy of the sample, such as of the nucleic acids; doing a cytology analysis of the sample; staining all or part of the sample using tissue stains; fixing all or part of the sample; freezing all or part of the sample; transporting the sample; providing or being provided one or more images of the sample; visually accessing the sample directly or remotely, such as with telemedicine; measuring the level of expression of at least one biomarker miRNAs; comparing the level of expression of each biomarker miRNA to the level of expression of another biomarker miRNA; calculating a diagnostic score that indicates the probability the thyroid sample is benign or is malignant, wherein the diagnostic score is based on comparisons between the expression levels of the biomarker miRNAs to the expression level of at least one other biomarker miRNA; reporting the diagnostic score or the probability the thyroid sample is benign or malignant; storing information related to the expression levels of measured miRNA or related to comparisons between levels of miRNA or related to calculations from measured and/or compared expression levels or related to diagnostic scores; implementing an algorithm on a computer to calculate values that reflect comparisons between or among expression levels of biomarker miRNAs; implementing an algorithm on a computer to calculate diagnostic scores; diagnosing the patient as having or not having a thyroid malignancy; diagnosing the patient as having or not having a benign thyroid condition; categorizing a thyroid sample as a particular subtype or category of malignant thyroid conditions; categorizing a thyroid sample as a particular subtype or category of benign thyroid conditions; monitoring a patient's thyroid; and/or, treating the patient for a thyroid malignancy. It is contemplated that the score may indicate the probability that the sample is benign. In other embodiments, the score may indicate the probability that the sample is malignant. It will also be understood that a sample may be tested or evaluated more than one time either at the same time and/or at different times. In some cases, another test is run to obtain a second opinion on the same sample or a different sample from the patient.
Any of the methods described herein may be implemented on tangible computer-readable medium comprising computer-readable code that, when executed by a computer, causes the computer to perform one or more operations. In some embodiments, there is a tangible computer-readable medium comprising computer-readable code that, when executed by a computer, causes the computer to perform operations comprising: a) receiving information corresponding to a level of expression in a thyroid sample from a patient of at least two of the following diff pair miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, or miR-182-5p, wherein at least one of the miRNAs is a biomarker miRNA; and b) determining a biomarker diff pair value using information corresponding to the at least one biomarker miRNA and information corresponding to the level of expression of a comparative microRNA, the diff pair value being indicative of whether the thyroid growth is malignant or benign. In some embodiments, receiving information comprises receiving from a tangible data storage device information corresponding to a level of expression in a thyroid sample from a patient of at least two of the following diff pair miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, miR-31-3p, miR-146a, miR-181c, or miR-182-5p, wherein at least one of the miRNAs is a biomarker miRNA. In additional embodiments, information is used that corresponds to the level of expression of a comparative miRNA that is miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, miR-31-3p, miR-146a, miR-181c, or miR-182-5p. In additional embodiments the medium further comprises computer-readable code that, when executed by a computer, causes the computer to perform one or more additional operations comprising: sending information corresponding to the biomarker diff pair value to a tangible data storage device. In specific embodiments, it further comprises computer-readable code that, when executed by a computer, causes the computer to perform one or more additional operations comprising: sending information corresponding to the biomarker diff pair value to a tangible data storage device. In certain embodiments, receiving information comprises receiving from a tangible data storage device information corresponding to a level of expression in a thyroid sample from a patient of at least two of the following diff pair miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, miR-31-3p, miR-146a, miR-181c, or miR-182-5p, wherein at least one of the miRNAs is a biomarker miRNA. In even further embodiments, the tangible computer-readable medium has computer-readable code that, when executed by a computer, causes the computer to perform operations further comprising: c) calculating a diagnostic score for the thyroid sample, wherein the diagnostic score is indicative of the probability that the thyroid sample is malignant. It is contemplated that any of the methods described above may be implemented with tangible computer readable medium that has computer readable code, that when executed by a computer, causes the computer to perform operations related to the measuring, comparing, and/or calculating a diagnostic score related to the probability of a malignancy or a benign thyroid condition.
A processor or processors can be used in performance of the operations driven by the example tangible computer-readable media disclosed herein. Alternatively, the processor or processors can perform those operations under hardware control, or under a combination of hardware and software control. For example, the processor may be a processor specifically configured to carry out one or more those operations, such as an application specific integrated circuit (ASIC) or a field programmable gate array (FPGA). The use of a processor or processors allows for the processing of information (e.g., data) that is not possible without the aid of a processor or processors, or at least not at the speed achievable with a processor or processors. Some embodiments of the performance of such operations may be achieved within a certain amount of time, such as an amount of time less than what it would take to perform the operations without the use of a computer system, processor, or processors, including no more than one hour, no more than 30 minutes, no more than 15 minutes, no more than 10 minutes, no more than one minute, no more than one second, and no more than every time interval in seconds between one second and one hour.
Some embodiments of the present tangible computer-readable media may be, for example, a CD-ROM, a DVD-ROM, a flash drive, a hard drive, or any other physical storage device. Some embodiments of the present methods may include recording a tangible computer-readable medium with computer-readable code that, when executed by a computer, causes the computer to perform any of the operations discussed herein, including those associated with the present tangible computer-readable media. Recording the tangible computer-readable medium may include, for example, burning data onto a CD-ROM or a DVD-ROM, or otherwise populating a physical storage device with the data.
Also provided are kits containing the disclosed compositions or compositions used to implement the disclosed methods. In some embodiments, kits can be used to evaluate one or more miRNA molecules. In certain embodiments, a kit contains, contains at least, or contains at most 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more, or any range and combination derivable therein, miRNA probes including those that may specifically hybridize under stringent conditions to miRNAs disclosed herein. In other embodiments, kits or methods may involve 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, or 12 or more (or any range derivable therein) miRNA probes or corresponding primers, which may be capable of specifically detecting any of the following miRNAs: miR-375, miR-146b-3p, miR-146b-5p, miR-221-3p, miR-222-3p, miR-551b-3p, miR-204-5p, miR-7-5p, miR-141-3p, miR-31-5p, miR-181a-2-3p, miR-15a-5p, miR-224-5p, miR-221-5p, miR-222-5p, miR-21-5p, miR-424-5p, miR-34a-5p, miR-197-3p, miR-19b-3p, miR-155-5p, miR-138-1-3p, miR-139-5p, miR-151a-3p, miR-29b-3p, miR-96-5p, miR-34a-3p, miR-3065-5p, miR-34b-5p, miR-31-3p, miR-146a, miR-181c, or miR-182-5p.
The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.”
It is contemplated that any embodiment discussed herein can be implemented with respect to any disclosed method or composition, and vice versa. Any embodiment discussed with respect to a method can be implemented in the context of a kit or computer readable medium. Furthermore, the disclosed compositions and kits can be used to achieve the disclosed methods.
Throughout this application, the term “about” is used to indicate that a value includes the standard deviation of error for the device or method being employed to determine the value.
The use of the term “or” in the claims is used to mean “and/or” unless explicitly indicated to refer to alternatives only, or the alternatives are mutually exclusive.
As used in this specification and claim(s), the words “comprising” (and any form of comprising, such as “comprise” and “comprises”), “having” (and any form of having, such as “have” and “has”), “including” (and any form of including, such as “includes” and “include”) or “containing” (and any form of containing, such as “contains” and “contain”) are inclusive or open-ended and do not exclude additional, unrecited elements or method steps. However, for a claim using any of these terms, embodiments are also contemplated where the claim is closed and does exclude additional, unrecited elements or method steps.
Other objects, features and advantages of the invention will be apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples, while indicating specific embodiments, are given by way of illustration only, because various changes and modifications within the spirit and scope of the invention will be apparent to those skilled in the art from this detailed description.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the invention. The invention may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
Certain embodiments are directed to compositions and methods relating to preparation and characterization of miRNAs, as well as use of miRNAs for prognostic and diagnostic applications, particularly those methods and compositions related to assessing and/or identifying thyroid cancer.
Histological subtypes of thyroid cancers have distinct miRNA expression profiles, and recent studies have suggested that miRNAs may be used in preoperative FNA samples to distinguish between hyperplastic thyroid nodules and malignant thyroid tumors. The use of miRNAs, as described herein, will complement and improve the performance (sensitivity and NPV) of the mutation Panel assay and allow the accurate classification of malignant FNAs, including those that are mutation-negative.
MicroRNA molecules (“miRNAs”) are generally 21 to 22 nucleotides in length, though lengths of 19 and up to 23 nucleotides have been reported. The miRNAs are each processed from a longer precursor RNA molecule (“precursor miRNA”). Precursor miRNAs are transcribed from non-protein-encoding genes. The precursor miRNAs have two regions of complementarity that enable them to form a stem-loop- or fold-back-like structure, which is cleaved in animals by a ribonuclease III-like nuclease enzyme called Dicer. The processed miRNA is typically a portion of the stem.
The processed miRNA (also referred to as “mature miRNA”) becomes part of a large complex to down-regulate a particular target gene. Examples of animal miRNAs include those that imperfectly basepair with the target, which halts translation of the target (Olsen et al., 1999; Seggerson et al., 2002). siRNA molecules also are processed by Dicer, but from a long, double-stranded RNA molecule. siRNAs are not naturally found in animal cells, but they can direct the sequence-specific cleavage of an mRNA target through an RNA-induced silencing complex (RISC) (Denli et al., 2003). In some embodiments, the mature miRNAs may be any of the miRNAs listed in Table 1.
In the disclosed compositions and methods miRNAs can be labeled, used in array analysis, or employed in diagnostic, therapeutic, or prognostic applications, particularly those related to pathological conditions of the thyroid. The RNA may have been endogenously produced by a cell, or been synthesized or produced chemically or recombinantly. They may be isolated and/or purified. The term “miRNA,” unless otherwise indicated, refers to the single-stranded processed RNA, after it has been cleaved from its precursor. The name of the miRNA is often abbreviated and referred to without a hsa-, mmu-, or mo-prefix and will be understood as such, depending on the context. Unless otherwise indicated, miRNAs referred to are human sequences identified as miR-X or let-X, where X is a number and/or letter.
In certain experiments, a miRNA probe designated by a suffix “5P” or “3P” can be used. “5P” indicates that the mature miRNA derives from the 5′ end of the precursor and a corresponding “3P” indicates that it derives from the 3′ end of the precursor, as described on the World Wide Web at sanger.ac.uk. Moreover, in some embodiments, a miRNA probe is used that does not correspond to a known human miRNA. It is contemplated that these non-human miRNA probes may be used in embodiments or that there may exist a human miRNA that is homologous to the non-human miRNA. While the methods and compositions are not limited to human miRNA, in certain embodiments, miRNA from human cells or a human biological sample is used or evaluated. In other embodiments, any mammalian miRNA or cell, biological sample, or preparation thereof may be employed.
In some embodiments, methods and compositions involving miRNA may concern miRNA and/or other nucleic acids. Nucleic acids may be, be at least, or be at most 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 441, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, or 1000 nucleotides, or any range derivable therein, in length. Such lengths cover the lengths of processed miRNA, miRNA probes, precursor miRNA, miRNA containing vectors, control nucleic acids, and other probes and primers. In many embodiments, miRNAs are 19-24 nucleotides in length, while miRNA probes are 19-35 nucleotides in length, depending on the length of the processed miRNA and any flanking regions added. miRNA precursors are generally between 62 and 110 nucleotides in humans.
Nucleic acids used in methods and compositions disclosed herein may have regions of identity or complementarity to another nucleic acid. It is contemplated that the region of complementarity or identity can be at least 5 contiguous residues, though it is specifically contemplated that the region is, is at least, or is at most 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, 240, 250, 260, 270, 280, 290, 300, 310, 320, 330, 340, 350, 360, 370, 380, 390, 400, 410, 420, 430, 440, 441, 450, 460, 470, 480, 490, 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 610, 620, 630, 640, 650, 660, 670, 680, 690, 700, 710, 720, 730, 740, 750, 760, 770, 780, 790, 800, 810, 820, 830, 840, 850, 860, 870, 880, 890, 900, 910, 920, 930, 940, 950, 960, 970, 980, 990, or 1000, or any range derivable therein, contiguous nucleotides from any nucleic acid discussed or provided herein, including any miR or precursor miR sequence. It is further understood that the length of complementarity within a precursor miRNA or between a miRNA probe and a miRNA or a miRNA gene are such lengths. Moreover, the complementarity may be expressed as a percentage, meaning that the complementarity between a probe and its target is 90% or greater over the length of the probe. In some embodiments, complementarity is or is at least 90%, 95% or 100%. In particular, such lengths may be applied to any nucleic acid comprising a nucleic acid sequence identified in any of the sequences disclosed herein. The commonly used name of the miRNA is given (with its identifying source in the prefix, for example, “hsa” for human sequences) and the processed miRNA sequence. Unless otherwise indicated, a miRNA without a prefix will be understood to refer to a human miRNA. A miRNA designated, for example, as miR-1-2 in the application will be understood to refer to hsa-miR-1-2. Moreover, a lowercase letter in the name of a miRNA may or may not be lowercase; for example, hsa-mir-130b can also be referred to as miR-130B. In addition, miRNA sequences with a “mu” or “mmu” sequence will be understood to refer to a mouse miRNA and miRNA sequences with a “mo” sequence will be understood to refer to a rat miRNA. The term “miRNA probe” refers to a nucleic acid probe that can identify a particular miRNA or structurally related miRNAs.
It is understood that a miRNA is derived from genomic sequences or a gene. In this respect, the term “gene” is used for simplicity to refer to the genomic sequence encoding the precursor miRNA for a given miRNA. However, embodiments may involve genomic sequences of a miRNA that are involved in its expression, such as a promoter or other regulatory sequences.
The term “recombinant” generally refers to a molecule that has been manipulated in vitro or that is a replicated or expressed product of such a molecule.
The term “nucleic acid” is well known in the art. A “nucleic acid” as used herein will generally refer to a molecule (one or more strands) of DNA, RNA or a derivative or analog thereof, comprising a nucleobase. A nucleobase includes, for example, a naturally occurring purine or pyrimidine base found in DNA (e.g., an adenine “A,” a guanine “G,” a thymine “T” or a cytosine “C”) or RNA (e.g., an A, a G, an uracil “U” or a C). The term “nucleic acid” encompasses the terms “oligonucleotide” and “polynucleotide,” each as a subgenus of the term “nucleic acid.”
The term “miRNA” generally refers to a single-stranded molecule, but in specific embodiments, molecules will also encompass a region or an additional strand that is partially (between 10 and 50% complementary across length of strand), substantially (greater than 50% but less than 100% complementary across length of strand) or fully complementary to another region of the same single-stranded molecule or to another nucleic acid. Thus, nucleic acids may encompass a molecule that comprises one or more complementary or self-complementary strand(s) or “complement(s)” of a particular sequence comprising a molecule. For example, precursor miRNA may have a self-complementary region, which is up to 100% complementary. miRNA probes or nucleic acids can include, can be, or can be at least 60, 65, 70, 75, 80, 85, 90, 95, 96, 97, 98, 99 or 100% complementary to their target.
As used herein, “hybridization”, “hybridizes” or “capable of hybridizing” is understood to mean the forming of a double or triple stranded molecule or a molecule with partial double or triple stranded nature. The term “anneal” is synonymous with “hybridize.” The term “hybridization”, “hybridize(s)” or “capable of hybridizing” encompasses the terms “stringent condition(s)” or “high stringency” and the terms “low stringency” or “low stringency condition(s).”
As used herein, “stringent condition(s)” or “high stringency” are those conditions that allow hybridization between or within one or more nucleic acid strand(s) containing complementary sequence(s), but preclude hybridization of random sequences. Stringent conditions tolerate little, if any, mismatch between a nucleic acid and a target strand. Such conditions are well known to those of ordinary skill in the art, and are preferred for applications requiring high selectivity. Non-limiting applications include isolating a nucleic acid, such as a gene or a nucleic acid segment thereof, or detecting at least one specific mRNA transcript or a nucleic acid segment thereof, and the like.
Stringent conditions may comprise low salt and/or high temperature conditions, such as provided by about 0.02 M to about 0.5 M NaCl at temperatures of about 42° C. to about 70° C. It is understood that the temperature and ionic strength of a desired stringency are determined in part by the length of the particular nucleic acid(s), the length and nucleobase content of the target sequence(s), the charge composition of the nucleic acid(s), and to the presence or concentration of formamide, tetramethylammonium chloride or other solvent(s) in a hybridization mixture.
It is also understood that these ranges, compositions and conditions for hybridization are mentioned by way of non-limiting examples only, and that the desired stringency for a particular hybridization reaction is often determined empirically by comparison to one or more positive or negative controls. Depending on the application envisioned it is preferred to employ varying conditions of hybridization to achieve varying degrees of selectivity of a nucleic acid towards a target sequence. In a non-limiting example, identification or isolation of a related target nucleic acid that does not hybridize to a nucleic acid under stringent conditions may be achieved by hybridization at low temperature and/or high ionic strength. Such conditions are termed “low stringency” or “low stringency conditions,” and non-limiting examples of such include hybridization performed at about 0.15 M to about 0.9 M NaCl at a temperature range of about 20° C. to about 50° C. Of course, it is within the skill of one in the art to further modify the low or high stringency conditions to suite a particular application.
Embodiments may concern detecting miRNA expression levels through hybridization such as with probes. In other embodiments, expression levels may be determined using PCR, such as quantitative PCR. Primers to amplify target miRNA sequences (or corresponding cDNA sequences) may be employed in some embodiments.
Labeling methods and kits may use nucleotides that are both modified for attachment of a label and can be incorporated into a miRNA molecule. Such nucleotides include those that can be labeled with a dye, including a fluorescent dye, or with a molecule such as biotin. Labeled nucleotides are readily available; they can be acquired commercially or they can be synthesized by reactions known to those of skill in the art.
Modified nucleotides for use in the methods and compositions are not naturally occurring nucleotides, but instead, refer to prepared nucleotides that have a reactive moiety on them. Specific reactive functionalities of interest include: amino, sulfhydryl, sulfoxyl, aminosulfhydryl, azido, epoxide, isothiocyanate, isocyanate, anhydride, monochlorotriazine, dichlorotriazine, mono- or dihalogen substituted pyridine, mono- or disubstituted diazine, maleimide, epoxide, aziridine, sulfonyl halide, acid halide, alkyl halide, aryl halide, alkylsulfonate, N-hydroxysuccinimide ester, imido ester, hydrazine, azidonitrophenyl, azide, 3-(2-pyridyl dithio)-propionamide, glyoxal, aldehyde, iodoacetyl, cyanomethyl ester, p-nitrophenyl ester, o-nitrophenyl ester, hydroxypyridine ester, carbonyl imidazole, and other such chemical groups. In some embodiments, the reactive functionality may be bonded directly to a nucleotide, or it may be bonded to the nucleotide through a linking group. The functional moiety and any linker cannot substantially impair the ability of the nucleotide to be added to the miRNA or to be labeled. Representative linking groups include carbon containing linking groups, typically ranging from about 2 to 18, usually from about 2 to 8 carbon atoms, where the carbon containing linking groups may or may not include one or more heteroatoms, e.g. S, O, N etc., and may or may not include one or more sites of unsaturation. Of particular interest in some embodiments are alkyl linking groups, typically lower alkyl linking groups of 1 to 16, usually 1 to 4 carbon atoms, where the linking groups may include one or more sites of unsaturation. The functionalized nucleotides (or primers) used in the above methods of functionalized target generation may be fabricated using known protocols or purchased from commercial vendors, e.g., Sigma, Roche, Ambion, etc. Functional groups may be prepared according to ways known to those of skill in the art, including the representative information found in U.S. Pat. Nos. 4,404,289; 4,405,711; 4,337,063 and 5,268,486, and U.K. Patent 1,529,202, which are all incorporated by reference.
Amine-modified nucleotides are used in some embodiments. The amine-modified nucleotide is a nucleotide that has a reactive amine group for attachment of the label. It is contemplated that any ribonucleotide (G, A, U, or C) or deoxyribonucleotide (G, A, T, or C) can be modified for labeling. Examples include, but are not limited to, the following modified ribo- and deoxyribo-nucleotides: 5-(3-aminoallyl)-UTP; 8-[(4-amino)butyl]-amino-ATP and 8-[(6-amino)butyl]-amino-ATP; N6-(4-amino)butyl-ATP, N6-(6-amino)butyl-ATP, N4-[2,2-oxy-bis-(ethylamine)]-CTP; N6-(6-Amino)hexyl-ATP; 8-[(6-Amino)hexyl]-amino-ATP; 5-propargylamino-CTP, 5-propargylamino-UTP; 5-(3-aminoallyl)-dUTP; 8-[(4-amino)butyl]-amino-dATP and 8-[(6-amino)butyl]-amino-dATP; N6-(4-amino)butyl-dATP, N6-(6-amino)butyl-dATP, N4-[2,2-oxy-bis-(ethylamine)]-dCTP; N6-(6-Amino)hexyl-dATP; 8-[(6-Amino)hexyl]-amino-dATP; 5-propargylamino-dCTP, and 5-propargylamino-dUTP. Such nucleotides can be prepared according to methods known to those of skill in the art. Moreover, a person of ordinary skill in the art could prepare other nucleotide entities with the same amine-modification, such as a 5-(3-aminoallyl)-CTP, GTP, ATP, dCTP, dGTP, dTTP, or dUTP in place of a 5-(3-aminoallyl)-UTP.
Nucleic acids may be isolated using techniques well known to those of skill in the art, though in particular embodiments, methods for isolating small nucleic acid molecules, and/or isolating RNA molecules can be employed. Chromatography is a process often used to separate or isolate nucleic acids from protein or from other nucleic acids. Such methods can involve electrophoresis with a gel matrix, filter columns, alcohol precipitation, and/or other chromatography. If miRNA from cells is to be used or evaluated, methods generally involve lysing the cells with a chaotropic (e.g., guanidinium isothiocyanate) and/or detergent (e.g., N-lauroyl sarcosine) prior to implementing processes for isolating particular populations of RNA.
In particular methods for separating miRNA from other nucleic acids, a gel matrix is prepared using polyacrylamide, though agarose can also be used. The gels may be graded by concentration or they may be uniform. Plates or tubing can be used to hold the gel matrix for electrophoresis. Usually one-dimensional electrophoresis is employed for the separation of nucleic acids. Plates are used to prepare a slab gel, while the tubing (glass or rubber, typically) can be used to prepare a tube gel. The phrase “tube electrophoresis” refers to the use of a tube or tubing, instead of plates, to form the gel. Materials for implementing tube electrophoresis can be readily prepared by a person of skill in the art or purchased.
Methods may involve the use of organic solvents and/or alcohol to isolate nucleic acids, particularly miRNA used in methods and compositions disclosed herein. Some embodiments are described in U.S. patent application Ser. No. 10/667,126, which is hereby incorporated by reference. Generally, this disclosure provides methods for efficiently isolating small RNA molecules from cells comprising: adding an alcohol solution to a cell lysate and applying the alcohol/lysate mixture to a solid support before eluting the RNA molecules from the solid support. In some embodiments, the amount of alcohol added to a cell lysate achieves an alcohol concentration of about 55% to 60%. While different alcohols can be employed, ethanol works well. A solid support may be any structure, and it includes beads, filters, and columns, which may include a mineral or polymer support with electronegative groups. A glass fiber filter or column may work particularly well for such isolation procedures.
In specific embodiments, miRNA isolation processes include: a) lysing cells in the sample with a lysing solution comprising guanidinium, wherein a lysate with a concentration of at least about 1 M guanidinium is produced; b) extracting miRNA molecules from the lysate with an extraction solution comprising phenol; c) adding to the lysate an alcohol solution for forming a lysate/alcohol mixture, wherein the concentration of alcohol in the mixture is between about 35% to about 70%; d) applying the lysate/alcohol mixture to a solid support; e) eluting the miRNA molecules from the solid support with an ionic solution; and, f) capturing the miRNA molecules. Typically the sample is dried down and resuspended in a liquid and volume appropriate for subsequent manipulation.
In some embodiments, miRNAs are labeled. It is contemplated that miRNA may first be isolated and/or purified prior to labeling. This may achieve a reaction that more efficiently labels the miRNA, as opposed to other RNA in a sample in which the miRNA is not isolated or purified prior to labeling. In particular embodiments, the label is non-radioactive. Generally, nucleic acids may be labeled by adding labeled nucleotides (one-step process) or adding nucleotides and labeling the added nucleotides (two-step process).
In some embodiments, nucleic acids are labeled by catalytically adding to the nucleic acid an already labeled nucleotide or nucleotides. One or more labeled nucleotides can be added to miRNA molecules. See U.S. Pat. No. 6,723,509, which is hereby incorporated by reference.
In other embodiments, an unlabeled nucleotide(s) is catalytically added to a miRNA, and the unlabeled nucleotide is modified with a chemical moiety that enables it to be subsequently labeled. In some embodiments, the chemical moiety is a reactive amine such that the nucleotide is an amine-modified nucleotide. Examples of amine-modified nucleotides are well known to those of skill in the art, many being commercially available.
In contrast to labeling of cDNA during its synthesis, the issue for labeling miRNA is how to label the already existing molecule. Some aspects concern the use of an enzyme capable of using a di- or tri-phosphate ribonucleotide or deoxyribonucleotide as a substrate for its addition to a miRNA. Moreover, in specific embodiments, a modified di- or tri-phosphate ribonucleotide is added to the 3′ end of a miRNA. The source of the enzyme is not limiting. Examples of sources for the enzymes include yeast, gram-negative bacteria such as E. coli, Lactococcus lactis, and sheep pox virus.
Enzymes capable of adding such nucleotides include, but are not limited to, poly(A) polymerase, terminal transferase, and polynucleotide phosphorylase. In specific embodiments, a ligase is contemplated as not being the enzyme used to add the label, and instead, a non-ligase enzyme is employed.
Terminal transferase may catalyze the addition of nucleotides to the 3′ terminus of a nucleic acid. Polynucleotide phosphorylase can polymerize nucleotide diphosphates without the need for a primer.
Labels on miRNA or miRNA probes may be colorimetric (includes visible and UV spectrum, including fluorescent), luminescent, enzymatic, or positron emitting (including radioactive). The label may be detected directly or indirectly. Radioactive labels include 125I, 32P, 33P, and 35S. Examples of enzymatic labels include alkaline phosphatase, luciferase, horseradish peroxidase, and β-galactosidase. Labels can also be proteins with luminescent properties, e.g., green fluorescent protein and phicoerythrin.
The colorimetric and fluorescent labels contemplated for use as conjugates include, but are not limited to, Alexa Fluor dyes, BODIPY dyes, such as BODIPY FL; Cascade Blue; Cascade Yellow; coumarin and its derivatives, such as 7-amino-4-methylcoumarin, aminocoumarin and hydroxycoumarin; cyanine dyes, such as Cy3 and Cy5; eosins and erythrosins; fluorescein and its derivatives, such as fluorescein isothiocyanate; macrocyclic chelates of lanthanide ions, such as Quantum Dye™; Marina Blue; Oregon Green; rhodamine dyes, such as rhodamine red, tetramethylrhodamine and rhodamine 6G; Texas Red; fluorescent energy transfer dyes, such as thiazole orange-ethidium heterodimer; and, TOTAB.
Specific examples of dyes include, but are not limited to, those identified above and the following: Alexa Fluor 350, Alexa Fluor 405, Alexa Fluor 430, Alexa Fluor 488, Alexa Fluor 500. Alexa Fluor 514, Alexa Fluor 532, Alexa Fluor 546, Alexa Fluor 555, Alexa Fluor 568, Alexa Fluor 594, Alexa Fluor 610, Alexa Fluor 633, Alexa Fluor 647, Alexa Fluor 660, Alexa Fluor 680, Alexa Fluor 700, and, Alexa Fluor 750; amine-reactive BODIPY dyes, such as BODIPY 493/503, BODIPY 530/550, BODIPY 558/568, BODIPY 564/570, BODIPY 576/589, BODIPY 581/591, BODIPY 630/650, BODIPY 650/655, BODIPY FL, BODIPY R6G, BODIPY TMR, and, BODIPY-TR; Cy3, Cy5, 6-FAM, Fluorescein Isothiocyanate, HEX, 6-JOE, Oregon Green 488, Oregon Green 500, Oregon Green 514, Pacific Blue, REG, Rhodamine Green, Rhodamine Red, Renographin, ROX, SYPRO, TAMRA, 2′,4′,5′,7′-Tetrabromosulfonefluorescein, and TET.
Specific examples of fluorescently labeled ribonucleotides include Alexa Fluor 488-5-UTP, Fluorescein-12-UTP, BODIPY FL-14-UTP, BODIPY TMR-14-UTP, Tetramethylrhodamine-6-UTP, Alexa Fluor 546-14-UTP, Texas Red-5-UTP, and BODIPY TR-14-UTP. Other fluorescent ribonucleotides include Cy3-UTP and Cy5-UTP.
Examples of fluorescently labeled deoxyribonucleotides include Dinitrophenyl (DNP)-11-dUTP, Cascade Blue-7-dUTP, Alexa Fluor 488-5-dUTP, Fluorescein-12-dUTP, Oregon Green 488-5-dUTP, BODIPY FL-14-dUTP, Rhodamine Green-5-dUTP, Alexa Fluor 532-5-dUTP, BODIPY TMR-14-dUTP, Tetramethylrhodamine-6-dUTP, Alexa Fluor 546-14-dUTP, Alexa Fluor 568-5-dUTP, Texas Red-12-dUTP, Texas Red-5-dUTP, BODIPY TR-14-dUTP, Alexa Fluor 594-5-dUTP, BODIPY 630/650-14-dUTP, BODIPY 650/665-14-dUTP; Alexa Fluor 488-7-OBEA-dCTP, Alexa Fluor 546-16-OBEA-dCTP, Alexa Fluor 594-7-OBEA-dCTP, and Alexa Fluor 647-12-OBEA-dCTP.
It is contemplated that nucleic acids may be labeled with two different labels. Furthermore, fluorescence resonance energy transfer (FRET) may be employed in disclosed methods (e.g., Klostermeier et al., 2002; Emptage, 2001; Didenko, 2001, each incorporated by reference).
Alternatively, the label may not be detectable per se, but indirectly detectable or allowing for the isolation or separation of the targeted nucleic acid. For example, the label could be biotin, digoxigenin, polyvalent cations, chelator groups and other ligands, include ligands for an antibody.
A number of techniques for visualizing or detecting labeled nucleic acids are readily available. Such techniques include, microscopy, arrays, fluorometry, light cyclers or other real time PCR machines, FACS analysis, scintillation counters, phosphoimagers, Geiger counters, MRI, CAT, antibody-based detection methods (Westerns, immunofluorescence, immunohistochemistry), histochemical techniques, HPLC (Griffey et al., 1997), spectroscopy, capillary gel electrophoresis (Cummins et al., 1996), spectroscopy; mass spectroscopy; radiological techniques; and mass balance techniques.
When two or more differentially colored labels are employed, fluorescent resonance energy transfer (FRET) techniques may be employed to characterize association of one or more nucleic acids. Furthermore, a person of ordinary skill in the art is well aware of ways of visualizing, identifying, and characterizing labeled nucleic acids, and accordingly, such protocols may be used. Examples of tools that may be used also include fluorescent microscopy, a BioAnalyzer, a plate reader, Storm (Molecular Dynamics), Array Scanner, FACS (fluorescent activated cell sorter), or any instrument that has the ability to excite and detect a fluorescent molecule.
A. Array Preparation
Some embodiments involve the preparation and use of miRNA arrays or miRNA probe arrays, which are ordered macroarrays or microarrays of nucleic acid molecules (probes) that are fully or nearly complementary or identical to a plurality of miRNA molecules or precursor miRNA molecules and that are positioned on a support or support material in a spatially separated organization. Macroarrays are typically sheets of nitrocellulose or nylon upon which probes have been spotted. Microarrays position the nucleic acid probes more densely such that up to 10,000 nucleic acid molecules can be fit into a region typically 1 to 4 square centimeters. Microarrays can be fabricated by spotting nucleic acid molecules, e.g., genes, oligonucleotides, etc., onto substrates or fabricating oligonucleotide sequences in situ on a substrate. Spotted or fabricated nucleic acid molecules can be applied in a high density matrix pattern of up to about 30 non-identical nucleic acid molecules per square centimeter or higher, e.g. up to about 100 or even 1000 per square centimeter. Microarrays typically use coated glass as the solid support, in contrast to the nitrocellulose-based material of filter arrays. By having an ordered array of miRNA-complementing nucleic acid samples, the position of each sample can be tracked and linked to the original sample. A variety of different array devices in which a plurality of distinct nucleic acid probes are stably associated with the surface of a solid support are known to those of skill in the art. Useful substrates for arrays include nylon, glass, metal, plastic, and silicon. Such arrays may vary in a number of different ways, including average probe length, sequence or types of probes, nature of bond between the probe and the array surface, e.g. covalent or non-covalent, and the like. The labeling and screening methods are not limited by with respect to any parameter except that the probes detect miRNA; consequently, methods and compositions may be used with a variety of different types of miRNA arrays.
Representative methods and apparatuses for preparing a microarray have been described, for example, in U.S. Pat. Nos. 5,143,854; 5,202,231; 5,242,974; 5,288,644; 5,324,633; 5,384,261; 5,405,783; 5,412,087; 5,424,186; 5,429,807; 5,432,049; 5,436,327; 5,445,934; 5,468,613; 5,470,710; 5,472,672; 5,492,806; 5,525,464; 5,503,980; 5,510,270; 5,525,464; 5,527,681; 5,529,756; 5,532,128; 5,545,531; 5,547,839; 5,554,501; 5,556,752; 5,561,071; 5,571,639; 5,580,726; 5,580,732; 5,593,839; 5,599,695; 5,599,672; 5,610,287; 5,624,711; 5,631,134; 5,639,603; 5,654,413; 5,658,734; 5,661,028; 5,665,547; 5,667,972; 5,695,940; 5,700,637; 5,744,305; 5,800,992; 5,807,522; 5,830,645; 5,837,196; 5,871,928; 5,847,219; 5,876,932; 5,919,626; 6,004,755; 6,087,102; 6,368,799; 6,383,749; 6,617,112; 6,638,717; 6,720,138, as well as WO 93/17126; WO 95/11995; WO 95/21265; WO 95/21944; WO 95/35505; WO 96/31622; WO 97/10365; WO 97/27317; WO 99/35505; WO 09923256; WO 09936760; WO0138580; WO 0168255; WO 03020898; WO 03040410; WO 03053586; WO 03087297; WO 03091426; WO03100012; WO 04020085; WO 04027093; EP 373 203; EP 785 280; EP 799 897 and UK 8 803 000, which are each herein incorporated by reference.
It is contemplated that the arrays can be high density arrays, such that they contain 2, 20, 25, 50, 80, 100, or more, or any integer derivable therein, different probes. It is contemplated that they may contain 1000, 16,000, 65,000, 250,000 or 1,000,000 or more, or any integer or range derivable therein, different probes. The probes can be directed to targets in one or more different organisms or cell types. In some embodiments, the oligonucleotide probes may range from 5 to 50, 5 to 45, 10 to 40, 9 to 34, or 15 to 40 nucleotides in length. In certain embodiments, the oligonucleotide probes are 5, 10, 15, 20, 25, 30, 35, 40 nucleotides in length, including all integers and ranges there between.
Moreover, the large number of different probes can occupy a relatively small area providing a high density array having a probe density of generally greater than about 60, 100, 600, 1000, 5,000, 10,000, 40,000, 100,000, or 400,000 different oligonucleotide probes per cm2. The surface area of the array can be about or less than about 1, 1.6, 2, 3, 4, 5, 6, 7, 8, 9, or 10 cm2.
Moreover, a person of ordinary skill in the art could readily analyze data generated using an array. Such protocols are disclosed herein or may be found in, for example, WO 9743450; WO 03023058; WO 03022421; WO 03029485; WO 03067217; WO 03066906; WO 03076928; WO 03093810; WO 03100448A1, all of which are specifically incorporated by reference.
B. Sample Preparation
It is contemplated that the miRNA of a wide variety of samples can be analyzed using arrays, miRNA probes, or array technology. While endogenous miRNA is contemplated for use with compositions and methods disclosed herein, recombinant miRNA—including nucleic acids that are complementary or identical to endogenous miRNA or precursor miRNA—can also be handled and analyzed as described herein. Samples may be biological samples, in which case, they can be from biopsy, fine needle aspirates, exfoliates, blood, tissue, organs, semen, saliva, tears, other bodily fluid, hair follicles, skin, or any sample containing or constituting biological cells. In certain embodiments, samples may be, but are not limited to, fresh, frozen, fixed, formalin fixed, paraffin embedded, or formalin fixed and paraffin embedded. Alternatively, the sample may not be a biological sample, but a chemical mixture, such as a cell-free reaction mixture (which may contain one or more biological enzymes).
1. Biological Sample Collection
In certain aspects, methods involve obtaining a sample from a subject. The term subject may refer to an animal (for example a mammal), including but not limited to humans, non-human primates, rodents, dogs, or pigs. The methods of obtaining provided herein include methods of biopsy such as fine needle aspiration, core needle biopsy, vacuum assisted biopsy, incisional biopsy, excisional biopsy, punch biopsy, shave biopsy or skin biopsy. The sample may be obtained from any of the tissues provided herein that include but are not limited to gall bladder, skin, heart, lung, breast, pancreas, liver, muscle, kidney, smooth muscle, bladder, colon, intestine, brain, prostate, esophagus, or thyroid tissue. Alternatively, the sample may be obtained from any other source including but not limited to blood, sweat, hair follicle, buccal tissue, tears, menses, feces, or saliva. In certain aspects of the current methods, any medical professional such as a doctor, nurse or medical technician may obtain a biological sample for testing.
A sample may include but is not limited to, tissue, cells, or biological material from cells or derived from cells of a subject. The biological sample may be a heterogeneous or homogeneous population of cells or tissues. The biological sample may be obtained using any method known to the art that can provide a sample suitable for the analytical methods described herein. The sample may be obtained by non-invasive methods including but not limited to: scraping of the skin or cervix, swabbing of the cheek, saliva collection, urine collection, feces collection, collection of menses, tears, or semen.
The sample may be obtained by methods known in the art. In certain embodiments the samples are obtained by biopsy. In other embodiments the sample is obtained by swabbing, scraping, phlebotomy, or any other methods known in the art. In some cases, the sample may be obtained, stored, or transported using components of a kit of the present methods. In some cases, multiple samples, such as multiple thyroid samples may be obtained for diagnosis by the methods described herein. In other cases, multiple samples, such as one or more samples from one tissue type (for example thyroid) and one or more samples from another tissue (for example, buccal) may be obtained for diagnosis by the methods of the present methods. In some cases, multiple samples such as one or more samples from one tissue type (e.g., thyroid) and one or more samples from another tissue (e.g., buccal) may be obtained at the same or different times. Samples may be obtained at different times are stored and/or analyzed by different methods. For example, a sample may be obtained and analyzed by routine staining methods or any other cytological analysis methods.
In some cases, further samples may be obtained from a subject based on the results of such a cytological analysis. A cancer diagnosis may include an examination of a subject by a physician, nurse or other medical professional. The examination may be part of a routine examination, or the examination may be due to a specific complaint. A specific complaint may include but is not limited to: pain, illness, anticipation of illness, presence of a suspicious lump or mass, a disease, or a condition.
In some embodiments the subject may or may not be aware of the disease or condition.
In some cases, the subject may be referred to a specialist such as an oncologist, surgeon, or endocrinologist. The specialist may likewise obtain a biological sample for testing or refer the individual to a testing center or laboratory for submission of the biological sample. In some cases the medical professional may refer the subject to a testing center or laboratory for submission of the biological sample. In other cases, the subject may provide the sample. In some cases, a molecular profiling business may obtain the sample.
In some embodiments the biological sample may be obtained by a physician, nurse, or other medical professional such as a medical technician, endocrinologist, cytologist, phlebotomist, radiologist, or a pulmonologist. The medical professional may indicate the appropriate test or assay to perform on the sample. In certain aspects a molecular profiling business may consult on which assays or tests are most appropriately indicated.
In other cases, the sample is obtained by an invasive procedure including but not limited to: biopsy, alveolar or pulmonary lavage, needle aspiration, or phlebotomy. The method of needle aspiration may further include fine needle aspiration, core needle biopsy, vacuum assisted biopsy, or large core biopsy. In some embodiments, multiple samples may be obtained by the methods herein to ensure a sufficient amount of biological material. Methods of obtaining suitable samples of thyroid are known in the art and are further described in the ATA Guidelines for thyroid nodule management (Cooper et al. Thyroid Vol. 16 No. 2 2006), herein incorporated by reference in its entirety.
General methods for obtaining biological samples are also known in the art. Publications such as Ramzy, Ibrahim Clinical Cytopathology and Aspiration Biopsy 2001, which is herein incorporated by reference in its entirety, described general methods for biopsy and cytological methods. In one embodiment, the sample is a fine needle aspirate of a thyroid nodule or a suspected thyroid tumor. In some cases, the fine needle aspirate sampling procedure may be guided by the use of an ultrasound, X-ray, or other imaging device.
In some embodiments of the present methods, the molecular profiling business may obtain the biological sample from a subject directly, from a medical professional, from a third party, or from a kit provided by a molecular profiling business or a third party. In some cases, the biological sample may be obtained by the molecular profiling business after the subject, a medical professional, or a third party acquires and sends the biological sample to the molecular profiling business. In some cases, the molecular profiling business may provide suitable containers, and excipients for storage and transport of the biological sample to the molecular profiling business.
In some embodiments of the methods described herein, a medical professional need not be involved in the initial diagnosis or sample acquisition. An individual may alternatively obtain a sample through the use of an over the counter (OTC) kit. An OTC kit may contain a means for obtaining said sample as described herein, a means for storing said sample for inspection, and instructions for proper use of the kit. In some cases, molecular profiling services are included in the price for purchase of the kit. In other cases, the molecular profiling services are billed separately. A sample suitable for use by the molecular profiling business may be any material containing tissues, cells, nucleic acids, genes, gene fragments, expression products, gene expression products, or gene expression product fragments of an individual to be tested. Methods for determining sample suitability and/or adequacy are provided.
2. Biological Sample Storage
In certain aspects, a sample may be obtained and prior to analysis by one or more methods described herein, the sample may be stored for a length of time. A length of time may include a time interval such as seconds, minutes, hours, days, weeks, months, years or longer. In some cases, the sample obtained from a subject is subdivided prior to the step of storage or further analysis. In some cases where the sample is subdivided different portions of the sample are subjected to different downstream methods or processes. Such methods or processes may include storage, cytological analysis, integrity tests, nucleic acid extraction, molecular profiling or any combination of these.
In some cases where storage is contemplated, some part of the sample may be stored while another portion of the sample is further processed. Processing may include but is not limited to molecular profiling, cytological staining, gene or gene expression product (RNA or protein) extraction, detection, or quantification, fixation or examination.
In other cases, the sample is obtained and stored and subdivided after the step of storage for further analysis such that different portions of the sample are subject to different downstream methods or processes including but not limited to storage, cytological analysis, adequacy tests, nucleic acid extraction, molecular profiling or a combination thereof.
In some cases, samples are obtained and analyzed by cytological analysis, and the resulting sample material is further analyzed by one or more molecular profiling methods described herein. In such cases, the samples may be stored between the steps of cytological analysis and the steps of molecular profiling. Samples may be stored upon acquisition to facilitate transport, or to wait for the results of other analyses. In another embodiment, samples may be stored while awaiting instructions a medical professional.
An acquired sample may be placed in short term or long term storage by placing in a suitable medium, excipient, solution, or container. In certain cases storage may require keeping the sample in a refrigerated, or frozen environment. The sample may be quickly frozen prior to storage in a frozen environment. In certain instances the frozen sample may be contacted with a suitable cryopreservation medium or compound. Examples of cryopreservation mediums or compounds include but are not limited to: glycerol, ethylene glycol, sucrose, or glucose.
A suitable medium, excipient, or solution may include but is not limited to: hanks salt solution, saline, cellular growth medium, an ammonium salt solution such as ammonium sulphate or ammonium phosphate, or water.
Suitable concentrations of ammonium salts include solutions of about 0.1 g/ml, 0.2 g/ml, 0.3 g/ml, 0.4 g/ml, 0.5 g/ml, 0.6 g/ml, 0.7 g/ml, 0.8 g/ml, 0.9 g/ml, 1.0 g/ml, 1.1 g/ml, 1.2 g/ml, 1.3 g/ml, 1.4 g/ml, 1.5 g/ml, 1.6 g/ml, 1.7 g/ml, 1.8 g/ml, 1.9 g/ml, 2.0 g/ml, 2.2 g/ml, 2.3 g/ml, 2.5 g/ml, 2.7 g/ml, 3.0 g/ml or higher. The medium, excipient, or solution may or may not be sterile.
The medium, excipient, or solution may contain preservative agents to maintain the sample in an adequate state for subsequent diagnostics or manipulation, or to prevent coagulation. Said preservatives may include citrate, ethylene diamine tetraacetic acid, sodium azide, or thimerosol. The sample may be fixed prior to or during storage by any method known to the art such as using glutaraldehyde, formaldehyde, or methanol. The container may be any container suitable for storage and or transport of the biological sample including but not limited to: a cup, a cup with a lid, a tube, a sterile tube, a vacuum tube, a syringe, a bottle, a microscope slide, or any other suitable container. The container may or may not be sterile. In some cases, the sample may be stored in a commercial preparation suitable for storage of cells for subsequent cytological analysis such as but not limited to Cytyc ThinPrep, SurePath, or Monoprep.
The storage temperature may be explicitly defined or defined by a temperature range. The sample may be stored at room temperature or at reduced temperatures such as cold temperatures (e.g. between about 20° C. and about 0° C.), or freezing temperatures, including for example 0° C., −1° C., −2° C., −3° C., −4° C., −5° C., −6° C., −7° C., −8° C., −9° C., −10° C., −12° C., −14° C., −15° C., −16° C., −20° C., −22° C., −25° C., −28° C., −30° C., −35° C., −40° C., −45° C., −50° C., −60° C., −70° C., −80° C., −100° C., −120° C., −140° C., −180° C., −190° C., or about −200° C. The sample may be stored in any condition or environment that allows or achieves the desired temperature condition. In some cases, the samples may be stored in a refrigerator, on ice or a frozen gel pack, in a freezer, in a cryogenic freezer, on dry ice, in liquid nitrogen, or in a vapor phase equilibrated with liquid nitrogen.
The sample container may be any container suitable for storage and or transport of the biological sample including but not limited to: a cup, a cup with a lid, a tube, a sterile tube, a vacuum tube, a syringe, a bottle, a microscope slide, or any other suitable container. The container may or may not be sterile.
3. Sample Conveyance and Transportation
Additionally contemplated in the current methods are methods of transporting a sample. Transport may involve moving or conveyance of a sample to or from a clinic, hospital, doctor's office, or other location to a second location. Upon transport the sample may be stored and/or analyzed by for example, cytological analysis or molecular profiling. In some embodiments some aspect of analysis, processing or profiling may begin or take place during transport. In some cases, the sample may be transported to a molecular profiling company in order to perform the analyses described herein. In other cases, the sample may be transported to a laboratory such as a laboratory authorized or otherwise capable of performing the methods described herein, such as a Clinical Laboratory Improvement Amendments (CLIA) laboratory.
In some instances the subject may transport the sample. Transportation by an individual may include the individual appearing at a molecular profiling business or a designated sample receiving point and providing a sample. Providing of the sample may involve any of the techniques of sample acquisition described herein, or the sample may have already have been acquired and stored in a suitable container. In other cases the sample may be transported to a molecular profiling business using a courier service, the postal service, a shipping service, or any method capable of transporting the sample in a suitable manner.
In some cases, the sample may be provided to a molecular profiling business by a third party testing laboratory (e.g. a cytology lab). In other cases, the sample may be provided to a molecular profiling business by the subject's primary care physician, endocrinologist or other medical professional. The cost of transport may be billed to the individual, medical provider, or insurance provider. The molecular profiling business may begin analysis of the sample immediately upon receipt, or may store the sample in any manner described herein. The method of storage may or may not be the same as chosen prior to receipt of the sample by the molecular profiling business.
The sample may be transported in any medium or excipient including any medium or excipient provided herein suitable for storing the sample such as a cryopreservation medium or a liquid based cytology preparation. In some cases, the sample may be transported frozen or refrigerated such as at any of the suitable sample storage temperatures provided herein.
Once the sample is received, the sample may be assayed using a variety of routine analyses known to the art such as cytological assays, and genomic analysis by a molecular profiling business, a representative or licensee thereof, a medical professional, researcher, or a third party laboratory or testing center (e.g. a cytology laboratory). Such tests may be indicative of cancer, the type of cancer, any other disease or condition, the presence of disease markers, or the absence of cancer, diseases, conditions, or disease markers. The tests may take the form of cytological examination including microscopic examination as described below. The tests may involve the use of one or more cytological stains. The biological material may be manipulated or prepared for the test prior to administration of the test by any suitable method known to the art for biological sample preparation. The specific assay performed may be determined by the molecular profiling company, the physician who ordered the test, or a third party such as a consulting medical professional, cytology laboratory, the subject from whom the sample derives, or an insurance provider. The specific assay may be chosen based on the likelihood of obtaining a definite diagnosis, the cost of the assay, the speed of the assay, or the suitability of the assay to the type of material provided.
4. Sample Integrity Tests
In some embodiments, concurrent with sample acquisition, sample storage or sample analysis the sample may be subjected to tests or examination that detail or reveal the integrity of the sample for use in the compositions or methods described herein. As a result of an integrity test a sample may be determined to be adequate or inadequate for further analysis.
In some embodiments sample integrity tests may pertain to the quality, integrity or adequacy of cells and or tissue in the sample. Metrics employed to determine quality, integrity or adequacy may involve but are not limited to cell number tests, cell viability tests, nuclear content tests, genetic content tests, biochemical assays, cell mass tests, cell volume tests, PCR tests, Q-PCR tests, RT-PCR tests, immunochemical analysis, histochemical analysis, microscopic analysis or visual analysis.
In certain aspects sample integrity may be ascertained by tests that measure nucleic acid content or integrity. Nucleic acid content tests may measure DNA content, RNA content or a some mixture of DNA or RNA. In some aspects nucleic acids are extracted or purified from other cellular components prior to a nucleic acid content test. In some embodiments nucleic acid specific dyes are used to assay nucleic acid integrity. In cases of nucleic acid extraction, spectrophotometric or electrophoretic methods may be used to assay nucleic acid integrity.
In yet other aspects, sample integrity may be ascertained by tests that measure protein content or integrity. Methods that measure protein content or integrity are well known to those skilled in the art. Such methods include but are not limited to ultraviolet absorbance reading (e.g. 280 nm absorbance readings), cell staining, protein staining or immunocytochemical methods. In some instances tests may be performed in parallel in intact samples or the samples may be divided and tests performed serially or in parallel.
Integrity tests may be performed on small subsets or aliquots of a sample or on the entirety of a sample.
C. Hybridization
After an array or a set of miRNA probes is prepared and the miRNA in the sample is labeled, the population of target nucleic acids is contacted with the array or probes under hybridization conditions, where such conditions can be adjusted, as desired, to provide for an optimum level of specificity in view of the particular assay being performed. Suitable hybridization conditions are well known to those of skill in the art and reviewed in Sambrook et al. (2001) and WO 95/21944. Of particular interest in embodiments is the use of stringent conditions during hybridization. Stringent conditions are known to those of skill in the art.
It is specifically contemplated that a single array or set of probes may be contacted with multiple samples. The samples may be labeled with different labels to distinguish the samples. For example, a single array can be contacted with a tumor tissue sample labeled with Cy3, and normal tissue sample labeled with Cy5. Differences between the samples for particular miRNAs corresponding to probes on the array can be readily ascertained and quantified.
The small surface area of the array permits uniform hybridization conditions, such as temperature regulation and salt content. Moreover, because of the small area occupied by the high density arrays, hybridization may be carried out in extremely small fluid volumes (e.g., about 250 μl or less, including volumes of about or less than about 5, 10, 25, 50, 60, 70, 80, 90, 100 μl, or any range derivable therein). In small volumes, hybridization may proceed very rapidly.
D. Differential Expression Analyses
Arrays can be used to detect differences between two samples. Specifically contemplated applications include identifying and/or quantifying differences between miRNA from a sample that is normal and from a sample that is not normal, between a cancerous condition and a non-cancerous condition, or between two differently treated samples. Also, miRNA may be compared between a sample believed to be susceptible to a particular disease or condition and one believed to be not susceptible or resistant to that disease or condition. A sample that is not normal is one exhibiting phenotypic trait(s) of a disease or condition or one believed to be not normal with respect to that disease or condition. It may be compared to a cell that is normal with respect to that disease or condition. Phenotypic traits include symptoms of, or susceptibility to, a disease or condition of which a component is or may or may not be genetic or caused by a hyperproliferative or neoplastic cell or cells.
An array comprises a solid support with nucleic acid probes attached to the support. Arrays typically comprise a plurality of different nucleic acid probes that are coupled to a surface of a substrate in different, known locations. These arrays, also described as “microarrays” or colloquially “chips” have been generally described in the art, for example, U.S. Pat. Nos. 5,143,854, 5,445,934, 5,744,305, 5,677,195, 6,040,193, 5,424,186 and Fodor et al., 1991), each of which is incorporated by reference in its entirety for all purposes. These arrays may generally be produced using mechanical synthesis methods or light directed synthesis methods that incorporate a combination of photolithographic methods and solid phase synthesis methods. Techniques for the synthesis of these arrays using mechanical synthesis methods are described in, e.g., U.S. Pat. No. 5,384,261, incorporated herein by reference in its entirety. Although a planar array surface is used in certain aspects, the array may be fabricated on a surface of virtually any shape or even a multiplicity of surfaces. Arrays may be nucleic acids on beads, gels, polymeric surfaces, fibers such as fiber optics, glass or any other appropriate substrate (see U.S. Pat. Nos. 5,770,358, 5,789,162, 5,708,153, 6,040,193 and 5,800,992, each of which is hereby incorporated in its entirety). Arrays may be packaged in such a manner as to allow for diagnostics or other manipulation of an all inclusive device (see for example, U.S. Pat. Nos. 5,856,174 and 5,922,591, each incorporated in its entirety by reference). See also U.S. patent application Ser. No. 09/545,207, filed Apr. 7, 2000, which is incorporated by reference in its entirety for additional information concerning arrays, their manufacture, and their characteristics.
Particularly, arrays can be used to evaluate samples with respect to diseases or conditions that include, but are not limited to: benign thyroid conditions and malignant thyroid tumors, including mutation- and translocation-negative tumors.
Moreover, miRNAs can be evaluated with respect to the following diseases, conditions, and disorders: benign thyroid conditions and malignant thyroid tumors, including mutation- and translocation-negative tumors; conventional follicular carcinomas (FC); oncocytic follicular carcinomas (OFC); nodular hyperplasias (NH); follicular adenomas (FA); papillary carcinomas (PC). In addition, miRNAs can be evaluated with respect to thyroid tumors and the determination of whether a particular tumor is not generally malignant or aggressive. Also, miRNAs can be evaluated with respect to determining whether a particular tumor is generally considered malignant and/or aggressive.
It is specifically contemplated that the disclosed methods and compositions can be used to evaluate differences between stages of disease, such as between hyperplasia, neoplasia, pre-cancer and cancer, or between a primary tumor and a metastasized tumor, or between a lesion that is low risk (i.e., not generally malignant or aggressive) and a lesion that is high risk (i.e., generally considered malignant and/or aggressive).
E. Other Assays
In addition to the use of arrays and microarrays, it is contemplated that a number of different assays could be employed to analyze miRNAs, their activities, and their effects. Such assays include, but are not limited to, nucleic acid amplification, polymerase chain reaction, quantitative PCR, RT-PCR, in situ hybridization, Northern hybridization, hybridization protection assay (HPA), branched DNA (bDNA) assay, rolling circle amplification (RCA), single molecule hybridization detection, Invader assay, and/or Bridge Litigation Assay.
F. Evaluation of Expression Levels
A variety of different models can be employed to evaluate expression levels and/or other comparative values based on expression levels of miRNAs (or their precursors or targets). One model used in the Examples described below is a logistic regression model (see the Wikipedia entry on the World Wide Web at en.wikipedia., which is hereby incorporated by reference).
Start by computing the weighted sum of the DiffPair values:
z=β
0+β1*Diff(miR1a,miR1b)+β2*Diff(miR2a,MiR2b)+ . . .
where the β0 is the (Intercept) term identified in the spreadsheets, while the remaining βi are the weights corresponding to the various DiffPairs in the model in question. Once z is computed, the score pmalignant (which may be interpreted as predicted probability of malignancy) is calculated as
This functions to turn the number z, which may be any value from negative infinity to positive infinity, into a number between 0 and 1, with negative values for z becoming scores/probabilities of less than 50% and positive values for z becoming scores/probabilities of greater than 50%.
Other examples of models include but are not limited to Decision Tree, Linear Disciminant Analysis, Neural Network, Support Vector Machine, and k-Nearest Neighbor Classifier. A person of ordinary skill in the art could use these different models to evaluate expression level data and comparative data involving expression levels of one or more miRs (or their precursors or their targets).
Models may take into account one or more diff pair values or they may also take into account differential expression of one or more miRNAs not specifically as part of a diff pair. A diagnostic score may be based on 1, 2, 3, 4, 5, 6, 7, 8 or more diff pair values (or any range derivable therein), but in some embodiments, it takes into account additionally or alternatively, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more miRNA expression levels (or any range derivable therein), wherein the miRNA expression level detectably differs between low risk and high risk lesions.
It will be understood by those of skill in the art that instead of a diff pair value, methods may involve a coefficient value that can be used in conjunction with the level of expression of a particular miRNA based on classifier analysis. Whereas the expression value associated with a diff pair is 1 times the expression value of the first miRNA of the pair summed with −1 times the expression value of the second miRNA of the diff pair, a more general classifier may be built with features composed of combinations of 2 or more miRNA expression values with coefficient values other than just 1 or −1.
Additionally, embodiments may be based on a constrained logistic regression model. For illustrative purposes, consider a model built with three biomarker predictors A, B, and C. The classifier is specified by the weights w0, w1, w2, and w3: given a sample with expression value x1 for marker A, x2 for marker B, and x3 for marker C, the model score may be computed as
Since a model may be constrained to satisfy w1+w2++w3=0, the last term in the above equation must vanish. But what's left expresses our model score in terms of the diff pairs Diff(A,B) (with expression value x1−x2) and Diff(B,C) (with expression value x2−x3). This argument can be extended in a straight-forward manner to apply to constrained logistic regression models with any number of predictors.
Alternatively, the converse may be done. A logistic regression model built using two DiffPairs, Diff(A,B) and Diff(C,D), with weights W1 and W2 can be equivalently described directly in terms of the underlying miRs A, B, C, and D with weights w1=W1, w2=−W1, w3=W2, and w4=−W2 (in case of miR overlap, we just add together the relevant coefficients: e.g., if B=C, we would have w1=W1, w2=−W1+W2, w3=−W2, with x3 now describing the expression value of D). Because the weight of each diff pair appears twice in the resulting individual miRs, once with a positive sign and once with a negative sign, the sum of the resulting individual miR coefficients (the lower-case wi's) must be zero, satisfying our constraint condition.
Any of the compositions described herein may be comprised in a kit. In a non-limiting example, reagents for isolating miRNA, labeling miRNA, and/or evaluating a miRNA population using an array, nucleic acid amplification, and/or hybridization can be included in a kit, as well as reagents for preparation of samples from thyroid samples. The kit may further include reagents for creating or synthesizing miRNA probes. Such kits may thus comprise, in suitable container means, an enzyme for labeling the miRNA by incorporating labeled nucleotides or unlabeled nucleotides that are subsequently labeled. In certain aspects, the kit can include amplification reagents. In other aspects, the kit may include various supports, such as glass, nylon, polymeric beads, and the like, and/or reagents for coupling any probes and/or target nucleic acids. Kits may also include one or more buffers, such as a reaction buffer, labeling buffer, washing buffer, or hybridization buffer, compounds for preparing the miRNA probes, and components for isolating miRNAs. Other kits may include components for making a nucleic acid array comprising miRNAs, and thus, may include, for example, a solid support.
Kits for implementing methods described herein are specifically contemplated. In some embodiments, there are kits for preparing miRNAs for multi-labeling and kits for preparing miRNA probes and/or miRNA arrays. In such embodiments, kits comprise, in suitable container means, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more of the following: 1) poly(A) polymerase; 2) unmodified nucleotides (G, A, T, C, and/or U); 3) a modified nucleotide (labeled or unlabeled); 4) poly(A) polymerase buffer; 5) at least one microfilter; 6) label that can be attached to a nucleotide; 7) at least one miRNA probe; 8) reaction buffer; 9) a miRNA array or components for making such an array; 10) acetic acid; 11) alcohol; 12) solutions for preparing, isolating, enriching, and purifying miRNAs or miRNA probes or arrays. Other reagents include those generally used for manipulating RNA, such as formamide, loading dye, ribonuclease inhibitors, and DNase.
In specific embodiments, kits include an array containing miRNA probes, as described in the application. An array may have probes corresponding to all known miRNAs of an organism or a particular tissue or organ in particular conditions, or to a subset of such probes. The subset of probes on arrays may be or include those identified as relevant to a particular diagnostic, therapeutic, or prognostic application. For example, the array may contain one or more probes that are indicative or suggestive of 1) a disease or condition (thyroid cancer), 2) susceptibility or resistance to a particular drug or treatment; 3) susceptibility to toxicity from a drug or substance; 4) the stage of development or severity of a disease or condition (prognosis); and 5) genetic predisposition to a disease or condition.
For any kit embodiment, including an array, there can be nucleic acid molecules that contain or can be used to amplify a sequence that is a variant of, identical to, or complementary to all or part of any of the sequences disclosed herein. In certain embodiments, a kit or array can contain one or more probes for the miRNAs identified by sequences disclosed herein. Any nucleic acid discussed above may be implemented as part of a kit.
Components of kits may be packaged either in aqueous media or in lyophilized form. The container means of the kits will generally include at least one vial, test tube, flask, bottle, syringe, or other container means, into which a component may be placed, and preferably, suitably aliquotted. Where there is more than one component in the kit (e.g., labeling reagent and label may be packaged together), the kit also will generally contain a second, third, or other additional container into which the additional components may be separately placed. However, various combinations of components may be comprised in a vial. The kits also may include a means for containing the nucleic acids, and any other reagent containers in close confinement for commercial sale. Such containers may include injection or blow molded plastic containers into which the desired vials are retained.
When the components of a kit are provided in one and/or more liquid solutions, the liquid solution may be an aqueous solution, with a sterile aqueous solution being particularly preferred.
However, the components of a kit may be provided as dried powder(s). When reagents and/or components are provided as a dry powder, the powder can be reconstituted by the addition of a suitable solvent. It is envisioned that the solvent may also be provided in another container means. In some embodiments, labeling dyes are provided as a dried power. It is contemplated that 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 120, 130, 140, 150, 160, 170, 180, 190, 200, 300, 400, 500, 600, 700, 800, 900, 1000 μg, or at least or at most those amounts, of dried dye are provided in kits. The dye may then be resuspended in any suitable solvent, such as DMSO.
The container means will generally include at least one vial, test tube, flask, bottle, syringe and/or other container means, into which the nucleic acid formulations are placed, for example, suitably allocated. Kits may also comprise a second container means for containing a sterile, pharmaceutically acceptable buffer and/or other diluent.
Kits may include a means for containing the vials in close confinement for commercial sale, such as, e.g., injection and/or blow-molded plastic containers into which the desired vials are retained.
Such kits may also include components that facilitate isolation of the labeled miRNA. It may also include components that preserve or maintain the miRNA or that protect against its degradation. Such components may be RNAse-free or protect against RNAses. Such kits generally will comprise, in suitable means, distinct containers for each individual reagent or solution.
A kit may also include instructions for employing the kit components as well the use of any other reagent not included in the kit. Instructions may include variations that can be implemented.
Kits may also include one or more of the following: control RNA; nuclease-free water; RNase-free containers, such as 1.5 ml tubes; RNase-free elution tubes; PEG or dextran; ethanol; acetic acid; sodium acetate; ammonium acetate; guanidinium; detergent; nucleic acid size marker; RNase-free tube tips; and RNase or DNase inhibitors.
It is contemplated that such reagents are embodiments of kits. Such kits, however, are not limited to the particular items identified above and may include any reagent used for the manipulation or characterization of miRNA.
The following examples are included to demonstrate preferred embodiments of the invention. It should be appreciated by those of skill in the art that the techniques disclosed in the examples that follow represent techniques discovered by the inventors to function well in the practice of the invention, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art will, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments that are disclosed and still obtain a like or similar result without departing from the spirit and scope of the invention.
The goal was to identify classifiers than can accurately distinguish benign thyroid conditions from malignant thyroid tumors, including mutation- and translocation-negative tumors, in preoperative thyroid FNAs. Papillary thyroid carcinomas are characterized, in particular, by the up-regulation of three miRNAs, miR-146b, miR-221, and miR-222, which is even more pronounced in BRAF V600E-positive tumors.
The classifier consists of a mathematical algorithm to predict probability of malignancy from the qRT-PCR measurements of the expression levels of the following microRNA species in FNA samples: miR-7, miR-21, miR31, miR-34a, miR-138-1*, miR-139-5p, miR-141, miR-145, miR-146b-3p, miR-146b-5p, miR-155, miR-183, miR-197, miR-222, miR-224, miR-375, and miR-425. These microRNAs were identified through statistical differential expression analysis of fresh frozen and FFPE samples comparing various benign and malignant sample types.
This was followed by a second round of feature selection conducted by L1-constrained logistic regression modeling. Prior indication of differential expression of microRNA species in the literature was considered as an additional factor for those miRs which were found to be strongly predictive of malignancy status in regression modeling.
The first analysis revealed BRAF V600E mutation to be a confounding factor for the development of classifiers.
As BRAF V600E is highly specific of malignancy, the inventors focused subsequent analyses on FNA specimens that were BRAF wild-type. This allowed the inventors to show that classifiers based on miRs-375 and miR-7, among others, show promise for identification of BRAF wild-type malignancies in FNA biopsies.
To identify candidate miRNAs, two independent sets of thyroid samples were procured that corresponded to a variety of benign and malignant conditions. The two sample sets consisted of 46 fresh frozen biopsies (data not shown) and 55 formalin-fixed paraffin-embedded (FFPE) samples, respectively. Total RNA was isolated from FFPE samples using the RecoverAll kit (Ambion). miRNA expression profiles were established on Affymetrix GeneChip miRNA 2.0 Arrays (Affymetrix) that target 1,105 human miRNAs from the Sanger database V15 (FFPE specimens). The samples were divided into benign and malignant groups for analysis, according to histology.
FNAs were obtained from an on-going multi-center study. Total nucleic acids were isolated from the FNAs using a proprietary method developed by Asuragen. qRT-PCR reactions were performed using specific TaqMan miRNAs assays (Applied Biosystems), as selected from the array data analysis.
Affymetrix GeneChip miRNA Service Methods Summary
Sample and array processing. Samples for miRNA profiling studies were processed by Asuragen Services (Austin, Tex.), according to the company's standard operating procedures. Following incoming sample quality control (QC) assessment, the 3′ ends of RNA molecules in total RNA samples were labeled with biotin according to the company's standard protocol. Labeled RNA (100 ng total RNA per sample) was purified and hybridized to Affymetrix GeneChip® miRNA Arrays (Affymetrix, Santa Clara, Calif.). Hybridization, washing, staining, imaging, and signal extraction were performed according to Affymetrix-recommended procedures. Arrays were scanned on an Affymetrix GeneChip® Scanner 3000 7G.
Normalization and Summarization.
Probe level summaries were extracted using median polish and Variance Stabilization Normalization (VSN) was applied to the array data. This is a global normalization approach that models the quadratic relationship between the variance of microarray data and the signal intensity, and transforms the data such that the variance is roughly constant. The post-normalized data scale is reported as generalized log 2 data. As the distribution of microarray data tends to approximately follow a normal distribution pattern after log transformation, this transformation helps to render normalized data amendable to classical statistical treatments, including t-tests and ANOVA.
Statistical Hypothesis Testing.
The signal processing implemented for the Affymetrix miRNA array is a multi-step process involving probe specific signal detection calls, background correction, and array scaling. For each probe, the contribution of signal due to background was estimated and removed by the feature extraction software as part of the data file output. Similarly, detection calls were based on the feature extraction software. Arrays within a specific analysis experiment were normalized together according to the VSN scaling method.
The signal processing implemented for the Affymetrix miRNA array is a multi-step process involving probe specific signal detection calls, background correction, and global normalization. For each probe, the contribution of signal due to background was estimated and removed by the feature extraction software as part of the data file output. Similarly, detection calls were based on the feature extraction software. Arrays within a specific analysis experiment were normalized together according to the VSN method described by Bolstad B et al [1].
The signal processing implemented for the Affymetrix miRNA array is a multi-step process involving probe specific signal detection calls, background estimate and correction, constant variance stabilization and log 2 transformation. For each probe, an estimated background value is subtracted that is derived from the median signal of a set of G-C-matched anti-genomic controls. Detection calls were based on a Wilcoxon rank-sum test of the miRNA probe signal compared to the distribution of signals from GC-content matched anti-genomic probes.
The signal processing implemented for the Affymetrix miRNA array is a multi-step process involving probe specific signal detection calls, background correction, and array scaling. For each probe, the contribution of signal due to background was estimated and removed by the feature extraction software as part of the data file output. Similarly, detection calls were based on the feature extraction software. Arrays within a specific analysis experiment were normalized together according to the VSN scaling method to the 1th percentile.
For statistical hypothesis testing, both equal variance two-sample t-Tests and one-way ANOVA were applied using empirical Bayes methods (as implemented by the Limma R package) to shrink individual miRNA variances towards a common value. This approach of borrowing information from the entire set of miRNAs increases the stability and power of the statistical test. All p-values are adjusted for false discovery rate using the method described by Benjamini & Hochberg (Benjamini & Hochberg (1995)) to account for testing all human mature miRNAs for differential expression.
Sample Information.
A total of 55 formalin-fixed paraffin-embedded (FFPE) samples from 31 benign and 24 malignant neoplasms were purchased from Asterand (Asterand Plc., Detroit, Mich., USA). The benign samples corresponded to 11 hyperplastic nodules (HN), 10 follicular adenomas (FA), 2 oncocytic follicular adenoma (OFA), 4 multi nodular goiter (MNG), and 4 Hashimoto's thyroiditis (HASH). The malignant samples corresponded to 8 papillary carcinomas (PTC), 5 follicular variant of papillary carcinomas (FVPTC), 2 follicular carcinomas (FTC), 2 oncocytic variant of follicular carcinoma (OFTC), 3 anaplastic carcinoma (ATC), and 4 medullary carcinomas (MED). Information regarding these samples is provided in Table 2. Total RNA was isolated from each sample, using the RecoverAll RNA Isolation Kit (Ambion; Austin, Tex., USA). Total RNA was quantified using a NanoDrop® ND-1000 spectrophotometer (NanoDrop Technologies Inc, Wilmington, Del., USA).
Two sets of fine needle aspiration (FNA) biopsies were used, each of these sets containing both benign and malignant FNA samples, as determined by cytology. Malignancy was confirmed by histology after resection of the lesion (Tables 3A and 3B). Molecular testing was performed using Asuragen miRInform™ Thyroid mutation panel assay.
FNA specimens, corresponding to an entire extra-pass, were collected according to the institutional IRB-approved protocol as a part of standard clinical care, and all samples were de-identified. Upon collection, FNAs were placed in 1 mL of Asuragen's RNARetain™ nucleic acid stabilization solution and shipped to Asuragen at room temperature for molecular testing. Total RNA was extracted using a modified mirVana PARIS™ procedure and proprietary Asuragen protocols optimized for recovery of nucleic acids from RNARetain. Concentration and purity of RNA were measured using the NanoDrop ND-1000 Spectrophotometer (NanoDrop Technologies Inc, Wilmington, Del., USA).
A total of 55 FFPE samples from 31 benign and 24 malignant neoplasms were profiled using the Affymetrix GeneChip® miRNA Arrays V2 in Asuragen Services and then analyzed by Asuragen's bioinformatics group (Asuragen). The Affymetrix miRNA arrays contain probes for more than 1,000 human miRNAs from Sanger miRBase V11.0 (available on the world wide web at asuragen.com/Services/services). A total of 200 ng of total RNA corresponding to each of the 55 samples was labeled as described in the materials and methods. The samples were separated in 2 different groups for analysis, corresponding to Benign (hyperplastic nodules (HN; n=11), follicular adenoma (FA; n=10), oncocytic follicular adenoma (OFA; n=2), multi nodular goiter (MNG; n=4), and Hashimoto's thyroiditis (HASH; n=4)) and Malignant (papillary carcinomas (PTC; n=8), follicular variant of papillary carcinomas (FVPTC; n=5), follicular carcinomas (FTC; n=2), oncocytic variant of follicular carcinoma (OFTC; n=2), anaplastic carcinoma (ATC; n=4), and medullary carcinomas (MED; n=4)). The average levels of expression of each miRNA within each group of samples are shown in Tables 4A-D.
A total of 156 human miRNAs were significantly differentially expressed between the Benign and the Malignant groups of samples (p<0.05). Among these, 41 miRNAs were over-expressed (Log 2 diff (Malignant vs Benign)≧1) and 7 miRNAs were under-expressed (Log 2 diff (Malignant vs Benign)≦1) by at least 2-fold in the Malignant group compared to the Benign group of samples, with a p-value≦0.05. Of the miRNAs that were over-expressed, hsa-miR-146b was over-expressed by more than 15-fold in Malignant vs Norm, and hsa-miR-221*, -375, and -31* were over-expressed by more than 5-fold in the Malignant group of lesions compared to the Benign group of specimens (Tables 5A-C). In addition, hsa-miR-31*, -146b-3p, -21, -222, -221, -551b, -34a, and -31, were 3- to 5-fold overexpressed in Malignant lesions compared to Benign specimens (Tables 5A-C). Among the miRNAs that were under-expressed, hsa-miR-7 was under-expressed in the Malignant group by more than 4-fold compared to the Benign group of samples (Tables 5A-C).
Samples and qRT-PCR experiments: Two sets of FNA specimens were used for the development of miRNA classifier. The first set (Set 1) contained a total of 14 samples corresponding to 8 benign and 6 malignant specimens, respectively, as described in Table 3A. qRT-PCR reactions were performed using TaqMan MicroRNA Assays (Applied Biosystems; Foster City, Calif., USA) according to the manufacturer's instructions. Reactions were done in duplicates and included 5 ng of total nucleic acids (TNA) per reaction. The reactions were incubated in the 7900HT Fast Real-Time PCR System (Applied Biosystems). The miRNAs tested for Set 1 were: miRs-138-1#, -139-5, -146b-3p, -146b-5p, -21, -222, -320a, -34a, -375, -423-5p, -625#, -7, -126, -143, -144, -193a-3p, -193a-5p, -195, -204, -222#, -320a, -451, -486-3p, and miR-486-5p. These miRNAs were selected based on array analysis of the FFPE samples, as well as additional information obtained from previous work on thyroid tissues using fresh frozen biopsies.
miR-625#
miR-423-5p
miR-320a
The second set of FNAs (Set 2) contained a total of 21 samples corresponding to a benign group of 5 samples in which no mutation had been identified, a benign group of 5 samples in which mutations had been identified, and a group of 5 malignant samples in which no mutation had been identified and a group of 5 malignant samples in which mutations had been identified, with the exclusion of BRAF mutation. An additional malignant sample was used that contained a BRAF V600E mutation (Table 3B). The miRNAs tested in the second FNA set were: miRs-138-1#, -139-5, -146b-3p, -146b-5p, -21, -222, -320a, -34a, -375, -423-5p, -625#, -7, -141, -145, -155, -183, -197, -224, -31, and miR-425-5p. These miRNAs were selected based on array analysis of the FFPE samples excluding BRAF-positive FFPE tumors, as well as additional information obtained from previous work on thyroid tissues using fresh frozen biopsies and data recently published in the literature (Kitano et al., 2011; Nikiforova et al., 2009).
miR-625#
miR-423-5p
miR-320
DiffPairs, normalized biomarkers whose values are defined as the difference in expression values (Ct values for qRT-PCR data) between two distinct miRNAs, were used as the normalization method for qRT-PCR data. DiffPair normalization is similar to normalization using a specified normalizer reference gene with the modification that different DiffPairs use different reference genes. Because the DiffPair itself is regarded as a biomarker, DiffPair normalization does not require the assumption that either miRNA composing a given DiffPair is stably expressed across all samples, making DiffPair normalization appropriate in some conditions for which use of fixed normalizer genes can be problematic.
L1-penalized regression (Tibshirani 1996, Goeman 2010), as implemented by the R package penalized, was used to fit logistic model classifier to DiffPaired FNA data sets. The use of an L1-penalty term integrates feature selection, whereby only those DiffPairs who combine to produce the best fitting model without requiring large model weights are allowed to be used as model predictors, directly into the model-training process in order to prevent statistical over-fitting. The L1-penalty used for model-fitting was always fixed at a constant value based on prior assessment of reasonable values for logistic regression models using DiffPaired qRT-PCR data. Performance of this logistic regression modeling methodology was assessed using 5-fold cross-validation (Hastie 2009).
We tested an independent set of 65 thyroid FFPE specimens (Set 2) corresponding to 29 benign and 36 malignant neoplasms. The benign samples corresponded to 9 hyperplastic nodules (HN), 8 follicular adenomas (FA), 3 oncocytic follicular adenoma (OFA), 4 multi nodular goiters (MNG), and 5 Hashimoto's thyroiditis (HASH). The malignant samples corresponded to 19 papillary carcinomas (PTC), 8 follicular variant of papillary carcinomas (FVPTC), and 9 follicular carcinomas (FTC). Information regarding these samples is provided in Table X. Total RNA was isolated from each sample, using the RecoverAll RNA Isolation Kit (Ambion; Austin, Tex., USA). Total RNA was quantified using a NanoDrop® ND-1000 spectrophotometer (NanoDrop Technologie Inc, Wilmington, Del., USA). The samples were tested for mutations and translocations, using Asuragen miRInform® Thyroid Marker Panel. The mutational results of the assay are indicated in Table X.
miRNA Profiling Analysis.
A total of 65 FFPE samples from 29 benign and 36 malignant neoplasms were profiled using the Affymetrix GeneChip® miRNA Arrays V2 in Asuragen Services and then analyzed by Asuragen's bioinformatics group (Asuragen). The Affymetrix miRNA arrays contain probes for more than 1,000 human miRNAs from Sanger miRbase V11.0 (on the World Wide Web at asuragen.com/Services/services). A total of 200 ng of total RNA corresponding to each of the 65 thyroid FFPE was labeled as described in the materials and methods. The thyroid samples were separated in 2 different groups for analysis, corresponding to Benign (hyperplastic nodules (HN; n=9), follicular adenoma (FA; n=8), oncocytic follicular adenoma (OFA; n=3), multi nodular goiter (MNG; n=4), and Hashimoto's thyroiditis (HASH; n=5)) and Malignant (papillary carcinomas (PTC; n=19), follicular variant of papillary carcinomas (FvPTC; n=8), and follicular carcinomas (FTC; n=9). The average levels of expression of each miRNA in each subset of the different groups of samples are shown in Table Y, which is provided at the end of the description.
miRNAs Can Distinguish Benign and Malignant Thyroid Neoplasms
A total of 410 human miRNAs were significantly differentially expressed between the Benign and the Malignant groups of samples (p<0.05). Among these, 115 miRNAs were over-expressed (Log 2 diff (Malignant vs Benign)≧1) by at least 2-fold in the Malignant group compared to the Benign group of samples, with a p-value≦0.05. Of these, hsa-miR-146b-5p was over-expressed by more than 30-fold in Malignant vs Benign, and hsa-miR-146b-3p and miR-375 were over-expressed between 9- and 15-fold in the Malignant group of lesions compared to the Benign group of specimens (Table Y). In addition, hsa-miR-221, -221*, -222, -31*, -181a-2*, -551b, -424, -31, -21, and -429 were overexpressed between 5- and 9-fold in Malignant lesions compared to Benign specimens (Table Y). Only one miRNA, miRNA-7, was down-regulated by at least 2-fold in Malignant compared to Benign lesions (Table Y).
As follicular thyroid cancers are more difficult to diagnose than papillary thyroid cancers, we also compared the miRNA expression profiles of the Follicular thyroid cancer (FTC) group of samples with those of the Benign group of samples. A total of 195 human miRNAs were significantly differentially expressed between the FTC and the Benign groups of samples (p<0.05). Among these, 66 miRNAs were over-expressed (Log 2 diff (FTC vs Benign)≧1) by at least 2-fold in the FTC group compared to the Benign group of samples, with a p-value≦0.05. Of these, two miRNAs, hsa-miR-96 and -182, were over-expressed by more than 5-fold in FTC vs Benign (Table Y). There was no miRNA down-regulated by at least 2-fold in FTC compared to Benign lesions (Table Y).
A particularly challenging diagnostic issue is to distinguish Follicular carcinoma from Follicular adenoma due to overlapping cytological features between the two entities. We therefore also compared the expression profiles of the Follicular thyroid cancer (FTC) group of samples with those of Follicular adenoma (FA). A total of 101 human miRNAs were significantly differentially expressed between the FTC and the FA groups of samples (p<0.05). Among these, 22 miRNAs were over-expressed (Log 2 diff (FTC vs Benign)≧1) by at least 2-fold in the FTC group compared to the FA group of samples, with a p-value≦0.05. Of these, two miRNAs, hsa-miR-146b-5p and -181a-2*, were over-expressed by more than 3-fold in FTC vs FA, and 20 miRNAs were over-expressed between 2- and 3-fold in FTC compared to FA (Table Y). There was no miRNA down-regulated by at least 2-fold in FTC compared to FA lesions (Table Y).
Hashimoto's lesions represent also a challenging diagnostic issue. We compared the miRNA expression profiles of the Hashimoto's group of samples (Hash) with those of other benign conditions. A total of 108 human miRNAs were significantly differentially expressed between the Hash and the Benign groups of samples (p<0.05). Among these, 8 miRNAs were over-expressed (Log 2 diff (Hash vs Benign)≧1) by at least 2-fold in the Hash group compared to the other Benign group of samples, with a p-value≦0.05. Of these, one miRNA, hsa-miR-142-5p was over-expressed by more than 5-fold in Hash vs Benign. In addition, seven miRNAs, hsa-miR-150, -146b-5p, -155, -146a, -223, -199b-5p, and -31 were over-expressed between 1- and 5-fold in Hash compared to Benign (Table Y). A total of 12 miRNAs were under-expressed in Hash vs Benign by more than 2-fold (Log 2 diff (Hash vs Benign)≦1). Among these, only one miRNA, hsa-miR-7, was down-regulated by more than 5-fold in Hash vs Benign and 11 other miRNas were under-expressed by at least 2-fold in Hash samples compared to the Benign specimens (Table Y). The same comparison was made between Hash samples and the Malignant group of Thyroid specimens. A total of 350 human miRNAs were significantly differentially expressed between the Hash and the Malignant groups of samples (p<0.05). Among these, 27 miRNAs were over-expressed (Log 2 diff (Hash vs Malignant)≧1) by at least 2-fold in the Hash group compared to the Malignant group of samples, with a p-value≦0.05 (Table Y). A total of 126 miRNAs were under-expressed in Hash vs Malignant by more than 2-fold (Log 2 diff (Hash vs Malignant)≦1). Among these, three miRNAs, hsa-miR-146b-3p, -200b, and -181a-2*, were down-regulated by at least 10-fold in Hash samples compared to the Malignant specimens (Table Y). A total of 21 miRNAs were down-regulated between 5- and 10-fold in the Hash versus the Malignant groups of specimens, and 102 miRNAs were under-regulated between 2- and 5-fold in the Hash compared to the Malignant groups of samples (Table Y).
A set of 41 FFPE samples containing 21 malignant and 20 benign thyroid neoplasms, including 6 Hashimoto's specimens, was also analyzed to further to build a classifier that can identify Hashimoto's disease among other benign and malignant thyroid conditions. Information regarding the samples is provided in Table Z1. Total nucleic acids (TNA) were isolated from each sample, using the RecoverAll RNA Isolation Kit (Ambion; Austin, Tex., USA). Total RNA was quantified using a NanoDrop® ND-1000 spectrophotometer (NanoDrop Technologie Inc, Wilmington, Del., USA). The samples were tested for mutations and translocations, using Asuragen miRInform® Thyroid Marker Panel. The mutational results of the assay are indicated in Table Z1. These samples were analyzed by RT-qPCR, as described below.
Constrained Logistic Regression Method for Constructing miRNA-Based Classifiers.
A variety of different models can be employed to evaluate expression levels and/or other comparative values based on expression levels of miRNAs (or their precursors or targets). In particular, a logistic regression model (see the Wikipedia entry on the World Wide Web at en.wikipedia.org/wiki/Logistic regression, which is hereby incorporated by reference) distinguishing between two diagnostic groups consists of a set of predictor variables, Xi for i between 1 and n together with a set of weight coefficients wi for i between 0 and n, from which the probability that a sample with predictor values Xi xi is in the first diagnostic group can be computed as
Other examples of models include but are not limited to decision trees, linear or quadratic discriminant analysis, neural networks, support vector machines, and k-nearest neighbor classifiers. A person of ordinary skill in the art could use these different modeling procedures to evaluate expression level data and comparative data involving expression levels of one or miRNAs (or their precursors or their targets).
Because of the difficulties involved in precisely controlling the amount of intact RNA input for qRT-PCR assays, it is generally desirable to construct models which take as inputs comparative differences in expression between two or more biomarkers instead of the raw cycle threshold (Ct) values measured for individual miRNA biomarkers. One method for accomplishing this is to consider a DiffPair consisting of two biomarkers A and B associated with the value computed as the difference in Ct value between marker A and marker B (i.e, if xA is the Ct value of marker A and xB is the Ct value of marker B, then xA−xB is the value of the DiffPair Diff(A,B)).
In fitting logistic regression models, an alternative method for adjusting for potential differential intact RNA input levels to the DiffPair method described above is to constrain the sum of the weight coefficients wi for i>0 to be equal to zero:
The result of fitting such as a constrained logistic regression model is that, as in the case with DiffPair values, the model output scores are insensitive to any changes to biomarker Ct values that change the measured Ct values of all predictors upwards or downwards by the same amount, so that only relative expression levels between multiple biomarkers are used by the resulting model to predict the probability of malignancy. Note that in the case of exactly two biomarkers, this constrained logistic regression model becomes a logistic regression model built on a single DiffPair. More generally, models built with this type of constraint can be equivalently described in terms of an unconstrained logistic regression model built using a set of DiffPairs for more than two biomarkers as well (although this may increase the complexity of modeling process).
The classifier algorithms presented below were constructed using only subsets of the singleplex-measured miRNA biomarkers. For each model, the strategy for selection of the miRNA subset used by that model was to choose biomarkers using an unconstrained L2-penalized stepwise logistic regression strategy (L2 penalty parameter set in all cases to λ2=2.5). Once the subset of biomarkers to be used as predictors for a given model had been identified, the final model was constructed by fitting using the constrained logistic regression described above (again using an L2-penalty λ2=2.5) to the selected predictors. Classifier performance was estimated using leave-one-out-cross-validation evaluating the entire modeling process, including feature selection.
The intercept (w0) and weight coefficients (wi for i>0) for the constrained logistic regression classifier for Hashimoto's disease are indicated in Table DX.
As estimated by leave-one-out-cross-validation (LOOCV) for the modeling strategy described above, the observed accuracy was 93% (95% CI: 80%-98%), sensitivity 67% (95% CI: 22%-96%), specificity 97% (95% CI: 85%-100%), with an estimated AUC of 0.98. The confidence interval for sensitivity is very wide because the data set used to train this classifier contained only 6 Hashimoto's samples. The LOOCV classification scores produced by this model are shown in
A total of 62 FNA specimens were analyzed. These samples were classified as benign or malignant, according to their cytology and/or histology report, as described in Table W1. Molecular testing was performed using Asuragen miRInform Thyroid mutation panel assay (Table W1). FNA specimens, corresponding to an entire extra-pass, were collected according to the institutional IRB-approved protocol as a part of standard clinical care, and all samples were de-identified. Upon collection, FNAs were placed in 1 mL of Asuragen's RNARetain™ nucleic acid stabilization solution and shipped to Asuragen at room temperature for molecular testing. Total RNA was extracted using a modified mirVana PARIS™ procedure and proprietary Asuragen protocols optimized for recovery of nucleic acids from RNARetain. Concentration and purity of RNA were measured using the NanoDrop ND-1000 Spectrophotometer (NanoDrop Technologies Inc, Wilmington, Del., USA).
Real-Time Assays and miRNA Classifiers
Real-time RT-qPCR reactions were performed for the 26 candidate miRNAs described in Table 0 (hsa-miR-375, -146b-3p, -146b-5p, -221, -222, -551b, -204, -7, -141, -31, -181a*-2, -15a, -221*, -21, -424, -34a, -197, -19b, -138-1*, -139-5p, -155, -151-3p, -29b, -96, -328, and 29b-1*). RT-qPCR reactions were performed in duplicates, using Exiqon miRCURY LNA™ Universal RT microRNA PCR Assays (Exiqon A/S, Denmark), using an optimizedversion of the manufacturer's protocol. Reactions included either 20 ng (FFPE samples) or 50 ng (FNAs) of total nucleic acids per reaction and were incubated in the 7900 Fast System SDS 2.3 software.
The 41 FFPE samples (Set3), including 21 malignant and 20 benign thyroid neoplasms were analyzed. RT-qPCR results for each replicate of the 26 miRNA assays are summarized in Table Z2. Similarly, the 62 FNA samples (Set3) were also analyzed. RT-qPCR results for each replicate of the 26 miRNA assays are summarized in Table W2.
A total of 62 FNAs were also analyzed. RT-qPCR results for each replicate of the 26 miRNA assays are summarized in Table W2. The intercept (w0) and weight coefficients (wi for i>0) for the constrained logistic regression classifiers trained as described in section entitled “Constrained logistic regression method for constructing miRNA-based classifiers” above are indicated in Table D1. The leave-one-out-cross-validation (LOOCV) estimated accuracies and AUCs of the models were: (1) 5-miRNA model: accuracy 90% (95% CI: 80%-96%), AUC 0.93; (2) 10-miRNA model: accuracy 94% (95% CI: 84%-98%), AUC 0.96. For both the 5- and 10-miRNA models, model sensitivity with regard to BRAF-mutation negative malignant samples was estimated by LOOCV at 79% (95% CI: 49%-95%), while sensitivity for BRAF-mutation positive malignant samples was estimated at 100% (95% CI: 85%-100%). The specificity of (1) the 5-miRNA model was estimated at 88% (95% CI: 70%-98%), while the specificity of (2) the 10-miRNA model was estimated at 96% (95% CI: 80%-100%). The LOOCV classification scores produced by these models are shown in
The miRNA predictors used by the classifiers break down into two categories according to the sign of their coefficients in the models: those miRs which are substantially overexpressed in malignant samples as compared to benign samples tend to receive strongly negative model coefficients, while those miRs which exhibit similar or lower espression levels in malignant samples as compared with benign enter the models with positive coefficients. The most negatively-weighted miRNA predictor in the 5-miRNA model was found to be miR-146b-3p, with miR-375 a close second (Table D1); for the 10-miRNA model, miR-375 had the largest magnitude coefficient among the negatively weighted miRNAs, with miR-221* coming second. For both the 5- and 10-miRNA models, miR-141 (which showed little difference in expression levels between malignant and benign samples) was assigned the largest positive coefficient.
The following references, to the extent that they provide exemplary procedural or other details supplementary to those set forth herein, are specifically incorporated herein by reference.
The application claims priority to U.S. Provisional Patent Application 61/552,451 filed on Oct. 27, 2011 and U.S. Provisional Patent Application 61/552,762 filed on Oct. 28, 2011, both of which are hereby incorporated by reference.
Number | Date | Country | |
---|---|---|---|
61552451 | Oct 2011 | US | |
61552762 | Oct 2011 | US |