The present disclosure relates to a method capable of pathologic stratification or identifying type of neoplasm with respect to breast tissue of a subject. More particularly, the disclosed method identifies neoplasm type, stage or group by way of detecting or mapping mutations concurrently present in the breast tissue according to predetermined test modules associated with the mutations of interest. A kit being configured to enable the disclosed method is provided in the present disclosure as well.
Fibroepithelial neoplasms of the breast are disease entities characterized by a biphasic proliferation of both epithelial and stromal components. Fibroepithelial breast tumors include fibroadenomas (FAs) and phyllodes tumors (PTs), the latter of which can be further subdivided into benign, borderline, and malignant grades based on their histological features1. While FAs affect millions of women worldwide annually3, PTs occur at a lower frequency of approximately 1% or less of breast tumors and up to 7% of Asian breast cancers4. Compared to FAs, PTs have a later median age of onset (35 years vs 43 years), and a higher propensity for local recurrence, with distant metastasis also occurring in some malignant PTs5.
Previous studies have suggested that PTs and FAs may be highly related4. FA-like areas are not uncommonly encountered during histopathological examination of PTs, and some studies have proposed a clonal progression from FA to PT6-10. At the molecular level, frequent Mediator of RNA polymerase II transcription subunit 12 (MED12) exon 2 mutations have recently been observed in FAs and PTs2,11-14, while gene expression and DNA methylation analyses have implicated genes such as HOXB13 and HMGA2 in PT development15-17. Higher rates of copy number alterations (CNAs) have been also associated with PTs of higher grade18,19, and a recent study profiling a small number of PTs (n=15, five per grade) using a targeted cancer gene panel revealed recurrent mutations in tumour protein p53 (TP53) and singleton mutations in retinoblastoma protein (RB1) and Neurofibromin 1 (NF1) exclusively in higher-grade PTs11. However, unlike breast carcinomas (BCs) whose comprehensive mutational landscapes have been extensively studied20-23, comparatively little is known about the genetic and molecular relationships linking different types of breast fibroepithelial lesions.
The diagnosis and classification of PTs often present challenges to pathologists, particularly in the distinction of benign PT from FA. Such classification can be of clinical importance in offering disease-specific treatment to patients suffering from FA, PT or BC.
The present disclosure aims to provide a method for grading, identifying or categorizing types of neoplasm relating to breast tissue of a subject. By having the type, stage or group of neoplasm correctly identified, the present disclosure facilitates disease-specific therapies towards the subject.
Another object of the present disclosure is to employ one or more nucleic-acid based assays in assisting the grading, identifying or categorizing of the neoplasm type relating to breast tissue of the subject. The disclosed method utilizing nucleic-acid based assays provides reliable results to serve as a supportive diagnostic tool in addition to physical examination of the specimen for neoplasm grading. Particularly, the disclosed method allows pathologists to substantially differentiate benign PT from FA.
Further object of the present disclosure is to offer a kit containing at least partly the essential reagents to facilitate the performance of the aforesaid one or more nucleic-acid based assays in generating the desired neoplasm grading. The disclosed kit can be of various embodiments to operate under different platforms of nucleic-acid based assays.
At least one of the preceding objects is met, in whole or in part, by the present disclosure, in which one of the embodiments of the present disclosure is a method for identifying type of neoplasm in a breast tissue of a subject comprising the steps of performing one or more nucleic-acid based assays to identify mutations present in the breast tissue acquired from the subject through a first test module and a second test module, each of the first and second test module being associated with detection of at least one predetermined mutation of one or more genes and configured to provide a positive outcome corresponding to at least one predetermined mutation detected in the tissue or a negative outcome corresponding to absence of detectable predetermined mutation in the sample, the first test module being associated with detection of mutation in MED12 gene and/or mutation in Retinoic acid receptor alpha (RARA) gene and the second test module being associated with detection of mutation in Filamin A alpha (FLNA) gene, mutation in SET domain containing 2 (SETD2) gene and/or mutation in mixed-lineage leukemia protein 2 (MLL2) gene; and identifying the type of neoplasm of the breast tissue based upon the provided outcome of the first and second test modules. Preferably, the type of neoplasm is regarded as fibroadenomas when the outcome of the first test module and the second test module are respectively positive and negative. Alternatively, the type of neoplasm is regarded as phyllodes tumor when the outcome of the first test module and the second test module are both positive. Also, the first test module can be further associated with detection of mutation in Telomerase reverse transcriptase (TERT) gene of the subject.
According to a number of the preferred embodiments, the step of performing one or more nucleic-acid based assays further comprises a third test module being associated with detection of mutation in NF1 gene, mutation in RB1 gene and/or mutation in phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit alpha (PIK3CA) gene. Preferably, the type of neoplasm is regarded as malignant phyllodes tumor when the outcome of the first test module, the second test module and the third test module are all positive.
According to a plurality of the preferred embodiments of the disclosed method, the mutation in MED12 gene is a splice site mutation located at position −8 of exon 2 of the MED12 gene, a missense mutation located at codon 44 of cDNA of the MED12 gene or a missense mutation located at codon 36 of cDNA of the MED12 gene.
According to more preferred embodiments, the mutation in RARA gene corresponds to or results in p.F286del, p.F287L, p.N299H, p.R394Q, p.L409del and/or p.G289R found in a polypeptide translated thereof from the RARA gene of the subject.
For several embodiments, the mutation in FLNA gene corresponds to p.A1191T, p.S1199L, p.P1244S, p. 1687-1688TV>M and/or p.S1186W found in a polypeptide translated thereof from the FLNA gene of the subject. These mutations to be detected relates particularly to missense mutation on the produced polypeptide.
For a number of embodiments, the mutation in SETD2 gene relates to p.R1674-1675EA>D, p.K1587fs, p.Q1545*, p.Y1605fs and/or p.F1651fs found in a polypeptide translatable thereof. The mutations found in SETD2 gene are generally relating to missense or somatic mutation.
In a plurality of embodiments, the mutation to be detected in TERT gene is preferably located at the promoter region. For instance, mutation located at −124 and/or −146 of the promoter region of the TERT gene leading to missense mutation.
In another aspect of the present disclosure, a kit for identifying type of neoplasm in a breast tissue of a subject is provided. Preferable, the kit comprises at least one platform capable of performing one or more nucleic-acid based assays to identify mutations present in the breast tissue acquired from the subject corresponding to a first test module and a second test module that each test module is associated with detection of at least one predetermined mutation of one or more genes, each test module being configured to provide a positive outcome corresponding to at least one predetermined mutation detected in the tissue or a negative outcome corresponding to absence of detectable predetermined mutation in the sample, the first test module being associated with detection of mutation in MED12 gene, TERT and/or mutation in RARA gene, the second test module being associated with detection of mutation in FLNA gene, mutation in SETD2 gene and/or mutation in MLL2 gene. Preferably, the test modules are configured to emit a detectable or visual signal corresponds to any positive outcome. The type of neoplasm is regarded as fibroadenomas when the outcome of the first test module and the second test module are respectively positive and negative. Alternatively, the type of neoplasm is regarded as benign phyllodes tumor when the outcome of the first test module and the second test module are both positive.
In one or more embodiments of the disclosed kit, the at least one platform further comprises a third test module being associated with detection of mutation in NF1 gene, mutation in RB1 gene and/or mutation in PIK3CA gene. With the inclusion of the third test module, the kit of the present disclosure can further regard, grade or identify the type of neoplasm as malignant phyllodes tumor when the outcome of the first test module, the second test module and the third test module are all positive.
For some embodiments, the breast tissue is stromal cells to be used with the disclosed kit for neoplasm grading or identification.
The present invention may be embodied in other specific forms without departing from its structures, methods, or other essential characteristics as broadly described herein and claimed hereinafter. The described embodiments are to be considered in all respects only as illustrative, and not restrictive. The scope of the invention is, therefore, indicated by the appended claims, rather than by the foregoing description. All changes that come within the meaning and range of equivalency of the claims are to be embraced within their scope.
Unless specified otherwise, the terms “comprising” and “comprise” as used herein, and grammatical variants thereof, are intended to represent “open” or “inclusive” language such that they include recited elements but also permit inclusion of additional, un-recited elements.
As used herein, the phrase “in embodiments” means in some embodiments but not necessarily in all embodiments.
As used herein, the terms “approximately” or “about”, in the context of concentrations of components, conditions, other measurement values, etc., means+/−5% of the stated value, or +/−4% of the stated value, or +/−3% of the stated value, or +/−2% of the stated value, or +/−1% of the stated value, or +/−0.5% of the stated value, or +/−0% of the stated value.
The term “polynucleotide” or “nucleic acid” as used herein designates mRNA, RNA, cRNA, cDNA or DNA. The term typically refers to oligonucleotides greater than 30 nucleotide residues in length.
The term “primer” used herein throughout the specification refers to an oligonucleotide which, when paired with a strand of DNA, is capable of initiating the synthesis of a primer extension product in the presence of a suitable polymerizing agent. The primer is preferably single-stranded for maximum efficiency in amplification but can alternatively be double-stranded. A primer must be sufficiently long to prime the synthesis of extension products in the presence of the polymerization agent. Primers can be “substantially complementary” to the sequence on the template to which it is designed to hybridize and serve as a site for the initiation of synthesis. By “substantially complementary”, it is meant that the primer is sufficiently complementary to hybridize with a target polynucleotide. Preferably, the primer contains no mismatches with the template to which it is designed to hybridize but this is not essential. For example, non-complementary nucleotide residues can be attached to the 5′ end of the primer, with the remainder of the primer sequence being complementary to the template. Alternatively, non-complementary nucleotide residues or a stretch of non-complementary nucleotide residues can be interspersed into a primer, provided that the primer sequence has sufficient complementarity with the sequence of the template to hybridize therewith and thereby form a template for synthesis of the extension product of the primer.
The term “gene” as used herein may refer to a DNA sequence with functional significance. It can be a native nucleic acid sequence, or a recombinant nucleic acid sequences derived from natural source or synthetic construct. The term “gene” may also be used to refer to, for example and without limitation, a cDNA and/or an mRNA encoded by or derived from, directly or indirectly, genomic DNA sequence.
According to one major aspect of the present disclosure, a method for identifying type of neoplasm in a breast tissue of a subject is disclosed. Preferably, the disclosed method comprises the steps of performing one or more nucleic-acid based assays to identify mutations present in the breast tissue acquired from the subject corresponding to a first test module and a second test module that each test module is associated with detection of at least one predetermined mutation of one or more genes, each test module being configured to provide a positive outcome corresponding to at least one predetermined mutation detected in the tissue or a negative outcome corresponding to absence of detectable predetermined mutation in the sample, the first test module being associated with detection of mutation in MED12 gene, mutation in TERT gene and/or mutation in RARA gene, the second test module being associated with detection of mutation in FLNA gene, mutation in SETD2 gene and/or mutation in MLL2 gene; and identifying the type of neoplasm of the breast tissue based upon the provided outcome of the first and second test modules.
One ordinary skilled artisan shall appreciate the fact that at least part of the nucleic acid-based assay described herein can include also at least some universally known procedures or steps to complete the assay despite such procedures may not be completely detailed in this specification. For instance, part of the nucleic acid-based assay can involve the step of extracting or isolating deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA) materials containing genetic information relating to the subject or the tissue sampled of the subject using any method or commercial kit known in the field. Part of the nucleic acid-based assay, as in some of the preferred embodiments, may involve also conducting polymerase chain reaction (PCR) towards the isolated DNA or RNA materials in conjunction with predetermined thermal cycles and conditions to amplify gene or part of the gene to be analyzed in the test module for concluding the pathologic grading. These pre-treatments or processes can form part of the nucleic acid-based assay to finally lead to identification of the allele, mutations and/or genotype of the desired genes for neoplasm grading and categorizing. Pursuant to some preferred embodiments of the disclosed method, the nucleic-acid based assay is preferably performed to identify, detect and/or genotype potential mutations resided in one or more genes of the subject giving rise to neoplasm or cancer development. The nucleic acid-based assay of the present disclosure comprises sequencing the genes of interested. The sequencing can be performed onto polynucleotides amplified and/or duplicated from the DNA or RNA materials isolated from the breast tissue. More specifically, the sequencing approach implementable in the present disclosure to effect the mutation identification or detection can be Sanger sequencing and/or ultra-deep targeted amplicon sequencing, which is effective and capable of catering highly precise and reliable result in identifying the interested mutations setting forth in the test modules. The details of the Sanger sequencing and/or ultra-deep targeted amplicon sequencing are further elaborated in the examples incorporated hereafter. It is important for other skilled artisans to appreciate the fact that the disclosed method can be conducted utilizing other known sequencing equivalent or non-equivalent procedures or approaches to detect presence of the interested mutation in the analyzed polynucleotides and such modification shall not depart from the scope of the present disclosure. Other known processes implementable to identify or assist in identifying these mutations can be any one of, but not limited to, temperature gradient gel electrophoresis, capillary electrophoresis, amplification-refractory mutation system-polymerase chain reaction (ARMS-PCR), dynamic allele-specific hybridization (DASH), target capture for next generation sequencing (NGS), high-density oligonucleotide SNP arrays or Restriction fragment length polymorphism (RFLP). The present disclosure utilizes pattern, outcome or results generated from the test modules, with regard to the predetermined gene correlated to the given module, to grade the neoplasm stage or type of the tissue sample rather than mere resorting specific primers or a single platform to realize the grading or categorizing. Modification towards the primers or platform implementable to effect the disclosed method in neoplasm grading such as varying length or hybridizing location of the sequencing primers shall fall within the scope of the present disclosure.
As described in the foregoing, the first test module and the second test module are respectively associated with the detection of at least one mutation of one or more gene specifically correlated to the test module and provide an outcome applicable for subsequent grading the neoplasm stage or type of the tissue sampled. More specifically, in a number of preferred embodiments, the first test module is associated to the detection of mutations resided in the MED12 gene, TERT gene and/or RARA gene. More preferable, the first module is associated to the detection of more than one mutation relating to the MED12 gene, TERT gene and/or RARA gene. For instance, the mutation detectable for MED12 gene and being associated with the first test module can be any one of a splice site mutation located at position −8 of exon 2 of the MED12 gene, a missense mutation located at codon 44 of cDNA of the MED12 gene or a missense mutation located at codon 36 of cDNA of the MED12 gene. Likewise, detectable mutations for RARA gene associated with the first test module can be any one of missense mutations resulting p.F286del, p.F287L, p.N299H, p.R394Q, p.L409del and/or p.G289R in polypeptide translated thereof. Furthermore, the mutation to be detected in TERT gene is preferably located at the promoter region. For instance, mutation located at −124 and/or −146 of the promoter region of the TERT gene leading to missense mutation
According to a number of embodiments of the disclosed method, the second test module is preferably associated with detectable predetermined mutations resided in FLNA gene, SETD2 gene and/or MLL2 gene. More specifically, the one or more mutations to be detected for FLNA gene in association with the second test module generally gives rise to p.A1191T, p.S1199L, p.P1244S, p. 1687-1688TV>M and/or p.S1186W in a polypeptide translated thereof. Mutation relating to SETD2 gene and associating to the second test module is any one or combined mutations give rises to p.R1674-1675EA>D, p.K1587fs, p.Q1545*, p.Y1605fs and/or p.F1651fs translatable from the SETD2 gene. Similarly, mutations of MLL2 gene to be detect and associated with the second test module is any mutation generally causing inactivating mutation such as p.V5482fs, p.Q1139*, p.G2668fs, p.Q3814* and/or p.L3457fs found in a polypeptide encoded by the MLL2 gene. For few preferred embodiments, the second test module of the disclosed method can be further associated with mutations of other genes in addition to the FLNA gene, SETD2 gene and/or MLL2 gene. These extra genes with interested mutations being inferable or indicative of the breast neoplasm type or stage that can be associated to the second test module are BCL-6 corepressor protein (BCOR) gene and Mitogen-activated protein kinase kinase kinase 1 (MAP3K1) gene.
In some embodiments of the disclosed method, the second test module may be configured to detect presence of mutations in genes of the subject other than the FLNA gene, SETD2 gene and/or MLL2 gene. For example, the second test module can be arranged to discover inactivating mutations such as p.K460*, p.W534* and/or p.K175fs in B-Cell CLL/Lymphoma 6 corepressor (BCOR) gene of the subject, or somatic mutation like p.M312fs and/or p.Q409fs in Mitogen-Activated Protein Kinase Kinase Kinase 1 (MAP3K1) gene of the subject.
To better grade or identify the neoplasm stage of the sampled breast tissue, the performing one or more nucleic-acid based assays may further comprise a third test module being associated with detection of mutation in NF1 gene, mutation in RB1 gene and/or mutation in PIK3CA gene. With the aid of the third testing module, the disclosed method can further grade or identify the tissue sample, which has advance to the stage of borderline malignant or malignant phyllodes tumor. Preferably, mutations of NF1 gene being associated with the third test module are mutations relating to p.K1014*, p.R416* and/or p.D2283fs found in polypeptide translatable thereof. These mutations targeted in the third test module for NF1 gene are resulting in either nonsense mutation or frameshift mutation that leads to neoplasm development. Similarly, mutations of RB1 gene being associated with the third test module are mutation regarding p.Q504*, p.N316fs, and/or p.P796fs found in polypeptide translated thereof. These mutations of RB1 gene cause also either nonsense or frameshift mutation. For PIK3CA gene, the interested mutation associated with the third test module mainly relates to p.H1047R/L, which is a missense mutation.
Further embodiments of the disclosed method can have more mutations of other relevant genes discovered through the third test module besides mutation in NF1 gene, RB1 gene and/or PIK3CA gene. With more mutated sites covered, the disclosed method is able to offer higher accuracy to timely detect, diagnose, categorize, group or recognize various stage of neoplasm development in the subject. The third test module can be used to detect recurrent mutation associated to p.L62R in a polypeptide encoded by epidermal growth factor receptor (EGFR) gene, inframe deletion mutation associated to p.I33del found in polypeptide encoded by Phosphatase and Tensin Homolog (PTEN) gene, inactivating frameshift mutation associated to p. 294fs or p.C229fs found in polypeptide encoded by Tumor Protein P53 (TP53) gene, somatic mutation associated to p.W407R found in polypeptide encoded by Erb-B2 Receptor Tyrosine Kinase 4 (ERBB4) gene, and/or duplication of Insulin-Like Growth Factor 1 Receptor (IGF1R) gene in the subject.
In accordance with the preferred embodiments, the first, second and/or the third module is fashioned to yield a positive outcome as far as at least one predetermined mutation of the gene associated to that particular test module is detected and vice versa. For example, the first test module brings forth a positive outcome in response to at least one mutation detected in MED12 gene, TERT gene and/or RARA gene of the breast tissue of the subject. On the contrary, the first test module yields a negative outcome in the absence of any detectable predetermined mutations relating to MED12 gene, TERT gene and/or RARA gene. The like principle is applicable for the second and the third test module to realize the disclosed method in neoplasm grading and identification. In a number of the preferred embodiments, each mutation of the gene under consideration in a test module may be subjected to separate nucleic acid-based assay operating under different known principles or platform for detection or identification. For these embodiments, the nucleic-acid based assays for respective mutations of the involved gene may not be performed in a concurrent basis but rather the results generated from each of the assays are retrieved or collected to the associated test module to compute or yield an outcome thereof. The disclosed method may simultaneously run the like assays for detecting mutations or allele of genes respectively associated to different modules in a single operation of the same platform according to other preferred embodiments that the results of each analyzed mutation will be then pulled to associate with the predetermined modules to compute the outcome for subsequent neoplasm grading. In line with the aforesaid, the nucleic acid-based assays described herein are free from being tied to a single operating platform or mechanism though identifying the interested mutations of the relevant genes under one platform is preferable for cost and/or time saving.
As setting forth in the foregoing description, the present disclosed method effectively grades the type of neoplasm in response to the outcome computed, generated or signaled by the test module used. In accordance with a plurality of the preferred embodiments, the disclosed method regards the type of neoplasm of the sampled breast tissue as fibroadenomas when the outcome of the first test module and the second test module are respectively positive and negative. It was found by inventors of the present disclosure that biomarker related to early onset of FA can be linked to mutations detectable in MED12, TERT gene and/or RARA gene of the subject. Whilst, FA samples are generally free from any detectable mutations in those gene associated to the second test module such as FLNA, SETD2, MLL2, BCOR, MAP3K1. Similarly, the present disclosure also correlates the analyzed breast tissues as FA when the third module delivers a negative outcome, in addition to respective positive and negative outcome of the first and second test modules, indicating no substantial interested mutations can be detected in those genes in association to the third module.
The disclosed method may regard the type of neoplasm as phyllodes tumor when the outcome of the first test module and the second test module are both positive according to the preferred embodiments. Based upon test and experiments performed, the present disclosure recognizes that developed phyllodes tumor appears to possess mutations for genes associated with both first and second test modules. Particularly, the neoplasm type of the sampled breast tissues can be conveniently regarded as phyllodes tumor in line with presence of the considered mutations in FLNA, SETD2, MLL2, BCOR or MAP3K1 gene besides identified interest mutations in gene associated with the first test module.
In order to further differentiate or sub-grade the identified phyllodes tumor, the present disclosure, in some preferred embodiments, bring forth the third test module to detect mutations of extra genes of the subject in addition to those present in the first and second modules. Particularly, the type of neoplasm of the acquired breast tissue is regarded as malignant phyllodes tumor when the outcome of the first test module, the second test module and the third test module are all positive, meaning that the acquired sample carries at least one mutated gene in each of the test module. It is possible also the sample or subjected tested with positive outcome of a test module may harbor two or more mutations in one or more genes associated with that particular test module. On the other hand, the disclosed method preferably regards the type of neoplasm of the sampled breast tissue as benign phyllodes tumor when the outcome of the third test module is negative and outcomes of both first and second test modules are positive.
Another aspect of the present disclosure relates to a kit for identifying type of neoplasm in a breast tissue of a subject. Preferably, the kit comprises at least one platform capable of performing one or more nucleic-acid based assays to identify mutations present in the breast tissue acquired from the subject corresponding to a first test module and a second test module, each of the first and second test modules being associated with detection of at least one predetermined mutation of one or more genes and configured to provide a positive outcome corresponding to at least one predetermined mutation detected in the tissue or a negative outcome corresponding to absence of detectable predetermined mutation in the sample, the first test module being associated with detection of mutation in MED12 gene, TERT gene and/or mutation in RARA gene, the second test module being associated with detection of mutation in FLNA gene, mutation in SETD2 gene and/or mutation in MLL2 gene.
The at least one platform operable for the disclosed kit to identify or assist in identifying these mutations can be any one of, but not limited to, temperature gradient gel electrophoresis, capillary electrophoresis, amplification-refractory mutation system-polymerase chain reaction (ARMS-PCR), dynamic allele-specific hybridization (DASH), target capture for next generation sequencing (NGS), high-density oligonucleotide SNP arrays or Restriction fragment length polymorphism (RFLP). Referring to a number of the preferred embodiments of the disclosed kit, each mutation of the gene under consideration in the test module may be subjected to separate nucleic acid-based assay carried in at least one of the aforesaid platforms for detection or identification. In some embodiments, the nucleic-acid based assays for respective mutations of the involved gene may not be performed in a concurrent basis but rather the results generated from each of the assays are collected to the associated test module to compute or yield an outcome thereof. For other preferred embodiments, the disclosed kit can be used, at least be part of, to simultaneously run the like assays for detecting mutations or allele of genes respectively associated to different modules under a single platform that the results of each analyzed mutation will be then pulled to associate with the predetermined modules to compute the outcome for subsequent neoplasm grading.
In more preferred embodiments, the nucleic acid-based assay of the disclosed kit comprises sequencing the genes of interested. Sequencing can be performed onto polynucleotides amplified and/or duplicated from DNA or RNA materials isolated from the breast tissue, more preferably stromal or epithelial cells of the breast tissue. More specifically, the sequencing approach implementable in the present disclosure to effect the mutation identification or detection can be Sanger sequencing and/or ultra-deep targeted amplicon sequencing which is effective and capable of catering highly precise and reliable result in identifying the interested mutations setting forth in the test modules.
In line with the foregoing description, the first test module and the second test module of the disclosed kit are respectively associated with the detection of at least one mutation of one or more gene specifically correlated to the test module and provide an outcome applicable for subsequent grading the neoplasm stage or type of the tissue sampled. More specifically, in a number of preferred embodiments of the disclosed kit, the first test module is associated to the detection of mutations resided in the MED12 gene, TERT gene and/or RARA gene. More preferable, the first module is associated to the detection of more than one mutation relating to the MED12 gene, TERT gene and/or RARA gene. For instance, the mutation detectable for MED12 gene and being associated with the first test module can be any one of a splice site mutation located at position −8 of exon 2 of the MED12 gene, a missense mutation located at codon 44 of cDNA of the MED12 gene or a missense mutation located at codon 36 of cDNA of the MED12 gene. Likewise, detectable mutations for RARA gene associated with the first test module can be any one of missense mutations corresponds to p.F286del, p.F287L, p.N299H, p.R394Q, p.L409del and/or p.G289R found in a polypeptide translated thereof. For TERT gene, the mutation to be detected is preferably located at the promoter region, e.g. mutation located at −124 and/or −146 of the promoter region of the TERT gene that finally results in missense mutation
In accordance with some preferred embodiments, the second test module is preferably associated with detectable predetermined mutations resided in FLNA gene, SETD2 gene and/or MLL2 gene. More specifically, the one or more mutations to be detected for FLNA gene in association with the second test module is mutation corresponding to p.A1191T, p.S1199L, p.P1244S, p. 1687-1688TV>M and/or p.S1186W in a polypeptide translatable from the FLNA gene of the subject. Mutation relating to SETD2 gene and associating to the second test module is any one or combined mutations resulting in p.R1674-1675EA>D, p.K1587fs, p.Q1545*, p.Y1605fs and/or p.F1651fs found in a polypeptide produced thereof. Similarly, mutations of MLL2 gene to be detect and associated with the second test module is any one or combined mutations causing inactivating mutations like p.V5482fs, p.Q1139*, p.G2668fs, p.Q3814* and/or p.L3457fs found in a polypeptide encoded thereof. For few preferred embodiments, the second test module of the disclosed kit can be further associated with mutations of other genes in addition to the FLNA gene, SETD2 gene and/or MLL2 gene. These extra genes with interested mutations being inferable or indicative of the breast neoplasm type or stage that can be associated to the second test module are BCL-6 corepressor protein (BCOR) gene and Mitogen-activated protein kinase kinase kinase 1 (MAP3K1) gene.
The disclosed kit facilitates utilization of the pattern, outcome or results generated from the test modules, with regard to the predetermined and correlated gene, to grade the neoplasm stage or type of the tissue sample. Preferably, the type of neoplasm of the sampled breast tissue is regarded as fibroadenomas when the outcome of the first test module and the second test module are respectively positive and negative. Conversely, the neoplasm type of the tested breast tissue is regarded as benign phyllodes tumor when the outcome of the first test module and the second test module are both positive.
In more preferable embodiments, the disclosed kit can further comprise a third test module being associated with detection of mutation in NF1 gene, mutation in RB1 gene and/or mutation in PIK3CA gene. With the aid of the third testing module, the disclosed kit favors further grading or identification of neoplasm type relating to the tissue sample, which may have advanced to the stage of borderline malignant or malignant phyllodes tumor. Preferably, mutations of NF1 gene being associated with the third test module are mutations relating to p.K1014*, p.R416* and/or p.D2283fs found in polypeptide encoded thereof. Similarly, mutations of RB1 gene being associated with the third test module are mutations regarding p.Q504*, p.N316fs, and/or p.P796fs found in correspondingly encoded polypeptide. For PIK3CA gene, the interested mutations associated with the third test module generally relates to mutation resulting in p.H1047R/L of the polypeptide encoded. Accordingly, the type of neoplasm of the acquired breast tissue is regarded, through the disclosed kit, as malignant phyllodes tumor when the outcome of the first test module, the second test module and the third test module are all positive, meaning that the acquired sample carries at least one mutated gene in each of the test module. On the other hand, the disclosed kit preferably enables the user of the kit to regard the type of neoplasm of the sampled breast tissue as benign phyllodes tumor when the outcome of the third test module is negative and outcomes of both first and second test modules are positive.
To accelerate results generation from the disclosed kit, the test modules are preferably configured to emit a detectable or visual signal corresponds to any positive outcome and vice versa. As far as one interested mutation associated to a given module is identified, a machine or user readable signal will be produced to highlight a positive outcome obtained thereof. For example, the disclosed kit can adopt an embodiment in the form of DNA chip on which various polynucleotides anchored to readily hybridize with target gene fragments, potentially harboring the interested mutation, amplified from the sampled breast tissue. The test module can be a group of polynucleotides or a dedicated area on the chip. A discrete spot, attached with the polynucleotide designed specifically to hybridize only with the target gene fragment bearing the mutation of interest under stringent condition, on DNA chip and belong to a particular test module shall give rise to a signal readable by a microarray machine in the occurrence of a successful hybridization to notify and associate detection of the interested mutation to that particular test module for yielding a positive outcome.
The above described method and/or kit can cater supportive diagnosis in addition to conventional neoplasm grading or categorizing based on physical examination of the breast biopsy with or without further staining. The physician may have to conduct re-examination of the sampled tissues if there exist discrepancies between the results concluded from histologic examination and the disclosed method and/or kit. For instance, a sample being regarded as FA or benign PT with positive outcome in all three test modules shall be subjected to re-examination by the physician. It is clearly shown in the experiments of the present disclosure that FA or benign PT sample shall be clear of any mutations resided in genes associated to the third module of the disclosed method and/or kit. It is highly possible that the result of the physical or histologic examination is false negative. Health of the tested subject can be in jeopardy due to delay of treatment in view of the false result. The disclosed method and/or kit of the present disclosure offers extra mechanism to work against false negative or positive results arisen from subjective histologic examination, which is primarily relied upon experience of the physician performing the session.
The following example is intended to further illustrate the invention, without any intent for the invention to be limited to the specific embodiments described therein.
Fibroepithelial tumors were diagnosed and subtyped according to clinical features and histopathological examination of surgically excised tumors. All cases were histologically reviewed by at least 2 expert breast pathologists. Criteria for diagnosis and grading were based on recommendations of the WHO Classification of Tumours of the Breast1. Briefly, phyllodes tumors were diagnosed when the fibroepithelial neoplasms showed an exaggerated intracanalicular pattern with leaf-like fronds accompanied by stromal hypercellularity. A benign phyllodes tumor was concluded when the lesion showed mild stromal cellularity with minimal nuclear atypia, pushing borders and mitoses of 4 or less per 10 high power fields, without stromal overgrowth. A diagnosis of malignant phyllodes tumor was rendered when there was marked stromal cellularity and atypia, presence of stromal overgrowth and permeative margins, with mitotic activity of 10 or more per 10 high power fields. Tumors with intermediate features were regarded as borderline. All 100 cases consisting of 21 FAs and 79 PTs were from fresh frozen tissue. Details of the samples employed in the present study are provided in Table 1 below. Of these, 69 cases had matching normal tissue. An additional five cases from FFPE (formalin-fixed paraffin embedded) slides, comprising concurrent (n=3) and longitudinal (n=2) cases were later included in the study.
Tumors and whole-blood were obtained from patients undergoing surgical excision of fibroepithelial tumors with informed consent. Genomic DNA (gDNA) from fresh frozen tissue was extracted and purified using the Qiagen Blood and Cell Culture DNA kit. Genomic DNA yield and quality were determined using Picogreen™ fluorometric analysis as well as visual inspection of agarose gel electrophoresis images. For FFPE samples from concurrent or longitudinal fibroepithelial tumors, the Qiagen FFPE Tissue Kit was used.
Whole-exome sequencing was performed in 22 phyllodes tumors with matched tumor-normal pairs. Native genomic DNA was fragmented with the Covaris S2 (Covaris) system using recommended settings. Sequencing adaptor ligation was performed using the TruSeq Paired-End Genomic DNA kit (Illumina). For enrichment of coding sequences, we used the TruSeq Exome Enrichment kit (Illumina) according to the manufacturer's recommended protocol. Exome-enriched libraries were then sequenced on the 11lumina HiSeq 2000 instrument to generate 76 bp paired-end reads. Bioinformatics analysis, comprising sequence alignment, variant calling and identification of candidate somatic variants was performed as described in previous work46. Variants were filtered to retain only those covered by at least 15 reads and having at least three variant reads. Furthermore, those with variant allele frequencies (VAFs) lower than 5% were excluded. Indels overlapping simple repeat regions were discarded. All remaining candidate variants were visually inspected in the IGV genome browser47 to exclude likely germline mutations and sequencing artifacts. The synonymous mutations identified in the exome sequencing are included in Table 2 below.
PCR amplification was conducted with Platinum Taq Polymerase (Life Technologies). The PCR program included one cycle at 95° C. for 10 min, 35 cycles at 95° C. for 30 s, 58° C. for 30 s and 72° C. for 1 min and one cycle at 72° C. for 10 min. The BigDye Terminator v.3.1 kit (Applied Biosystems) was used for bidirectional sequencing on generated PCR amplicons, and products were fractionated using the ABI PRISM 3730 Genetic Analyzer (Applied Biosystems). Sequencing traces were aligned to reference sequences using Lasergene 10.1 (DNASTAR) and were visually analysed. The present disclosure selected 60 putative somatic mutations for Sanger validation (both tumor and normal sample) comprising mutations in recurrently mutated genes, cancer-associated genes and randomly selected genes. Of these, 54 mutations were successfully validated, 4 were found to be false positives and 2 failed to sequence, indicating a true positive rate of 90%. Validated mutations are highlighted with an asterisk in Table 3.
BCOR*
BCOR*
FLNA*
FLNA*
FLNA*
FLNA*
MED12*
MED12*
MED12*
MED12*
MED12*
MED12*
MED12*
MED12*
MED12*
MED12*
MLL2*
MLL2*
RARA*
RARA*
RARA*
RARA*
RARA*
TP53*
TP53*
The present disclosure performed whole-exome sequencing of 22 matched tumor-normal pairs of PTs, including 10 benign, 8 borderline and 4 malignant PTs (Table 1). The PT exomes and matched normal samples were sequenced to a mean coverage of 66-fold, and on average 78% of bases were covered by at least 20 reads (Table 2). Inventors of the present disclosure identified a total of 333 non-synonymous or splice site somatic mutations in 310 genes. Sanger sequencing of recurrent (mutated in at least two cases) and singleton mutations of interest attained a 90% validation rate (Table 3). Despite a relatively lower depth of coverage compared to previous FA study (66× vs 124×) conducted by the inventors, the median count of non-silent somatic mutations/case in PTs was higher than in FAs (13 vs 5, p<0.001) as shown in
A panel of 50 selected genes (including recurrently mutated genes in the PT discovery cohort, genes mutated in FA2, and also genes associated with breast cancer20) was designed using the SureDesign tool (Agilent). Sequencing libraries were prepared from extracted DNA from 68 paired tumor-normal samples and 32 tumors using the SureSelect XT2 Target Enrichment System for Illumina Multiplexed Sequencing platform (Illumina) according to manufacturer's instructions. Target-enriched libraries were then sequenced on Illumina's HiSeq 2000 sequencing platform to generate 76 bp paired-end reads. For paired tumor-normal samples, analysis was performed as described in the exome sequencing analysis portion. In addition, due to higher sequencing coverage (samples had an average coverage in target region of at least 228×), the Strelka48 (Illumina) somatic variant caller was used to identify low-allele frequency variants (at least 3%). All candidate variants were visually inspected in IGV to confirm that they are probably somatic. For patients where only the tumor sample was available, only variants in genes that were recurrently mutated among the paired tumor-normal samples were considered. The present disclosure also used a stricter variant allele frequency cut-off for the SNVs (at least 5%) and indels (at least 10%). Variants overlapping simple repeat regions were discarded, as were variants with dbSNP49 (version 137) entries. Variants were also filtered against an in-house database containing germline variants identified in approximately 512 East Asian exomes to further remove likely germline polymorphisms. These variants were also visually inspected in IGV to exclude probable sequencing artefacts.
To validate the PT exome-sequencing data, inventors of the present disclosure performed targeted deep sequencing in a prevalence cohort of 100 fibroepithelial tumors (21 FAs, 34 benign, 35 borderline and 10 malignant PTs as shown in Table 1), which included 22 cases from the discovery cohort. The present disclosure sequenced a total of 50 genes, comprising recurrently mutated genes and singletons of interest in our discovery cohort, as well as genes previously reported to be mutated in FAs2 and BCs20-22,27. The mean average coverage of target genes was 524× (minimum of 228×). The present disclosure acknowledges that the relatively low average depth of coverage (66×) attained in the exome sequencing of PTs is a limitation of the present study and may have resulted in under-calling of sequence variants. This is supported by further observation that 11 of 59 mutations identified by targeted sequencing (cut-off at 20% variant frequency) were missed by exome sequencing, resulting in a false-negative rate of 18.6%, likely due to low coverage. Also, due to the rarity of PTs and a relatively small discovery cohort, this study may have missed mutations occurring at low frequencies across patients as these would have been excluded from the targeted sequencing panel.
Copy number estimates for each of the genes in the targeted sequencing study were obtained using the OncoCNV28. Briefly, depth of coverage information for each targeted regions were generated from BAM files and normalized against a pool of normal samples as well as GC content. Probe-level copy number estimates were then aggregated to obtain gene-level copy number estimates. Genes with copy number estimates less than 1.5 or more than 3 were considered to have copy number gains or losses. To identify copy number alterations (CNAs) and regions with LOH in our exome sequencing cohort, we used Control-FREEC29.
To investigate the potential functions of the point mutations, we performed the mutation prediction algorithms, such as SIFT50, Polyphen251, CHASM52 and PROVEAN53, respectively. The functional mutations were shown as damaging or probably damaging and deleterious in Table 4. Cancer-specific mutations were shown as drivers or passengers. Neutral mutations were shown as tolerated or benign.
BCOR
BCOR
BCOR
BCOR
BCOR
CHD8
CHD8
DNAH11
DNAH11
DNAH11
EGFR
EGFR
ERBB4
ERBB4
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA
FLNA #
MAP3K1
MAP3K1
MAP3K1
MAP3K1
MAP3K1
MAP3K1
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12
MED12**
MED12**
MLL2
MLL2
MLL2
MLL2
MLL2
MLL2
MLL2
MLL2
MLL2
MLL2
MLL2
MLL2
MLL2
NF1
NF1
NF1
NF1
NF1
NF1
NF1
PCLO
PCLO
PCLO
PCNXIA
PCNXIA
PIK3CA
PIK3CA
PIK3CA
PIK3CA
PIK3CA
PIK3CA
PIK3CA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RARA
RB1
RB1
RB1
RB1
RB1
RB1
RB1
ROS1
ROS1
RUNX1
RUNX1
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SETD2
SYNE1
SYNE1
TP53
TP53
TP53
From the experiments carried as described above, the present disclosure identified 20 recurrently mutated genes in the fibroepithelial tumors as being summarized in
A comparison of recurrent mutations across the fibroepithelial tumors revealed distinct patterns of mutations and pathways associated with different phases in the breast fibroepithelial tumor spectrum as shown in
Second, the present disclosure also observed novel mutations in FLNA, SETD2, MLL2, BCOR and MAP3K1 in PTs (benign, borderline and malignant) that were rarely present in FAs (
Besides FLNA, over one third (35%) of PTs also harboured mutations in at least one of two chromatin modifying enzymes; SETD2 (21%) and MLL2 (12%) (Fisher's test p-value compared to FA=0.0058 and further in view of
Third, compared to benign PTs, borderline and malignant PTs also exhibited additional mutations in NF1, RB1, TP53, PIK3CA, ERBB4 and EGFR, which are known cancer-driver genes that have transforming ability. Copy number alterations (CNAs) of these genes were also found in borderline/malignant PTs. These findings are consistent with previous studies whereby TP53 and RB1 were found to be deregulated in malignant PTs39-41. Interestingly, although the frequency of alterations in each individual cancer-related gene was low, 29% (13/45) of borderline/malignant PTs exhibited probable driver alterations (defined as either COSMIC recurrent mutations and loss-of-function mutations (nonsense/frameshift) or high-level CNAs) in at least one cancer-related gene. In contrast, none of the 55 FAs and benign PTs (0/55, 0%) harboured genetic alterations in these genes (Fisher's exact test, p-value=1.02E-05). These results suggest that these cancer-related genes may be involved in a subset of higher-grade PTs. Notably, two tumors clearly contained bona-fide PIK3CA activating mutations (H1047R/L), and two tumors harboured high level EGFR amplifications as shown in
Like FAs, PTs are fibroepithelial tumors, comprising an admixture of epithelial and stromal compartments. To determine the location and distribution of the PT-associated mutations identified in this study, the present disclosure further performed laser capture microdissection (LCM) on 6 PTs from the discovery series. Isolated epithelial and stromal components were analysed separately for mutations in MED12, RARA, FLNA, SETD2, BRCA1 and PIK3CA, the latter two genes being frequently mutated in BCs but less so in PTs (see next paragraph). Briefly, 6 fresh frozen tissues from phyllodes tumors were embedded in Optimal Cutting Temperature (OCT) compound (Tissue-Tek, Sakura Finetek), and sections (8 μm thick) were cut in a Microtome-cryostat (Leica), mounted onto Arcturus® PEN membrane glass slides (Life Technologies), and then stored at −80° C. till required. Slides were dehydrated & stained with Arcturus® Histogene® following manufacturer's recommendations. The stained slide was loaded onto the laser capture microscope stage (ArcturusXT™ Laser Capture Microdissection (LCM) System). A Capsure™ Macro LCM cap (Life Technologies) was then placed automatically over the chosen area of the tissue. Once the cells of interest that were highlighted by the software were verified by the user, the machine automatically dissected out the highlighted cells of interest using a near infrared laser or UV pulse that transferred them onto the Capsure™ Macro LCM Cap. The DNA was extracted directly from LCM caps using Qiagen FFPE DNA Tissue kit following manufacturer's protocol with the following modifications. Each sample cap was incubated with the lysis buffer (ATL & Proteinase K) in a 500 μl microcentrifuge at 60° C. for 5 hrs and enzyme deactivation was carried out at 90° C. for 10 minutes. The eluted DNA was used directly for PCR and Big Dye® sequencing.
The present disclosure found that all of the PT-associated mutations were present in the stromal cells and not in the epithelial cells. These observations are consistent with previous study in FAs where MED12 mutations are detected exclusively in the tumor stroma, suggesting that FAs and PTs likely originate from stromal cells rather than epithelial cells, in spite of their biphasic epithelial-stromal morphological appearance. It is important to note, however, that the present results do not exclude the possibility that genetic alterations may also be present in the epithelial compartment of fibroepithelial tumors. By using OncoCNV analysis, LOH in chrlq was observed in 21% of fibroepithelial tumors in the present disclosure, consistent with a previous study42. Furthermore, epithelial alterations are frequently observed in PT43 and histopathological assessment, as in Table 6, indicates that 49% of PTs (32/65, with assessable epithelial compartments) can exhibit moderate-to-florid usual ductal hyperplasia, an epithelial phenotype4. It is thus possible that breast fibroepithelial tumor development may involve a complex interplay between the epithelial and stromal compartments of these tumors, warranting further studies.
Comparisons between the spectrum and frequency of mutations in fibroepithelial tumors compared to BCs revealed significant distinctions as summarized in
Full-length RARA cDNAs were cloned into pcDNA3.1 with a 3× Flag tag. The patient-derived mutations were introduced using the QuikChange II XL site-directed mutagenesis kit (Agilent) as described by manufacturer's instructions. The transcriptional activity of wild-type and mutant RARA was assessed by a luciferase assay using the RARE (retinoic acid response element) Cignal reporter assay kit (Qiagen). HEK293 cells were transiently transfected with the RARE reporter construct and Renilla luciferase constructs from the kit, together with the wild-type or mutant RARA plasmids as described above. The transfected cells were then incubated with the indicated concentrations of RA for 24 hours. The luciferase assay was performed using the Dual Luciferase Reporter Assay System (Promega) according to the manufacturer's instructions. Results were normalized to co-expressed Renilla.
For mammalian two-hybrid assays, the RARA ligand binding domain was cloned into pACT vector (Promega) to generate the bait plasmid while the cDNA sequence coding for the CoRNR1 peptide region (THRLITLADHICQIITQDFARNQV) of the NCoR1 protein was inserted into pBIND create the prey plasmid. Mammalian two hybrid screens were carried out with CheckMate Mammalian Two-Hybrid System (Promega) following the manufacturer's protocol. Briefly, transfected HEK293T cells were treated with the indicated concentrations of RA for 24 hours and assayed for luciferase activity. Results were normalized to co-expressed Renilla.
Given the strikingly high frequency of RARA mutations specifically in fibroepithelial tumors as shown in
To confirm the expression of mutant FLNA, the present study also sequenced the cDNA of three FLNA mutant samples with available fresh frozen tissue. One hundred ng of RNA were converted to cDNA with SuperScript III First-Strand Synthesis SuperMix from Invitrogen according to manufacturer's recommended protocol. PCR was performed according to the primers listed in the Table 7. PCR amplification, sequencing and fractionation were performed as described above for Sanger sequencing of genomic DNA.
Finally, the present disclosure investigated if FAs might progress to malignant PT in a linear fashion, as proposed in previous studies6-10. Using the same targeted 50-gene panel, the experiment sequenced a set of paired concurrent FA and PT-like regions isolated from the same patients (N=3). The present disclosure also analysed paired longitudinal tumors from two patients originally diagnosed with FAs that were subsequently followed by PT-like recurrences. It was found that even in the same patient, higher-grade PTs harboured more mutations than the paired FA regions, especially in cancer-associated genes as indicated in Table 8 and
The genomic landscape of breast fibroepithelial tumors described in the present disclosure may have significant clinical implications. As mentioned earlier, the diagnosis and histopathologic classification of PT often present challenges to pathologists. The present disclosure provides the foundation for a genomics-based classification of breast fibroepithelial tumors, which may increase diagnostic accuracy when used in combination with histopathological criteria. For example, based on the acquired sequencing data, Sample 004, previously classified histologically as a benign FA, was found to harbour RB1 truncating and EGFR activating mutations, in addition to MED12, RARA and FLNA mutations referring to
Although disclosed method and kit have been described in their preferred form with a degree of particularity, it is understood that the present disclosure of the preferred forms have been made only by way of example and that numerous changes in the details of construction and the combination and arrangements of parts may be resorted to without departing from the scope of the present disclosure.
Number | Date | Country | Kind |
---|---|---|---|
10201500462U | Jan 2015 | SG | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/SG2015/050368 | 10/4/2015 | WO | 00 |