The present disclosure relates to methods of detecting cancer cells, and specifically detecting breast cancer cells from breast milk-derived cells.
Detection and treatment prior to lymphatic spread is associated with favorable outcomes in breast cancer. For example, in the case of women who have estrogen receptor alpha-positive (ERα+) breast cancer and are treated with anti-estrogens, recurrences occur at a steady rate from years 5 to 20 after diagnosis. However, this recurrence is dependent on the status of lymph nodes at the time of diagnosis. Distal recurrence is 13% in women without lymph node involvement but is 34% in women with four or more lymph nodes involved. Radiologic techniques remain the mainstay of breast cancer detection but these methods detect tumors above 5 mm in diameter. Unfortunately, these methods are still not always reliable, with high rate of false positives and false negatives. An examination of the Dutch national registry for the frequency of missed breast cancers in women participating in a high-risk MRI screening program has revealed the limitations of using only radiological techniques for diagnosis. Among 131 breast cancer cases with a negative MRI 0-24 months before cancer detection, 31% of cases had MRI detectable cancers that were missed, whereas 34% of cases showed minimal signs. Therefore, complementary molecular assays are needed to improve earlier cancer detection.
Breast milk can be one source of cancer cells as it contains a variety of cell types. Cells from breast milk have been cultured and characterized for stemness activity and have been suggested to be a source of cells for regenerative medicine. Earlier culturing methods favored the propagation of stem/basal and mesenchymal stem cells from the breast milk.
Since the majority of breast cancers likely originate from luminal progenitor cells, earlier methods are not ideal for the expansion of luminal and cancer cells from breast milk for robust characterization. We recently reported a breast epithelial culturing method that did not require feeder layer cells and allowed for the propagation of breast epithelial cells that phenotypically correspond to stem, luminal progenitor and mature/differentiated luminal cells.
We initiated this study upon the contribution of breast milk from a patient who was diagnosed with ER+/Progesterone Receptor (PR)-positive (ER+/PR+) cancer in her right breast while breast feeding. We determined whether the above technique can be used to propagate and characterize cancer cells in breast milk and show that breast milk contains cells that display cancer stem cell (CSC) properties and carry mutations that are enriched in breast cancer. Since incidence of breast cancer during pregnancy is 1 in 3000, is expected to increase due to delay in childbearing, and breast cancers diagnosed during pregnancy or postpartum tend to be highly metastatic, the method described here is useful for rapid characterization of cancer cells in these patients including identification of driver mutations and screening for drugs that may potentially target these mutations.
Radiologic techniques remain the main method for early detection for breast cancer and are critical to achieve a favorable outcome from cancer. However, more sensitive detection methods to complement radiologic techniques are needed to enhance early detection and treatment strategies. Using our recently established culturing method that allows propagation of normal and cancerous breast epithelial cells of luminal origin, flow cytometry characterization, and genomic sequencing, we show that cancer cells can be detected in breast milk. Cells derived from milk from the breast with cancer were enriched for CD49f+/EpCAM−, CD44+/CD24− and CD271+ cancer stem-like cells (CSC). These CSC carried mutations within the cytoplasmic retention domain of HDAC6, stop/gain insertion in MORF4L1, and deletion mutations within SWI/SNF complex component SMARCC2. CSC were sensitive to HDAC6 inhibitors, BET bromodomain inhibitors, and EZH2 inhibitors, as mutations in SWI/SNF complex components are known to increase sensitivity to these drugs. Among cells derived from breast milk of additional 10 women not known to have breast cancer, two of them contained cells that were enriched for the CSC phenotype and carried mutations in NF1 or KMT2D, which are frequently mutated in breast cancer. Breast milk-derived cells with NF1 mutations also carried copy number variations in CDKN2C, PTEN, and REL genes.
In some embodiments, the methods described herein can enable rapid cancer cell characterization including driver mutation detection and therapeutic screening for pregnancy/post-partum breast cancers. Furthermore, the methods can be used as a surveillance or early detection tool for women at high risk for developing breast cancer.
In some embodiments, the methods presented herein can detect cancer cells non-invasively, which increases the feasibility of using this method in a surveillance setting. Our previous studies have shown that culturing of primary cells does not introduce mutations; therefore, any mutations detected in milk-derived cells are not likely due to culturing artifacts. The methods have an inbuilt control (milk from the other breast without aberrations from the same woman), which should help to eliminate any culture-induced artifacts corrupting cancer-specific mutation detection as well as bias attributable to inter-individual variations in the genotype and phenotype of breast epithelial cells.
In some embodiments, the methods described herein are useful for screening women who are considered to be at a higher risk of developing breast cancer due to family history, germ line breast cancer risk alleles, and dense breasts that hinder radiologic detection. Additionally, since breast milk is enriched for cells with proliferative capacity, the methods can help in cataloguing somatic mutation rates in BRCA1/BRCA2 mutation carriers or any other risk alleles associated with impaired DNA repair pathways.
The present disclosure is generally related to methods of collecting, extracting, and analyzing DNA in a breast milk-derived cell. In particular, methods are provided in accordance with the present disclosure for collecting, extracting, and analyzing DNA in a breast milk sample comprising cancer stem-like cells. In some embodiments, a method is provided for culturing breast milk-derived cells.
In some embodiments, the breast milk is mixed with media and plated directly on a 804G conditioned media coated plate. In some embodiments, the milk to media ratio is 1 to 1 (i.e., 1:1). In some embodiments, the ratio of milk to media is 1:2, 2:1, 1:3, 3:1, 1:4, or 4:1.
In one embodiment, a method of detecting cancer stem-like cells in a breast milk sample is described comprising: (i) extracting the nucleotides from the sample; (ii) amplifying the nucleotides using primers, and (iii) analyzing the nucleotides in the sample. In some embodiments, the method comprises detecting an increase in p63 expression compared to a control.
The method may include extracting the nucleotides from a breast milk sample; analyzing the nucleotides in the sample for gene expression and breast cancer enriched gene aberrations, and quantifying the expression level of a cancer marker. The method may further include a step of amplifying the nucleotides. The method may further comprise detecting the cancer marker. In some embodiments, the cancer marker is p63.
The disclosed method may indicate cancer when the cancer marker expression level as quantified in the breast milk is increased compared to a control sample. In some embodiments, a control sample is provided by the same individual. For example, the opposite breast than the one tested. In some embodiments, a control value is provided by an average population data set. When the p63 expression is at least about 1.5× higher than the control level, the results may indicate the presence of cancer. When p63 expression is about 1.5×, about 2.0×, about 2.5×, about 3.0× about 3.5×, about 4.0×, about 4.5×, about 5.0×, about 5.5×, about 6.0×, about 6.5×, about 7.0×, about 7.5×, about 8.0×, about 8.5×, about 9.0×, about 9.5×, or about 10.0× higher than the control, the results may indicate the presence of cancer.
In one embodiment, the method further comprises quantifying the expression of a caner marker. In some embodiments, the method comprises comparing the expression levels of a cancer marker identified in the sample with a control. In some embodiments, the cancer marker is p63. In some embodiments, the method is performed close in time to the collection of the sample. In other embodiments, the sample is stored prior to performing the method.
In some embodiments, the sample may include the entire amount of breast milk produced in one sitting from one or both breasts. In another embodiment, the sample may include the entire amount of breast milk produced over a 24 hour period from one or both breasts. In some embodiments, the sample may be collected at different time points of breast milk released during a sitting. For example, one sample may be taken during the initial release of breast milk and a second sample may be taken at the end of release of breast milk.
In some embodiments, the sample is collected using a breast pump and collection container.
In one embodiment, the nucleotides are analyzed for mutations. For example, the mutation may indicate a disease state in the breast milk-derived cells. In one embodiment, the disease is cancer, e.g., the cancer may be breast cancer. In one embodiment, the cancer is a non-metastatic cancer. In one embodiment, the nucleotides are analyzed by PCR, e.g., overlap extension PCR (OE PCR), emulsion PCR (EmPCR), or digital droplet PCR (ddPCR) may be used. In one embodiment, the nucleotides are analyzed for the presence of a tumor marker or a tumor recurrence marker. In some embodiments, the tumor marker is p63. In one embodiment, a DNA elongation method can be employed that transforms small DNA molecules into longer DNA fragments that can be analyzed using standard methods, thus allowing the analysis of more DNA molecules of interest than otherwise possible. In one embodiment, the method further comprises monitoring the patient for cancer progression. In one embodiment, the method further comprises determining if the patient is eligible for a targeted cancer therapy.
As described herein, a genetic marker is a specific sequence of DNA at a known location on a chromosome. Examples of genetic markers may include single polymorphism nucleotides (SNPs) and microsatellites. A genetic marker of susceptibility is a specific change in a person's DNA that makes the person more likely to develop certain diseases such as cancer.
As described herein, a biomarker, or molecular marker, is a biological molecule found in blood, urine, other body fluids such as lymph fluid or breast milk, or tissues that is a sign of a normal or abnormal process, or of a condition or disease. A biomarker may be used to see how well the body responds to a treatment for a disease or condition. Many specific biomarkers have been well characterized and repeatedly shown to correctly predict relevant clinical outcomes across a variety of treatments and populations.
As described herein, tumor markers are substances that are produced by cancer cells or by other cells of the body in response to cancer or certain benign (noncancerous) conditions. Most tumor markers are made by normal cells as well as by cancer cells; however, they are produced at much higher levels in cancerous conditions. These substances can be found in the blood, urine, stool, tumor tissue, or other tissues or bodily fluids of some patients with cancer. In particular here, the substances are found in breast milk. Tumor markers may include, e.g., proteins, DNA, RNA, etc. For example, certain patterns of gene expression and changes to DNA can be used as tumor markers. A tumor recurrence marker is a tumor marker used in monitoring the tumor recurrence in a patient.
The methods described herein may be useful to determine specific molecules, e.g., a tumor marker or a tumor recurrence marker, to predict the risk of tumor relapse after a specific treatment or curative surgery. In one embodiment, ctDNA from a patient sample is analyzed for the presence of a tumor marker, a tumor recurrence marker, a genetic marker, and/or, a biomarker. In one embodiment, the mutant allele fraction of a specific gene is determined. In one embodiment, the heterogeneity of the mutant allele fraction is determined.
In accordance with the invention, “patient” may refer to a human or an animal. Accordingly, the methods and compositions disclosed herein can be used for both human clinical medicine and veterinary applications. Thus, as described herein, a “patient” can be a human or, in the case of veterinary applications, the patient can be a laboratory, an agricultural, a domestic, or a wild animal. In various aspects, the patient can be a laboratory animal such as a rodent (e.g., mouse, rat, hamster, etc.), a rabbit, a monkey, a chimpanzee, a domestic animal such as a dog, a cat, or a rabbit, an agricultural animal such as a cow, a horse, a pig, a sheep, a goat, or a wild animal in captivity such as a bear, a panda, a lion, a tiger, a leopard, an elephant, a zebra, a giraffe, a gorilla, a dolphin, or a whale. Exemplary patients include cancer patients, post-operative patients, transplant patients, patients undergoing chemotherapy, immunosuppressed patients, and the like. In one embodiment, the sample is obtained from a patient. In another embodiment, the patient is a female producing breast milk. In some embodiments, the female is pregnant. In some embodiments, the female is postpartum. In another embodiment, the sample is a breast milk sample from a patient. In some embodiments, the sample is collected from a woman who is nursing. The samples can be prepared for testing as described herein.
In various embodiments, the nucleotides (DNA or RNA) are collected from a breast milk sample comprising breast milk-derived cells. In some embodiments, the breast milk-derived cell is a cancer-like stem cell. In some embodiments, the breast milk-derived cell comprises markers to indicate a cancer. In some embodiments, the cancer is a breast cancer. In some embodiments, the breast cancer is HER2+. HER2−, or triple negative breast cancer. In some embodiments, the cancer originated from another source and metastasized into the breast tissue.
In one embodiment, the method is used to extract DNA and detect a disease. In one embodiments, the method is used to extract RNA and detect a disease. In a further embodiment, the disease is cancer. In some embodiments, the cancer comprises a primary tumor. In yet other embodiments, the cancer comprises non-metastatic tumor cells. In yet other embodiments, the cancer comprises metastatic tumor cells.
In one embodiment, detecting the nucleotides in the sample comprises quantifying the copy number of a gene in the nucleotides sample. In one embodiment, detecting the nucleotides in the sample comprises detecting a mutation in the nucleotides sample. In some embodiments, the gene copy number is quantified per ml of sample. The methods described herein can be used to detect or identify specific nucleic acid sequences in a DNA sample. Techniques for isolation of DNA are well-known in the art. Methods for isolating DNA are described in Sambrook et al., “Molecular Cloning: A Laboratory Manual”, 3rd Edition, Cold Spring Harbor Laboratory Press, (2001), incorporated herein by reference.
In one embodiment, a method of analyzing the nucleotides for a mutation is provided including: providing primer(s) and/or probe(s), amplifying the nucleotides, sequencing the nucleotides, and analyzing the sequenced nucleotides for mutations. In various embodiments, the mutation is indicative of a disease, e.g., cancer. In one embodiment, the nucleotides may be analyzed for the presence of a specific mutant allele fraction, a genetic marker, a biomarker, a tumor marker, or a tumor recurrence marker. In one embodiment, the amplification method is a PCR method, such as OE PCR or ddPCR.
In the methods herein described, DNA/mRNA may be detected and/or quantified using any DNA detection method known in the art. In one embodiment, the nucleic acid may be detected using conventional polymerase chain reaction (PCR) methods. In one embodiment, the nucleic acid may be detected using conventional polymerase chain reaction (PCR), quantitative PCR (qPCR), overlap extension PCR (OE PCR), Emulsion PCR (EmPCR), or digital PCR (dPCR). As described herein, PCR techniques may be used to amplify specific, target DNA fragments from low quantities of source DNA or RNA (for example, after a reverse transcription step to produce complementary DNA (cDNA), or detection of small fragment DNAs in a sample). When performing conventional PCR, the final concentration of template is proportional to the starting copy number and the number of amplification cycles. In one embodiment, a given number of reactions is performed on a single sample and the result is an analysis of fragment sizes or, for quantitative real-time PCR (qPCR), the analysis is an estimate of the concentration of the target sequences in the reaction-based on the number of cycles required to reach a quantification cycle (Cq).
For qPCR methods, a fluorescent reporter dye is used as an indirect measure of the amount of nucleic acid present during each amplification cycle. The increase in fluorescent signal is directly proportional to the quantity of exponentially accumulating PCR product molecules (amplicons) produced during the repeating phases of the reaction. Reporter molecules may be categorized as; double-stranded DNA (dsDNA) binding dyes, dyes conjugated to primers, or additional dye-conjugated oligonucleotides, referred to as probes. The use of a dsDNA-binding dye, such as SYBR® Green I, represents the simplest form of detection chemistry. When free in solution or with only single-stranded DNA (ssDNA) present, SYBR Green I dye emits light at low signal intensity. As the PCR progresses and the quantity of dsDNA increases, more dye binds to the amplicons and hence, the signal intensity increases. Alternatively, a probe (or combination of two depending on the detection chemistry) can add a level of detection specificity beyond the dsDNA-binding dye, since it binds to a specific region of the template that is located between the primers. The most commonly used probe format is the Dual-Labeled Probe (DLP; also referred to as a Hydrolysis or TaqMan® Probe). The DLP is an oligonucleotide with a 5′ fluorescent label, e.g., 6-FAM™ and a 3′ quenching molecule, such as one of the dark quenchers e.g., BHQ®1 or OQ™ (see Quantitative PCR and Digital PCR Detection Methods). These probes are designed to hybridize to the template between the two primers and are used in conjunction with a DNA polymerase that has 5′ to 3′ exonuclease activity.
For digital PCR (dPCR), the sample can be diluted and separated into a large number of reaction chambers or partitions. In various embodiments, each partition contains either one copy of the target DNA or no copies of the target DNA. In some embodiments, the partition may contain one or more copies of the target DNA. In some embodiments, the partition may contain two or more copies of the target DNA. The number of reaction chambers or partitions varies between systems, from several thousand to millions. The PCR is then performed in each partition and the amplicon detected using a fluorescent label such that the collected data are a series of positive and negative results.
In one embodiment, the methods described herein may include droplet digital PCR (ddPCR) technology. ddPCR is a method for performing digital PCR that is based on water-oil emulsion droplet technology. For example, a sample is fractionated into thousands of droplets (e.g., 10,000, 15,000, 20,000, 25,000, 30,000, 40,000, or 50,000 droplets, or more depending on the reaction to be performed), and PCR amplification of the template molecules occurs in each individual droplet. The droplets for use in ddPCR are typically nanoliter-sized droplets. ddPCR has a small sample requirement reducing cost and preserving samples.
As described herein, for methods employing ddPCR, the sample(s) may be partitioned into 20,000 nanoliter-sized droplets. This partitioning allows the measurement of thousands of independent amplification events within a single sample. ddPCR technology uses reagents and workflows similar to those used for most standard TaqMan probe-based assays. ddPCR allows the detection of rare DNA target copies, allows the determination of copy number variation, and allows the measurement of gene expression levels with high accuracy and sensitivity. Digital PCR is an end-point PCR method that is used for absolute quantification and for analysis of minority sequences against a background of similar majority sequences, e.g., quantification of somatic mutations. When using this technique, the sample is taken to limiting dilution and the number of positive and negative reactions is used to determine a precise measurement of target concentration. The digital PCR (dPCR) methods may be employed using emulsion beads (e.g., Bio-Rad QX100™ Droplet Digital™ PCR, ddPCR™ system and RainDance Technologies' RainDrop™ instrument). In an alternative format, the reactions may be run on integrated fluidic circuits (chips). These chips have integrated chambers and valves for partitioning samples and reaction reagents (e.g., BioMark™, Fluidigm).
For overlap extension PCR (OE-PCR) the method may be used for DNA elongation, to insert specific mutations at specific points in a sequence, or to splice smaller DNA fragments into a larger polynucleotide. In one embodiment, a method is described for detection of mutations in very small DNA fragments. Small fragment DNA elongation can be accomplished using a variation of overlap extension (OE) PCR. PCR parameters including primer concentration, concentration of PCR additives, PCR annealing temperatures, and temperature ramp speeds are analyzed for contribution to maximizing elongation efficiency. In another embodiment, a method of analyzing the ctDNA for a mutation is provided including: providing primer(s) and/or probe(s), amplifying the ctDNA, sequencing the ctDNA, and analyzing the sequenced ctDNA for mutations. In various embodiments, the mutation is indicative of a disease, e.g., cancer. In one embodiment, the amplification method is a PCR method, such as OE PCR, EmPCR, or ddPCR.
Emulsion PCR (EmPCR) may be used for template amplification, e.g., in multiple NGS-based sequencing platforms. The basic principle of emPCR is dilution and compartmentalization of template molecules in water droplets in a water-in-oil emulsion. Ideally, the dilution is to a degree where each droplet contains a single template molecule and functions as a micro-PCR reaction. As described herein, emulsion PCR can overcome possible OE PCR bias for elongation of ultra-low frequency mutations. Elongation efficiency and false positive mutation rates may be analyzed to determine optimal PCR conditions, utilizing in vitro systems of varying mutant and wildtype DNA fragment sizes and ratios, and modeling human breast milk, which contains DNA of differing fragment lengths.
Techniques for isolation of DNA and RNA are well-known in the art. In one embodiment, cells may be ruptured by using a detergent or a solvent, such as phenolchloroform. In another embodiment, cells remain intact and cell-free DNA may be extracted. DNA may be separated from other components in the sample by physical methods including, but not limited to, centrifugation, pressure techniques, or by using a substance with affinity for DNA, such as, for example, silica beads. After sufficient washing, the isolated DNA may be suspended in either water or a buffer. In other embodiments, commercial kits are available, such as Quiagen™, Nuclisensm™, and Wizard™ (Promega), and Promegam™. Methods for isolating DNA are described in Sambrook et al., “Molecular Cloning: A Laboratory Manual”, 3rd Edition, Cold Spring Harbor Laboratory Press, (2001), incorporated herein by reference.
In various embodiments described herein, primers and/or probes are used for amplification of the target DNA or RNA are oligonucleotides from about ten to about one hundred, more typically from about ten to about thirty or about twenty to about twenty-five base pairs long, but any suitable sequence length can be used. In illustrative embodiments, the primers and probes may be double-stranded or single-stranded, but the primers and probes are typically single-stranded. The primers and probes described herein are capable of specific hybridization, under appropriate hybridization conditions (e.g., appropriate buffer, ionic strength, temperature, formamide, or MgCl2 concentrations), to a region of the target DNA. The primers and probes described herein may be designed based on having a melting temperature within a certain range, and substantial complementarity to the target DNA. Methods for the design of primers and probes are described in Sambrook et al., “Molecular Cloning: A Laboratory Manual”, 3rd Edition, Cold Spring Harbor Laboratory Press, (2001), incorporated herein by reference.
Also within the scope of the invention are nucleic acids complementary to the probes and primers described herein, and those that hybridize to the nucleic acids described herein or those that hybridize to their complements under highly stringent conditions. In accordance with the invention “highly stringent conditions” means hybridization at 65° C. in 5×SSPE and 50% formamide, and washing at 65° C. in 0.5×SSPE. Conditions for low stringency and moderately stringent hybridization are described in Sambrook et al., “Molecular Cloning: A Laboratory Manual”, 3rd Edition, Cold Spring Harbor Laboratory Press, (2001), incorporated herein by reference. In some illustrative aspects, hybridization occurs along the full-length of the nucleic acid.
In some embodiments, also included are nucleic acid molecules having about 60%, about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, at least 96%, at least 97%, at least 98%, or at least 99% homology to the probes and primers described herein. Determination of percent identity or similarity between sequences can be done, for example, by using the GAP program (Genetics Computer Group, software; now available via Accelrys on http://www.accelrys.com), and alignments can be done using, for example, the ClustalW algorithm (VNTI software, InforMax Inc.). A sequence database can be searched using the nucleic acid sequence of interest. Algorithms for database searching are typically based on the BLAST software. In some embodiments, the percent identity can be determined along the full-length of the nucleic acid. As used herein, the term “complementary” refers to the ability of purine and pyrimidine nucleotide sequences to associate through hydrogen bonding to form double-stranded nucleic acid molecules. Guanine and cytosine, adenine and thymine, and adenine and uracil are complementary and can associate through hydrogen bonding resulting in the formation of double-stranded nucleic acid molecules when two nucleic acid molecules have “complementary” sequences. The complementary sequences can be DNA or RNA sequences. The complementary DNA or RNA sequences are referred to as a “complement.”
Techniques for synthesizing the probes and primers described herein are well-known in the art and include chemical syntheses and recombinant methods. Such techniques are described in Sambrook et al., “Molecular Cloning: A Laboratory Manual”, 3rd Edition, Cold Spring Harbor Laboratory Press, (2001), incorporated herein by reference. Primers and probes can also be made commercially (e.g., CytoMol, Sunnyvale, CA or Integrated DNA Technologies, Skokie, IL). Techniques for purifying or isolating the probes and primers described herein are well-known in the art. Such techniques are described in Sambrook et al., “Molecular Cloning: A Laboratory Manual”, 3rd Edition, Cold Spring Harbor Laboratory Press, (2001), incorporated herein by reference. The primers and probes described herein can be analyzed by techniques known in the art, such as restriction enzyme analysis or sequencing, to determine if the sequence of the primers and probes is correct.
The following examples are exemplary embodiments of the disclosure. One of ordinary skill in the art will understand that slight variations or substitutions may be made to achieve the same results. Those slight variations and substitutions are considered a part of the disclosure herein.
Breast milk was collected in sterile cups and cultured within an hour of donation directly on 60-mm tissue culture plates pre-coated with conditioned media from 804G cells overnight in 50% culture media along with ROCK, TGFβ and BMP inhibitors and Cipro. In total, breast milk from 11 women were cultured and characterized. Five of them (all Caucasian) completed a questionnaire, which requested information including the number of pregnancies, the duration of breast-feeding, smoking, and contraceptive use. Time of milk collection varied between 1-5 months after delivery. Four women were nursing from their first pregnancy and one was nursing from her second pregnancy. None of the women had a breast cancer diagnosis or known BRCA mutations. Two women had a family history of breast cancer. All of the women had used birth control pills. There were no discernable differences between donors and the age range of donors was 29-37 years.
Cell culture plates with breast milk were washed extensively with PBS the next day and cells were passaged once plates became confluent. We usually observed faster growth of cells from milk from the breasts with abnormalities. Flow cytometry characterization and antibodies were used.
Mammosphere assays were performed using MammoCult media (Stem Cell Technologies) as described previously. Cell proliferation rates were measured using Bromodeoxyuridine ELISA by plating 1000 cells in 96 well plates precoated with conditioned media from 804G cells overnight. Sensitivity to drugs was measured after five days of treatment with one media and drug exchange. HDAC6 inhibitor CAY10603 (#S7596), EZH2 inhibitor GSK126 (#S7061), and the BET bromodomain inhibitor JQ1 (#S7110) were purchased from Selleckchem.com. We used an unpaired t test (Graphpad.com) to determine statistical differences in proliferation rate and data from technical replicates are presented although drug sensitivity studies were performed thrice.
RNA from biologic triplicates was prepared using the RNAeasy kit from Qiagen and. Total DNA was prepared using the Qiagen DNA mini kit (Cat #51304). The primers were purchased from Applied Biosciences.
RNA sequencing was performed. The sequencing data were first assessed using FastQC (v.0.11.5, Babraham Bioinformatics, Cambridge, UK) for quality control. All sequenced libraries were mapped to the human genome (UCSC hg38) using the STAR RNA-seq aligner (v.2.5) with the following parameter: “--outSAMmapqUnique 60”. The read distribution across the genome was assessed using bamutils (from ngsutils v.0.5.9). Uniquely mapped sequencing reads were assigned to hg38 refGene genes using featureCounts (subread v.1.5.1) with the following parameters: “-s 2-p-Q 10”. Each sample was analyzed independently and genes with read count per million (CPM)<0.5 in more than 3 replicate samples were removed from the comparisons. The data was normalized using the TMM (trimmed mean of M values) method. Multi-dimensional scaling analysis was done with limma (v.3.38.3). Differential expression analyses were performed using edgeR (v.3.12.1). The false discovery rate (FDR) was computed from p-values using the Benjamini-Hochberg procedure.
Whole genome sequencing libraries were generated with Illumina Nextera DNA Flex Library Prep Kit according to the manufacturer's instruction. The libraries were sequenced on Illumina NovaSeq 6000 S4 flow cell with 150 bp paired-end reads. Breast milk-derived cell DNA and blood samples were targeted for ˜80× and 40× coverage, respectively. All sequence data will be submitted to NCBI dbGaP server.
The paired-end sequence reads were first processed to remove Illumine adapter sequences and low-quality base calls with Trim Galore (http://www.bioinformatics.babraham.ac.uk/projects/trim_galore/). The resulting high-quality reads were aligned to human reference genome hg38 using BWA-MEM and BWAKIT (v0.7.15). Sentieon version 201911 (Sentieon, Inc, https://www.sentieon.com/) was applied in the following step to identify somatic variants. The sentieon analysis included duplicated reads marking, indel realignment, base quality recalibration (BQSR), and co-realignment of tumor and normal samples. Quality metrics including alignment stats, whole genome sequence metrics and insert size metrics were generated during the process. Somatic variants were detected with the Sentieon TNSeq algorithm. The variants passed filters were next functionally annotated with ANNOVAR using various databases.
We used nCounter® v2 Cancer Copy number assay (115000112: XT_PCN_HuV2_Cancercopy-CSO from Nanostring) to determine CNVs in 87 genes commonly deleted or amplified in cancer. The assay was done as per manufacturer recommendations using 300 ngs of genomic DNA. Values 1.6-2.4 are considered normal, 2.6-3.4 considered amplification, whereas 0.4 to 1.4 are considered deletions. For M1 and M2, DNA from the right breast milk-derived cells was compared to DNA from the left breast milk-derived cells, whereas for M10, DNA from the left breast milk-derived cells was compared to DNA from the right breast milk-derived cells. Thus, controls for the assay is from the same individual. Raw numbers of triplicate assays are presented in Tables 1, 2, and 3.
Breast milk was collected from both the breasts prior to initiating any treatment and was cultured using the protocol in Example 1 with the addition of Cipro at 10 μgs/ml to reduce growth of bacteria in milk. Breast epithelial cells grew within a day (
We subjected cells from milk from both breasts to a mammosphere assay to determine whether cells derived from milk from the right breast with cancer show unique properties compared to cells derived from milk from the left breast without cancer. While cells from the left breast milk generated only very small mammospheres (<30 micrometers), larger mammospheres (>50 micrometers) were detected in the right breast milk derived cells (
The tumor in the right breast of this donor was a 2.1 cm, moderately differentiated, ER+/PR+(80% nuclear positivity for both), HER2−, grade 2/3, 2/6 lymph node positive, E-cadherin-positive pT2, pN1a invasive ductal carcinoma. E-cadherin positivity indicated that it is not a lobular carcinoma. Treatment consisted of a lumpectomy, followed by chemotherapy, a mastectomy, and radiation. We propagated cells from mastectomy breast tissues. At the time of surgery, there was no evidence of cancer. Flow cytometry showed distinct patterns between the right and left breasts. Cells from the breast with cancer (right) were negative for EpCAM+ epithelial cells but were likely enriched for mesenchymal stem cells as all cells were CD44+ and >75% of cells were CD90+/CD73+. The remaining cells were CD90+/CD73− (
We obtained breast milk from 10 nursing women, propagated cells and characterized cells using flow cytometry. Note that cells from both breasts were cultured for a similar duration and the analyses were done within five passages. Profiles of milk-derived cells from these women are presented in
Cells from the left breast milk and the right breast milk of M10 showed differences in subpopulation of CD49f+/EpCAM−, CD44+/CD24− and CD271+/EpCAM− cells, although none of the differences reached statistical difference (
We employed RNA sequencing to determine whether phenotypic changes between breast milk-derived cells of the two breasts of the same individual could be aligned with transcriptomic changes to obtain evidence for the presence of abnormal cells in one of the breasts. RNA sequencing was uninformative as breast milk-derived cells from the right and the left breasts with and without cancer or aberrant cells showed significant differences in gene expression patterns. MDS plots of all samples are shown in
We previously demonstrated a close relationship between cancer stem cell phenotype and epithelial to mesenchymal transition (EMT). We examined RNA-seq data for differences in the expression of EMT-associated genes between the right and the left breast milk-derived cells. Expression levels of SNAIL ZEB1, ZEB2, and SOX2 were extremely low, which is consistent with luminal characteristics of cells. SNAI2 (Slug) showed two-fold expression differences between the right breast milk and the left breast milk-derived cells in few cases. The major difference was noted with TP63, which controls self-renewal of breast cancer stem cells through sonic hedgehog pathway. While two-fold differences in expression between the right and the left breast milk-derived cells were common across all samples, the difference was 36-fold in case of M10. We verified these differences at protein levels. Consistent with mRNA data, M1 right breast milk-derived cells contained 2-fold higher levels of p63 protein (
We performed whole genome sequencing of breast milk-derived cell DNA and compared the DNA sequencing results with sequencing data of the corresponding germ line DNA from blood. Summary of the results are presented
We performed CNV analysis of 87 cancer-relevant genes using DNA from breast milk-derived cells of donors M1. M2 and M10. Breast milk-derived cell DNA from the breast without aberrant cells from the same donor was used as control for comparison. While the right breast milk-derived cells of M1 and the left breast milk-derived cells of M10 did not carry any CNVs in these 87 genes, the right breast milk-derived cells of M2 showed amplification in CDKN2C, PTEN, and REL (
Prior studies have demonstrated synthetic lethality of SWI/SNF mutated cancers to inhibitors of EZH2 and bromodomains. Similarly, NF1 mutated tumors are susceptible to bromodomain inhibitors. Since cells from the right breast milk of M1 contained mutations in SWI/SNF complex and the right breast milk of M2 had a mutation in NF1 gene, we examined the sensitivity of these cells to EZH2 inhibitor GSK126 and bromodomain inhibitor JQ1. Since cells from right breast milk of M1 contained a mutation in HDAC6 gene, we also tested the effects of the HDAC6 inhibitor CAY10603. Unfortunately, we were unable to use cells from the other breast of the same donors as controls as these cells ceased to proliferate by the time we reached this stage of studies. Cells from the donor M10 also could not be tested because antibiotics used in the culture media failed to suppress milk-resident bacterial growth in this sample. While cells from both donors showed modest sensitivity to the HDAC6 inhibitor, they were extremely sensitive to GSK126 and JQ1 (
This study, initiated due to foresight of a patient, has established a mechanism to identify and characterize aberrant cells in the breast potentially before a radiologic and/or routine breast exam can detect suspicious abnormalities. Cells from breast milk of a donor with known cancer were enriched with CSCs and contained mutations in genes that are linked to breast cancer initiation and/or progression. In 82% of studies that correlated CSC biomarker expression in tumors with outcomes showed an association of CSC marker enrichment with poor overall survival. Since there are no universal CSC biomarkers and characterized biomarkers are associated with different aspects of cancer progression such as drug resistance and metastasis, the method described here would allow for the comprehensive characterization of CSC marker enrichment during early stage of the disease and may potentially aid in monitoring disease progression and making treatment decisions. These phenotypic characterizations coupled with DNA sequencing provided details of earliest lesions associated with breast cancer. For example, in addition to a mutation in HDAC6, which itself is an ERα-regulated gene and mediates estradiol-induced cell migration, DNA from breast milk with ER+/PR+ cancer cells showed mutations in SMARCC2 (BAF170). BAF170 is part of the SWI/SNF complex and the activity of ARID1A-SWI/SNF is essential for luminal cell identity and the control of ER activity and anti-estrogen response in breast cancer. Thus, two major players in ERα functions are aberrant in this cancer.
One in 3000 women are diagnosed with breast cancer during pregnancy or post-partum. These cancers tend to be highly metastatic because remodeling of organs such as liver during lactation provides fertile niche for metastatic cells. In certain instances, as in the case of donor 1, it is possible to propagate cancer cells from breast milk for genomic sequencing and focused screening of drugs that may target driver mutations. Thus, the method described here is beneficial for characterizing at least pregnancy and post-partum breast cancers.
This application claims priority to U.S. Provisional Application Ser. No. 63/074,101, filed Sep. 3, 2021, the contents of which are hereby incorporated in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/047506 | 8/25/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63074101 | Sep 2020 | US |