For the purpose of only those jurisdictions that permit incorporation by reference, all of the references cited in this disclosure are hereby incorporated by reference in their entireties (numbers in parentheses or in superscript following text in this patent disclosure refer to the numbered references provided in the “Reference List” section of this patent specification). In addition, any manufacturers' instructions or catalogues for any products cited or mentioned herein are incorporated by reference. Documents incorporated by reference into this text, or any teachings therein, can be used in the practice of the present invention.
Tissues are comprised of different cell types whose interactions elicit distinct gene expression patterns that regulate tissue formation, regeneration, homeostasis and repair. Analysis of these gene expression patterns requires methods that can capture as closely as possible the transcriptomes of cells of interest in their tissue microenvironment. The tissue microenvironment plays a crucial role in determining gene expression in cells in vivo, yet tools that can accurately and sensitively capture the transcriptomics of rare cells of interest in the context of an intact tissue microenvironment are lacking. While currently available techniques such as TRAP-seq and TU-tagging can capture in situ transcriptomics, these methods are limited in their ability to study rare cell populations due to their inherent noise (1-3). The application of TU-tagging is limited to cell populations that constitute at least 5% of the total tissue (1), while TRAP-seq has slightly higher sensitivity and can be used for cell populations that constitute at least 1% of the total population (2). Furthermore, current technologies designed to study in situ transcriptomics are limited by the requirement to perform multiple steps after tissue dissociation (3-5), and the requirement for the use of sophisticated tools (6), making it challenging to transcriptionally profile rare cell populations rapidly isolated from their native microenvironment. Many crucial questions of tremendous biological and clinical significance involve cell populations representing far fewer than 1% of the total cells in a tissue, such as adult stem cells, subtypes of neuronal cells and immune cells, dormant cancer cells, and others. As such, there is a need in the art for new and improved methods for studying in situ transcriptomics in rare cell populations. The present invention addresses this need.
The present invention provides new and improved methods for labeling RNA and performing in situ transcriptomics from cell populations, including rare cell populations, present within complex multicellular environments.
The enzyme cytosine deaminase or “CD” is naturally expressed in prokaryotes and fungi, but not in mammalian cells. The CD enzyme converts cytosine to uracil, which can then be converted by further enzymatic action to uridine and uridine triphosphate—which can then be incorporated into RNA. The present invention exploits the activity of the CD enzyme to selectively label/tag RNA in cells of interest present in mixed cell populations. In particular, the methods of the present invention involve expressing a recombinant CD enzyme in mammalian cells of interest, and also supplying an exogenous non-naturally occurring labeled/tagged substrate for the CD enzyme—specifically a halogenated cytosine, such as, for example, 5-fluoro-cytosine. Using this system, non-native metabolites of the halogenated cytosine substrate (e.g. halogenated uracil, halogenated uridine, and halogenated uridine triphosphate) are generated and incorporated into RNA in the cells of interest—thereby specifically labeling/tagging newly synthesized RNAs with halogenated uridine derivatives. Because mammalian cells cannot convert cytosine (or halogenated cytosine derivatives) to uracil without expression of the recombinant CD enzyme, this CD-based RNA tagging system can be used to selectively tag RNA in the cells of interest in a highly controlled manner.
While others have suggested employing CD to label RNA with reactive moieties (see WO 2016/154040), to the best of Applicant's knowledge, such an approach has never been successfully performed. In contrast, and as shown in the Examples section of the present application, Applicants have now successfully demonstrated that CD-based labeling of RNA with halogen atoms can be successfully performed. Importantly, such methods were found to be effective even though halogenated uracil derivatives are widely known to be toxic. For example, 5-fluoro uracil is highly cytotoxic, and is, in fact, used extensively as a chemotherapeutic agent for the treatment of tumors—based on its ability to induce cell death by inhibiting thymidylate synthase and by causing DNA and RNA damage (12-14). Nonetheless, the inventors have found that halogenated uracil derivatives can, surprisingly, be used to efficiently label RNA in the present methods. Indeed, the inventors have also demonstrated that efficient RNA labeling can even be achieved when the timing and dosing of the halogenated cytosine exposure is reduced to a level that causes minimal-to-no cytotoxicity. Furthermore, the use of halogenated cytosine (such as fluoro-cytosine) as the substrate in these CD-based RNA tagging methods appears to provide several advantages over other potential tagging and/or labeling systems. For example, because the tag is very small in size (a single halogen atom), the tagged cytosine remains a suitable substrate for the cytosine deaminase enzyme—and can be effectively converted to halogenated uracil by the CD enzyme. This is in contrast to labeling/tagging of cytosine with larger tags, (such as multi-atomic reactive moieties), which may compromise the ability of the cytosine deaminase enzyme to convert the labeled/tagged cytosine to uracil and thus severely reduce the efficiency of, or even prevent, RNA labelling. (Indeed, preliminary testing data suggests that the efficiency of conversion of cytosine to uracil by CD is inversely correlated with the size of any tag present on the cytosine molecule).
In some embodiments the CD-based RNA tagging systems of the present invention summarized above also involve expressing recombinant uracil phosphoribosyl transferase (UPRT) in the cells of interest. This limits the diffusion of halogenated uracil to neighboring cells—thus increasing the signal-to-noise ratio of the present methods.
In some embodiments the CD-based RNA tagging systems of the present invention summarized above also involve supplying exogenous thymine to the cells of interest. The thymine acts as a competitive inhibitor of halogenated uracil/halogenated uridine export from the CD-expressing cells—thus further increasing the signal-to-noise ratio of the present methods.
In some embodiments the tagged RNAs generated in the cells of interest using the CD-based RNA tagging systems of the present invention can be detected and/or purified very efficiently using antibodies that bind to tagged uridine-containing RNAs specifically, but that do not bind to (or have significantly lower levels of binding to) tagged cytidine containing RNAs. Such antibodies include, but are not limited to, anti-bromodeoxyuridine (“BrdU”) antibodies, which bind specifically to halogenated-uridine containing nucleic acid molecules. Anti-BrdU antibodies are well known in the art and are widely available. The use of such antibodies to detect and/or purify RNA from the cells of interest provides several important advantages. For example, the use of such antibodies for RNA purification reduces or eliminates contamination by RNA that contains the halogenated cytosine substrate or its cytidine derivatives (e.g. cytidine triphosphate). This is in contrast to detection and/or purification methods that are based purely on the existence of the tag itself (whether that is a reactive moiety or any other type of tag)—irrespective of whether the tag is present on a cytidine derivative or on a uridine derivative. This is important because derivatives of the tagged cytosine substrates used in the present methods can be incorporated into RNA in a CD-independent manner—such that cells other than the cells of interest contain tagged RNA. Avoiding contamination by RNA that contains tagged cytidine is particularly advantageous where the cells of interest are very rare. Furthermore, the use of such antibodies eliminates the need to perform additional steps and/or utilize additional chemistry before the tagged RNA can be detected and/or purified. This is in contrast to RNA detection and purification systems that involve, for example, tagging RNA with a reactive moiety and then performing chemical reactions to render the RNA detectable.
The purified tagged RNAs obtained using the methods described herein (which represent the transcriptomes of the cells of interest), can be analyzed using any desired technique used for RNA analysis, including, but not limited to, microarray analysis, RT-PCR and/or RNA-sequencing (“RNA-seq”).
Using the new and improved methods of the present invention it is now possible to obtain and define the transcriptomes of cells of interest representing as few as 0.003% of the total cell population in an organ or tissue. As such, the methods of the present invention can be used to study the transcriptomes of rare cell populations including, but not limited to, stem cells, micro-metastatic cells, post-treatment residual cancer cells, sub-populations of neurons, and specific types of immune cells, and can also be used to study such cell types in a variety of different situations, including in response to diverse stimuli and/or in multiple different physiological and pathological contexts. Furthermore, because the methods of the invention only measure newly synthesized RNAs, the methods can be used to identify rapid changes in gene expression in cells of interest in response to a variety of external factors such as ligands, hormones, and drugs.
In some embodiments, the present invention provides a method of producing tagged RNA from mammalian cells of interest present in a tissue or tissue culture that contains multiple cell types, the method comprising contacting a tissue or a tissue culture that contains multiple cell types with an effective amount of a halogenated cytosine, wherein mammalian cells of interest in the tissue or tissue culture have been engineered to express a recombinant cytosine deaminase enzyme, thereby generating halogenated uridine-tagged RNA in the cells of interest.
In some such embodiments such methods may further comprise subsequently separating halogenated uridine-tagged RNA from other components of the tissue, tissue culture, and/or cells of interest. Similarly, in some such embodiments such methods may further comprise subsequently contacting a cell lysate or RNA sample derived from the tissue or tissue culture with an antibody that binds specifically to halogenated uridine-tagged RNA. Similarly, in some such embodiments such methods may further comprise subsequently separating halogenated uridine-tagged RNA from other components in the cell lysate or RNA sample based on its binding to the antibody, thereby obtaining halogenated uridine-tagged RNA from the cells of interest present in the tissue or tissue culture.
In some embodiments, the present invention provides a method of obtaining tagged RNA from mammalian cells of interest present in a tissue or tissue culture that contains multiple cell types, the method comprising: (a) contacting a tissue or a tissue culture that contains multiple cell types with an effective amount of a halogenated cytosine, wherein mammalian cells of interest in the tissue or tissue culture have been engineered to express a recombinant cytosine deaminase enzyme; (b) contacting a cell lysate or RNA sample derived from the tissue or tissue culture with an antibody that binds specifically to halogenated uridine-tagged RNA; and (c) separating halogenated uridine-tagged RNA from other components in the cell lysate or RNA sample based on its binding to the antibody; thereby obtaining tagged RNA from the cells of interest present in the tissue or tissue culture.
Each of the embodiments described above, or elsewhere herein, may, optionally, further comprise contacting the tissue or tissue culture with an effective amount of exogenous thymine.
Each of the embodiments described above, or elsewhere herein, may, optionally, be performed using cells of interest that also express a recombinant uracil phosphoribosyltransferase (UPRT) enzyme.
In some embodiments, the present invention provides a method of producing tagged RNA from mammalian cells of interest present in a tissue or tissue culture that contains multiple cell types, the method comprising: (a) contacting a tissue or a tissue culture that contains multiple cell types with an effective amount of a halogenated cytosine, wherein mammalian cells of interest in the tissue or tissue culture have been engineered to express a recombinant cytosine deaminase enzyme, and (b) contacting the tissue or a tissue culture with an effective amount of exogenous thymine, thereby producing tagged RNA from the cells of interest present in the tissue or tissue culture.
In some embodiments, the present invention provides a method of producing tagged RNA from mammalian cells of interest present in a tissue or tissue culture that contains multiple cell types, the method comprising: (a) contacting a tissue or a tissue culture that contains multiple cell types with an effective amount of a halogenated cytosine, wherein mammalian cells of interest in the tissue or tissue culture have been engineered to express both (i) a recombinant cytosine deaminase enzyme, and (ii) a recombinant uracil phosphoribosyltransferase (UPRT) enzyme, and (b) contacting the tissue or a tissue culture with an effective amount of an exogenous thymine, thereby producing tagged RNA from the cells of interest present in the tissue or tissue culture.
Each of the embodiments described above, or elsewhere herein, may, optionally, further comprise performing RNA sequencing of the tagged RNA, and/or reverse transcribing the tagged RNA to produce cDNA, and/or performing RT-PCR with the tagged RNA, and/or amplifying the tagged RNA, or cDNA derived therefrom, and/or performing microarray analysis of the tagged RNA.
In each of the embodiments described above, or elsewhere herein, the halogenated cytosine may be selected from the group consisting of fluoro-cytosine, chloro-cytosine, bromo-cytosine, and iodo-cytosine. For example, in some embodiments the halogenated cytosine is 5-fluoro-cytosine and the halogenated uridine-tagged RNA is 5-fluoro-uridine-tagged RNA.
In each of the embodiments described above, or elsewhere herein, the methods may, optionally, also comprise contacting a cell lysate or RNA sample derived from the tissue or tissue culture with an anti-BrdU antibody.
In each of the embodiments described above, or elsewhere herein, the methods may, optionally, also comprise performing immuno-affinity chromatography or immunoprecipitation to separate halogenated uridine-tagged RNA from other components in the cell lysate or RNA sample. For example, in some embodiments two successive rounds of immuno-affinity chromatography or immunoprecipitation may be performed to separate the halogenated uridine-tagged RNA from other components in the cell lysate or RNA sample.
In each of the embodiments described above, or elsewhere herein, the cells of interest used in the methods of the invention may, optionally, comprise a recombinant nucleic acid molecule that comprises a nucleotide sequence encoding a cytosine deaminase enzyme operatively linked to a promoter. Similarly, in each of the embodiments described above, or elsewhere herein, the cells of interest used in the methods of the invention may, optionally, comprise a recombinant nucleic acid molecule that comprises a nucleotide sequence encoding a UPRT enzyme operatively linked to a promoter. In some embodiments the cells of interest may, optionally, comprise both (a) a recombinant nucleotide sequence encoding a cytosine deaminase enzyme, and (b) a recombinant nucleotide sequence encoding a UPRT enzyme. In some such embodiments the nucleotide sequence encoding the cytosine deaminase enzyme and the nucleotide sequence encoding the UPRT enzyme may, optionally, be present on the same nucleic acid molecule, for example on a nucleic acid molecule comprising an internal ribosome entry site (IRES) sequence or viral 2A peptide encoding sequence located between the nucleotide sequence encoding the cytosine deaminase enzyme and the nucleotide sequence encoding the UPRT enzyme. Alternatively, in other embodiments, the nucleotide sequence encoding the cytosine deaminase enzyme and the nucleotide sequence encoding the UPRT enzyme may each be present on a separate nucleic acid molecule. In each of such embodiments the nucleotide sequence encoding the cytosine deaminase enzyme, and/or the nucleotide sequence encoding the UPRT enzyme may, optionally, comprise an inducible promoter, and/or a tissue-specific promoter.
In each of the embodiments described above, or elsewhere herein, the cells of interest may be in vitro or in vivo.
In embodiments where the cells of interest are in vivo the cells may be contacted with the halogenated cytosine in vivo. In some embodiments the cells of interest may have been injected into a living animal or may be derived from cells that have been injected into a living animal. In some embodiments the cells of interest may be present in a genetically engineered animal that has been engineered to express the recombinant cytosine deaminase enzyme, and/or to express both the recombinant cytosine deaminase enzyme and UPRT.
In embodiments where the cells of interest are in vitro the cells may be contacted with the halogenated cytosine in vitro. For example, a tissue or tissue culture comprising the cells of interest may be contacted in vitro with the halogenated cytosine at a concentration of up to about 50 micro molar, or of up to about 125 micro molar, or of up to about 250 micro molar, or of up to about 500 micro molar.
In embodiments where the cells of interest are in vivo the cells may be contacted with the halogenated cytosine in vivo. For example, in some embodiments a dose of up to about 50 mg/kg, or up to about 125 mg/kg, or up to about 250 mg/kg, or up to about 500 mg/kg, of the halogenated cytosine may be administered to an animal comprising the cells of interest.
In some embodiments, the cells of interest, or the tissue or tissue culture, may be contacted with the halogenated cytosine in vitro or in vivo for a period of at least about 2 hours. In some embodiments, the cells of interest, or the tissue or tissue culture, may be contacted with the halogenated cytosine in vitro or in vivo for or for a period of up to about 48 hours, or for a period of up to about 24 hours, or for a period of up to about 12 hours, or for a period of up to about 8 hours, or for a period of up to about 6 hours, or for a period of up to about 4 hours. In some embodiments, the cells of interest, or the tissue or tissue culture, may be contacted with the halogenated cytosine in vitro or in vivo for a period of from about 2 hours to about 24 hours, or for a period of from about 2 hours to about 12 hours, or for a period of from about 2 hours to about 8 hours, or for a period of from about 2 hours to about 6 hours, or for a period of from about 2 hours to about 4 hours. In some embodiments the present invention provides kits that may be useful in carrying out the methods described herein. For example, in some embodiments the present invention provides a kit for obtaining tagged RNA from mammalian cells of interest present in a tissue or tissue culture that contains multiple mammalian cell types, the kit comprising two or more components selected from the group consisting of (a) a halogenated cytosine (for example 5-fluoro-cytosine), (b) thymine, (c) a nucleotide molecule encoding a cytosine deaminase enzyme, (d) a nucleotide molecule encoding a UPRT enzyme, and (e) an antibody that binds to halogenated uridine-tagged RNA (for example an anti-BrdU antibody).
The present invention also provides substantially pure samples of halogenated uridine-tagged RNA, such as substantially pure samples of 5-fluoro-uridine-tagged RNA. In some embodiments such samples may be produced using the methods of the present invention.
These and other embodiments of the invention are further described in the “Brief Description of the Figures,” “Detailed Description,” “Examples,” “Figures,” and “Claims” sections of this patent disclosure, each of which sections is intended to be read in conjunction with, and in the context of, all other sections of the present patent disclosure. Furthermore, one of skill in the art will recognize that the various embodiments of the present invention described herein can be combined in various different ways, and that such combinations are within the scope of the present invention.
a-g. Flura-tagging of mRNA in vivo.
a-b. CD expression produces fluorouracil (5-FU) derivatives upon fluorocytosine (5-FC) treatment that can be detected using anti-BrdU antibody.
a-e. Flura-tagging enables transcriptional profiling from micrometastatic lung xenografts.
a-b. Validation of differential gene expression by MDA231 cells in vitro vs. as lung micrometastases in vivo. Expression of representative differentially expressed genes identified by Flura-seq in MDA231 cells cultured in vitro or in mice lungs as micrometastases, was quantified by RT-PCR using human specific primers.
Some of the main embodiments of the present invention are described in the “Summary of the Invention,” “Examples,” “Brief Description of the Figures,” and “Figures” sections of this patent disclosure. This Detailed Description section provides certain additional description and details, and is intended to be read in conjunction with all other sections of the present patent disclosure.
As used in this specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents, unless the context clearly dictates otherwise. The terms “a” (or “an”) as well as the terms “one or more” and “at least one” can be used interchangeably.
Furthermore, “and/or” is to be taken as specific disclosure of each of the two specified features or components with or without the other. Thus, the term “and/or” as used in a phrase such as “A and/or B” is intended to include A and B, A or B, A (alone), and B (alone). Likewise, the term “and/or” as used in a phrase such as “A, B, and/or C” is intended to include A, B, and C; A, B, or C; A or B; A or C; B or C; A and B; A and C; B and C; A (alone); B (alone); and C (alone).
Units, prefixes, and symbols are denoted in their Systeme International de Unites (SI) accepted form. Numeric ranges provided herein are inclusive of the numbers defining the range.
Where a numeric term is preceded by “about” or “approximately,” the term includes the stated number and values ±10% of the stated number.
Numbers in parentheses or superscript following text in this patent disclosure refer to the numbered references provided in the “Reference List” section at the end of this patent disclosure.
Other terms are defined elsewhere in this patent disclosure, or else are used in accordance with their usual meaning in the art.
Where the present disclosure provides specified nucleotide and/or amino acid sequences (for example as identified using SEQ ID Nos), in addition to the specific sequences that are disclosed herein, variants of such sequences are also contemplated and are intended to fall within the scope of the present invention. For example, in some embodiments variants of the specific sequences disclosed herein from other species (orthologs) may be used. Similarly, in other embodiments variants that comprise fragments of any of the specific sequences disclosed herein may be used. Likewise, in some embodiments variants of the specific sequences disclosed herein that comprise one or more substitutions, additions, deletions, or other mutations may be used. In some embodiments the variant sequences have at least about 40% or 50% or 60% or 65% or 70% or 75% or 80% or 85% or 90% or 95% or 98% or 99% identity with the specific sequences described herein.
The various methods described herein require that a halogenated cytosine is contacted with the cells of interest. Any suitable halogenated derivative of cytosine may be used. In some embodiments the halogenated cytosine is selected from the group consisting of fluoro-cytosine, chloro-cytosine, bromo-cytosine, and iodo-cytosine. Furthermore, in some embodiments the halogenated cytosine is selected from the group consisting of 5-fluoro-cytosine, 5-chloro-cytosine, 5-bromo-cytosine, and 5-iodo-cytosine. In some embodiments the halogenated cytosine is 5-fluoro-cytosine. Halogenated cytosine derivatives such as those identified herein are known in the art and are either available from commercial sources or can be produced using published protocols. For example, 5-fluoro-cytosine is commercially available from Sigma-Aldrich (catalog number F7129).
Some of the methods described herein require that thymine is contacted with the cells of interest. Thymine is well known in the art and is available from multiple commercial sources and/or can be produced using published protocols. For example, thymine is commercially available from Sigma-Aldrich (catalog number T0376).
Any suitable method may be used to bring the halogenated cytosine and/or thymine into contact with the cells of interest. For example, when the cells of interest are present in vitro, such as in a tissue culture, the halogenated cytosine and/or thymine may be simply added to the tissue culture medium. When the cells of interest are present in an animal in vivo, the halogenated cytosine and/or thymine may be administered to the animal by any means that will result in the halogenated cytosine and/or thymine reaching and coming into contact with the cells of interest in that animal. For example, the halogenated cytosine and/or thymine may be administered to the animal by oral, intravenous, intraperitoneal, or subcutaneous routes, or by any other suitable route of administration known in the art. In some embodiments the agents may be administered systemically. In other embodiments the agents may be administered locally.
The amount of halogenated cytosine and/or thymine that is used should be an “effective amount.” As used herein the term “effective amount” refers to an amount of one of these agents that is sufficient to achieve the desired outcomes described herein. For example, in the case of the halogenated cytosine, the amount should be sufficient to achieve the desired outcome of “tagging” newly synthesized RNA in the cells of interest—as described herein. Similarly, in the case of thymine, the amount should be sufficient to act as a competitive inhibitor of halogenated uracil/halogenated uridine export from CD-expressing cells—as described herein. An appropriate “effective” amount of a halogenated cytosine and/or thymine may be determined using standard techniques known in the art, such as in vitro and/or in vivo dose escalation studies, and may be determined taking into account such factors as the desired route of administration, desired frequency of administration, etc. In some embodiments an “effective amount” may be determined using assays such as those described in the Examples section of this patent application. In some embodiments an “effective amount” may be calculated or determined based on studies performed in vitro and/or in vivo in various animal models, and may be determined taking into account various factors such as the form of the agent, the route of administration, the body weight of the animal, etc.
In some embodiments the halogenated cytosine used in the methods of the present invention is 5-fluoro-cytosine. The 5-fluoro-cytosine is converted in CD-expressing cells to various metabolites, including 5-fluoro-uracil (5-FU). 5-FU can be transported across cell membranes based on its concentration gradient (9,10). Furthermore, 5-FU is highly cytotoxic, and is in fact used extensively used as a chemotherapeutic agent for the treatment of solid tumors—based on its ability to induce cell death by inhibiting thymidylate synthase and by causing DNA and RNA damage (12-14). Importantly, the present invention provides various means for minimizing any such cytotoxicity while still allowing for effective RNA labeling. For example, and as described in more detail in the Examples section of this patent disclosure, the present invention provides amounts of 5-fluoro-cytosine, and timing of 5-fluoro-cytosine administration, that have been optimized to minimize cytotoxicity while facilitating effective RNA labeling. The present patent disclosure also provides specific assays that can be performed to measure such cytotoxicity.
In some embodiments the halogenated cytosine is used at approximately the maximum amount/dose at which it can be used without inducing cytotoxicity, for example as determined using the assays described in the Examples section of this patent disclosure. In some embodiments the halogenated cytosine is used at about 90% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 80% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 75% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 70% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 60% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 50% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 40% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 30% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 25% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 20% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 10% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 5% of the maximum amount/dose at which it can be used without inducing cytotoxicity. In some embodiments the halogenated cytosine is used at about 1% of the maximum amount/dose at which it can be used without inducing cytotoxicity.
In some embodiments the cells of interest are contacted with a halogenated cytosine in vitro. In some such embodiments, the effective amount of the halogenated cytosine is up to about 50 micro-molar, or up to about 100 micro-molar, or up to about 150 micro-molar, or up to about 200 micro-molar, or up to about 250 micro molar, or up to about 300 micro molar, or up to about 350 micro molar, or up to about 400 micro molar, or up to about 450 micro molar, or up to about 500 micro molar. In other some such embodiments, the effective amount of the halogenated cytosine is about 50 micro-molar, or about 100 micro-molar, or about 150 micro-molar, or about 200 micro-molar, or about 250 micro molar, or about 300 micro molar, or about 350 micro molar, or about 400 micro molar, or about 450 micro molar, or about 500 micro molar.
In some embodiments the cells of interest are contacted with a halogenated cytosine in vivo. In some such embodiments, the effective amount of the halogenated cytosine is up to about 50 mg/kg, or up to about 100 mg/kg, or up to about 150 mg/kg, or up to about 200 mg/kg, or up to about 250 mg/kg, or up to about 300 mg/kg, or up to about 350 mg/kg, or up to about 400 mg/kg, or up to about 450 mg/kg, or up to about 500 mg/kg. In some such embodiments, the effective amount of the halogenated cytosine is about 50 mg/kg, or about 100 mg/kg, or about 150 mg/kg, or about 200 mg/kg, or about 250 mg/kg, or about 300 mg/kg, or about 350 mg/kg, or about 400 mg/kg, or about 450 mg/kg, or about 500 mg/kg.
In some embodiments the cells of interest are contacted with thymine in vitro. In some such embodiments, the effective amount of the thymine is up to about 50 micro-molar, or up to about 100 micro-molar, or up to about 125 micro-molar, or up to about 150 micro-molar, or up to about 200 micro-molar, or up to about 250 micro molar, or up to about 300 micro molar, or up to about 350 micro molar, or up to about 400 micro molar, or up to about 450 micro molar, or up to about 500 micro molar. In other some such embodiments, the effective amount of the thymine is about 50 micro-molar, or about 100 micro-molar, or about 125 micro-molar, or about 150 micro-molar, or about 200 micro-molar, or about 250 micro molar, or about 300 micro molar, or about 350 micro molar, or about 400 micro molar, or about 450 micro molar, or about 500 micro molar.
In some embodiments the cells of interest are contacted with thymine in vivo. In some such embodiments, the effective amount of the thymine is up to about 50 mg/kg, or up to about 100 mg/kg, or up to about 125 mg/kg, or up to about 150 mg/kg, or up to about 200 mg/kg, or up to about 250 mg/kg, or up to about 300 mg/kg, or up to about 350 mg/kg, or up to about 400 mg/kg, or up to about 450 mg/kg, or up to about 500 mg/kg. In some such embodiments, the effective amount of the thymine is about 50 mg/kg, or about 100 mg/kg, or about 125 mg/kg, or about 150 mg/kg, or about 200 mg/kg, or about 250 mg/kg, or about 300 mg/kg, or about 350 mg/kg, or about 400 mg/kg, or about 450 mg/kg, or about 500 mg/kg.
In some embodiments the tissue or tissue culture is contacted with the halogenated cytosine and/or the thymine in vitro or in vivo for a period of at least about 2 hours, and for a period of up to about 4 hours, or up to about 6 hours, or up to about 8 hours, or up to about 10 hours, or up to about 12 hours, or up to about 24 hours, or up to about 48 hours. In some embodiments the tissue or tissue culture is contacted with a halogenated cytosine and with thymine concurrently.
The CD enzyme is a prokaryotic and fungal enzyme not expressed by mammalian cells. The methods of the present invention involve expression of a recombinant cytosine deaminase (CD) enzyme in cells of interest. In some embodiments the cells of interest are not fungi and are not prokaryotic cells. In some embodiment the cells of interest are mammalian cells.
Any suitable CD enzyme, from any suitable species, can be used as long as the enzyme has cytosine deaminase activity in the cells of interest. Similarly, any naturally occurring or manmade variant of a CD enzyme can be used as long as the enzyme has cytosine deaminase in the cells of interest. In the Examples presented herein the CD enzyme used was from Saccharomyces cerevisiae. However, a CD enzyme from any other organism or species, or a variant thereof, can be used as long as the enzyme has cytosine deaminase activity in the cells of interest. Nucleotide sequences and amino acid sequences of CD enzymes are known in the art and provided in public nucleotide and amino acid sequence databases. Furthermore, vectors comprising nucleotide sequences encoding CD enzymes are available commercially. For example, a vector comprising a nucleotide sequence encoding CD is commercially available from Addgene (catalog number 35102).
Some of the methods of the present invention involve expression of a recombinant uracil phosphoribosyltransferase (UPRT) enzyme in the cells of interest. Any suitable UPRT enzyme can be used as long as the enzyme has uracil phosphoribosyltransferase activity in the cells of interest. Similarly, any naturally occurring or manmade variant of a UPRT enzyme can be used as long as the enzyme has uracil phosphoribosyltransferase activity in the cells of interest. In the Examples presented herein the UPRT enzyme used was from Toxoplasma gondii. However, a UPRT enzyme from any other organism or species, or a variant thereof, can be used as long as the enzyme has UPRT activity in the cells of interest.
Nucleotide sequences and amino acid sequences of UPRT enzymes are known in the art and provided in public nucleotide and amino acid sequence databases. Furthermore, vectors comprising nucleotide sequences encoding UPRT enzymes are available commercially. For example, a vector comprising a nucleotide sequence encoding UPRT is available from Addgene (catalog number 47110).
The recombinant enzymes used in the methods of the present invention can be expressed in the cells of interest using any suitable vector system and any suitable promoter system known in the art. For example, the recombinant CD and/or UPRT enzymes can be expressed in the cells of interest by delivering to the cells, or by obtaining cells that already comprise, a recombinant nucleic acid molecule that comprises a nucleotide sequence encoding the enzyme (i.e. CD or UPRT) operatively linked to a promoter. Some embodiments involve delivering to the cells of interest, or obtaining cells of interest that already comprise, both a recombinant nucleotide sequence encoding a CD enzyme and a recombinant nucleotide sequence encoding a UPRT enzyme. In some such embodiments the nucleotide sequence encoding the CD enzyme and the nucleotide sequence encoding the UPRT enzyme are present on the same nucleic acid molecule. In some such embodiments the nucleic acid molecule comprises an internal ribosome entry site (IRES) sequence or viral 2A peptide encoding sequence located between the nucleotide sequence encoding the CD enzyme and the nucleotide sequence encoding the UPRT enzyme. In other embodiments the nucleotide sequence encoding the CD enzyme and the nucleotide sequence encoding the UPRT enzyme are each present on a separate nucleic acid molecule.
The promoter(s) used to drive expression of the recombinant CD and/or UPRT enzymes can be any promoters known in the art that allow for expression of the enzyme(s) in the cells of interest. In some embodiments the promoter is an inducible promoter. Any suitable inducible promoter system known in the art can be used. Exemplary inducible promoter systems include, but are not limited to, doxycycline (Dox)-inducible promoters.
In some embodiments the promoter is a tissue-specific promoter, or other cell-type specific promoter, or the promoter activity is regulated by cell-type or tissue specific expression of recombinase such as, Cre recombinase, for example to drive expression specifically in the cells of interest. Any suitable tissue-specific promoter system known in the art can be used.
Methods for delivery of vectors or other nucleic acid molecules to cells either in vitro or in vivo are well known in the art, and any suitable methods can be used in conjunction with the present invention. In some embodiments of the present invention the nucleic acid molecules described herein are delivered to cells in vitro. In other embodiments of the present invention the nucleic acid molecules of the invention are delivered to cells in vivo. In some embodiments of the present invention the nucleic acid molecules described herein are delivered to cells in vitro and the cells are then delivered to an animal. In some embodiments the nucleic acid molecules described herein are present and/or expressed in the cells of interest transiently. In some embodiments the nucleic acid molecules described herein are present and/or expressed in the cells of interest stably/permanently. In some embodiments the nucleic acid molecules described herein are used to generate transgenic animals containing the recombinant CD and/or UPRT sequences. The vector to be used, and the means used for delivery of that vector, can be selected based on the situation, e.g. the cell type, whether stable or transient transfection/transduction is required, whether an integrating vector is desired, whether the cells of interest are in vivo or in vitro, and the like.
One of the major advantages of the methods of the present invention is that the tagged halogenated-uridine-containing RNAs can be detected and isolated/purified using simple and widely available technologies—employing antibodies that bind to halogenated-uridine-containing RNAs, but not halogenated-cytidine-containing RNAs, and routine immuno-affinity purification techniques. In some embodiments, the halogenated-uridine-containing RNAs technologies can be detected using any antibody known in the art that binds specifically to halogenated-uridine-containing RNAs but that does not bind to non-halogenated RNAs. One such type of antibody is an anti-BrdU antibody, of which several are known in the art and available from commercial sources, including but not limited to, the anti-BrdU antibody available from Abcam (catalog number ab6326). Any routine immuno-affinity purification techniques can be used to the tagged halogenated-uridine-containing RNAs, including, but not limited to immuno-affinity chromatography methods, including column-based methods, bead-based methods, and the like. For example, standard immno-precipitation techniques can be used. In some embodiments, two or more successive rounds of immuno-affinity purification are used.
Tagged halogenated-uridine-containing RNAs obtained from cells of interest using the methods of the present invention can be analyzed in any way that any other RNA can be analyzed. Such methods include, but are not limited to, performing hybridization-based analysis (such as microarray analysis), RT-PCR analysis, next generation RNA sequencing (“RNA-seq”) analysis, and the like. Methods of performing such analyses are well known in the art and described in the published literature. Furthermore, kits for performing such analysis, including kits for RNA-seq library preparation and sequencing, are commercially available from multiple sources.
Embodiments of the present invention can be further described and understood by reference to the following non-limiting “Example.” It will be apparent to those skilled in the art that many modifications to the specific description provided in the Example can be practiced without undue experimentation and without departing from the spirit of and scope of the present invention.
Numerous biologically and clinically significant cell populations represent only minute fractions of tissues, however, existing techniques lack the sensitivity to faithfully capture rare transcriptomes in situ (1-3). To overcome this problem, we developed fluorouracil-tagged RNA sequencing (Flura-seq) to define the transcriptomes of rare cells of interest from an intact tissue microenvironment. Fluraseq utilizes cytosine deaminase (CD) to convert the non-natural pyrimidine fluorocytosine to fluorouracil. Expression of S. cerevisiae CD and exposure to fluorocytosine generates fluorouracil and metabolically labels newly synthesized RNAs specifically in cells of interest. Fluorouracil-tagged RNAs can then be immunopurified and sequenced. We applied Flura-seq to define the transcriptome of human breast cancer xenografts representing as few as 0.003% of host organ cell population during the early stages of metastatic colonization of mouse lungs. The robustness, simplicity and lack of toxicity of Flura-seq make this tool broadly applicable to many studies in developmental, regenerative, and cancer biology.
Tissues are comprised of different cell types whose interactions elicit distinct gene expression patterns that regulate tissue formation, regeneration, homeostasis and repair. Analysis of these gene expression patterns require methods that can capture as closely as possible the transcriptomes of cells of interest in their tissue microenvironment. Current technologies designed to study in situ transcriptomics are limited by their low sensitivity (requiring more than 1% of the total tissue) (1-3), the involvement of multiple steps after tissue dissociation (3-5), or the requirement for sophisticated tools (6), making it challenging to transcriptionally profile rare cell populations rapidly isolated from their native microenvironment. To address this problem, here we describe the development of Flura-seq and its initial application to the analysis of micrometastatic cell populations representing a tiny fraction of the host organ. Flura-seq is based on the use of cytosine deaminase (CD), a key enzyme of the pyrimidine salvage pathway in fungi and prokaryotes (7)′ CD is absent in mammalian cells, which instead use cytidine deaminase for the same purpose (7). In addition to converting cytosine to uracil, CD can also convert 5-fluorocytosine (5-FC), a non-natural pyrimidine, to 5-flourouracil (5-FU). 5-FU is endogenously converted to fluorouridine triphosphate (F-UTP), which is then incorporated into RNA.
S. cerevisiae CD was exogenously expressed in human embryonic kidney 293T cells, and the cells were treated with 250 μM fluorocytosine (5-FC). This system is predicted to generate intracellular fluorouracil (5-FU), which is then incorporated into newly synthesized RNAs (
Next, we determined whether 5-FU-labeled mRNAs can be specifically and efficiently isolated by immunoprecipitation. Labeled mRNAs of three representative genes with varying levels of expression were detectable as early as 2 h after treatment with 5-FC, and the labeling continued to increase for up to 24 h (
Next, we determined whether this system could be used to isolate RNA specifically from cells of interest that were admixed with a large proportion of other cells. MDA231 cells expressing CD/UPRT were co-cultured with the mouse breast cancer 4T1 cell line at ratios of 10′ to 10′ (100-1000 MDA231 cells to 106 4T1 cells). After 12 h of 5-FC addition, 5-FU-labeled mRNAs were immunoprecipitated, and the proportion of human and mouse mRNA for representative housekeeping genes were determined by reverse transcriptase-polymerase chain reaction (RT-PCR). Human mRNAs were enriched by more than 105 fold compared to mouse mRNAs (
5-FU is extensively used as a chemotherapeutic agent for the treatment of solid tumors, based on its ability to induce cancer cell death by inhibiting thymidylate synthase and by causing DNA and RNA damage (12-14). It was therefore crucial to test whether our 5-FU-tagging method, which is based on low concentrations of 5-FC and short incubation time periods, has significant cytotoxic effects under our experimental conditions. CD/UPRT-expressing cells were treated with 5-FC at concentrations of up to 250 μM for up to 12 h (“Flura-tagged” cells), and the effects of this treatment on RNA damage response and transcriptional regulation were investigated.
A key feature of the RNA damage response is the formation of stress granules (15). Ras GTPase-activating protein-binding protein 1 (G3BP), a key component of stress granules, forms distinct foci during stress granule formation (12). We determined the dose- and time-dependent effect of 5-FC treatment on stress granule formation by immunostaining of the cells with anti-G3BP antibody. Cells treated with sodium arsenite (NaAsO2), which causes stress granule formation (12), formed distinct G3BP foci within 1 h, whereas cells treated with 5-FC did not contain stress granules after 12 h with any of the tested concentration (
For a more sensitive analysis of potential alterations caused by Flura-tagging, we investigated the transcriptional changes occurring under these experimental conditions. We compared the transcriptome of cells treated with two different concentrations of 5-FC (50 μM and 250 μM) at two different time points (4 h and 12 h) with that of control cells, using RNA-seq. Only 113 genes showed statistically significant differences in expression in cells treated with 50 μM of 5-FC for 4 h when compared to the control cells (
Next, we determined whether Flura-seq could be used to characterize transcriptomics in situ from a small number of cancer cells disseminated in an intact organ that would be challenging to achieve using existing technologies. Understanding the gene expression in the early stage micrometastatic cancer cells has a huge clinical significance as it represents the state of cancer cells before they develop into life threatening and difficult to treat macrometastases. MDA231 cells engineered to express a GFP-luciferase fusion protein for imaging and bioluminescence analysis, and DOX-inducible CD/UPRT for Flura-tagging of RNA, were inoculated into the tail vein of Foxn1nu immunodeficient mice to allow colonization of the pulmonary parenchyma (
To establish a 5-FC dose for use in vivo, we used the same dose as thiouracil (250 mg/kg) that has been established to be non-toxic for the treatment up to 12 h in mice used for tagging RNA with thiouracil, which is structurally similar to 5-FU (16). Pharmacokinetics studies have shown that 5-FC has a very short half-life in mice (0.36 to 0.43 h) (17), suggesting that large fraction of 5-FC is cleared from the body rapidly. Furthermore, the expression of several representative human genes was within 2-fold range in mouse lungs inoculated with control MDA231 cells or the cells expressing CD/UPRT and treated with 250 mg/kg 5-FC for 8 h (
Next, we investigated the signal-to-noise ratio of Flura-tagging in vivo by measuring the relative capture of two representative housekeeping human transcripts (signal) and mouse transcripts (noise). Human mRNAs were enriched more than a 1000-fold compared to the equivalent mouse mRNAs (
To determine the sensitivity of Flura-tagging, we estimated the percentage of human cells expressing RFP (along with CD and UPRT) (
Finally, we determined whether Flura-tagged mRNA from micrometastatic lesions could be sequenced to characterize the in situ transcriptomes of cancer cells. Mice were treated with 5-FC for 4 h or 12 h, and Flura-tagged RNAs were immunopurified and sequenced. The sequenced reads were aligned to a hybrid genome containing both human and mouse genomes, so that reads coming from human or mouse cells could be distinctly identified. In mice treated with 5-FC for 4 h, approximately 53% of the aligned reads were mapped to human genome, whereas 74% of the aligned reads were mapped to human genome when the mice were treated with 5-FC for 12 h (
To determine the effect of in vivo growth on MDA231 lung xenografts, we compared the transcriptomic profiles of Flura-tagged MDA231 micrometastases in vivo and in tissue culture with complete growth media. The expression of 1917 genes were found to differ by more than 4-fold in vivo compared to in vitro (
In this Example, we describe a novel method that can define in situ transcriptomes from a very small cell population representing a tiny fraction of an organ. This technique requires the expression of an exogenous enzyme to selectively label RNA in a highly controlled manner, thus minimizing noise drastically and thus making it possible to study cell types that represent a minute fraction of the total tissue. Flura-seq can be easily applied to other areas of biomedical research in model systems. For example, genetically engineered conditional expression of CD/UPRT would allow the analysis of cell-type specific transcriptomics in mice with high sensitivity. Prominent examples of specific cell types that constitute rare subpopulations within their tissues and could be targeted for analysis by Flura-seq include adult stem cells, and specific subtypes of immune and neuronal cells. Flura-seq can be applied to identify gene expression changes in these cells under different physiological, developmental and pathological conditions to uncover mechanisms involved in tissue homeostasis, regeneration and pathophysiology. Another feature of Flura-seq is that it only identifies newly synthesized transcripts, making it a powerful tool for studying changes in transcription under different stimuli, such as cytokines, pharmacologic agonists and antagonists, stress signals, and other inputs that act by rapidly changing the transcriptomic state of target cells both in vivo and in vitro. Further, since Flura-seq involves covalently labeling RNA, it can easily complement other techniques such as single-cell sequencing to combine in situ transcriptomic analysis with profiling of the dissociated cell population with single cell resolution. Thus, Flura-seq can be applied to study gene expression in rare cell populations in their native environments to address a wide range of biological questions.
Human embryonic kidney transformed with T-cell antigen (293T) and human breast cancer MDA-MB-231 (MDA231) cells were cultured in DMEM High Glucose medium (Wheaton) supplemented with 10% Fetal Bovine Serum (Clontech, Catalog number 631106) supplemented with 2 mM L-Glutamine. All cell lines have been routinely tested for mycoplasma contamination. For the induction of CD or CD and UPRT, cells were treated with 1 μg/ml doxycycline for 24 h. For Fluorouracil tagging, cells were treated with 250 μM of 5-FC or 5-FU unless indicated. Where indicated, 125 μM thymine was added together with 5-FC. For the formation of stress granules, cells were treated with 500 μM Sodium Arsenite (Santa Cruz Biotechnologies, Catalog number SC-301816).
Doxycycline (Sigmal-Aldrich, Catalog number D9891), Fluorocytosine (Sigma-Aldrich, Catalog number F7129), Fluorouracil (Sigma-Aldrich, Catalog number F6627), Sodium Arsenite (Santa Cruz Biotechnology, Catalog number SC-301816), Thymine (Sigma-Aldrich, Catalog number T0376). Anti-BrdU antibody (Abcam, Catalog number ab6326), Anti-G3BP antibody (Abcam, Catalog number ab56574), Anti-CD31 antibody (Dianova, Catalog number DIA-310), Anti-GFP antibody (Ayes Labs, Catalog number GFP-1020).
Cells were fixed with 4% paraformaldehyde for 10 min, permeabilized with 0.2% TritonX-100 for 10 min, blocked with 5% BSA for one hour at room temperature, prior to incubation with primary antibodies at 4° C. overnight, and secondary antibodies (Thermo Fisher anti-chicken antibody, Catalog number A-11039, Thermo Fisher anti-rat antibody, Catalog number A-11006, Abcam anti-rat antibody, Catalog number ab150117) incubated for 1 h at room temperature. Mouse lungs were fixed in 4% paraformaldehyde overnight at 4° C., embedded in paraffin and sectioned. The lung sections were deparaffinized and stained with hematolxylin/eosin or immunostained using standard protocols. Automated image analysis was performed using the FIJI software package.
Flura-Tagged and TU-Tagged mRNA Extraction
Cells or tissues were lysed in lysis buffer (20 mM Tris-HCl pH 7.5, 500 mM LiCl, 1% LiDS, 1 mM EDTA, 5 mM DTT), and mRNAs were extracted using Oligo (dT)25 magnetic beads (New England Biolabs, Catalog number S1419S) following the manufacturer's recommended protocol. The isolated mRNAs were immunoprecipitated using anti-BrdU antibody (1-5 μg/sample) conjugated with Protein G Dynabeads (Thermo Fisher Scientific, Catalog number 10004D) by overnight incubation at 4° C. The mRNAs were incubated with the antibody bead complex in 0.8× Binding buffer (0.5×SSPE with 0.025% Tween 20) at room temperature for 1-2 h in a rotator. Subsequently, beads were washed twice with Binding buffer, twice with Wash buffer B (1×SSPE with 0.05% Tween 20), once with Wash buffer C (TE with 0.05% Tween 20), and once with TE buffer. The bound mRNAs were eluted in 200 μl of 100 μg/mL BrdU for 45 min in a shaker at room temperature. The eluted RNAs were purified using the RNeasy MinElute Clean up kit (Qiagen, Catalog number 74204) following the manufacturer's protocol. The RNA was eluted in 100 μl RNAase free water. The Flura-tagged RNA elute were re-precipitated as described above, and eluted in 12.5 μl final volume. The RNA was either reverse-transcribed using cDNA kit-First Strand Transcriptor (Roche, Catalog number 043790-12001) following the manufacturer's protocol, or used for Flura-Seq. TU-tagged mRNAs were purified as described in (20).
Mouse experiments were performed following the protocols approved by the MSKCC Institutional Animal Care and Use Committee (IACUC). 5-6 weeks old female mice (Mus musculus) Hsd:Athymic-Foxnlnu were used in all the experiments. For lung colonization experiments, 50,000 MDA231 cells suspended in 100 μl PBS were injected into the tail vein. Proliferation of injected cancer cells were quantified using bioluminescence imaging following retroorbital injection of luciferin. CD/UPRT were induced by feeding mice with doxycycline containing diet for 2-3 days. For Flura-tagging, mice were injected with 250 mg/kg (500 μl) 5-FC intraperitoneally together with 125 mg/kg (500 μl) thymine subcutaneously. For thiouracil-tagging, mice were injected intraperitoneally with 250 mg/kg (500 μl) of thiouracil. The mice were euthanized 4-12 h post injection, lungs were harvested and processed for downstream experiments. For RNA analysis, lungs were dissociated using the PRO 200 grinder from PRO Scientific Inc. in RNA extraction lysis buffer. The lung lysates were either used immediately for mRNA extraction or stored at −80° C. for later use. Flura-tagged mRNAs were isolated as described above.
Harvested lungs were chopped into small pieces, which were then incubated at 37° C. in 30 mL digestion buffer (5% Fetal Bovine Serum (FBS) 1 mM L-glutamine 0.35 mg/mL Worthington Type III collagenase, 6.25×10-3 U/mL dispase, 100 U/mL penicillin, 100 μg/mL streptomycin, 6.25 ng/mL amphotericin B) containing 10 mL trypsin and 30 μl DNAse for 1 h. The cells were filtered through a 70 μM filter, and cells were collected by centrifugation. The cell pellets were then resuspended in PBS containing 0.1% FBS and 100 μg/ml DAPI, and analyzed using a BD FACS Aria™ IIU Flow cytometer. CD or CD/UPRT expressing stable cell lines were treated with 1 μg/mL doxycycline for 24 h, trypsinized, filtered and sorted for RFP positive cells using a BD LSRFortessa Flow cytometer.
RNA-seq library preparation. Total RNA was purified using Qiagen RNeasy Mini Kit. Quality and quantity was checked by Agilent BioAnalyzer 2000. 10 ng RNA per sample was used for library construction with Sample Prep Kit v2 (Illumina) according to manufacturer's instructions. Libraries were multiplex sequenced libraries on a Hiseq2500 platform, and more than 25 million raw paired-end reads were generated for each sample.
Flura-seq library preparation. RNA was amplified by SMARTer PCR with the number of PCR cycles determined empirically based on the amount of purified Flura-tagged RNA. The Nextera XT kit was used to prepare sequencing libraries following the manufacturer's protocol. In our in vivo experiments, 20 cycles of PCR were used.
In all relevant experiments, mice were chosen randomly for different treatments. Comparisons between samples were done in the gene expression analysis, and each group had 2-3 biological replicates that are indicated in the figure legends for each experiment. The numbers of samples are underpowered but adequate to detect a trend of gene expression for further analysis (21). There was no blinding in any of the experiments.
Reads were quality checked using FastQC v0.11.5 and mapped to a human (hg19) or hybrid human-mouse (hg19-mm10) genome with STAR2.5.2b (22) using standard settings for paired reads. Uniquely mapped reads were assigned to annotated genes with HTSeq v0.6.1p1 (23) with default settings. Read counts were normalized by library size, and differential gene expression analysis based on a negative binomial distribution was performed using DESeq2 v3.4 (24). In general, thresholds for differential expression were set as follows: adjusted p-value<0.05, fold change >2.0 or <0.5, and average normalized read count >10. Genes were considered detectable in the immunoprecipitation samples with a normalized read count >100. Gene set enrichment analysis was performed using GSVA v3.4 (25) and previously curated gene sets (26): HALLMARK_TGF_BETA_SIGNALING, HALLMARK_HEDGEHOG_SIGNALING, HALLMARK_WNT_BETA_CATENIN_SIGNALING, HALLMARK_PI3_K_AKT_MTOR_SIGNALING, HALLMARK_NOTCH_SIGNALING, HALLMARK_KRAS_SIGNALING_UP, CORDENONSI_YAP_CONSERVED_SIGNATURE, HALLMARK_IL6_JAK_STAT3_SIGNALING.
Cytosine Deaminase (CD)(Addgene 35102), Uracil Phospho Ribosyl Transferase (UPRT) (Addgene 47110) and rtTA3 (Addgene 26730) expressing plasmids were purchased from Addgene. Primers used for cloning the constructs described in the manuscript are described in Table S4. CD and UPRT described above were used as template for PCR for subcloning. RFP and IRES were amplified using pTRIPZ (Dharmacon) as a template. The PCR products were either ligated using DNA Ligase after restriction enzyme digestion and/or by Gibson Assembly.
This application claims the benefit of priority of U.S. Provisional Patent Application No. 62/471,264 filed on Mar. 14, 2017, the content of which is hereby incorporated by reference in its entirety.
This invention was made with government support under grant number W81XWH-12-1-0074 awarded by the U.S. Army Medical Research and Materiel Command. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/022092 | 3/13/2018 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62471264 | Mar 2017 | US |