The contents of the electronic sequence listing (STAN-1939_S22-045_SEQ_LIST.xml; Size: 12,737 bytes; and Date of Creation: Aug. 12, 2024) is herein incorporated by reference in its entirety.
Single-cell transcriptomics often serve as the de facto way to define cell types and states, but targeting cells based on their RNA profile has remained challenging. RNA sense-response systems would for example enable the identification and destruction of harmful cells (e.g., in the contexts of cancer and autoimmune disorders), or the experimental manipulation of specific cells in a complex environment (e.g., the nervous and the immune systems). Available RNA sensing technologies are limited to miRNAs, or require careful design around functional RNA structures such as ribozymes, guide RNAs or internal ribosome entry sites. For the latter, an additional confounding factor is the cell's natural response to double-stranded RNA (dsRNA). dsRNA editing by adenosine deaminases acting on RNA (ADARs) allows for the editing of specific RNAs.
Provided herein are methods and kits for detecting target RNAs and expressing proteins in target cells utilizing ADAR editing.
The present disclosure provides a method for expressing a protein in a target cell, the method comprising combining the target cell with a sensor RNA comprising the following: (ia) a first nucleotide sequence comprising a sensor nucleotide sequence that is reverse complementary to the target RNA wherein the sensor nucleotide sequence comprises a stem-loop sequence comprising one or more editable codons, or (ib) a first nucleotide sequence comprising a sensor nucleotide sequence that is reverse complementary to the 3′ UTR of the target RNA, wherein the sensor nucleotide sequence comprises one or more editable codons, and (ii) a second nucleotide sequence encoding a first cleavage domain, and (iii) a third nucleotide sequence encoding an output protein; wherein the target RNA is present in the target cell.
The present disclosure provides a method for detecting a target RNA, the method including (a) combining the biological sample with a sensor RNA including the following: (i) a first nucleotide sequence encoding a marker protein, (ii) a second nucleotide sequence including a sensor nucleotide sequence that is reverse complementary to the 3′ UTR of the target RNA, wherein the sensor nucleotide sequence includes a stop codon, (iii) a third nucleotide sequence encoding a second cleavage domain, and (iv) a fourth nucleotide sequence encoding an output protein; (b) assaying for the presence of the output protein in the biological sample.
The present disclosure also provides method for detecting a target RNA in a biological sample, the method including: (a) combining the biological sample with a sensor RNA including the following: (i) a first nucleotide sequence including a stem-loop sequence including one or more stop codons, (ii) a second nucleotide sequence including a sensor nucleotide sequence that is reverse complementary to the target RNA, (iii) a third nucleotide sequence encoding a first cleavage domain, and (iv) a fourth nucleotide sequence encoding an output protein; and (b) assaying for the presence of the output protein in the biological sample.
The present disclosure provides a method for detecting a target RNA in a biological sample, the method comprising: (a) combining the biological sample with a sensor RNA comprising the following: (i) a first nucleotide sequence comprising a sensor nucleotide sequence that is reverse complementary to the target RNA, wherein the sensor nucleotide sequence comprises a start codon and (ii) a second nucleotide sequence encoding an output protein; and (b) assaying for the presence of the output protein in the biological sample.
The present disclosure provides a method for detecting a target RNA in a biological sample, the method comprising: (a) combining the biological sample with a sensor RNA comprising the following: (i) a first nucleotide sequence comprising a sensor nucleotide sequence that is reverse complementary to the target RNA, wherein the sensor nucleotide sequence comprises an AUA sequence and (ii) a second nucleotide sequence encoding an output protein; and (b) assaying for the presence of the output protein in the biological sample.
Kits for practicing the subject methods are also provided.
The invention is best understood from the following detailed description when read in conjunction with the accompanying drawings. It is emphasized that, according to common practice, the various features of the drawings are not to-scale. On the contrary, the dimensions of the various features are arbitrarily expanded or reduced for clarity. Included in the drawings are the following figures.
Before describing exemplary embodiments in greater detail, the following definitions are set forth to illustrate and define the meaning and scope of the terms used in the description.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Singleton, et al., DICTIONARY OF MICROBIOLOGY AND MOLECULAR BIOLOGY, 2D ED., John Wiley and Sons, New York (1994), and Hale & Markham, THE HARPER COLLINS DICTIONARY OF BIOLOGY, Harper Perennial, N.Y. (1991) provide one of skill with the general meaning of many of the terms used herein. Still, certain terms are defined below for the sake of clarity and ease of reference.
Certain ranges are presented herein with numerical values being preceded by the term “about.” The term “about” is used herein to provide literal support for the exact number that it precedes, as well as a number that is near to or approximately the number that the term precedes. In determining whether a number is near to or approximately a specifically recited number, the near or approximating unrecited number may be a number which, in the context in which it is presented, provides the substantial equivalent of the specifically recited number.
It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the context clearly dictates otherwise. For example, the term “a RNA sensor” refers to one or more RNA sensors, i.e., a single RNA sensor and multiple RNA sensors. It is further noted that the claims can be drafted to exclude any optional element. As such, this statement is intended to serve as antecedent basis for use of such exclusive terminology as “solely,” “only” and the like in connection with the recitation of claim elements, or use of a “negative” limitation.
The terms “polynucleotide” and “nucleic acid,” used interchangeably herein, refer to a polymeric form of nucleotides of any length, either ribonucleotides or deoxynucleotides. Thus, this term includes, but is not limited to, single-, double-, or multi-stranded DNA or RNA, genomic DNA, cDNA, DNA-RNA hybrids, or a polymer including purine and pyrimidine bases or other natural, chemically or biochemically modified, non-natural, or derivatized nucleotide bases. The terms “polynucleotide” and “nucleic acid” should be understood to include, as applicable to the embodiment being described, single-stranded (such as sense or antisense) and double-stranded polynucleotides.
By “hybridizable” or “complementary” or “substantially complementary” it is meant that a nucleic acid (e.g. RNA, DNA) contains a sequence of nucleotides that enables it to non-covalently bind, i.e. form Watson-Crick base pairs and/or G/U base pairs, “anneal”, or “hybridize,” to another nucleic acid in a sequence-specific, antiparallel, manner (i.e., a nucleic acid specifically binds to a complementary nucleic acid) under the appropriate in vitro and/or in vivo conditions of temperature and solution ionic strength. Standard Watson-Crick base-pairing includes: adenine/adenosine) (A) pairing with thymidine/thymidine (T), A pairing with uracil/uridine (U), and guanine/guanosine) (G) pairing with cytosine/cytidine (C). Inosine (I) bases pair with cytosine/cytidine. In addition, for hybridization between two RNA molecules (e.g., dsRNA), and for hybridization of a DNA molecule with an RNA molecule (e.g., when a DNA target nucleic acid base pairs with a guide RNA, etc.): G can also base pair with U. For example, G/U base-pairing is partially responsible for the degeneracy (i.e., redundancy) of the genetic code in the context of tRNA anti-codon base-pairing with codons in mRNA. Thus, in the context of this disclosure, a G (e.g., of a protein-binding segment (e.g., dsRNA duplex) of a guide RNA molecule; of a target nucleic acid (e.g., target DNA or RNA) base pairing with a sensor RNA) is considered complementary to both a U and to C. For example, when a G/U base-pair can be made at a given nucleotide position of a protein-binding segment (e.g., dsRNA duplex) of a sensor RNA molecule, the position is not considered to be non-complementary, but is instead considered to be complementary.
Hybridization requires that the two nucleic acids contain complementary sequences, although mismatches between bases are possible. The conditions appropriate for hybridization between two nucleic acids depend on the length of the nucleic acids and the degree of complementarity, variables well known in the art. The greater the degree of complementarity between two nucleotide sequences, the greater the value of the melting temperature (Tm) for hybrids of nucleic acids having those sequences. Typically, the length for a hybridizable nucleic acid is 8 nucleotides or more (e.g., 10 nucleotides or more, 12 nucleotides or more, 15 nucleotides or more, 20 nucleotides or more, 22 nucleotides or more, 25 nucleotides or more, or 30 nucleotides or more).
It is understood that the sequence of a polynucleotide need not be 100% complementary to that of its target nucleic acid to be specifically hybridizable. Moreover, a polynucleotide may hybridize over one or more segments such that intervening or adjacent segments are not involved in the hybridization event (e.g., a loop structure or hairpin structure, a ‘bulge’, and the like). A polynucleotide can include 60% or more, 65% or more, 70% or more, 75% or more, 80% or more, 85% or more, 90% or more, 95% or more, 98% or more, 99% or more, 99.5% or more, or 100% sequence complementarity to a target region within the target nucleic acid sequence to which it will hybridize. For example, an antisense nucleic acid in which 18 of 20 nucleotides of the antisense compound are complementary to a target region, and would therefore specifically hybridize, would represent 90 percent complementarity. The remaining noncomplementary nucleotides may be clustered or interspersed with complementary nucleotides and need not be contiguous to each other or to complementary nucleotides. Percent complementarity between particular stretches of nucleic acid sequences within nucleic acids can be determined using any convenient method. Example methods include BLAST programs (basic local alignment search tools) and PowerBLAST programs (Altschul et al., J. Mol. Biol., 1990, 215, 403-410; Zhang and Madden, Genome Res., 1997, 7, 649-656) or by using the Gap program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, Madison Wis.), e.g., using default settings, which uses the algorithm of Smith and Waterman (Adv. Appl. Math., 1981, 2, 482-489).
The terms “peptide,” “polypeptide,” and “protein” are used interchangeably herein, and refer to a polymeric form of amino acids of any length, which can include coded and non-coded amino acids, chemically or biochemically modified or derivatized amino acids, and polypeptides having modified peptide backbones.
The term “naturally-occurring” as used herein as applied to a nucleic acid, a protein, a cell, or an organism, refers to a nucleic acid, protein, cell, or organism that is found in nature. For example, a polypeptide or polynucleotide sequence that is present in an organism (including viruses) that can be isolated from a source in nature and which has not been intentionally modified by a human in the laboratory is naturally occurring.
The term “exogenous” as used herein as applied to a nucleic acid or a protein refers to a nucleic acid or protein that is not normally or naturally found in and/or produced by a given bacterium, organism, or cell in nature. As used herein, the term “endogenous nucleic acid” refers to a nucleic acid that is normally found in and/or produced by a given bacterium, organism, or cell in nature. An “endogenous nucleic acid” is also referred to as a “native nucleic acid” or a nucleic acid that is “native” to a given bacterium, organism, or cell. As used herein, the term “endogenous polypeptide” refers to a polypeptide that is normally found in and/or produced by a given bacterium, organism, or cell in nature.
“Recombinant,” as used herein, means that a particular nucleic acid or protein is the product of various combinations of cloning, restriction, and/or ligation steps resulting in a construct having a structural coding or non-coding sequence distinguishable from endogenous nucleic acids found in natural systems. Generally, DNA sequences encoding the structural coding sequence can be assembled from cDNA fragments and short oligonucleotide linkers, or from a series of synthetic oligonucleotides, to provide a synthetic nucleic acid which is capable of being expressed from a recombinant transcriptional unit contained in a cell or in a cell-free transcription and translation system. Such sequences can be provided in the form of an open reading frame uninterrupted by internal non-translated sequences, or introns, which are typically present in eukaryotic genes. Genomic DNA containing the relevant sequences can also be used in the formation of a recombinant gene or transcriptional unit. Sequences of non-translated DNA may be present 5′ or 3′ from the open reading frame, where such sequences do not interfere with manipulation or expression of the coding regions, and may indeed act to modulate production of a desired product by various mechanisms.
Thus, e.g., the term “recombinant” nucleic acid or “recombinant” protein refers to one which is not naturally occurring, e.g., is made by the artificial combination of two otherwise separated segments of sequence through human intervention. This artificial combination is often accomplished by either chemical synthesis means, or by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques. Such is usually done to replace a codon with a redundant codon encoding the same or a conservative amino acid, while typically introducing or removing a sequence recognition site. Alternatively, it is performed to join together nucleic acid segments of desired functions to generate a desired combination of functions. This artificial combination is often accomplished by either chemical synthesis means, or by the artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques.
By “construct” or “vector” is meant a recombinant nucleic acid, generally recombinant DNA, which has been generated for the purpose of the expression and/or propagation of a nucleotide sequence(s) of interest, or is to be used in the construction of other recombinant nucleotide sequences.
The term “transformation” or “transfection” refers to a permanent or transient genetic change induced in a cell following introduction of a nucleic acid (i.e., DNA and/or RNA exogenous to the cell). Genetic change (“modification”) can be accomplished either by incorporation of the new DNA into the genome of the host cell, or by transient or stable maintenance of the new DNA as an episomal element. Where the cell is a eukaryotic cell, a permanent genetic change is generally achieved by introduction of the DNA into the genome of the cell. Suitable methods of genetic modification include viral infection, transfection, conjugation, protoplast fusion, electroporation, particle gun technology, calcium phosphate precipitation, direct microinjection, and the like. The choice of method is generally dependent on the type of cell being transformed and the circumstances under which the transformation is taking place (i.e., in vitro, ex vivo, or in vivo). A general discussion of these methods can be found in Ausubel et al, Short Protocols in Molecular Biology, 3rd ed., Wiley & Sons, 1995.
The terms “regulatory region” and “regulatory elements”, used interchangeably herein, refer to transcriptional and translational control sequences, such as promoters, enhancers, polyadenylation signals, terminators, protein degradation signals, translational start and stop codons, translation initiation sites, splice enhancer/donor/branch/acceptor sites, and the like, that provide for and/or regulate expression of a coding sequence and/or production of an encoded polypeptide in a host cell. As used herein, a “promoter sequence” or “promoter” is a DNA regulatory region capable of binding/recruiting RNA polymerase (e.g., via a transcription initiation complex) and initiating transcription of a downstream (3′ direction) sequence (e.g., a protein coding (“coding”) or non-protein-coding (“non-coding”) sequence. A promoter can be a constitutively active promoter (e.g., a promoter that is constitutively in an active/“ON” state), it may be an inducible promoter (e.g., a promoter whose state, active/“ON” or inactive/“OFF”, is controlled by an external stimulus, e.g., the presence of a particular temperature, compound, or protein), it may be a spatially restricted promoter (e.g., tissue specific promoter, cell type specific promoter, etc.), and/or it may be a temporally restricted promoter (e.g., the promoter is in the “ON” state or “OFF” state during specific stages of embryonic development or during specific stages of a biological process, e.g., hair follicle cycle in mice).
“Operably linked” refers to a juxtaposition wherein the components so described are in a relationship permitting them to function in their intended manner. For instance, a promoter is operably linked to a nucleotide sequence (e.g., a protein coding sequence, e.g., a sequence encoding an mRNA; a non protein coding sequence, e.g., a sequence encoding a Shh protein; and the like) if the promoter affects its transcription and/or expression.
The term “adenosine deaminase acting on RNA” or “ADAR” refers to an enzyme that catalyze the hydrolytic C6 deamination of adenosine (A) to produce inosine (I) in RNA substrates that are double stranded. ADARs preferentially edit double stranded RNAs at sites of mismatches where mismatches containing adenosines and cytosines are editing more efficiently than other mismatches. Editing by ADARs results in nucleotide substitution in RNA, because the purine I generated as the result of the deamination reaction is recognized as G instead of A, both by ribosomes during translational decoding of mRNA and by RNA-dependent polymerases during RNA replication. The term “ADAR” encompasses any know type of ADAR such as ADAR1 (ADAR) or ADAR2 (ADARB2).
As used herein “ADAR1” refers to an adenosine deaminase acting on RNA that catalyze the hydrolytic C6 deamination of adenosine (A) to produce inosine (I) in RNA substrates that are double stranded. ADAR1 has 2 main isoforms, p150 and p110. The term “ADAR1” encompasses ADAR1 from various species. Amino acid sequences of ADAR1 from various species are publicly available. See, e.g., GenBank Accession Nos. NP_001102 (Homo sapiens ADAR1 p150), NP_001180424.1 (Homo sapiens ADAR1 p110), NP_001139768 (Mus musculus ADAR1 p150), NP_001033676 (Mus musculus ADAR1 p110). The term “ADAR1” as used herein also encompasses fragments, fusion proteins, and variants (e.g., variants having one or more amino acid substitutions, addition, deletions, and/or insertions) that retain ADAR1 enzymatic activity.
As used herein “ADAR2” refers to an adenosine deaminase acting on RNA that catalyze the hydrolytic C6 deamination of adenosine (A) to produce inosine (I) in RNA substrates that are double stranded. ADAR2 is exclusively localized to the nucleus. The term “ADAR2” encompasses ADAR2 from various species. Amino acid sequences of ADAR2 from various species are publicly available. See, e.g., GenBank Accession Nos. NP_056648.1 (Homo sapiens ADAR2), NP_001020008.1 (Mus musculus ADAR2), AC052474.1 (Doryteuthis opalescens ADAR2). The term “ADAR2” as used herein also encompasses fragments, fusion proteins, and variants (e.g., variants having one or more amino acid substitutions, addition, deletions, and/or insertions) that retain ADAR2 enzymatic activity.
The term “sample” as used herein relates to a material or mixture of materials, typically, although not necessarily, in fluid, i.e., aqueous, form, containing one or more components of interest. Samples may be derived from a variety of sources such as from food stuffs, environmental materials, a biological sample or solid, such as tissue or fluid isolated from an individual, including but not limited to, for example, plasma, serum, spinal fluid, semen, lymph fluid, the external sections of the skin, respiratory, intestinal, and genitourinary tracts, tears, saliva, milk, blood cells, tumors, organs, and also samples of in vitro cell culture constituents (including but not limited to conditioned medium resulting from the growth of cells in cell culture medium, putatively virally infected cells, recombinant cells, and cell components). In certain embodiments of the method, the sample includes a cell. In some instances of the method, the cell is in vitro. In some instances of the method, the cell is in vivo.
The term “biological sample” encompasses a clinical sample or a non-clinical sample, and also includes tissue obtained by surgical resection, tissue obtained by biopsy, cells in culture, cell supernatants, cell lysates, tissue samples, organs, bone marrow, blood, plasma, serum, and the like. A “biological sample” includes a sample obtained from a patient's sample cell, e.g., a sample containing polynucleotides and/or polypeptides that is obtained from a patient's sample cell (e.g., a cell lysate or other cell extract containing polynucleotides and/or polypeptides); and a sample containing sample cells from a patient. A biological sample containing a sample cell from a patient can also include normal, non-diseased cells. A biological sample may be from a plant or an animal. The biological sample may also be from any species. In certain embodiments of the method, the biological sample includes a cell. In some instances of the method, the cell is in vitro. In some instances of the method, the cell is in vivo.
The term “editable codon” as used herein refers to a 3 nucleotide sequence that is editable by an ADAR protein or a derivative thereof. The codon may be a start codon, a stop codon or an AUA codon. The codon contains a sequence that contains an adenosine base. In general, in the methods disclosed herein, the editable codon is a start codon that is edited to become a non-start codon, a stop codon that is edited to become a non-stop codon, or a non-start codon (i.e., AUA) that is edited to become a start codon.
Before the various embodiments are described, it is to be understood that the teachings of this disclosure are not limited to the particular embodiments described, and as such can, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only, and is not intended to be limiting, since the scope of the present teachings will be limited only by the appended claims.
The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described in any way. While the present teachings are described in conjunction with various embodiments, it is not intended that the present teachings be limited to such embodiments. On the contrary, the present teachings encompass various alternatives, modifications, and equivalents, as will be appreciated by those of skill in the art.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure belongs. Although any methods and materials similar or equivalent to those described herein can also be used in the practice or testing of the present teachings, some exemplary methods and materials are now described.
The citation of any publication is for its disclosure prior to the filing date and should not be construed as an admission that the present claims are not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided can be different from the actual publication dates which can be independently confirmed.
As will be apparent to those of skill in the art upon reading this disclosure, each of the individual embodiments described and illustrated herein has discrete components and features which can be readily separated from or combined with the features of any of the other several embodiments without departing from the scope or spirit of the present teachings. Any recited method can be carried out in the order of events recited or in any other order which is logically possible.
All patents and publications, including all sequences disclosed within such patents and publications, referred to herein are expressly incorporated by reference.
Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range, is encompassed within the invention. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges, and are also encompassed within the invention, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the invention.
It is appreciated that certain features of the invention, which are, for clarity, described in the context of separate embodiments, may also be provided in combination in a single embodiment. Conversely, various features of the invention, which are, for brevity, described in the context of a single embodiment, may also be provided separately or in any suitable sub-combination. All combinations of the embodiments pertaining to the invention are specifically embraced by the present invention and are disclosed herein just as if each and every combination was individually and explicitly disclosed. In addition, all sub-combinations of the various embodiments and elements thereof are also specifically embraced by the present invention and are disclosed herein just as if each and every such sub-combination was individually and explicitly disclosed herein.
In further describing the subject invention, methods for detecting a target RNA described first in greater detail. Next, methods of expressing a target protein of interest are described. Kits are also described.
As summarized above, methods are provided for detecting a target RNA in a biological sample, the method including (a) combining the biological sample with a sensor RNA and (b) assaying for the presence of an output protein associated with the detection with the target RNA. In some embodiments, the biological sample is a cell.
The target RNA may be any RNA. For example, the target RNA includes, without limitation, mRNA, long non-coding RNA, transfer RNA, ribosomal RNA, small RNAs such as microRNA, small interfering RNA, small nucleolar RNAs, etc. In some embodiments, the target RNA may differentially expressed in different tissues cell types, or cell states and the detecting of the target RNA may be used to identify tissue types, cell types or cell states. In some embodiments, the target RNA may be a genetic variant of gene. In these instances, the genetic variant may be predictive of a disease or susceptible to a disease such as an oncogenic mutation or a genetic variant associated with increased susceptibility to a pathogen. In some embodiments, the methods of the present disclosure may be used to detect point mutations that are associated with the development of a disease such as cancer, neurodegenerative disease, an autoimmune disease, etc. In some embodiments, the methods of the present disclosure are capable of detecting small indels, single nucleotide polymorphisms (SNPs) or variant, multi-nucleotide variant or dinucleotide variant, etc. In some embodiments, the methods of the present disclosure are capable of detecting and distinguishing copy number variants within and between biological samples. The target RNA may also be a gene fusion which may be predictive of cancer in general or a specific type of cancer. The target RNA may also be a specific splice variant (isoform) of a gene.
In some embodiments, the sensor RNA includes the following: (i) a first nucleotide sequence encoding a marker protein, (ii) a second nucleotide sequence encoding a first cleavage domain, (iii) a third nucleotide sequence including a sensor nucleotide sequence that is reverse complementary to the target RNA, wherein the sensor nucleotide sequence includes one or more stop codons, (iv) a fourth nucleotide sequence encoding a second cleavage domain, and (v) a fifth nucleotide sequence encoding an output protein.
In some embodiments, the sensor RNA includes the following: (i) a first nucleotide sequence encoding a marker protein, (ii) a second nucleotide sequence encoding a cleavage domain (iii) a third nucleotide sequence including a sensor nucleotide sequence that is reverse complementary to the 3′ UTR of the target RNA, wherein the sensor nucleotide sequence includes one or more stop codons, (iv) a fourth nucleotide sequence encoding a second cleavage domain, and (v) a fifth nucleotide sequence encoding an output protein.
In some embodiments, the sensor RNA includes the following: (i) a first nucleotide sequence containing a stem-loop sequence containing one or more stop codons (ii) a second nucleotide sequence containing a sensor nucleotide sequence that is reverse complementary to the target RNA, (iii) a third nucleotide sequence encoding a cleavage domain, and (iv) a fourth nucleotide sequence encoding an output protein.
In some embodiments, the sensor RNA includes the following: (i) a first nucleotide sequence containing a sensor nucleotide sequence that is reverse complementary to the target RNA wherein the sensor nucleotide sequence contains a stem-loop sequence containing one or more stop codons, (ii) a second nucleotide sequence encoding a cleavage domain, and (iii) a third nucleotide sequence encoding an output protein.
In some embodiments, the sensor RNA includes the following: (i) a first nucleotide sequence containing a sensor nucleotide sequence that is reverse complementary to the target RNA, (ii) a second nucleotide sequence containing a stem-loop sequence containing one or more stop codons (iii) a third nucleotide sequence encoding a cleavage domain, and (iv) a fourth nucleotide sequence encoding an output protein.
In some embodiments, the sensor RNA has one or more stop codons containing at least 1 base that is mismatched with 1) a sequence within the stem loop opposite the stop codon or 2) a sequence in the target RNA. The at least 1 base that is mismatched is generally not more than 2 bases that are mismatched. In an embodiment, the sensor RNA has one or more stop codons containing only 1 base that is mismatched with 1) a sequence within the stem loop opposite the stop codon or 2) a sequence in the target RNA. In some embodiments, the sensor RNA does not have any mismatched bases.
In certain embodiments, the sensor RNA contains the first nucleotide sequence to the third, fourth or fifth nucleotide sequences in order (i.e., the fifth nucleotide sequence follows the fourth nucleotide sequence which follows the third nucleotide sequence which follows the second nucleotide sequence which follows the first nucleotide sequence). In some embodiments, the sensor RNA contains the first nucleotide sequence to the third, fourth or fifth nucleotide sequence that are not in order described above.
The sensor RNA of the present disclosure contains a sensor nucleotide sequence or a stem-loop sequence containing one or more stop codons which is followed by a nucleotide sequence encoding an output protein. In some embodiments, the sensor RNA contains one or more stop codons that contain at least 1 base that is mismatched with the target RNA or the sequence within the stem-loop. In some embodiments, the sensor RNA does not contain any mismatches with the target RNA. In the presence of the target RNA, the sensor nucleotide sequence of the sensor RNA hybridizes to the target RNA thereby forming a double stranded RNA molecule that can recruit an ADAR protein. The double stranded RNA can contain a stop codon with or without mismatches, or a stop codon could be within the stem-loop of the sensor RNA. An ADAR protein then edits the adenosine base within the stop codon(s) of the sensor RNA to an inosine base. This editing removes the stop codon(s) which then allows the output protein to be produced from the sensor RNA within the biological sample.
When the sensor RNA contains a nucleotide sequence containing a stem-loop sequence comprising a stop codon, any stem-loop sequence may be used. In some embodiments, the stem loop contains natural editing sites. Natural editing sites are sites within nucleotide sequences which are edited in nature. Natural editing sites are known in the art and have been described in, for example, Gabay et al. (Nat Commun. 2022 Mar. 4; 13(1):1184) which is specifically incorporated by reference herein. Examples of natural editing sites include, without limitation, editing sites found in GRIA2, GRIA3, IGFBP7, NEIL1, FLNA, GRIK2, CDK13, GABRA3, GLI1, SPEG, HTR2C, GRIA4, CYFIP2, CADPS, CADPS, RICTOR, COG3, GRIK1, COPA, HBE1, SON, FLNB, MAGEL2, NOVAl, PNMT, WASH1, LAT, DACT3, FXYD5, ZNF717, ZNF551 CAPS1, etc. Natural editing sites are also disclosed in Table 1 below. In some embodiments, the stem-loop sequence is a GluR-B stem-loop or a modified variant thereof. In some embodiments, the stem contains a natural editing site while the loop is a synthetic sequence. In some embodiments, the sequence of the stem is altered compared to the natural editing site by the addition or removal of nucleotides in order to add or remove mismatches. In some embodiments, the sequence alteration adds or removes additional stop codons In some embodiments, the stem-loop sequence contains a CAPS1 derived stem-loop according to:
When the sensor RNA contains a nucleotide sequence containing a stem-loop sequence containing a stop codon, the length of the stem-loop may have a limit. For example, the stem-loop may be 50 bp or less, 40 bp or less, 30 bp or less or 20 bp or less. In an embodiment, the length of the stem-loop is 18-50 bps.
Sensor RNAs containing a nucleotide sequence containing a stem-loop sequence containing an editable codon have certain advantages relative to sensor RNAs that do not contain such a stem-loop sequence, such as those disclosed in International Application PCT/US2022/033459. This is due to ADAR having separate domains for RNA editing (catalytic domain) and dsRNA binding. First, sensor RNAs containing a nucleotide sequence containing a stem-loop sequence containing an editable codon decouples the sequence that is being edited (e.g., a stop codon) from the sequence that recruits the ADAR protein (i.e., the dsRNA segment that is formed when the sensor nucleotide sequence hybridizes to the target RNA). Generally, if the editable codon in the sensor RNA is a UAG (stop codon) and there is only one mismatch in the stop codon relative to the target RNA then the target RNA should have a CCA sequence (or a sequence that is reverse complementary to a different stop codon having one mismatch with the stop codon). The presence of the CCA sequence (or an equivalent sequence for a different editable codon) potentially limits the number of possible target RNAs. Requiring a specific sequence (such as CCA or an equivalent sequence) to be present in the target RNA can be limiting because it restricts which subsequence a sensor could be created against; for example, a CCA or equivalent sequence may only present in highly structured parts of the target RNA, may only be present in the coding sequence, or may be present in protein-bound sections of a target RNA, all of which may contribute to lower availability for sensor-target hybridization, reducing efficiency. With a sensor containing a stem-loop, the range of suitable subsequences is greatly increased, so problematic target RNA subsequences can be avoided and efficient ones utilized instead. In some embodiments where gene fusions or splice variants are to be distinguished, flexibility in target RNA subsequence choice is needed, and is provided by the stem-loop design. Second, while ADAR editing is largely sequence-agnostic, there are some minor biases primarily driven by the catalytic domain which extend beyond the editable codon. Biases driven by the catalytic domain are known in the art and have been described by, for example, Kuttan et al. (Proc Natl Acad Sci USA. 2012 Nov. 27; 109(48):E3295-304) which is specifically incorporated by reference herein. Editing sites in the sensor RNA may be dictated by the target RNA which precludes optimization of the editing site (i.e., the stop or non-stop codons of the present disclosure). By separating out the editing site from the sensor nucleotide sequence that hybridizes to the target RNA, to the editing site and the sensor nucleotide sequence can be optimized separately.
In some embodiments, the sensor RNA contains a non-start codon in place of a stop codon. In these embodiments, the sensor RNA contains the following: (i) a first nucleotide sequence containing a sensor nucleotide sequence that is reverse complementary to the target RNA, wherein the sensor nucleotide sequence contains a non-start codon (e.g. AUA) that contains at least 1 base that is mismatched with the target RNA sequence, (ii) a second nucleotide sequence encoding a second cleavage domain, and (iii) a third nucleotide sequence encoding an output protein. In the presence of the target RNA, the sensor RNA hybridizes to the target RNA thereby forming a double stranded RNA molecule containing one or more base mismatches within the non-start codon or elsewhere. An ADAR protein then edits the adenosine base within the non-start codon (e.g., AUA to AUI) of the sensor RNA to an inosine base. This editing converts the non-start codon to a start codon which then allows the output protein to be produced from the sensor RNA within the biological sample.
In some embodiments, the sensor RNA has a start codon in place of a stop codon. In these embodiments, the sensor RNA has the following: (i) a first nucleotide sequence having a sensor nucleotide sequence that is reverse complementary to the target RNA, wherein the sensor nucleotide sequence has a start codon (e.g. AUG) that has at least 1 base that is mismatched with the target RNA sequence and (ii) a second nucleotide sequence encoding an output protein wherein the sequence encoding the output protein has a start codon. In the presence of the target RNA, the sensor RNA hybridizes to the target RNA thereby forming a double stranded RNA molecule having one or more base mismatches within the start codon or elsewhere. An ADAR protein then edits the adenosine base within the start codon (e.g., AUG to IUG) of the sensor RNA to an inosine base. This editing converts the start codon to a non-start codon which then allows the output protein to be produced from the sensor RNA within the biological sample. Prior to editing, the presence of the first start codon within the sensor nucleotide sequence represents an upstream reading frame which suppresses the expression of the downstream reading frame. After editing, the upstream reading frame is removed allowing the downstream reading frame to be expressed which produces the output protein.
When the sensor RNA contains a start codon in place of a stop codon, the upstream reading frame as described above may have particular features. In some embodiments, the length of the upstream reading frame is shorter than the downstream reading frame. In some embodiments, the length of the upstream reading frame is longer than the downstream reading frame. In an embodiment, the length of the upstream reading frame is about the same length of the downstream reading frame.
In some embodiments, the sensor RNA contains a start codon in place of a stop codon. In these embodiments, the sensor RNA contains the following: (i) a first nucleotide sequence containing a sensor nucleotide sequence that is reverse complementary to the target RNA, wherein the sensor nucleotide sequence contains a start codon (e.g., AUG) that contains at least 1 base that is mismatched with the target RNA sequence and (ii) a second nucleotide sequence encoding an output protein. In the presence of the target RNA, the sensor RNA hybridizes to the target RNA thereby forming a double stranded RNA molecule containing one or more base mismatches within the start codon or elsewhere. An ADAR protein then edits the adenosine base within the start codon (e.g., AUG to IUG) of the sensor RNA to an inosine base. This editing converts the start codon to a non-start codon which then prevents the production of the output protein. In this embodiment, the output protein is only produced in the absence of the target RNA.
In some embodiments, the sensor RNA includes splice sites prior to the output protein. In these embodiments, the ADAR protein edits a codon at the splice thereby removing the splice site leading to the production of the output protein. In some embodiments, the ADAR protein edits a non-splice site converting it into a splice site thereby inactivating the production of the output protein.
In some cases, it is desired to reduce the immunogenicity of the sensor RNA. Methods of reducing the immunogenicity of RNAs are known in the art such and have been described by, for example, Starostina et al. (Vaccines (Basel). 2021 May 3; 9(5):452) which is specifically incorporated by reference herein. In general, methods of reducing the immunogenicity of a sensor RNA involves the incorporation of modified ribonucleic acids into the sensor RNA. Modified ribonucleic acids that find use in the present disclosure includes, without limitation, methylcytosine, pseudouridine, methyladenosine, etc. In some embodiments, the methylcytosine is 5-methylcytosine. In some embodiments, the pseudouridine is N1-methyl-pseudouridine. In some embodiments, the methyladenosine is a N6-methyladenosine. In some embodiments, the methyladenosine is a N1-methyladenosine. In some embodiments, a portion of the nucleotides present in the sensor RNA are composed of modified ribonucleic acids. For instance, a portion of the uridines in the sensor RNA are replaced with pseudouridines. When the uridines of the sensor RNA are replaced with pseudouridines, a certain percentage of the uridines are replaced with pseudouridines. For instance, about 1-10%, about 10-20%, about 20-30%, about 30-40%, about 40-50%, about 50-60%, about 60-70%, about 70-80%, 80-90% or greater than 90% of the uridines are replaced with pseudouridines. In an embodiment, 75% or less of the uridines in the sensor RNAs are replaced with pseudouridines. In an embodiment, the sensor sequence does not have pseudouridines.
When a sensor RNA contains pseudouridines, the pseudouridine(s) may be in specific locations. In some embodiments, the pseudouridine(s) are not adjacent to adenosines that are the targets of ADAR editing. In some embodiments, the pseudouridine(s) are not contained in the sensor sequence that hybridizes with a target RNA. When a sensor RNA contains pseudouridines, the sensor may contain a particular stop codon. In some embodiments, the stop codon used is UGA. When the UGA stop codon is used, the adenosine in the UGA may be followed by a specific nucleotide. In some embodiments, the adenosine in the UGA is followed by guanosine such the nucleotide sequence is UGAG.
In some embodiments, the sensor nucleotide sequence includes bases that are mismatched with adenosine bases within the target RNA that are not within a start or stop codon. In some embodiments, the mismatched bases prevent the editing of adenosines that are not within the stop or start codons.
In some embodiments, the sensor nucleotide sequence includes one or more editing inducing elements (EIEs). Suitable EIEs that find use in the present disclosure are disclosed within Uzonyi et al. (Mol Cell. 2021 Jun. 3; 81(11):2374-2387) and Danan-Gotthold et al. (Genome Biol. 2017 Oct. 23; 18(1):196).
A marker protein of the present disclosure may be any marker protein that is useful for the detection of the presence of a sensor mRNA within a biological sample. For instance, the marker protein may be a fluorescent protein or a luminescent protein. Non-limiting examples of useful fluorescent proteins include but are not limited to GFP, EBFP, Azurite, Cerulean, mCFP, Turquoise, ECFP, mKeima-Red, TagCFP, AmCyan, mTFP, TurboGFP, TagGFP, EGFP, TagYFP, EYFP, Topaz, Venus, mCitrine, TurboYFP, mOrange, TurboRFP, tdTomato, TagRFP, dsRed2, mRFP, mCherry, mPlum mRaspberry, mScarlet, etc. Examples of luminescent proteins, include without limitation, Cypridinia luciferase, Gaussia luciferase, Renilla luciferase, Phontinus luciferase, Luciola luciferase, Pyrophorus luciferase, Phrixothrix luciferase, etc. In some embodiments, the marker protein may be the first half of the output protein. In these embodiments, the sequence encoding the marker protein produces the first half of the output which may be non-functional without the second half of the output protein in the absence of the target RNA. In the presence of the target RNA, the second half of the output protein is produced. When the second half of the output protein is produced in the presence of the first half of the output protein, the two halves are then able to form a functional output protein. In these embodiments, the first half of the output protein is the N-terminus of the output protein and the second half of the output protein is the C-terminus of the output protein.
In certain embodiments, the sensor RNA includes a nucleotide sequences that encodes a cleavage domain. Cleavage domains that find use in the present disclosure include without limitation, HIV-1 protease cleavage domain, TEV cleavage domain, preScission protease cleavage domain, HCV protease cleavage domain, RecA cleavage domain, self-cleaving domain, etc. When a self-cleaving domain is used then the self-cleaving domain may be a 2A self-cleaving domain. 2A self-cleaving domains that find use in the present disclosure include T2A, P2A, E2A and F2A which are described in Szymczak-Workman et al. (Cold Spring Harb Protoc. 2012 Feb. 1; 2012(2):199-204). In some embodiments, the sensor RNA includes a first and a second cleavage domain. When the sensor RNA includes a first and a second cleavage domain, the cleavage domains may be of the same type or they may be of a different type. For instance, the first cleavage domain may be a P2A self-cleaving domain and the second cleavage domain may also be a P2A self-cleaving domain or the first cleavage domain may be a P2A self-cleaving domain and the second cleavage domain may also be a T2A self-cleaving domain or any combination thereof.
The sensor nucleotide sequence of the present disclosure may be reverse complementary to any region of the target RNA. In certain embodiments, the sensor nucleotide sequence is reverse complementary to the 3′ UTR of the target RNA. In some embodiments, the sensor nucleotide sequence is reverse complementary to the 5′ UTR of the target RNA. In certain embodiments, the sensor nucleotide sequence is reverse complementary to the coding sequence of the target RNA. In some embodiments, the sensor nucleotide sequence is reverse complementary to an exon of the target RNA. In some embodiments, the sensor nucleotide sequence is reverse complementary to an intron of the target RNA. In some embodiments, the sensor nucleotide sequence is reverse complementary to two separate non-contiguous regions of the same target RNA. For instance, the sensor nucleotide sequence may be reverse complementary to two separate regions of the 5′ UTR of the target RNA, to two separate regions of the coding sequence of the target RNA, to two separate regions of the 5′ UTR of the target RNA, to a region in the 5′ UTR and a region in the coding sequence of the target RNA, to a region in the coding sequence and a region in the 3′ UTR of the target RNA, or to a region in the 5′ UTR and a region in the 3′ UTR of the target RNA. In some embodiments, the sensor RNA is reverse complimentary to two or more distinct target RNAs.
Sensor RNAs that have sensor nucleotide sequences that are reverse complimentary to the 3′ or 5′ UTR have certain advantages relative to sensor RNAs that are reverse complementary to coding sequences (CDS), such as those disclosed in International Application PCT/US2022/033459. First, ADAR editing is more efficient in the UTR when compared CDS because translating ribosomes may destabilize dsRNA. The increased efficiency is shown in
The sensor nucleotide sequence of the present disclosure may be any length determined necessary for sufficient specificity to the target RNA. For instance the sensor nucleotide sequence could be less than about 50 nucleotides, from about 50 to 60, about 60 to 70, about 70 to 80, about 80 to 90, about 90 to 100, about 100 to 110, about 110 to 120, about 120 to 130, about 130 to 140, about 140 to 150, about 150 to 160, about 160 to 170, about 170 to 180, about 180 to 190, about 190 to 200, about 200 to 210, about 210 to 220, about 220 to 230, about 230 to 240, about 240 to 250, about 250 to 260, about 260 to 270, about 270 to 280, about 280 to 290, about 290 to 300, about 300 to 310, about 310 to 320, about 320 to 330, about 330 to 340, about 340 to 350, about 350 to 360, about 360 to 370, about 370 to 380, about 380 to 390, about 390 to 400, about 400 to 410, about 410 to 420, about 420 to 430, about 430 to 440, about 440 to 450, about 450 to 460, about 460 to 470, about 470 to 480, about 480 to 490, about 490 to 500 or greater than 500 nucleotides in length.
When the sensor nucleotide sequence is reverse complementary to two non-contiguous regions within a target RNA, the distance between the two non-contiguous regions of the target may be any length. For instance, the distance between the two non-contiguous regions of the target may be less than about 50 nucleotides, from about 50 to 60, about 60 to 70, about 70 to 80, about 80 to 90, about 90 to 100, about 100 to 150, about 150 to 200, about 200 to 250, about 250 to 300, about 300 to 350, about 350 to 400, about 400 to 450, about 450 to 500 or greater than 500 nucleotides.
When the sensor nucleotide sequence is reverse complimentary to two non-contiguous regions within the target RNA, the nucleotide sequence of the sensor nucleotide that is reverse complementary to the first region of the two non-contiguous regions with the target RNA may be any length. For instance, the nucleotide sequence of the sensor nucleotide that is reverse complementary to the first region of the two non-contiguous regions may be less than about 20 nucleotides, from about 20 to 30, about 30 to 40, about 40 to 50, about 50 to 60, about 60 to 70, about 70 to 80, about 80 to 90, about 90 to 100, about 100 to 110, about 110 to 120, about 120 to 130, about 130 to 140, about 140 to 150, about 150 to 160, about 160 to 170, about 170 to 180, about 180 to 190, about 190 to 200, about 200 to 210, about 210 to 220, about 220 to 230, about 230 to 240, about 240 to 250, about 250 to 260, about 260 to 270, about 270 to 280, about 280 to 290, about 290 to 300, about 300 to 310, about 310 to 320, about 320 to 330, about 330 to 340, about 340 to 350, about 350 to 360, about 360 to 370, about 370 to 380, about 380 to 390, about 390 to 400, about 400 to 410, about 410 to 420, about 420 to 430, about 430 to 440, about 440 to 450, about 450 to 460, about 460 to 470, about 470 to 480, about 480 to 490, about 490 to 500 or greater than 500 nucleotides in length.
When the sensor nucleotide sequence is reverse complimentary to two non-contiguous regions with the target RNA, the nucleotide sequence of the sensor nucleotide that is reverse complementary to the second region of the two non-contiguous regions with the target RNA may be any length. For instance, the nucleotide sequence of the sensor nucleotide that is reverse complementary to the second region of the two non-contiguous regions may be less than about 20 nucleotides, from about 20 to 30, about 30 to 40, about 40 to 50, about 50 to 60, about 60 to 70, about 70 to 80, about 80 to 90, about 90 to 100, about 100 to 110, about 110 to 120, about 120 to 130, about 130 to 140, about 140 to 150, about 150 to 160, about 160 to 170, about 170 to 180, about 180 to 190, about 190 to 200, about 200 to 210, about 210 to 220, about 220 to 230, about 230 to 240, about 240 to 250, about 250 to 260, about 260 to 270, about 270 to 280, about 280 to 290, about 290 to 300, about 300 to 310, about 310 to 320, about 320 to 330, about 330 to 340, about 340 to 350, about 350 to 360, about 360 to 370, about 370 to 380, about 380 to 390, about 390 to 400, about 400 to 410, about 410 to 420, about 420 to 430, about 430 to 440, about 440 to 450, about 450 to 460, about 460 to 470, about 470 to 480, about 480 to 490, about 490 to 500 or greater than 500 nucleotides in length.
The sensor nucleotide sequence or the stem-loops of the present disclosure may include any stop or start codon including an adenosine residue. For example, the stop codon of the sensor nucleotide sequence may be UAG, UAA, or UGA. In general, the stop codons of the present disclosure are in-frame with the coding sequence of the output protein such that the output protein is produced when the stop codon is edited.
The output protein of the present disclosure may be any output protein desired. Examples of the output protein of the present disclosure include, without limitation, a fluorescent protein, a genomic modification protein, a transcription factor, a killing factor, a toxin, an antigen, a T cell receptor, an enzyme, a therapeutic protein, a cytokine, a chemokine, a growth factor, a signaling peptide, a chimeric antigen receptor (CAR), etc. The output proteins may be secreted, transmembrane or membrane-tethered. When output proteins are to be trafficked to specific locations within the biological sample then the coding sequence of the output protein is preceded by a nucleotide sequence encoding the appropriate signal peptide such as those described in Owji et al. (Eur J Cell Biol. 2018 August; 97(6):422-441).
When the output protein is a genomic modification protein, the genomic modification proteins may include, without limitation, CRE recombinase or variants thereof, meganucleases or variants thereof, Zinc-finger nucleases or variants thereof, CRISPR/Cas-9 nuclease or variants thereof, a modified Cas9 nickase fused to a reverse-transcriptase (i.e., genomic modification protein used in prime editing), TAL effector nucleases or variants thereof, etc. Methods of prime editing are known in the art and have been described in, for example, Scholefield et al (Gene Ther. 2021 August; 28(7-8):396-401) which is specifically incorporated by reference herein.
When the output protein is a transcription factor, the transcription factor may include, without limitation, jun, fos, max, mad, serum response factor (SRF), AP-1, AP2, myb, MyoD, myogenin, ETS-box containing proteins, TFE3, E2F, ATF1, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, 5 HNF4, C/EBP, SP1, CCAAT-box binding proteins, interferon regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, GATA-box binding proteins, e.g., GAT A-3, and the forkhead family of winged helix proteins.
When the output protein is a killing factor, the killing factor may include, without limitation, tumor necrosis factor alpha (TNFa), Fas ligand (FasL), a caspase such as caspase 1, caspase 2, caspase 3, caspase 4, caspase 5, caspase 6, caspase 7, caspase 8, caspase 9, caspase 10, caspase 11, caspase 12, caspase 13 or a variant thereof, etc.
When the output protein is a therapeutic protein, the therapeutic protein of may include, without limitation, hormones and growth and differentiation factors including, without limitation, insulin, glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing factor (GHRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angioproteinetins, angiostatin, granulocyte colony stimulating factor (GCSF), erythroproteinetin (EPO), connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth factor .alpha. (TGFa), platelet-derived growth factor (PDGF), insulin growth factors I and II (IGF-1 and IGF-11), any one of the transforming growth factor 13-superfamily, including TGFI3, activins, inhibins, or any of the bone morphogenic proteins (BMP) including BMPs 1-15, any one of the heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and tyrosine hydroxylase.
When the output protein is a cytokine, the cytokine may include, without limitation IL-1-like, IL-1α, IL-1β, IL-1RA, IL-18, CD132, IL-2, IL-4, IL-7, IL-9, IL-13, CD1243, 132, IL-15, CD131, IL-3, IL-5, GM-CSF, IL-6-like, IL-6, IL-11, G-CSF, IL-12, LIF, OSM, IL-10-like, IL-10, IL-20, IL-14, IL-16, IL-17, IFN-α, IFN-β, IFN-γ, CD154, LT-β, TNF-α, TNF-β, 4-1BBL, APRIL, CD70, CD153, CD178, GITRL, LIGHT, OX40L, TALL-1, TRAIL, TWEAK, TRANCE, TGF-β1, TGF-β2, TGF-β3, Epo, Tpo, Flt-3L, SCF, M-CSF, MSP, etc.
When the output protein is a chemokine, the chemokine may include, without limitation XCL1, XCL2, CCL1, CCL2, CCL3, CCL4, CCL5, CCL7, CCL8, CCL11, CCL13, CCL14, CCL15, CCL16, CCL17, CCL18, CCL19, CCL20, CCL21, CCL22, CCL23, CCL24, CCL25, CCL26, CCL27, CXCL1, CXCL2, CXCL3, CXCL4, CXCL5, CXCL6, CXCL7, CXCL8, CXCL9, CXCL10, CXCL11, CXCL12, CXCL13, CXCL14, CX3CL1, etc.
In certain embodiments, when the output polypeptide is a CAR, the extracellular binding domain of the CAR has a single chain antibody. The single-chain antibody may be a monoclonal single-chain antibody, a chimeric single-chain antibody, a humanized single-chain antibody, a fully human single-chain antibody, and/or the like. In one non-limiting example, the single chain antibody is a single chain variable fragment (scFv). Suitable CAR extracellular binding domains include those described in Labanieh et al. (2018 Nature Biomedical Engineering 2:377-391) which is specifically incorporated by reference herein. In some embodiments, the extracellular binding domain of the CAR is a single-chain version (e.g., an scFv version) of an antibody approved by the United States Food and Drug Administration and/or the European Medicines Agency (EMA) for use as a therapeutic antibody, e.g., for inducing antibody-dependent cellular cytotoxicity (ADCC) of certain disease-associated cells in a patient, etc. Non-limiting examples of single-chain antibodies which may be employed when the protein of interest is a CAR include single-chain versions (e.g., scFv versions) of Adecatumumab, Ascrinvacumab, Cixutumumab, Conatumumab, Daratumumab, Drozitumab, Duligotumab, Durvalumab, Dusigitumab, Enfortumab, Enoticumab, Figitumumab, Ganitumab, Glembatumumab, Intetumumab, Ipilimumab, Iratumumab, Icrucumab, Lexatumumab, Lucatumumab, Mapatumumab, Narnatumab, Necitumumab, Nesvacumab, Ofatumumab, Olaratumab, Panitumumab, Patritumab, Pritumumab, Radretumab, Ramucirumab, Rilotumumab, Robatumumab, Seribantumab, Tarextumab, Teprotumumab, Tovetumab, Vantictumab, Vesencumab, Votumumab, Zalutumumab, Flanvotumab, Altumomab, Anatumomab, Arcitumomab, Bectumomab, Blinatumomab, Detumomab, Ibritumomab, Minretumomab, Mitumomab, Moxetumomab, Naptumomab, Nofetumomab, Pemtumomab, Pintumomab, Racotumomab, Satumomab, Solitomab, Taplitumomab, Tenatumomab, Tositumomab, Tremelimumab, Abagovomab, Igovomab, Oregovomab, Capromab, Edrecolomab, Nacolomab, Amatuximab, Bavituximab, Brentuximab, Cetuximab, Derlotuximab, Dinutuximab, Ensituximab, Futuximab, Girentuximab, Indatuximab, Isatuximab, Margetuximab, Rituximab, Siltuximab, Ublituximab, Ecromeximab, Abituzumab, Alemtuzumab, Bevacizumab, Bivatuzumab, Brontictuzumab, Cantuzumab, Cantuzumab, Citatuzumab, Clivatuzumab, Dacetuzumab, Demcizumab, Dalotuzumab, Denintuzumab, Elotuzumab, Emactuzumab, Emibetuzumab, Enoblituzumab, Etaracizumab, Farletuzumab, Ficlatuzumab, Gemtuzumab, Imgatuzumab, Inotuzumab, Labetuzumab, Lifastuzumab, Lintuzumab, Lorvotuzumab, Lumretuzumab, Matuzumab, Milatuzumab, Nimotuzumab, Obinutuzumab, Ocaratuzumab, Otlertuzumab, Onartuzumab, Oportuzumab, Parsatuzumab, Pertuzumab, Pinatuzumab, Polatuzumab, Sibrotuzumab, Simtuzumab, Tacatuzumab, Tigatuzumab, Trastuzumab, Tucotuzumab, Vandortuzumab, Vanucizumab, Veltuzumab, Vorsetuzumab, Sofituzumab, Catumaxomab, Ertumaxomab, Depatuxizumab, Ontuxizumab, Blontuvetmab, Tamtuvetmab, or an antigen-binding variant thereof.
The output protein may further include a tag to be used to detect the protein following its production. For instance, the tag may include, without limitation, a fluorescent protein, e.g., green fluorescent protein (GFP), YFP, RFP, CFP, mCherry, tdTomato, and the like; a histidine tag, e.g., a 6×His tag; a hemagglutinin (HA) tag; a FLAG tag; a Myc tag; and the like.
In some embodiments, the detecting is quantitative or qualitative. For instance, the detecting of the target RNA may be correlated with the quantity of the output protein produced. In some embodiments, the quantity of the output protein relative to the quantitative of target RNA may be linear. In some embodiments, the quantity of the output protein relative to the quantitative of target RNA may be logarithmic. The methods of the present disclosure are capable of quantitatively detecting changes in the expression of specific genes through the detection of the target RNA. For instance, the methods are capable of detecting about a 2 fold, 3 fold, 4 fold, 5 fold, 6 fold, 7 fold, 8 fold, 9 fold, 10 fold, 15 fold, 20 fold, 30 fold, 40 fold, 50 fold, 60 fold, 70 fold, 80 fold, 90 fold, 100 fold, 200 fold, 300 fold, 400 fold, 500 fold, 600 fold, 700 fold, 800 fold, 900 fold, 1000 fold, 5000 fold, 10,000 fold, 50,000 fold, 100,000 fold, or greater than a 100,000 fold change in the target RNA.
Aspects of this disclosure include assaying for the presence of the output protein in a biological sample. In some embodiments, the assaying for the output protein may contain using immunoblotting. In some embodiments, the assaying contains using microscopy. When the assaying contains microscopy the output protein may be conjugated to a fluorescent or luminescent protein or the output protein may be a fluorescent or luminescent protein. In some embodiments, the assaying for the presence of the output protein contains using flow cytometry. When the assaying includes flow cytometry, fluorescence activating cell sorting may be used.
The methods of the present disclosure also contain combining the biological sample with the sensor RNA. The combining can be done using any convenient method known in the art. In some embodiments, the combining includes transfecting the biological sample with a recombinant vector containing the sensor RNA. When the biological sample is transfected with the recombinant vector, the recombinant vector includes, without limitation, a plasmid, a viral vector, a cosmid an artificial chromosome, etc. In some embodiments, the combining contains contacting the biological sample with a lipid nanoparticle containing the sensor RNA. Lipid nanoparticles has been described in the art such as Hou et al. (Nat Rev Mater. 2021; 6(12):1078-1094).
When transfection of a biological sample such as a cell is desired, vectors, such as plasmids viral vectors, cosmids or artificial chromosomes, may be employed to engineer the cell to express the sensor RNA, as desired. Protocols of interest include those described in published PCT application WO1999/041258, the disclosure of which protocols are herein incorporated by reference.
Depending on the nature of the cell and/or expression construct, protocols of interest may include electroporation, particle gun technology, calcium phosphate precipitation, direct microinjection, viral infection and the like. The choice of method is generally dependent on the type of cell being transformed and the circumstances under which the transformation is taking place (i.e., in vitro, ex vivo, or in vivo). A general discussion of these methods can be found in Ausubel, et al, Short Protocols in Molecular Biology, 3rd ed., Wiley & Sons, 1995. In some embodiments, lipofectamine and calcium mediated gene transfer technologies are used. After the subject nucleic acids have been introduced into a cell, the cell may be incubated, normally at 37° C., sometimes under selection, for a period of about 1-24 hours in order to allow for the expression of the sensor RNA. In mammalian target cells, a number of viral-based expression systems may be utilized to express the sensor RNA(s). In cases where an adenovirus is used as an expression vector, the sensor RNA sequence of interest may be ligated to an adenovirus transcription/translation control complex, e.g., the late promoter and tripartite leader sequence. This chimeric gene may then be inserted in the adenovirus genome by in vitro or in vivo recombination. Insertion in a non-essential region of the viral genome (e.g., region E1 or E3) will result in a recombinant virus that is viable and capable of expressing the chimeric protein in infected hosts. (e.g., see Logan & Shenk, Proc. Natl. Acad. Sci. USA 81:355-359 (1984)). The efficiency of expression may be enhanced by the inclusion of appropriate transcription enhancer elements, transcription terminators, etc. (see Bittner et al., Methods in Enzymol. 153:51-544 (1987)).
In some embodiments, the viral vector is a recombinant adeno-associated virus (AAV) vector. AAV vectors are DNA viruses of relatively small size that can integrate, in a stable and site specific manner, into the genome of the cells that they infect. They are able to infect a wide spectrum of cells without inducing any effects on cellular growth, morphology or differentiation, and they do not appear to be involved in human pathologies. The AAV genome has been cloned, sequenced and characterized. It encompasses approximately 4700 bases and contains an inverted terminal repeat (ITR) region of approximately 145 bases at each end, which serves as an origin of replication for the virus. The remainder of the genome is divided into two essential regions that carry the encapsidation functions: the left-hand part of the genome, that contains the rep gene involved in viral replication and expression of the viral genes; and the right-hand part of the genome, that contains the cap gene encoding the capsid proteins of the virus.
The application of AAV as a vector for gene therapy has been rapidly developed in recent years. Wild-type AAV can infect, with a comparatively high titer, dividing or non-dividing cells, or tissues of mammal, including human, and also can integrate into in human cells at specific site (on the long arm of chromosome 19) (Kotin et al, Proc. Natl. Acad. Sci. U.S.A., 1990. 87: 2211-2215; Samulski et al, EMBO J., 1991. 10: 3941-3950 the disclosures of which are hereby incorporated by reference herein in their entireties). AAV vector without the rep and cap genes loses specificity of site-specific integration, but may still mediate long-term stable expression of exogenous genes. AAV vector exists in cells in two forms, wherein one is episomic outside of the chromosome; another is integrated into the chromosome, with the former as the major form. Moreover, AAV has not been found to be associated with any human disease, nor any change of biological characteristics arising from the integration has been observed. There are sixteen serotypes of AAV reported in literature, respectively named AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAV13, AAV14, AAV15, and AAV16, wherein AAV5 is originally isolated from humans (Bantel-Schaal, and H. zur Hausen. Virology, 1984. 134: 52-63), while AAV1-4 and AAV6 are all found in the study of adenovirus (Ursula Bantel-Schaal, Hajo Delius and Harald zur Hausen. J. Viral., 1999. 73: 939-947).
AAV vectors may be prepared using any convenient methods. Adeno-associated viruses of any serotype are suitable (See, e.g., Blacklow, pp. 165-174 of “Parvoviruses and Human Disease” J. R. Pattison, ed. (1988); Rose, Comprehensive Virology 3:1, 1974; P. Tattersall “The Evolution of Parvovirus Taxonomy” In Parvoviruses (J R Kerr, S F Cotmore. M E Bloom, RMLinden, C RParrish, Eds.) p 5-14, Rudder Arnold, London, U K (2006); and D E Bowles, J E Rabinowitz, R J Samulski “The Genus Dependovirus” (J R Kerr, S F Cotmore. M E Bloom, R M Linden, C R Parrish, Eds.) p 15-23, Rudder Arnold, London, UK (2006), the disclosures of which are hereby incorporated by reference herein in their entireties). Methods for purifying for vectors may be found in, for example, U.S. Pat. Nos. 6,566,118, 6,989,264, and 6,995,006 and WO/1999/011764 titled “Methods for Generating High Titer Helper-free Preparation of Recombinant AAV Vectors”, the disclosures of which are herein incorporated by reference in their entirety. Preparation of hybrid vectors is described in, for example, PCT Application No. PCTIUS2005/027091, the disclosure of which is herein incorporated by reference in its entirety. The use of viral vectors derived from the AAVs for transferring genes in vitro and in vivo has been described (See e.g., International Patent Application Publication Nos: 91/18088 and WO 93/09239; U.S. Pat. Nos. 4,797,368, 6,596,535, and 5,139,941; and European Patent No: 0488528, all of which are herein incorporated by reference in their entirety). These publications describe various AAV-derived constructs in which the rep and/or cap genes are deleted and replaced by a gene of interest, and the use of these constructs for transferring the gene of interest in vitro (into cultured cells) or in vivo (directly into an organism). The replication defective recombinant AAVs according to the invention can be prepared by co-transfecting a plasmid containing the nucleic acid sequence of interest flanked by two AAV inverted terminal repeat (ITR) regions, and a plasmid carrying the AAV encapsidation genes (rep and cap genes), into a cell line that is infected with a human helper virus (for example an adenovirus). The AAV recombinants that are produced are then purified by standard techniques.
In some embodiments, the vector(s) for use in the methods of the invention are encapsidated into a virus particle (e.g., AAV virus particle including, but not limited to, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV10, AAV11, AAV12, AAV13, AAV14, AAV15, and AAV16). Accordingly, the invention includes a recombinant virus particle (recombinant because it contains a recombinant polynucleotide) comprising any of the vectors described herein. Methods of producing such particles are known in the art and are described in U.S. Pat. No. 6,596,535.
When the biological sample is transfected with a recombinant vector including the sensor RNA, the sensor RNA is operably linked to a promoter. Suitable promoters of the present disclosure include, without limitation, a SFFV promoter, a hEFla, a CMV promotor or a variant thereof, an inducible promoter, a CMV-tetO promoter, a tissue or cell specific promoter, etc.
In some aspects of the present disclosure, the sensor RNA includes one or more MS2 hairpins. In some embodiments, the sensor RNA includes more than one MS2 hairpin. For example, the sensor RNA may include two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten more, or more than ten. In some aspects, the sensor RNA include one or more TAR RNA elements. For example, the sensor RNA may include two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten more, or more than ten. In some aspects, the sensor RNA include one or more BoxB stem-loop. For example, the sensor RNA may include two or more, three or more, four or more, five or more, six or more, seven or more, eight or more, nine or more, ten more, or more than ten. In some aspects, the sensor RNA includes MS2 hairpins and BoxB stem loops, MS2 hairpins and TAR RNA elements, or BoxB stem-loops and TAR RNA elements.
In some embodiments, the method of detecting a target RNA further contains combining the biological sample with an ADAR protein or a coding sequence thereof. The ADAR protein may be any ADAR protein from any species. For instance, the ADAR protein may include without limitation, an ADAR (ADAR1), an ADAR p110, an ADAR p150, an ADAR2, an engineered ADAR protein such as a protein containing a deaminase domain of ADAR2 or a variant thereof and a MS2 RNA binding protein (MCP), an engineered ADAR protein that lacks a nuclear localization sequence, an engineered ADAR protein containing a nuclear export sequence, an engineered ADAR protein containing one or more dsRNA binding domains from one or more distinct ADAR proteins, an engineered ADAR protein containing a TAR RNA binding protein, an engineered ADAR protein containing a Lambda N peptide, a split engineered ADAR protein wherein the N and C terminus of the deaminase domain are produced separately and the two halves binding to one another in the presence of the target RNA, etc. Suitable engineered ADAR proteins have been described in Katrekar et al. (Nat Methods. 2019 March; 16(3):239-242.), Biswas et al. (iScience. 2020 Jul. 24; 23(7):101318), Matthews et al. (Nat Struct Mol Biol. 2016 May; 23(5):426-33), Cox et al. (Science. 2017 Nov. 24; 358(6366):1019-1027) or Kuttan et al. (Proc Natl Acad Sci USA. 2012 Nov. 27; 109(48):E3295-304). Split engineered ADAR proteins are described in Katrekar et al. (Elife. 2022 Jan. 19; 11:e75555). When the sensor RNA contains a start codon in place of a stop codon, a particular ADAR protein may be used. In some embodiments, the ADAR protein is ADAR2 when the sensor RNA contains a start codon in place of a stop codon.
In some embodiments, RNA editing proteins other than ADARs are used. For instance, proteins of the apolipoprotein B mRNA editing enzyme, catalytic polypeptide-like (APOBEC) family may be used. Examples of suitable APOBEC proteins include, without limitation, APOBEC1, APOBEC2, APOBEC3A, APOBEC3B, APOBEC3C, APOBEC3D, APOBEC3F, APOBEC3G, APOBEC3H, APOBEC4, etc.
In some embodiments, the sensor RNA further contains a nucleotide sequence containing a cleavage domain followed by a nucleotide sequence encoding any of the ADAR proteins described above wherein the nucleotide sequence containing the cleavage domain is after the nucleotide sequence encoding the output protein. In some embodiments, an ADAR protein is used instead of a marker protein as the first nucleotide sequence.
In some embodiments, the sensor RNA further contains a nucleotide sequence encoding a second sensor nucleotide sequence that is reverse complementary to a second target RNA wherein the sensor nucleotide sequence contains a second stop codon wherein the sequences of the first and second target RNAs are different. In some embodiments, the stop codon that contains at least 1 base that is mismatched with the second target RNA sequence. In some embodiments, the sensor RNA further contains a nucleotide sequence encoding a second sensor nucleotide sequence that is reverse complementary to a second target RNA wherein the sensor nucleotide sequence contains a start codon. In some embodiments, the sensor RNA further contains a nucleotide sequence encoding a second sensor nucleotide sequence that is reverse complementary to a second target RNA wherein the sensor nucleotide sequence contains a non-start codon that can be edited to a start codon. In some embodiments, the stop, start or non-start codon contains at least 1 base that is mismatched with the second target RNA sequence. In some embodiments, the stop, start or non-start codon is contained with a stem-loop sequence contained in the second sensor nucleotide sequence. In some embodiments, the biological sample is combined with two or more sensor RNAs that detect two or more distinct target RNAs.
In some embodiments, the method of detecting a target RNA further contains combining the biological sample with a protein that specifically localizes the sensor RNA to the location of the target RNA. For examples, a protein that specifically localizes the sensor RNA to the location of the target RNA may be a dCas9 or a dCas13 protein that has a guide RNA directed to the genomic locus corresponding to the target RNA (in the case of dCas9) or the target RNA directly (in the case of dCas13). In some embodiments, the dCas9 or dCas13 is engineered to be linked to a MCP, a TAR RNA binding protein or a Lambda N peptide.
As summarized above, methods are provided for expressing a protein in a target cell, the methods including combining a cell with a sensor RNA as described above, wherein the target RNA is present in the target cell.
The target RNA to which the sensor RNA hybridizes is, in some instances, determined by the target cell. In some embodiments, the target cell is a cell that is in a particular disease state. In these instances, the target cell includes a target RNA that is specific to the disease state or is in a higher abundance in cells that are in a particular disease state such as a cancerous cell. The cell may be in any disease state. In some embodiments, the target cell is a particular cell type. In these instances, the target cell includes a target RNA that is specific to the cell type or is in a higher abundance in cells that are a particular cell type. The cell may be any cell type.
Cells of any origin are candidate cells for combining with a sensor RNA of the present disclosure. Non-limiting examples of candidate cell types include connective tissue elements such as fibroblast, skeletal tissue (bone and cartilage), skeletal, cardiac and smooth muscle, epithelial tissues (e.g., liver, lung, breast, skin, bladder and kidney), neural cells (glia and neurons), endocrine cells (adrenal, pituitary, pancreatic islet cells), bone marrow cells, melanocytes, and many different types of hematopoetic cells. Suitable cells can also be cells representative of a specific body tissue from a subject. The types of body tissues include, but are not limited, to blood, muscle, nerve, brain, heart, lung, liver, pancreas, spleen, thymus, esophagus, stomach, intestine, kidney, testis, ovary, hair, skin, bone, breast, uterus, bladder, spinal cord and various kinds of body fluids.
Cells suitable for use in a subject method include cells of a variety of subject hosts. Generally, such subject hosts are “mammals” or “mammalian”, where these terms are used broadly to describe organisms which are within the class mammalia, including the orders carnivore (e.g., dogs and cats), rodentia (e.g., mice, guinea pigs and rats), and primates (e.g., humans, chimpanzees and monkeys). In many aspects, the subject host will be a human. In certain embodiments, the subject host is a plant.
In certain embodiments, the method for expressing a target protein in a target cell further includes combining the biological sample with an ADAR protein or a coding sequence thereof. The ADAR protein may be any ADAR protein from any species. For instance, the ADAR protein may include without limitation, an ADAR (ADAR1), an ADAR p110, an ADAR p150, an ADAR2, an engineered ADAR protein such as a protein including a deaminase domain of ADAR2 or a variant thereof and a MS2 RNA binding protein MCP, etc. Suitable engineered ADAR proteins have been described in Katrekar et al. (Nat Methods. 2019 March; 16(3):239-242.), Biswas et al. (iScience. 2020 Jul. 24; 23(7):101318), Matthews et al. (Nat Struct Mol Biol. 2016 May; 23(5):426-33), Cox et al. (Science. 2017 Nov. 24; 358(6366):1019-1027) or Kuttan et al. (Proc Natl Acad Sci USA. 2012 Nov. 27; 109(48):E3295-304).
The methods for expressing a protein in a target cell include combining the target cell with the sensor RNA. The combining can be done using any convenient method known in the art. In some embodiments, the combining includes transfecting the biological sample with a recombinant vector including the sensor RNA. When the biological sample is transfected with the recombinant vector, the recombinant vector includes, without limitation, a plasmid, a viral vector, a cosmid an artificial chromosome, etc. In some embodiments, the combining includes contacting the biological sample with a lipid nanoparticle including the sensor RNA. Lipid nanoparticles has been described in the art such as Hou et al. (Nat Rev Mater. 2021; 6(12):1078-1094).
When transfection of a biological sample such as a cell is desired, vectors, such as plasmids viral vectors, cosmids or artificial chromosomes, may be employed to engineer the cell to express the sensor RNA, as desired.
The protein expressed in the target cell is the output protein encoded by the sensor RNA. The output protein of the sensor RNA may be any of the output proteins described above. In some embodiments, the output protein treats the disease or condition associated with the target RNA in the target cell.
In some embodiments, the methods of the present disclosure can be used to produce a target protein in the absences of a cell. In these embodiments, a cell-free system includes the biological sample, the sensor RNA and the ADAR protein. The biological sample may include any target RNA. For example, the biological sample may be a sample including viral matter such as viral RNA wherein detection of the viral RNA leads to production of the output protein. Suitable cell-free systems include those described by Kuruma et al. (Nat Protoc. 2015 September; 10(9):1328-44) and Lavickova et al. (ACS Synth Biol. 2019 Feb. 15; 8(2):455-462).
Methods for expressing a protein in a target cell may also be used to treat an individual for a disease or a condition. In the methods disclosed herein, the protein for expression in a target cell may promote the survival of the target cell or may promote the death of the cell. For instance, if the disease or condition is associated with a cell that is infected by a pathogen or a cancer cell then it may be desirable for the promotion of the death of such cells. In embodiments in which the death of the target cell is desired, output protein encoded by the sensor RNA may be any output protein that promotes the death of the cell. Output protein that promote the death of the cell include, without limitation, a toxin, tumor necrosis factor alpha (TNFa), Fas ligand (FasL), a caspase such as caspase 1, caspase 2, caspase 3, caspase 4, caspase 5, caspase 6, caspase 7, caspase 8, caspase 9, caspase 10, caspase 11, caspase 12, caspase 13 or a variant thereof, etc.
In addition, if the disease or condition is associated with a cell that is infected by a pathogen or a cancer cell it may also be desirable to activate an immune cell to target the infected cell or cancer cell. Immune cells generally include white blood cells (leukocytes) which are derived from hematopoietic stem cells (HSC) produced in the bone marrow. Immune cells also include, e.g., lymphocytes (T cells, B cells, natural killer (NK) cells) and myeloid-derived cells (neutrophil, eosinophil, basophil, monocyte, macrophage, dendritic cells). T cells include all types of immune cells expressing CD3 including T-helper cells (CD4+ cells), cytotoxic T-cells (CD8+ cells), T-regulatory cells (Treg) and gamma-delta T cells. Cytotoxic cells include CD8+ T cells, natural-killer (NK) cells, and neutrophils, which cells are capable of mediating cytotoxicity responses.
In embodiments in which the activation of immune cell is desired, the target RNA that the sensor RNA is directed to may be a target RNA that is specifically expressed in an immune cell. In embodiments in which the activation of immune cell is desired, the sensor RNA may contain a sequence that encodes an output protein that activates or modulates the activity of the immune cell. Non-limiting examples of output proteins that activate immune cells include a chimeric antigen receptor, such as those described above, or a cytokine such as IL-1-like, IL-1α, IL-10, IL-1RA, IL-18, CD132, IL-2, IL-4, IL-7, IL-9, IL-13, CD1243, 132, IL-15, CD131, IL-3, IL-5, GM-CSF, IL-6-like, IL-6, IL-11, G-CSF, IL-12, LIF, OSM, IL-10-like, IL-10, IL-20, IL-14, IL-16, IL-17, IFN-α, IFN-β, IFN-γ, CD154, LT-β, TNF-α, TNF-β, 4-1BBL, APRIL, CD70, CD153, CD178, GITRL, LIGHT, OX40L, TALL-1, TRAIL, TWEAK, TRANCE, TGF-β1, TGF-β2, TGF-β3, Epo, Tpo, Flt-3L, SCF, M-CSF, MSP, etc.
If the disease or condition is associated with the expression of a non-functional protein, a reduced functioning protein or a protein that has an aberrant activity in a disease state relative to a non-disease state then it may be desirable to have a sensor RNA that is targeted to the diseased cells where, upon contact with the diseased cell that contains the target RNA, the cell produces the output protein where the output protein is a fully functional form of the protein that is non-functioning, has reduced functionality or has aberrant functions.
If the disease or condition is associated with the degradation of a tissue it may be desirable to promote the growth or regrowth of said tissue. In embodiments in which the disease or condition is associated with tissue degradation it may be desirable to have a sensor RNA that is targeted to the diseased cells where, upon contact with the diseased cell that contains the target RNA, the cell produces the output protein that promotes the growth or regrowth of the tissue. Non-limiting examples of output proteins that promote the growth or regrowth of the tissue include hormones and growth and differentiation factors including, without limitation, insulin, glucagon, growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing factor (GHRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic gonadotropin (hCG), vascular endothelial growth factor (VEGF), angioproteinetins, angiostatin, granulocyte colony stimulating factor (GCSF), erythroproteinetin (EPO), connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic fibroblast growth factor (aFGF), epidermal growth factor (EGF), transforming growth factor .alpha. (TGFa), platelet-derived growth factor (PDGF), insulin growth factors I and II (IGF-1 and IGF-11), any one of the transforming growth factor 13-superfamily, including TGFI3, activins, inhibins, or any of the bone morphogenic proteins (BMP) including BMPs 1-15, any one of the heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic factor (GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 and netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and tyrosine hydroxylase.
Given the diversity of cellular activities that may be modulated through the use of the subject sensor RNA, the instant methods of treatment may be utilized for a variety of applications. As non-limiting examples, the instant methods may find use in a treatment directed to a variety of diseases including but not limited to e.g., Acanthamoeba infection, Acinetobacter infection, Adenovirus infection, ADHD (Attention Deficit/Hyperactivity Disorder), AIDS (Acquired Immune Deficiency Syndrome), ALS (Amyotrophic Lateral Sclerosis), Alzheimer's Disease, Amebiasis, Intestinal (Entamoeba histolytica infection), Anaplasmosis, Human, Anemia, Angiostrongylus Infection, Animal-Related Diseases, Anisakis Infection (Anisakiasis), Anthrax, Aortic Aneurysm, Aortic Dissection, Arenavirus Infection, Arthritis (e.g., Childhood Arthritis, Fibromyalgia, Gout, Lupus (SLE) (Systemic lupus erythematosus), Osteoarthritis, Rheumatoid Arthritis, etc.), Ascaris Infection (Ascariasis), Aspergillus Infection (Aspergillosis), Asthma, Attention Deficit/Hyperactivity Disorder, Autism, Avian Influenza, B virus Infection (Herpes B virus), B. cepacia infection (Burkholderia cepacia Infection), Babesiosis (Babesia Infection), Bacterial Meningitis, Bacterial Vaginosis (BV), Balamuthia infection (Balamuthia mandrillaris infection), Balamuthia mandrillaris infection, Balantidiasis, Balantidium Infection (Balantidiasis), Baylisascaris Infection, Bilharzia, Birth Defects, Black Lung (Coal Workers' Pneumoconioses), Blastocystis hominis Infection, Blastocystis Infection, Blastomycosis, Bleeding Disorders, Blood Disorders, Body Lice (Pediculus humanus corporis), Borrelia burgdorferi Infection, Botulism (Clostridium botulinim), Bovine Spongiform Encephalopathy (BSE), Brainerd Diarrhea, Breast Cancer, Bronchiolitis, Bronchitis, Brucella Infection (Brucellosis), Brucellosis, Burkholderia cepacia Infection (B. cepacia infection), Burkholderia mallei, Burkholderia pseudomallei Infection, Campylobacter Infection (Campylobacteriosis), Campylobacteriosis, Cancer (e.g., Colorectal (Colon) Cancer, Gynecologic Cancers, Lung Cancer, Prostate Cancer, Skin Cancer, etc.), Candida Infection (Candidiasis), Candidiasis, Canine Flu, Capillaria Infection (Capillariasis), Capillariasis, Carbapenem resistant Klebsiella pneumonia (CRKP), Cat Flea Tapeworm, Cercarial Dermatitis, Cerebral Palsy, Cervical Cancer, Chagas Disease (Trypanosoma cruzi Infection), Chickenpox (Varicella Disease), Chikungunya Fever (CHIKV), Childhood Arthritis, German Measles (Rubella Virus), Measles, Mumps, Rotavirus Infection, Chlamydia (Chlamydia trachomatis Disease), Chlamydia pneumoniae Infection, Chlamydia trachomatis Disease, Cholera (Vibrio cholerae Infection), Chronic Fatigue Syndrome (CFS), Chronic Obstructive Pulmonary Disease (COPD), Ciguatera Fish Poisoning, Ciguatoxin, Classic Creutzfeldt-Jakob Disease, Clonorchiasis, Clonorchis Infection (Clonorchiasis), Clostridium botulinim, Clostridium difficile Infection, Clostridium perfringens infection, Clostridium tetani Infection, Clotting Disorders, CMV (Cytomegalovirus Infection), Coal Workers' Pneumoconioses, Coccidioidomycosis, Colorectal (Colon) Cancer, Common Cold, Conjunctivitis, Cooleys Anemia, COPD (Chronic Obstructive Pulmonary Disease), Corynebacterium diphtheriae Infection, Coxiella burnetii Infection, Creutzfeldt-Jakob Disease, CRKP (Carbapenem resistant Klebsiella pneumonia), Crohn's Disease, Cryptococcosis, Cryptosporidiosis, Cryptosporidium Infection (Cryptosporidiosis), Cyclospora Infection (Cyclosporiasis), Cyclosporiasis, Cysticercosis, Cystoisospora Infection (Cystoisosporaiasis), Cystoisosporaiasis, Cytomegalovirus Infection (CMV), Dengue Fever (DF), Dengue Hemorrhagic Fever (DHF), Dermatophytes, Dermopathy, Diabetes, Diamond Blackfan Anemia (DBA), Dientamoeba fragilis Infection, Diphtheria (Corynebacterium diphtheriae Infection), Diphyllobothriasis, Diphyllobothrium Infection (Diphyllobothriasis), Dipylidium Infection, Dog Flea Tapeworm, Down Syndrome (Trisomy 21), Dracunculiasis, Dwarf Tapeworm (Hymenolepis Infection), E. coli Infection (Escherichia coli Infection), Ear Infection (Otitis Media), Eastern Equine Encephalitis (EEE), Ebola Hemorrhagic Fever, Echinococcosis, Ehrlichiosis, Elephantiasis, Encephalitis (Mosquito-Borne and Tick-Borne), Entamoeba histolytica infection, Enterobius vermicularis Infection, Enterovirus Infections (Non-Polio), Epidemic Typhus, Epilepsy, Epstein-Barr Virus Infection (EBV Infection), Escherichia coli Infection, Extensively Drug-Resistant TB (XDR TB), Fasciola Infection (Fascioliasis), Fasciolopsis Infection (Fasciolopsiasis), Fibromyalgia, Fifth Disease (Parvovirus B19 Infection), Flavorings-Related Lung Disease, Folliculitis, Food-Related Diseases, Clostridium perfringens infection, Fragile X Syndrome, Francisella tularensis Infection, Genital Candidiasis (Vulvovaginal Candidiasis (VVC)), Genital Herpes (Herpes Simplex Virus Infection), Genital Warts, German Measles (Rubella Virus), Giardia Infection (Giardiasis), Glanders (Burkholderia mallei), Gnathostoma Infection, Gnathostomiasis (Gnathostoma Infection), Gonorrhea (Neisseria gonorrhoeae Infection), Gout, Granulomatous amebic encephalitis (GAE), Group A Strep Infection (GAS) (Group A Streptococcal Infection), Group B Strep Infection (GBS) (Group B Streptococcal Infection), Guinea Worm Disease (Dracunculiasis), Gynecologic Cancers (e.g., Cervical Cancer, Ovarian Cancer, Uterine Cancer, Vaginal and Vulvar Cancers, etc.), H1N1 Flu, Haemophilus influenzae Infection (Hib Infection), Hand, Foot, and Mouth Disease (HFMD), Hansen's Disease, Hantavirus Pulmonary Syndrome (HPS), Head Lice (Pediculus humanus capitis), Heart Disease (Cardiovascular Health), Heat Stress, Hemochromatosis, Hemophilia, Hendra Virus Infection, Herpes B virus, Herpes Simplex Virus Infection, Heterophyes Infection (Heterophyiasis), Hib Infection (Haemophilus influenzae Infection), High Blood Pressure, Histoplasma capsulatum Disease, Histoplasmosis (Histoplasma capsulatum Disease), Hot Tub Rash (Pseudomonas dermatitis Infection), HPV Infection (Human Papillomavirus Infection), Human Ehrlichiosis, Human Immunodeficiency Virus, Human Papillomavirus Infection (HPV Infection), Hymenolepis Infection, Hypertension, Hyperthermia, Hypothermia, Impetigo, Infectious Mononucleosis, Inflammatory Bowel Disease (IBD), Influenza, Avian Influenza, H1N1 Flu, Pandemic Flu, Seasonal Flu, Swine Influenza, Invasive Candidiasis, Iron Overload (Hemochromatosis), Isospora Infection (Isosporiasis), Japanese Encephalitis, Jaundice, K. pneumoniae (Klebsiella pneumoniae), Kala-Azar, Kawasaki Syndrome (KS), Kernicterus, Klebsiella pneumoniae (K. pneumoniae), La Crosse Encephalitis (LAC), La Crosse Encephalitis virus (LACV), Lassa Fever, Latex Allergies, Lead Poisoning, Legionnaires' Disease (Legionellosis), Leishmania Infection (Leishmaniasis), Leprosy, Leptospira Infection (Leptospirosis), Leptospirosis, Leukemia, Lice, Listeria Infection (Listeriosis), Listeriosis, Liver Disease and Hepatitis, Loa Infection, Lockjaw, Lou Gehrig's Disease, Lung Cancer, Lupus (SLE) (Systemic lupus erythematosus), Lyme Disease (Borrelia burgdorferi Infection), Lymphatic Filariasis, Lymphedema, Lymphocytic Choriomeningitis (LCMV), Lymphogranuloma venereum Infection (LGV), Malaria, Marburg Hemorrhagic Fever, Measles, Melioidosis (Burkholderia pseudomallei Infection), Meningitis (Meningococcal Disease), Meningococcal Disease, Methicillin Resistant Staphylococcus aureus (MRSA), Micronutrient Malnutrition, Microsporidia Infection, Molluscum Contagiosum, Monkey B virus, Monkeypox, Morgellons, Mosquito-Borne Diseases, Mucormycosis, Multidrug-Resistant TB (MDR TB), Mumps, Mycobacterium abscessus Infection, Mycobacterium avium Complex (MAC), Mycoplasma pneumoniae Infection, Myiasis, Naegleria Infection (Primary Amebic Meningoencephalitis (PAM)), Necrotizing Fasciitis, Neglected Tropical Diseases (NTD), Neisseria gonorrhoeae Infection, Neurocysticercosis, New Variant Creutzfeldt-Jakob Disease, Newborn Jaundice (Kernicterus), Nipah Virus Encephalitis, Nocardiosis, Non-Polio Enterovirus Infections, Nonpathogenic (Harmless) Intestinal Protozoa, Norovirus Infection, Norwalk-like Viruses (NLV), Novel H1N1 Flu, Onchocerciasis, Opisthorchis Infection, Oral Cancer, Orf Virus, Oropharyngeal Candidiasis (OPC), Osteoarthritis (OA), Osteoporosis, Otitis Media, Ovarian Cancer, Pandemic Flu, Paragonimiasis, Paragonimus Infection (Paragonimiasis), Parasitic Diseases, Parvovirus B19 Infection, Pediculus humanus capitis, Pediculus humanus corporis, Pelvic Inflammatory Disease (PID), Peripheral Arterial Disease (PAD), Pertussis, Phthiriasis, Pink Eye (Conjunctivitis), Pinworm Infection (Enterobius vermicularis Infection), Plague (Yersinia pestis Infection), Pneumocystis jirovecii Pneumonia, Pneumonia, Polio Infection (Poliomyelitis Infection), Pontiac Fever, Prion Diseases (Transmissible spongiform encephalopathies (TSEs)), Prostate Cancer, Pseudomonas dermatitis Infection, Psittacosis, Pubic Lice (Phthiriasis), Pulmonary Hypertension, Q Fever (Coxiella burnetii Infection), Rabies, Raccoon Roundworm Infection (Baylisascaris Infection), Rat-Bite Fever (RBF) (Streptobacillus moniliformis Infection), Recreational Water Illness (RWI), Relapsing Fever, Respiratory Syncytial Virus Infection (RSV), Rheumatoid Arthritis (RA), Rickettsia rickettsii Infection, Rift Valley Fever (RVF), Ringworm (Dermatophytes), Ringworm in Animals, River Blindness (Onchocerciasis), Rocky Mountain Spotted Fever (RMSF) (Rickettsia rickettsii Infection), Rotavirus Infection, RVF (Rift Valley Fever), RWI (Recreational Water Illness), Salmonella Infection (Salmonellosis), Scabies, Scarlet Fever, Schistosomiasis (Schistosoma Infection), Seasonal Flu, Severe Acute Respiratory Syndrome, Sexually Transmitted Diseases (STDs) (e.g., Bacterial Vaginosis (BV), Chlamydia, Genital Herpes, Gonorrhea, Human Papillomavirus Infection, Pelvic Inflammatory Disease, Syphilis, Trichomoniasis, HIV/AIDS, etc.), Shigella Infection (Shigellosis), Shingles (Varicella Zoster Virus (VZV)), Sickle Cell Disease, Single Gene Disorders, Sinus Infection (Sinusitus), Skin Cancer, Sleeping Sickness (African Trypanosomiasis), Smallpox (Variola Major and Variola Minor), Sore Mouth Infection (Orf Virus), Southern Tick-Associated Rash Illness (STARI), Spina Bifida (Myelomeningocele), Sporotrichosis, Spotted Fever Group Rickettsia (SFGR), St. Louis Encephalitis, Staphylococcus aureus Infection, Streptobacillus moniliformis Infection, Streptococcal Diseases, Streptococcus pneumoniae Infection, Stroke, Strongyloides Infection (Strongyloidiasis), Sudden Infant Death Syndrome (SIDS), Swimmer's Itch (Cercarial Dermatitis), Swine Influenza, Syphilis (Treponema pallidum Infection), Systemic lupus erythematosus, Tapeworm Infection (Taenia Infection), Testicular Cancer, Tetanus Disease (Clostridium tetani Infection), Thrush (Oropharyngeal Candidiasis (OPC)), Tick-borne Relapsing Fever, Tickborne Diseases (e.g., Anaplasmosis, Babesiosis, Ehrlichiosis, Lyme Disease, Tourette Syndrome (TS), Toxic Shock Syndrome (TSS), Toxocariasis (Toxocara Infection), Toxoplasmosis (Toxoplasma Infection), Trachoma Infection, Transmissible spongiform encephalopathies (TSEs), Traumatic Brain Injury (TBI), Trichinellosis (Trichinosis), Trichomoniasis (Trichomonas Infection), Tuberculosis (TB) (Mycobacterium tuberculosis Infection), Tularemia (Francisella tularensis Infection), Typhoid Fever (Salmonella typhi Infection), Uterine Cancer, Vaginal and Vulvar Cancers, Vancomycin-Intermediate/Resistant Staphylococcus aureus Infections (VISA/VRSA), Vancomycin-resistant Enterococci Infection (VRE), Variant Creutzfeldt-Jakob Disease (vCJD), Varicella-Zoster Virus Infection, Variola Major and Variola Minor, Vibrio cholerae Infection, Vibrio parahaemolyticus Infection, Vibrio vulnificus Infection, Viral Gastroenteritis, Viral Hemorrhagic Fevers (VHF), Viral Hepatitis, Viral Meningitis (Aseptic Meningitis), Von Willebrand Disease, Vulvovaginal Candidiasis (VVC), West Nile Virus Infection, Western Equine Encephalitis Infection, Whipworm Infection (Trichuriasis), Whitmore's Disease, Whooping Cough, Xenotropic Murine Leukemia Virus-related Virus Infection, Yellow Fever, Yersinia pestis Infection, Yersiniosis (Yersinia enterocolitica Infection), Zoonotic Hookworm, Zygomycosis, and the like.
In some instances, methods of treatment utilizing one or more sensor RNAs of the instant disclosure may find use in treating a cancer. Cancers, the treatment of which may include the use of one or more proteolytically cleavable polypeptides of the instant disclosure, will vary and may include but are not limited to e.g., Acute Lymphoblastic Leukemia (ALL), Acute Myeloid Leukemia (AML), Adrenocortical Carcinoma, AIDS-Related Cancers (e.g., Kaposi Sarcoma, Lymphoma, etc.), Anal Cancer, Appendix Cancer, Astrocytomas, Atypical Teratoid/Rhabdoid Tumor, Basal Cell Carcinoma, Bile Duct Cancer (Extrahepatic), Bladder Cancer, Bone Cancer (e.g., Ewing Sarcoma, Osteosarcoma and Malignant Fibrous Histiocytoma, etc.), Brain Stem Glioma, Brain Tumors (e.g., Astrocytomas, Central Nervous System Embryonal Tumors, Central Nervous System Germ Cell Tumors, Craniopharyngioma, Ependymoma, etc.), Breast Cancer (e.g., female breast cancer, male breast cancer, childhood breast cancer, etc.), Bronchial Tumors, Burkitt Lymphoma, Carcinoid Tumor (e.g., Childhood, Gastrointestinal, etc.), Carcinoma of Unknown Primary, Cardiac (Heart) Tumors, Central Nervous System (e.g., Atypical Teratoid/Rhabdoid Tumor, Embryonal Tumors, Germ Cell Tumor, Lymphoma, etc.), Cervical Cancer, Childhood Cancers, Chordoma, Chronic Lymphocytic Leukemia (CLL), Chronic Myelogenous Leukemia (CML), Chronic Myeloproliferative Neoplasms, Colon Cancer, Colorectal Cancer, Craniopharyngioma, Cutaneous T-Cell Lymphoma, Duct (e.g., Bile Duct, Extrahepatic, etc.), Ductal Carcinoma In Situ (DCIS), Embryonal Tumors, Endometrial Cancer, Ependymoma, Esophageal Cancer, Esthesioneuroblastoma, Ewing Sarcoma, Extracranial Germ Cell Tumor, Extragonadal Germ Cell Tumor, Extrahepatic Bile Duct Cancer, Eye Cancer (e.g., Intraocular Melanoma, Retinoblastoma, etc.), Fibrous Histiocytoma of Bone (e.g., Malignant, Osteosarcoma, ect.), Gallbladder Cancer, Gastric (Stomach) Cancer, Gastrointestinal Carcinoid Tumor, Gastrointestinal Stromal Tumors (GIST), Germ Cell Tumor (e.g., Extracranial, Extragonadal, Ovarian, Testicular, etc.), Gestational Trophoblastic Disease, Glioma, Hairy Cell Leukemia, Head and Neck Cancer, Heart Cancer, Hepatocellular (Liver) Cancer, Histiocytosis (e.g., Langerhans Cell, etc.), Hodgkin Lymphoma, Hypopharyngeal Cancer, Intraocular Melanoma, Islet Cell Tumors (e.g., Pancreatic Neuroendocrine Tumors, etc.), Kaposi Sarcoma, Kidney Cancer (e.g., Renal Cell, Wilms Tumor, Childhood Kidney Tumors, etc.), Langerhans Cell Histiocytosis, Laryngeal Cancer, Leukemia (e.g., Acute Lymphoblastic (ALL), Acute Myeloid (AML), Chronic Lymphocytic (CLL), Chronic Myelogenous (CML), Hairy Cell, etc.), Lip and Oral Cavity Cancer, Liver Cancer (Primary), Lobular Carcinoma In Situ (LCIS), Lung Cancer (e.g., Non-Small Cell, Small Cell, etc.), Lymphoma (e.g., AIDS-Related, Burkitt, Cutaneous T-Cell, Hodgkin, Non-Hodgkin, Primary Central Nervous System (CNS), etc.), Macroglobulinemia (e.g., Waldenström, etc.), Male Breast Cancer, Malignant Fibrous Histiocytoma of Bone and Osteosarcoma, Melanoma, Merkel Cell Carcinoma, Mesothelioma, Metastatic Squamous Neck Cancer with Occult Primary, Midline Tract Carcinoma Involving NUT Gene, Mouth Cancer, Multiple Endocrine Neoplasia Syndromes, Multiple Myeloma/Plasma Cell Neoplasm, Mycosis Fungoides, Myelodysplastic Syndromes, Myelodysplastic/Myeloproliferative Neoplasms, Myelogenous Leukemia (e.g., Chronic (CML), etc.), Myeloid Leukemia (e.g., Acute (AML), etc.), Myeloproliferative Neoplasms (e.g., Chronic, etc.), Nasal Cavity and Paranasal Sinus Cancer, Nasopharyngeal Cancer, Neuroblastoma, Non-Hodgkin Lymphoma, Non-Small Cell Lung Cancer, Oral Cancer, Oral Cavity Cancer (e.g., Lip, etc.), Oropharyngeal Cancer, Osteosarcoma and Malignant Fibrous Histiocytoma of Bone, Ovarian Cancer (e.g., Epithelial, Germ Cell Tumor, Low Malignant Potential Tumor, etc.), Pancreatic Cancer, Pancreatic Neuroendocrine Tumors (Islet Cell Tumors), Papillomatosis, Paraganglioma, Paranasal Sinus and Nasal Cavity Cancer, Parathyroid Cancer, Penile Cancer, Pharyngeal Cancer, Pheochromocytoma, Pituitary Tumor, Pleuropulmonary Blastoma, Primary Central Nervous System (CNS) Lymphoma, Prostate Cancer, Rectal Cancer, Renal Cell (Kidney) Cancer, Renal Pelvis and Ureter, Transitional Cell Cancer, Retinoblastoma, Rhabdomyosarcoma, Salivary Gland Cancer, Sarcoma (e.g., Ewing, Kaposi, Osteosarcoma, Rhabdomyosarcoma, Soft Tissue, Uterine, etc.), Sézary Syndrome, Skin Cancer (e.g., Childhood, Melanoma, Merkel Cell Carcinoma, Nonmelanoma, etc.), Small Cell Lung Cancer, Small Intestine Cancer, Soft Tissue Sarcoma, Squamous Cell Carcinoma, Squamous Neck Cancer (e.g., with Occult Primary, Metastatic, etc.), Stomach (Gastric) Cancer, T-Cell Lymphoma, Testicular Cancer, Throat Cancer, Thymoma and Thymic Carcinoma, Thyroid Cancer, Transitional Cell Cancer of the Renal Pelvis and Ureter, Ureter and Renal Pelvis Cancer, Urethral Cancer, Uterine Cancer (e.g., Endometrial, etc.), Uterine Sarcoma, Vaginal Cancer, Vulvar Cancer, Waldenström Macroglobulinemia, Wilms Tumor, and the like.
Also compositions for practicing the methods are described in the present disclosure. In general, subject compositions may have sensor RNA as described above in addition to a pharmaceutically acceptable excipient. In some embodiments, the subject compositions contain a secondary agent for treating any of the diseases or conditions described above.
Compositions of the present disclosure can be administered by any suitable means, including topical, oral, parenteral, intrapulmonary, and intranasal. Parenteral infusions include intramuscular, intravenous (bolus or slow drip), intraarterial, intraperitoneal, intrathecal or subcutaneous administration. An agent can be administered in any manner which is medically acceptable. This may include injections, by parenteral routes such as intravenous, intravascular, intraarterial, subcutaneous, intramuscular, intratumor, intraperitoneal, intraventricular, intraepidural, or others as well as oral, nasal, ophthalmic, rectal, or topical. Sustained release administration is also specifically included in the disclosure, by such means as depot injections or erodible implants.
As noted above, sensor RNA can be formulated with an a pharmaceutically acceptable carrier (one or more organic or inorganic ingredients, natural or synthetic, with which a subject agent is combined to facilitate its application). A suitable carrier includes sterile saline although other aqueous and non-aqueous isotonic sterile solutions and sterile suspensions known to be pharmaceutically acceptable are known to those of ordinary skill in the art. An “effective amount” refers to that amount which is capable of ameliorating or delaying progression of the diseased, degenerative or damaged condition. An effective amount can be determined on an individual basis and will be based, in part, on consideration of the symptoms to be treated and results sought. An effective amount can be determined by one of ordinary skill in the art employing such factors and using no more than routine experimentation.
The composition may be administered in a unit dosage form and may be prepared by any methods well known in the art. Such methods include combining agent with a pharmaceutically acceptable carrier or diluent which constitutes one or more accessory ingredients. A pharmaceutically acceptable carrier is selected on the basis of the chosen route of administration and standard pharmaceutical practice. Each carrier must be “pharmaceutically acceptable” in the sense of being compatible with the other ingredients of the formulation and not injurious to the subject. This carrier can be a solid or liquid and the type is generally chosen based on the type of administration being used.
Depending on the individual and condition being treated and on the administration route, the active agent may be administered in dosages of 0.01 mg to 500 mg/kg body weight per day, e.g., about 20 mg/day for an average person. Dosages will be appropriately adjusted for pediatric formulation.
In some embodiments, the composition is formulated in an aqueous buffer. Suitable aqueous buffers include, but are not limited to, acetate, succinate, citrate, and phosphate buffers varying in strengths from 5 mM to 100 mM. In some embodiments, the aqueous buffer includes reagents that provide for an isotonic solution. Such reagents include, but are not limited to, sodium chloride; and sugars e.g., mannitol, dextrose, sucrose, and the like. In some embodiments, the aqueous buffer further includes a non-ionic surfactant such as polysorbate 20 or 80. Optionally the composition may further include a preservative. Suitable preservatives include, but are not limited to, a benzyl alcohol, phenol, chlorobutanol, benzalkonium chloride, and the like. In many cases, the composition is stored at about 4° C. Pharmaceutical compositions may also be lyophilized, in which case they generally include cryoprotectants such as sucrose, trehalose, lactose, maltose, mannitol, and the like. Lyophilized formulations can be stored over extended periods of time, even at ambient temperatures.
Compositions can be prepared as injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, or suspension in, liquid vehicles prior to injection can also be prepared. The preparation also can be emulsified or encapsulated in liposomes or micro particles such as polylactide, polyglycolide, or copolymer for enhanced adjuvant effect, as discussed above. Langer, Science 249: 1527, 1990 and Hanes, Advanced Drug Delivery Reviews 28: 97-119, 1997. The compositions of this invention can be administered in the form of a depot injection or implant preparation which can be formulated in such a manner as to permit a sustained or pulsatile release of the active ingredient. The pharmaceutical compositions are generally formulated as sterile, substantially isotonic and in full compliance with all Good Manufacturing Practice (GMP) regulations of the U.S. Food and Drug Administration.
As described above, the composition may also contain a secondary agent for treatment of any of the diseases or condition described above. When the disease or condition is cancer, the secondary agent may be a chemotherapeutic agent. Chemotherapeutic agents that find use in the present disclosure include, without limitation, Abitrexate (Methotrexate Injection), Abraxane (Paclitaxel Injection), Adcetris (Brentuximab Vedotin Injection), Adriamycin (Doxorubicin), Adrucil Injection (5-FU (fluorouracil)), Afinitor (Everolimus), Afinitor Disperz (Everolimus), Alimta (PEMET EXED), Alkeran Injection (Melphalan Injection), Alkeran Tablets (Melphalan), Aredia (Pamidronate), Arimidex (Anastrozole), Aromasin (Exemestane), Arranon (Nelarabine), Arzerra (Ofatumumab Injection), Avastin (Bevacizumab), Bexxar (Tositumomab), BiCNU (Carmustine), Blenoxane (Bleomycin), Bosulif (Bosutinib), Busulfex Injection (Busulfan Injection), Campath (Alemtuzumab), Camptosar (Irinotecan), Caprelsa (Vandetanib), Casodex (Bicalutamide), CeeNU (Lomustine), CeeNU Dose Pack (Lomustine), Cerubidine (Daunorubicin), Clolar (Clofarabine Injection), Cometriq (Cabozantinib), Cosmegen (Dactinomycin), CytosarU (Cytarabine), Cytoxan (Cytoxan), Cytoxan Injection (Cyclophosphamide Injection), Dacogen (Decitabine), DaunoXome (Daunorubicin Lipid Complex Injection), Decadron (Dexamethasone), DepoCyt (Cytarabine Lipid Complex Injection), Dexamethasone Intensol (Dexamethasone), Dexpak Taperpak (Dexamethasone), Docefrez (Docetaxel), Doxil (Doxorubicin Lipid Complex Injection), Droxia (Hydroxyurea), DTIC (Decarbazine), Eligard (Leuprolide), Ellence (Ellence (epirubicin)), Eloxatin (Eloxatin (oxaliplatin)), Elspar (Asparaginase), Emcyt (Estramustine), Erbitux (Cetuximab), Erivedge (Vismodegib), Erwinaze (Asparaginase Erwinia chrysanthemi), Ethyol (Amifostine), Etopophos (Etoposide Injection), Eulexin (Flutamide), Fareston (Toremifene), Faslodex (Fulvestrant), Femara (Letrozole), Firmagon (Degarelix Injection), Fludara (Fludarabine), Folex (Methotrexate Injection), Folotyn (Pralatrexate Injection), FUDR (FUDR (floxuridine)), Gemzar (Gemcitabine), Gilotrif (Afatinib), Gleevec (Imatinib Mesylate), Gliadel Wafer (Carmustine wafer), Halaven (Eribulin Injection), Herceptin (Trastuzumab), Hexalen (Altretamine), Hycamtin (Topotecan), Hycamtin (Topotecan), Hydrea (Hydroxyurea), lclusig (Ponatinib), Idamycin PFS (Idarubicin), Ifex (Ifosfamide), Inlyta (Axitinib), Intron A alfab (Interferon alfa-2a), Iressa (Gefitinib), Istodax (Romidepsin Injection), Ixempra (Ixabepilone Injection), Jakafi (Ruxolitinib), Jevtana (Cabazitaxel Injection), Kadcyla (Ado-trastuzumab Emtansine), Kyprolis (Carfilzomib), Leukeran (Chlorambucil), Leukine (Sargramostim), Leustatin (Cladribine), Lupron (Leuprolide), Lupron Depot (Leuprolide), Lupron DepotPED (Leuprolide), Lysodren (Mitotane), Marqibo Kit (Vincristine Lipid Complex Injection), Matulane (Procarbazine), Megace (Megestrol), Mekinist (Trametinib), Mesnex (Mesna), Mesnex (Mesna Injection), Metastron (Strontium-89 Chloride), Mexate (Methotrexate Injection), Mustargen (Mechlorethamine), Mutamycin (Mitomycin), Myleran (Busulfan), Mylotarg (Gemtuzumab Ozogamicin), Navelbine (Vinorelbine), Neosar Injection (Cyclophosphamide Injection), Neulasta (filgrastim), Neulasta (pegfilgrastim), Neupogen (filgrastim), Nexavar (Sorafenib), Nilandron (Nilandron (nilutamide)), Nipent (Pentostatin), Nolvadex (Tamoxifen), Novantrone (Mitoxantrone), Oncaspar (Pegaspargase), Oncovin (Vincristine), Ontak (Denileukin Diftitox), Onxol (Paclitaxel Injection), Panretin (Alitretinoin), Paraplatin (Carboplatin), Perjeta (Pertuzumab Injection), Platinol (Cisplatin), Platinol (Cisplatin Injection), PlatinolAQ (Cisplatin), PlatinolAQ (Cisplatin Injection), Pomalyst (Pomalidomide), Prednisone Intensol (Prednisone), Proleukin (Aldesleukin), Purinethol (Mercaptopurine), Reclast (Zoledronic acid), Revlimid (Lenalidomide), Rheumatrex (Methotrexate), Rituxan (Rituximab), RoferonA alfaa (Interferon alfa-2a), Rubex (Doxorubicin), Sandostatin (Octreotide), Sandostatin LAR Depot (Octreotide), Soltamox (Tamoxifen), Sprycel (Dasatinib), Sterapred (Prednisone), Sterapred DS (Prednisone), Stivarga (Regorafenib), Supprelin LA (Histrelin Implant), Sutent (Sunitinib), Sylatron (Peginterferon Alfa-2b Injection (Sylatron)), Synribo (Omacetaxine Injection), Tabloid (Thioguanine), Taflinar (Dabrafenib), Tarceva (Erlotinib), Targretin Capsules (Bexarotene), Tasigna (Decarbazine), Taxol (Paclitaxel Injection), Taxotere (Docetaxel), Temodar (Temozolomide), Temodar (Temozolomide Injection), Tepadina (Thiotepa), Thalomid (Thalidomide), TheraCys BCG (BCG), Thioplex (Thiotepa), TICE BCG (BCG), Toposar (Etoposide Injection), Torisel (Temsirolimus), Treanda (Bendamustine hydrochloride), Trelstar (Triptorelin Injection), Trexall (Methotrexate), Trisenox (Arsenic trioxide), Tykerb (lapatinib), Valstar (Valrubicin Intravesical), Vantas (Histrelin Implant), Vectibix (Panitumumab), Velban (Vinblastine), Velcade (Bortezomib), Vepesid (Etoposide), Vepesid (Etoposide Injection), Vesanoid (Tretinoin), Vidaza (Azacitidine), Vincasar PFS (Vincristine), Vincrex (Vincristine), Votrient (Pazopanib), Vumon (Teniposide), Wellcovorin IV (Leucovorin Injection), Xalkori (Crizotinib), Xeloda (Capecitabine), Xtandi (Enzalutamide), Yervoy (Ipilimumab Injection), Zaltrap (Ziv-aflibercept Injection), Zanosar (Streptozocin), Zelboraf (Vemurafenib), Zevalin (Ibritumomab Tiuxetan), Zoladex (Goserelin), Zolinza (Vorinostat), Zometa (Zoledronic acid), Zortress (Everolimus), Zytiga (Abiraterone), Nimotuzumab and immune checkpoint inhibitors such as nivolumab, pembrolizumab/MK-3475, pidilizumab and AMP-224 targeting PD-1; and BMS-935559, MEDI4736, MPDL3280A and MSB0010718C targeting PD-L1 and those targeting CTLA-4 such as ipilimumab.
When the disease or condition is associated with an infection, the secondary agent may be an antibiotic. Antibiotics that find use in the present disclosure include, without limitation, antibiotics with the classes of aminoglycosides; carbapenems; and the like; penicillins, e.g. penicillin G, penicillin V, methicillin, oxacillin, carbenicillin, nafcillin, ampicillin, etc. penicillins in combination with β-lactamase inhibitors, cephalosporins, e.g. cefaclor, cefazolin, cefuroxime, moxalactam, etc.; tetracyclines; cephalosporins; quinolones; lincomycins; macrolides; sulfonamides; glycopeptides including the anti-infective antibiotics vancomycin, teicoplanin, telavancin, ramoplanin and decaplanin. Derivatives of vancomycin include, for example, oritavancin and dalbavancin (both lipoglycopeptides). Telavancin is a semi-synthetic lipoglycopeptide derivative of vancomycin (approved by FDA in 2009). Other vancomycin analogs are disclosed, for example, in WO 2015022335 A1 and Chen et al. (2003) PNAS 100(10): 5658-5663, each herein specifically incorporated by reference. Non-limiting examples of antibiotics include vancomycin, linezolid, azithromycin, daptomycin, colistin, eperezolid, fusidic acid, rifampicin, tetracyclin, fidaxomicin, clindamycin, lincomycin, rifalazil, and clarithromycin.
Also provided are kits for practicing the methods described in the present disclosure. In general, subject kits may contain a sensor RNA as described above. The sensor RNA may be contained in a lipid nanoparticle or the sensor RNA may be within a recombinant vector as described above. In some cases, the kit further contains an ADAR protein or a coding sequence thereof. When the kit contains a coding sequence of the ADAR it may be in a recombinant vector as described above. When the kit contains the ADAR protein or the coding sequence thereof, the ADAR protein may be any ADAR protein described above.
In some cases, the kit may further contain a positive and/or negative control. The positive control may be in the form of a biological sample containing the target RNA, a sensor RNA containing an edited codon (i.e., a stop codon that has been edited to be a non-stop codon or a start codon edited to be a non-start codon or a non-start codon edited to be a start codon) or a sensor RNA containing the nucleotide sequence of the target RNA. The negative control may be in the form of a biological sample that does not contain the target RNA.
A subject kit can include any combination of components for performing the methods of the present disclosure. The components of a subject kit can be present as a mixture or can be separate entities. In some cases, components are present as a lyophilized mixture. In some cases, the components are present as a liquid mixture. Components of a subject kit can be in the same or separate containers, in any combination.
The subject kits may further include (in certain embodiments) instructions for practicing the subject methods. These instructions may be present in the subject kits in a variety of forms, one or more of which may be present in the kit. One form in which these instructions may be present is as printed information on a suitable medium or substrate, e.g., a piece or pieces of paper on which the information is printed, in the packaging of the kit, in a package insert, and the like. Yet another form of these instructions is a computer readable medium, e.g., diskette, compact disk (CD), flash drive, and the like, on which the information has been recorded. Yet another form of these instructions that may be present is a website address which may be used via the internet to access the information at a remote site.
The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how to make and use the present invention, and are not intended to limit the scope of what the inventors regard as their invention nor are they intended to represent that the experiments below are all or the only experiments performed. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, temperature, etc.) but some experimental errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, molecular weight is weight average molecular weight, temperature is in degrees Centigrade, and pressure is at or near atmospheric. Standard abbreviations may be used, e.g., bp, base pair(s); kb, kilobase(s); pl, picoliter(s); s or sec, second(s); min, minute(s); h or hr, hour(s); aa, amino acid(s); nt, nucleotide(s); i.m., intramuscular(ly); i.p., intraperitoneal(ly); s.c., subcutaneous(ly); and the like.
A RADAR sensor is inspired by recent advances in RNA editing (Katrekar, D. et al. (2019) Nat. Methods 16, 239-242; Qu, L. et al. (2019) Nat. Biotechnol. 37, 1059-1069; Merkle, T. et al. (2019) Nat. Biotechnol. 37, 133-138; Reautschnig, P. et al. (2022) Nat. Biotechnol. 1-10), and consists of three parts (
We first verified RADAR in human embryonic kidney (HEK) cells using a de novo designed trigger, T1, embedded in the 3′ UTR of a cotransfected gene (
We then validated RADAR in scenarios closer to eventual use cases. First, in addition to transiently delivered trigger plasmids, we saw increased output in response to doxycyclineinduced expression of T1 embedded in the 3′ UTR of a genomically integrated EGFP (
We then explored strategies for expanding the coverage of the human/mouse transcriptome. First, using the 3′ UTR of Bdnf, we observed that RADAR performance was largely maintained down to 72 bp dsRNA (
Combining the ability of shortening the sensor, using a split design, and sensing CDSs, >85% of human (
While ADAR over-expression greatly improves RADAR's dynamic range, it may have detrimental side effects. To ameliorate this potential problem, we utilized an engineered version of ADAR, containing only a mutant deaminase domain of ADAR2 and the MS2 RNA binding protein MCP (“ADAR(DD)-MCP”)(Katrekar, D. et al. (2019) Nat. Methods 16, 239-242; Biswas, J. et al. (2020) iScience 23, 101318). Combining this enzyme with MS2-bearing sensors, we achieved a dynamic range similar to ADAR1p150 over-expression (
RADAR has several unique features and potential applications. For example, RADAR can be used for cell classification (
Cell states, especially in medical contexts, are often defined by not only the expression level of RNAs but also the presence of new RNA sequences. We leveraged RADAR's unique features to sense the latter. First, as ADAR is sensitive to the base identities surrounding the edited adenosine (Qu, L. et al. (2019) Nat. Biotechnol. 37, 1059-1069), RADAR is uniquely suitable for distinguishing certain short genetic variants. As a demonstration, we validated a sensor that distinguishes two common oncogenic mutations of TP53 associated with different invasive traits in cancer cells (Yoshikawa, K. et al. (2010) Biomed. Res. 31, 401-411) (
Finally, to show that RADAR is a self-contained module, we assessed RADAR in plants, organisms lacking endogenous ADAR. Without any optimization other than switching to plant promoters and vectors, we observed elevated fluorescent output in response to the matching T1 trigger compared to a non-matching trigger in a Nicotiana benthamiana model system (
Here we demonstrated leveraging ADAR editing to create a modular, programmable molecular device capable of sensing a broad variety of RNAs. One key direction of future improvements to RADAR is increasing input sensitivity (
The methods herein have numerous applications. RADARs can be used for feedback gene editing where gene editing enzymes may be turned off once a desired mutation is detected to reduce off-target editing. RADARs can be used for feedback gene expression for gene therapy where a transcription factor could be produced in response to the gene that it regulates, either positively or negatively. This enables precise control over the expression levels of that gene. RADARs can be used for markerless cell therapy screening where in cell therapies, cells must be edited/engineered with high fidelity, but this is often hard to achieve without having a selectable marker (which most of the time is undesirable to use). RADAR could be used to transiently select for cells that have the intended. RADARs can be used to detect plant pathogens such as viruses. RADARs can be used for the delivery of oncolytics and senolytics to kill diseased cells. RADARs can be used for manipulating cells based on their type or state: e.g., sensing a marker of T cell exhaustion to then modify T cell therapy behavior; detecting whether a cell is the right dendritic cell to express an antigen for a “tolerogenic vaccine” (antigens expressed by certain dendritic cells will make the body become tolerant to it, so this is kind of treatment for allergies). RADARs can be used for regulating engineered RNA viruses (e.g., alphaviruses and the rabies virus have been engineered for various purposes, including therapeutics such as self-amplifying RNA vaccines or cancer treatments, and our system could be used to control that, e.g. by negative feedback, to regulate dosage and lifetime). Our system could a part of the RNA virus package or a separate, co-delivered module. RADARs can be used for safety devices other than feedback for engineered viruses; e.g., one may not want to express an RNA-based therapy in cells infected with a retrovirus such as HIV, so our system could detect the presence of HIV RNA, and shut down so that the RNA wouldn't get integrated into the genome. (It could of course also try to deliver a treatment/inhibitor to the latent HIV). RADARs can also be used for IVF screening of eggs for certain mutations before fertilization (non-destructive, non-modifying).
Plasmid generation. Plasmids were generated using standard molecular cloning practices, including InFusion of linearized plasmids and PCR fragments and restriction-ligation of linearized fragments and annealed phosphorylated oligonucleotides. A complete list of plasmids and associated maps is found in Table 3. Plasmids are available upon request from the corresponding author and will be deposited to Addgene. Human ADAR plasmids as well as the ADAR1 knockout cell line were generously provided by prof. Billy Li. Cre and Cre reporter plasmids were kindly gifted by prof. Liqun Luo. pUBC_stdMCP_serinemod_E488QADAR_p2A_yGFP (“ADAR(DD)-MCP”) was a gift from Robert Singer (Addgene plasmid #154787).
Tissue culture. Flp-In T-REx Human Embryonic Kidney (HEK) 293 cells were purchased from Thermo Scientific (catalog #R78007). Cells were cultured in a humidity-controlled incubator under standard culture conditions (37° C. with 5% C02) in Dulbecco's Modified Eagle Medium, supplemented with 10% fetal bovine serum (Fisher Scientific catalog #FB 12999102), 1 mM sodium pyruvate (EMD Millipore catalog #TMS-005-C), Ix penicillin-streptomycin (Genesee catalog #25-512), 2 mM Lglutamine (Genesee catalog #25-509) and 1× MEM non-essential amino acids (Genesee catalog #25-536). To induce expression of certain constructs, 100 ng/mL of doxycycline was added at the time of transfection. The inducible-trigger containing cell line was generated by transfecting the construct in a PiggyBac backbone along with PiggyBac integrase (4:1), with 50 μg/mL hygromycin added for selection when reseeding into 10 cm dishes two days after transfection.
Transient transfections. HEK 293 cells were cultured in either 24-well or 96-well tissue culture-treated plates under standard culture conditions. When cells were 70-90% confluent, the cells were transiently transfected with plasmid constructs using the jetOPTIMUSR DNA transfection Reagent (Polyplus catalog #117-15), as per manufacturer's instructions using 0.375 uL of reagent per 50 uL of jetOPTIMUS buffer for 500 ng total DNA transfections in the 24-well format and 0.13 uL of reagent per 12.5 uL of buffer for 130 ng total DNA in the 96-well format. All transfections are detailed in Table 4.
Nicotiana
benthamiana
24-well format: approximately 100,000 cells seeded. 0.375 uL of jetOptimus reagent in 50 uL jetOptimus buffer per condition for transfection on the following day. 500 ng total DNA for each condition, with remainder adjusted with filler plasmid. 96-well format: approximately 25,000 cells seeded. 0.13 uL of jetOptimus reagent in 12.5 uL jetOptimus buffer per condition for transfection on the following day. 130 ng total DNA for each condition, with remainder adjusted with filler plasmid. plant infiltration: Agrobacteria were electroporated with each plasmid separately. Overnight inoculates were verified with colony PCR. Cultures were diluted to an GD of 0.9 and combined in equal volumes for infiltration.
Flow cytometry and data analysis. Cells were harvested approximately 48 hours after transfection by trypsinization and resuspended in flow buffer (HBSS+2.5 mg/mL bovine serum albumin). Post 40 um straining, cells were analyzed by flow cytometry (Biorad ZE5 Cell Analyzer), and data was processed with the cytoflow Python package. An overview of gating and analysis is given in
RNA extraction and reverse-transcription. HEK293T cells grown in 24-plates were spun down and RNA was extracted using the following kits: RNAasy mini kit (Qiagen), RNase-Free DNase Set (Qiagen), and QIAshredder (Qiagen). After extraction RNA quality was assessed by running 500 ng on 1% agarose gel. 500 ng of purified RNA was then reverse-transcribed using iScript cDNA synthesis (Biorad). For Sanger sequencing, cDNA was sent to Genewiez/Azenta with matching primers.
qPCR Measurements. qPCR was carried out on a QuantStudio3 (Applied Biosystems) using SYBR-Green. RNA estimation was calculated based on the calibration curve of purified plasmid and normalized by the Ct threshold. The following primer pair sequences for GFP (Signagen) and normalizing gene (β-actin) were used: GFP-F AAGCAGAAGAACGGCATCAA (SEQ ID NO: 10), GFP-r TCCAGCAGGACCATGTGATC (SEQ ID NO:11), β-actin-F CGTCCACCGCAAATGCTT (SEQ ID NO: 12), β-actin-R GTTTTCTGCGCAAGTTAGGTTTTGT (SEQ ID NO: 13).
Leaf infiltration and analysis. Three leaves in three Nicotiana benthamiana plants were infiltrated with Agrobacterium separately transformed with RADAR components (sensor and human ADAR1p150), and a matching or unrelated trigger, with two infiltrations of either trigger per leaf. Quantification based on averages of average fluorescence intensities from six rectangular regions per infiltration spot. EGFP (output) fluorescence was normalized using the mCherry (marker) fluorescent intensity.
Statistical analysis. Values are reported as the means from at least three biological replicates, representative of two independent biological experiments. For experiments comparing two groups, a Bonferroni-corrected two-tailed Student t-test was used to assess significance.
Bioinformatics. We analyzed all genes with annotated 3′ UTRs in the human genome in search of trigger sequences compatible with the RADAR design. Later, we also considered all genes with a CDS annotation. 3′ UTR and CDS sequences were obtained from Ensembl Biomart. Candidates must have a 5′ CCA 3′ sequence (which is paired with the 5′ UAG 3′ in the sensor) flanked by a sufficiently long stretch of sequence; the reverse complement of the trigger sequence should not create an in-frame stop codon in the sensor. Furthermore, the trigger sequences should be unique (assessed by mapping to the genome), and the sensor sequence easily synthesizable (lacking homopolymer runs and with 35-80% GC content). The hg38 genome build was used for human transcriptome analysis and the GRCm39 build for murine transcriptome analysis. The blastn tool version 2.9.0+ was used with the -task blastn-evalue 1 arguments, and further filtering was done on the alignment length (minimum 30 matches) and position (must overlap the central CCA or alternative sequence). Results from alternative chromosomes duplicative in nature were removed. A single hit—the target sequence—was allowed, with sensors having more hits discarded from analysis; this means that some designs were discarded due to pseudogenes and other repetitive regions that may or may not be expressed as RNA. As sequences other than the UAG:CCA pairing can efficiently be edited, we also analyzed candidate trigger sequences with a central GCA, UCA, or CAA sequence.
The 20211025 release of ClinVar was used to analyze the potential for detecting known and likely pathogenic variants, assuming that UAG can be edited when paired by one of CAA, CUA, CGA, ACA, UCA, GCA, CCA, CCU, CCC (Qu, L. et al. (2019) Nat. Biotechnol. 37, 1059-1069). Alternative and reference alleles were padded based on the hg38 reference genome by one base on either side. A variant was considered distinguishable by RADAR if the padded alternate allele contained one of the edit-triggering variants, but the padded reference did not contain any.
In vitro transcribed mRNA and pseudouridine incorporation. While the RADAR design does not require special modifications (i.e., the mRNA can be made by the cell from a DNA plasmid, or it could be made in vitro with standard nucleotides), it can in some cases be hindered by modifications.
One such modification is pseudouridine (Ψ), particularly N1-methyl-pseudouridine, used for synthetic mRNA to reduce its immunogenicity. Typically, all uridine (U) bases are replaced by pseudouridine in IVT mRNA by supplying the modified nucleoside triphosphate instead of UTP in the reaction. However, Ψ affects ADAR editing negatively. Furthermore, it increases stop-codon readthrough, and is thus particularly not good for RADAR—the off state looks less “off” due to increased readthrough, and the on state looks less “on” due to decreased editing.
If the UAG stop codon is used, ΨAG may be particularly affected in terms of editing, due to the increased base stacking between Ψ and A that prevents the necessary flipping out of the A base. The UGA stop codon, particularly when followed by G (UGAg) may be helpful in this case, as ΨGAg does not directly put the modified base next to A, although a 5′ G also decreases editing of an A. In addition to affecting the catalytic part of ADAR editing, Ψ can affect the dsRNA binding ability of ADAR.
To generate IVT mRNA, a sensor or trigger plasmid containing the T7 promoter, a TEV 5′ leader UTR and a hybrid 3′ UTR was amplified, adding a 120-base poly A tail. The PCR amplicon was purified and used as the template in an IVT reaction with the CleanCap AG reagent, ATP, GTP, CTP, and T7 polymerase in the presence of murine RNase inhibitor; the amount of UTP vs N1 mΨ was varied 0-100%. IVT mRNA fidelity was verified on a gel, DNA removed with DNase I, and mRNA purified using the QIAgen RNeasy mini kit. mRNA was transfected with the TransIT-mRNA kit, with 500 ng of total mRNA per well in 24-well format, with flow cytometry after 20 hours. ADAR was not over-expressed in the mRNA experiments, relying only on the wild-type expression levels of ADAR1 in HEK293 cells. Trigger was in the 3′ UTR of BFP, with a control sequence used in the “no-input” case so that both conditions received 200 ng of sensor and 300 ng of a BFP mRNA (with matching or non-matching trigger sequence).
Full pseudouridine incorporation indeed greatly diminished, but did not fully abrogate sensor functionality, both by increasing baseline and reducing trigger-dependent activation (
UAG:NNN analysis. To understand the capability for nucleotide variant detection (single-nucleotide and multi-nucleotide variants, SNVs and MNVs), all 5′ NNN 3′ trigger sequences opposite the UAG stop codon were interrogated.
The triggering ability of the 64 5′ NNN 3′ sequences vary across around two logs (
Effect of bulges near UAG/CCA. Mismatches near the CCA:UAG pairing on the triggering strand were inserted. None of the tested mismatches, neither 5′ nor 3′ strongly affected sensor performance (
Previously called “Offset RADAR”. This mechanism takes the standard RADAR and separates out the RNA binding from the RNA editing.
Sensor composition: (1) UAG/UGA/UAA stop codon within a stem-loop, sur-rounded by sequence complementary to the trigger RNA, (2) a 2A tag and (3) the output protein. An optional marker+2A tag can precede the sensor RNA.
Mechanism: The sensor and trigger form dsRNA around the stem-loop, which recruits ADAR. ADAR is then able to edit the stop codon within the stem-loop and remove it, allowing translation of the downstream output.
The stem-loop can be selected from naturally occurring ADAR editing sites, or selected from a library screen. Crucially, the stem-loop should not be edited without the extended dsRNA formation. For natural sites, this means the stem loop should likely be shortened, otherwise it is edited by ADAR without the dsRNA formation. The stem-loop is typically placed in the middle of the sequence complementary to the trigger RNA, but this position can be altered (e.g., 5′ of the complementary region, 3′ of the complementary region, or somewhere in between; it should be in the vicinity of the complementary region).
They key aspect is that the editing substrate is the stem-loop formed by the sensor, not the sensor:trigger dsRNA duplex—this is key to its novelty, as such a design has not been envisioned by International Application PCT/US2022/033459 or Qian et al. Nature 2022.
Portions of natural ADAR editing substrate stem-loops have previously been incorporated into guide RNAs for the purposes of editing endogenous RNAs (as opposed to editing an exogenous RNA such as the sensor RNAs described here), for example Fukuda et al., Scientific Reports 2017. However, the stem-loops in those examples have been truncated to contain the portion of the stem-loop that interacts with the ADAR dsRNA binding domains, whereas here the stem-loops are truncated to the portions that interact with and are edited by the ADAR catalytic domain. Other types of stem-loops that are not the targets of ADAR editing have also been introduced to other guide RNAs for the purposes of recruiting engineered ADAR enzymes for editing endogenous RNAs; such stem-loops include MS2 hairpins for recruiting MCP-ADAR(DD) (e.g., Azad et al. Gene Therapy 2017 and International Application PCT/US2022/033459), or Cas13-binding hairpins for recruiting dCas13b-ADAR(DD) (e.g., Cox et al. Science 2017 and WO2019071048). However, in those applications, the stem-loop does not serve as the editing substrate containing an editable codon as it does here, but is used for recruiting an engineered ADAR, while in ModulADAR, the dsRNA formed by the sensor and trigger RNA recruit native ADAR.
A major advantage of changing the editing substrate is that this allows deep optimization of the editing, separate from the binding. ADAR enzymes have dsRNA binding domains that have little substrate specificity and a separate catalytic deaminase domain, which does have some substrate preferences. In the standard RADAR setup, the sensor dsRNA around the editing site is determined by the sequence of the triggering RNA so the optimization of ADAR binding and editing is done jointly; here either can be freely chosen. In the standard setup, the UAG stop codon is chosen because it is the most robust, particularly if immediately paired with CCA; in the ModulADAR design other stop codons can be leveraged, which can be advantageous, e.g. because the UAA stop codon typically allows for less readthrough than UAG, or UGA (particularly if followed by G, so UGAG) may be less affected by uridine modifications than UAG since in the former the A is not surrounded by modified bases while in the latter it is (the commonly used pseudouridine modification inhibits ADAR editing).
Another advantage and aspect of novelty is that now more sequences can be chosen as candidate triggers; in the standard design each sequence should have a CCA (or some small number of alternatives) that can basepair with UAG to efficiently edit it, while here there's no such requirement.
In principle, this kind of approach (separating the ADAR binding and editing) can be applied to the “uORF,” “AUA,” and “AUG” approaches described in the present disclosure as well. Lists of natural editing targets with suitable motifs that could be used in each case are prioritized. By having the editing substrate in a separate, stand-alone and trigger-independent sequence, motifs larger than the three bases of a stop codon could be used, for example a strong Kozak sequence can be included in front of a non-start (AUA) or start codon (AUG), without needing to find the reverse complement of such a sequence within the trigger RNA, as would be required if the editing happened in the duplex formed by the sensor and trigger RNAs.
Stem-loop choice. The stop-codon-containing stem-loop choice is critical. It should be well-edited when ADAR is localized by dsRNA formation, but should not be edited without the presence of additional dsRNA formed by the sensor and trigger.
One source of stem-loops is natural editing sites, which are often in a stem-loop. These stem-loops generally have a stem that is long enough to be bound by the dsRNA binding domains of ADAR enzymes. For use in ModulADAR, the stem should be shortened to just the part bound by the catalytic domain of ADAR.
Natural editing sites often contain UAG sequences; the stem-loop should be inserted into the sensor such that the UAG is in-frame with the coding sequences. Natural editing sites containing UGA or UAA also exist. For sequences containing UAA, both As should be naturally edited. Any other in-frame stop codons should be removed; they can be kept if they are also efficiently edited upon ADAR co-localization.
Natural editing sites in the human genome have been cataloged (Gabay et al., Nature Communications 2022). A handful of these sites are tested; specifically, ones derived from the GLURB, CAPS1, GLI1, GABRA3, and HT2RC genes.
Stem-loops are generated in a library and evaluated for performance. The contact area of ADAR2 deaminase domain based on available structures is around 12 bp, so stems tested in this way should be about 9-30 bp, with the size of the stem referring to the basepaired portion of the stem-loop.
Basic Characterization. When a stem-loop derived from GLURB (aka GLUR2, GRIA2) is placed in the middle of the complementarity region, the ModulADAR design works well (
OR logic. OR logic in the standard case (no editable stem-loop) is achieved by co-delivering two separate RNAs with different inputs. This means that if you have two inputs, A and B, the output dose from just “A” will be ≈50%, just “B” will be ≈50% and from both “A” and “B” together will be 100%.
A single-molecule OR gate can be made with the ModulADAR technique such that the output from all cases (just “A”, just “B”, or both “A” and “B” together) will be approximately the same (
uORF RADAR
Sensor composition: (1) AUG surrounded by sequence complementary to the trigger RNA, (2) output (with an AUG). The first AUG is crucially out of frame from the output AUG so that the first reading frame functions as an “upstream open reading frame,” repressing translation from the output AUG start codon.
Mechanism: The first AUG is set up to be edited to IUG upon dsRNA formation; IUG (GUG) is no longer a start codon and turns off the upstream reading frame, allowing the downstream frame to be translated.
Ideally the upstream reading frame is long, almost as long as the correct downstream reading frame. There should not be any other AUGs within the sensor sequence.
Two uORFs can be placed in series for an AND gate (both inputs have to be present in order to remove both uORFs).
Basic characterization with nuclearly localized trigger RNAs. The uORF mechanism works best with ADAR2 over-expression (
This design is derived from the “uORF” design, but rather than editing an upstream reading frame, the AUG of the output reading frame is edited directly.
Sensor composition: (1) AUG start codon surrounded by sequence complementary to the trigger RNA, (2) 2A tag, (3) output. All of the components are in frame with each other.
Mechanism: The AUG is set up to be edited to IUG upon dsRNA formation; IUG (GUG) is no longer a start codon, disabling the translation of the output protein.
Here the output is turned off in response to an input (“NOT X” type logic) as opposed to all other designs where the output is turned on in response to an input.
There should not be any in-frame AUG sequences within the sensor that could enable functional production of the output protein. The sensor should not have in-frame stop codons downstream of the AUG.
The AUG RADAR is similar to the uORF RADAR, it's just that there is no second, downstream reading frame; the “upstream” reading frame contains the desired output.
Basic Characterization. The AUG mechanism works with a cytosolic triggering mRNA (i.e., the typical mRNA).
Sensor composition: (1) AUA surrounded by sequence complementary to the trigger RNA, (2) 2A tag, (3) output (without AUG). All of the components are in frame with each other.
Mechanism: The AUA is set up to be edited to AUI upon dsRNA formation; AUI (AUG) can function as a start codon and enable translation of the otherwise not translated output protein.
There should not be any in-frame AUG sequences within the sensor that could enable functional production of the output protein. The sensor should not have in-frame stop codons downstream of AUA.
It helps to have a separate, long, overlapping reading frame in a different frame than the main one; this stabilizes the RNA when it is in the “off” state.
When two AUA sensors are in series, one would achieve OR logic on a single RNA strand (as opposed to two RNAs with the standard RADAR design).
Basic characterization. The AUA mechanism works best with ADAR2 over-expression (
It is possible to combine the different varieties together for different logic functions. Below RADAR is either the standard RADAR or the ModulADAR variety.
Notwithstanding the appended claims, the disclosure set forth herein is also described by the following clauses:
In at least some of the previously described embodiments, one or more elements used in an embodiment can interchangeably be used in another embodiment unless such a replacement is not technically feasible. It will be appreciated by those skilled in the art that various other omissions, additions and modifications may be made to the methods and structures described above without departing from the scope of the claimed subject matter. All such modifications and changes are intended to fall within the scope of the subject matter, as defined by the appended claims.
It will be understood by those within the art that, in general, terms used herein, and especially in the appended claims (e.g., bodies of the appended claims) are generally intended as “open” terms (e.g., the term “including” should be interpreted as “including but not limited to,” the term “having” should be interpreted as “having at least,” the term “includes” should be interpreted as “includes but is not limited to,” etc.). It will be further understood by those within the art that if a specific number of an introduced claim recitation is intended, such an intent will be explicitly recited in the claim, and in the absence of such recitation no such intent is present. For example, as an aid to understanding, the following appended claims may contain usage of the introductory phrases “at least one” and “one or more” to introduce claim recitations. However, the use of such phrases should not be construed to imply that the introduction of a claim recitation by the indefinite articles “a” or “an” limits any particular claim containing such introduced claim recitation to embodiments containing only one such recitation, even when the same claim includes the introductory phrases “one or more” or “at least one” and indefinite articles such as “a” or “an” (e.g., “a” and/or “an” should be interpreted to mean “at least one” or “one or more”); the same holds true for the use of definite articles used to introduce claim recitations. In addition, even if a specific number of an introduced claim recitation is explicitly recited, those skilled in the art will recognize that such recitation should be interpreted to mean at least the recited number (e.g., the bare recitation of “two recitations,” without other modifiers, means at least two recitations, or two or more recitations). Furthermore, in those instances where a convention analogous to “at least one of A, B, and C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “a system having at least one of A, B, and C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). In those instances where a convention analogous to “at least one of A, B, or C, etc.” is used, in general such a construction is intended in the sense one having skill in the art would understand the convention (e.g., “a system having at least one of A, B, or C” would include but not be limited to systems that have A alone, B alone, C alone, A and B together, A and C together, B and C together, and/or A, B, and C together, etc.). It will be further understood by those within the art that virtually any disjunctive word and/or phrase presenting two or more alternative terms, whether in the description, claims, or drawings, should be understood to contemplate the possibilities of including one of the terms, either of the terms, or both terms. For example, the phrase “A or B” will be understood to include the possibilities of “A” or “B” or “A and B.”
In addition, where features or aspects of the disclosure are described in terms of Markush groups, those skilled in the art will recognize that the disclosure is also thereby described in terms of any individual member or subgroup of members of the Markush group.
As will be understood by one skilled in the art, for any and all purposes, such as in terms of providing a written description, all ranges disclosed herein also encompass any and all possible sub-ranges and combinations of sub-ranges thereof. Any listed range can be easily recognized as sufficiently describing and enabling the same range being broken down into at least equal halves, thirds, quarters, fifths, tenths, etc. As a non-limiting example, each range discussed herein can be readily broken down into a lower third, middle third and upper third, etc. As will also be understood by one skilled in the art all language such as “up to,” “at least,” “greater than,” “less than,” and the like include the number recited and refer to ranges which can be subsequently broken down into sub-ranges as discussed above. Finally, as will be understood by one skilled in the art, a range includes each individual member. Thus, for example, a group having 1-3 articles refers to groups having 1, 2, or 3 articles. Similarly, a group having 1-5 articles refers to groups having 1, 2, 3, 4, or 5 articles, and so forth.
Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it is readily apparent to those of ordinary skill in the art in light of the teachings of this invention that certain changes and modifications may be made thereto without departing from the spirit or scope of the appended claims.
Accordingly, the preceding merely illustrates the principles of the invention. It will be appreciated that those skilled in the art will be able to devise various arrangements which, although not explicitly described or shown herein, embody the principles of the invention and are included within its spirit and scope. Furthermore, all examples and conditional language recited herein are principally intended to aid the reader in understanding the principles of the invention and the concepts contributed by the inventors to furthering the art, and are to be construed as being without limitation to such specifically recited examples and conditions. Moreover, all statements herein reciting principles, aspects, and embodiments of the invention as well as specific examples thereof, are intended to encompass both structural and functional equivalents thereof. Additionally, it is intended that such equivalents include both currently known equivalents and equivalents developed in the future, i.e., any elements developed that perform the same function, regardless of structure. Moreover, nothing disclosed herein is intended to be dedicated to the public regardless of whether such disclosure is explicitly recited in the claims.
The scope of the present invention, therefore, is not intended to be limited to the exemplary embodiments shown and described herein. Rather, the scope and spirit of present invention is embodied by the appended claims. In the claims, 35 U.S.C. § 112(f) or 35 U.S.C. § 112(6) is expressly defined as being invoked for a limitation in the claim only when the exact phrase “means for” or the exact phrase “step for” is recited at the beginning of such limitation in the claim; if such exact phrase is not used in a limitation in the claim, then 35 U.S.C. § 112 (f) or 35 U.S.C. § 112(6) is not invoked.
This application is a continuation of PCT Application No. PCT/US2023/063245 filed Feb. 24, 2023, which application claims priority pursuant to 35 U.S.C. § 119 (e) to the filing date of U.S. Provisional Application Ser. No. 63/313,423 filed Feb. 24, 2022, the disclosures of which applications are herein incorporated by reference.
This invention was made with Government support under contract 1656518 awarded by the National Science Foundation and under contract EB027723 awarded by the National Institutes of Health. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
63313423 | Feb 2022 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2023/063245 | Feb 2023 | WO |
Child | 18814161 | US |