Nucleic acid based constructs for modulating expression of genes can be improved by increasing sensitivity and reducing leakiness.
The present disclosure recognizes a discovery of nucleic acid constructs related to regulatable gene product expression. In some embodiments, the present disclosure provides compositions and methods for the regulation of gene expression using nucleic acid constructs. In some embodiments, the present disclosure recognizes the utility of alternative splicing in regulation of gene expression in a nucleic acid construct. In some embodiments, the present disclosure recognizes the utility of regulating gene expression utilizing ligand-binding aptamers.
In some embodiments, the present disclosure provides a system for modulating gene expression, comprising a polyA aptamer polynucleotide that comprises in a 5′ to 3′ direction: a 5′ splice donor site; an engineered intron; a first 3′ splice acceptor site; a polyA switch comprising two or more ligand-binding aptamers with one or more ligand binding pockets, and at least one polyA cleavage signal therein; a second 3′ splice acceptor site; and a nucleic acid sequence encoding an expressible polypeptide.
In some embodiments, a polyA aptamer polynucleotide of the present disclosure comprises two ligand-binding aptamers. In some embodiments, a polyA aptamer polynucleotide comprises three ligand-binding aptamers. In some embodiments, a polyA aptamer polynucleotide comprises a polyA switch comprising a three way junction. In some embodiments, a three way junction comprises a junction of one or more RNA double stranded stems. In some embodiments, portions of a three way junction are single stranded. In some embodiments, a RNA double stranded stem comprises a ligand-binding aptamer. In some embodiments, a nucleic acid sequence encoding an expressible polypeptide comprises a 5′UTR.
In some embodiments, the present disclosure provides a method for modulating expression of a gene product in a cell. The method comprises the steps of: introducing into the cell a system comprising in a 5′ to 3′ direction: a 5′ splice donor site; an engineered intron; a first 3′ splice acceptor site; a polyA switch comprising two or more ligand-binding aptamers with one or more ligand binding pockets, and at least one polyA cleavage signal therein; a second 3′ splice acceptor site. In some embodiments a gene product expressed by the methods described herein is exogenous to the cell. In some embodiments, a gene product expressed by the methods described herein is endogenous to the cell. In some embodiments, a method provided by the present disclosure occurs in one or more cells of an individual, the ligand is glucose, the individual has diabetes, pre-diabetes, or complications from diabetes, and/or the expressible polynucleotide is insulin. In some embodiments, a method provided by the present disclosure occurs in one or more cells of an individual, the expressible polynucleotide is a therapeutic gene product such as human growth hormone, coagulation factor X, or dystrophin. In some embodiments, a method provided by the present disclosure occurs in one or more cells of an individual, the ligand is the gene product of a cancer biomarker, and the expressible polynucleotide is a suicide gene. In some embodiments, a method provided by the present disclosure occurs in an individual, the expressible polynucleotide is a reporter gene, and the location and/or intensity of the expression of the reporter gene provides information about spatial distribution, temporal fluctuation, or both, of a ligand in one or more cells of the individual. In some embodiments, a method provided by the present disclosure occurs in an individual, tissue, or cell, wherein the expressible polynucleotide encodes a detectable gene product, and wherein the respective individual, tissue, or cell is imaged.
In some embodiments, the present disclosure provides compositions and methods for regulatable gene product expression. In some embodiments, compositions and methods for regulatable gene product expression comprise a polyA aptamer polynucleotide. In some embodiments, a polyA aptamer polynucleotide comprises, amongst other things, one or more splice donor sites, one or more splice acceptor sites, an engineered intron; a polyA switch; and a nucleic acid sequence encoding an expressible polypeptide. In some embodiments, a polyA switch comprises at least one ligand-binding aptamer. In some embodiments, a polyA switch comprises at least one polyA cleavage signal. In some embodiments, a polyA aptamer polynucleotide comprises RNA double strand stems.
Aptamers are short RNA sequences that fold like receptors and bind to specific ligands. Efficient in vitro evolution methods for generating aptamers with high affinity to specific ligands are well established. The binding affinity of aptamers can often reach nanomolar range, comparable to that of antibodies. In this regard, aptamers can be viewed as antibodies made of RNA. What distinguishes an aptamer from an antibody are its small size (often smaller than 50 bases) and its modular nature. These features enable aptamers to integrate with and control other RNA structures without losing its binding function. It has been demonstrated that aptamers can transform the self-cleaving RNA ribozymes to operate in a ligand-dependent manner, and function like a molecular switch in test tubes and in cells.
In some embodiments, a polyA aptamer polynucleotide comprises one or more RNA double stranded stems. In some embodiments, a RNA double stranded stem is a nucleic acid structure formed by intramolecular base pairing of complementary nucleic acids contained within a single polyA aptamer polynucleotide. In some embodiments, a RNA double stranded stem may also be referred to as an arm. In some embodiments, a polyA aptamer polynucleotide comprises one or more RNA double strand stems. In some embodiments, a polyA aptamer polynucleotide comprises two RNA double strand stems. In some embodiments, a polyA aptamer polynucleotide comprises three RNA double strand stems. In some embodiments, a RNA double stranded stem comprises ligand binding aptamer. In some embodiments, a polyA aptamer polynucleotide comprises two ligand binding aptamers. In some embodiments, a polyA aptamer polynucleotide comprises three ligand binding aptamers.
In some embodiments, at least two RNA double stranded stems are joined to form a junction. In some embodiments, a junction of RNA double stranded stems comprises a single stranded region. In some embodiments, three RNA stems meet to form a three way junction. In some embodiments, a three way junction comprises at least one single stranded region. In some embodiments, a three way junction comprises one, two, or three single stranded regions.
In some embodiments the sequence of a double stranded RNA stem is selected from one of the following:
In some embodiments, a single stranded region formed by a junction of RNA double stranded stems comprises at least one nucleic acid. In some embodiments, a single stranded region formed by a junction of RNA double stranded stems comprises one, two, three, four, five, six, seven, eight, nine, ten, eleven, twelve, or more nucleic acids. In some embodiments, a three way junction comprises a first, second, and third single stranded regions. In some embodiments, a first single stranded region comprises at least one base selected from C and A. In some embodiments, a second single stranded region comprises at least one base selected from C and A.
In some embodiments, a RNA double stranded stem is 30, 20, 10, or 5 base pairs in length. In some embodiments, a RNA double stranded stem is 5 to 30, 10 to 30, 20 to 30, 5 to 10, 5 to 20, 5 to 30, or 10 to 20 base pairs in length. In some embodiments, a RNA double stranded stem is up to 30 base pairs in length. In some embodiments, a RNA double stranded stem is less than 30, 20, or 10 base pairs in length.
In some embodiments, a polyA aptamer polynucleotide comprises one or more aptamers. In some embodiments, a polyA aptamer polynucleotide comprises two aptamers. In some embodiments, a polyA aptamer polynucleotide comprises three aptamers.
In some embodiments, an aptamer included in a polyA aptamer polynucleotide described herein comprises at least one single stranded region and at least one aptamer RNA double stranded stem. In some embodiments, an aptamer RNA double stranded stem comprises a single stranded region. In some embodiments, an aptamer RNA has an RNA double stranded stem with a sequence of AATAAGATTACCGAAAGGCAATCTTATT (e.g., arm2-2). In some embodiments, an aptamer RNA has an RNA double stranded stem with a sequence of CCAGATCGAATTCGATCTGG (e.g., are 3-2). In some embodiments, an aptamer RNA has an RNA double stranded stem with a length ranging from 6-10; 7-11; 8-12; 9-13; 10-14 base pairs in length.
In accordance with various embodiments, any of a variety of polyA signals (e.g., encoded by a polyA signal sequence) may be used. By way of non-limiting example, a polyA signal sequences used in mammalian cells include: AAUAAA, AUUAAA, AGUAAA, ACUAAA, UAUAAA, CAUAAA, GAUAAA, AAUAUA, AAUACA, and AAUAGA. In some embodiments, a polyA switch may include two or more polyA signal sequences (e.g., 3, 4, 5, 6, 7, 8, 9, 10 or more).
Polyadenylation is a foundational mRNA processing mechanism that is present in all mammalian cells. Typically, mammalian polyA signals are found in the 3′ untranslated region (UTR). In contrast, the present disclosure provides compositions and methods that comprise a polyA cleavage signal present in an expression construct at a location other than at the 3′ untranslated region (UTR) of an expressible polynucleotide, such as a gene. When a polyA signal is artificially created in the 5′ UTR, where it is not normally found in cells, efficient cleavage of the polyA signal leads to the addition of polyA tail at the site. This results in the removal and degradation of the second half of the associated mRNA with transgene sequence, and therefore a loss of gene expression. In some embodiments, the polyA signal is present upstream of the translation start site of a nucleic acid sequence encoding an expressible polynucleotide (mRNA) encoding an expressed polypeptide. In some embodiments, the polyA signal is located in the 5′ UTR of the mRNA. In some embodiments, a single stranded region of a 3-way junction comprises all or a portion of the polyA cleavage signal. In some embodiments, the third single stranded region of a 3-way junction comprise all or a portion of the polyA cleavage signal. In some embodiments, a RNA double stranded stem comprises all or a portion of the polyA cleavage signal. In some embodiments, the third RNA double stranded stem comprises all or a portion of the polyA cleavage signal. In some embodiments, a portion of the polyA cleavage signal, as used herein, includes one, two, three, or four nucleotides. In some embodiments, a polyA cleavage signal has a sequence of AAUAAA. In some embodiments, a polyA cleavage signal has a sequence of AUUAAA, AGUAAA, ACUAAA, UAUAAA, CAUAAA, GAUAAA, AAUAUA, AAUACA, AAUAGA, AAAAAG, or ACUAAA. In embodiments wherein two or more polyA signals are utilized in the construct, the polyA signals may be the same or may be different. In particular embodiments, the expressible polynucleotide is able to be transcribed by RNA polymerase II.
In some embodiments, the presence of the polyA cleavage signal in the 5′ UTR targets the second half of mRNA after the polyA signal for degradation, and this ability is exploited in the various compositions and methods of the present disclosure. In some embodiments, the presence of the polyA cleavage signal in the 5′ UTR results in cleavage of a pre-mRNA/mRNA encoded by a polyA aptamer polynucleotide. In some embodiments, cleavage of a pre-mRNA/mRNA encoded by a polyA aptamer polynucleotide results in degradation of the second half of pre-mRNA/mRNA. In some embodiments, cleavage of a pre-mRNA/mRNA encoded by a polyA aptamer polynucleotide results in no expression of a polypeptide.
In particular embodiments, the polyA cleavage signal is within a polyA aptamer polynucleotide comprising at least one ligand-binding aptamer to which one or more ligands can bind. In some embodiments, binding of the ligand to the ligand-binding aptamer determines whether or not the polyA cleavage signal is present in the pre-mRNA/mRNA after alternative splicing. In some embodiments, binding of the ligand to the ligand-binding aptamer determines whether or not the pre-mRNA/mRNA is cleaved after alternative splicing. In some embodiments, binding of the ligand to the ligand-binding aptamer determines whether or not an expressible polypeptide is expressed after alternative splicing.
In some embodiments, a polyA aptamer polynucleotide comprises an engineered intron. In some embodiments, an engineered intron comprises one or more splice sites. In some embodiments, a splice site is or comprises a splice donor site (e.g, comprising a GU sequence). In some embodiments a splice site is or comprises a splice acceptor site (e.g., comprising an AG sequence). In some embodiments, splice sites in an engineered intron function (e.g., in conjunction with each other and/or in conjunction with one or more endogenous splice site(s)) to excise an engineered intron from a polyA aptamer polynucleotide.
In some embodiments, an engineered intron is preceded by a 5′ splice donor site. In some embodiments, a polyA aptamer polynucleotide comprises a 5′ splice donor site in the region 5′ of an engineered intron. In some embodiments, a polyA aptamer polynucleotide comprises a first 3′ splice acceptor site 3′ of an engineered intron. In some embodiments, an engineered intron of a polyA aptamer polynucleotide described herein comprises a 5′ splice donor site and a first 3′ splice acceptor site. In some embodiments, a polyA aptamer polynucleotide comprises a nucleic acid sequence encoding an expressible polypeptide. In some embodiments, a polyA aptamer polynucleotide comprises a second 3′splice acceptor site immediately 5′ of a nucleic acid sequence encoding an expressible polypeptide.
In some embodiments, a polyA aptamer polynucleotide comprises a promoter 5′ of the splice donor site. Exemplary promoters include, e.g., CMV, E1F, VAV, TCRvbeta, MCSV, an SV40 promoter, an RSV promoter, and PGK promoter.
In some embodiments, in the absence of a ligand bound to a ligand-binding aptamer, splicing of the pre-mRNA encoded by a polyA aptamer polynucleotide described herein occurs between the 5′ splice donor site and the first 3′ splice acceptor site. In some embodiments, splicing between the 5′ splice donor site and the first 3′ splice acceptor site of a pre-mRNA encoded by a polyA aptamer polynucleotide described herein results in an mRNA comprising a polyA cleavage signal preceding a 5′UTR of a nucleic acid sequence encoding an expressible polypeptide. In some embodiments, presence of a polyA cleavage signal preceding a 5′UTR of a nucleic acid sequence encoding an expressible polypeptide results in cleavage at the polyA cleavage site and degradation of the sequence encoding an expressible polypeptide.
In some embodiments, in the presence of a ligand bound to a ligand-binding aptamer, splicing of the pre-mRNA encoded by a polyA aptamer polynucleotide described herein occurs between the 5′ splice donor site and the second 3′ splice acceptor site. In some embodiments, splicing of the pre-mRNA encoded by a polyA aptamer polynucleotide described herein between the 5′ splice donor site and the second 3′ splice acceptor site results in an mRNA comprising a nucleic acid sequence encoding an expressible polypeptide. In some embodiments, splicing of the pre-mRNA encoded by a polyA aptamer polynucleotide described between the 5′ splice donor site and the second 3′ splice acceptor site results in removal of polyA cleavage signal by splicing it out. In some embodiments, splicing between the 5′ splice donor site and the second 3′ splice acceptor site of the pre-mRNA encoded by a polyA aptamer polynucleotide described herein results in the expression of an expressible polypeptide.
In some embodiments, a polyA aptamer polynucleotide comprises two or more ligand-binding aptamers. In some embodiments, each of two or more ligand binding aptamers binds a different ligand. In some embodiments, a polyA aptamer polynucleotide comprises two or more separate polyA switches. In some embodiments, a first polyA switch comprises a first aptamer that binds a first ligand, and a second polyA switch comprises a second aptamer that binds a second ligand. In some embodiments the first and second aptamers are non-identical and the first and second ligands are non-identical. In some embodiments, the first and second aptamers are non-identical and the first and second ligands are identical.
In some embodiments, an engineered intron is any sequence. In some embodiments, an engineered intron is approximately 100, 200, 300, 400, or 500 nucleotides in length. In some embodiments, an engineered intron is in the range of 100-200; 110-200; 120-200; 130-200; 140-200; 150-200; 160-200; 170-200; or 180-200 bases in length. In some embodiments, an engineered intron is at most 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220 bases in length. In some embodiments, an engineered intron has the following sequence:
In some embodiments, an engineered intron has the following sequence:
As used herein, an intron can refer to either a DNA sequence or its corresponding RNA sequence.
In some embodiments a polyA aptamer polynucleotide comprises additional sequences to facilitate, regulate or assist polyA signal cleavage within a polyA aptamer polynucleotide. In some embodiments, a polyA aptamer polynucleotide comprises a G-U rich region 5′ of the nucleic acid sequence encoding the expressible polypeptide and 3′ of the polyA cleavage signal. In some embodiments a polyA aptamer polynucleotide comprises additional sequences to facilitate, regulate or assist splicing within a polyA aptamer polynucleotide. In some embodiments, a polyA aptamer polynucleotide comprises a nucleic acid triplet sequence capable of modulating the strength of alternative splicing. In some embodiments, a nucleic acid triplet sequence is 3′ relative to the second 3′acceptor site in the 5′UTR. In some embodiments, a nucleic acid triplet sequence is 3′ of an engineered intron. In some embodiments, a sequence of a nucleic acid triplet sequence comprises any three nucleotides. In some embodiments, a sequence of a nucleic acid triplet sequence comprises TAG, TCT, TTC, TTG, TGA, TGC, TCC, ACA, AAC, ACC, AGC, AGG, CCT, CCC, TTT, TGA, TCT, TAC, CAC, or CAT.
In some embodiments, a polyA aptamer polynucleotide comprises a G-U rich region 5′ of the nucleic acid sequence encoding the expressible polypeptide and 3′ of the polyA cleavage signal. In some embodiments, a polyA aptamer polynucleotide comprises a G rich region 5′ of the nucleic acid sequence encoding the expressible polypeptide and 3′ of the G-U rich region. In some embodiments, a G rich region is understood in the art to be a MAZ sequence. In some embodiments, a polyA aptamer polynucleotide comprises one or more G rich regions. In some embodiments, a polyA aptamer polynucleotide comprises one or more consecutive G rich regions. In some embodiments, a polyA aptamer polynucleotide comprises one or more MAZ sequences. In some embodiments, a polyA aptamer polynucleotide comprises one or more consecutive MAZ sequences. In some embodiments, a polyA aptamer polynucleotide comprises one, two, three, four, five, six MAZ sequences. The consecutive MAZ may be separated by one or more spacer sequences. In some embodiments the sequence of a G rich region is
In some embodiments, a polyA aptamer polynucleotide comprises one or more start codons. In some embodiments, a polyA aptamer polynucleotide comprises one or more out of frame start codons. In some embodiments, an out of frame start codon is out of frame relative to the coding sequence of a nucleic acid sequence encoding an expressible polypeptide. In some embodiments, a polyA aptamer polynucleotide comprises at least one out of frame start codon. In some embodiments, a polyA aptamer polynucleotide comprises at least one out of frame start codon 3′ of a first 3′ splice acceptor site 3′ of an engineered intron.
In some embodiments, a polyA aptamer polynucleotide comprises a nucleic acid sequence encoding an expressible polypeptide. In some embodiments, a nucleic acid sequence encoding an expressible polypeptide comprises a 5′UTR. In some embodiments, a 5′ UTR of a nucleic acid sequence encoding an expressible polypeptide comprises a 3′splice acceptor site. In some embodiments, a 5′ UTR of a nucleic acid sequence encoding an expressible polypeptide comprises a branch point and a 3′splice acceptor site. A branch point is understood in the art to comprise a nucleotide or nucleotides involved in initiating a nucleophilic attack on the 5′ donor splice site. In some embodiments, a 5′ UTR of a nucleic acid sequence encoding an expressible polypeptide does not comprise a branch point. In some embodiments, a 5′ UTR of a nucleic acid sequence encoding an expressible polypeptide comprises a spacer sequence. In some embodiments, a spacer sequence comprises at least one CAA repeat. In some embodiments a 5′UTR of a nucleic acid sequence encoding an expressible polypeptide has a sequence of
In some embodiments, a nucleic acid sequence encoding an expressible polypeptide contemplated in the present disclosure can be any nucleic acid sequence or any gene encoding any polypeptide. In some embodiments, a nucleic acid sequence encoding a non-coding RNA. In some embodiments, a nucleic acid sequence encoding an expressible polypeptide contemplated in the present disclosure can be an exogenous nucleic acid. In some embodiments, a nucleic acid sequence encoding an expressible polypeptide contemplated in the present disclosure can be a gene endogenous to a subject to which a polyA aptamer polynucleotide has been introduced. In some embodiments, a polyA aptamer polynucleotide of the present disclosure is introduced into a region of an individual's genome that regulates expression of a gene of interest. Accordingly, in some embodiments, a polyA aptamer polynucleotide of the present disclosure can be used to regulate expression of genes endogenous to an individual. In some embodiments, a nucleic acid sequence encoding an expressible polypeptide of a polyA aptamer polynucleotide of the present disclosure is an endogenous nucleic acid sequence.
In some embodiments, an expressible polypeptide is insulin. In some embodiments, an expressible polypeptide is human growth hormone. In some embodiments, an expressible polypeptide is coagulation factor X. In some embodiments, an expressible polypeptide is dystrophin. In some embodiments, an expressible polypeptide is a suicide protein. In some embodiments, a suicide protein is a protein that induces cell death. Exemplary suicide proteins include Mixed Lineage Kinase Domain Like Pseudokinase (MLKL), Receptor-interacting serine/threonine-protein kinase 3 (RIPK3), Receptor-interacting serine/threonine-protein kinase 1 (RIPK1), Fas-associated protein with death domain (FADD), or gasdermin D (GSDMD), cysteine-aspartic proteases, cysteine aspartases or cysteine-dependent aspartate-directed proteases (CASPASE-1 or CASP-1), CASPASE-4, CASPASE-5, CASPASE-12, PYCARD/ASC (PYD and CARD domain containing/Fas-associated protein with death domain) or variants thereof.
In some embodiments, an expressible polypeptide is a detectable gene product. In some embodiments a detectable gene product is a reporter. In some embodiments a reporter is a protein capable of providing a detectable signal and/or comprise the ability to generate a detectable signal (e.g. by catalyzing reaction converting a compound to a detectable product). Detectable signals can comprise, for example, fluorescence or luminescence. Detectable signals, methods of detecting them, and methods of incorporating them into reagents (e.g. polypeptides comprising a reporter protein) are well known in the art. In some embodiments of any of the aspects, detectable signals can include signals that can be detected by spectroscopic, photochemical, biochemical, immunochemical, electromagnetic, radiochemical, or chemical means, such as fluorescence, chemifluoresence, or chemiluminescence, or any other appropriate means. In some embodiments of any of the aspects, the reporter protein is selected from the group consisting of luciferase, nanoluciferase, beta-lactamase, beta-galactosidase, horseradish peroxidase, alkaline phosphatase, catalase, carbonic anhydrase, green fluorescent protein, red fluorescent protein, cyan fluorescent protein, yellow fluorescent protein, trypsin, a protease, a peptide that complements and activates a truncated reporter protein, a kinase.
In some embodiments, activity or function of a polyA aptamer polynucleotide of the present disclosure is measured by expression of an expressible polypeptide. In some embodiments, activity or function of a polyA aptamer polynucleotide of the present disclosure is measured by fold induction. In some embodiments, fold induction is calculated as the ratio of expressible polypeptide in the presence of a ligand and expressible polypeptide in the absence of a ligand. In some embodiments, fold induction is calculated as the ratio of expressible polypeptide in the presence of an aptamer and expressible polypeptide in the presence of a different aptamer. In some embodiments, fold induction is calculated as the ratio of expressible polypeptide in the presence of an aptamer comprising at least one splice acceptor site and one splice donor site and expressible polypeptide in the presence of a different aptamer with no splice sites. In some embodiments, fold induction is calculated as the ratio of expression of an endogenous gene before introduction of a polyA aptamer polynucleotide and expression of an endogenous gene after introduction of a polyA aptamer polynucleotide regulating expression of the same endogenous gene.
In accordance with various embodiments, a ligand may be selected so as to facilitate a desired end purpose of a provided system. Accordingly, a ligand may be or comprise a polypeptide, nucleic acid, small molecule, drug, metabolite, or combination thereof. In some embodiments, a ligand may be or comprise a cellular metabolite, aberrant cellular protein, or a protein expressed by a pathogenic organisms (e.g., a virus, bacteria, or fungus). For example, in some embodiments, a ligand may be an exogenously administered small molecule so that dosing and function of the system can be modulated easily as desired in a particular therapeutic context. For example, in some embodiments, a ligand is tetracycline or its derivatives. In some embodiments, a ligand may be selected such that expression of an expressible polypeptide occurs in response to a particular biological condition (e.g., infection, tumorigenesis, high or low glucose), for example, as a biosensor system that can detect one or more intracellular “signatures” in a cell, tissue, or subject. Accordingly, in some embodiments, a ligand is endogenous to a subject (e.g., an endogenous protein) In some embodiments, a ligand is neomycin or its derivatives. In some embodiments, a ligand is theophylline or its derivatives. In some embodiments, a ligand is glucose. In some embodiments a ligand is a cancer biomarker.
In some embodiments, a polyA aptamer polynucleotide of the present disclosure can be introduced by a vector. In some embodiments, a vector can be a viral vector. Suitable viral vectors include, without limitation, lentiviral vectors, retroviral vectors, alphaviral, picornal (e.g., polio) vaccinial, adenoviral, adeno-associated viral, herpes viral, and fowl pox viral vectors.
In accordance with the present disclosure, polyA aptamer polynucleotides and/or systems including one or more polyA aptamer polynucleotides, may be used in any of a variety of applications. For example, in some embodiments, a polyA aptamer polynucleotide of the present disclosure is used for treatment of an individual suffering from a disease, for example, by providing controllable expression of a therapeutic protein encoded by an expressible polynucleotide. In some embodiments, a disease is the lack of certain protein(s) caused by a genetic disorder. In some embodiments, a disease is diabetes, pre-diabetes, or complications from diabetes. In some embodiments, a disease is cancer. In some embodiments, a disease is muscular dystrophy. In some embodiments, a disease is hereditary Factor X deficiency. In some embodiments, a polyA aptamer polynucleotide of the present disclosure is provided in combination with other treatments for a disease. In some embodiments, a polyA aptamer polynucleotide of the present disclosure is used for inducing reprogramming of cells into pluripotent stem cells (induced pluripotent stem cells or iPSCs). In some embodiments, a polyA aptamer polynucleotide of the present disclosure is introduced or administered prior to, during, or subsequent to other treatments for a disease. In some embodiments, a therapeutic protein maybe or comprise insulin, growth hormones, dystrophin, albumin, factor IX, Oct4, Sox2, Klf4, cMyc, and any combination thereof.
In some embodiments, a system comprising a polyA aptamer polynucleotide may be used to provide information regarding whether or not a therapy is effective in a particular subject. In some embodiments wherein it is desirable to determine whether one or more therapies are effective in a subject, a system may be employed in the subject before the therapy is provided, such as to detect the presence or absence of a specific indicative compound for the therapy, and then after the therapy is provided one or more times the system may be employed in the subject to detect the presence or absence of the specific indicative compound. In other embodiments, the system is not employed for monitoring therapy until after the therapy is provided one or more times to the subject, such as to identify the presence or absence of a specific compound that is indicative of the efficacy of the therapy.
In some embodiments, polyA aptamer polynucleotides and/or systems including one or more polyA aptamer polynucleotides may be used as a biosensor. In accordance with various embodiments, provided systems may provide spatial and/or temporal information regarding a particular environment (e.g., an intracellular, extracellular, and/or environmental environment). For example, in some embodiments, a system comprising at least one polyA aptamer polynucleotide may be used to detect one or more specific molecular signatures in a subject and to allow for production of a desired expressible polypeptide in order to achieve a desired biological state in response to the presence of the molecular signature(s). In some embodiments, a molecular signature may be or comprise: the presence of a particular endogenous gene product (e.g., a disease-associated gene product/protein), the presence of a toxin, the presence of an exogenous gene product, the presence of a metabolite (e.g., a metabolite from an environmental contaminant), and any combination thereof.
In some embodiments, a polyA aptamer polynucleotide may comprise one or more reporter moieties (e.g., a reporter gene product, for example, an imaging reporter). In some embodiments, an expressible polynucleotide comprised in a polyA aptamer polynucleotide encodes a reporter gene product (e.g., protein). In some embodiments, a reporter gene product may be or comprise luciferase, green fluorescent protein, red fluorescent protein, β-galactosidase, infrared fluorescent proteins, near-infrared fluorescent proteins, opsin, and any combination thereof.
In some embodiments, a system comprising a polyA aptamer polynucleotide may encode both a reporter gene product and a therapeutic gene product. In some such embodiments, expression of the reporter gene product and the therapeutic gene product may be controlled by the same aptamer. In some embodiments, expression of the reporter gene product and the therapeutic gene product may be controlled by different aptamers.
The present examples describe a highly responsive gene regulation mechanism that harnesses the power of drug-inducible alternative splicing to control polyA cleavage.
Location
Constructs were made to test additional Y-shape structures that are configured differently and with the polyA cleavage signal positioned differently. Four different constructs were made: B1-B4 where the polyA signal (in red) is placed near aptamer C and clamped by the 3-way junction (
Number of PolyA Cleavage Signals
Tests were performed to evaluate the optimal number of polyA signal(s) in Y-shape structure.
Modifying the environment of a 3-way junction directly affects the clamping of polyA signal. Therefore, the performance of Y-shape switch is very sensitive to any change in the 3-way junction. Extensive mutation/insertion/deletion studies around the 3-way junction were performed to identify the best sequences.
Y174, Y175, Y176, and Y177 (See
Changing the polyA signal position relative to the pocket of the 3-way junction can alter induction capability (
PolyA aptamer polynucleotide constructs as described herein comprise nucleic acid (e.g., RNA) double strand stems. Such double stranded regions are also referred to in the present disclosure as arms. Modifications of the length, stability, and nucleotide composition can affect the strength and effectiveness of the polyA aptamer polynucleotide.
Earlier results (using constructs Y1 to Y9,
The majority of these modifications significantly reduce induction, and none surpasses Y35. Therefore, the arm3 of Y35 represents the optimal arm3 sequence for the Y shape structure of those tested. Some other parental constructs used for arm3 modification, such as Y175, Y197, and Y210, all share the same arm3 sequence of Y35.
Modifications to the double strand stems that are arm 2 (i.e., arm2-1 and arm2-2) alter the stability of arm 2. The modifications include variations in length, sequences, as well as point mutations that create mismatches in the stem (
Orientation of each of the aptamers relative to the other aptamers may have an effect of the function of polyA aptamer polynucleotide.
We tested the effects of G-quad sequence on induction.
To further optimize the mechanism of Tc-induced alternative splicing, we have extensively probed the effects of IVS2 3′ splice site location and surrounding sequence/structure. The modifications include: embed IVS23′ splice site into the arm1; move IVS2 3′ splice site closer or further away from the aptamers binding site; put IVS2 3′ splice site in a loose bulge in the arm1; change the length or stability of the arm1 that hosts IVS2 3′ splice site; change splicing strength of IVS2 3′ splice site.
Assays were performed to test the effect of modulating the strength second 3′ splice acceptor site in the 5′UTR. The 5′UTR sequence of Y196CAA-4MAZ located after 4MAZ and before the start codon ATG has the following sequence: gcggccgccaacaacaacaacaacaacaacaacaacaacaacaacaacataacagtgttcactagcaacctcaaacagacaccA TG. Adding an additional branch point (S10), ppt (S11), or mutating CAG to CCC (S12) or AAG (S13) all lead to reduced induction (
We tested the effect of shortening the overall size of the hybrid switch by reducing the size of IVS2 intron.
An upstream out-of-frame AUG was introduce to construct S192 to test the effect on reporter gene translation from IVS2-spliced transcript. The modifications include: (1) changing TAC to ATG immediately after IVS2 3′ splice site to create a new start codon (red box), (2) changing the corresponding base on the other side of arm1 to maintain the base paring in the stem, and (3) mutating an in-frame stop codon tga into aga in arm2-1 (red arrow), so the translation from this new ATG can produce fairly long protein. See
The sequence after IVS2 3′ splice site CAG is shown. The new μORF is underlined:
aagattaccgaaaggcaatcttattaaaacataccagatcttgagagggt
gtttgtggcaaaacataccagatcgaattcgatctggggaggtgaagaat
acgaccacctgctacaagtacctaataaaCATtagCGGaGaaacatacca
ctgtgtgttggttttttgtgtgttaacgggggagggggaggaaaggggga
gggggaggaaagggggagggggaggaaagggggagggggagcggccgcca
taa
cagtgttcactagcaaccTcaaacagacacc
ATG. This approach significantly lowers the leakage expression from IVS2-spliced transcript, therefore significantly increases the induction as demonstrated by the result of S206.
This construct is further optimized by fine-tuning the 5′UTR sequence based on 5206 (
In summary, in the process of optimizing Tc effects on splicing choice between IVS2 3′ splice site and the alternative 3′ splice site, we found that the best location for placing IVS2 3′ splice site is to embed it inside the arm1 of Y structure. In order to place IVS2 3′ splice site in that location, the aptamer A is deleted from the Y structure. Creating an upstream out-of-frame AUG (μORF) which eliminates reporter gene translation from IVS2-spliced transcript decreases leakage expression. Compared to Y196CAA-4MAZ, 5222 (
To avoid potential immunogenicity generated by the protein translation of upstream open reading frames (μORF), we built another hybrid switch without the μORF aimed at surpassing the performance of S222. To build this new hybrid switch, we returned to the Y196CAA-4MAZ design as it has 3 aptamers as compared to 2 aptamers in S222. To further improve Y196CAA-4MAZ, we (1) use the mini-IVS2 intron with 120 bases, (2) optimizing the 3′ splice site of mini-IVS2 sequence, (3) optimizing the 5′UTR sequence containing the downstream alternative 3′ splice site. These efforts led to a group of constructs surpassing S222 in performance. The induction by tetracycline is so efficient that they induce gene expression to 50% of the maximal level (EC50) at a drug concentration as low as 0.5 to 1 μg/ml. This concentration of tetracycline can be routinely achieved in human serum using FDA-approved dosage, and is an order of magnitude lower than what has been previously achieved using any RNA-based gene regulation technology.
The Y-shape polyA switch, when combined with CRISPR, creates a powerful technology platform to control the expression of any endogenous gene in mammalian genome.
The data described represent a highly responsive gene regulation mechanism that harnesses the power of drug-inducible alternative splicing to control polyA cleavage. The combination engineered creates a sensitive RNA-based switch that can be controlled by small molecule drugs and enables tight regulation of gene expression in mammalian cells. In contrast to other reported methods, this hybrid switch technology described herein exhibits very low leaky expression, and effectively turns on the transgene expression close to 700-folds in human cells. Furthermore, the induction by tetracycline is so efficient that it induces gene expression to 50% of the maximal level (EC50) at a drug concentration as low as 0.5 to 1 μg/ml. This concentration of tetracycline can be routinely achieved in human serum using FDA-approved dosage, and is an order of magnitude lower than what has been previously achieved using other RNA-based gene regulation technology.
This hybrid switch technology therefore is advantageously safe to use in human patients for controlling the expression of a therapeutic gene or transgene. The present disclosure thus satisfies a long-felt need in the art to provide a highly efficient and non-immunogenic technology to regulate genes of interest in cells at a drug concentration that is safe for human consumption.
A combination of three base changes to the sequence of the Y-shape structure was tested to determine the cumulative effects on induction performance of the poly A aptamer. The three mutations, as noted in
Assays described in the figures filed herewith were performed as follows:
Cells were seeded in 96-well plates at a density of 25000-30000 cells/well. After 24 hours of incubation, each well was transfected with 50 ng of DNA vectors and were incubated with culture medium containing none or various concentration of tetracycline for an additional 18 hours. Luciferase activity was measured in relative light units (RLU) with a Polarstar Omega plate reader (BMG Labtech, USA). To make 36 mL of assay buffer, 144 μL 1M DTT, 108 μL M ATP, 252 μL 0.1M luciferin and 360 μl 0.05M CoA were added to 35 mL of basic buffer (25 mM Tricine, 0.5 mM EDTA-Na2, 0.54 mM Na-triphosphate, 16.3 mM MgSO4.7H2O, and 0.8% Triton X-100). After the cell medium was removed, 40 μL of assay buffer was added to each well, and luciferase activity was read twice with the Polarstar Omega plate reader. Induction in fold is calculated as the ratio of transgene expression in the presence vs absence of tetracycline.
Cells transfected with the respective constructs were grown 18 hours at 37° C. in medium in the absence or presence of tetracycline. Total RNA was isolated according to the protocol supplied with RiboPure™ RNA Purification Kit (Ambion, Austin, Tex.). For RT-PCR, RT was performed using SuperScript III (invitrogen, Carlsbad, Calif.) according to manufacturer's protocol and PCR was performed using the primers targeting the beginning of the transcript and reporter gene.
Cells were seeded in 12-well plates at a density of 1.2×105 cells/well. After 24 hours of incubation, each well was transfected with 500 ng of DNA vectors and were incubated with culture medium containing none or various concentration of tetracycline for an additional 18 hours. Images were taken on a fluorescence microscope (Zeiss Axiovert 40CFL) at a magnification of 200×.
The following sequences are additional examples of embodiments of components of the system described herein. The sequences are provided as DNA sequences that when transcribed components of form RNA aptamers:
ATGGGACCCTTGATGTTTTCTTTCCCCTTCTTTTCTATGGTTAAGTTCATGTCATA
GGAAGGGGAGAAGTAACAGGGTACACATATTGACCAAATCAGGGTAATTTTGC
ATTTGTAATTTTAAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATT
TCTAATACTTTCCCTAATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCAT
GCCTCTTTGCACCATTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAAT
AGCAATATTTCTGCATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAG
GTTTCATATTGCTAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTAT
GGTTGGGATAAGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCAT
GTTCATACCTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTG
AGGTGAAGAATACGACCACCTGCGTTTTATACTTCCACGAGATCTGGGGAG
GTGAAGAATACGACCACCTAATAAGATTACCGAAAGGCAATCTTATTAAAA
CATACCAGATCTTGTGAGGGTGTTTGTGGCAAAACATACCAGATCGAATTC
GATCTGGGGAGGTGAAGAATACGACCACCTGCTACAAGTACCTAATAAAGT
ATAAAGTGCAAAACATACCAGATCTGTGTGTTGGTTTTTTGTGTGTTAACG
GGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGA
AAGGGGGAGGGGGA
GCGGCCGCCAACAACAACAACAACAACAACAACAACAACAA
CAACAACATAACAGTGTTCACTAGCAACCTCAAACAGACACC
ATG
TTGATGTTTTCTTTCCCCTTCTTTTCTATGGTTAAGTTCATGTCATAGGAAGGGGA
GAAGTAACAGGGTACACATATTGACCAAATCAGGGTAATTTTGCATTTGTAATT
TTAAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATTTCTAATACTT
TCCCTAATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCATGCCTCTTTGC
ACCATTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAATAGCAATATTT
CTGCATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAGGTTTCATATT
GCTAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATA
AGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCATGTTCATACCT
CTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTGCTGGCCCATC
ACGACCACCTAATAAGATTACCGAAAGGCAATCTTATTAAAACATACCAGA
AGGTGAAGAATACGACCACCTGCTACAAGTACCTAATAAAGTATAAAGTGC
AAAACATACCAGATCTGTGTGTTGGTTTTTTGTGTGTTAACGGGGGAGGGG
GAGGAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGGGGAG
GGGGA
GCGGCCGCCAACAACAACAACAACAACAACAACAACAACAACAACAACATAA
CAGTGTTCACTAGCAACCTCAACAGACACC
ATG
ATGGGACCCTTGATGTTTTCTTTCCCCTTCTTTTCTATGGTTAAGTTCATGTCATA
GGAAGGGGAGAAGTAACAGGGTACACATATTGACCAAATCAGGGTAATTTTGC
ATTTGTAATTTTAAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATT
TCTAATACTTTCCCTAATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCAT
GCCTCTTTGCACCATTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAAT
AGCAATATTTCTGCATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAG
GTTTCATATTGCTAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTAT
GGTTGGGATAAGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCAT
GTTCATACCTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTG
GTGAAGAATACGACCACCTAATAAGATTACCGAAAGGCAATCTTATTAAAA
TATAAAGTGCAAAACATACCAGATCTGTGTGTTGGTTTTTTGTGTGTTAACG
GGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGA
AAGGGGGAGGGGGA
GCGGCCGCCAACAACAACAACAACAACAACAACAACAACAA
CAACAACATAACAGTGTTCACTAGCAACCTCAAACAGACACC
ATG
ATGGGACCCTTGATGTTTTCTTTCCCCTTCTTTTCTATGGTTAAGTTCATGTCATA
GGAAGGGGAGAAGTAACAGGGTACACATATTGACCAAATCAGGGTAATTTTGC
ATTTGTAATTTTAAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATT
TCTAATACTTTCCCTAATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCAT
GCCTCTTTGCACCATTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAAT
AGCAATATTTCTGCATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAG
GTTTCATATTGCTAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTAT
GGTTGGGATAAGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCAT
GTTCATACCTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTG
AGGTGAAGAATACGACCACCTGCGTTTTATACTTCCAcGAGATCTGGGGAG
GTGAAGAATACGACCACCTAATAAGATTACCGAAAGGCAATCTTATTAAAA
CATACCAGATCTTgTGAGGGTGTTTGTGGCAAAACATACCAGATCGAATTC
TATAAAGTGCAAAACATACCAGATCTGTGTGTTGGTTTTTTGTGTGTTAACG
GGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGA
AAGGGGGAGGGGGA
GCGGCCGCCAACAACAACAACAACAACAACAACAACAACAA
CAACAACATAACAGTGTTCACTAGCAACCTCAAACAGACACC
ATG
ATGGGACCCTTGATGTTTTCTTTCCCCTTCTTTTCTATGGTTAAGTTCATGTCATA
GGAAGGGGAGAAGTAACAGGGTACACATATTGACCAAATCAGGGTAATTTTGC
ATTTGTAATTTTAAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATT
TCTAATACTTTCCCTAATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCAT
GCCTCTTTGCACCATTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAAT
AGCAATATTTCTGCATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAG
GTTTCATATTGCTAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTAT
GGTTGGGATAAGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCAT
GTTCATACCTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTG
AGGTGAAGAATACGACCACCTGCGTTTTATACTTCCACGAGATCTGGGGAG
GTGAAGAATACGACCACCTAATAAGATTACCGAAAGGCAATCTTATTAAAA
CATACCAGATCTTGTGAGGGTGTTTGTGGCAAAACATACCAGATCGAATTC
GATCTGGGGAGGTGAAGAATACGACCACCTGCTACAAGTACCTAATAAAGT
GGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGA
AAGGGGGAGGGGGA
GCGGCCGCCAACAACAACAACAACAACAACAACAACAACAA
CAACAACATAACAGTGTTCACTAGCAACCTCAAACAGACACC
ATG
ATGGGACCCTTGATGTTTTCTTTCCCCTTCTTTTCTATGGTTAAGTTCATGTCATA
GGAAGGGGAGAAGTAACAGGGTACACATATTGACCAAATCAGGGTAATTTTGC
ATTTGTAATTTTAAAAAATGCTTTCTTCTTTTAATATACTTTTTTGTTTATCTTATT
TCTAATACTTTCCCTAATCTCTTTCTTTCAGGGCAATAATGATACAATGTATCAT
GCCTCTTTGCACCATTCTAAAGAATAACAGTGATAATTTCTGGGTTAAGGCAAT
AGCAATATTTCTGCATATAAATATTTCTGCATATAAATTGTAACTGATGTAAGAG
GTTTCATATTGCTAATAGCAGCTACAATCCAGCTACCATTCTGCTTTTATTTTAT
GGTTGGGATAAGGCTGGATTATTCTGAGTCCAAGCTAGGCCCTTTTGCTAATCAT
GTTCATACCTCTTATCTTCCTCCCACAGCTCCTGGGCAACGTGCTGGTCTGTGTG
AGGTGAAGAATACGACCACCTGCGTTTTATACTTCCAcGAGATCTGGGGAG
GTGAAGAATACGACCACCTAATAAGATTACCGAAAGGCAATCTTATTAAAA
CATACCAGATCTTgTGAGGGTGTTTGTGGCAAAACATACCAGATCGAATTC
GGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
AAGGGGGAGGGGGA
GCGGCCGCCAACAACAACAACAACAACAACAACAACAACAA
ACAACAACATAACAGTGTTCACTAGCAACCTCAAACAGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTG
TTCACTAGCATCCCCAAACAGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AGCACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTG
TTCACTAGCATCCCCCAGACCATCTACCACCGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGCGGCCTTAATTAACAGT
GTTCACTAGGACACCATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
CACACACACAAATCTGGGGAGGTGAAGAATACGACCACCTGCGTTTTAT
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTG
TTCACTAGCCCCCCCCAGACCATCTACCACCGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTG
TTCACTAGCCCCCCCCAGACCATCTACCACCGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTG
TTCACTAGCCCCCCCCAGACCATCTACCACCGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTG
TTCACTAGCCCCCCCCAGACCATCTACCACCGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
CTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTACC
GAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCA
AAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCTG
CTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
AAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGT
TCACTAGCCCCCCCCAGACCATCTACCACCGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
AGTGTTCACTAGAGCCAACAACAACAACAACAACAACAACAACAACGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
AGTGTTCACTAGAGCCAACAACAACAACAACAACAACAACAACAACGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
AGTGTTCACTAGAGCCAACAACAACAACAACAACAACAACAACAACGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAAC
AGTGTTCACTAGAGCCAACAACAACAACAACAACAACAACAACAACGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
ACTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTAC
CGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGC
AAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCT
GCTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTG
TTGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAG
GAAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTG
TTCACTAGCCCCCCCCAGACCATCTACCACCGACACC
ATG
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
CTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTACC
GAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCA
AAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCTG
CTACAAGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
AAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCTTAATTAACA
GTGTTCACTAGAGCCAACAACAACAACAACAACAACAACAACAACGACACC
ATG
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
AG
ATGTTCCTCGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGAGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
AAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGT
TCACTAGCAACCTCAAACAGACACC
ATG
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
GAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
ATGTTCCTCGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGAGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
AAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGT
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
GAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
ATGTTCCTCGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGAGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
GAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
ATGTTCCTCGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGAGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
GAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGAGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
GAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
ATGTTCCTCGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGAGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
GAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
ATGTTCCTCGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGAGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
GAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
ATGTTCCTCGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGAGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
AAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGT
ATGCCAGCTACCATTCTGCTTTTATTTTATGGTTGGGATAAGGCTGGATTATTCT
GAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
ATGTTCCTCGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAACATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
AAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGT
TAAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTC
TGAGTCCAAGCTAGGCCCTTTTGCTAATCATCTTCATACCTCTTATCTTCCTCTGC
AG
ATTTTCCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGAT
TACCGAAAGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGT
GGCAAAACATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCA
CCTGCTACAAGTACCTAATAAAAATTAGCGGAGAAACATACCACTGTGTGT
TGGTTTTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGG
AAAGGGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGT
TCACTAGCATCCCCAAACAGACACC
ATG
AAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTCTGA
GGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCAAAACA
TACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCTGCTACA
TTTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAG
GGGGAGGGGGAGGAAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGTTCACT
AGCCCCCCCCAGACCATCTACCACCGACACC
ATG
AAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTCTGA
CACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTACCGAAA
GGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCAAAACA
TACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCTGCTACA
AGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTGTTGGTTTT
TTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGG
GGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGTTCACTAGC
CCCCCCCAGACCATCTACCACCGACACC
ATG
AAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTCTGA
ACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTACCGAAAG
GCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCAAAACAT
GTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTGTTGGTTTTT
TGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGGG
GAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGTTCACTAGCC
CCCCCCAGACCATCTACCACCGACACCATG
AAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTCTGA
CCACGAGATCTGGGGAGGTGAAGAATACGACCACCTAATAAGATTACCGAA
AGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCAAAAC
ATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCTGCTACA
AGTACCTAATAAAGTATAAAGTGCAAAACATACCAGATCTGTGTGTTGGTTTT
TTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGGG
GGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGTTCACTAGC
CCCCCCCAGACCATCTACCACCGACACC
ATG
AAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTCTGA
GCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCAAAACAT
ACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCTGCTACAA
TTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGG
GGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGTTCACTA
GCCCCCCCCAGACCATCTACCACCGACACC
ATG
AAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTCTGA
GCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCAAAACAT
ACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCTGCTACAA
TTTGTGTGTTAACGGGGGAGGGGGAGGAAAGGGGGAGGGGGAGGAAAGG
GGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGGCCGCCATAACAGTGTTCACTA
GCCCCCCCCAGACCATCTACCACCGACACC
ATG
AAGCCAGCTACCATTCTGCTTTTATTTTATCGTTGGGATAAGGCTGGATTATTCTGA
AGGCAATCTTATTAAAACATACCAGATCTTGTGAGGGTGTTTGTGGCAAAAC
ATACCAGATCGAATTCGATCTGGGGAGGTGAAGAATACGACCACCTGCTACA
GGGGAGGGGGAGGAAAGGGGGAGGGGGA
GCGCCGCCATAACAGTGTTCACT
AGCCCCCCCCAGACCATCTACCACCGACACC
ATG
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. The scope of the present invention is not intended to be limited to the above Description, but rather is as set forth in the following claims:
This application claims priority to U.S. Provisional Application No. 62/894,611, filed on Aug. 30, 2019, U.S. Provisional Application No. 62/904,635, filed on Sep. 23, 2019, and U.S. Provisional Application No. 63/043,504, filed Jun. 24, 2020, the contents of each of which are incorporated herein by reference in their entirety.
This invention was made with government support under EB013584 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/048561 | 8/28/2020 | WO |
Number | Date | Country | |
---|---|---|---|
63043504 | Jun 2020 | US | |
62904635 | Sep 2019 | US | |
62894611 | Aug 2019 | US |