A wide variety of diseases and disorders are caused by mutations, deletions, or altered expression of genes. Many of these genes are tightly regulated in healthy individuals such that over-expression or under-expression of the gene may result in detrimental side effects. Additionally, some diseases and disorders are characterized by different cell genotypes of healthy and diseased cells within a subject. As a result, expression of a gene, such as a transgene, may be therapeutic in one cell type but detrimental in another cell type. While substantial progress is being made toward delivery of transgenes into individuals for treatment of genetic disorders, there remains a need for gene therapies that can regulate transgene expression in a cell-type or cell state dependent manner.
In various aspects, the present disclosure provides an engineered core promoter, wherein the engineered core promoter comprises a sequence having at least 80% sequence identity to SEQ ID NO: 4 and including nucleotide substitutions T15G and A40T relative to SEQ ID NO: 5.
In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 4 and including the nucleotide substitutions T15G and A40T relative to SEQ ID NO: 5. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 4.
In some aspects, the engineered core promoter further comprises a transcriptional pause site. In some aspects, the transcriptional pause site comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212. In some aspects, the transcriptional pause site comprises a sequence of any one of SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212. In some aspects, the engineered core promoter further comprises a YY1 motif. In some aspects, the YY1 motif comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 205-SEQ ID NO: 207. In some aspects, the YY1 motif comprises a sequence having a sequence of any one of SEQ ID NO: 205-SEQ ID NO: 207. In some aspects, the engineered core promoter comprises a spacer of 30 to 34 nucleotides.
In some aspects, a transcriptional activity produced by the engineered core promoter paired with an activated response element is higher than a transcriptional activity produced by a core promoter of SEQ ID NO: 6 when paired with the activated response element, a basal transcriptional activity produced by the engineered core promoter paired with an activated response element is lower than a basal transcriptional activity produced by a core promoter of SEQ ID NO: 6 when paired with an inactive response element, or both. In some aspects, a transcriptional activity produced by the engineered core promoter paired with an activated response element is higher than a transcriptional activity produced by a core promoter of SEQ ID NO: 9 when paired with the activated response element, a basal transcriptional activity produce by the engineered core promoter paired with an activated response element is lower than a basal transcriptional activity produced by a core promoter of SEQ ID NO: 9 when paired with an inactive response element, or both.
In various aspects, the present disclosure provides an engineered promoter comprising a response element and an engineered core promoter as described herein.
In some aspects, the response element confers bone marrow-specific transcription, liver-specific transcription, neuron-specific transcription, muscle-specific transcription, or kidney-specific transcription of a payload under transcriptional control of the engineered promoter. In some aspects, the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194. In some aspects, the response element comprises a sequence of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194.
In various aspects, the present disclosure provides a recombinant polynucleotide comprising an engineered core promoter as described herein or an engineered promoter as described herein and a payload comprising a coding sequence under transcriptional control of the engineered core promoter.
In some aspects, the payload encodes a protein. In some aspects, the protein is a neuronal protein, a kidney protein, a retinal protein, a muscle protein, or an apoptosis-inducing protein. In some aspects, the protein is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the protein is progranulin, MeCP2, polycystin-1, or polycystin-2.
In some aspects, the payload encodes a therapeutic polynucleotide. In some aspects, the therapeutic polynucleotide is a guide RNA or a suppressor tRNA. In some aspects, the therapeutic polynucleotide targets a gene. In some aspects, the gene is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the gene is GRN, MECP2, PKD2, or PKD2.
In various aspects, the present disclosure provides an engineered viral vector comprising an engineered core promoter as described herein, an engineered promoter as described herein, or a recombinant polynucleotide as described herein in a viral vector.
In some aspects, the viral vector is an adenoviral vector, an adeno-associated viral vector, or a lentivector. In some aspects, the adeno-associated viral vector is selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV 10, AAV11, AAV12, AAV13, AAV14, AAV15, AAV16, AAV-DJ, AAV-DJ/8, AAV-DJ/9, AAV1/2, AAV.rh8, AAV.rh10, AAV.rh20, AAV.rh39, AAV.Rh43, AAV.Rh74, AAV.v66, AAV.Oligo001, AAV.SCH9, AAV.r3.45, AAV.RHM4-1, AAV.hu37, AAV.Anc80, AAV.Anc80L65, AAV.7m8, AAV.PhP.eB, AAV.PhP.V1, AAV.PHP.B, AAV.PhB.C1, AAV.PhB.C2, AAV.PhB.C3, AAV.PhB.C6, AAV.cy5, AAV2.5, AAV2tYF, AAV3B, AAV.LK03, AAV.HSC1, AAV.HSC2, AAV.HSC3, AAV.HSC4, AAV.HSC5, AAV.HSC6, AAV.HSC7, AAV.HSC8, AAV.HSC9, AAV.HSC10, AAV.HSC11, AAV.HSC12, AAV.HSC13, AAV.HSC14, AAV.HSC15, AAV.HSC16, AAV.HSC17, AAVhu68, and combinations thereof.
In various aspects, the present disclosure provides a pharmaceutical composition comprising an engineered core promoter as described herein, an engineered promoter as described herein, a recombinant polynucleotide as described herein, or a viral vector as described herein, and a pharmaceutically acceptable carrier.
In various aspects, the present disclosure provides a method of treating a disorder in a subject in need thereof, the method comprising: administering to the subject a composition comprising a recombinant polynucleotide as described herein, a viral vector as described herein, or a pharmaceutical composition as described herein; and expressing a therapeutic sequence encoded by a payload of the recombinant polynucleotide in a target cell of the subject, thereby treating the disorder.
In some aspects, the target cell is a cell type or cell state associated with the disorder. In some aspects, the disorder is a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the disorder is Rett syndrome, MECP2 duplication syndrome, frontotemporal dementia, neuronal ceroid lipofuscinosis, cancer, atherosclerosis, Alzheimer's disease, amyotrophic lateral sclerosis, limbic predominant age-related TDP-43 encephalopathy, or polycystic kidney disease. In some aspects, the therapeutic sequence encodes a therapeutic protein. In some aspects, the therapeutic protein is a neuronal protein, a kidney protein, a retinal protein, a muscle protein, or an apoptosis-inducing protein. In some aspects, the therapeutic protein is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the therapeutic protein is MECP2, progranulin, polycystin-1, or polycystin-2. In some aspects, the payload encodes a therapeutic polynucleotide. In some aspects, the therapeutic polynucleotide is a guide RNA or a suppressor tRNA.
In various aspects, the present disclosure provides a method of identifying a switchable core promoter, the method comprising: introducing a core promoter library comprising a first sub-library and a second sub-library to a population of cells; wherein the first sub-library comprises a plurality of core promoters, wherein a core promoter of the plurality of core promoters is linked to a first response element, and a unique barcode sequence; wherein the second sub-library comprises the plurality of core promoters, wherein the core promoter of the plurality of promoters is linked to a second response element and, a unique barcode sequence; and identifying a switchable core promoter as the core promoter that promotes higher transcription of the unique barcode when paired with the first response element than when paired with the second response element.
In some aspects, the method further comprises activating the first response element. In some aspects, the second response element is not activated. In some aspects, the first response element is an activated response element and the second enhancer sequence is an inactive response element or an unactivated response element.
In various aspects, the present disclosure provides an engineered core promoter comprising a TATA box, an initiator element, a spacer separating the TATA box and the initiator element, and one or more features selected from: a) the TATA box comprises a sequence having at least 85% sequence identity to any one of SEQ ID NO: 199-SEQ ID NO: 201 or SEQ ID NO: 213-SEQ ID NO: 215; b) the initiator element comprises a sequence having at least 75% sequence identity to any one of SEQ ID NO: 202-SEQ ID NO: 204, SEQ ID NO: 216, or SEQ ID NO: 217; c) the spacer is a short spacer comprising no less than 25 and no more than 29 nucleotide residues from the TATA box to the initiator element; d) the spacer is a long spacer comprising no less than 31 and no more than 48 nucleotide residues from the TATA box to the initiator element; e) a YY1 motif comprising a sequence having at least 80% sequence identity to any one of SEQ ID NO: 205-SEQ ID NO: 207; f) a transcriptional pause site comprising a sequence having at least 90% sequence identity to any one of SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212; g) at least 70% sequence identity and at least one nucleotide substitution relative to SEQ ID NO: 5, SEQ ID NO: 9, or SEQ ID NO: 210; and h) combinations thereof.
In some aspects, the engineered core promoter comprises at least 70% sequence identity and nucleotide substitutions T15G and A40C relative to SEQ ID NO: 5. In some aspects, the engineered core promoter comprises at least 70% sequence identity and nucleotide substitutions T15A and A40T relative to SEQ ID NO: 5.
In some aspects, the spacer comprises from 29 to 31 nucleotides from the TATA box to the initiator element. In some aspects, the spacer comprises 30 nucleotides from the TATA box to the initiator element. In some aspects, the short spacer comprises 28 nucleotide residues from the TATA box to the initiator element. In some aspects, the long spacer comprises 32 nucleotide residues from the TATA box to the initiator element.
In some aspects, the TATA box comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 199-SEQ ID NO: 201 or SEQ ID NO: 213-SEQ ID NO: 215. In some aspects, the TATA box comprises a sequence of any one of SEQ ID NO: 199-SEQ ID NO: 201 or SEQ ID NO: 213-SEQ ID NO: 215.
In some aspects, the initiator element comprises a sequence having at least 85% sequence identity to any one of SEQ ID NO: 202-SEQ ID NO: 204, SEQ ID NO: 216, or SEQ ID NO: 217. In some aspects, the initiator element comprises a sequence of any one of SEQ ID NO: 202-SEQ ID NO: 204, SEQ ID NO: 216, or SEQ ID NO: 217.
In some aspects, the YY1 motif comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 205-SEQ ID NO: 207. In some aspects, the YY1 motif comprises a sequence of any one of SEQ ID NO: 205-SEQ ID NO: 207.
In some aspects, the transcriptional pause site comprises a sequence having at least 95% sequence identity to any one of SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212. In some aspects, the transcriptional pause site comprises a sequence of any one of SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212.
In some aspects, the engineered core promoter comprises two or more of the features. In some aspects, the engineered core promoter comprises features a) and b). In some aspects, the engineered core promoter comprises three or more of the features. In some aspects, the engineered core promoter comprises features a) and b) and at least one more of the features. In some aspects, the engineered core promoter comprises features a), b), and c). In some aspects, the engineered core promoter comprises features a), b), and d). In some aspects, the engineered core promoter comprises features a), b), and e). In some aspects, the engineered core promoter comprises features a), b), and f).
In various aspects, the present disclosure provides an engineered core promoter comprising a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144.
In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144. In some aspects, the engineered core promoter comprises a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110.
In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 7. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 7. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 1. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 1. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 2. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 2. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 3. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 3. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 4 and including nucleotide substitutions T15G and A40T relative to SEQ ID NO: 5. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 4. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 10. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 10. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 11. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 11. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 12. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 12. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 13. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 13. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 14. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 14. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 15 and including nucleotide substitutions T15G and A40C relative to SEQ ID NO: 5. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 15. In some aspects, the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 110. In some aspects, the engineered core promoter comprises a sequence of SEQ ID NO: 110.
In some aspects, the engineered core promoter comprises a TATA box, an RNA polymerase binding sequence, a B recognition element, a CCAAT box, or a Pribnow box. In some aspects, the engineered core promoter is capable of recruiting a polymerase. In some aspects, the polymerase is an RNA polymerase II.
In some aspects, the engineered core promoter has a dynamic range that is higher than a dynamic range of SEQ ID NO: 6. In some aspects, the engineered core promoter has a dynamic range that is higher than a dynamic range of SEQ ID NO: 9. In some aspects, the engineered core promoter has an activity that is higher than an activity of SEQ ID NO: 6. In some aspects, the engineered core promoter has an activity that is higher than an activity of SEQ ID NO: 9. In some aspects, a transcriptional activity produced by the engineered core promoter paired with an activated response element is higher than a transcriptional activity produced by a core promoter of SEQ ID NO: 6 when paired with the activated response element, a basal transcriptional activity produced by the engineered core promoter paired with an activated response element is lower than a basal transcriptional activity produced by a core promoter of SEQ ID NO: 6 when paired with an inactive response element, or both. In some aspects, a transcriptional activity produced by the engineered core promoter paired with an activated response element is higher than a transcriptional activity produced by a core promoter of SEQ ID NO: 9 when paired with the activated response element, a basal transcriptional activity produced by the engineered core promoter paired with an activated response element is lower than a basal transcriptional activity produced by a core promoter of SEQ ID NO: 9 when paired with an inactive response element, or both.
In various aspects, the present disclosure provides an engineered response element comprising a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to SEQ ID NO: 145 or SEQ ID NO: 146.
In some aspects, the engineered response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145 or SEQ ID NO: 146. In some aspects, the engineered response element comprises a sequence of SEQ ID NO: 145 or SEQ ID NO: 146. In some aspects, the engineered response element is capable of binding to a cognate ligand, a coactivator, or a corepressor. In some aspects, the engineered response element is an enhancer. In some aspects, binding of the cognate ligand or the coactivator to the engineered response element increases transcription of a payload under transcriptional control of the engineered response element paired with a core promoter in a target cell type, a target cell state, or a target tissue. In some aspects, binding of the corepressor to the engineered response element decreases transcription of a payload under transcriptional control of the engineered response element paired with a core promoter in a non-target cell type, a non-target cell state, or a non-target tissue. In some aspects, the engineered response element confers bone marrow-specific transcription of a payload under transcriptional control of the engineered response element paired with a core promoter. In some aspects, the engineered response element comprises a sequence of SEQ ID NO: 145. In some aspects, the engineered response element confers liver-specific transcription of a payload under transcriptional control of the engineered response element paired with a core promoter. In some aspects, the engineered response element comprises a sequence of SEQ ID NO: 146.
In various aspects, the present disclosure provides an engineered promoter comprising a response element and a core promoter; wherein the response element comprises a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194, and wherein the core promoter comprises a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to of any one of SEQ ID NO: 1-SEQ ID NO: 144 or SEQ ID NO: 210.
In some aspects, the core promoter comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 144 or SEQ ID NO: 210. In some aspects, the core promoter comprises a sequence of SEQ ID NO: 1-SEQ ID NO: 144 or SEQ ID NO: 210. In some aspects, the engineered promoter comprises a sequence of any one of SEQ ID NO: 148-SEQ ID NO: 177 or SEQ ID NO: 218-SEQ ID NO: 255.
In various aspects, the present disclosure provides an engineered promoter comprising a response element and an engineered core promoter as described herein.
In some aspects, the response element is capable of binding to a cognate ligand, a coactivator, or a corepressor. In some aspects, the response element is an enhancer. In some aspects, binding of the cognate ligand or the coactivator to the response element increases transcription of a payload under transcriptional control of the engineered promoter in a target cell type, a target cell state, or a target tissue. In some aspects, binding of the corepressor to the response element decreases transcription of a payload under transcriptional control of the engineered promoter in a non-target cell type, a non-target cell state, or a non-target tissue. In some aspects, the response element confers bone marrow-specific transcription of a payload under transcriptional control of the engineered promoter. In some aspects, the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145 or SEQ ID NO: 193. In some aspects, the response element comprises a sequence of SEQ ID NO: 145 or SEQ ID NO: 193. In some aspects, the response element confers liver-specific transcription of a payload under transcriptional control of the engineered promoter. In some aspects, the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 146 or SEQ ID NO: 194. In some aspects, the response element comprises a sequence of SEQ ID NO: 146 or SEQ ID NO: 194. In some aspects, the response element confers neuron-specific transcription of a payload under transcriptional control of the engineered promoter. In some aspects, the response element confers muscle-specific transcription of a payload under transcriptional control of the engineered promoter. In some aspects, the response element confers kidney-specific transcription of a payload under transcriptional control of the engineered promoter.
In various aspects, the present disclosure provides a recombinant polynucleotide comprising an engineered core promoter as described herein and a payload comprising a coding sequence under transcriptional control of the engineered core promoter.
In various aspects, the present disclosure provides a recombinant polynucleotide comprising a response element comprising a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194 and a payload comprising a coding sequence under transcriptional control of the response element.
In some aspects, the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194. In some aspects, the response element comprises a sequence of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194.
In various aspects, the present disclosure provides a recombinant polynucleotide comprising a promoter and a payload, wherein the promoter comprises: a response element capable of binding to a cognate ligand, a coactivator, or a corepressor; and the engineered core promoter of any one of claims 1-60 capable of recruiting a polymerase; wherein the payload comprises a coding sequence under transcriptional control of the promoter.
In some aspects, the response element is an enhancer. In some aspects, binding of the cognate ligand or the coactivator to the response element increases transcription of the payload in a target cell type, a target cell state, or a target tissue. In some aspects, binding of the corepressor to the response element decreases transcription of the payload in a non-target cell type, a non-target cell state, or a non-target tissue. In some aspects, the response element confers neuron-specific transcription of the payload. In some aspects, the response element confers bone marrow-specific transcription of the payload. In some aspects, the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145 or SEQ ID NO: 193. In some aspects, the response element comprises a sequence of SEQ ID NO: 145 or SEQ ID NO: 193. In some aspects, the response element confers liver-specific transcription of the payload. In some aspects, the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 146 or SEQ ID NO: 194. In some aspects, the response element comprises a sequence of SEQ ID NO: 146 or SEQ ID NO: 194. In some aspects, the response element confers muscle-specific transcription of the payload. In some aspects, the response element confers kidney-specific transcription of the payload.
In various aspects, the present disclosure provides a recombinant polynucleotide comprising a promoter and a payload, wherein the promoter comprises: a response element capable of binding to a cognate ligand, a coactivator, or a corepressor, wherein the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194; and an engineered core promoter capable of recruiting a polymerase; wherein the payload comprises a coding sequence under transcriptional control of the promoter.
In some aspects, the payload encodes a protein. In some aspects, the protein is a neuronal protein, a kidney protein, a retinal protein, a muscle protein, or an apoptosis-inducing protein. In some aspects, the protein is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the protein is progranulin, MeCP2, polycystin-1, or polycystin-2.
In some aspects, the payload encodes a therapeutic polynucleotide. In some aspects, the therapeutic polynucleotide is a guide RNA or a suppressor tRNA. In some aspects, the therapeutic polynucleotide targets a gene. In some aspects, the gene is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the gene is GRN, MECP2, PKD2, or PKD2.
In various aspects, the present disclosure provides an engineered viral vector comprising the engineered core promoter of any one of claims 1-60 in a viral vector.
In various aspects, the present disclosure provides an engineered viral vector comprising an engineered response element as described herein in a viral vector.
In various aspects, the present disclosure provides an engineered viral vector comprising an engineered promoter as described herein in a viral vector.
In various aspects, the present disclosure provides an engineered viral vector comprising a recombinant polynucleotide as described herein in a viral vector.
In some aspects, the viral vector is an adenoviral vector, an adeno-associated viral vector, or a lentivector. In some aspects, the adeno-associated viral vector is selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV 10, AAV11, AAV12, AAV13, AAV14, AAV15, AAV16, AAV-DJ, AAV-DJ/8, AAV-DJ/9, AAV1/2, AAV.rh8, AAV.rh10, AAV.rh20, AAV.rh39, AAV.Rh43, AAV.Rh74, AAV.v66, AAV.Oligo001, AAV.SCH9, AAV.r3.45, AAV.RHM4-1, AAV.hu37, AAV.Anc80, AAV.Anc80L65, AAV.7m8, AAV.PhP.eB, AAV.PhP.V1, AAV.PHP.B, AAV.PhB.C1, AAV.PhB.C2, AAV.PhB.C3, AAV.PhB.C6, AAV.cy5, AAV2.5, AAV2tYF, AAV3B, AAV.LK03, AAV.HSC1, AAV.HSC2, AAV.HSC3, AAV.HSC4, AAV.HSC5, AAV.HSC6, AAV.HSC7, AAV.HSC8, AAV.HSC9, AAV.HSC10, AAV.HSC11, AAV.HSC12, AAV.HSC13, AAV.HSC14, AAV.HSC15, AAV.HSC16, AAV.HSC17, AAVhu68, and combinations thereof.
In various aspects, the present disclosure provides a cell comprising an engineered core promoter as described herein.
In various aspects, the present disclosure provides a cell comprising an engineered response element as described herein.
In various aspects, the present disclosure provides a cell comprising an engineered promoter as described herein.
In various aspects, the present disclosure provides a cell comprising a recombinant polynucleotide as described herein.
In various aspects, the present disclosure provides a cell comprising an engineered viral vector as described herein.
In some aspects, the cell is a target cell. In some aspects, the cell is a diseased cell. In some aspects, the cell is a central nervous system cell, a neuron, a renal cell, a retinal cell, a hepatocyte, an epithelial cell, a muscle cell, an erythrocyte, a platelet, a bone marrow cell, an endothelial cell, an epidermal cell, a lymphocyte, a glial cell, an interstitial cell, an adipocyte, or a fibroblast.
In various aspects, the present disclosure provides a pharmaceutical composition comprising an engineered core promoter as described herein and a pharmaceutically acceptable carrier.
In various aspects, the present disclosure provides a pharmaceutical composition comprising an engineered response element as described herein and a pharmaceutically acceptable carrier.
In various aspects, the present disclosure provides a pharmaceutical composition comprising an engineered promoter as described herein and a pharmaceutically acceptable carrier.
In various aspects, the present disclosure provides a pharmaceutical composition comprising a recombinant polynucleotide as described herein and a pharmaceutically acceptable carrier.
In various aspects, the present disclosure provides a pharmaceutical composition comprising an engineered viral vector as described herein and a pharmaceutically acceptable carrier.
In various aspects, the present disclosure provides a method of expressing a payload in a target cell of a subject, the method comprising: administering to the subject a recombinant polynucleotide as described herein a viral vector as described herein, or a pharmaceutical composition as described herein to the subject; binding a cognate ligand, a coactivator, or a corepressor to the response element of the recombinant polynucleotide, wherein the cognate ligand or the coactivator is specific to the target cell, or wherein the corepressor is specific to a non-target cell; initiating transcription of the payload by recruiting a polymerase to the core promoter; and transcribing the payload, thereby expressing the payload in the target cell.
In some aspects, the method comprises expressing the payload at a higher level in the target cell than in the non-target cell. In some aspects, the method comprises initiating transcription at a higher rate in the target cell than in the non-target cell. In some aspects, the cognate ligand or the coactivator is present at a higher level in the target cell than in the non-target cell. In some aspects, the cognate ligand is a transcription factor. In some aspects, the corepressor is present at a higher level in the non-target cell than in the target cell. In some aspects, the target cell is a target cell state, and wherein the non-target cell is a non-target cell state. In some aspects, the target cell state is a diseased cell. In some aspects, the non-target cell state is a healthy cell. In some aspects, the target cell is a target cell type, and wherein the non-target cell is a non-target cell type. In some aspects, the target cell type is a central nervous system cell, a neuron, a renal cell, a retinal cell, a hepatocyte, an epithelial cell, a muscle cell, an erythrocyte, a platelet, a bone marrow cell, an endothelial cell, an epidermal cell, a lymphocyte, a glial cell, an interstitial cell, an adipocyte, or a fibroblast. In some aspects, the non-target cell type is a central nervous system cell, a neuron, a renal cell, a retinal cell, a hepatocyte, an epithelial cell, a muscle cell, an erythrocyte, a platelet, a bone marrow cell, an endothelial cell, an epidermal cell, a lymphocyte, a glial cell, an interstitial cell, an adipocyte, a fibroblast, or combinations thereof.
In various aspects, the present disclosure provides a method of treating a disorder in a subject in need thereof, the method comprising: administering to the subject a composition comprising a recombinant polynucleotide as described herein, a viral vector as described herein, or a pharmaceutical composition as described herein; and expressing a therapeutic sequence encoded by a payload of the recombinant polynucleotide in a target cell of the subject, thereby treating the disorder.
In some aspects, the target cell is a cell type associated with the disorder. In some aspects, the target cell is a cell state associated with the disorder. In some aspects, the disorder is a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the disorder is Rett syndrome, MECP2 duplication syndrome, frontotemporal dementia, neuronal ceroid lipofuscinosis, cancer, atherosclerosis, Alzheimer's disease, amyotrophic lateral sclerosis, limbic predominant age-related TDP-43 encephalopathy, or polycystic kidney disease. In some aspects, the disorder is any one of the disorders provided in TABLE 5. In some aspects, the therapeutic sequence encodes a therapeutic protein. In some aspects, the therapeutic protein is a neuronal protein, a kidney protein, a retinal protein, a muscle protein, or an apoptosis-inducing protein. In some aspects, the therapeutic protein is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the therapeutic protein is MECP2, progranulin, polycystin-1, or polycystin-2. In some aspects, the therapeutic protein is encoded by a gene provided in TABLE 5. In some aspects, the payload encodes a therapeutic polynucleotide. In some aspects, the therapeutic polynucleotide is a guide RNA or a suppressor tRNA. In some aspects, the therapeutic polynucleotide targets a gene provided in TABLE 5.
In various aspects, the present disclosure provides a method of identifying a switchable core promoter, the method comprising: introducing a core promoter library comprising a first sub-library and a second sub-library to a population of cells; wherein the first sub-library comprises a plurality of core promoters, wherein a core promoter of the plurality of core promoters is linked to a first response element, and a unique barcode sequence; wherein the second sub-library comprises the plurality of core promoters, wherein the core promoter of the plurality of promoters is linked to a second response element and, a unique barcode sequence; and identifying a switchable core promoter as the core promoter that promotes higher transcription of the unique barcode when paired with the first response element than when paired with the second response element.
In some aspects, the method further comprises activating the first response element. In some aspects, the second response element is not activated. In some aspects, the first response element is an activated response element and the second enhancer sequence is an inactive response element or an unactivated response element. In some aspects, the first response element is specific for the population of cells. In some aspects, the population of cells are neurons, kidney cells, liver cells, muscle cells, or cancer cells. In some aspects, the first response element is specific for neurons, kidney cells, liver cells, muscle cells, or cancer cells. In some aspects, the second response element is specific for neurons, kidney cells, liver cells, muscle cells, or cancer cells. In some aspects, the plurality of core promoters comprises engineered core promoters, synthetic core promoters, wild type core promoters, variant core promoters, or combinations thereof.
In various aspects, the present disclosure provides a recombinant polynucleotide as described herein, a viral vector as described herein, or a pharmaceutical composition as described herein; for use in a method of treating a disorder, the method comprising administering to a subject a composition comprising the recombinant polynucleotide, viral vector or pharmaceutical composition and expressing a therapeutic sequence encoded by a payload of the recombinant polynucleotide in a target cell of the subject.
In some aspects, the disorder treated is a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. In some aspects, the disorder is Rett syndrome, MECP2 duplication syndrome, frontotemporal dementia, neuronal ceroid lipofuscinosis, cancer, atherosclerosis, Alzheimer's disease, amyotrophic lateral sclerosis, limbic predominant age-related TDP-43 encephalopathy, or polycystic kidney disease. In some aspects, the disorder is any one of the disorders provided in TABLE 5.
All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference.
The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings of which:
5), is denoted with a white box.
While gene therapy has the potential to treat a wide variety of diseases associated with genetic mutations or altered gene expression, there may be risks associated with such therapies due, in part, to imprecise targeting of cells or tissues with the therapeutic payload. Limiting payload expression to a target cell type, cell state, or tissue type, such as a diseased cell or tissue, may reduce the side effects of gene therapy caused by imprecise targeting. Described herein are polynucleotide sequences that encode for cell type- or cell state-specific expression of a payload. The polynucleotide sequences may comprise a payload, such as a transgene or therapeutic polynucleotide, under transcriptional control of a promoter. The promoter may facilitate cell type- or cell state-specific transcription initiation, leading to cell type- or cell state-specific expression of the payload. In some embodiments, the promoter may comprise a response element that is capable of binding to cognate ligands, coactivators, or corepressors to modulate payload transcription and a core promoter that is capable of recruiting transcriptional machinery to initiate transcription of the payload when in combination with the response element bound to cognate ligands, coactivators, or corepressors. As described herein, the core promoter may be a switchable core promoter that promotes high levels of transcription in the presence of an activated response element, such as an enhancer bound to a cognate ligand or coactivator, and low or no transcription in the presence of an unactivated response element, such as an enhancer not bound to the cognate ligand or coactivator or an enhancer bound to a corepressor. The cognate ligand may be a transcription factor or sequence-specific DNA binding factor.
The polynucleotides of the present disclosure (e.g., promoters, core promoters, response elements, payloads, or combinations thereof) may comprise a recombinant polynucleotide sequence. The recombinant polynucleotide sequence may be engineered to encode for tissue type-, cell type, or cell state-specific transcription of a payload. In some embodiments, the level of transcription of the payload may depend on a cell type (e.g., neuron, renal cell, hepatocyte, podocyte, retinal cell, epithelial cell, muscle cell, erythrocyte, platelet, bone marrow cell, endothelial cell, epidermal cell, lymphocyte, glial cell, interstitial cell, adipocyte, or fibroblast), cell state (e.g., diseased cell or healthy cell; activated or unactivated engineered cells, such as an activated or unactivated CAR T cell), tissue type (e.g., diseased tissue, healthy tissue, nervous tissue (e.g., central nervous system, peripheral nervous system), kidney tissue, eye tissue, muscle tissue, blood, skin, fat, bone, cancerous tissue, thymus tissue, gastrointestinal tissue (e.g., stomach, intestine), spleen tissue, placenta tissue, pancreatic tissue, lung tissue, liver tissue, cardiac tissue, or brain tissue (e.g., cerebrum, cerebellum, adrenal)), or a combination thereof. In some embodiments, a recombinant polynucleotide of the present disclosure may comprise a core promoter (e.g., an engineered core promoter) as described herein, a response element (e.g., an engineered response element) as described herein, a payload (e.g., transgene or therapeutic polynucleotide) as described herein, or combinations thereof.
The promoter may be selected for enhanced transcription in a cell type, cell state, or tissue type of interest, reduced transcription in other cell types, cell states, or tissue types, or combinations thereof. In some embodiments, the promoter may be selected or engineered to tune the level of payload transcription (e.g., to a therapeutic level, such as about the same level as a wild type version of a transgene in the target cell type or cell state). In some embodiments, the promoter may be selected or engineered to tune the cell state-dependence of payload transcription. Tuning a transcription level may comprise adjusting transcription to a desired level. The transcription level of the payload may control the expression level of the protein encoded by the payload. For example, a high level of transcription of a transgene may lead to a high level of expression of the protein encoded by the transgene. A core promoter of the present disclosure (e.g., an engineered core promoter) may be engineered or selected for both strength (e.g., strong transcriptional activation) and specificity (e.g., specific activation when paired with a response element in a transcriptionally active state compared to when paired with a response element in a transcriptionally inactive state).
Also described herein are methods of delivering a polynucleotide sequence of the present disclosure to a subject. In some embodiments, the polynucleotide may be part of a viral vector capable of delivering the polynucleotide to a cell of the subject. The viral vector may comprise a viral inverted terminal repeat sequence that includes a viral origin of replication, enabling viral replication of the polynucleotide sequence. The viral vector may further comprise a viral capsid encapsulating the polynucleotide and facilitating delivery of the polynucleotide into the cell. A method of delivering a polynucleotide composition to a subject may comprise administering a viral vector comprising the polynucleotide to the subject. Upon delivery of the polynucleotide to the subject, a payload encoded by the polynucleotide may be transcribed in a cell of the subject in a cell type-, cell state-, or tissue type-dependent manner.
Further described herein are methods of treating a disease or condition by delivering a polynucleotide composition of the present disclosure to a subject and expressing a therapeutic protein or therapeutic polynucleotide encoded by the polynucleotide in the subject in a cell type-, cell state-, or tissue type-dependent manner. The therapeutic protein may be a wild type version of a protein mutated or under-expressed in the disease or condition. The therapeutic polynucleotide may be a polynucleotide (e.g., a gRNA or a tRNA) that targets a gene associated with the disease or condition. The polynucleotide composition may be delivered to the subject as part of a viral vector. The subject may have a disease or condition, for example a disease or condition caused by mutation or altered expression of a protein. In some embodiments, the polynucleotide composition may comprise a transgene encoding a wild type copy of the protein having the mutation or altered expression. The transgene may be selectively transcribed in a target cell type, cell state, or tissue type of the subject upon delivery of the polynucleotide composition to the subject. In some embodiments, a protein encoded by the transgene is expressed in the subject at a level dependent on the level of transcription of the transgene. Transcription of the transgene, expression of the protein encoded by the transgene, or both, in a cell type-, cell state-, or tissue type-dependent manner may treat the disease or condition in the subject.
Further described herein are methods of identifying a switchable core promoter as disclosed herein. The switchable core promoter may be an engineered core promoter as disclosed herein. A method of identifying a switchable core promoter may comprise: introducing a core promoter library comprising a first sub-library and a second sub-library to a population of cells; wherein the first sub-library comprises a plurality of core promoters, wherein a core promoter of the plurality of core promoters is linked to a first response element, and a unique barcode sequence; wherein the second sub-library comprises the plurality of core promoters, wherein the core promoter of the plurality of promoters is linked to a second response element and, a unique barcode sequence; and identifying a switchable core promoter as the core promoter that promotes higher transcription of the unique barcode when paired with the first response element than when paired with the second response element. The method may further comprise activating the first response element. The second response element may not be activated. The first response element may be an activated response element and the second enhancer sequence may be an inactive response element or an unactivated response element. The first response element may be specific for the population of cells. The population of cells may be neurons, kidney cells, liver cells, muscle cells, or cancer cells. The first response element may be specific for neurons, kidney cells, liver cells, muscle cells, or cancer cells. The second response element may be specific for neurons, kidney cells, liver cells, muscle cells, or cancer cells. The plurality of core promoters may comprise engineered core promoters, synthetic core promoters, wild type core promoters, variant core promoters, or combinations thereof.
As used herein, “switchable” may refer to the ability of a core promoter to be combined with various response elements (e.g., cell type-specific enhancers, tissue type-specific enhancers, or cell state-specific enhancers) to promote high levels of transcription in the presence of a response element in a transcriptionally active state (e.g., an activated response element) and low or no transcription in the presence of a response element in a transcriptionally inactive or repressed state (e.g., an unactivated response element or an inactive response element).
As used herein, “core promoter” or “core promoter sequence” may refer to a polynucleotide encoding a sequence that recruits transcriptional machinery (e.g., an RNA polymerase) to initiate transcription of a downstream polynucleotide coding for a sequence (e.g., a payload sequence), for example by binding general transcription factors (GTFs) that recruit an RNA polymerase. As used herein, “engineered core promoter” or “engineered core promoter sequence” may refer to a non-naturally occurring core promoter or core promoter sequence. In some embodiments, the “engineered core promoter” or “engineered core promoter sequence” may refer to a core promoter that has been mutated, altered, or engineered to have a moderate affinity for molecules (e.g., general transcription factors (GTFs)) that recruit transcriptional machinery (e.g., an RNA polymerase). More specifically, the moderate affinity is such that the concentration of these molecules (e.g., GTFs) in the nucleus results in little or no recruitment of transcriptional machinery to the engineered core promoter when in the absence of an activated response element or in the presence of an inactive response element. However, when in the presence of an activated response element, the activated response element causes an increase in the concentration of these molecules (e.g., GTFs) at the core promoter and the resulting increased localized concentration of these molecules (e.g., GTFs) at the core promoter overcomes the moderate affinity, allowing for higher recruitment of transcriptional machinery (e.g., an RNA polymerase) to initiate higher transcription of a downstream polynucleotide coding for a sequence (e.g., a payload sequence) compared the recruitment of transcription machinery and transcription level in the absence of the activated response element.
As used herein, “response element” may refer to a polynucleotide encoding a sequence that binds to a cognate ligand (e.g., a transcription factor), coactivator, corepressor, or combinations thereof in a sequence-dependent manner. An “active response element” may refer to a polynucleotide encoding a sequence that is capable of binding to a cognate ligand (e.g., a transcription factor), coactivator, corepressor, or combinations thereof in a sequence-dependent manner to facilitate binding of transcriptional machinery to a nearby core promoter, e.g., an engineered core promoter. An “inactive response element” may refer to a polynucleotide encoding a sequence that is not capable of binding to a cognate ligand (e.g., a transcription factor), coactivator, or combinations thereof in a sequence-dependent manner to facilitate binding of transcriptional machinery to a nearby core promoter, e.g., an engineered core promoter. An “activated response element” may be a response element bound to a cognate ligand (e.g., a transcription factor) or coactivator, or having decreased binding to a corepressor (as compared to when unactivated), or combinations thereof, that facilitates high binding of transcriptional machinery to a nearby core promoter, e.g., an engineered core promoter. An “unactivated response element” may be an active response element that is not bound to a cognate ligand (e.g., a transcription factor) or coactivator, or having increased binding to a corepressor (as compared to when activated), or combinations thereof that has low or does not facilitate binding of transcriptional machinery to a nearby core promoter, e.g., an engineered core promoter. For example, an activated response element may facilitate increased binding of transcriptional machinery to the nearby core promoter than an unactivated response element to the same core promoter.
As used herein, “enhancer” or “enhancer region” may refer to a response element that binds to a cognate ligand or coactivator, or has low binding to a corepressor to increase binding of transcriptional machinery to a nearby core promoter and, therefore, increase transcription of a nearby polynucleotide encoding a sequence of a payload, such as a transgene or therapeutic polynucleotide, wherein this binding is cell type-, cell state-, or tissue type-dependent. An “active enhancer” may refer to a polynucleotide encoding a sequence that is capable of cell type-, cell state-, or tissue type-dependent binding to a cognate ligand (e.g., a transcription factor), coactivator, or having decreased binding to a corepressor (as compared to when unactivated), or combinations thereof in a sequence-dependent manner that facilitates binding of transcriptional machinery to a nearby core promoter, e.g., an engineered core promoter, when in that cell type, cell state, or tissue type, respectively. An “inactive enhancer” may refer to a polynucleotide encoding a sequence that is not capable of cell type-, cell state-, or tissue type-dependent binding to a cognate ligand (e.g., a transcription factor), coactivator, or combinations thereof, in a sequence-dependent manner to facilitate binding of transcriptional machinery to a nearby core promoter, e.g., an engineered core promoter, when in that cell type, cell state, or tissue type, respectively. An “activated enhancer” may be an enhancer in its specific cell type, cell state, or tissue type so that it is bound to a cognate ligand (e.g., a transcription factor) or coactivator, or having decreased binding to a corepressor (as compared to when unactivated), or combinations thereof, that facilitates binding of transcriptional machinery to a nearby core promoter, e.g., an engineered core promoter in that specific cell type, cell state, or tissue type. An “unactivated enhancer” may be an active enhancer that is not in its specific cell type, cell state, or tissue type so that it is not bound to a cognate ligand (e.g., a transcription factor) or coactivator, or has increased binding to a corepressor (as compared to when activated), or combinations thereof, that has low or does not facilitate binding of transcriptional machinery to a nearby core promoter, e.g., an engineered core promoter.
A polynucleotide (e.g., an RNA or a DNA polynucleotide) may comprise a promoter to regulate or enhance transcription of a payload, such as a transgene or therapeutic polynucleotide. The promoter may be an engineered promoter. In some embodiments, the promoter may comprise a response element (e.g., an enhancer region) and a core promoter that functions as a site for preinitiation complex formation, or combinations thereof. The core promoter may be an engineered core promoter. In some embodiments, the sequence of the response element may comprise a protein binding sequence that binds one or more cognate ligands, coactivators, or corepressors in a sequence-dependent manner. For example, a response element may bind a transcription factor (TF) that enhances transcription of a nearby polynucleotide encoding a sequence of a payload, such as a transgene or therapeutic polynucleotide. In some embodiments, the core promoter may comprise an engineered core promoter to promote high levels of transcription in the presence of an active response element (e.g., an active enhancer) when activated and low or no transcription in the presence of an inactive response element (e.g., an inactive enhancer). In some embodiments, the core promoter may comprise an engineered core promoter to promote high levels of transcription in the presence of an activated response element (e.g., an activated enhancer) and low or no transcription in the presence of an unactivated response element (e.g., an unactivated enhancer). In some embodiments, the core promoter may comprise a synthetic promoter engineered to promote high levels of transcription in the presence of an activated response element (e.g., an activated enhancer) and low or no transcription in the presence of an inactive response element (e.g., an inactive enhancer).
The elements within the promoter (e.g., the response element or the core promoter) may be selected or engineered to alter transcription rates of a downstream polynucleotide encoding for a payload. In some embodiments, the promoter may be selected for ubiquitous transcription. For example, the promoter may be selected to promote high levels of transcription in any cell type, tissue type, or cell state. In some embodiments, the promoter may be selected for cell type-specific transcription. In some embodiments, the promoter may be selected for tissue type-specific transcription. In some embodiments, the promoter may be selected for cell state-specific transcription. For example, the promoter may be selected to promote high levels of transcription in a target cell or tissue type (e.g., excitatory neurons or kidney tissue) or a target cell state (e.g., a diseased cell state) and low levels or no transcription in non-target cell types or states (e.g., non-neuronal cells, non-kidney tissue, or non-diseased cells). In some embodiments, the target tissue may be nervous tissue, kidney tissue, eye tissue, muscle tissue, blood, skin, fat, bone, cancerous tissue, thymus tissue, gastrointestinal tissue (e.g., stomach, intestine), spleen tissue, placenta tissue, pancreatic tissue, lung tissue, liver tissue, cardiac tissue, or brain tissue (e.g., cerebrum, cerebellum, adrenal). In some embodiments, the target cell type may be neuron, renal cell, hepatocyte, podocyte, retinal cell, epithelial cell, muscle cell, erythrocyte, platelet, bone marrow cell, endothelial cell, epidermal cell, lymphocyte, glial cell, interstitial cell, adipocyte, or fibroblast. In some embodiments, the target cell state may be a diseased cell, an actively dividing cell, a quiescent cell, or a mutated cell. In some embodiments, the target cell may be a specific state of a cell. For example, for a CAR T cell, the target state of a cell may be activated and a non-target state may be inactive (e.g., naïve).
In some embodiments, a promoter may promote cell type-specific transcription if it promotes transcription of polynucleotides encoding for a payload (i.e., a payload sequence) in a target cell type at a level that is at least about 0.1-fold, at least about 0.25-fold, at least about 0.5-fold, at least about 0.75-fold, at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold a transcription level of the payload sequence in a non-target cell type. In some embodiments, a promoter may promote cell state-specific transcription if it promotes transcription of a payload sequence in a target cell state at a level that is at least about 0.1-fold, at least about 0.25-fold, at least about 0.5-fold, at least about 0.75-fold, at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold a transcription level of the payload sequence in a non-target cell state.
In some embodiments, a promoter may promote a desired level of transcription if it promotes transcription of polynucleotides encoding for a payload (i.e., a payload sequence) in a target cell type at a level that is at least about-0.75 fold, at least about-0.5 fold, at least about-0.25-fold, at least about-0.1-fold, at least about 0-fold, at least about 0.1-fold, at least about 0.25-fold, at least about 0.5-fold, at least about 0.75-fold, at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold a transcription level of an endogenous version of the payload sequence in a target cell type. In some embodiments, a promoter may promote a desired level of transcription of a payload sequence if it promotes transcription of the payload sequence in a target cell state at a level that is at least about-0.75 fold, at least about-0.5 fold, at least about-0.25-fold, at least about-0.1-fold, at least about 0-fold, at least about 0.1-fold, at least about 0.25-fold, at least about 0.5-fold, at least about 0.75-fold, at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold a transcription level of an endogenous version of the payload sequence in a target cell state.
The promoter of a polynucleotide may comprise a response element, such as an enhancer region. The response element may bind to a sequence-specific protein, such as a cognate ligand (e.g., a transcription factor), a coactivator, and/or a corepressor to modulate transcription. For example, the response element may recruit a cognate ligand (e.g., a transcription factor), a coactivator, and/or a corepressor that enhances, represses, or alters transcription of a downstream polynucleotide encoding a sequence of a payload, such as a transgene or therapeutic polynucleotide. In some embodiments, the response element may function to enhance the rate of transcription upon binding of a cognate ligand or coactivator and/or upon decreased binding of a corepressor. For example, a response element may comprise a transcription factor binding site that binds transcription factors that enhance transcription of nearby polynucleotides comprising coding sequences. In some embodiments, a response element may be an activated response element or inactivated response element, in which an activated response element is a response element bound to a transcription factor that enhances transcription and that may enhance transcription initiation by a core promoter relative to a response element without a transcription factor bound and an inactivated response element is a response element that is not bound to a transcription factor and therefore it does not enhance transcription or enhance transcription initiation by a core promoter. In some embodiments, the response element may function to alter the rate of transcription upon binding of a cognate ligand a coactivator, and/or a corepressor. An enhancer region may be a response element that functions to enhance the rate of transcription upon binding of a cognate ligand or coactivator or decreased binding of a corepressor, wherein this binding of a cognate ligand or coactivator or decreased binding of a corepressor and subsequent enhancement of the rate of transcription is cell type-, cell state-, or tissue type-dependent. For example, an enhancer region may comprise a transcription factor binding site that binds one or more transcription factors that enhance transcription of nearby polynucleotides comprising coding sequences, wherein the presence of the one or more transcription factors is cell type-, cell state-, or tissue type-dependent. In some embodiments, transcriptional enhancement by the enhancer region may be regulated by one or more transcription factors, such as cell type-, cell state-, or tissue type-specific transcription factors, and therefore be cell type-, cell state-, or tissue type-dependent. For example, the enhancer region may be unactivated when in an unbound state and may be activated upon binding of one or more transcription factors, wherein this binding is dependent on the presence of the one or more transcription factors and wherein the presence of the one or more transcription factors is cell type-, cell state-, or tissue type-dependent. Alternatively or in addition, the enhancer region may be unactivated when in a bound state to one or more corepressors and may be activated upon decreased or no binding of the one or more corepressors, wherein this binding is dependent on cell type-, cell state-, or tissue type-dependent. The activated enhancer may enhance recruitment of a polymerase to a core promoter relative to the unactivated enhancer. The activated enhancer may enhance recruitment of a polymerase to a core promoter relative to the inactive enhancer.
A response element (e.g., an enhancer region) may be paired with a core promoter, such as an engineered core promoter described herein, to promote transcription of a polynucleotide encoding a payload. The sequence of the response element may be positioned in a polynucleotide sequence such that it affects recruitment of a polymerase to the core promoter. For example, the response element (e.g., an enhancer region) may be positioned 5′ of the core promoter, or the response element may be positioned 3′ of the polynucleotide encoding the payload.
A response element (e.g., an enhancer region) may comprise a sequence that binds to cognate ligands or coactivators expressed in a target cell type (e.g., neurons, renal cells, hepatocytes, podocytes, retinal cells, epithelial cells, muscle cells, erythrocytes, platelets, bone marrow cells, endothelial cells, epidermal cells, lymphocytes, glial cells, interstitial cells, adipocytes, or fibroblasts) or cell state (e.g., diseased cells or healthy cells) at higher levels than a non-target cell type or non-target cell state. Alternatively or in addition, a response element (e.g., an enhancer region) may comprise a sequence that binds to corepressors expressed in a target cell type or cell state at lower levels than a non-target cell type or non-target cell state. For example, an enhancer region of a promoter for expressing a payload in a kidney cell may comprise a transcription factor binding sequence that binds to a kidney-specific transcription factor. The kidney-specific enhancer region may enhance transcription initiation when bound to the kidney-specific transcription factor in the kidney cells, increasing transcription of the payload in kidney cells relative to non-kidney cells. In another example, an enhancer region of a promoter for expressing a payload in a central nervous system (CNS) cell may comprise a transcription factor binding sequence that binds to a CNS-specific transcription factor. The CNS-specific enhancer region may enhance transcription initiation when bound to the CNS-specific transcription factor in CNS cells, increasing transcription of the payload in CNS cells relative to non-CNS cells. As a further example, an enhancer region for expressing a payload in a liver cell may comprise a transcription factor binding sequence that binds to a liver-specific transcription factor. The liver-specific enhancer region may enhance transcription initiation when bound to the liver-specific transcription factor in liver cells, increasing transcription of the payload in liver cells relative to non-liver cells. Examples of response elements are provided in TABLE 1.
A response element may comprise a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194. The response element may comprise a sequence having at least 90% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194. The response element may comprise a sequence of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194. In some embodiments, a response element may be an engineered response element. For example, an engineered response element may comprise a sequence of SEQ ID NO: 145 or SEQ ID NO: 146. In some embodiments, the response element may be an enhancer region. In some embodiments, the response element may be a liver-specific enhancer region, wherein the enhancer region is activated in liver cells. For example, a liver-specific enhancer region may comprise a sequence of SEQ ID NO: 146 or SEQ ID NO: 197. In some embodiments, a liver-specific enhancer/promoter may comprise a sequence of SEQ ID NO: 163-SEQ ID NO: 177. In some embodiments, the response element may be a bone marrow-specific enhancer, wherein the enhancer region is activated in bone marrow cells. For example, a bone marrow-specific enhancer region may comprise a sequence of SEQ ID NO: 145 or SEQ ID NO: 193. In some embodiments, a bone marrow-specific enhancer region/promoter may comprise a sequence of SEQ ID NO: 148-SEQ ID NO: 162.
The response elements (e.g., a response element of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194) may be appended to core promoter or promoter for expression of a polynucleotide comprising a payload sequence (e.g., a transgene encoding a therapeutic protein or a sequence encoding a therapeutic polynucleotide) to promote cell type-, tissue type-, or cell state-specific transcription of a polynucleotide encoding for a payload (i.e., a payload sequence). In some embodiments, response elements may be combined or used in combination with other response elements to tune transcriptional levels of the payload sequence. In some embodiments, the 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more response elements may be combined to tune transcriptional levels of the payload sequence. The combined 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more response elements may all be the same response element, different response elements, or any combination thereof. For example, a promoter may comprise a combination of sequences for transcription enhancement and sequences for transcription repression (e.g., via increased binding to a corepressor) to tune payload expression in the cell type or cell state of interest. The response element (e.g., a response element of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194) may be paired with a core promoter (e.g., a core promoter of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) to form an engineered promoter. The engineered promoter may facilitate cell-specific payload expression under transcriptional control of the engineered promoter.
The promoter of a polynucleotide may comprise a core promoter that facilitates recruitment of transcription machinery and initiation of transcription. The core promoter may be an engineered core promoter. In some embodiments, the core promoter may be engineered for switchable activity when paired with a cell-specific response element (e.g., a cell-specific enhancer). A switchable core promoter may exhibit low or no activity (low or no transcriptional activation) when paired with a cell-specific response element (e.g., a cell-specific enhancer) that is in a cell that is not its specific cell type (e.g., a cell type that is not the specific cell type of the cell-specific response element), and may exhibit high activity (high transcriptional activation) when paired with that same cell-specific response element (e.g., that same cell-specific enhancer), but in a cell that is its specific cell type (e.g., a cell type that is the specific cell type of that same cell-specific response element), and then that same switchable core promoter may exhibit low or no activity (low or no transcriptional activation) when paired with a different cell-specific response element (e.g., a different cell-specific enhancer) that is in a cell that is not its specific cell type (e.g., a cell type that is not the specific cell type of the different cell-specific response element), and may exhibit high activity (high transcriptional activation) when paired with that same different cell-specific response element (e.g., the same different cell-specific enhancer), but in a cell that is its specific cell type (e.g., a cell type that is the specific cell type of the same different cell-specific response element). For example, a switchable core promoter (e.g., any of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a neuron-specific enhancer may exhibit high transcriptional activity in neurons and low transcriptional activity in non-neuronal cells (e.g., kidney cells, liver cells, bone marrow cells, etc.). In another example, that switchable core promoter (e.g., any of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a hepatocyte-specific enhancer may exhibit high transcriptional activity in hepatocytes and low transcriptional activity in non-hepatocytes (e.g., kidney cells, neurons, bone marrow cells, etc.). In some embodiments, the core promoter may be engineered for switchable activity when paired with a tissue-specific response element (e.g., a tissue-specific enhancer). A switchable core promoter may exhibit low or no activity (low or no transcriptional activation) when paired with a tissue-specific response element (e.g., a tissue-specific enhancer) that is in a cell that is not its specific tissue type (e.g., a cell that is not the specific tissue type of the tissue-specific response element), and may exhibit high activity (high transcriptional activation) when paired with that same tissue-specific response element (e.g., that same tissue-specific enhancer), but in a cell that is its specific tissue type (e.g., a cell that is the specific tissue type of that same tissue-specific response element), and then that same switchable core promoter may exhibit low or no activity (low or no transcriptional activation) when paired with a different tissue-specific response element (e.g., a different tissue-specific enhancer) that is in a cell that is not its specific tissue type (e.g., a cell that is not the specific tissue type of the different tissue-specific response element), and may exhibit high activity (high transcriptional activation) when paired with that same different tissue-specific response element (e.g., that same different tissue-specific enhancer), but in a cell that is its specific tissue type (e.g., a cell that is the specific tissue type of that same different tissue-specific response element). For example, a switchable core promoter (e.g., any of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a CNS-specific enhancer may exhibit high transcriptional activity in cells of the CNS and low transcriptional activity in non-CNS cells (e.g., cells of the kidney, liver, bone marrow, etc.). In another example, that switchable core promoter (e.g., any of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a liver-specific enhancer may exhibit high transcriptional activity in liver cells and low transcriptional activity in non-liver cells (e.g., cells of the kidney, CNS, bone marrow, etc.). A switchable core promoter may exhibit low or no activity (low or no transcriptional activation) when paired with a cell state-specific response element (e.g., a cell state-specific enhancer) that is in a cell in a specific cell state that is not its specific cell state (e.g., a cell not in a specific cell state of the cell state-specific response element), and may exhibit high activity (high transcriptional activation) when paired with that same cell state-specific response element (e.g., that same cell state-specific enhancer), but in a cell that is in its specific cell state (e.g., a cell that is in the specific cell state of that same cell state-specific response element), and then that same switchable core promoter may exhibit low or no activity (low or no transcriptional activation) when paired with a different cell state-specific response element (e.g., a different cell state-specific enhancer) that is in a cell in a specific cell state that is not its different specific cell state (e.g., a cell not in a specific cell state of the different cell state-specific response element), and may exhibit high activity (high transcriptional activation) when paired with that same different cell state-specific response element (e.g., that same different cell state-specific enhancer), but in a cell that is in its specific different cell state (e.g., a cell that is in the specific cell state of that same different cell state-specific response element). For example, a switchable core promoter (e.g., any of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a disease-specific enhancer may exhibit high transcriptional activity in cells exhibiting that disease and low transcriptional activity in cells not exhibiting that disease. In another example, that switchable core promoter (e.g., any of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a Rett-specific enhancer may exhibit high transcriptional activity in neurons expressing mutant MeCP2 associated with Rett Syndrome and low transcriptional activity in neurons expressing functional MeCP2 associated with a normal neuronal phenotype. A switchable core promoter may exhibit high activity when paired with a enhancer that is cell type-, cell state-, or tissue type-dependent and in that specific cell type, cell state, or tissue type. A switchable core promoter may exhibit a wide dynamic range of activity when paired with a cell type-, cell state-, or tissue type-specific enhancer when in that cell type, cell state, or tissue type as compared to when not in that cell type, cell state, or tissue type, respectively.
In some embodiments, the core promoter may be positioned downstream (i.e., 3′) of a response element. In some embodiments, the core promoter may be positioned upstream (i.e., 5′) of a polynucleotide encoding a payload sequence (e.g., a transgene or sequence encoding a therapeutic polynucleotide). The core promoter may recruit polymerase or proteins that bind to polymerases to initiate transcription of a sequence downstream of the core promoter. In some embodiments, the core promoter may recruit an RNA polymerase (e.g., RNA polymerase II) or a TATA binding protein (TBP) that recruits an RNA polymerase when in combination with a response element bound to cognate ligands, coactivators, or corepressors. For example, the core promoter may bind to general transcription factors (GTFs) which recruit RNA polymerase II (Pol II) to initiate transcription. The ability of the core promoter to recruit transcription machinery (e.g., an RNA polymerase) or the affinity of the core promoter for the transcription machinery may affect transcription levels. In some embodiments, the core promoter may be altered to tune transcription levels by altering recruitment of and/or affinity for transcription machinery. In some embodiments, an engineered core promoter is a mutated or altered core promoter that has a decreased ability to recruit transcription machinery (e.g., an RNA polymerase) and/or decreased affinity for the transcription machinery as compared to a corresponding unmutated or unaltered core promoter. In some embodiments, an engineered core promoter is a de novo synthetic core promoter that has a decreased ability to recruit transcription machinery (e.g., an RNA polymerase) and/or decreased affinity for the transcription machinery as compared to other core promoters, such as a core promoter of SEQ ID NO: 5 or SEQ ID NO: 9. In some embodiments, the ability of this engineered core promoter to initiate transcriptional activity is dependent on being paired with a response element, wherein the initiation of transcriptional activity occurs when the response element binds to cognate ligands and/or coactivators, and is not initiated or initiated at a low basal level when the response element is not bound to cognate ligands and/or coactivators, or optionally, is bound to corepressors.
In some embodiments, the core promoter is a minimal synthetic core promoter, such as a minP core promoter (SEQ ID NO: 5) or a variant of a minP core promoter (e.g., SEQ ID NO: 4 or SEQ ID NO: 15). Core promoters may be selected or engineered for one or more desired transcriptional properties, such as transcription level, cell type specificity, cell state specificity, tissue specificity, or cell genotype specificity. In some embodiments, a core promoter may be engineered to selectively promote transcription initiation when paired with a cell type-, cell state-, or tissue type-specific response element (e.g., a cell type-, cell state-, or tissue type-specific enhancer). Engineering a core promoter may comprise screening variants of a core promoter for transcription level, cell type specificity, cell state specificity, tissue specificity, or cell genotype specificity.
The core promoters described herein may comprise engineered sequences that serve as a promoter switch that can be selectively activated in a target cell type, target tissue type, or target cell state. A switchable core promoter may readily promote transcription initiation when paired with a cell type-, tissue type-, or cell state-specific response element in the target cell type, target tissue type, or target cell state, respectively, while exhibiting low background transcription initiation in non-target cell types, non-target tissue types, or non-target cell states, respectively. The switchable core promoter may be used in combination with a variety of response elements or other response elements to promote cell type-, tissue type-, or cell state-specific transcription in a variety of cell types, tissue types, or cell states.
A switchable core promoter may promote transcription initiation in a target cell type, tissue type, or target cell state at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation in a non-target cell type, non-target tissue type, or non-target cell state, when paired with a response element specific for the target cell type, target tissue type, or target cell state. In some embodiments, a switchable core promoter may promote transcription initiation in a target cell type, tissue type, or target cell state when paired with a response element specific for the target cell type, target tissue type, or target cell state at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation in the target cell type, target tissue type, or target cell state, when paired with a response element not specific for the target cell type, target tissue type, or target cell state.
In some embodiments, a switchable core promoter may promote transcription initiation when paired with an active response element at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation compared to the switchable core promoter when paired with an inactive response element. In some embodiments, a switchable core promoter may promote transcription initiation in a target cell type, target tissue type, or target cell state when paired with a response element specific for the target cell type, target tissue type, or target cell state at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation in a target cell type, tissue type, or target cell state compared to the switchable core promoter when paired with an inactive response element. A switchable core promoter may promote transcription initiation when paired with a response element bound to a cognate ligand or coactivator (e.g., an enhancer bound to a cognate ligand or coactivator) (an activated response element) at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with that response element not bound to a cognate ligand or coactivator (e.g., an enhancer not bound to a cognate ligand or coactivator) (an unactivated response element). The activated response element may be an enhancer specific for a target cell type, target tissue type, or target cell state in that target cell type, target tissue type, or target cell state, respectively. In some embodiments, the unactivated response element may be an enhancer that is not specific for the cell type, tissue type, or cell state that it is in.
A switchable core promoter may promote transcription initiation when paired with a response element bound to a cognate ligand or coactivator (e.g., an enhancer bound to a cognate ligand or coactivator) and/or having decreased binding to a corepressor at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with that response element not bound to a cognate ligand or coactivator (e.g., an enhancer not bound to a cognate ligand or coactivator) and/or having increased binding to a corepressor.
A switchable core promoter may promote transcription initiation when paired with a response element capable of being bound to a cognate ligand or coactivator (e.g., an enhancer bound to a cognate ligand or coactivator) (an active response element) that is bound to a cognate ligand or coactivator (e.g., an enhancer bound to a cognate ligand or coactivator) (an activated response element) at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with a response element not capable of being bound to a cognate ligand or coactivator (e.g., an enhancer not capable of being bound to a cognate ligand or coactivator) (an inactive response element), and therefore which may not be bound to a cognate ligand or coactivator (e.g., an enhancer not bound to a cognate ligand or coactivator) (an unactivated response element). The active response element may be specific for a target cell type, target tissue type, or target cell state, and when in that target cell type, target tissue type, or target cell state, respectively, may be an activated response element. The unactivated response element may be specific for a target cell type, target tissue type, or target cell state, but that may not be in that specific target cell type, target tissue type, or target cell state, respectively. The inactive response element may be a response element that may not be capable of activation regardless of the cell type, tissue type, or cell state it is in, and therefore, may always be an unactivated response element in a cell.
A switchable core promoter may promote transcription initiation in a target cell type, tissue type, or target cell state at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation in a non-target cell type, non-target tissue type, or non-target cell state, when paired with an enhancer specific for the target cell type, target tissue type, or target cell state. In some embodiments, a switchable core promoter may promote transcription initiation in a target cell type, tissue type, or target cell state when paired with an enhancer specific for the target cell type, target tissue type, or target cell state at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation in the target cell type, target tissue type, or target cell state, when paired with an enhancer not specific for the target cell type, target tissue type, or target cell state.
In some embodiments, a switchable core promoter may promote transcription initiation when paired with an active enhancer at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation compared to the switchable core promoter when paired with an inactive enhancer. In some embodiments, a switchable core promoter may promote transcription initiation in a target cell type, tissue type, or target cell state when paired with an enhancer specific for the target cell type, target tissue type, or target cell state at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation in a target cell type, tissue type, or target cell state compared to the switchable core promoter when paired with an inactive enhancer.
A switchable core promoter may promote transcription initiation when paired with an enhancer bound to a cognate ligand or coactivator (an activated enhancer) at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with that enhancer not bound to a cognate ligand or coactivator (an unactivated enhancer). The activated enhancer may be specific for a target cell type, target tissue type, or target cell state in that target cell type, target tissue type, or target cell state, respectively. In some embodiments, the unactivated enhancer may be not specific for the cell type, tissue type, or cell state that it is in.
A switchable core promoter may promote transcription initiation when paired with an enhancer bound to a cognate ligand or coactivator and/or having decreased binding to a corepressor at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with that enhancer not bound to a cognate ligand or coactivator and/or having increased binding to a corepressor.
A switchable core promoter may promote transcription initiation when paired with an enhancer capable of being bound to a cognate ligand or coactivator (an active enhancer) that is bound to a cognate ligand or coactivator (an activated enhancer) at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with an enhancer not capable of being bound to a cognate ligand or coactivator (an inactive enhancer), and therefore which is not bound to a cognate ligand or coactivator (an unactivated enhancer). The active enhancer may be specific for a target cell type, target tissue type, or target cell state, and when in that target cell type, target tissue type, or target cell state, respectively, may be an activated enhancer. The unactivated enhancer may be specific for a target cell type, target tissue type, or target cell state, but that may not be in that specific target cell type, target tissue type, or target cell state, respectively. The inactive enhancer may not be capable of activation regardless of the cell type, tissue type, or cell state it is in, and therefore, may always an unactivated enhancer in a cell.
The engineered core promoters of the presence disclosure provide high dynamic range, also referred to as fold activation, relative to endogenous core promoters. In some embodiments, dynamic range may describe the transcriptional activity of a core promoter when paired with an activated response element relative to the transcriptional activity of the core promoter when paired with an unactivated response element. A core promoter with a high dynamic range may promote transcription initiation in a cell at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation in the cell having an endogenous core promoter, paired with the same response element. In some embodiments, a core promoter with a high dynamic range may promote transcription initiation when paired with an activated response element at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with an inactive response element.
The engineered core promoters of the presence disclosure provide high dynamic range, also referred to as fold activation, for improved cell type-, tissue type-, or cell state-specificity relative to endogenous core promoters. In some embodiments, dynamic range may describe the transcriptional activity of a core promoter when paired with an activated response element (e.g., an enhancer bound to a cognate ligand or coactivator) relative to the transcriptional activity of the core promoter when paired with an inactive response element. A core promoter with a high dynamic range may promote transcription initiation in a target cell type, target tissue type, or target cell state at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation in a non-target cell type, non-target tissue type, or non-target cell state. In some embodiments, a core promoter with a high dynamic range may promote transcription initiation when paired with an activated response element at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with an inactive response element. In some embodiments, a core promoter with a high dynamic range may promote transcription initiation when paired with an activated enhancer at a rate that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold that of the rate of transcription initiation when paired with an unactivated enhancer.
A core promoter may comprise one or more sequence elements. In some embodiments, a core promoter may comprise, from 5′ to 3′, a polynucleotide encoding a TATA box sequence, a spacer sequence, and an initiator element sequence. In some embodiments, a core promoter may comprise, from 5′ to 3′, a TATA box, a spacer, and an initiator element. In some embodiments, a core promoter may further comprise a transcriptional pause site, a YY1 transcription factor binding motif, or a combination thereof. In some embodiments, a core promoter may comprise a transcription factor binding motif. For example, the spacer sequence may comprise a transcription factor binding sequence. In some embodiments, sequence elements within a core promoter may be configured as shown in
A core promoter may comprise one or more sequence elements, such as a TATA box (e.g., TATAAA), an initiator element, an RNA polymerase binding sequence, a B recognition element (BRE, e.g., G/C,G/C,G/A,CGCC), a CCAAT box or CAT box (e.g., GGCCAATCT), or a Pribnow box (e.g., TATAAT). In some embodiments, a core promoter may comprise a TATA box (e.g., an engineered TATA box of SEQ ID NO: 199-SEQ ID NO: 201 or SEQ ID NO: 213-SEQ ID NO: 215), for example as shown in SEQ ID NO: 4 and SEQ ID NO: 15. In some embodiments, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 72%, at least about 75%, at least about 78%, at least about 80%, at least about 82%, at least about 85%, at least about 87%, at least about 90%, at least about 92%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 199-SEQ ID NO: 201 or SEQ ID NO: 213-SEQ ID NO: 215.
In some embodiments, a core promoter may comprise an initiator element (e.g., an engineered initiator element of SEQ ID NO: 202-SEQ ID NO: 204, SEQ ID NO: 216, or SEQ ID NO: 217). In some embodiments, a core promoter may comprise an initiator element having at least about 70%, at least about 72%, at least about 75%, at least about 78%, at least about 80%, at least about 82%, at least about 85%, at least about 87%, at least about 90%, at least about 92%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 202-SEQ ID NO: 204, SEQ ID NO: 216, or SEQ ID NO: 217.
In some embodiments, a core promoter may comprise a long spacer (e.g., an engineered long spacer between a TATA box and an initiator element, for example as shown in SEQ ID NO: 3 and SEQ ID NO: 69-SEQ ID NO: 108. A long spacer may have a length of no less than 31 nucleotide residues from the TATA box to the initiator element, wherein the number of nucleotide residues from the TATA box to the initiator element are counted from the first T in the TATA box to the A in the CAG in the initiator element (or the corresponding CAG in the parent sequence if the initiator element is mutated), inclusive. In some embodiments, a long spacer may have a length of from 31 to 48 nucleotide residues from the TATA box to the initiator element. In some embodiments, a long spacer may have a length of from 31 to 44 nucleotide residues from the TATA box to the initiator element. In some embodiments, a long spacer may have a length of from 31 to 42 nucleotide residues from the TATA box to the initiator element. In some embodiments, a long spacer may have a length of from 31 to 37 nucleotide residues or from 31 to 35 nucleotide residues from the TATA box to the initiator element. For example, a long spacer may have a length of 32 nucleotide residues from the TATA box to the initiator element. In some embodiments, a core promoter may comprise an engineered a short spacer (e.g., an engineered short spacer) between a TATA box and an initiator element, wherein the number of nucleotide residues from the TATA box to the initiator element are counted from the first T in the TATA box to the A in the CAG in the initiator element (or the corresponding CAG in the parent sequence if the initiator element is mutated), inclusive, for example as shown in SEQ ID NO: 7, SEQ ID NO: 10, and SEQ ID NO: 109-SEQ ID NO: 125. A short spacer may have a length of no more than 28 nucleotide residues from the TATA box to the initiator element. In some embodiments, a short spacer may have a length of from 22 to 29 nucleotide residues or from 24 to 29 nucleotide residues from the TATA box to the initiator element. For example, a short spacer may have a length of 28 nucleotide residues from the TATA box to the initiator element. In some embodiments, a core promoter may comprise an engineered spacer between a TATA box and an initiator element, for example as shown in SEQ ID NO: 13, and SEQ ID NO: 126-SEQ ID NO: 144. A spacer may have a length of about 30 nucleotide residues from the TATA box to the initiator element. Spacing between TATA and Inr elements of select core promoters are illustrated in
In some embodiments, a core promoter may comprise a transcription pause site (e.g., SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212), for example as shown in SEQ ID NO: 1, SEQ ID NO: 11, SEQ ID NO: 14, and SEQ ID NO: 16-SEQ ID NO: 21. In some embodiments, a core promoter may comprise a transcription pause site having at least about 70%, at least about 72%, at least about 75%, at least about 78%, at least about 80%, at least about 82%, at least about 85%, at least about 87%, at least about 90%, at least about 92%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212.
In some embodiments, a core promoter may comprise a YY1 motif (e.g., SEQ ID NO: 205-SEQ ID NO: 207), for example as shown in SEQ ID NO: 2, SEQ ID NO: 12, and SEQ ID NO: 31-SEQ ID NO: 68. In some embodiments, a core promoter may comprise a YY1 motif having at least about 70%, at least about 72%, at least about 75%, at least about 78%, at least about 80%, at least about 82%, at least about 85%, at least about 87%, at least about 90%, at least about 92%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 205-SEQ ID NO: 207. In some embodiments, a core promoter may comprise a YY1 motif (e.g., SEQ ID NO: 205-SEQ ID NO: 207), for example as shown in SEQ ID NO: 2, SEQ ID NO: 12, and SEQ ID NO: 31-SEQ ID NO: 68. In some embodiments, a core promoter may comprise a YY1 motif having at least about 70%, at least about 72%, at least about 75%, at least about 78%, at least about 80%, at least about 82%, at least about 85%, at least about 87%, at least about 90%, at least about 92%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 205-SEQ ID NO: 207 and is capable of binding to YY1. In some embodiments, an engineered core promoter sequence may comprise one or more mutations relative to a founder core promoter sequence. In some embodiments, an engineered core promoter sequence may comprise one or more point mutations (e.g., T15A, T15G, A40T, or A40C) relative to SEQ ID NO: 5 (TAGAGGGTATATAATGGAAGCTCGACTTCCAGCTTGGCAATCCGG), for example as shown in SEQ ID NO: 4 and SEQ ID NO: 15. In some embodiments, an engineered core promoter sequence may comprise one or more point mutations relative to SEQ ID NO: 9 (TCTAGAGGGTATATAATGGGGGCCACTAGTCTACTACCAGAAAG). In some embodiments, an engineered core promoter sequence may comprise one or more point mutations relative to SEQ ID NO: 210
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 213, a spacer, an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 202, and a pause site having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 198, optionally the spacer may be of 29 to 33 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 213, a spacer of 31 nucleotides, an initiator element having the sequence of SEQ ID NO: 202, and a pause site of SEQ ID NO: 198.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 200, a spacer, an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 203, and a YY1 motif having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 205, optionally the spacer may be of 29 to 33 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 200, a spacer of 31 nucleotides, an initiator element having the sequence of SEQ ID NO: 203, and a YY1 motif of SEQ ID NO: 205.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 201, a spacer, and an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 202, optionally the spacer may be of 31 to 35 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 201, a spacer of 33 nucleotides, and an initiator element having the sequence of SEQ ID NO: 202.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 214, a spacer, and an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 216, optionally the spacer may be of 30 to 34 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 214, a spacer of 32 nucleotides, and an initiator element having the sequence of SEQ ID NO: 216.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 213, a spacer, and an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 204, optionally the spacer may be of 27 to 31 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 213, a spacer of 29 nucleotides, and an initiator element having the sequence of SEQ ID NO: 204.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 201, a spacer, and an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 202, optionally the spacer may be of 27 to 31 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 201, a spacer of 29 nucleotides, and an initiator element having the sequence of SEQ ID NO: 202.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 213, a spacer, an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 203, and a pause site having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 211, optionally the spacer may be of 29 to 33 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 213, a spacer of 31 nucleotides, an initiator element having the sequence of SEQ ID NO: 203, and a pause site having the sequence of SEQ ID NO: 211.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 200, a spacer, an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 203, and a YY1 motif having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 206, optionally the spacer may be of 29 to 33 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 200, a spacer of 31 nucleotides, an initiator element having the sequence of SEQ ID NO: 203, and a YY1 motif having the sequence of SEQ ID NO: 206.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 201, a spacer, and an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 204, optionally the spacer may be of 27 to 31 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 201, a spacer of 29 nucleotides, and an initiator element having the sequence of SEQ ID NO: 204.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 200, a spacer, an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 203, and a pause site having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 212, optionally the spacer may be of 29 to 33 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 200, a spacer of 31 nucleotides, an initiator element having the sequence of SEQ ID NO: 203, and a pause site having the sequence of SEQ ID NO: 212.
In an embodiment, a core promoter may comprise a TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 215, a spacer, and an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 217, optionally the spacer may be of 30 to 34 nucleotides.
In an embodiment, a core promoter may comprise a TATA box sequence having the sequence of SEQ ID NO: 215, a spacer of 32 nucleotides, and an initiator element having the sequence of SEQ ID NO: 217.
In an embodiment, a core promoter may comprise TATA box sequence having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 200, a spacer, and an initiator element having at least about 70%, at least about 80%, at least about 90%, at least about 95%, at least about 97%, or about 100% sequence identity to SEQ ID NO: 202, optionally the spacer may be of 27 to 31 nucleotides.
In an embodiment, a core promoter may comprise TATA box sequence having the sequence of SEQ ID NO: 200, a spacer of 29 nucleotides, and an initiator element having the sequence of SEQ ID NO: 202.
In some embodiments, a switchable core promoter sequence may comprise a sequence of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144. In some embodiments, a switchable core promoter sequence may comprise a sequence having at least about 70%, at least about 72%, at least about 75%, at least about 78%, at least about 80%, at least about 82%, at least about 85%, at least about 87%, at least about 90%, at least about 92%, at least about 93%, at least about 95%, at least about 97%, at least about 98%, at least about 99%, or about 100% sequence identity to SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144. Examples of switchable core promoter sequences are provided in TABLE 3.
In some embodiments, a core promoter of the present disclosure may function as a switchable core promoter in multiple cell types or cell states. Cell type- or cell state-specificity may be conferred by a response element (e.g., an enhancer sequence that binds to cell type-specific transcription factors) paired with the switchable core promoter. For example, a switchable core promoter of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144 may be paired with a CNS-specific enhancer to promote CNS-specific transcription initiation and may be paired with a kidney-specific enhancer to promote kidney-specific transcription initiation. The core promoters described herein may have low background transcriptional activation (e.g., low levels of transcriptional activation in the absence of an enhancer, in the presence of an inactive enhancer, or in the presence of an unactivated enhancer) and high activity (e.g., high levels of transcriptional activation) when paired with a response element in the presence of cell type-, tissue type-, or cell state-specific cognate ligands or coactivators (e.g., when paired with an activated enhancer). For example, a core promoter may have low transcriptional activation in the presence of an inactive enhancer. The core promoter may have high transcriptional initiation when paired with a cell type-, tissue type-, or cell state-specific response element in a cell type, tissue type, or cell state of interest (e.g., in the presence of, or at high levels of, transcription factors that bind to the transcription factor binding sequence of an enhancer region sequence). The core promoter may have low transcriptional activation when paired with a cell type-, tissue type-, or cell state-specific response element not in a cell type, tissue type, or cell state of interest (e.g., in the absence of, or at low levels of, transcription factors that bind to the transcription factor binding sequence of an enhancer region sequence). In some embodiments, a core promoter may be engineered to have low background transcriptional activation and high transcriptional activation when paired with a cell type-, tissue type, or cell state-specific response element in a cell type, tissue type, or cell state of interest. The sequence of the core promoter may be varied or engineered to tune the transcription level, tissue specificity, or cell type- or cell state-specificity. In some embodiments, a core promoter may comprise an engineered version of an endogenous core sequence. For example, a switchable core promoter of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144 may be paired with a response element of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194.
A promoter (e.g., an enhancer/promoter) may comprise a switchable core promoter (e.g., SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a response element (e.g., an enhancer comprising a transcription factor binding site or a coactivator binding site). In some embodiments, the response element of the promoter may confer cell type-, tissue type, or cell state-specificity. For example, an enhancer of SEQ ID NO: 146 or SEQ ID NO: 194 may confer liver specificity. In some embodiments, an enhancer of SEQ ID NO: 145 or SEQ ID NO: 193 may confer bone marrow specificity. In some embodiments, the switchable core promoter of the promoter may readily promote transcription initiation in the presence of an activated response element (e.g., an activated enhancer) and exhibit low background transcription initiation in the presence of an inactive response element (e.g., an inactive enhancer). In some embodiments, the switchable core promoter of the promoter may readily promote transcription initiation in the presence of an activated response element (e.g., an activated enhancer) and exhibit low background transcription initiation in the presence of the unactivated response element (e.g., the unactivated enhancer). A switchable core promoter of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144 may be paired with a response element of SEQ ID NO: 145 or SEQ ID NO: 146.
In some embodiments, a liver-specific enhancer/promoter may comprise a sequence of SEQ ID NO: 163-SEQ ID NO: 166, SEQ ID NO: 169, or SEQ ID NO: 172-SEQ ID NO: 177. In some embodiments, a bone marrow-specific enhancer/promoter may comprise a sequence of SEQ ID NO: 148-SEQ ID NO: 151, SEQ ID NO: 154, or SEQ ID NO: 157-SEQ ID NO: 162. Examples of enhancer/promoter sequences are provided in TABLE 4.
A payload of the present disclosure may comprise a coding sequence under transcriptional control of a promoter (e.g., a promoter comprising a response element and a core promoter). In some embodiments, the payload may encode a transgene or therapeutic polynucleotide for delivery to a cell (e.g., a cell of a human or non-human subject). In some embodiments, the payload may comprise a coding sequence encoding a protein (e.g., a protein without a mutation associated with a disease or condition). In some embodiments, the payload may comprise a coding sequence encoding a therapeutic polynucleotide (e.g., a gRNA targeting a gene associated with the disease or condition). Upon delivery of the payload to a cell, the coding sequence may be transcribed in the cell, thereby expressing the encoded protein or therapeutic polynucleotide. In some embodiments, expression of a protein or therapeutic polynucleotide encoded by the coding sequence may treat, prevent, or alleviate symptoms of a disease or disorder. In some embodiments, the payload may comprise a transgene that encodes a wild type copy of a protein that is mutated or dysregulated in the disease or condition.
In some embodiments, cell state specific transcription of a payload sequence (e.g., a transgene) is desired. For example, a transgene lacking a mutation may be specifically transcribed in neurons having a gene comprising the mutation or having a phenotype associated with the mutation. In another example, a transgene lacking a mutation may be specifically transcribed in retinal tissue having gene comprising the mutation or having a phenotype associated with the mutation. In another example, a transgene lacking a genetic variation may be specifically transcribed in cells having the genetic variation or having a phenotype associated with the genetic variation. In some embodiments, cell specific transcription of a payload sequence (e.g., a transgene) is desired. For example, a transgene may be specifically transcribed in neurons of a subject. In another example, a transgene may be specifically transcribed in hepatocytes of a subject. In some embodiments, tissue specific transcription of a payload sequence (e.g., a transgene) is desired. For example, a transgene may be specifically transcribed in cells of the CNS. In another example, a transgene may be specifically transcribed in liver cells. Examples of genes that may be encoded in the payload sequence (e.g., the transgene) are provided in TABLE 5. In some embodiments, the genes may be delivered as transgenes to a cell of a subject to treat a disease or condition in the subject. Alternatively or in addition, a payload may encode a therapeutic polynucleotide (e.g., a gRNA or a tRNA) that targets a gene associated with a disease or condition. For example, a payload may encode a gRNA for gene editing a targeting a gene in a target cell type, tissue type, or cell state. Examples of genes that may be targeted by a therapeutic polynucleotide encoded by a payload are provided in TABLE 5.
Examples of genes that may be encoded by a payload sequence (e.g., the transgene) and delivered to a cell of a subject to treat, prevent, or alleviate symptoms of a disease or condition include MECP2, GRN, PRPH2, RHO, UBE3A, DYRKIA, MEF2C, NSD1, ATRX, RPS6KA3, TCF4, ZEB2, FOXG1, CDKL5, a partial piece of chromosome 2, SLC6A1, DMD, SERPINA1, ABCA4, CFTR, HEXA, RAB7A, ATP7B, HFE, LIPA, SCNNIA, PKD1, PKD2, PKHD1, ACE, ALB, VHL, EPO, FH, ACE, TNF, SPP1, IL6, MYH9, TSC2, ADIPOQ, IL2, CCL2, TGFB1, UMOD, BCOR, FLCN, FLCN, TP53, CRP, PTEN, IFT88, CLDN14, AGT, MET, MYH9, YWHAE, HAMP, EPO, MUC1, BAP1, APOE, CYBA, GSTT1, IFNG, IGF1, IL2, ABCB1, SDHB, TSC2, BRAF, CDKNIB, GLA, KRT7, PPARG, RET, TRPC6, NDRG1, GANAB, NOX4, ADIPOR1, GREBIL, ANKS6, NUTM2B, CAT, CYBA, CYBB, EGFR, HMOX1, LRP2, SERPINE1, PAX2, ABCB1, PPARA, PPARG, PTGS2, RELA, RET, TLR4, UMOD, BAP1, RETN, GREBIL, FRAS1, CRB2, APRT, AXL, CCND1, CBR1, CPTIA, CYP1A1, CYP2B6, EDN1, ERBB2, HMGCR, MME, NFKB1, NGF, MAPK1, MAPK3, PTGS2, PTGS2, HLTF, SOD1, SOD2, SREBF2, HNF1B, TERT, TNFSF10, NDRG1, MBTPS2, WNT4, BCOR, INF2, ALG9, BICC1, TMEM67, IRX2, FREM1, ANKS6, FREM2, CD46, COL4A3, COL4A4, COL4A5, TTC21B, NPHP4, CD2AP, CFI, LAMB2, LMX1B, or MYH9. Alternately or in addition, the payload sequence may encode a therapeutic polynucleotide that targets a gene, such as MECP2, GRN, PRPH2, RHO, UBE3A, DYRKIA, MEF2C, NSD1, ATRX, RPS6KA3, TCF4, ZEB2, FOXG1, CDKL5, a partial piece of chromosome 2, SLC6A1, DMD, SERPINA1, ABCA4, CFTR, HEXA, RAB7A, ATP7B, HFE, LIPA, SCNNIA, PKD1, PKD2, PKHD1, ACE, ALB, VHL, EPO, FH, ACE, TNF, SPP1, IL6, MYH9, TSC2, ADIPOQ, IL2, CCL2, TGFB1, UMOD, BCOR, FLCN, FLCN, TP53, CRP, PTEN, IFT88, CLDN14, AGT, MET, MYH9, YWHAE, HAMP, EPO, MUC1, BAP1, APOE, CYBA, GSTT1, IFNG, IGF1, IL2, ABCB1, SDHB, TSC2, BRAF, CDKNIB, GLA, KRT7, PPARG, RET, TRPC6, NDRG1, GANAB, NOX4, ADIPOR1, GREBIL, ANKS6, NUTM2B, CAT, CYBA, CYBB, EGFR, HMOX1, LRP2, SERPINE1, PAX2, ABCB1, PPARA, PPARG, PTGS2, RELA, RET, TLR4, UMOD, BAP1, RETN, GREBIL, FRAS1, CRB2, APRT, AXL, CCND1, CBR1, CPTIA, CYP1A1, CYP2B6, EDN1, ERBB2, HMGCR, MME, NFKB1, NGF, MAPK1, MAPK3, PTGS2, PTGS2, HLTF, SOD1, SOD2, SREBF2, HNF1B, TERT, TNFSF10, NDRG1, MBTPS2, WNT4, BCOR, INF2, ALG9, BICC1, TMEM67, IRX2, FREM1, ANKS6, FREM2, CD46, COL4A3, COL4A4, COL4A5, TTC21B, NPHP4, CD2AP, CFI, LAMB2, LMX1B, or MYH9, to treat, prevent, or alleviate symptoms of a disease or condition associated with the gene.
In some embodiments, the genes that may be encoded by a payload sequence, or targeted by a therapeutic polynucleotide encoded by the payload sequence, and delivered to a cell of a subject to treat, prevent, or alleviate symptoms of a disease or condition may be associated with a disease or disorder. For example, the genes and associated conditions may include PKD1 or PDK2 associated with Autosomal Dominant Polycystic Kidney Disease; MECP2 associated with Rett Syndrome; CNGA3 or CNGB3 associated with Achromatopsia; ABCDI associated with Adrenomyeloneuropathy; UBE3A associated with Angelman Syndrome; Tafazzin associated with Barth Syndrome; CLN1, CLN2, CLN3, CLN4, CLN5, or CLN6 associated with Batten Disease; ASPA associated with Canavan Disease; PKD1 or PDK2 associated with Autosomal Dominant Polycistic Kidney Disease; CYP21A2 associated with Congenital Adrenal Hyperplasia; PMM2 associated with Congenital Disorder of Glycosylation 1a; LAMP2 associated with Danon Disease; GLA associated with Fabry Disease; GBA associated with Gaucher Disease; GLB1 associated with GM1 Gangliosidosis; G6PC associated with Glycogen SD 1a; F9 associated with Hemophilia; Serping1 associated with Hereditary Angioedema; GALC associated with Krabbe Disease; GUCY2D, ND1, ND6, ND4, RPE65, or AIPL1 associated with Leber's Disease; MIMI associated with Myotubular Myopathy; I, II, IIA, IIC, IIID, IVA, VI, VII, and IXA associated with Mucopolysaccharidosis; OTC associated with Ornithine Transcarbamylase Deficiency; GAA associated with Pompe Disease; HEXB associated with Sandhoff Disease; SMNI associated with Spinal Muscular Atrophy; HEXA associated with Tay-Sachs; USH2D-WHRN and USH3A-CLN1 associated with Usher Syndrome; SOD1 associated with ALS; HBB associated with Beta-thalassemia or Sickle Cell disease; BESTI associated with Bestrophinopathy; CHM associated with Chorioderemia; FXN associated with Friedreich's Ataxia; SLC37A4 associated with GSD1b; IGF1 associated with Osteoporosis; RPGR or RHO associated with Retinitis Pigmentosa; USHIC or CIB2 associated with Usher 1C, 1F; SERPINA1 associated with Alpha-1 Antitrypsin Deficiency; MECP2 associated with Rett Syndrome; AC6, Serca2, VEGF-B, or PPI associated with Heart Failure; GAD associated with Parkinson's Disease; MBTPS2 associated with Brain Anomalies, Retardation, Ectodermal Dysplasia, Skeletal Malformations, Hirschsprung Disease, Ear-Eye Anomalies, Cleft Palate-Cryptorchidism, And Kidney Dysplasia-Hypoplasia; CD46, COL4A3, COL4A4, COL4A5, TTC21B, NPHP4, CD2AP, CFI, LAMB2, LMX1B, or MYH9 associated with Chronic kidney disease/disorder with a monogenetic origin; TERT or IRX2 associated with Clear cell sarcoma of kidney; ERBB2 associated with Collecting Duct Carcinoma of the Kidney; FREM1 associated with Congenital absence of kidneys syndrome; TMEM67 associated with Cystic kidney; SOD1 associated with Kidney Calculi; EDN1 or MME associated with Kidney Failure; CPTIA, CYP2B6, HMGCR, NFKB1, NGF, or SREBF2 associated with Kidney Failure, Chronic; INF2 associated with Kidney Failure or Chronic kidney disease/disorder with a monogenetic origin; PTGS2, HLTF, NDRG1, or BCOR associated with Kidney Neoplasm; CYP1A1, MAPK1, MAPK3, PTGS2, SOD2, or TNFSF10 associated with Malignant neoplasm of kidney; ALG9 associated with Polycystic Kidney Disease, Potter Type I, with Microbrachycephaly, Hypertelorism, and Brachymelia; BICC1 or ANKS6 associated with Polycystic Kidney, Autosomal Dominant; WNT4 associated with Sex Reversal, Female, With Dysgenesis Of Kidneys, Adrenals, And Lungs; FREM2 associated with Unilateral agenesis of kidney; or HNF1B associated with Unilateral Multicystic Dysplastic Kidney.
In some embodiments, a gene (e.g., a transgene) or a therapeutic polynucleotide encoded by a payload may be transcribed in target cell type, cell state, or tissue type at a level that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold a transcription level of the payload in a non-target cell type, cell state, or tissue type.
In some embodiments, a gene (e.g., a transgene) or a therapeutic polynucleotide encoded by a payload may be transcribed in target cell type, cell state, or tissue type at a level that is at least about-0.75-fold, at least about-0.5-fold, at least about-0.25-fold, at least about-0.1-fold, at least about 0-fold, at least about 0.1-fold, at least about 0.25-fold, at least about 0.5-fold, at least about 0.75-fold, at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold a transcription level of a wild type version of the payload in a target cell type, cell state, or tissue type.
In some embodiments, a gene (e.g., a transgene) or a therapeutic polynucleotide encoded by a payload may be transcribed using any core promoter as disclosed herein (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a response element for a target cell type, cell state, or tissue type, in target cell type, cell state, or tissue type at a level that is at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold a transcription level of the payload in a non-target cell type, cell state, or tissue type.
In some embodiments, a gene (e.g., a transgene) or a therapeutic polynucleotide encoded by a payload may be transcribed using any core promoter as disclosed herein (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) paired with a response element for a target cell type, cell state, or tissue type, in target cell type, cell state, or tissue type at a level that is at least about-0.75-fold, at least about-0.5-fold, at least about-0.25-fold, at least about-0.1-fold, at least about 0-fold, at least about 0.1-fold, at least about 0.25-fold, at least about 0.5-fold, at least about 0.75-fold, at least about 1-fold, at least about 1.1-fold, at least about 1.2-fold, at least about 1.3-fold, at least about 1.4-fold, at least about 1.5-fold, at least about 2-fold, at least about 3-fold, at least about 4-fold, at least about 5-fold, at least about 10-fold, at least about 20-fold, at least about 30-fold, at least about 40-fold, at least about 50-fold, at least about 100-fold, at least about 150-fold, or at least about 200-fold a transcription level of a wild type version of the payload in a target cell type, cell state, or tissue type.
In some embodiments, a polynucleotide of the present disclosure is introduced into a subject via a recombinant vector. In some embodiments, an engineered core promoter of the present disclosure (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) is introduced into a subject via a recombinant vector. In some embodiments, an engineered core promoter of the present disclosure paired with a response element is introduced into a subject via a recombinant vector. In some embodiments, an engineered core promoter of the present disclosure paired with a response element that is upstream of a polynucleotide encoding a payload is introduced into a subject via a recombinant vector. In some embodiments the vector is a plasmid, a viral vector, an expression cassette, or a transformed cell.
In some embodiments, the viral vector is an adenoviral vector, an adeno-associated viral vector, or a lentiviral vector. Adeno-associated virus (AAV) vectors include vectors derived from any AAV serotype, including, but not limited to AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV 10, AAV11, AAV12, AAV13, AAV14, AAV15, AAV16, AAV-DJ, AAV-DJ/8, AAV-DJ/9, AAV1/2, AAV.rh8, AAV.rh10, AAV.rh20, AAV.rh39, AAV.Rh43, AAV.Rh74, AAV.v66, AAV.Oligo001, AAV.SCH9, AAV.r3.45, AAV.RHM4-1, AAV.hu37, AAV.Anc80, AAV.Anc80L65, AAV.7m8, AAV.PhP.eB, AAV.PhP. V1, AAV.PHP.B, AAV.PhB.C1, AAV.PhB.C2, AAV.PhB.C3, AAV.PhB.C6, AAV.cy5, AAV2.5, AAV2YF, AAV3B, AAV.LK03, AAV.HSC1, AAV.HSC2, AAV.HSC3, AAV.HSC4, AAV.HSC5, AAV.HSC6, AAV.HSC7, AAV.HSC8, AAV.HSC9, AAV.HSC10, AAV.HSC11, AAV.HSC12, AAV.HSC13, AAV.HSC14, AAV.HSC15, AAV.HSC16, AAV.HSC17, and AAVhu68.
In some embodiments, a polynucleotide is introduced into a subject by non-viral vector systems. In some embodiments, an engineered core promoter of the present disclosure (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) is introduced into a subject by non-viral vector systems. In some embodiments, an engineered core promoter of the present disclosure paired with a response element is introduced into a subject by non-viral vector systems. In some embodiments, an engineered core promoter of the present disclosure paired with a response element that is upstream of a polynucleotide encoding a payload is introduced into a subject by non-viral vector systems. In some embodiments, cationic lipids, polymers, hydrodynamic injection and/or ultrasound may be used in delivering a polynucleotide to a subject in the absence of virus. In some embodiments, cationic lipids, polymers, hydrodynamic injection and/or ultrasound may be used in delivering an engineered core promoter of the present disclosure (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) to a subject in the absence of virus. In some embodiments, cationic lipids, polymers, hydrodynamic injection and/or ultrasound may be used in delivering an engineered core promoter of the present disclosure paired with a response element to a subject in the absence of virus. In some embodiments, cationic lipids, polymers, hydrodynamic injection and/or ultrasound may be used in delivering an engineered core promoter of the present disclosure paired with a response element that is upstream of a polynucleotide encoding a payload to a subject in the absence of virus.
In some examples, the vector may be a eukaryotic vector, a prokaryotic vector (e.g., a bacterial vector) a viral vector, or any combination thereof. In some examples, the vector may be a viral vector. In some embodiments, the viral vector may be a retroviral vector, an adenoviral vector, an adeno-associated viral (AAV) vector, an alphavirus vector, a lentivirus vector (e.g., human or porcine), a Herpes virus vector, an Epstein-Barr virus vector, an SV40 virus vectors, a pox virus vector, or a combination thereof. In some embodiments, the viral vector may be a recombinant vector, a hybrid vector, a chimeric vector, a self-complementary vector, a single-stranded vector, or any combination thereof.
In some embodiments, the viral vector may be an adeno-associated virus (AAV). In some embodiments, the AAV may be any AAV known in the art. In some embodiments, the viral vector may be of a specific serotype. In some embodiments, the viral vector may be an AAV1 serotype, AAV2 serotype, AAV3 serotype, AAV4 serotype, AAV5 serotype, AAV6 serotype, AAV7 serotype, AAV8 serotype, AAV9 serotype, AAV10 serotype, AAV11 serotype, AAV 12 serotype, AAV13 serotype, AAV14 serotype, AAV15 serotype, AAV16 serotype, AAV-DJ serotype, AAV-DJ/8 serotype, AAV-DJ/9 serotype, AAV1/2 serotype, AAV.rh8 serotype, AAV.rh10 serotype, AAV.rh20 serotype, AAV.rh39 serotype, AAV.Rh43 serotype, AAV.Rh74 serotype, AAV.v66 serotype, AAV.Oligo001 serotype, AAV.SCH9 serotype, AAV.r3.45 serotype, AAV.RHM4-1 serotype, AAV.hu37 serotype, AAV.Anc80 serotype, AAV.Anc80L65 serotype, AAV.7m8 serotype, AAV.PhP.eB serotype, AAV.PhP.VI serotype, AAV.PHP.B serotype, AAV.PhB.C1 serotype, AAV.PhB.C2 serotype, AAV.PhB.C3 serotype, AAV.PhB.C6 serotype, AAV.cy5 serotype, AAV2.5 serotype, AAV2tYF serotype, AAV3B serotype, AAV.LK03 serotype, AAV.HSC1 serotype, AAV.HSC2 serotype, AAV.HSC3 serotype, AAV.HSC4 serotype, AAV.HSC5 serotype, AAV.HSC6 serotype, AAV.HSC7 serotype, AAV.HSC8 serotype, AAV.HSC9 serotype, AAV.HSC10 serotype, AAV.HSC11 serotype, AAV.HSC12 serotype, AAV.HSC13 serotype, AAV.HSC14 serotype, AAV.HSC15 serotype, AAV.HSC16 serotype, AAV.HSC17 serotype, or AAVhu68 serotype, a derivative of any of these serotypes, or any combination thereof.
In some embodiments, the AAV vector may be a recombinant vector, a hybrid AAV vector, a chimeric AAV vector, a self-complementary AAV (scAAV) vector, a single-stranded AAV, or any combination thereof.
In some embodiments, the AAV vector may be a recombinant AAV (rAAV) vector. Methods of producing recombinant AAV vectors may be known in the art and generally involve, in some cases, introducing into a producer cell line: (1) DNA necessary for AAV replication and synthesis of an AAV capsid, (b) one or more helper constructs comprising the viral functions missing from the AAV vector, (c) a helper virus, and (d) the plasmid construct containing the genome of the AAV vector, e.g., ITRs, promoter and payload sequences, etc. In some examples, the viral vectors described herein may be engineered through synthetic or other suitable means by references to published sequences, such as those that may be available in the literature. For example, the genomic and protein sequences of various serotypes of AAV, as well as the sequences of the native terminal repeats (TRs), Rep proteins, and capsid subunits may be known in the art and may be found in the literature or in public databases such as GenBank or Protein Data Bank (PDB).
In some examples, methods of producing delivery vectors herein comprising packaging a polynucleotide of the present disclosure in an AAV vector. In some examples, methods of producing delivery vectors herein comprising packaging an engineered core promoter of the present disclosure (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) in an AAV vector. In some examples, methods of producing delivery vectors herein comprising packaging an engineered core promoter of the present disclosure paired with a response element in an AAV vector. In some examples, methods of producing delivery vectors herein comprising packaging an engineered core promoter of the present disclosure paired with a response element that is upstream of a polynucleotide encoding a payload in an AAV vector. In some examples, methods of producing the delivery vectors described herein comprise, (a) introducing into a cell: (i) a polynucleotide disclosed herein (e.g., an engineered core promoter of the present disclosure; an engineered core promoter of the present disclosure paired with a response element; or an engineered core promoter of the present disclosure paired with a response element that is upstream of a polynucleotide encoding a payload); and (ii) a viral genome comprising a Replication (Rep) gene and Capsid (Cap) gene that encodes a wild type AAV capsid protein or modified version thereof; (b) expressing in the cell the wild type AAV capsid protein or modified version thereof; (c) assembling an AAV particle; and (d) packaging the polynucleotide disclosed herein in the AAV particle, thereby generating an AAV delivery vector. In some examples, any polynucleotide disclosed herein may be packaged in the AAV vector. In some examples, the recombinant vectors comprise one or more inverted terminal repeats and the inverted terminal repeats comprise a 5′ inverted terminal repeat, a 3′ inverted terminal repeat, and a mutated inverted terminal repeat. In some examples, the mutated terminal repeat lacks a terminal resolution site, thereby enabling formation of a self-complementary AAV.
In some examples, a hybrid AAV vector may be produced by transcapsidation, e.g., packaging an inverted terminal repeat (ITR) from a first serotype into a capsid of a second serotype, wherein the first and second serotypes may be not the same. In some examples, the Rep gene and ITR from a first AAV serotype (e.g., AAV2) may be used in a capsid from a second AAV serotype (e.g., AAV5 or AAV9), wherein the first and second AAV serotypes may not be the same. As a non-limiting example, a hybrid AAV serotype comprising the AAV2 ITRs and AAV9 capsid protein may be indicated AAV2/9. In some examples, the hybrid AAV delivery vector comprises an AAV2/1, AAV2/2, AAV 2/4, AAV2/5, AAV2/8, or AAV2/9 vector.
In some examples, the AAV vector may be a chimeric AAV vector. In some examples, the chimeric AAV vector comprises an exogenous amino acid or an amino acid substitution, or capsid proteins from two or more serotypes. In some examples, a chimeric AAV vector may be genetically engineered to increase transduction efficiency, selectivity, or a combination thereof.
In some examples, the AAV vector comprises a self-complementary AAV genome. Self-complementary AAV genomes may be generally known in the art and contain both DNA strands which can anneal together to form double-stranded DNA.
In some examples, the delivery vector may be a retroviral vector. In some examples, the retroviral vector may be a Moloney Murine Leukemia Virus vector, a spleen necrosis virus vector, or a vector derived from the Rous Sarcoma Virus, Harvey Sarcoma Virus, avian leukosis virus, human immunodeficiency virus, myeloproliferative sarcoma virus, or mammary tumor virus, or a combination thereof. In some examples, the retroviral vector may be transfected such that the majority of sequences coding for the structural genes of the virus (e.g., gag, pol, and env) may be deleted and replaced by the gene(s) of interest (e.g., the payload).
In some examples, the delivery vehicle may be a non-viral vector. Examples of non-viral vectors may include plasmids, lipid nanoparticles, lipoplexes, polymersomes, polyplexes, dendrimers, nanoparticles, and cell-penetrating peptides. The non-viral vector may comprise a polynucleotide, such as a plasmid, encoding for a promoter (e.g., comprising a cell type- or cell state-specific response element and a switchable core promoter) and a payload sequence. The non-viral vector may comprise an engineered core promoter of the present disclosure (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144). The non-viral vector may comprise an engineered core promoter of the present disclosure paired with a response element. The non-viral vector may comprise an engineered core promoter of the present disclosure paired with a response element that is upstream of a polynucleotide encoding a payload. In some examples, the delivery vehicle may be a plasmid. In some examples, the plasmid may be a minicircle plasmid. In some embodiments, a vector may comprise naked DNA (e.g., a naked DNA plasmid). In some embodiments, the non-viral vector comprises DNA. In some embodiments, the non-viral vector comprises RNA. In some examples, the non-viral vector comprises circular double-stranded DNA. In some examples, the non-viral vector may comprise a linear polynucleotide. In some examples, the non-viral vector comprises a polynucleotide encoding one or more genes of interest and one or more regulatory elements. In some examples, the non-viral vector comprises a bacterial backbone containing an origin of replication and an antibiotic resistance gene or other selectable marker for plasmid amplification in bacteria. In some examples, the non-viral vector contains one or more genes that provide a selective marker to induce a target cell to retain a polynucleotide (e.g., a plasmid) of the non-viral vector. In some examples, the non-viral vector may be formulated for delivery through injection by a needle carrying syringe. In some examples, the non-viral vector may be formulated for delivery via electroporation. In some examples, a polynucleotide of the non-viral vector may be engineered through synthetic or other suitable means known in the art. For example, in some cases, the genetic elements may be assembled by restriction digest of the desired genetic sequence from a donor plasmid or organism to produce ends of the DNA which may then be readily ligated to another genetic sequence.
In some embodiments, the present disclosure provides a method of inserting a polynucleotide comprising the promoter (e.g., comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144, paired with a response element) as described herein and the payload into a recombinant polynucleotide cassette. The recombinant polynucleotide cassette may further modulate expression of the payload (e.g., by modulating translation). In some embodiments, the recombinant polynucleotide cassette modulates stability of the payload RNA. In some embodiments, the recombinant polynucleotide cassette comprises a 5′UTR effector region. In some embodiments, the recombinant polynucleotide cassette comprises a 3′UTR effector region. In some embodiments, the payload is codon optimized in the recombinant polynucleotide cassette. In some embodiments, an intron is inserted into the 5′UTR effector region or the sequence of the payload. In some embodiments, the intron is a natural intron or a synthetic intron.
In some embodiments, the recombinant polynucleotide cassette comprises the promoter as described herein (e.g., comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144, paired with a response element), the payload sequence, and one or more of: a 5′UTR effector region; a 3′UTR effector region; a codon optimized sequence of the payload; and an intron in the sequence of the payload. In some embodiments, translation of the payload increases from the recombinant polynucleotide cassette comprising one or more of the 5′UTR effector region, the 3′UTR effector region, a codon optimized sequence of the payload, and the intron in the sequence of the payload compared to from a recombinant polynucleotide cassette lacking the one or more of the 5′UTR effector region, the 3′UTR effector region, a codon optimized sequence of the payload, and the intron in the sequence of the payload. In some embodiments, translation of the payload decreases from the recombinant polynucleotide cassette comprising one or more of the 5′UTR effector region, the 3′UTR effector region, a codon optimized sequence of the payload, and the intron in the sequence of the payload compared to from a recombinant polynucleotide cassette lacking the one or more of the 5′UTR effector region, the 3′UTR effector region, a codon optimized sequence of the payload, and the intron in the sequence of the payload.
In some embodiments, the 5′ UTR effector region comprises one or more of: a structural element; a sequence motif; a nucleotide base content comprising a G/C content of at least 1%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97%, 99%, or 100%; and a 5′ UTR intron. In some embodiments, the structural element is a site that is in a conformation with the 5′cap so that the 5′cap is inaccessible or has low accessibility to the translation machinery (e.g., also referred to as “a cap-burying site”), resulting in decreased or no translation compared to a 5′UTR lacking this structural element. In some embodiments, the structural element is an Internal Ribosome Entry Site (IRES). In some embodiments, the structural element is an RNA pseudoknot. In some embodiments, the structural element is an Iron Responsive Element (IRE). In some embodiments, the structural element is a non-coding translation modulatory structure. In some embodiments, the structural element is a hairpin. In some embodiments, the structural element is a sequence (e.g., a cap-burying site sequence, an IRES, an RNA pseudoknot, an IRE, or a non-coding translation modulatory structure), that changes conformation when a sequence element contacts a target RNA with which it has at least partial complementarity, such that the rate of translation of the payload downstream of the structural element is increased or decreased compared to translation of the payload prior to the sequence element contacting the target RNA. In some embodiments, the 5′UTR effector region further comprises the sequence element, wherein a nucleic acid sequence of the sequence element is at least partially complementary to a sequence of a target RNA and wherein conformation of the structural element changes when the sequence element contacts the target RNA with which it has at least partial complementarity. In some embodiments, the 5′ UTR intron is a natural intron, synthetic intron, or a fragment thereof.
In some embodiments, the 3′ UTR comprises one or more of: a site that recruits polyA tail machinery; an miRNA binding site; or a sequence motif. In some embodiments, the poly (A) tail recruitment machinery comprises an enzyme. In some embodiments, the length of the poly (A) tail modulates protein expression from the polynucleotide. In some embodiments, the 3′UTR effector region comprises one, two, three, four, five, six, seven, eight, nine, ten, or more miRNA binding sites. In some embodiments, the miRNA binding sites are for the same miRNA. In some embodiments, the miRNA binding sites are for different miRNA. In some embodiments, the sequence motif is an AC-rich motif. In some embodiments, the sequence motif is an AU-rich element (ARE).
In some embodiments, the codon optimized sequence of the therapeutic polynucleotide is a least 1%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97%, 99%, or 100% codon optimized. In some embodiments, the intron in the sequence of the therapeutic polynucleotide is a natural intron, synthetic intron, or a fragment thereof. In some embodiments, the recombinant polynucleotide cassette is encoded by a DNA vector.
In some embodiments, the combination of the promoter sequence and payload sequence as described herein in the recombinant polynucleotide cassette results in the payload being expressed at a therapeutic level for reducing or alleviating at least one symptom of the disease or disorder. The therapeutic level can be-0.25-fold,-0.5-fold, 0-fold, 0.25-fold, 0.5-fold, 0.75-fold, 1-fold, 1.5-fold, 2-fold, or 4-fold greater than the biological level of the payload.
Methods for treatment of diseases or disorders characterized by aberrant gene expression are also encompassed by the present disclosure. Said methods include administering a therapeutically effective amount of a payload sequence as part of a recombinant polynucleotide cassette. The recombinant polynucleotide cassette of the disclosure can be formulated in pharmaceutical compositions. These compositions can comprise, in addition to one or more of the recombinant polynucleotide cassettes (e.g., comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144), a pharmaceutically acceptable excipient, carrier, buffer, stabilizer or other materials well known to those skilled in the art. Such materials should be non-toxic and should not interfere with the efficacy of the active ingredient. The precise nature of the carrier or other material can depend on the route of administration, e.g., oral, intravenous, cutaneous or subcutaneous, nasal, intramuscular, intraperitoneal routes.
Pharmaceutical compositions for oral administration can be in tablet, capsule, powder, or liquid form. A tablet can include a solid carrier such as gelatin or an adjuvant. Liquid pharmaceutical compositions generally include a liquid carrier such as water, petroleum, animal or vegetable oils, mineral oil, or synthetic oil. Physiological saline solution, dextrose or other saccharide solution or glycols such as ethylene glycol, propylene glycol or polyethylene glycol can be included.
For intravenous, cutaneous, or subcutaneous injection, or injection at the site of affliction, the active ingredient will be in the form of a parenterally acceptable aqueous solution which is pyrogen-free and has suitable pH, isotonicity and stability. Those of relevant skill in the art are well able to prepare suitable solutions using, for example, isotonic vehicles such as Sodium Chloride Injection, Ringer's Injection, Lactated Ringer's Injection. Preservatives, stabilizers, buffers, antioxidants and/or other additives can be included, as required.
In some embodiments, the polynucleotide of the present disclosure or recombinant polynucleotide cassette of the present disclosure (e.g., a polynucleotide or recombinant polynucleotide cassette comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) may be administered to cells via a lipid nanoparticle. In some embodiments, the lipid nanoparticle may be administered at the appropriate concentration according to standard methods appropriate for the target cells.
In some embodiments, the polynucleotide of the present disclosure or recombinant polynucleotide cassette of the present disclosure (e.g., a polynucleotide or recombinant polynucleotide cassette comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) may be administered to cells via a viral vector. In some embodiments, the viral vector may be administered at the appropriate multiplicity of infection according to standard transduction methods appropriate for the target cells. Titers of the virus vector or capsid to administer can vary depending on the target cell type or cell state and number and can be determined by those of skill in the art. In some embodiments, at least about 102 infections units are administered. In some embodiments, at least about 103, 104, 105, 106, 107, 108, 109, 1010, 1011, 1012, or 1013 infectious units are administered.
In some embodiments, the polynucleotide or recombinant polynucleotide cassette (e.g., a polynucleotide or recombinant polynucleotide cassette comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) is introduced to cells of any type or state, including, but not limited to neural cells, cells of the eye (including retinal cells, retinal pigment epithelium, and corneal cells), lung cells, epithelial cells, skeletal muscle cells, dendritic cells, hepatic cells, pancreatic cells, bone cells, hematopoietic stem cells, spleen cells, keratinocytes, fibroblasts, endothelial cells, prostate cells, and heart cells.
In some embodiments, the polynucleotide or the disclosure or the recombinant polynucleotide cassette of the disclosure (e.g., a polynucleotide or recombinant polynucleotide cassette comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) may be introduced to cells in vitro via a viral vector for administration of modified cells to a subject. In some embodiments, a viral vector encoding the polynucleotide of the disclosure or the recombinant polynucleotide cassette of the disclosure is introduced to cells that have been removed from a subject. In some embodiments, the modified cells are placed back in the subject following introduction of the viral vector.
In some embodiments, a dose of modified cells is administered to a subject according to the age and species of the subject, disease or disorder to be treated, as well as the cell type or state and mode of administration. In some embodiments, at least about 102-108 cells are administered per dose. In some embodiments, cells transduced with viral vector are administered to a subject in an effective amount.
In some embodiments, the dose of viral vector administered to a subject will vary according to the age of the subject, the disease or disorder to be treated, and mode of administration. In some embodiments, the dose for achieving a therapeutic effect is a virus titer of at least about 102, 103, 104, 105, 106, 107, 108, 109, 1010, 1011, 1012, 1013, 1014, 1015, 1016 or more transducing units.
Administration of the pharmaceutically useful polynucleotide of the present disclosure or the polynucleotide cassette of the present disclosure is preferably in a “therapeutically effective amount” or “prophylactically effective amount” (as the case can be, although prophylaxis can be considered therapy), this being sufficient to show benefit to the individual. The actual amount administered, and rate and time-course of administration, will depend on the nature and severity of protein aggregation disease being treated. Prescription of treatment, e.g., decisions on dosage etc., is within the responsibility of general practitioners and other medical doctors, and typically takes account of the disorder to be treated, the condition of the individual patient, the site of delivery, the method of administration and other factors known to practitioners. Examples of the techniques and protocols mentioned above can be found in Remington's Pharmaceutical Sciences, 16th edition, Osol, A. (ed), 1980.
A composition can be administered alone or in combination with other treatments, either simultaneously or sequentially dependent upon the condition to be treated.
In some embodiments, the recombinant polynucleotide of the disclosure or the recombinant polynucleotide cassette of the disclosure (e.g., a polynucleotide, recombinant polynucleotide, or recombinant polynucleotide cassette comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) is used for treating a disease or disorder associated with abnormal expression of a gene or protein. In some embodiments the disease or disorder is Rett syndrome, MECP2 duplication syndrome, frontotemporal dementia, neuronal ceroid lipofuscinosis, amyotrophic lateral sclerosis (ALS), limbic predominant age-related TDP-43 encephalopathy (LATE), autosomal recessive polycystic kidney disease, kidney failure, chronic, malignant neoplasm of kidney, polycystic kidney disease, type 2, kidney failure, kidney neoplasm, medullary cystic kidney disease type 2, clear cell sarcoma of kidney, kidney calculi, collecting duct carcinoma of the kidney, polycystic kidney, medullary cystic kidney disease 1, congenital absence of kidneys syndrome, cystic kidney, chronic kidney disease/disorder with a monogenetic origin, unilateral agenesis of kidney, glomerulocystic kidney disease with hyperuricemia and isosthenuria, cystic kidney disease with ventriculomegaly, unilateral multicystic dysplastic kidney, brain anomalies, retardation, ectodermal dysplasia, skeletal malformations, Hirschsprung disease, ear-eye anomalies, cleft palate-cryptorchidism, and kidney dysplasia-hypoplasia, sex reversal, female, with dysgenesis of kidneys, adrenals, and lungs, polycystic kidney disease, potter type I, with microbrachycephaly, hypertelorism, brachymelia, retinitis pigmentosa 7, macular degeneration, retinitis pigmentosa 4, Angelman syndrome, DYRKIA haploinsufficiency, MEF2C haploinsufficiency syndrome, Sotos syndrome, reverse Sotos syndrome, alpha-thalassemia X-linked intellectual disability syndrome, Xp22.12 duplication, Coffin-Lowry syndrome, Pitt Hopkins syndrome, Mowat-Wilson syndrome, FOXG1 syndrome, CDKL5 deficiency disorder, West syndrome, 2q23.1 microdeletion syndrome, Doose syndrome, SLC6A1 epileptic encephalopathy, Duchenne's muscular dystrophy, Becker muscular dystrophy, alpha-1 antitrypsin deficiency (AATD), macular degeneration/Stargardt disease, cystic fibrosis, Tay-Sachs, Charcot-Marie-Tooth neuropathy, Wilson's disease, hereditary hemochromatosis, Wolman disease, cholesteryl ester storage disease, psueodhypoaldosteronism type 1, autosomal dominant polycystic kidney disease, achromatopsia, adrenomyeloneuropathy, Barth syndrome, Batten disease, Canavan disease, congenital adrenal hyperplasia, congenital disorder of glycosylation 1a, Danon disease, Fabry disease, Gaucher disease, GM1 gangliosidosis, glycogen SD 1a, hemophilia, hereditary angioedema, Krabbe disease, Leber's disease, myotubular myopathy, mucopolysaccharidosis, ornithine transcarbamylase deficiency, Pompe disease, Sandhoff disease, spinal muscular atrophy, Usher syndrome, beta-thalassemia, bestrophinopathy, chorioderemia, Friedreich's ataxia, GSD1b, osteoporosis, sickle cell disease, heart failure, or Parkinson's disease. In some embodiments, the disease or disorder is Rett syndrome. In some embodiments, the disease or disorder is frontotemporal dementia. In some embodiments, the disease or disorder is a retinal disorder. In some embodiments, the retinal disorder comprises Retinitis Pigmentosa 7, Retinitis Pigmentosa 4, or macular degeneration.
In some embodiments, a polynucleotide of the present disclosure or the recombinant polynucleotide cassette of the present disclosure (e.g., a polynucleotide or recombinant polynucleotide cassette comprising an engineered core promoter, such as any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144) may be administered using a viral vector (e.g., an AAV vector) or a non-viral vector to a subject in need thereof. For example, the subject in need thereof may have or be at risk for frontotemporal dementia. Upon administration of said polynucleotide or said recombinant polynucleotide cassette, a payload sequence (e.g., encoding progranulin, polycystin-1, or polycystin-2) may be expressed in a cell type, cell state, or tissue type of interest (e.g., a diseased cell, a neuronal cell, CNS tissue, renal cell, or renal tissue). The payload sequence may be selectively transcribed in the cell type, cell state, or tissue type of interest, thereby preventing unwanted adverse effects in the subject due to expression in non-target tissues. The subject can be a human or a non-human animal. Thus, the polynucleotides disclosed herein or the recombinant polynucleotide cassettes disclosed herein can serve as a therapeutically effective vector replacement therapy that senses endogenous nucleic acids to regulate expression of a payload sequence and prevent or minimize adverse side effects from overexpression of a payload sequence.
As used herein, the term “therapeutic polynucleotide” may to a polynucleotide that is introduced into a cell and is capable of being expressed in the cell or to a polynucleotide that may, in itself, have a therapeutic activity, such as a gRNA or a tRNA.
As used herein, the term “polynucleotide” refers to a single or double-stranded polymer of deoxyribonucleotide (DNA) or ribonucleotide (RNA) bases read from the 5′ to the 3′ end. The term “RNA” is inclusive of dsRNA (double stranded RNA), snRNA (small nuclear RNA), lncRNA (long non-coding RNA), mRNA (messenger RNA), miRNA (microRNA) RNAi (inhibitory RNA), siRNA (small interfering RNA), shRNA (short hairpin RNA), tRNA (transfer RNA), IRNA (ribosomal RNA), snoRNA (small nucleolar RNA), and cRNA (complementary RNA). The term DNA is inclusive of cDNA, genomic DNA, and DNA-RNA hybrids.
The term “ameliorating” refers to any therapeutically beneficial result in the treatment of a disease state, e.g., Rett syndrome, including prophylaxis, lessening in the severity or progression, remission, or cure thereof.
The term “mammal” as used herein includes both humans and non-humans and include but is not limited to humans, non-human primates, canines, felines, murines, bovines, equines, and porcines.
The term percent “identity,” in the context of two or more nucleic acid or polypeptide sequences, refers to two or more sequences or subsequences that have a specified percentage of nucleotides or amino acid residues that are the same, when compared and aligned for maximum correspondence, as measured using one of the sequence comparison algorithms described below (e.g., BLASTP and BLASTN or other algorithms available to persons of skill) or by visual inspection. Depending on the application, the percent “identity” can exist over a region of the sequence being compared, e.g., over a functional domain, or, alternatively, exist over the full length of the two sequences to be compared.
For sequence comparison, typically one sequence acts as a reference sequence (also called the subject sequence) to which test sequences (also called query sequences) are compared. The percent sequence identity is defined as a test sequence's percent identity to a reference sequence. For example, when stated “Sequence A having a sequence identity of 50% to Sequence B,” Sequence A is the test sequence and Sequence B is the reference sequence. When using a sequence comparison algorithm, test and reference sequences are input into a computer program, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then aligns the sequences to achieve the maximum alignment, based on the designated program parameters, introducing gaps in the alignment if necessary. The percent sequence identity for the test sequence(s) relative to the reference sequence can then be determined from the alignment of the test sequence to the reference sequence. The equation for percent sequence identity from the aligned sequence is as follows:
[(Number of Identical Positions)/(Total Number of Positions in the Test Sequence)]×100%
For purposes herein, percent identity and sequence similarity calculations are performed using the BLAST algorithm for sequence alignment, which is described in Altschul et al., J. Mol. Biol. 215:403-410 (1990). Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (www.ncbi.nlm.nih.gov/). The BLAST algorithm uses a test sequence (also called a query sequence) and a reference sequence (also called a subject sequence) to search against, or in some cases, a database of multiple reference sequences to search against. The BLAST algorithm performs sequence alignment by finding high-scoring alignment regions between the test and the reference sequences by scoring alignment of short regions of the test sequence (termed “words”) to the reference sequence. The scoring of each alignment is determined by the BLAST algorithm and takes factors into account, such as the number of aligned positions, as well as whether introduction of gaps between the test and the reference sequences would improve the alignment. The alignment scores for nucleic acids can be scored by set match/mismatch scores. For protein sequences, the alignment scores can be scored using a substitution matrix to evaluate the significance of the sequence alignment, for example, the similarity between aligned amino acids based on their evolutionary probability of substitution. For purposes herein, the substitution matrix used is the BLOSUM62 matrix. For purposes herein, the public default values of Apr. 6, 2023 are used when using the BLASTN and BLASTP algorithms. The BLASTN and BLASTP algorithms then output a “Percent Identity” output value and a “Query Coverage” output value. The overall percent sequence identity as used herein can then be calculated from the BLASTN or BLASTP output values as follows:
Percent Sequence Identity=(“Percent Identity” output value)×(“Query Coverage” output value)
The following non-limiting examples illustrate the calculation of percent identity between two nucleic acids sequences. The percent identity is calculated as follows: [(number of identical nucleotide positions)/(total number of nucleotides in the test sequence)]×100%. Percent identity is calculated to compare test sequence 1: AAAAAGGGGG (length=10 nucleotides) to reference sequence 2: AAAAAAAAAA (length=10 nucleotides). The percent identity between test sequence 1 and reference sequence 2 would be [(5)/(10)]×100%=50%. Test sequence 1 has 50% sequence identity to reference sequence 2. In another example, percent identity is calculated to compare test sequence 3: CCCCCGGGGGGGGGGCCCCC (length=20 nucleotides) to reference sequence 4: GGGGGGGGGG (length=10 nucleotides). The percent identity between test sequence 3 and reference sequence 4 would be [(10)/(20)]×100%=50%. Test sequence 3 has 50% sequence identity to reference sequence 4. In another example, percent identity is calculated to compare test sequence 5: GGGGGGGGGG (length=10 nucleotides) to reference sequence 6: CCCCCGGGGGGGGGGCCCCC (length=20 nucleotides). The percent identity between test sequence 5 and reference sequence 6 would be [(10)/(10)]×100%=100%. Test sequence 5 has 100% sequence identity to reference sequence 6.
The following non-limiting examples illustrate the calculation of percent identity between two protein sequences. The percent identity is calculated as follows: [(number of identical amino acid positions)/(total number of amino acids in the test sequence)]×100%. Percent identity is calculated to compare test sequence 7: FFFFFYYYYY (length=10 amino acids) to reference sequence 8: YYYYYYYYYY (length=10 amino acids). The percent identity between test sequence 7 and reference sequence 8 would be [(5)/(10)]×100%=50%. Test sequence 7 has 50% sequence identity to reference sequence 8. In another example, percent identity is calculated to compare test sequence 9: LLLLLFFFFFYYYYYLLLLL (length=20 amino acids) to reference sequence 10: FFFFFYYYYY (length=10 amino acids). The percent identity between test sequence 9 and reference sequence 10 would be [(10)/(20)]×100%=50%. Test sequence 9 has 50% sequence identity to reference sequence 10. In another example, percent identity is calculated to compare test sequence 11: FFFFFYYYYY (length=10 amino acids) to reference sequence 12: LLLLLFFFFFYYYYYLLLLL (length=20 amino acids). The percent identity between test sequence 11 and reference sequence 12 would be [(10)/(10)]×100%=100%. Test sequence 11 has 100% sequence identity to reference sequence 12.
As used herein, the term “subject” broadly refers to any animal, including but not limited to, human and non-human animals (e.g., dogs, cats, cows, horses, sheep, pigs, poultry, fish, crustaceans, etc.).
As used herein, the term “effective amount” refers to the amount of a composition (e.g., a synthetic peptide) sufficient to effect beneficial or desired results. An effective amount can be administered in one or more administrations, applications or dosages and is not intended to be limited to a particular formulation or administration route.
As used herein, the term “therapeutically effective amount” is an amount that is effective to ameliorate a symptom of a disease. A therapeutically effective amount can be a “prophylactically effective amount” as prophylaxis can be considered therapy.
As used herein, the terms “administration” and “administering” refer to the act of giving a drug, prodrug, or other agent, or therapeutic treatment (e.g., peptide) to a subject or in vivo, in vitro, or ex vivo cells, tissues, and organs. Exemplary routes of administration to the human body can be through space under the arachnoid membrane of the brain or spinal cord (intrathecal), the eyes (ophthalmic), mouth (oral), skin (topical or transdermal), nose (nasal), lungs (inhalant), oral mucosa (buccal or lingual), ear, rectal, vaginal, by injection (e.g., intravenously, subcutaneously, intratumorally, intraperitoneally, etc.) and the like.
As used herein, the term “treatment” means an approach to obtaining a beneficial or intended clinical result. The beneficial or intended clinical result can include alleviation of symptoms, a reduction in the severity of the disease, inhibiting an underlying cause of a disease or condition, steadying diseases in a non-advanced state, delaying the progress of a disease, and/or improvement or alleviation of disease conditions.
As used herein, the term “pharmaceutical composition” refers to the combination of an active ingredient with a carrier, inert or active, making the composition especially suitable for therapeutic or diagnostic use in vitro, in vivo or ex vivo.
The terms “pharmaceutically acceptable” or “pharmacologically acceptable,” as used herein, refer to compositions that do not substantially produce adverse reactions, e.g., toxic, allergic, or immunological reactions, when administered to a subject.
As used herein, the term “pharmaceutically acceptable carrier” refers to any of the standard pharmaceutical carriers including, but not limited to, phosphate buffered saline solution, water, emulsions (e.g., such as an oil/water or water/oil emulsions), glycerol, liquid polyethylene glycols, aprotic solvents such as dimethylsulfoxide, N-methylpyrrolidone and mixtures thereof, and various types of wetting agents, solubilizing agents, anti-oxidants, bulking agents, protein carriers such as albumins, any and all solvents, dispersion media, coatings, sodium lauryl sulfate, isotonic and absorption delaying agents, disintegrants (e.g., potato starch or sodium starch glycolate), and the like. The compositions also can include stabilizers and preservatives. For examples of carriers, stabilizers and adjuvants, see, e.g., Martin, Remington's Pharmaceutical Sciences, 21th Ed., Mack Publ. Co., Easton, Pa. (2005), incorporated herein by reference in its entirety.
As used herein, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise.
As used herein, the terms “about” and “approximately,” in reference to a number, is used herein to include numbers that fall within a range of 10%, 5%, or 1% in either direction (greater than or less than) the number unless otherwise stated or otherwise evident from the context (except where such number would exceed 100% of a possible value).
The following embodiments recite non-limiting permutations of combinations of features disclosed herein. Other permutations of combinations of features are also contemplated. In particular, each of these numbered embodiments is contemplated as depending from or relating to every previous or subsequent numbered embodiment, independent of their order as listed. 1. An engineered core promoter sequence comprising a TATA box, an initiator element, a spacer separating the TATA box and the initiator element, and one or more features selected from: a) the TATA box comprises a sequence having at least 85% sequence identity to any one of SEQ ID NO: 199-SEQ ID NO: 201; b) the initiator element comprises a sequence having at least 75% sequence identity to any one of SEQ ID NO: 202-SEQ ID NO: 204; c) the spacer is a short spacer comprising no less than 25 and no more than 29 nucleotide residues between the TATA box and the initiator element; d) the spacer is a long spacer comprising no less than 31 and no more than 48 nucleotide residues between the TATA box and the initiator element; e) a YY1 motif comprising a sequence having at least 80% sequence identity to any one of SEQ ID NO: 205-SEQ ID NO: 207; f) a transcriptional pause site comprising a sequence having at least 90% sequence identity to any one of SEQ ID NO: 196-SEQ ID NO: 198; g) at least 70% sequence identity and at least one nucleotide substitution relative to SEQ ID NO: 5, SEQ ID NO: 9, or SEQ ID NO: 210; and h) combinations thereof. 2. The engineered core promoter sequence of embodiment 1, comprising at least 70% sequence identity and nucleotide substitutions T15G and A40C relative to SEQ ID NO: 5. 3. The engineered core promoter sequence of embodiment 1, comprising at least 70% sequence identity and nucleotide substitutions T15A and A40T relative to SEQ ID NO: 5. 4. The engineered core promoter sequence of any one of embodiments 1-3, wherein the spacer comprises from 29 to 31 nucleotides between the TATA box and the initiator element. 5. The engineered core promoter sequence of any one of embodiments 1-4, wherein the spacer comprises 30 nucleotides between the TATA box and the initiator element. 6. The engineered core promoter sequence of any one of embodiments 1-3, wherein the short spacer comprises 28 nucleotide residues between the TATA box and the initiator element. 7. The engineered core promoter sequence of any one of embodiments 1-3, wherein the long spacer comprises 32 nucleotide residues between the TATA box and the initiator element. 8. The engineered core promoter sequence of any one of embodiments 1-7, wherein the TATA box comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 199-SEQ ID NO: 201. 9. The engineered core promoter sequence of any one of embodiments 1-8, wherein the TATA box comprises a sequence of any one of SEQ ID NO: 199-SEQ ID NO: 201. 10. The engineered core promoter sequence of any one of embodiments 1-9, wherein the initiator element comprises a sequence having at least 85% sequence identity to any one of SEQ ID NO: 202-SEQ ID NO: 204. 11. The engineered core promoter sequence of any one of embodiments 1-10, wherein the initiator element comprises a sequence of any one of SEQ ID NO: 202-SEQ ID NO: 204. 12. The engineered core promoter sequence of any one of embodiments 1-11, wherein the YY1 motif comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 205-SEQ ID NO: 207. 13. The engineered core promoter sequence of any one of embodiments 1-12, wherein the YY1 motif comprises a sequence of any one of SEQ ID NO: 205-SEQ ID NO: 207. 14. The engineered core promoter sequence of any one of embodiments 1-11, wherein the transcriptional pause site comprises a sequence having at least 95% sequence identity to any one of SEQ ID NO: 196-SEQ ID NO: 198. 15. The engineered core promoter sequence of any one of embodiments 1-11, wherein the transcriptional pause site comprises a sequence of any one of SEQ ID NO: 196-SEQ ID NO: 198. 16. The engineered core promoter of any one of embodiments 1-15, wherein the engineered core promoter comprises two or more of the features. 17. The engineered core promoter sequence of any one of embodiments 1-16, wherein the engineered core promoter comprises features a) and b). 18. The engineered core promoter of any one of embodiments 1-17, wherein the engineered core promoter comprises three or more of the features. 19. The engineered core promoter sequence of any one of embodiments 1-18, wherein the engineered core promoter comprises features a) and b) and at least one more of the features. 20. The engineered core promoter sequence of any one of embodiments 1-19, wherein the engineered core promoter comprises features a), b), and c). 21. The engineered core promoter sequence of any one of embodiments 1-19, wherein the engineered core promoter comprises features a), b), and d). 22. The engineered core promoter sequence of any one of embodiments 1-19, wherein the engineered core promoter comprises features a), b), and e). 23. The engineered core promoter sequence of any one of embodiments 1-19, wherein the engineered core promoter comprises features a), b), and f). 24. An engineered core promoter sequence comprising a sequence having at least 80%, 85%, 90%, 93%, 95%, 97%, 99%, or 100% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144. 25. The engineered core promoter sequence of embodiment 23, wherein the engineered core promoter sequence comprises a sequence having at least 80%, 85%, 90%, 93%, 95%, 97%, 99%, or 100% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. 26. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 7. 27. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 1. 28. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 2. 29. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 3. 30. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 4. 31. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 10. 32. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 11. 33. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 12. 34. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 13. 35. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 14. 36. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 15. 37. The engineered core promoter sequence of embodiment 23 or embodiment 24, wherein the engineered core promoter sequence comprises a sequence of SEQ ID NO: 110. 38. The engineered core promoter sequence of any one of embodiments 1-37, wherein the engineered core promoter sequence comprises a TATA box, an RNA polymerase binding sequence, a B recognition element, a CCAAT box, or a Pribnow box. 39. The engineered core promoter sequence of any one of embodiments 1-38, wherein the engineered core promoter sequence is capable of recruiting a polymerase. 40. The engineered core promoter sequence of embodiment 39, wherein the polymerase is an RNA polymerase II. 41. The engineered core promoter sequence of any one of embodiments 1-40, wherein the engineered core promoter has a dynamic range that is higher than a dynamic range of SEQ ID NO: 6. 42. The engineered core promoter sequence of any one of embodiments 1-41, wherein the engineered core promoter has a dynamic range that is higher than a dynamic range of SEQ ID NO: 9. 43. The engineered core promoter sequence of any one of embodiments 1-42, wherein the engineered core promoter has an activity that is higher than an activity of SEQ ID NO: 6. 44. The engineered core promoter sequence of any one of embodiments 1-43, wherein the engineered core promoter has an activity that is higher than an activity of SEQ ID NO: 9. 45. The engineered core promoter sequence of any one of embodiments 1-44, wherein the engineered core promoter has an activity that is higher than an activity of SEQ ID NO: 6 when paired with an active response element, a basal activity that is lower than a basal activity of SEQ ID NO: 6 when paired with an inactive response element, or both. 46. The engineered core promoter sequence of any one of embodiments 1-45, wherein the engineered core promoter has an activity that is higher than an activity of SEQ ID NO: 9 when paired with an active response element, a basal activity that is lower than a basal activity of SEQ ID NO: 9 when paired with an inactive response element, or both. 47. A recombinant polynucleotide comprising a promoter and a payload, wherein the promoter comprises: a response element capable of binding to a cognate ligand, a coactivator, or a corepressor; and the engineered core promoter sequence of any one of embodiments 1-46 capable of recruiting a polymerase; wherein the payload comprises a coding sequence. 48. The recombinant polynucleotide of embodiment 47, wherein the response element is an enhancer. 49. The recombinant polynucleotide of embodiment 47 or embodiment 48, wherein binding of the cognate ligand or the coactivator to the response element increases transcription of the payload in a target cell type, a target cell state, or a target tissue. 50. The recombinant polynucleotide of embodiment 47, wherein binding of the corepressor to the response element decreases transcription of the payload in a non-target cell type, a non-target cell state, or a non-target tissue. 51. The recombinant polynucleotide of any one of embodiments 47-50, wherein the response element confers neuron-specific transcription of the payload. 52. The recombinant polynucleotide of any one of embodiments 47-50, wherein the response element confers muscle-specific transcription of the payload. 53. The recombinant polynucleotide of any one of embodiments 47-50, wherein the response element confers kidney-specific transcription of the payload. 54. The recombinant polynucleotide of any one of embodiments 47-53, wherein the payload encodes a protein. 55. The recombinant polynucleotide of embodiment 54, wherein the protein is a neuronal protein, a kidney protein, a retinal protein, a muscle protein, or an apoptosis-inducing protein. 56. The recombinant polynucleotide of embodiment 54 or embodiment 55, wherein the protein is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 57. The recombinant polynucleotide of any one of embodiments 54-56, wherein the protein is progranulin, MeCP2, polycystin-1, or polycystin-2. 58. The recombinant polynucleotide of any one of embodiments 47-53, wherein the payload encodes a therapeutic polynucleotide. 59. The recombinant polynucleotide of embodiment 58, wherein the therapeutic polynucleotide is a guide RNA or a suppressor tRNA. 60. The recombinant polynucleotide of embodiment 58 or embodiment 59, wherein the therapeutic polynucleotide targets a gene. 61. The recombinant polynucleotide of embodiment 60, wherein the gene is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 62. The recombinant polynucleotide of embodiment 60 or embodiment 61, wherein the gene is GRN, MECP2, PKD2, or PKD2. 63. An engineered viral vector comprising the recombinant polynucleotide of any one of embodiments 47-62 in a viral vector. 64. The engineered viral vector of embodiment 63, wherein the viral vector is an adenoviral vector, an adeno-associated viral vector, or a lentivector. 65. The engineered viral vector of embodiment 64, wherein the adeno-associated viral vector is selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV 10, AAV11, AAV12, AAV13, AAV14, AAV15, AAV16, AAV-DJ, AAV-DJ/8, AAV-DJ/9, AAV1/2, AAV.rh8, AAV.rh10, AAV.rh20, AAV.rh39, AAV.Rh43, AAV.Rh74, AAV.v66, AAV.Oligo001, AAV.SCH9, AAV.r3.45, AAV.RHM4-1, AAV.hu37, AAV.Anc80, AAV.Anc80L65, AAV.7m8, AAV.PhP.eB, AAV.PhP. V1, AAV.PHP.B, AAV.PhB.C1, AAV.PhB.C2, AAV.PhB.C3, AAV.PhB.C6, AAV.cy5, AAV2.5, AAV2tYF, AAV3B, AAV.LK03, AAV.HSC1, AAV.HSC2, AAV.HSC3, AAV.HSC4, AAV.HSC5, AAV.HSC6, AAV.HSC7, AAV.HSC8, AAV.HSC9, AAV.HSC10, AAV.HSC11, AAV.HSC12, AAV.HSC13, AAV.HSC14, AAV.HSC15, AAV.HSC16, AAV.HSC17, AAVhu68, and combinations thereof. 66. A pharmaceutical composition comprising the recombinant polynucleotide of any one of embodiments 47-62 or the viral vector of any one of embodiments 63-65 and a pharmaceutically acceptable carrier. 67. A method of expressing a payload in a target cell of a subject, the method comprising: administering to the subject the recombinant polynucleotide of any one of embodiments 47-62, the viral vector of any one of embodiments 63-65, or the pharmaceutical composition of embodiment 66 to the subject; binding a cognate ligand, a coactivator, or a corepressor to the response element of the recombinant polynucleotide, wherein the cognate ligand or the coactivator is specific to the target cell, or wherein the corepressor is specific to a non-target cell; initiating transcription of the payload by recruiting a polymerase to the core promoter sequence; and transcribing the payload, thereby expressing the payload in the target cell. 68. The method of embodiment 67, comprising expressing the payload at a higher level in the cell type of interest than in the non-target cell. 69. The method of embodiment 67 or embodiment 68, comprising initiating transcription at a higher rate in the cell type of interest than in the non-target cell. 70. The method of any one of embodiments 67-69, wherein the cognate ligand or the coactivator is present at a higher level in the target cell than in the non-target cell. 71. The method of any one of embodiments 67-70, wherein the cognate ligand is a transcription factor. 72. The method of any one of embodiments 67-69, wherein the corepressor is present at a higher level in the non-target cell than in the target cell. 73. The method of any one of embodiments 67-72, wherein the target cell is a target cell state, and wherein the non-target cell is a non-target cell state. 74. The method of embodiment 73, wherein the target cell state is a diseased cell. 75. The method of embodiment 73 or embodiment 74, wherein the non-target cell state is a healthy cell. 76. The method of any one of embodiments 67-71, wherein the target cell is a target cell type, and wherein the non-target cell is a non-target cell type. 77. The method of embodiment 76, wherein the target cell type is a central nervous system cell, a neuron, a renal cell, a retinal cell, a hepatocyte, an epithelial cell, a muscle cell, an erythrocyte, a platelet, a bone marrow cell, an endothelial cell, an epidermal cell, a lymphocyte, a glial cell, an interstitial cell, an adipocyte, or a fibroblast. 78. The method of embodiment 76 or embodiment 77, wherein the non-target cell type is a central nervous system cell, a neuron, a renal cell, a retinal cell, a hepatocyte, an epithelial cell, a muscle cell, an erythrocyte, a platelet, a bone marrow cell, an endothelial cell, an epidermal cell, a lymphocyte, a glial cell, an interstitial cell, an adipocyte, a fibroblast, or combinations thereof. 79. A method of treating a disorder in a subject in need thereof, the method comprising: administering to the subject a composition comprising the recombinant polynucleotide of any one of embodiments 47-62, the viral vector of any one of embodiments 63-65, or the pharmaceutical composition of embodiment 66; and expressing a therapeutic sequence encoded by a payload of the recombinant polynucleotide in a target cell of the subject, thereby treating the disorder. 80. The method of embodiment 79, wherein the target cell is cell type associated with the disorder. 81. The method of embodiment 79, wherein the target cell is cell state associated with the disorder. 82. The method of any one of embodiments 79-81, wherein the disorder is a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 83. The method of any one of embodiments 79-82, wherein the disorder is Rett syndrome, MECP2 duplication syndrome, frontotemporal dementia, neuronal ceroid lipofuscinosis, cancer, atherosclerosis, Alzheimer's disease, amyotrophic lateral sclerosis, limbic predominant age-related TDP-43 encephalopathy, or polycystic kidney disease. 84. The method of any one of embodiments 79-83, wherein the disorder is any one of the disorders provided in TABLE 5. 85. The method of any one of embodiments 79-84, wherein the therapeutic sequence encodes a therapeutic protein. 86. The method of embodiment 85, wherein the therapeutic protein is a neuronal protein, a kidney protein, a retinal protein, a muscle protein, or an apoptosis-inducing protein. 87. The method of embodiment 85 or embodiment 86, wherein the therapeutic protein is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 88. The method of any one of embodiments 85-87, wherein the therapeutic protein is MECP2, progranulin, polycystin-1, or polycystin-2. 89. The method of any one of embodiments 85-88, wherein the therapeutic protein is encoded by a gene provided in TABLE 5. 90. The method of any one of embodiments 79-84, wherein the payload encodes a therapeutic polynucleotide. 91. The method of embodiment 90, wherein the therapeutic polynucleotide is a guide RNA or a suppressor tRNA. 92. The method of embodiment 90 or embodiment 91, wherein the therapeutic polynucleotide targets a gene provided in TABLE 5. 93. A method of identifying a switchable core promoter, the method comprising: introducing a core promoter library comprising a first sub-library and a second sub-library to a population of cells; wherein the first sub-library comprises a plurality of core promoter sequences, wherein a core promoter sequence of the plurality of core promoter sequences is linked to a first enhancer sequence, and a unique barcode sequence; wherein the second sub-library comprises the plurality of core promoter sequences, wherein the core promoter sequence of the plurality of promoter sequences is linked to a second enhancer sequence and, a unique barcode sequence; and identifying a switchable core promoter as the core promoter sequence that promotes higher transcription of the unique barcode when paired with the first enhancer sequence than when paired with the second enhancer sequence. 94. The method of embodiment 93, further comprising activating the first enhancer sequence. 95. The method of embodiment 93 or embodiment 94, wherein the second enhancer sequence is not activated. 96. The method of any one of embodiments 93-95, wherein the first enhancer sequence is an active enhancer and the second enhancer sequence is an inactive enhancer. 97. The method of any one of embodiments 93-96, wherein the first enhancer sequence is specific for the population of cells. 98. The method of any one of embodiments 93-97, wherein the population of cells are neurons, kidney cells, liver cells, muscle cells, or cancer cells. 99. The method of any one of embodiments 93-98, wherein the first enhancer sequence is specific for neurons, kidney cells, liver cells, muscle cells, or cancer cells. 100. The method of any one of embodiments 93-99, wherein the second enhancer sequence is specific for neurons, kidney cells, liver cells, muscle cells, or cancer cells. 101. The method of any one of embodiments 93-100, wherein the plurality of core promoter sequences comprises engineered core promoter sequences, synthetic core promoter sequences, wild type core promoter sequences, variant core promoter sequences, or combinations thereof.
The following embodiments recite non-limiting permutations of combinations of features disclosed herein. Other permutations of combinations of features are also contemplated. In particular, each of these numbered embodiments is contemplated as depending from or relating to every previous or subsequent numbered embodiment, independent of their order as listed. 1. An engineered core promoter comprising a TATA box, an initiator element, a spacer separating the TATA box and the initiator element, and one or more features selected from: a) the TATA box comprises a sequence having at least 85% sequence identity to any one of SEQ ID NO: 199-SEQ ID NO: 201 or SEQ ID NO: 213-SEQ ID NO: 215; b) the initiator element comprises a sequence having at least 75% sequence identity to any one of SEQ ID NO: 202-SEQ ID NO: 204, SEQ ID NO: 216, or SEQ ID NO: 217; c) the spacer is a short spacer comprising no less than 25 and no more than 29 nucleotide residues from the TATA box to the initiator element; d) the spacer is a long spacer comprising no less than 31 and no more than 48 nucleotide residues from the TATA box to the initiator element; e) a YY1 motif comprising a sequence having at least 80% sequence identity to any one of SEQ ID NO: 205-SEQ ID NO: 207; f) a transcriptional pause site comprising a sequence having at least 90% sequence identity to any one of SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212; g) at least 70% sequence identity and at least one nucleotide substitution relative to SEQ ID NO: 5, SEQ ID NO: 9, or SEQ ID NO: 210; and h) combinations thereof. 2. The engineered core promoter of embodiment 1, comprising at least 70% sequence identity and nucleotide substitutions T15G and A40C relative to SEQ ID NO: 5. 3. The engineered core promoter of embodiment 1, comprising at least 70% sequence identity and nucleotide substitutions T15A and A40T relative to SEQ ID NO: 5. 4. The engineered core promoter of any one of embodiments 1-3, wherein the spacer comprises from 29 to 31 nucleotides from the TATA box to the initiator element. 5. The engineered core promoter of any one of embodiments 1-4, wherein the spacer comprises 30 nucleotides from the TATA box to the initiator element. 6. The engineered core promoter of any one of embodiments 1-3, wherein the short spacer comprises 28 nucleotide residues from the TATA box to the initiator element. 7. The engineered core promoter of any one of embodiments 1-3, wherein the long spacer comprises 32 nucleotide residues from the TATA box to the initiator element. 8. The engineered core promoter of any one of embodiments 1-7, wherein the TATA box comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 199-SEQ ID NO: 201 or SEQ ID NO: 213-SEQ ID NO: 215. 9. The engineered core promoter of any one of embodiments 1-8, wherein the TATA box comprises a sequence of any one of SEQ ID NO: 199-SEQ ID NO: 201 or SEQ ID NO: 213-SEQ ID NO: 215. 10. The engineered core promoter of any one of embodiments 1-9, wherein the initiator element comprises a sequence having at least 85% sequence identity to any one of SEQ ID NO: 202-SEQ ID NO: 204, SEQ ID NO: 216, or SEQ ID NO: 217. 11. The engineered core promoter of any one of embodiments 1-10, wherein the initiator element comprises a sequence of any one of SEQ ID NO: 202-SEQ ID NO: 204, SEQ ID NO: 216, or SEQ ID NO: 217. 12. The engineered core promoter of any one of embodiments 1-11, wherein the YY1 motif comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 205-SEQ ID NO: 207. 13. The engineered core promoter of any one of embodiments 1-12, wherein the YY1 motif comprises a sequence of any one of SEQ ID NO: 205-SEQ ID NO: 207. 14. The engineered core promoter of any one of embodiments 1-11, wherein the transcriptional pause site comprises a sequence having at least 95% sequence identity to any one of SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212. 15. The engineered core promoter of any one of embodiments 1-11, wherein the transcriptional pause site comprises a sequence of any one of SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212. 16. The engineered core promoter of any one of embodiments 1-15, wherein the engineered core promoter comprises two or more of the features. 17. The engineered core promoter of any one of embodiments 1-16, wherein the engineered core promoter comprises features a) and b). 18. The engineered core promoter of any one of embodiments 1-17, wherein the engineered core promoter comprises three or more of the features. 19. The engineered core promoter of any one of embodiments 1-18, wherein the engineered core promoter comprises features a) and b) and at least one more of the features. 20. The engineered core promoter of any one of embodiments 1-19, wherein the engineered core promoter comprises features a), b), and c). 21. The engineered core promoter of any one of embodiments 1-19, wherein the engineered core promoter comprises features a), b), and d). 22. The engineered core promoter of any one of embodiments 1-19, wherein the engineered core promoter comprises features a), b), and e). 23. The engineered core promoter of any one of embodiments 1-19, wherein the engineered core promoter comprises features a), b), and f). 24. An engineered core promoter comprising a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144. 25. The engineered core promoter of embodiment 24, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, or SEQ ID NO: 10-SEQ ID NO: 144. 26. The engineered core promoter of embodiment 23, wherein the engineered core promoter comprises a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. 27. The engineered core promoter of embodiment 26, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. 28. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 7. 29. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 7. 30. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 1. 31. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 1. 32. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 2. 33. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 2. 34. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 3. 35. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 3. 36. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 4 and including nucleotide substitutions T15G and A40T relative to SEQ ID NO: 5. 37. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 4. 38. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 10. 39. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 10. 40. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 11. 41. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 11. 42. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 12. 43. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 12. 44. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 13. 45. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 13. 46. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 14. 47. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 14. 48. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 15 and including nucleotide substitutions T15G and A40C relative to SEQ ID NO: 5. 49. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 15. 50. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence having at least 90% sequence identity to SEQ ID NO: 110. 51. The engineered core promoter of any one of embodiments 23-27, wherein the engineered core promoter comprises a sequence of SEQ ID NO: 110. 52. The engineered core promoter of any one of embodiments 1-51, wherein the engineered core promoter comprises a TATA box, an RNA polymerase binding sequence, a B recognition element, a CCAAT box, or a Pribnow box. 53. The engineered core promoter of any one of embodiments 1-52, wherein the engineered core promoter is capable of recruiting a polymerase. 54. The engineered core promoter of embodiment 53, wherein the polymerase is an RNA polymerase II. 55. The engineered core promoter of any one of embodiments 1-54, wherein the engineered core promoter has a dynamic range that is higher than a dynamic range of SEQ ID NO: 6. 56. The engineered core promoter of any one of embodiments 1-55, wherein the engineered core promoter has a dynamic range that is higher than a dynamic range of SEQ ID NO: 9. 57. The engineered core promoter of any one of embodiments 1-56, wherein the engineered core promoter has an activity that is higher than an activity of SEQ ID NO: 6. 58. The engineered core promoter of any one of embodiments 1-57, wherein the engineered core promoter has an activity that is higher than an activity of SEQ ID NO: 9. 59. The engineered core promoter of any one of embodiments 1-58, wherein a transcriptional activity produced by the engineered core promoter paired with an activated response element is higher than a transcriptional activity produced by a core promoter of SEQ ID NO: 6 when paired with the activated response element, a basal transcriptional activity produced by the engineered core promoter paired with an activated response element is lower than a basal transcriptional activity produced by a core promoter of SEQ ID NO: 6 when paired with an inactive response element, or both. 60. The engineered core promoter of any one of embodiments 1-59, wherein a transcriptional activity produced by the engineered core promoter paired with an activated response element is higher than a transcriptional activity produced by a core promoter of SEQ ID NO: 9 when paired with the activated response element, a basal transcriptional activity produced by the engineered core promoter paired with an activated response element is lower than a basal transcriptional activity produced by a core promoter of SEQ ID NO: 9 when paired with an inactive response element, or both. 61. An engineered response element comprising a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to SEQ ID NO: 145 or SEQ ID NO: 146. 62. The engineered response element of embodiment 61, wherein the engineered response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145 or SEQ ID NO: 146. 63. The engineered response element of embodiment 61 or embodiment 62, wherein the engineered response element comprises a sequence of SEQ ID NO: 145 or SEQ ID NO: 146. 64. The engineered response element of any one of embodiments 61-63, wherein the engineered response element is capable of binding to a cognate ligand, a coactivator, or a corepressor. 65. The engineered response element of embodiment 64, wherein the engineered response element is an enhancer. 66. The engineered response element of embodiment 64 or embodiment 65, wherein binding of the cognate ligand or the coactivator to the engineered response element increases transcription of a payload under transcriptional control of the engineered response element paired with a core promoter in a target cell type, a target cell state, or a target tissue. 67. The engineered response element of embodiment 64, wherein binding of the corepressor to the engineered response element decreases transcription of a payload under transcriptional control of the engineered response element paired with a core promoter in a non-target cell type, a non-target cell state, or a non-target tissue. 68. The engineered response element of any one of embodiments 61-67, wherein the engineered response element confers bone marrow-specific transcription of a payload under transcriptional control of the engineered response element paired with a core promoter. 69. The engineered response element of embodiment 68, comprising a sequence of SEQ ID NO: 145. 70. The engineered response element of any one of embodiments 61-67, wherein the engineered response element confers liver-specific transcription of a payload under transcriptional control of the engineered response element paired with a core promoter. 71. The engineered response element of embodiment 70, comprising a sequence of SEQ ID NO: 146. 72. An engineered promoter comprising a response element and a core promoter; wherein the response element comprises a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194, and wherein the core promoter comprises a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to of any one of SEQ ID NO: 1-SEQ ID NO: 144 or SEQ ID NO: 210. 73. The engineered promoter of embodiment 72, wherein the core promoter comprises a sequence having at least 90% sequence identity to any one of SEQ ID NO: 1-SEQ ID NO: 144 or SEQ ID NO: 210. 74. The engineered promoter of embodiment 72 or embodiment 73, wherein the core promoter comprises a sequence of SEQ ID NO: 1-SEQ ID NO: 144 or SEQ ID NO: 210. 75. The engineered promoter of any one of embodiments 72-74, wherein the engineered promoter comprises a sequence of any one of SEQ ID NO: 148-SEQ ID NO: 177 or SEQ ID NO: 218-SEQ ID NO: 255. 76. An engineered promoter comprising a response element and the engineered core promoter of any one of embodiments 1-60. 77. The engineered promoter of any one of embodiments 72-76, wherein the response element is capable of binding to a cognate ligand, a coactivator, or a corepressor. 78. The engineered promoter of embodiment 77, wherein the response element is an enhancer. 79. The engineered promoter of embodiment 77 or embodiment 78, wherein binding of the cognate ligand or the coactivator to the response element increases transcription of a payload under transcriptional control of the engineered promoter in a target cell type, a target cell state, or a target tissue. 80. The engineered promoter of embodiment 77, wherein binding of the corepressor to the response element decreases transcription of a payload under transcriptional control of the engineered promoter in a non-target cell type, a non-target cell state, or a non-target tissue. 81. The engineered promoter of any one of embodiments 72-80, wherein the response element confers bone marrow-specific transcription of a payload under transcriptional control of the engineered promoter. 82. The engineered promoter of embodiment 81, wherein the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145 or SEQ ID NO: 193. 83. The engineered promoter of embodiment 81 or embodiment 82, wherein the response element comprises a sequence of SEQ ID NO: 145 or SEQ ID NO: 193. 84. The engineered promoter of any one of embodiments 72-80, wherein the response element confers liver-specific transcription of a payload under transcriptional control of the engineered promoter. 85. The engineered promoter of embodiment 84, wherein the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 146 or SEQ ID NO: 194. 86. The engineered promoter of embodiment 84 or embodiment 85, wherein the response element comprises a sequence of SEQ ID NO: 146 or SEQ ID NO: 194. 87. The engineered promoter of any one of embodiments 72-80, wherein the response element confers neuron-specific transcription of a payload under transcriptional control of the engineered promoter. 88. The engineered promoter of any one of embodiments 72-80, wherein the response element confers muscle-specific transcription of a payload under transcriptional control of the engineered promoter. 89. The engineered promoter of any one of embodiments 72-80, wherein the response element confers kidney-specific transcription of a payload under transcriptional control of the engineered promoter. 90. A recombinant polynucleotide comprising the engineered core promoter of any one of embodiments 1-60 and a payload comprising a coding sequence under transcriptional control of the engineered core promoter. 91. A recombinant polynucleotide comprising a response element comprising a sequence having at least 80%, at least 85%, at least 90%, at least 93%, at least 95%, at least 97%, at least 99%, or 100% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194 and a payload comprising a coding sequence under transcriptional control of the response element. 92. The recombinant polynucleotide of embodiment 91, wherein the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194. 93. The recombinant polynucleotide of embodiment 91 or embodiment 92, wherein the response element comprises a sequence of SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194. 94. A recombinant polynucleotide comprising a promoter and a payload, wherein the promoter comprises: a response element capable of binding to a cognate ligand, a coactivator, or a corepressor; and the engineered core promoter of any one of embodiments 1-60 capable of recruiting a polymerase; wherein the payload comprises a coding sequence under transcriptional control of the promoter. 95. The recombinant polynucleotide of embodiment 94, wherein the response element is an enhancer. 96. The recombinant polynucleotide of embodiment 94 or embodiment 95, wherein binding of the cognate ligand or the coactivator to the response element increases transcription of the payload in a target cell type, a target cell state, or a target tissue. 97. The recombinant polynucleotide of embodiment 94, wherein binding of the corepressor to the response element decreases transcription of the payload in a non-target cell type, a non-target cell state, or a non-target tissue. 98. The recombinant polynucleotide of any one of embodiments 94-97, wherein the response element confers neuron-specific transcription of the payload. 99. The recombinant polynucleotide of any one of embodiments 94-97, wherein the response element confers bone marrow-specific transcription of the payload. 100. The recombinant polynucleotide of embodiment 99, wherein the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145 or SEQ ID NO: 193. 101. The recombinant polynucleotide of embodiment 99 or embodiment 11, wherein the response element comprises a sequence of SEQ ID NO: 145 or SEQ ID NO: 193. 102. The recombinant polynucleotide of any one of embodiments 94-97, wherein the response element confers liver-specific transcription of the payload. 103. The recombinant polynucleotide of embodiment 102, wherein the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 146 or SEQ ID NO: 194. 104. The recombinant polynucleotide of embodiment 102 or embodiment 103, wherein the response element comprises a sequence of SEQ ID NO: 146 or SEQ ID NO: 194. 105. The recombinant polynucleotide of any one of embodiments 94-97, wherein the response element confers muscle-specific transcription of the payload. 106. The recombinant polynucleotide of any one of embodiments 94-97, wherein the response element confers kidney-specific transcription of the payload. 107. A recombinant polynucleotide comprising a promoter and a payload, wherein the promoter comprises: a response element capable of binding to a cognate ligand, a coactivator, or a corepressor, wherein the response element comprises a sequence having at least 90% sequence identity to SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 193, or SEQ ID NO: 194; and an engineered core promoter capable of recruiting a polymerase; wherein the payload comprises a coding sequence under transcriptional control of the promoter. 108. The recombinant polynucleotide of any one of embodiments 90-107, wherein the payload encodes a protein. 109. The recombinant polynucleotide of embodiment 108, wherein the protein is a neuronal protein, a kidney protein, a retinal protein, a muscle protein, or an apoptosis-inducing protein. 110. The recombinant polynucleotide of embodiment 108 or embodiment 109, wherein the protein is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 111. The recombinant polynucleotide of any one of embodiments 108-110, wherein the protein is progranulin, MeCP2, polycystin-1, or polycystin-2. 112. The recombinant polynucleotide of any one of embodiments 90-107, wherein the payload encodes a therapeutic polynucleotide. 113. The recombinant polynucleotide of embodiment 112, wherein the therapeutic polynucleotide is a guide RNA or a suppressor tRNA. 114. The recombinant polynucleotide of embodiment 112 or embodiment 113, wherein the therapeutic polynucleotide targets a gene. 115. The recombinant polynucleotide of embodiment 114, wherein the gene is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 116. The recombinant polynucleotide of embodiment 114 or embodiment 115, wherein the gene is GRN, MECP2, PKD2, or PKD2. 117. An engineered viral vector comprising the engineered core promoter of any one of embodiments 1-60 in a viral vector. 118. An engineered viral vector comprising the engineered response element of any one of embodiments 61-71 in a viral vector. 119. An engineered viral vector comprising the engineered promoter of any one of embodiments 72-89 in a viral vector. 120. An engineered viral vector comprising the recombinant polynucleotide of any one of embodiments 90-116 in a viral vector. 121. The engineered viral vector of any one of embodiments 117-120, wherein the viral vector is an adenoviral vector, an adeno-associated viral vector, or a lentivector. 122. The engineered viral vector of embodiment 121, wherein the adeno-associated viral vector is selected from the group consisting of AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, AAV 10, AAV11, AAV12, AAV13, AAV14, AAV15, AAV16, AAV-DJ, AAV-DJ/8, AAV-DJ/9, AAV1/2, AAV.rh8, AAV.rh10, AAV.rh20, AAV.rh39, AAV.Rh43, AAV.Rh74, AAV.v66, AAV.Oligo001, AAV.SCH9, AAV.r3.45, AAV.RHM4-1, AAV.hu37, AAV.Anc80, AAV.Anc80L65, AAV.7m8, AAV.PhP.eB, AAV.PhP. V1, AAV.PHP.B, AAV.PhB.C1, AAV.PhB.C2, AAV.PhB.C3, AAV.PhB.C6, AAV.cy5, AAV2.5, AAV2tYF, AAV3B, AAV.LK03, AAV.HSC1, AAV.HSC2, AAV.HSC3, AAV.HSC4, AAV.HSC5, AAV.HSC6, AAV.HSC7, AAV.HSC8, AAV.HSC9, AAV.HSC10, AAV.HSC11, AAV.HSC12, AAV.HSC13, AAV.HSC14, AAV.HSC15, AAV.HSC16, AAV.HSC17, AAVhu68, and combinations thereof. 123. A cell comprising the engineered core promoter of any one of embodiments 1-60. 124. A cell comprising the engineered response element of any one of embodiments 61-71. 125. A cell comprising the engineered promoter of any one of embodiments 72-89. 126. A cell comprising the recombinant polynucleotide of any one of embodiments 90-116. 127. A cell comprising the engineered viral vector of any one of embodiments 117-122. 128. The cell of any one of embodiments 123-127, wherein the cell is a target cell. 129. The cell of any one of embodiments 123-128, wherein the cell is a diseased cell. 130. The cell of any one of embodiments 123-129, wherein the cell is a central nervous system cell, a neuron, a renal cell, a retinal cell, a hepatocyte, an epithelial cell, a muscle cell, an erythrocyte, a platelet, a bone marrow cell, an endothelial cell, an epidermal cell, a lymphocyte, a glial cell, an interstitial cell, an adipocyte, or a fibroblast. 131. A pharmaceutical composition comprising the engineered core promoter of any one of embodiments 1-60 and a pharmaceutically acceptable carrier. 132. A pharmaceutical composition comprising the engineered response element of any one of embodiments 61-71 and a pharmaceutically acceptable carrier. 133. A pharmaceutical composition comprising the engineered promoter of any one of embodiments 72-89 and a pharmaceutically acceptable carrier. 134. A pharmaceutical composition comprising the recombinant polynucleotide of any one of embodiments 90-116 and a pharmaceutically acceptable carrier. 135. A pharmaceutical composition comprising the engineered viral vector of any one of embodiments 117-122 and a pharmaceutically acceptable carrier. 136. A method of expressing a payload in a target cell of a subject, the method comprising: administering to the subject the recombinant polynucleotide of any one of embodiments 90-116, the viral vector of any one of embodiments 117-122, or the pharmaceutical composition of embodiment 134 or embodiment 135 to the subject; binding a cognate ligand, a coactivator, or a corepressor to the response element of the recombinant polynucleotide, wherein the cognate ligand or the coactivator is specific to the target cell, or wherein the corepressor is specific to a non-target cell; initiating transcription of the payload by recruiting a polymerase to the core promoter; and transcribing the payload, thereby expressing the payload in the target cell. 137. The method of embodiment 136, comprising expressing the payload at a higher level in the target cell than in the non-target cell. 138. The method of embodiment 136 or embodiment 137, comprising initiating transcription at a higher rate in the target cell than in the non-target cell. 139. The method of any one of embodiments 136-138, wherein the cognate ligand or the coactivator is present at a higher level in the target cell than in the non-target cell. 140. The method of any one of embodiments 136-139, wherein the cognate ligand is a transcription factor. 141. The method of any one of embodiments 136-140, wherein the corepressor is present at a higher level in the non-target cell than in the target cell. 142. The method of any one of embodiments 136-141, wherein the target cell is a target cell state, and wherein the non-target cell is a non-target cell state. 143. The method of embodiment 142, wherein the target cell state is a diseased cell. 144. The method of embodiment 142 or embodiment 143, wherein the non-target cell state is a healthy cell. 145. The method of any one of embodiments 136-140, wherein the target cell is a target cell type, and wherein the non-target cell is a non-target cell type. 146. The method of embodiment 145, wherein the target cell type is a central nervous system cell, a neuron, a renal cell, a retinal cell, a hepatocyte, an epithelial cell, a muscle cell, an erythrocyte, a platelet, a bone marrow cell, an endothelial cell, an epidermal cell, a lymphocyte, a glial cell, an interstitial cell, an adipocyte, or a fibroblast. 147. The method of embodiment 145 or embodiment 146, wherein the non-target cell type is a central nervous system cell, a neuron, a renal cell, a retinal cell, a hepatocyte, an epithelial cell, a muscle cell, an erythrocyte, a platelet, a bone marrow cell, an endothelial cell, an epidermal cell, a lymphocyte, a glial cell, an interstitial cell, an adipocyte, a fibroblast, or combinations thereof. 148. A method of treating a disorder in a subject in need thereof, the method comprising: administering to the subject a composition comprising the recombinant polynucleotide of any one of embodiments 90-116, the viral vector of any one of embodiments 117-122, or the pharmaceutical composition of embodiment 134 or embodiment 135; and expressing a therapeutic sequence encoded by a payload of the recombinant polynucleotide in a target cell of the subject, thereby treating the disorder. 149. The method of embodiment 148, wherein the target cell is a cell type associated with the disorder. 150. The method of embodiment 148, wherein the target cell is a cell state associated with the disorder. 151. The method of any one of embodiments 148-150, wherein the disorder is a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 152. The method of any one of embodiments 148-151, wherein the disorder is Rett syndrome, MECP2 duplication syndrome, frontotemporal dementia, neuronal ceroid lipofuscinosis, cancer, atherosclerosis, Alzheimer's disease, amyotrophic lateral sclerosis, limbic predominant age-related TDP-43 encephalopathy, or polycystic kidney disease. 153. The method of any one of embodiments 148-152, wherein the disorder is any one of the disorders provided in TABLE 5. 154. The method of any one of embodiments 148-153, wherein the therapeutic sequence encodes a therapeutic protein. 155. The method of embodiment 154, wherein the therapeutic protein is a neuronal protein, a kidney protein, a retinal protein, a muscle protein, or an apoptosis-inducing protein. 156. The method of embodiment 154 or embodiment 155, wherein the therapeutic protein is associated with a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 157. The method of any one of embodiments 154-156, wherein the therapeutic protein is MECP2, progranulin, polycystin-1, or polycystin-2. 158. The method of any one of embodiments 154-157, wherein the therapeutic protein is encoded by a gene provided in TABLE 5. 159. The method of any one of embodiments 148-158, wherein the payload encodes a therapeutic polynucleotide. 160. The method of embodiment 159, wherein the therapeutic polynucleotide is a guide RNA or a suppressor tRNA. 161. The method of embodiment 159 or embodiment 160, wherein the therapeutic polynucleotide targets a gene provided in TABLE 5. 162. A method of identifying a switchable core promoter, the method comprising: introducing a core promoter library comprising a first sub-library and a second sub-library to a population of cells; wherein the first sub-library comprises a plurality of core promoters, wherein a core promoter of the plurality of core promoters is linked to a first response element, and a unique barcode sequence; wherein the second sub-library comprises the plurality of core promoters, wherein the core promoter of the plurality of promoters is linked to a second response element and, a unique barcode sequence; and identifying a switchable core promoter as the core promoter that promotes higher transcription of the unique barcode when paired with the first response element than when paired with the second response element. 163. The method of embodiment 162, further comprising activating the first response element. 164. The method of embodiment 162 or embodiment 163, wherein the second response element is not activated. 165. The method of any one of embodiments 162-164, wherein the first response element is an activated response element and the second enhancer sequence is an inactive response element or an unactivated response element. 166. The method of any one of embodiments 162-165, wherein the first response element is specific for the population of cells. 167. The method of any one of embodiments 162-166, wherein the population of cells are neurons, kidney cells, liver cells, muscle cells, or cancer cells. 168. The method of any one of embodiments 162-167, wherein the first response element is specific for neurons, kidney cells, liver cells, muscle cells, or cancer cells. 169. The method of any one of embodiments 162-168, wherein the second response element is specific for neurons, kidney cells, liver cells, muscle cells, or cancer cells. 170. The method of any one of embodiments 162-169, wherein the plurality of core promoters comprises engineered core promoters, synthetic core promoters, wild type core promoters, variant core promoters, or combinations thereof. 171. The recombinant polynucleotide of any one of embodiments 90-116, the viral vector of any one of embodiments 117-122, or the pharmaceutical composition of embodiment 134 or embodiment 135; for use in a method of treating a disorder, the method comprising administering to a subject a composition comprising the recombinant polynucleotide, viral vector or pharmaceutical composition and expressing a therapeutic sequence encoded by a payload of the recombinant polynucleotide in a target cell of the subject. 172. The recombinant polynucleotide, viral vector, or pharmaceutical composition for use of embodiment 171, wherein the disorder treated is a genetic disorder, a neuronal disorder, a kidney disorder, an eye disorder, a muscular disorder, or a cancer. 173. The recombinant polynucleotide, viral vector, or pharmaceutical composition for use of any one of embodiments 171-172, wherein the disorder is Rett syndrome, MECP2 duplication syndrome, frontotemporal dementia, neuronal ceroid lipofuscinosis, cancer, atherosclerosis, Alzheimer's disease, amyotrophic lateral sclerosis, limbic predominant age-related TDP-43 encephalopathy, or polycystic kidney disease. 174. The recombinant polynucleotide, viral vector, or pharmaceutical composition for use of any one of embodiments 171-173, wherein the disorder is any one of the disorders provided in TABLE 5.
The invention is further illustrated by the following non-limiting examples.
This example describes a massively parallel reporter assay (MPRA) screen to identify switchable core promoters. A goal of the screen was to identify core promoters that promote high levels of transcription when paired with an activated cell type-specific enhancer sequence but exhibit low basal transcriptional initiation.
A library of 3,649 core promoters was designed, containing variants of existing activatable core promoter sequences as well as fully de novo synthetic core promoter sequences. The variants of existing core promoters were designed by mutagenizing existing synthetic core promoters, including ybTATA (SEQ ID NO: 9) and minP (SEQ ID NO: 5) from pGL4. Synthetic core promoters were developed by randomly generating matches to TATA and the initiator element, Inr across a range of predicted similarity scores. The final library contained 2157 sequence variants and 1484 fully de novo synthetic sequences.
Enhancer sequences were selected to pair with core promoter library members to screen for high levels of cell state-specific transcriptional activation and low levels of basal transcriptional activation in a hepatoblast cell line (HepG2). Enhancers were selected based on cell type-specific activity in HepG2 cells and a myeloid cell line (K562). Literature MPRA results were used to identify endogenous enhancers with cell type-specific activity by comparing activation levels of different enhancers in HepG2 cells (a cancer cell model of hepatocytes) and in K562 cells (a cancer cell model of myeloid cells), as shown in
Each member of the core promoter library was cloned into six different constructs containing one of each the five different enhancer sequences identified in the enhancer screens (“HepG2 Syn” (SEQ ID NO: 146), “K562 Syn” (SEQ ID NO: 145), “HepG2 End” (SEQ ID NO: 194), or “K562 End” (SEQ ID NO: 193)), a negative control enhancer (“Negative,” SEQ ID NO: 147) or a random sequence with no enhancer activity (“Random”). The resulting constructs generated for each core promoter library member are illustrated in
The resulting 21,849 enhancer-promoter constructs were screened in HepG2 and K562 cell lines for cell type-specific transcriptional activation. Transcriptional activity of each enhancer-promoter construct was measured using a massively parallel reporter assay, in HepG2 cells and K562 cells, as shown in
An orthogonal measurement of fluorescent marker expression from a dual reporter construct assessed by flow cytometry in HepG2 cells, as shown in
Cell-specific activation of the 3,649 core promoters in HepG2 cells was assessed by comparing transcriptional activity of each core promoter when paired with a HepG2-specific enhancer (HepEndo or HepSyn) or a K562-specific enhancer (KEndo or KSyn) relative to either a negative control enhancer or no enhancer. Fold activation in HepG2 cells with different synthetic enhancers (KSyn or HepSyn) relative to no enhancer or a negative control enhancer is shown in
To further validate the MPRA screen, core promoter activity was compared for core promoters with different random sequences inserted into the spacing between the TATA and Inr sequences of the core promoter.
Members of the core promoter library were evaluated based on level of activation in HepG2 cells when paired with a HepG2-specific enhancer (core promoter strength) as well as relative activation levels when paired with a HepG2-specific enhancer as compared to when paired with either no enhancer, a negative control enhancer, or a different cell type-specific enhancer (core promoter dynamic range). SEQ ID NO: 1-SEQ ID NO: 3, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 14, and SEQ ID NO: 16-SEQ ID NO: 144 were identified as the top core promoter candidates in from the MPRA screen based on strength and dynamic range. Core promoters of SEQ ID NO: 1-SEQ ID NO: 3, SEQ ID NO: 7, and SEQ ID NO: 10-SEQ ID NO: 14 were selected for further evaluation. Switchable core promoter candidates retained the ability to act as a switch with a wide dynamic range across different enhancers and different cell lines. A switchable core promoter when paired with SEQ ID NO: 145, would display low activity in HepG2 cells but high activity in K562 and, when paired with SEQ ID NO: 146, would display high activity in HepG2 but low activity in K562 cells.
This example describes specific transcriptional activation by core promoters identified in the MPRA screen described in EXAMPLE 1. Specific activation of core promoters was determined by comparing transcriptional activation of each core promoter when paired with an active, cell-type specific enhancer relative to transcriptional activation when paired with an inactive enhancer (SEQ ID NO: 147; GCCACTGCACTCCGTGCTGGGCAACTGAGTGAGACCCCATCTCAAAAAACTCAAAA AAGAGTTAAAATAAATCACTGTCTATTGTGCCAAAGAGCTGTTGAAGTCATCATTCT TAGAAGAATAGACAGATGGTATTTCAAGGCTT) or a random sequence with no enhancer activity (SEQ ID NO: 195; AAAACGACGGCCAGTGAATTGACGCGTATTGGGATGGAACGGCCTCCACGGCCACT AGT). The active enhancers were selected from a synthetic K562 cell type-specific enhancer (SEQ ID NO: 145; TGCTATTTTTAGCGGGCTTTTTTTGACGGGAGGAAGGAAGGAGGGAGAGGGACGGG AAGTAAAGAGAAAAAGAGGAAGTGAAAGCTAAGAAGGAAGTGACGGCTGGCGGG GACAGATAAGAGTTGTCTAGTTGTGATAATGGAACTGCTGAGTCATGGATTGCAGA GTCAC), a synthetic HepG2 cell-type specific enhancer (SEQ ID NO: 146; GGTTAATGATTAACCGTGTAAATAATTAGCGGATCACGTGATGGTCACGTGTTGGTT CAAGGCCAGAGTCAAGGTCGGGAGTCCAAAGTCCAGAAGTGCAAGGTCCGGTGTTT ACTTTGGGTGTTTACCTTCC), an endogenous K562 cell type-specific enhancer (SEQ ID NO: 193; GGTTAATGATTAACCGTGTAAATAATTAGCGGATCACGTGATGGTCACGTGTTGGTT CAAGGCCAGAGTCAAGGTCGGGAGTCCAAAGTCCAGAAGTGCAAGGTCCGGTGTTT ACTTTGGGTGTTTACCTTCC), or an endogenous HepG2 cell-type specific enhancer (SEQ ID NO: 194; AGAAACACGGCACAGATGACGGCTTTAAAAGAAAAGAGTCCATTGACTGGAAAAG CAAACAGTATGATGGTCAAAGGCCAAGAATCAGAAAACAGAGATTTTTCCTCTTCT CTTGTTTCACAAAGTAAATAAACATGAGACATAT).
Core promoter expression constructs were generated by piecing together an enhancer (e.g., any one of SEQ ID NO: 145-SEQ ID NO: 147, SEQ ID NO: 193, SEQ ID NO: 194, or a random sequence with no enhancer activity) and a core promoter (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 3, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 13, and SEQ ID NO: 16-SEQ ID NO: 144), such that the construct sequence included, from 5′ to 3′, the Enhancer followed by the Core Promoter. Transcriptional activation for each of the enhancer/promoter constructs was measured in HepG2 cells, and fold activation was determined by measuring transcriptional activation when paired with an active enhancer divided by transcriptional activation when paired with an inactive enhancer or no enhancer.
Fold activation with select core promoters (SEQ ID NO: 1-SEQ ID NO: 3, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 13, and SEQ ID NO: 16-SEQ ID NO: 144) identified from the MPRA screen are provided in TABLE 6.
As shown in TABLE 6, fold activation of transcription was higher with HepG2 cell type-specific enhancers in HepG2 cells than with K562 cell-type specific enhancers in HepG2 cells, suggesting that the identified core promoters selectively initiated transcription in the presence of enhancers specific to the given cell type while exhibiting low transcriptional activation in the presence of enhancers specific for other cell types.
This example describes further analysis of specific transcriptional activation by core promoters identified using the MPRA screening methods described in EXAMPLE 1.
Specific activation of select core promoters (SEQ ID NO: 1-SEQ ID NO: 15) was determined by comparing transcriptional activation of each core promoter when paired with an active, cell-type specific enhancer relative to transcriptional activation when paired with an inactive enhancer (SEQ ID NO: 147) or no enhancer (SEQ ID NO: 195). The active enhancers were selected from a synthetic K562 cell type-specific enhancer (SEQ ID NO: 145), a synthetic HepG2 cell-type specific enhancer (SEQ ID NO: 146), an endogenous K562 cell type-specific enhancer (SEQ ID NO: 193), or an endogenous HepG2 cell-type specific enhancer (SEQ ID NO: 194). The core promoter of SEQ ID NO: 8 (GGGTACGCCCCTTTTTATGCGCGTGATTACTGCACAGGAATTGG) was included as a negative control, corresponding to a core promoter that exhibited low transcriptional activation.
Core promoter expression constructs were generated by piecing together an enhancer (e.g., any one of SEQ ID NO: 145-SEQ ID NO: 147, SEQ ID NO: 193, SEQ ID NO: 194, or a random sequence with no enhancer activity) and a core promoter (e.g., any one of SEQ ID NO: 1-SEQ ID NO: 15), such that the construct sequence included, from 5′ to 3′, Enhancer followed by the Core Promoter. Select enhancer+core promoter sequences that were assayed are provided in TABLE 7.
Transcriptional activation for each of the enhancer/promoter constructs was measured in HepG2 cells, and fold activation was determined by measuring transcriptional activation when paired with an active enhancer divided by transcriptional activation when paired with an inactive enhancer or a random sequence with no enhancer activity (no enhancer).
Fold activation with select core promoters (SEQ ID NO: 1-SEQ ID NO: 15) is provided in TABLE 8.
As shown in TABLE 8, fold activation of transcription was higher with HepG2 cell type-specific enhancers in HepG2 cells than with K562 cell-type specific enhancers in HepG2 cells, suggesting that the identified core promoters selectively initiated transcription in the presence of enhancers specific to the given cell type while exhibiting low transcriptional activation in the presence of enhancers specific for other cell types.
This example describes the effect of spacing between TATA and Inr in the core promoter on transcriptional activation. Transcriptional activation of core promoters with a short spacing, corresponding to a deletion of two bases in between TATA and Inr, or a longer spacing was compared, corresponding a standard 30 nucleotide spacing from TATA to Inr wherein the number of nucleotide residues from the TATA box to the initiator element are counted from the first T in the TATA box to the A in the CAG in the initiator element (or the corresponding CAG in the parent sequence if the initiator element is mutated), inclusive. Examples of core promoters with the short, 28-nucleotide spacing include SEQ ID NO: 7, SEQ ID NO: 10, and SEQ ID NO: 109-SEQ ID NO: 125. Examples of core promoters with standard, 30-nucleotide spacing include SEQ ID NO: 13 and SEQ ID NO: 126-SEQ ID NO: 144.
The activity of core promoters with a short spacing, having a length of 28 nucleotides, or a standard spacing, having a length of 30 nucleotides, was compared when paired with a synthetic HepG2 cell type-specific enhancer (SEQ ID NO: 146) in HepG2 cells, as shown in
This example describes the effect of a pause site in the core promoter on transcriptional activation. Transcriptional activity and fold activation of core promoters with and without a transcriptional pause site was compared. Ten different transcriptional pause sites were tested in the context of 20 different core promoters. Examples of core promoter sequences with a pause site include SEQ ID NO: 1, SEQ ID NO: 11, and SEQ ID NO: 16-SEQ ID NO: 21. Examples of pause site sequences include SEQ ID NO: 196-SEQ ID NO: 198, SEQ ID NO: 211, or SEQ ID NO: 212. Examples of core promoter sequences without the pause site include SEQ ID NO: 13 and SEQ ID NO: 126-SEQ ID NO: 144.
This example describes the effect of a YY1 transcription factor motif in the core promoter on transcription activation. Transcriptional activity of core promoters with and without a YY1 motif after transfection into HepG2 cells was compared. Examples of core promoter sequences with a YY1 motif include SEQ ID NO: 2, SEQ ID NO: 12, and SEQ ID NO: 31-SEQ ID NO: 68. Examples of a YY1 transcription factor motif include SEQ ID NO: 205-SEQ ID NO: 208.
Canonically, a YY1 motif located immediately downstream of the Inr in a core promoter sequence is associated with active promoters. To test the effect of YY1 motifs on cell type-specific activation, the transcriptional activity of 288 fully synthetic promoters with a YY1 motif was compared to core promoters lacking a YY1 motif after transfection into HepG2 cells. As shown in
Furthermore, as shown in
This example describes transcriptional activation of ybTATA and minP variants, including variations in the TATA sequence and the Inr sequence. Transcriptional activity of core promoters with variations in the TATA sequence after transfection into HepG2 cells was compared. Variants of a ybTATA founder sequence (SEQ ID NO: 9) and a minP founder sequence (SEQ ID NO: 5) were paired with a synthetic HepG2 enhancer (SEQ ID NO: 146) screened for transcriptional activation in HepG2. Transcriptional activity of ybTATA variants and minP variants are shown in
Double mutants were screened in the context of the minP founder sequence (SEQ ID NO: 5), as shown in
This example describes characterization of candidate switchable core promoters identified using the assays described in EXAMPLE 1-EXAMPLE 8. Candidate switchable core promoters, corresponding to SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, and SEQ ID NO: 10-SEQ ID NO: 15 were selected based on high transcriptional activation activity and low background transcriptional activity, corresponding to high fold activation.
SEQ ID NO: 1, SEQ ID NO: 11, and SEQ ID NO: 14 included a pause site; SEQ ID NO: 2 and SEQ ID NO: 12 included a YY1 motif, SEQ ID NO: 3 included a long spacing between TATA and Inr; SEQ ID NO: 4 included T15A and A40T substitutions relative to minP (SEQ ID NO: 5); SEQ ID NO: 6 (GTACTTATATAAGGGGGTGGGGGCGCGTTCGTCCTCAGTCGCGATCGAACACTCGA GCCGAGCAGACGTGCCTACGGACCG) showed high transcriptional activation but also high background; SEQ ID NO: 7 and SEQ ID NO: 10 included a short spacing between TATA and Inr; SEQ ID NO: 13 was a fully synthetic promoter with strong transcriptional activation; and SEQ ID NO: 15 included T15G and A40C substitutions relative to minP. SEQ ID NO: 8 showed low transcriptional activation and was included as a negative control. SEQ ID NO: 5 (minP) and SEQ ID NO: 9 (ybTATA) were included for comparison.
Transcriptional activation properties of SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, and SEQ ID NO: 10-SEQ ID NO: 15 were evaluated in HepG2 cells.
The core promoter of SEQ ID NO: 7 was identified as having high transcriptional activation and high dynamic range. As shown in
This example describes the validation of several core promoters in both K562 lymphoblast and HepG2 liver cancer cell lines with either a K562-specific enhancer or a HepG2-specific enhancer (“HSyn”). Screened core promoters included SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, and SEQ ID NO: 15. The core promoters were each cloned into constructs containing either a synthetic K562-specific enhancer or a synthetic HepG2-specific enhancer, and the constructs were tested for transcriptional activity. The core promoter activity was determined using an MPRA screen described in EXAMPLE 1 and using a dual reporter quantitative PCR (qPCR) assay (the dual reporter flow assay as described in EXAMPLE 1, except using qPCR for the readout instead of MFI)
A comparison of promoter activity determined by the MPRA screen (“Screen Activity”) and by dual reporter qPCR assay (“qPCR Activity”) for promoters with a K562-specific enhancer in HepG2 cells is shown in
This example describes the activity of a minP variant core promoter relative to a ybTATA core promoter in both H4 neuroglioma and HepG2 liver cancer cell lines. The minP variant core promoter (SEQ ID NO: 4, “minP_T14A_A39T”) and the ybTATA core promoter (SEQ ID NO: 9) were each cloned into three constructs containing one of two different H4-specific enhancers (“H4-specific enhancer 1” and “H4-specific enhancer 2”) or a liver-specific enhancer. Activity of the core promoters in both the H4 and HepG2 cell lines was measured by a dual reporter assay to measure activity. A comparison of the transcriptional activity of the minP variant of SEQ ID NO: 4 (“minP Activity”) and the ybTATA promoter of SEQ ID NO: 9 (“yb_TATA Activity”) in H4 cells when paired with either the H4-specific enhancers or the liver-specific enhancer is shown in
This example describes transcription of an excitatory neuron-specific exogenous progranulin in the central nervous system (CNS). A CNS-specific progranulin DNA construct containing a progranulin coding sequence, a CNS-specific enhancer region, and a switchable core promoter is constructed. The switchable core promoter is SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. The enhancer region sequence contains a transcription factor binding site that binds a CNS-specific transcription factor. The progranulin construct is encapsidated in an adeno-associated virus and delivered to cells of a subject. Optionally, the progranulin construct is integrated into a recombinant polynucleotide cassette comprising one or more of a 5′UTR effector region, 3′UTR effector region, codon optimized progranulin sequence, or introns (natural and/or synthetic) for modulating translation of the RNA coding for progranulin before being encapsidated for AAV delivery. The exogenous progranulin encoded by the progranulin construct is transcribed at higher levels in excitatory neurons than in other non-neuronal cell types or non-CNS tissue, including hepatocytes or liver tissue. CNS-specific transcription of the progranulin transgene results in CNS-specific expression of the exogenous progranulin.
This example describes kidneys-specific transcription of exogenous PKD2 in renal tissue. A kidney-specific PKD2 DNA construct containing a protein coding sequence, a kidney-specific enhancer, and a switchable core promoter is constructed. The switchable core promoter is SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. The enhancer region sequence contains a transcription factor binding site that binds a kidney-specific transcription factor. The PKD2 construct is encapsidated in an adeno-associated virus and delivered to cells of a subject. Optionally, the polycystin-2 construct is integrated into a recombinant polynucleotide cassette comprising one or more of a 5′UTR effector region, 3′UTR effector region, codon optimized polycystin-2 sequence, or introns (natural and/or synthetic) for modulating translation of the RNA coding for polycystin-2 before being encapsidated for AAV delivery. The exogenous polycystin-2 protein encoded by the PKD2 construct is transcribed at higher levels in renal tissue than in other non-kidney cell types or non-kidney tissue, including liver tissue. Kidney-specific transcription of the delivered PKD2 transgene results in kidney-specific expression of exogenous polycystin-2.
This example describes transcription of an excitatory neuron-specific exogenous MECP2 in the central nervous system (CNS). A CNS-specific MECP2 DNA construct containing an MECP2 coding sequence, a CNS-specific enhancer region, and a switchable core promoter is constructed. The switchable core promoter is SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. The enhancer region sequence contains a transcription factor binding site that binds a CNS-specific transcription factor. The MECP2 construct is encapsidated in an adeno-associated virus and delivered to cells of a subject. Optionally, the MECP2 construct is integrated into a recombinant polynucleotide cassette comprising one or more of a 5′UTR effector region, 3′UTR effector region, codon optimized progranulin sequence, or introns (natural and/or synthetic) for modulating translation of the RNA coding for MECP2 before being encapsidated for AAV delivery. The exogenous MECP2 encoded by the MECP2 construct is transcribed at higher levels in excitatory neurons than in other non-neuronal cell types or non-CNS tissue, including hepatocytes or liver tissue. CNS-specific transcription of the MECP2 transgene results in CNS-specific expression of the exogenous MECP2.
This example describes transcription of a Rett-specific exogenous MECP2 in cells. A Rett-specific MECP2 DNA construct containing a MECP2 coding sequence, a Rett-specific enhancer region, and a switchable core promoter is constructed. The switchable core promoter is SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. The enhancer region sequence contains a transcription factor binding site that binds a transcription factor that is only expressed in cells having a Rett phenotype (e.g., cells expressing a non-functional MeCP2 protein) and not in normal celles (e.g., cells expressing functional MeCP2 protein). The Rett-specific MECP2 DNA construct is encapsidated in an adeno-associated virus and delivered to cells of a subject. Optionally, the MECP2 construct is integrated into a recombinant polynucleotide cassette comprising one or more of a 5′UTR effector region, 3′UTR effector region, codon optimized progranulin sequence, or introns (natural and/or synthetic) for modulating translation of the RNA coding for MECP2 before being encapsidated for AAV delivery. The exogenous MECP2 encoded by the MECP2 construct is transcribed at higher levels in Rett cells than in normal cells. Rett-specific transcription of the MECP2 transgene results in Rett-specific expression of the exogenous MECP2.
This example describes treatment of frontotemporal dementia in a subject by selectively expressing exogenous progranulin in CNS cells. A CNS-specific progranulin DNA construct containing a progranulin coding sequence, a CNS-specific enhancer region, and a switchable core promoter is constructed. The switchable core promoter is SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. The enhancer region sequence contains a transcription factor binding site that binds a CNS-specific transcription factor. The progranulin construct is encapsidated in an adeno-associated virus (AAV).
The AAV containing the construct is intravenously administered to the subject. Upon administration, the exogenous progranulin encoded by the progranulin construct is transcribed at higher levels in excitatory neurons than in other non-neuronal cell types or non-CNS tissue, including hepatocytes or liver tissue. CNS-specific transcription of the progranulin transgene results in CNS-specific expression of the exogenous progranulin. CNS-specific transcription of the exogenous progranulin alleviates at least one symptom of the frontotemporal dementia or delays the onset of the frontotemporal dementia, thereby treating the frontotemporal dementia in the subject.
This example describes treatment of polycystic kidney disease in a subject by selectively expressing exogenous PKD2 in kidney cells. A kidney-specific PKD2 DNA construct containing a PKD2 coding sequence, a kidney-specific enhancer region, and a switchable core promoter is constructed. The switchable core promoter is SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. The enhancer region sequence contains a transcription factor binding site that binds a kidney-specific transcription factor. The MECP2 construct is encapsidated in an adeno-associated virus (AAV).
The AAV containing the construct is intravenously administered to the subject. Upon administration, the exogenous PKD2 encoded by the MECP2 construct is transcribed at higher levels in kidney cells than in other non-kidney cell types or non-renal tissue, including CNS or liver tissue. Kidney-specific transcription of the PKD2 transgene results in kidney-specific expression of the exogenous PKD2. Kidney-specific transcription of the exogenous PKD2 alleviates at least one symptom of the polycystic kidney disease or cures the polycystic kidney disease, thereby treating the polycystic kidney disease in the subject.
This example describes treatment of Rett syndrome in a subject by selectively expressing exogenous MECP2 in CNS cells. A CNS-specific MECP2 DNA construct containing a MECP2 coding sequence, a CNS-specific enhancer region, and a switchable core promoter is constructed. The switchable core promoter is SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. The enhancer region sequence contains a transcription factor binding site that binds a CNS-specific transcription factor. The MECP2 construct is encapsidated in an adeno-associated virus (AAV).
The AAV containing the construct is intravenously administered to the subject. Upon administration, the exogenous MECP2 encoded by the MECP2 construct is transcribed at higher levels in excitatory neurons than in other non-neuronal cell types or non-CNS tissue, including hepatocytes or liver tissue. CNS-specific transcription of the MECP2 transgene results in CNS-specific expression of the exogenous MECP2. CNS-specific transcription of the exogenous MECP2 alleviates at least one symptom of the Rett syndrome or cures the Rett syndrome, thereby treating the Rett syndrome in the subject.
This example describes treatment of Rett syndrome in a subject by selectively expressing exogenous MECP2 in Rett cells. A Rett-specific MECP2 DNA construct containing a MECP2 coding sequence, a Rett-specific enhancer region, and a switchable core promoter is constructed. The switchable core promoter is SEQ ID NO: 1-SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 10-SEQ ID NO: 15, or SEQ ID NO: 110. The enhancer region sequence contains a transcription factor binding site that binds a transcription factor that is only expressed in cells having a Rett phenotype (e.g., cells expressing a non-functional MeCP2 protein) and not in normal celles (e.g., cells expressing functional MeCP2 protein). The Rett-specific MECP2 DNA construct is encapsidated in an adeno-associated virus (AAV). Optionally, the MECP2 construct is integrated into a recombinant polynucleotide cassette comprising one or more of a 5′UTR effector region, 3′UTR effector region, codon optimized progranulin sequence, or introns (natural and/or synthetic) for modulating translation of the RNA coding for MECP2 before being encapsidated for AAV delivery.
The AAV containing the construct is intravenously administered to the subject. Upon administration, the exogenous MECP2 encoded by the MECP2 construct is transcribed at higher levels in Rett cells than in normal cells. Rett-specific transcription of the MECP2 transgene results in Rett-specific expression of the exogenous MECP2. Rett-specific transcription of the MECP2 transgene results in Rett-specific expression of the exogenous MECP2. Rett-specific transcription of the exogenous MECP2 alleviates at least one symptom of the Rett syndrome or cures the Rett syndrome, thereby treating the Rett syndrome in the subject.
While preferred embodiments of the present invention have been shown and described herein, it will be apparent to those skilled in the art that such embodiments are provided by way of example only. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the invention. It should be understood that various alternatives to the embodiments of the invention described herein may be employed in practicing the invention. It is intended that the following claims define the scope of the invention and that methods and structures within the scope of these claims and their equivalents be covered thereby.
The present application claims the benefit of U.S. Provisional Application No. 63/329,211, entitled “PROMOTER SWITCHES FOR TISSUE-SPECIFIC EXPRESSION,” filed on Apr. 8, 2022, and U.S. Provisional Application No. 63/423,977, entitled “PROMOTER SWITCHES FOR TISSUE-SPECIFIC EXPRESSION,” filed on Nov. 9, 2022, each of which applications are herein incorporated by reference in their entireties for all purposes.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2023/017923 | 4/7/2023 | WO |
Number | Date | Country | |
---|---|---|---|
63329211 | Apr 2022 | US | |
63423977 | Nov 2022 | US |