A Sequence Listing is provided herewith as a text file, “3730037WO1SEQ LIST.txt” created on Sep. 30, 2020 and having a size of 143,894 bytes. The contents of the text file are incorporated by reference herein in their entirety.
Although strides have been made in the treatment of cancer, treatment options for many types of cancer are not optimal. For example, glioblastoma (GBM) is the most common and lethal primary brain tumor in adults. Despite aggressive treatment regimens including surgical resection, radiotherapy, and chemotherapy, the median survival remains only 12-15 months. Glioblastomas are highly diffuse and infiltrate the normal brain, rendering complete resection complicated or impossible. The growth of residual tumor often results in therapy resistance and ultimately death. Additionally, recent genomic studies have revealed that glioblastomas exhibit extensive intratumoral heterogeneity, with various subpopulations of cells harboring distinct mutations and displaying diverse epigenetic states. Similar issues exist for other types of cancer.
Therefore, a need exists to establish innovative treatment strategies that can target and efficiently eliminate cancer cells in vivo irrespective of their mutational and epigenetic profile.
Described herein are methods and compositions for depleting or eliminating cells that involve CRISPR-Cas mediated targeting and cutting of repetitive or highly repetitive sequences in the genomes of cancer cells, also referred to herein as “Genome Shredding.” The methods and compositions result in the fragmentation of a target cell's genome and DNA damage-induced cell death, hence providing a genotype/mutation-agnostic treatment paradigm. For example, by introducing Cas enzymes into cancer cells, an adaptive immune response is stimulated that create a pro-inflammatory/anti-tumor immune microenvironment that further assists tumor clearance and remission. The methods can be performed in vitro and in vivo.
Described herein are methods of shredding the genomes of selected cell types, for example, selected cancer cell types.
Described herein are genomic shredding can be used to selectively deplete or eliminate selected cell types such as specific cancer cell types. For example, a guide RNA (gRNA) or single guide RNA (sgRNA) can be used to recognize to target repetitive or highly repetitive sequences in the target genome, and a Cas nuclease can act as a pair of scissors to cleave genomic DNA. As shown in the Examples, cell depletion is greater when repetitive sequences are targeted than when essential gene sequences are targeted. The specificity of targeting can be increased by use of deactivated Cas proteins that can be activated by selected proteases.
The Cas system can recognize any sequence in the genome that matches 20 bases of a gRNA. However, each gRNA also has or is adjacent to a “Protospacer Adjacent Motif” (PAM), which is invariant for each type of Cas protein, because the PAM binds directly to the Cas protein. See Doudna et al., Science 346(6213): 1077, 1258096 (2014); and Jinek et al., Science 337:816-21 (2012). Hence, the guide RNAs can have a PAM site sequence that can be bound by a Cas protein.
When the Cas system was first described for Cas9, with a “NGG” PAM site, the PAM was somewhat limiting in that it required a GG in the right orientation to the site to be targeted. Different Cas9 species have now been described with different PAM sites. See Jinek et al., Science 337:816-21 (2012); Ran et al., Nature 520:186-91 (2015); and Zetsche et al., Cell 163:759-71 (2015). In addition, mutations in the PAM recognition domain (Table 1) have increased the diversity of PAM sites for SpCas9 and SaCas9. See Kleinstiver et al., Nat Biotechnol 33:1293-1298 (2015); and Kleinstiver et al., Nature 523:481-5 (2015).
Table 1 summarizes information about PAM sites that can be used with the guide RNAs.
Some examples of the specific guide RNA sequences provided herein are shown below in Table 2.
The specific guide RNA sequences can also be selected from the sequences of highly amplified loci that can be present in particular types of cancer cells. Such highly amplified loci are useful for in vivo targeting of cancer cells without killing other cells. For example, the EGFR, PDGFRA, MDM2, CDK4, or combinations thereof loci can be amplified in certain glioblastomas, and sgRNA guide RNA sequences can be selected from such EGFR, PDGFRA, MDM2, and/or CDK4 sequences.
There are a number of different types of nucleases and systems that can be used for gene shredding. The nuclease employed can in some cases be any DNA binding protein can complex with a selected guide RNA and has nuclease activity. Examples of nuclease include Streptococcus pyogenes Cas (SpCas9) nucleases, Staphylococcus aureus Cas9 (SpCas9) nucleases, Francisella novicida Cas2 (FnCas2, also called dFnCpf1) nucleases, or any combination thereof. The CRISPR-Cas systems are generally the most widely used. In some cases, the nuclease is a Cas protein. The term “protein” is used with reference to the nuclease to embrace a deactivated nuclease and an active nuclease.
CRISPR-Cas systems are generally divided into two classes. The class 1 system contains types I, III and IV, and the class 2 system contains types II, V, and VI. The class 1 CRISPR-Cas system uses a complex of several Cas proteins, whereas the class 2 system only uses a single Cas protein with multiple domains. The class 2 CRISPR-Cas system is usually preferable for gene-engineering applications because of its simplicity and ease of use.
A variety of Cas proteins can be employed in the methods described herein. Three species that have been best characterized are provided as examples. The most commonly used Cas protein is a Streptococcus pyogenes Cas9, (SpCas9). More recently described forms of Cas include Staphylococcus aureus Cas9 (SaCas9) and Francisella novicida Cas2 (FnCas2, also called FnCpf1). Jinek et al., Science 337:816-21 (2012); Qi et al., Cell 152:1173-83 (2013); Ran et al., Nature 520:186-91 (2015); Zetsche et al., Cell 163:759-71 (2015).
One example of an amino acid sequence for Streptococcus pyogenes Cas9 (SpCas9) nuclease is provided below (SEQ ID NO:38).
A cDNA that encodes the Streptococcus pyogenes Cas9 (SpCas9) is provided below (SEQ ID NO:39).
An amino acid sequence for a Francisella novicida Cas2 (FnCas2, also called FnCpf1) is shown below (SEQ ID NO:40).
A cDNA that encodes the foregoing Francisella novicida Cas2 (FnCas2, also called dFnCpf1) polypeptide is shown below (SEQ ID NO:41).
The Cas proteins can be modified to improve their utility. For example, one Cas protein that can be used is the SpyCas9 amino acid sequence with a nuclear localization sequence (pCF823 vector; Streptococcus pyogenes Cas9-NLS) shown below as SEQ ID NO:42.
Another Cas protein that can be used is the SauCas9 amino acid sequence with a nuclear localization sequence (pCF825 vector; NLS-Staphylococcus aureus Cas9-NLS) shown below as SEQ ID NO:43.
In some cases, the Cas protein is circularly permuted. Circularly permutation involves removal and in-frame fusion of a N-terminal portion of a selected Cas protein downstream of the selected Cas protein's C-terminus (as is shown in
For example, one circularly permuted Cas protein that can be used is the Cas9-CP-199 circular permutant amino acid sequence (CP2, NLS-Cas9-CP-199-NLS, QLFEE|NPINA) shown below as SEQ ID NO:44.
As shown, the original N-terminal amino acids (MDKK) are now at position 1202 of the SEQ ID NO:44 Cas9-CP-199 circular permutant.
Another Cas protein that can be used is the Cas9-CP-230 circular permutant amino acid sequence (CP3, NLS-Cas9-CP-230-NLS, cleavage at LIAQL|PGEKK) shown below as SEQ ID NO:45.
Another Cas protein that can be used is the Cas9-CP-1010 circular permutant amino acid sequence (CP6, NLS-Cas9-CP-1010-NLS, cleavage at ESEFV|YGDYK) shown below as SEQ ID NO:46.
Another Cas protein that can be used is the Cas9-CP-1029 circular permutant amino acid sequence (CP9, NLS-Cas9-CP-1029-NLS, cleavage at KSEQE|IGKAT) shown below as SEQ ID NO:47.
Another Cas protein that can be used is the Cas9-CP-1249 circular permutant amino acid sequence (CP15, NLS-Cas9-CP-1249-NLS, cleavage at KLKGS|PEDNE) shown below as SEQ ID NO:48.
Another Cas protein that can be used is the Cas9-CP-1282 circular permutant amino acid sequence (CP16, NLS-Cas9-CP-1282-NLS, cleavage at SKRVI|LADAN), shown below as SEQ ID NO:49.
Another Cas protein that can be used is the ProCas9 amino acid sequence (pCF712 ProCas9-Flavi vector; NLS-Flavivirus protease-sensitive caged ProCas9-NLS) shown below as SEQ ID NO:50.
In some cases, the protein is or is encoded by any one of SEQ ID NO: 38-50. In some embodiments, the protein or nucleic acid has about 70%, about 75%, about 80%, about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99%, or more sequence identity to SEQ ID NO: 38-50.
The guide RNAs and/or proteins can be locally administered or systemically delivered. There are different ways to deliver guide RNAs and Cas proteins. The first approach is to use a vector-based CRISPR-Cas9 system encoding the Cas protein and guide RNA (e.g., sgRNA) from the same vector, thus avoiding multiple transfections or transductions of different components. The second is to deliver the mixture of the Cas9 protein mRNA and the sgRNA, and the third strategy is to deliver the mixture of the Cas9 protein and the sgRNA.
In some cases, the guide RNAs can be delivered to cells or administered to subjects in the form of an expression cassette or vector that can express one or more of the guide RNAs. Cas proteins can also be delivered to cells or administered to the subjects in the form of an expression cassette or vector that can express one or more Cas proteins. The Cas nucleases (e.g. as proteins) can also be combined with their respective gRNAs and delivered as RNA-protein complexes (RNPs). Hence, the RNPs can be pre-assembled outside of the cell and introduced into the cell.
The guide RNAs and/or the Cas proteins/nucleases can include a targeting agent that can restricts the activity of the guide RNAs/nuclease complex to specific targeted cell types (e.g., to specific cancer cell types). The targeting agent can be a protease that is expressed and/or is functional only in the targeted cell type, where the protease activates the Cas protein to have nuclease activity. The targeting agent can be a guide RNA that recognizes only cellular sequences that are unique to the targeted cells. The targeting agent can also be a sequence that localizes a protein within a particular cell type. The targeting agent can, for example, be an antibody or other binding agent that specifically binds to specific cancer cell types and that facilitates delivery of the guide RNAs and the Cas protein (or vector(s) encoding the guide RNAs and the Cas protein/nuclease) to specific targeted cell types.
When the targeting agent is a target cell protease that is functional only in the targeted cell type, the guide RNAs and the Cas protein can be systemically administered. However, in some cases, local delivery may facilitate more rapid uptake and may help avoid non-targeted cellular injury. The target cell protease activates the Cas protein only in the targeted cells (e.g., the targeted cancer cells). The Cas protein can have a modified structure such as the Cas9 circular permutants or ProCas9 enzymes described in the Examples (see also Oakes, Fellmann, et al., Cell 176: 254-267 (2019), which is incorporated by reference herein in its entirety). Such Cas9 circular permutants or ProCas9 enzymes are only activated when cleaved by particular proteases, for example, one or more proteases that are unique to specific cancer cell types. The Cas9 circular permutants or ProCas9 enzymes are therefore selectively activated in presence of a matching cell type specific protease such as a cancer cell specific protease.
Examples of proteases that can activate Cas9 circular permutants include serine proteases, matrix metalloproteinases, aspartic proteases, cysteine proteases, asparaginyl proteases, viral proteases, bacterial proteases, and proteases expressed in a tissue-specific or cell-specific manner. Examples of proteases that can be used also include those listed, for example, in Table 4.
When the targeting agent is a guide RNA that recognizes only cellular sequences that are unique to the targeted cells, the guide RNAs and Cas protein can be systemically delivered. However, in some cases, local delivery may facilitate more rapid uptake and may help avoid non-targeted cellular injury. For example, the guide RNAs can recognize target endogenous cellular sequences that are specific and/or more common in cancer cells compared to the non-cancer cells. Such cancer-cell specific sequences can include specific (somatic) repeat expansions, loci showing cancer-specific copy number amplifications, and/or other repeat sequences that only occur in cancer cells (e.g. due to viral integrations, chromosomal fusion, chromosomal breakpoints, specific somatic mutations, hypermutations following primary treatment, etc.). In such cases, the guide RNAs will only activate the Cas protein in the cell types that have the target endogenous cellular sequences.
Targeting agents that localize a protein (or other molecule) within a cell can, for example, be nuclear localization signal (NLS). Such a nuclear localization sequence has an amino acid sequence that ‘tags’ a protein for import into the cell nucleus by nuclear transport. Typically, this signal consists of one or more short sequences of positively charged lysines or arginines exposed on the protein surface. The nuclear localization sequences can be classified as either monopartite or bipartite. The major structural differences between the two is that the two basic amino acid clusters in bipartite NLSs are separated by a relatively short spacer sequence (hence bipartite—2 parts), while monopartite NLSs are not. The first nuclear localization sequence to be discovered was the sequence PKKKRKV (SEQ ID NO:81) in the SV40 Large T-antigen (a monopartite NLS) (Kalderon et al. Cell. 39: 499-509 (1984)). The NLS of nucleoplasmin, KR[PAATKKAGQA]KKKK (SEQ ID NO:82), is a prototypical bipartite signal: two clusters of basic amino acids, separated by a spacer of about 10 amino acids. Both are recognized by importin α. Importin α contains a bipartite NLS itself, which is specifically recognized by importin β. The importin β may be the actual import mediator.
A comparison of the nuclear localization efficiencies of eGFP fused NLSs of SV40 Large T-Antigen, nucleoplasmin (AVKRPAATKKAGQAKKKKLD, SEQ ID NO:83), EGL-13 (MSRRRKANPTKLSENAKKLAKEVEN, SEQ ID NO:84), c-Myc (PAAKRVKLD, SEQ ID NO:85) and TUS-protein (KLKIKRPVK SEQ ID NO:86) indicated that the c-Myc NLS has higher nuclear localization efficiency compared to that of SV40 NLS (Ray et al., Bioconjug. Chem. 26 (6): 1004-7 (2015)).
When a targeting agent is used that specifically binds to specific cancer cell types. The targeting agent can facilitate delivery of the guide RNAs and the Cas protein (or vector(s) encoding the guide RNAs and the Cas protein) to specific targeted cell types, the combination of the binding agent, the guide RNA(s), and the Cas protein/nuclease (or one or more vectors encoding the guide RNA(s) and the Cas protein/nuclease) can be administered systemically. However, in some cases, local delivery may facilitate more rapid uptake and may help avoid non-targeted cellular injury. The binding agent, the guide RNAs, and the Cas protein/nuclease (or vector(s) encoding the guide RNAs and the Cas protein/nuclease) can be incorporated within a carrier that displays the binding agent. Such a carrier can protect the guide RNAs and the nuclease (or vector(s) encoding the guide RNAs and the Cas protein/nuclease) from degradation and can also protect non-targeted tissues from off-target genomic shredding.
Targeted delivery of the Cas-sgRNA complex to specific cancer cells can include targeted Cas-sgRNA ribonucleoprotein (RNP) delivery using targeting or binding agents that are coupled to the Cas protein or sgRNA; targeted delivery of expression vector(s) encoding the Cas protein/nuclease and/or the gRNA, or a combination thereof. The binding (or targeting) agent can be selective viral vectors, viral particles, or virus like particles (VLPs); or potentially delivery vehicles that are targeted specifically to cancer cells; or nanoparticles that are targeted to cancer cells; or lipid carriers that are targeted to cancer cells. Such nanoparticles, or lipid carriers (e.g., liposomes) can include a binding agent that binds to the targeted cells.
The binding agent can specifically recognize and specifically bind to a cancer marker. A “cancer marker” is a molecule that is differentially expressed or processed in cancer, for example, on a cancer cell or in the cancer milieu. Exemplary cancer markers are cell surface proteins such as cancer cell adhesion molecules, cancer cell receptors, intracellular receptors, hormones, and molecules such as proteases that are secreted by cells into the cancer milieu. Examples include programmed cell death 1 (PD-1; also called CD279), C type Lectin Like molecule 1 (CLL-1), interleukin-1 receptor accessory protein (IL1-RAP, aka IL-1R3). Markers for specific cancers can include CD45 for acute myeloid leukemia, CD34+CD38− for acute myeloid leukemia cancer stem cells, MUC1 expression on colon and colorectal cancers, bombesin receptors in lung cancer, S100A10 protein as a renal cancer marker, and prostate specific membrane antigen (PSMA) on prostate cancer.
The guide RNAs and Cas proteins/nucleases can be recombinantly expressed in the cells. The guide RNAs and Cas protein/nucleases can be introduced in form of a nucleic acid molecules encoding the guide RNAs and/or Cas protein/nucleases. The nucleic acid molecules encoding the guide RNAs and/or Cas protein proteins can be provided in expression cassettes or expression vectors.
The expression cassettes can be within vectors. Vectors can, for example, be expression vectors such as viruses or other vectors that is readily taken up by the cells. Examples of vectors that can be used include, for example, adeno-associated virus (AAV) gene transfer vectors, lentiviral vectors, retroviral vectors, herpes virus vectors, e.g., cytomegalovirus vectors, herpes simplex virus vectors, varicella zoster virus vectors, adenovirus vectors, e.g., helper-dependent adenovirus vectors, adenovirus-AAV hybrids, rabies virus vectors, vesicular stomatitis virus (VSV) vectors, coronavirus vectors, poxvirus vectors and the like. Non-viral vectors may be employed to deliver the expression vectors, e.g., liposomes, nanoparticles, microparticles, lipoplexes, polyplexes, nanotubes, and the like. In one embodiment, two or more expression vectors are administered, for instance, each encoding a distinct guide RNA, a distinct Cas protein, or a combination thereof.
The expression cassettes or expression vectors include promoter sequences that are operably linked to the nucleic acid segment encoding the guide RNAs, Cas proteins, or combinations thereof. The promoter can be heterologous to the nucleic acid segment that includes a guide RNA, a Cas protein, or a combination thereof.
As used herein, the term “heterologous” when used in reference to an expression cassette, expression vector, regulatory sequence, promoter, or nucleic acid refers to an expression cassette, expression vector, regulatory sequence, or nucleic acid that has been manipulated in some way. For example, a heterologous promoter can be a promoter that is not naturally linked to a nucleic acid segment of interest, or that has been introduced into cells by cell transformation procedures. A heterologous nucleic acid or promoter also includes a nucleic acid or promoter that is native to an organism but that has been altered in some way (e.g., placed in a different chromosomal location, mutated, added in multiple copies, linked to a non-native promoter or enhancer sequence, etc.).
Heterologous nucleic acids may comprise sequences that comprise cDNA forms; the cDNA sequences may be expressed in either a sense (to produce mRNA) or anti-sense orientation (to produce an anti-sense RNA transcript that is complementary to the mRNA transcript). Heterologous coding regions can be distinguished from endogenous coding regions, for example, when the heterologous coding regions are joined to nucleotide sequences comprising regulatory elements such as promoters that are not found naturally associated with the coding region, or when the heterologous coding regions are associated with portions of a chromosome not found in nature (e.g., genes expressed in loci where the protein encoded by the coding region is not normally expressed). Similarly, heterologous promoters can be promoters that at linked to a coding region to which they are not linked in nature.
Methods for ensuring expression of a functional guide RNA, Cas protein, or combinations thereof can involve expression from a transgene, expression cassette, or expression vector. For example, the nucleic acid segments encoding the selected guide RNAs, or combinations thereof can be present in a vector, such as for example a plasmid, cosmid, virus, bacteriophage or another vector available for genetic engineering. The coding sequences inserted in the vector can be synthesized by standard methods or isolated from natural sources. The coding sequences may further be ligated to transcriptional regulatory elements, termination sequences, and/or to other amino acid encoding sequences. Such regulatory sequences can provide initiation of transcription, internal ribosomal entry sites (IRES) (Owens, Proc. Natl. Acad. Sci. USA 98: 1471-1476 (2001)) and optionally regulatory elements ensuring termination of transcription and stabilization of the transcript.
Non-limiting examples for regulatory elements ensuring the initiation of transcription comprise a translation initiation codon, transcriptional enhancers such as e.g. the SV40-enhancer, insulators and/or promoters. The promoter can be a constitutive promoter, and inducible promoter, or a tissue-specific promoter. Examples of promoters that can be used include the cytomegalovirus (CMV) promoter, SV40-promoter, RSV-promoter (Rous sarcoma virus), the lacZ promoter, chicken beta-actin promoter, CAG-promoter (a combination of chicken beta-actin promoter and cytomegalovirus immediate-early enhancer), the gai10 promoter, human elongation factor 1α-promoter, AOX1 promoter, GAL1 promoter CaM-kinase promoter, the lac, trp or tac promoter, the lacUV5 promoter, the Autographa californica multiple nuclear polyhedrosis virus (AcMNPV) polyhedral promoter, or a globin intron in mammalian and other animal cells. Non-limiting examples for regulatory elements ensuring transcription termination include the V40-poly-A site, the tk-poly-A site or the SV40, lacZ or AcMNPV polyhedral polyadenylation signals, which are to be included downstream of the nucleic acid sequence of the invention. Additional regulatory elements may include translational enhancers, Kozak sequences and intervening sequences flanked by donor and acceptor sites for RNA splicing. Moreover, elements such as origin of replication, drug resistance gene or regulators (as part of an inducible promoter) may also be included.
One straightforward approach is to use a vector-based system encoding the Cas protein and guide RNA (e.g., sgRNA) from the same vector, thus avoiding multiple transfections of different components. The second is to deliver the mixture of the Cas9 mRNA and the sgRNA, and the third strategy is to deliver the mixture of the Cas9 protein and the sgRNA.
Also described herein are methods that include administering to a patient or subject:
In some embodiments, the patient or subject suffers from or it is suspected that the patient or subject suffers from a disease or disorder. Such a disease or disorder can be a cell proliferative disease including, but not limited to, one or more leukemias (e.g., acute leukemia, acute lymphocytic leukemia, acute myelocytic leukemia, acute myeloblastic leukemia, acute promyelocytic leukemia, acute myelomonocytic leukemia, acute monocytic leukemia, acute erythroleukemia, chronic leukemia, chronic myelocytic leukemia, chronic lymphocytic leukemia), polycythemia vera, lymphomas (Hodgkin's disease, non-Hodgkin's disease), Waldenstrom's macroglobulinemia, heavy chain disease, and solid tumors such as sarcomas and carcinomas (e.g., fibrosarcoma, myxosarcoma, liposarcoma, chondrosarcoma, osteogenic sarcoma, chordoma, angiosarcoma, endotheliosarcoma, lymphangiosarcoma, lymphangioendothelio sarcoma, synovioma, mesothelioma, Ewing's tumor, leiomyosarcoma, rhabdomyosarcoma, colon carcinoma, pancreatic cancer, breast cancer, ovarian cancer, prostate cancer, squamous cell carcinoma, basal cell carcinoma, adenocarcinoma, sweat gland carcinoma, sebaceous gland carcinoma, papillary carcinoma, papillary adenocarcinomas, cystadenocarcinoma, medullary carcinoma, bronchogenic carcinoma, renal cell carcinoma, hepatoma, bile duct carcinoma, choriocarcinoma, seminoma, embryonal carcinoma, Wilms tumor, cervical cancer, uterine cancer, testicular cancer, lung carcinoma, small cell lung carcinoma, bladder carcinoma, epithelial carcinoma, glioma, astrocytoma, medulloblastoma, craniopharyngioma, ependymoma, pinealoma, hemangioblastoma, acoustic neuroma, oligodenroglioma, schwannoma, meningioma, melanoma, neuroblastoma, and retinoblastoma), or a combination thereof.
For example, in some case the disease or disorder is a glioblastoma.
The methods, compositions, and/or kits described herein can reduce the incidence or progression of such diseases by 1% or more, 2% or more, 3% or more, 5% or more, 7% or more, 10% or more, 15% or more, 20% or more, 25% or more, 30% or more, 40% or more, or 50% or more compared to a control. Such a control can be the initial frequency or previous rate of progression of the disease of the subject. The control can also be an average frequency or rate of progression of the disease. For example, when treating cancer, the compositions and/or methods described herein can reduce tumor volume in the treated subject by 1% or more, 2% or more, 3% or more, 5% or more, 7% or more, 10% or more, 15% or more, 20% or more, 25% or more, 30% or more, 40% or more, or 50% or more compared to a control. Such a control can be the initial tumor volume. In some cases, the compositions and/or methods described herein can reduce the incidence or progression of such diseases by at least 2-fold, or at least 3-fold, or at least 5-fold, or at least 10-fold compared to a control.
The disclosed methods of treatment can be accomplished via any mode of administration for therapeutic agents. These modes include systemic or local administration such as oral, nasal, parenteral, transdermal, subcutaneous, vaginal, buccal, rectal or topical administration modes.
Guide RNAs, Cas proteins, or a combination thereof can be administered to subjects. Expression systems that include one or more expression cassettes or expression vectors that can express the guide RNAs, the Cas proteins, or a combination thereof can be administered to subjects. The expression cassettes, expression vectors, and cells are administered in a manner that permits them to be incorporated into, graft or migrate to a specific tissue site, or to specific cell types.
Depending on the intended mode of administration, the disclosed compositions can be in solid, semi-solid or liquid dosage form, such as, for example, injectables, tablets, suppositories, pills, time-release capsules, elixirs, tinctures, emulsions, syrups, powders, liquids, suspensions, or the like, sometimes in unit dosages and consistent with conventional pharmaceutical practices. Likewise, the compositions can also be administered in intravenous (both bolus and infusion), intraperitoneal, subcutaneous or intramuscular form, and all using forms well known to those skilled in the pharmaceutical arts.
For therapy, expression systems that include one or more expression cassettes or expression vectors can be administered locally or systemically. The expression systems are administered in a manner that permits them to be incorporated into, graft, migrate to a specific tissue site, or migrate to specific cell types. Administration can be by injection, catheter, implantable device, or the like. The expression cassettes, expression vectors, and cells can be administered in any physiologically acceptable excipient or carrier that does not adversely affect the subject. For example, the expression cassettes, expression vectors, and cells can be administered intravenously.
Methods of administering the guide RNAs, Cas proteins, expression systems, or combinations thereof to subjects, particularly human subjects, include injection or implantation of the guide RNAs, Cas proteins, expression systems, or combinations thereof into target sites within a delivery device which facilitates their introduction, uptake, incorporation, targeting, or implantation. Such delivery devices include tubes, e.g., catheters, for introducing cells, expression vectors, and fluids into the body of a recipient subject. The tubes can additionally include a needle, e.g., a syringe, through which the cells of the invention can be introduced into the subject at a desired location. Multiple injections may be made using this procedure.
As used herein, the term “solution” includes a carrier or diluent in which the expression cassettes, expression vectors, and cells of the invention remain viable. Carriers and diluents that can be used include saline, aqueous buffer solutions, solvents and/or dispersion media. The use of such carriers and diluents are available in the art. The solution is preferably sterile and fluid to the extent that easy syringability exists.
The administering the guide RNAs, Cas proteins, expression systems, or combinations thereof can also be embedded in a support matrix. Suitable ingredients include targeting agents, matrix proteins, carriers that support or promote the incorporation of the guide RNAs, Cas proteins, expression systems, or combinations thereof. In another embodiment, the composition may include physiologically acceptable matrix scaffolds. Such physiologically acceptable matrix scaffolds can be resorbable and/or biodegradable.
Liquid, particularly injectable, compositions can, for example, be prepared by dissolution, dispersion, etc. For example, the guide RNAs, Cas proteins, expression systems, or combinations thereof can be dissolved in or mixed with a pharmaceutically acceptable solvent such as, for example, water, saline, aqueous dextrose, glycerol, ethanol, and the like, to thereby form an injectable isotonic solution or suspension.
Carriers, liposomes, nanoparticles, proteins such as albumin, chylomicron particles, or serum proteins can be used to stabilize the guide RNAs, Cas proteins, expression systems, or combinations thereof. Such carriers can also include or display a targeting agent to facilitate delivery to a specific cell type.
The disclosed guide RNAs, Cas proteins, expression systems, or combinations thereof can also be administered in the form of liposome delivery systems, such as small unilamellar vesicles, large unilamellar vesicles and multilamellar vesicles. Liposomes can be formed from a variety of phospholipids, containing cholesterol, stearylamine or phosphatidylcholines. In some embodiments, a film of lipid components is hydrated with an aqueous solution of drug to a form lipid layer encapsulating the pathway inhibitor and/or modulator of glucose metabolism, as described in U.S. Pat. No. 5,262,564 which is hereby incorporated by reference in its entirety.
Disclosed pharmaceutical compositions can also be delivered by the use of monoclonal antibodies as individual carriers to which the guide RNAs, Cas proteins, expression systems, or combinations thereof are coupled. For example, the monoclonal antibodies can be specific for a selected cell marker, such as a cell surface protein that is unique to a selected target cell. The guide RNAs, Cas proteins, expression systems, or combinations thereof can also be coupled with soluble polymers as targetable drug carriers. Such polymers can include polyvinylpyrrolidone, pyran copolymer, poly(hydroxypropyl)methacrylamide-phenol, poly(hydroxyethyl)-aspanamide phenol, or poly(ethyleneoxide)-polylysine substituted with palmitoyl residues. Furthermore, the guide RNAs, Cas proteins, expression systems, or combinations thereof can be coupled to a class of biodegradable polymers useful in achieving controlled release of a drug, for example, polylactic acid, polyepsilon caprolactone, polyhydroxy butyric acid, polyorthoesters, polyacetals, polydihydropyrans, polycyanoacrylates and cross-linked or amphipathic block copolymers of hydrogels.
Parental injectable administration is generally used for subcutaneous, intramuscular or intravenous injections and infusions. Injectables can be prepared in conventional forms, either as liquid solutions or suspensions or solid forms suitable for dissolving in liquid prior to injection.
Pharmaceutical compositions can be prepared according to mixing, granulating or coating methods, and the compositions can contain from about 0.1% to about 99%, from about 5% to about 90%, or from about 1% to about 20% of guide RNAs, Cas proteins, expression systems, or combinations thereof by weight or volume.
The dosage regimen is selected in accordance with a variety of factors including type, species, age, weight, sex and medical condition of the subject; the severity of the condition to be treated; the route of administration; the renal or hepatic function of the subject; and the particular guide RNAs, Cas proteins, expression systems, or combinations thereof employed. A physician or veterinarian of ordinary skill in the art can readily determine and prescribe the effective amount of the guide RNAs, Cas proteins, expression systems, or combinations thereof required to prevent, counter or arrest the progress of the disease or disorder.
The guide RNAs, Cas proteins, expression systems, or combination thereof may be administered in a composition as a single dose, in multiple doses, in a continuous or intermittent manner, depending, for example, upon the recipient's physiological condition, whether the purpose of the administration is for more sustained therapeutic purposes, and other factors known to skilled practitioners. The administration of the compositions of the invention may be provided as a single dose, or essentially continuous over a preselected period of time, or it may be in a series of spaced doses. Both local and systemic administration is contemplated.
In some cases, effective dosage amounts of the guide RNAs, Cas proteins, expression systems, or combinations thereof when used for the indicated effects, range from about 0.5 mg to about 5000 mg as needed to treat the disease or disorder. Compositions for in vivo or in vitro use can contain about 0.5, 5, 20, 50, 75, 100, 150, 250, 500, 750, 1000, 1250, 2500, 3500, or 5000 mg of the guide RNAs, Cas proteins, expression systems, or combinations thereof, or, in a range of from one amount to another amount in the list of doses.
Hence, the disclosure provides a pharmaceutical composition that include any of the guide RNAs, Cas proteins, expression systems, or combinations thereof described herein.
The compositions can also contain other ingredients such as chemotherapeutic agents, anti-viral agents, antibacterial agents, antimicrobial agents and/or preservatives. Examples of additional therapeutic agents that may be used include, but are not limited to: anti-PD-L1 antibodies, alkylating agents, such as nitrogen mustards, alkyl sulfonates, nitrosoureas, ethylenimines, and triazenes; antimetabolites, such as folate antagonists, purine analogues, and pyrimidine analogues; antibiotics, such as anthracyclines, bleomycins, mitomycin, dactinomycin, and plicamycin; enzymes, such as L-asparaginase; farnesyl-protein transferase inhibitors; hormonal agents, such as glucocorticoids, estrogens/antiestrogens, androgens/antiandrogens, progestins, and luteinizing hormone-releasing hormone anatagonists, octreotide acetate; microtubule-disruptor agents, such as ecteinascidins or their analogs and derivatives; microtubule-stabilizing agents such as paclitaxel (Taxol®), nab-paclitaxel, docetaxel (Taxotere®), and epothilones A-F or their analogs or derivatives; plant-derived products, such as vinca alkaloids, epipodophyllotoxins, taxanes; and topoisomerase inhibitors; prenyl-protein transferase inhibitors; and miscellaneous agents such as, hydroxyurea, procarbazine, mitotane, hexamethylmelamine, platinum coordination complexes such as cisplatin and carboplatin; and other agents used as anti-cancer and cytotoxic agents such as biological response modifiers, growth factors; immune modulators, and monoclonal antibodies. The compositions can also be used in conjunction with radiation therapy.
Also described herein is a kit that includes a packaged composition for controlling, preventing or treating a cell proliferative disease or cell proliferation disease.
In one embodiment, the kit or container holds at least one guide RNA described herein and instructions for using the guide RNA. Such a kit can also include at least one Cas protein. The instructions can include a description for using at least one Cas protein with at least one guide RNA. The guide RNA and the Cas protein can be packaged either separately in different containers, or together in a single container.
In some cases, the kit can include an expression system that includes at least one expression cassette having a promoter operably linked to a nucleic acid segment that includes a guide RNA, a Cas protein, or a combination thereof. The promoter can be heterologous to the nucleic acid segment that includes a guide RNA, a Cas protein, or a combination thereof. The expression system can be encapsulated in a liposome, nanoparticle, or other carrier. Similarly, the kit can include a liposome, nanoparticle, or carrier with at least one guide RNA, at least one Cas protein, or a combination thereof.
The kit can also hold instructions for administering the at least one guide RNA, at least one a Cas protein, or a combination thereof. The kit can also include instructions for administering an expression system that includes at least one expression cassette having a promoter operably linked to a nucleic acid segment that includes a guide RNA, a Cas protein, or a combination thereof.
The kits of the invention can also include containers with tools useful for administering the compositions and maintaining a ketogenic diet as described herein. Such tools include syringes, swabs, catheters, antiseptic solutions, package opening devices, forks, spoons, straws, and the like.
The compositions, kits, and/or methods described herein are useful for treatment of cell proliferative diseases such as cancer or cell-proliferative disorder.
For example, the compositions, kits, and/or methods described herein can reduce the incidence or progression of such diseases by 1% or more, 2% or more, 3% or more, 5% or more, 7% or more, 10% or more, 15% or more, 20% or more, 25% or more, 30% or more, 40% or more, or 50% or more compared to a control. Such a control can be the initial frequency or previous rate of progression of the disease of the subject. The control can also be an average frequency or rate of progression of the disease. For example, when treating cancer, the compositions and/or methods described herein can reduce tumor volume in the treated subject by 1% or more, 2% or more, 3% or more, 5% or more, 7% or more, 10% or more, 15% or more, 20% or more, 25% or more, 30% or more, 40% or more, or 50% or more compared to a control. Such a control can be the initial tumor volume. In some cases, the compositions and/or methods described herein can reduce the incidence or progression of such diseases by at least 2-fold, or at least 3-fold, or at least 5-fold, or at least 10-fold compared to a control.
The following Examples illustrate experiments and experimental results performed during development of the invention.
This Example illustrates some of the materials and methods that were used in the development of the invention.
For in-vivo E. coli screening, fluorescence measurements, and cell proliferation assays, MG1655 was used with a chromosomally integrated and constitutively expressed green fluorescent protein (GFP) and red fluorescent protein (RFP) (Oakes et al., 2014; Qi et al., 2013). EZ-rich defined growth medium (EZ-RDM, Teknoka) was used for all liquid culture assays and plates were made using 2×YT. Plasmids used were based on a 2-plasmid system as reported previously (Oakes et al., 2014, 2016; Qi et al., 2013) containing Cas9 and variants on a selectable chloramphenicol-resistant (CmR) marker and plasmids with sgRNAs and proteases with AmpR markers. The antibiotics were used to verify transformation and to maintain plasmid stocks. No blinding or randomization was done for any of the experiments reported.
All mammalian cell cultures were maintained in a 37° C. incubator, at 5% carbon dioxide. HEK293T (293FT; Thermo Fisher Scientific, #R70007) human kidney cells and derivatives thereof were grown in Dulbecco's Modified Eagle Medium (DMEM; Corning Cellgro, #10-013-CV) supplemented with 10% fetal bovine serum (FBS; Seradigm, #1500-500), and 100 Units/ml penicillin and 100 μg/ml streptomycin (100-Pen-Strep; GIBCO #15140-122). HepG2 human liver cells (ATCC, #HB-8065) and derivatives thereof were cultured in Eagle's Minimum Essential Medium (EMEM; ATCC, #30-2003) supplemented with 10% FBS and 100-Pen-Strep. A549 human lung cells (ATCC, #CCL-185) and derivatives thereof were grown in Ham's F-12K Nutrient Mixture, Kaighn's Modification (F-12K; Corning Cellgro, #10-025-CV) supplemented with 10% FBS and 100-Pen-Strep. HAP1 cells (kind gift from Jan Carette, Stanford) and derivatives thereof were grown in Iscove's Modified Dulbecco's Medium (IMDM; GIBCO #12440-053 or HyClone #SH30228.01) supplemented with 10% FBS and 100-Pen-Strep. HAP1 cells had been derived from the near-haploid chronic myeloid leukemia cell line KBM7 (Carette et al., 2011). Karyotyping analysis demonstrated that most cells (27 of 39) were fully haploid, while a smaller population (9 of 39) was haploid for all chromosomes except chromosome 8, like the parental KBM7 cells. Less than 10% (3 of 39) were diploid for all chromosomes except for chromosome 8, which was tetraploid.
A549 cells were authenticated using short tandem repeat DNA profiling (STR profiling; UC Berkeley Cell Culture/DNA Sequencing facility). STR profiling was carried out by PCR amplification of nine STR loci plus amelogenin (GenePrint 10 System; Promega #B9510), fragment analysis (3730XL DNA Analyzer; Applied Biosystems), comprehensive data analysis (GeneMapper software; Applied Biosystems), and final verification using supplier databases including American Type Culture Collection (ATCC) and Deutsche Sammlung von Mikroorganismen und Zellkulturen (DSMZ).
HEK293T, HEK-RT1, HEK-RT6, HepG2, A549, and HAP1 cells were tested for absence of Mycoplasma contamination (UC Berkeley Cell Culture facility) by fluorescence microscopy of methanol fixed and Hoechst 33258 (Polysciences #09460) stained samples.
U-251 human glioblastoma cells (Sigma-Aldrich, #09063001;
RRID:CVCL_0021), LN-229 human glioblastoma cells (ATCC, #CRL-2611;
RRID:CVCL_0393), T98G human glioblastoma cells (ATCC, #CRL-1690;
RRID:CVCL_0556), LN-18 human glioblastoma cells (ATCC, #CRL-2610;
RRID:CVCL_0392), and derivatives thereof were cultured in Dulbecco's Modified Eagle Medium/Nutrient Mixture F-12 (DMEM/F-12; Gibco, #11320-033 or Corning Cellgro, #10-090-CV) supplemented with 10% FBS and 100-Pen-Strep. U-251, LN-229, T98G, LN-18, and HEK293T cells were authenticated using short tandem repeat DNA profiling (STR profiling; UC Berkeley Cell Culture/DNA Sequencing facility). STR profiling was carried out by PCR amplification of nine STR loci plus amelogenin (GenePrint 10 System; Promega, #B9510), fragment analysis (3730XL DNA Analyzer; Applied Biosystems), comprehensive data analysis (GeneMapper software; Applied Biosystems), and final verification using supplier databases including American Type Culture Collection (ATCC) and Deutsche Sammlung von Mikroorganismen und Zellkulturen (DSMZ). U-251, LN-229, T98G, LN-18, and HEK293T cells were tested for absence of Mycoplasma contamination (UC Berkeley Cell Culture facility) by fluorescence microscopy of methanol fixed and Hoechst 33258 (Polysciences, #09460) stained samples.
The plasmid vector pCF153, expressing the Gag-Pol polyprotein from Friend murine leukemia virus FB29 (GenBank: Z11128.1), was derived from the pGagPol insert and pVSV-G backbone (a kind gift from Philippe Mangeot, Inserm) (Mangeot et al., 2019) to optimize vector size and expression efficiency. The plasmid vector pCF160, expressing the vesicular stomatitis virus glycoprotein (VSV-G), was derived from pVSV-G to optimize the Kozak sequence. The lentiviral vector pCF226, expressing Streptococcus pyogenes Cas9 and a puromycin selection marker, was described previously (Oakes et al., 2019). The lentiviral vector pCF821, encoding a U6-sgRNA cassette and an EF1a driven mNeonGreen marker, was derived from the pCF525 backbone (Watters et al., 2018) and the pCF221-based U6-sgRNA-EF1a-mCherry insert (Oakes et al., 2019). The mCherry fluorescence marker was replaced with a human codon optimized version of mNeonGreen (gBlock, Integrated DNA Technologies). Analogously, the lentiviral vector pCF820, encoding a U6-sgRNA-EF1a-mCherry2 cassette, was derived from pCF821 by replacing the mNeonGreen marker with a human codon optimized version of mCherry2 (gBlock, Integrated DNA Technologies). Of note, both the pCF820 (mCherry2) and pCF821 (mNeonGreen) sgRNA vectors yield higher viral titers than the otherwise comparable sgRNA vector pCF221 (mCherry). The all-in-one lentiviral vector pCF826, featuring a U6-sgRNA and EFS-Cas9-mCherry2 cassette, was derived from pCF820 with an EFS-Cas9 insert from pCF226 (Oakes et al., 2019). The all-in-one retroviral vector pCF841, encoding a U6-sgRNA and EFS-Cas9-mNeonGreen cassette, was derived from pCF826 by replacing mCherry2 with mNeonGreen from pCF821 and by replacing the lentiviral LTR elements (5′ LTR, packaging signal, RRE, cPPT/CTS, self-inactivating 3′ LTR; human immunodeficiency virus-derived) with retroviral LTR elements (5′ LTR, packaging signal, truncated gag, self-inactivating 3′ LTR; murine leukemia virus-derived) from the RT3GEPIR vector (Fellmann et al., 2013).
To begin, a defective Cas9 (dCas9) coding region flanked by BsaI restriction enzyme sites was inserted into a pUC19 based plasmid. A modified transposon with R1 and R2 sites (Jones et al., 2016), containing a chloramphenicol antibiotic resistance marker, p15A origin of replication, TetR and TetR/A promoter, was built using custom oligos and standard molecular biology techniques. The modified transposon was then cleaved from a plasmid using HindIII and gel purified. This linear transposon product was used in overnight in vitro reactions (0.5 molar ratio transposon to 100 ng dCas9-Puc19 plasmid) with 1 mL of MuA Transposase (F-750, Thermo Fisher) in 10 replicates. The transposed DNA was purified and recovered. Plasmids were electroporated into custom made electrocompetent MG1655 E. coli (Oakes et al., 2014) using a BTX Harvard apparatus ECM630 High Throughput Electroporation System and titered on carbenicillin (Carb) and chloramphenicol (CM) to ensure greater than 100× coverage of the library size (13,614). These cells were then outgrown for 12 hours and selected for via Carb and CM markers to ensure growth of transposed members. After isolating transposed plasmids via miniprep (QIAGEN), the original Puc19 backbone was removed via BsaI cleavage and dCas9 proteins transposed with a new plasmid backbone were selected via a 0.7% TAE agarose gel. The linear fragments were then ligated overnight with annealed and phosphorylated oligos coding for GGS linkers encoding 5, 10, 15 and 20 amino acids using a BsaI Golden Gate reaction. Completed libraries were purified, electroporated into the E. coli Mg1655 RFP and GFP screening strain containing an RFP-repressing sgRNA, and the electroporated cells were titered on carbenicillin (Carb) and chloramphenicol (CM) to ensure >5× coverage of the library size (8,216).
Screens were performed in a similar manner to previous reports (Oakes et al., 2014, 2016). Briefly, biological duplicates of Cas9-CP libraries with an RFP guide RNA were transformed (at greater than 5× library size) into E. coli MG1655 with genetically integrated and constitutively expressed GFP and RFP. Cells were grown overnight in EZ-RDM+Carb, CM, and 200 nM Anhydrotetracycline (aTc) inducer. E. coli were then sorted based on gates for RFP repression but not GFP repression, the RFP-repressed, GFP-expressing cells were collected, and the cells were resorted immediately to further enrich for functional Cas9-CPs. Double sorted libraries were then grown out and DNA was collected for sequencing. This DNA was also retransformed onto plates and individual clones were picked for further analysis.
This method was modified from previous Tnseq protocols (e.g., Coradetti et al., 2018). Briefly, the transposed plasmids were sheared to about 300 bp using a S220 Focused-ultrasonicator (Covaris) and purified in between each of the following steps using Agencourt AMPure XP beads (Beckman Coulter). Following shearing, fragments were end-repaired and A-tailed according to NEB manufacturers protocols, and then universal adapters were ligated onto the fragments in a 50 ul quick ligase reaction at room temperature. Fragments from each library were then amplified in a 20-cycle reaction with Indexed Illumina primers that annealed upstream of the new CP start codon and in the universal adaptor. PCR products were cleaned again and analyzed for primer dimers via an Agilent Bioanalyzer DNA 1000 chip. Sequencing was performed at the QB3 Vincent J. Coates Genomics Sequencing Laboratory on a HiSeq2500 in a 100 bp run.
Demultiplexed reads from the HiSeq2500 were assessed using FastQC to check basic quality metrics. Reads for each sample were then trimmed using a custom python script. The trimmed sequences were mapped to the dCas9 nucleic acid sequence using BWA via a custom python wrapper script to determine the amino acid position in dCas9 corresponding to the starting amino acid position in the dCas9-CP permutant. The resulting alignment files were then processed using a custom python script to calculate the abundance of each dCas9-CP permutant in a given library sample. Fold-changes for each dCas9-CP permutant between pre-library and post-library sorts along with significance values for each enrichment were calculated using the DESeq package in R (Anders and Huber, 2010). Due to ambiguity in transposon sequence, insertion site calls were one greater (sites: n+1) than the variants named in Table 3. As per the DESeq guidelines, count data from technical sequencing replicates were summed to create one unique replicate before running through the DESeq pipeline. All relevant sequencing data and Cas9-CP analysis scripts are available in a website at github.com/SavageLab/cpCas9.
E. coli CRISPRi GFP Repression Assay
Assays were performed using methods like those described by Oakes et al. (2016). To measure the ability of a circular permutant to bind to and repress DNA expression, cells were co-transformed with a Cas9 permutant plasmid with aTc inducible promoter and a single guide RNA plasmid for RFP or GFP that, in the case of the ProCas9 assays, also contained the active or inactive proteases on an IPTG-inducible promoter.
Endpoint Assay: Cells were picked in biological triplicate into 96 well plates containing 500 μL EZ MOPS plus Carb and CM. Plates were grown in 37° C. shakers for twelve hours. Next, cells were diluted 1:1000 in 500 μL EZ MOPS plus Carb, CM, IPTG and aTc. Two hundred nM aTc was used to induce Cas9-CPs or ProCas9s and 50 μM IPTG levels was used to induce the proteases in a 2 mL deep well blocks and shaken at 750 rpm at 37° C. After an eight-twelve-hour induction and growth period, 20 μL of cells were added to 80 μL of water and put into a 96-well microplate reader (Tecan M1000) at 37° C. and read immediately. Each well was measured for optical density at 600 nm and GFP or RFP fluorescence. GFP expression was normalized by dividing it with OD600. In the case of the time course assays, 150 μL of the 1:1000 dilution was used and placed into a black walled clear bottom plate (3631-Corning) and directly into the Tecan M1000 for a 130× 600 s kinetic cycle of reading. For E. coli single cell analysis, cells from the endpoint time course were run on a Sony SH800 to capture 100,000 events per sample.
E. coli Genomic Cleavage Assay
Assays were performed as previously described (Oakes et al., 2016) E. coli containing sgRNA plasmids targeting a genomically integrated GFP were made electrocompetent and transformed with 10 ng of the various Cas9-CP plasmids or controls using electroporation. After recovery in 1 mL SOC media for 1 hour, cells were plated in technical triplicate of tenfold serial dilutions onto 2×YT agar plates with antibiotics selection for both plasmids and aTc induction at 200 nM. Plates were grown at 37° C. overnight and CFU/mL was determined. A reduction in CFUs indicated genomic cleavage and cell death.
E. coli Western Blotting
After CRISPRi repression assays for TEV linker Pro-Cas9s, 40 μL of cell culture was pelleted and resuspended in SDS loading buffer for further analysis. SDS samples were loaded into 4%-20% acrylamide gels (BioRad) for electrophoresis. After transfer to membranes (Trans-Blot Turbo-BioRad), blots were washed three times with 1×TBS+0.01% Tween 20, blocked with 5% milk for 1.5 hour and then a 1:1000 of HRP-conjugated DYKDDDDK (SEQ ID NO:51) Tag (Anti-Flag) antibody (Cell Signaling Technology, #2044) was incubated for twenty-four hours at 4° C. Antibodies were washed away with 3×TBST and detected using Pierce ECL Western Blotting Substrate (Thermo Fisher).
NIa protease cleavage sites—i.e., the CP linkers—were identified from previous reports (TuMV, 7 aa; Kim et al., 2016), by using the sequence between the P3 and 6KI genes annotated in NCBI (PPV, PVY, CBSV), or from previously identified Potyvirus protease consensus sequences (Seon Han et al., 2013).
A lentiviral vector referred to as pCF204, expressing a U6 driven sgRNA and an EFS driven Cas9-P2A-Puro cassette, was based on the lenti-CRISPR-V2 plasmid (Sanjana et al., 2014), by replacing the sgRNA with an enhanced Streptococcus pyogenes Cas9 sgRNA scaffold (Chen et al., 2013). The pCF704 and pCF711 lentiviral vectors, expressing a U6-sgRNA and an EFS driven ProCas9 variant, were derived from pCF204 by swapping wild-type Cas9 for the respective ProCas9 variant. The pCF712 and pCF713 vectors were derived from pCF704 and pCF711, respectively, be replacing the EF1a-short promoter (EFS) with the full-length EF1a promoter. The lentiviral vector pCF732 was derived from pCF712 by removal of the ProCas9's nuclear localization sequences (NLSs). Vectors not containing a guide RNA, including pCF226 (Cas9-wt) and pCF730 (ProCas9Flavi), were derived from pCF204 and pCF712, respectively, through KpnI/NheI-based removal of the U6-sgRNA cassette and blunt ligation. The guide RNA-only vector pCF221, encoding a U6-sgRNA cassette and an EF1a driven mCherry marker, is loosely based on the pCF204 backbone and guide RNA cassette. Lentiviral vectors expressing viral proteases, including pCF708 expressing an EF1a driven mTagBFP2-tagged dTEV protease, pCF709 expressing an EF1a driven mTagBFP2-tagged ZIKV NS2B-NS3 protease, and pCF710 expressing an EF1a driven mTagBFP2-tagged WNV protease, are all based on the pCF226 backbone. The GFP-tagged protease vectors pCF736 and pCF738 are derived from pCF708 and pCF710, respectively, by swapping mTagBFP2 with GFP. All vectors were generated using custom oligonucleotides (IDT), gBlocks (IDT), standard cloning methods, and Gibson assembly techniques and reagents (NEB).
Design of sgRNAs
Standard sgRNA sequences were either designed manually, using CRISPR Design (crispr.mit.edu), or using GuideScan (Perez et al., 2017). When editing endogenous genes, sgRNAs were often designed to target evolutionarily conserved regions in the 50 proximal third of the gene of interest. The following sequences were used: sgGFP1 (CCTCGaaCTTCACCTCGGCG, SEQ ID NO:52), sgGFP2 (CaaCTACaa GACCCGCGCCG, SEQ ID NO:53), sgGFP9 (CCGGCaaGCTGCCCGTGCCC, SEQ ID NO:54), sgOR2B6-1 (CATTATTCTAGTGTCACGCC, SEQ ID NO:55), sgOR2B6-2 (GGGTATGaaGTTTGGTGTCC, SEQ ID NO:56), sgPCSK9-4 (CCGGTGGTCACT CTGTATGC, SEQ ID NO:57), sgPuro5 (TGTCGAGCCCGACGCGCGTG, SEQ ID NO:58), sgPuro6 (GCTCGGTGACCCGCTCGATG, SEQ ID NO:59), sgRPA1-1 (ACaaaaGTCAGATCCGTACC, SEQ ID NO:60), sgRPA1-2 (TACCTGGAGCaa CTCCCGAG, SEQ ID NO:62). All sgRNAs were designed with a G preceding the 20-nucleotide guide for better expression from U6 promoters.
To enable rapid CRISPR-Cas controlled cell depletion, through a strategy that was termed Cas-induced death by editing or ‘CIDE’, several sgRNAs (sgCIDEs) were designed directed again highly repetitive sequences in the human genome. In brief, using GuideScan (Perez et al., 2017) the most frequently occurring Streptococcus pyogenes Cas9 sgRNA target sites (50-NGG-30 PAM) were identified in the hg38 assembly (Genome Reference Consortium Human Build 38) of the human genome. Sequences were eliminated from this list that contained extended homomeric stretches (greater than four A/T/C/or G). Two sequences (sgCIDE-4, CGCCTGTaaTCCCAGCACTT (SEQ ID NO:63); sgCIDE-5, CCTCGGCCTCCCaaAGTGCT (SEQ ID NO:64) were empirically validated with slightly over 125,000 target loci. Two additional sequences (sgCIDE-1, TGTaaTCCCAGCACTTTGGG (SEQ ID NO:65); sgCIDE-2, TCCCaaAGT GCTGGGATTAC (SEQ ID NO:66) were empirically validated with approximately 300,000 target loci. All four sgCIDEs led to rapid cell depletion when expressed in presence of active Cas9.
All sgRNA sequences provided in Table 2 were cloned into the pCF820, pCF821, and pCF826 vectors using Esp3I restriction sites and enzymes (New England Biolabs). Because the pCF841 vector contains additional Esp3I sites, U6-sgRNA cassettes were PCR amplified from other vectors and inserted into XhoI/EcoRI-HF digested pCF841 using Gibson assembly (New England Biolabs).
To prevent viral packaging cells from dying when transfecting all-in-one Cas9-sgRNA vectors expressing sgCIDEs, HEK293T human embryonic kidney cells (293FT; Thermo Fisher Scientific, #R70007; RRID:CVCL_6911) were transduced with the lentiviral vector pCF525-AcrIIA4 (Watters et al., 2018, 2020) to stably express the anti-CRISPR protein AcrIIA4, a potent inhibitor of Streptococcus pyogenes Cas9 (Rauch et al., 2017). Transduced cells were selected on Hygromycin B (400 μg/ml; Thermo Fisher Scientific, #10687010) and the resulting cell line termed “CRISPR-Safe” packaging cells.
Lentiviral particles were produced in HEK293T cells using polyethylenimine (PEI; Polysciences #23966) based transfection of plasmids. HEK293T cells were split to reach a confluency of 70%-90% at time of transfection. Lentiviral vectors were co-transfected with the lentiviral packaging plasmid psPAX2 (Addgene #12260) and the VSV-G envelope plasmid pMD2.G (Addgene #12259). Transfection reactions were assembled in reduced serum media (Opti-MEM; GIBCO #31985-070). For lentiviral particle production on 10 cm plates, 8 μg lentiviral vector, 4 μg psPAX2 and 2 μg pMD2.G were mixed in 2 mL Opti-MEM, followed by addition of 42 μg PEI. After 20-30 min incubation at room temperature, the transfection reactions were dispersed over the HEK293T cells. Media was changed 12-hour post-transfection, and virus harvested at 36-48-hour post-transfection. Viral supernatants were filtered using 0.45 μm cellulose acetate or polyethersulfone (PES) membrane filters, diluted in cell culture media if appropriate, and added to target cells. Polybrene (5 μg/ml; Sigma-Aldrich) was supplemented to enhance transduction efficiency, if necessary.
Transduced target cell populations (HEK293T, A549, HAP1, HepG2 and derivatives thereof) were usually selected 24-48-hour post-transduction using puromycin (InvivoGen #ant-pr-1; HEK293T, A549 and HepG2: 1.0 μg/ml, HAP1: 0.5 μg/ml) or hygromycin B (Thermo Fisher Scientific #10687010; 200-400 μg/ml).
In general, to enable high viral titers, both lentiviral and retroviral all-in-one particles encoding Cas9-sgRNA (sgCIDE) were produced using the established CRISPR-Safe packaging cell line described herein. Generally, lentiviral particles were produced in HEK293T cells or derivatives thereof using polyethylenimine (PEI; Polysciences #23966) mediated transfection of plasmids, as previously described (Oakes et al., 2019). In brief, lentiviral transfer vectors were co-transfected with the lentiviral helper plasmid psPAX2 (Addgene #12260) and the VSV-G envelope plasmid pMD2.G (Addgene, #12259). Transfection reactions were assembled in reduced serum media (Opti-MEM; Gibco, #31985-070). For lentiviral particle production on 6-well plates, 1 μg lentiviral vector, 0.5 μg psPAX2 and 0.25 μg pMD2.G were mixed in 0.4 ml Opti-MEM, followed by addition of 5.25 μg PEI. After 20-30 min incubation at room temperature, the transfection reactions were dispersed over the HEK293T cells. Media was changed 12-14 h post-transfection, and virus harvested at 42-48 h post-transfection. Viral supernatants were filtered using 0.45 μm polyethersulfone (PES) membrane filters, diluted in cell culture media as appropriate, and added to target cells. Polybrene (5 μg/ml; Sigma-Aldrich) was supplemented to enhance transduction efficiency, if necessary. Similarly, retroviral particles were also produced in HEK293T cells or derivatives thereof using polyethylenimine (PEI; Polysciences #23966) mediated transfection of plasmids. Specifically, retroviral transfer vectors were co-transfected with the retroviral helper plasmids pCF153 (expressing Gag-Pol from FMLV) and pCF160 (expressing the envelope protein VSV-G). Transfection reactions were assembled in reduced serum media (Opti-MEM; Gibco, #31985-070). For retroviral particle production on 6-well plates, 1 μg retroviral transfer vector, 0.5 μg pCF153 and 0.25 μg pCF160 were mixed in 0.4 ml Opti-MEM, followed by addition of 5.25 μg PEI. After 20-30 min incubation at room temperature, the transfection reactions were dispersed over the HEK293T cells. Media was changed 12-14 h post-transfection, and virus harvested at 42-48 h post-transfection. Viral supernatants were filtered using 0.45 μm polyethersulfone (PES) membrane filters, diluted in cell culture media as appropriate, and added to target cells. Polybrene (5 μg/ml; Sigma-Aldrich) was supplemented to enhance transduction efficiency, if necessary.
To establish a rapid and quantitative way to reliably assess genome editing efficiency from various CRISPR-Cas constructs in mammalian cells, a fluorescence-based reporter assay was built. Assays leveraging editing-based disruption of a constitutively expressed fluorescence marker have been built before. However, such assays show a long detection lag time as the genetic disruption of a locus coding for the fluorescent marker would not immediately lead to a reduction in the fluorescence signal, due to the remaining presence of intact transcripts and protein half-life. To quantify this effect, HEK293T cells were stably transduced with a retroviral vector (LMP-Pten.1524) constitutively expressing GFP (Fellmann et al., 2013), and established monoclonal derivatives. The best performing cell line was termed HEK-LMP-10. When editing this reporter line with a vector (pX459, Addgene #48139) expressing wild-type Streptococcus pyogenes Cas9 and guide RNAs targeting the reporter (sgGFP1, sgGFP2), or a non-targeting control (sgNT), the editing detection lag—defined as the time between introduction of an editing reagent and complete loss of fluorescence signal in edited cells—was up to eight days. Hence, this type of assay is inconvenient for rapid quantification of editing efficiency. Conversely, assays relying on frameshift mutations to activate a fluorescence reporter often require specific guide RNA sequences and only get activated with the faction of edits that lead to the required frameshift, thus introducing a quantification bias.
To overcome this limitation, an inducible genome editing reporter cell line was built that had a fluorescence marker that is not expressed in the default state but can be induced following a defined time of potential genome editing. In this scenario, unedited cells rapidly turn positive, while non-edited cells remain fluorophore negative. Specifically, inducible monoclonal HEK293T-based genome editing reporter cells, referred to as “HEK-RT1,” were established in a two-step procedure. In the first step, puromycin resistant monoclonal HEK-RT3-4 reporter cells were generated (Park et al., 2018). In brief, HEK293T human embryonic kidney cells were transduced at low-copy with the amphotropic pseudotyped RT3GEPIR-Ren.713 retroviral vector (Fellmann et al., 2013), comprising an all-in-one Tet-On system enabling doxycycline-controlled GFP expression. After puromycin (2.0 μg/ml) selection of transduced HEK239 Ts, 36 clones were isolated and individually assessed for i) growth characteristics, ii) homogeneous morphology, iii) sharp fluorescence peaks of doxycycline (1 μg/ml) inducible GFP expression, iv) relatively low fluorescence intensity to favor clones with single-copy reporter integration, and v) high transfectability. HEK-RT3-4 cells are derived from the clone that performed best in these tests.
Since HEK-RT3-4 are puromycin resistant, in the second step, monoclonal HEK-RT1 and analogous sister reporter cell lines were derived by transient transfection of HEK-RT3-4 cells with a pair of vectors encoding Cas9 and guide RNAs targeting puromycin (sgPuro5, sgPuro6), followed by identification of monoclonal derivatives that are puromycin sensitive. In total, eight clones were isolated and individually assessed for i) growth characteristics, ii) homogeneous morphology, iii) doxycycline (1 μg/ml) inducible and reversible GFP fluorescence, and v) puromycin and hygromycin B sensitivity. The monoclonal HEK-RT1 and HEK-RT6 cell lines performed best in these tests and were further evaluated in a doxycycline titration experiment, showing that both reporter lines enable doxycycline concentration-dependent induction of the fluorescence marker in as little as 24-48 hours. The HEK-RT1 cell line was chosen as rapid mammalian genome editing reporter system for all further assays.
When employing the HEK-RT1 genome editing reporter assay to quantify WT Cas9 (Cas9-wt) and ProCas9 variant activity following stable genomic integration, HEK-RT1 reporter cells were transduced with the indicated Cas-wt/ProCas9 and sgRNA lentiviral vectors and selected on puromycin. A guide RNA targeting the GFP fluorescence reporter (sgGFP9) was compared to a non-targeting control (sgNT). A non-targeting control was used in all assays for normalization, in case not all non-edited cells turned GFP positive upon doxycycline treatment, though usual reporter induction rates were above 95%. GFP expression in HEK-RT1 reporter cells was induced for 24-48 hour using doxycycline (1 μg/ml; Sigma-Aldrich), at the indicated days post-editing. Percentages of GFP-positive cells were quantified by flow cytometry (Attune NxT, Thermo Fisher Scientific), routinely acquiring 10,000-30,000 events per sample. When quantifying ProCas9 activation by mTagBFP2-tagged proteases, GFP fluorescence was quantified in mTagBFP2-positive cells. In all cases, editing efficiency was reported as the difference in percentage of GFP-positive cells between samples expressing a non-targeting guide (sgNT) and samples expressing the sgGFP9 guide targeting the GFP reporter. For ProCas9 GFP disruption assays following transfection of the tested components (
Flow cytometry (Attune Nxt Flow Cytometer, Thermo Fisher Scientific) was used to quantify the expression levels of fluorophores (mTagBFP2, GFP/EGFP, mCherry) as well as the percentage of transfected or transduced cells. For the HEK-RT1 genome editing reporter cell line, flow cytometry was used to quantify the percentage of GFP-negative (edited) cells, 24-48 hour after doxycycline (1 μg/mL) treatment to induce GFP expression. Phase contrast and fluorescence microscopy was carried out following standard procedures (EVOS FL Cell Imaging System, Thermo Fisher Scientific), routinely at least 48-hour post-transfection or post-transduction of target cells with fluorophore expressing constructs.
HEK293T (293FT; Thermo Fisher Scientific) were co-transfected with the indicated plasmids expressing Cas9-wt or ProCas9-Flavi and plasmids expressing dTEV or WNV protease. HEK293T cells were split to reach a confluency of 70%-90% at time of transfection. For transfections in 6-well plates, 1 μg Cas9-sgRNA vector and 0.75 μg protease vector (if applicable) were mixed in 0.4 mL Opti-MEM, followed by addition of 5.25 μg polyethylenimine (PEI; Polysciences #23966). After 20-30 min incubation at room temperature, the transfection reactions were dispersed over the HEK293T cells. Media was changed 12-hour post-transfection. At 36-hour post-transfection, HEK293T were washed in ice-cold PBS and scraped from the plates. Cell pellets were lysed in Laemmli buffer (62.5 mMTris-HCl pH 6.8, 10% glycerol, 2% SDS, 5% 2-mercaptoethanol). Equal amounts of protein were separated on 4%-20% Mini-PROTEAN TGX gels (Bio-Rad, #456-1095) and transferred to 0.2 μm PVDF membranes (Bio-Rad, #162-0177). Blots were blocked in 5% milk in TBST 0.1% (TBS+0.01% Tween 20) for 1 hour; all antibodies were incubated in 5% milk in TBST 0.1% at 4° C. overnight; blots were washed in TBST 0.1%. The abundance of b-actin (ACTB) was monitored to ensure equal loading. Immunoblotting was performed using the antibodies: mouse monoclonal Anti-Flag-M2 (Sigma-Aldrich, #1804, clone M2, 1:500; sigmaaldrich.com/content/dam/sigma-aldrich/docs/Sigma/Bulletin/f1804bul.pdf), mouse monoclonal C-Cas9 Anti-SpyCas9 (Sigma-Aldrich, #SAB4200751, clone 10C11-A12, 1:500; sigmaaldrich.com/content/dam/sigma-aldrich/docs/Sigma/Datasheet/10/sab4200751dat.pdf), mouse monoclonal N-Cas9 Anti-SpyCas9 (Novus Biologicals, #NBP2-36440, clone 7A9-3A3, 1:500; novusbio.com/PDFs2/NBP2-36440.pdf), HRP-conjugated mouse monoclonal Anti-Beta-Actin (Santa Cruz Biotechnology, #sc-47778 HRP, clone C4, 1:250; datasheets.scbt.com/sc-47778.pdf), and HRP-conjugated sheep Anti-Mouse (GE Healthcare Amersham ECL, #NXA931; 1:5000; see website es.vwr.com/assetsvc/asset/es_ES/id/9458958/contents). Blots were exposed using Amersham ECL Western Blotting Detection Reagent (GE Healthcare Amersham ECL, #RPN2209) and imaged using a ChemiDoc MP imaging system (Bio-Rad). Protein ladders were used as molecular weight reference (Bio-Rad, #161-0374).
For assessment of CRISPR-Cas programmed cell depletion using guide RNAs targeting an essential gene (RPA1) or sgCIDEs targeting hundreds of thousands of loci within the genome, cells were stably transduced with a lentiviral vector expressing Cas9-wt (pCF226) or ProCas9Flavi (pCF730) and selected on puromycin. Subsequently, these cell lines were further stably transduced with vectors expressing various mCherry-tagged sgRNAs and analyzed as follows: 1) After mixing sgRNA expressing populations with parental cells, the fraction of mCherry-positive cells was quantified over time. Different sgRNAs targeting a neutral gene (sgOR2B6), an essential gene (sgRPA1), >100,000 genomic loci (sgCIDE) and a non-targeting control (sgNT) were compared. 2) Alternatively, the cell lines were partially transduced with lentiviral vectors expressing a GFP-tagged dTEV (pCF736) or WNV (pCF738) protease, and cell depletion quantified by flow cytometry. Depletion of protease-expressing (GFP+) cells was quantified among the sgRNA-positive (mCherry+) population.
Specific statistical tests used are indicated in all cases. Propagation of uncertainty was taken into consideration when reporting data and their uncertainty (standard deviation) as functions of measurement variables. Unless otherwise noted, error bars indicate the standard deviation of triplicates, and significance was assessed by comparing samples to their respective controls using unpaired, two-tailed t tests (alpha=0.05). Genome editing quantification using TIDE was carried out as recommended (Brinkman et al., 2014). In brief, indels ranging from −10 to +10 nucleotides were quantified. Parental cells were used as reference for normalization. When reporting TIDE editing efficiencies, only indels with p values <0.01 in at least one replicate were considered true.
To identify functional Cas9 circular permutants (Cas9-CPs), fold-changes for each dCas9-CP between pre- and post-library sorts along with significance values for each enrichment were calculated. Cas9-CP analysis scripts are available at website github.com/SavageLab/cpCas9, which is incorporated by reference herein in its entirety. All relevant sequencing data have been deposited in the National Institutes of Health (NIH) Sequencing Read Archive (SRA) at website ncbi.nlm.nih.gov/bioproject/PRJNA505363 under ID code 505363, Accession code PRJNA505363.
This Example demonstrates how circular permutation can be used to re-engineer the molecular sequence of Cas9 to both better control its activity and create a more optimal DNA binding scaffold for fusion proteins.
To investigate the topological malleability of Streptococcus pyogenes Cas9 (hereafter Cas9), a random transposon insertion library was generated in vitro by adapting an engineered transposon from Jones et al. (2016) to contain a plasmid backbone, inducible promoter, and stop codon.
Circular permutation (CP) libraries, constructed around dCas9, were screened for function in an E. coli-based repression (i.e., CRISPRi) assay targeting the expression of either RFP or GFP (Qi et al., 2013; Oakes et al., 2014, 2016). In brief, dCas9-CP libraries were targeted to repress RFP expression while GFP was used as a control for cell viability. Functional dCas9-CP library members were isolated through a sequential double-sorting procedure that enriched functional clones 100-fold to 10,000-fold (
The majority of functional clones were found in the 20-amino acid linker library. Deep sequencing of this library was performed to generate an enrichment profile of permutation across Cas9. Seventy-seven sites were identified as highly enriched (>100-fold) following the double sorting procedure (
The isolated Cas9-CPs were next tested for their cleavage activity relative to wild-type (WT) Cas9. Briefly, two variants from each of the three hotspots (specifically, CP sites 199, 230, 1010, 1029, 1249, and 1282) were constructed with a 20-amino acid linker between the original N and C termini and recoded with functional nuclease active sites (Table 3). Testing of these constructs for genomic cleavage and killing activity in E. coli demonstrated that all possessed similar activity as WT Cas9 (
Characterization of the libraries described above revealed that circular permutation is highly sensitive to the linker length connecting the original N and C terminus. PCR analysis of pooled libraries indicated that a linker length of 5 aa or 10 aa was not sufficient to generate Cas9-CP diversity. Conversely, libraries of 15 or 20 aa linkers qualitatively possessed extensive permutable diversity. Therefore, the inventors decided to test the importance of linker length on confirmed sites identified above (
In agreement with the pooled libraries, we found that all Cas9-CPs with linkers of 5 and 10 aa in length were markedly disrupted in activity, while those with longer linkers were active. Notably, activity did not increase with linker length beyond 15 aa (
The sensitivity of CPs to linker length led us to hypothesize that Cas9-CPs could be made into “caged” variants that could switch from an inactive form to an active one upon post-translational modification (
To test the possibility of turning Cas9-CPs into activatable switches using a well-studied protease, the six representative CP variants were engineered to include the 7-amino acid cleavage site (ENLYFQ/S) of the tobacco etch virus (TEV) nuclear inclusion antigen (NIa) protease as the linker sequence (Seon Han et al., 2013). This 7-amino acid linker was able to fully disrupt Cas9-CP activity in the E. coli CRISPRi GFP repression assay (
This Example illustrates that the uncaging mechanism for releasing Cas9-CP activities can be used with a variety of proteases.
The human rhinovirus 3C is responsible for about 30% of cases of the common cold and contains a well-studied protease, human rhinovirus 3C protease (3Cpro), unrelated to that from tobacco etch virus (TEV) (Skern, 2013). The eight-amino acid linker with the TEV recognition site was replaced in the six Cas9-CPs with the linker sequence with the for 3Cpro (LEVLFQ/GP SEQ ID NO:87). The six Cas9-CPs with the 3Cpro linker were then tested for bacterial CRISPRi activity with and without active protease.
Protease-dependent activation of Cas9-CPs was observed, with varying amounts of turn-on in activity, thus demonstrating that the deactivation-reactivation mechanism can be extended to other proteases (
Next, the protease sensing Cas9-CPs (hereafter ProCas9s) were tested on agriculturally and medically relevant viruses.
The Potyvirus proteases from turnip mosaic virus (TuMV), plum pox virus (PPV), potato virus Y (PVY), and cassava brown streak virus (CBSV) were tested, all of which are plant viruses responsible for significant crop losses each year (Seon Han et al., 2013; Tomlinson et al., 2018). The nuclear inclusion antigen (NIa) protease genes from these viruses were also cloned.
These protease constructs were evaluated for co-expression in conjunction with ProCas9s having linkers from a set of proteases of a medically important Flavivirus genus. Briefly, the capsid protein C cleavage sequences from Zika virus (ZIKV), West Nile virus (WNV, Kunjin strain), Dengue virus 2 (DENV2), and yellow fever virus (YFV) (Bera et al., 2007; Kummerer et al., 2013) were used as the CP linker sequence to generate a set of flavivirus-specific ProCas9s. In the viral life cycle, these cleavage sequences are cut by the NS2B-NS3 protease from the respective virus to mature the polyprotein (Kummerer et al., 2013).
Cognate protease cleavage sites (STAR Methods) were used as the CP linker in Cas9-CP199, yielding the respective ProCas9s that were systematically tested against all co-expressed N1a proteases. The following Table 4 shows sequences for the protease-specific linkers used with the Cas9-CP199 protein to provide protease-activated Cas9 activity by the Zika virus (ZIKV), yellow fever virus (YFV), Dengue virus 2 (DENV2), West Nile virus (WNV, Kunjin strain), and Flavi virus (consensus).
CRISPRi experiments revealed a general trend of proteases activating their respective ProCas9 (
Screening of these Flavivirus ProCas9 variants against their cognate proteases revealed a variant—hereafter called Pro-Cas9Flavi—that possesses a WNV linker sequence (KQKKR/GGK, SEQ ID NO:80) and was activated by NS2B-NS3 proteases from both Zika and WNV (
Next, the function of ProCas9s was validated and optimized in eukaryotic cells using a transient transfection system in the HEK293T-based GFP disruption assay (
A small amount of leaky activation (about 5%) was also observed in the absence of protease activity, so the distance between the original N and C termini was tested by progressively shortening by 2, 4, or 6 amino acids to evaluate whether such shortening would reduce unwanted background activity. While removing two amino acids from ProCas9Flavi had no apparent effect, removing six amino acids (ProCas9Flavi-S6) significantly reduced activity levels for nonactive or non-corresponding active proteases while still enabling a response, albeit weaker, to both ZIKV and WNV (corresponding) proteases (
A prerequisite for using activatable genome editors in sensing or molecular recording applications is that they possess low background activity under stable expression conditions. To confirm that ProCas9s function accordingly, lentiviral vectors were built that expressed ProCas9 from either a weak EF1a core promoter (EFS) or strong full-length EF1a promoter, along with single guide RNAs (sgRNAs) driven from a U6 promoter. The lentiviral vectors were tested for ProCas9Flavi and ProCas9Flavi-S6 activity in HEK-RT1 reporter cells (
When measured 6 to 10 days post-transduction, none of the four tested ProCas9 constructs showed any background activity (
TIDE analysis (Brinkman et al., 2014) was used to quantify editing outcome (
An activatable switch for molecular sensing must display repeatable induction upon stimulation. In an initial test, HEK-RT1 reporter lines (
To mimic a viral infection more closely, we next evaluated whether a stably integrated viral vector expressing Flavivirus proteases could also activate ProCas9Flavi enzymes. To generate viral particles, HEK293T packaging cell lines were transfected with dTEV, ZIKV, or WNV protease-encoding lentiviral vectors. Expressing the NS2B-NS3 or NS3 protease is known to be toxic (Ramanathan et al., 2006), and a similar effect was observed with ZIKV and WNV proteases, which led to reduced viral titers and target cell transduction efficiency. Nevertheless, we were able to stably transduce the HEK-RT1-ProCas9 reporter cell lines with protease constructs and followed the effects of dTEV, ZIKV, and WNV protease expression (
To assess the dynamic range of ProCas9Flavi induction, the above experiments were repeated out to 8 days (
As with background activity testing, the activation of ProCas9s by proteases was further validated by targeting the endogenous PCSK9 locus (
Conceptually, the underlying idea of ProCas9s is that they are present in cells in an inactive, or “vigilant,” state due to the linker sterically inhibiting activity (FIG. 4I). The presence of a cognate protease recognizing the peptide linker relieves inhibition through target cleavage, and leads to an “active” ProCas9 composed of two distinct subunits. To explore this hypothesis, HEK239T cells were co-transfected with vectors expressing either Cas9 WT or ProCas9Flavi and the dTEV or WNV protease. Immunoblotting with antibodies for the full-length Cas9 WT and vigilant ProCas9Flavi—as well as both the small (about 29 kDa) and large (about 137 kDa) subunit of active ProCas9Flavi—showed that Cas9 WT and ProCas9Flavi are expressed to comparable extents in the absence of a cognate protease (
A molecular sensor, such as ProCas9, could actuate many types of outputs. One unique effect would be to induce cell death upon sensing viral infection, as a form of altruistic defense. Since activated ProCas9 is capable of inducing DNA double-strand breaks, we sought to identify sgRNAs that could induce rapid cell death. As Flaviviruses replicate rapidly upon target cell infection, such sgRNAs would have to kill their host cells in less time. Targeting essential genes such as the single-stranded DNA binding protein RPA1, which is involved in DNA replication, could be one option. Alternatively, targeting highly repetitive sequences within a cell's genome to induce massive DNA damage and cellular toxicity could be another avenue. Indeed, sgRNAs targeting even only moderately amplified loci have been shown to lead to cell depletion under certain conditions (Wang et al., 2015), independent of whether the sgRNA targets a gene or intergenic region. While these effects have been observed over long assay periods, targeting highly repetitive sequences might provide sufficient DNA damage to trigger rapid cell death.
To compare the two strategies, both HEK293T and HAP1 cells were stably transduced to express WT Cas9 and an sgRNA coupled to an mCherry fluorescence marker (
Cas-induced death by editing or ‘CIDE, as an output constrains the performance of ProCas9. The system remains off to minimize genomic damage yet is vigilant to respond to a stimulus. To develop this protease-induced altruistic defense platform, stable expression of the best CIDE guide RNAs (sgCIDE-2, sgCIDE-4) was assessed in conjunction with a genomically integrated ProCas9Flavi cassette to determine cell viability in the absence of a stimulus (
To investigate the ability of CRISPR-Cas9 to eliminate glioblastoma cells through targeting of repetitive sequence elements in their genomes, ten of the most common repetitive single-guide RNA (sgRNA) target loci in the human genome were identified as 20-mers with adjacent 5′-NGG-3′ protospacer adjacent motifs (PAMs). Single guide RNAs (referred to as sgCIDE RNAs for CRISPR-Cas induced death by editing) were designed to target repetitive or highly repetitive sequences in the target genome. The number of off-target sites was further determined with a Hamming distance (mismatches) of up to three and allowing for NGG or NAG PAMs. Specific examples include, but are not limited to, the following sgCIDE RNAs targeting the human and/or mouse genome shown in Table 2.
The sgCIDEs examined could target about 3,000-300,000 sites per haploid genome. For example, as shown in Table 5 sgCIDEs with SEQ ID Nos: 1-3 could target approximately up to 300,000 sites per haploid genome.
To evaluate cell depletion by genomic shredding, U-251 glioblastoma cells that expressed Cas9 were transduced with a vector coding for mCherry and a single guide RNA targeting a selected repetitive genomic sequence or selected essential genes. After an eight-twelve hours incubation, mCherry expression was measured.
HEK293, HAP1, A549, and U-251 cells were stably transduced with a lentiviral vector (pCF226) to express Cas9 (HEK-pCF226, HAP1-pCF226, A549-pCF226, and U251-pCF226). These cells were also stably transduced to express mCherry fluorescence marker.
HEK-pCF226 cells are cells from the human embryonic kidney HEK293T cell line that express Cas9. HAP1-pCF226 cells are cells derived from the human KBM7 cell line (Carette et al., Ebola virus entry requires the cholesterol transporter Niemann-Pick C1. Nature (2011)) that express Cas9. A549-pCF226 cells are cells from the human lung cancer A549 cell line that express Cas9. U251-pCF226 cells are cells from the human glioblastoma cell line U-251 that express Cas9.
The effect of guide RNA expression on cell viability was assessed using a competitive proliferation assay in which cells expressing a specific sgRNA (Table 2), coupled to mCherry expression from the same vector, were mixed with parental cells expressing only Cas9 WT, and the mCherry-positive population was followed over time. The sgRNAs used targeted a neutral gene (sgOR2B6), an essential gene (sgRPA1), greater, and a non-targeting control (sgNT) were compared
To assess timing and dynamic effects of genome shredding on glioblastoma cells in more detail, fluorescence time-lapse video microscopy was used to monitor Cas9-expressing U-251 cells stably transduced with lentivirus that expressed GFP-coupled sgCIDEs (sgCIDE-1/2/3/6/8/10) or negative controls (sgNT-1/2/3) over seven days. A schematic diagram of this system is shown in
Cell confluency quantification and propidium iodide (PI) staining revealed that sgCIDEs induced growth inhibition starting at day one (1) post-transduction, and cell death started as early as day two. To look at the genomic effects of repetitive loci targeting, DNA from lysed targeted cells was separated on agarose-coated slides. Single-cell analysis of Cas9 expressing U-251 and LN-229 using comet assays showed that the DNA from sgCIDE-1/2/3 expressing cells exhibited very long tails at 24 hours post-transduction compared to control (sgNT-1/2/3). These results indicated that extensive genomic fragmentation had occurred even at this early timepoint (24 hours).
Competitive proliferation assays were performed with Cas9-expressing U251 and LN229 glioblastoma cell lines. Wild type cells not expressing Cas9 were used for normalization. The cell lines were stably transduced with the guide RNAs inducing genome shredding (sgCIDE1-10, Table 2), guide RNAs targeting an essential gene (sgRPA1), or a control non-targeting guide RNA (sgNT). The changes in ratios of sgRNA-transduced cells (mNeonGreen+) were monitored by flow cytometry over seven days.
Cell lines (U-251, LN-18) were stably transduced with a lentiviral vector expressing Cas9 (pCF226) and selected on puromycin (1.0-2.0 μg/ml). Subsequently, Cas9 expressing cell lines were further stably transduced with pairs of lentiviral vectors (pCF221) expressing various mNeonGreen-tagged sgRNAs. Volume of virus was adjusted as appropriate between cell lines to establish similar levels of infectivity, with ˜2× more virus used in LN-18 cells than U-251 cells. At day two post-transduction, sgRNA expressing populations were mixed approximately 80:20 with parental cells and the fraction of mNeonGreen-positive cells was quantified over time by flow cytometry (Attune NxT flow cytometer, Thermo Fisher Scientific). The changes in ratios of sgRNA-transduced cells (mNeonGreen+) were monitored by flow cytometry over seven days.
As illustrated in
Hence CRISPR-Cas genome shredding through targeting of highly repetitive sequences in the genome is a robust strategy for rapid and efficient elimination of cancer cells such as glioblastoma cells. Notably, targeting of repetitive sequences largely surpassed the efficacy of CRISPR-Cas9 methods directed at targeting of a key essential gene, highlighting the power of this approach.
Given the efficiency of genome shredding-based cell elimination, the origin and distribution of repetitive and highly repetitive CRISPR-Cas9 target loci in the genome was examines. To distinguish genome-specific versus general sequences, the inventors compared repetitive element from the human (Homo sapiens, hg38), mouse (Mus musculus, mm10), and chicken (Gallus gallus, galGal6) genomes, and annotated each sequence with over a thousand repeats in either of the three genomes. Genomic mapping of repeat elements demonstrated nearly uniform distribution throughout each genome, with the exception of a few regions that were devoid of repetitive guide RNA targets. When compared to annotated databases, the most common repeat sequences in the human genome mapped to retrotransposons and other mobile genetic elements (MGEs). While these MGE-targeting guide RNAs are species-specific, as is common for retrotransposons, a second class of highly repetitive target loci was represented by repeat expansion motifs. Repeat expansions can accumulate and expand in genomes because of replication errors in regions with specific repeat k-mer motifs. Not surprising due to the simplicity of these motifs, matching repeat expansion targets were identified across all three genomes. Parallel competitive proliferation assays in Cas9 expressing human U-251 glioblastoma, mouse GL261 glioblastoma, and chicken DF-1 fibroblast cells confirmed that repeat expansion targeting pan-vertebrate sgCIDEs rapidly induce depletion of transduced cells independent of their genetic origin.
The alkylating agent temozolomide (TMZ) is the current frontline chemotherapy for GBM but is only effective in cells when promoter methylation of O-6-methylguanine-DNA methyltransferase (MGMT) silences its expression. This is because active MGMT removes the TMZ-added methyl group from the O6 position of guanine, rendering the treatment ineffective. In sensitive glioblastoma cells, TMZ leads to a prolonged G2/M arrest followed by a p53-dependent cell death. This Example illustrates CRISPR-Cas9 genome shredding compared to chemotherapy in TMZ-sensitive and TMZ-resistant glioblastoma cells.
To investigate the speed of cell elimination by either method, Cas9 expressing TMZ-sensitive U-251 and LN-229, and TMZ-resistant T98G and LN-18, glioblastoma cells were treated with TMZ or these cells were transduced with lentiviral vectors expressing sgCIDEs.
Luminescence-based quantification of cell viability over five days showed that lethality observed only in U-251 and LN-229 that were sensitive to TMZ (
The effects of genome shredding on cell cycle progression were then assessed. Cells were treated with TMZ or sgCIDEs for one to five days and then stained with PI after fixation for analysis by flow cytometry. Control DMSO and sgNT-1/2 treatments, as well as guide RNAs targeting an olfactory receptor (sgOR2B6-1/2), showed comparable normal cell cycle profiles in Cas9 expressing U-251, LN-229, T98G, and LN-18 glioblastoma cells. TMZ-sensitive glioblastoma cells treated with TMZ (50 μM or 100 μM) exhibited G2/M arrest with initial increase of the G2 peak, loss of G1, and slow increase of the Sub-G1 (apoptotic) population starting at day two. Increases of the Sub-G1 population was more prominent in TP53-mutant U-251 cells compared LN-229 with wild-type TP53, consistent with previous observations that TP53 status affects resolution of the G2/M arrest. Treatment with guide RNAs targeting the essential gene RPA1 (sgRPA1-2/3) resulted in an accumulation in S-phase starting at day three, accompanied by increase of the Sub-G1 population, in all four glioblastoma cell lines. See
In contrast, genome shredding with sgCIDE-1/2/3/6/8/10 led to a rapid increase of the Sub-G1 population starting at day one post-transduction, combined with a drastic depletion of the G1 peak and slight increase of the S-phase population, in all four tested glioblastoma cell lines. Noteworthy, this change in cell cycle profile was consistent across all six sgCIDEs, for all four tested GBM cell lines independent of MGMT promoter methylation and TERT promoter or TP53 mutational status, indicating a characteristic path to cell death. At day two post-transduction, the Sub-G1 population of sgCIDE transduced samples already represented approximately 20-40% of cells, and by day 3 the Sub-G1 population was 30-60%. See
Together, CRISPR-Cas genome shredding was both more rapid than TMZ at inducing cell death and it was effective independent of the GBM cells' genetic and epigenetic makeup. Hence, genome shredding can be more versatile when addressing intratumoral cellular heterogeneity issues.
Because recurrent tumors develop from cells that escape treatment, either by avoiding exposure, tolerating the effects, or developing resistance, colony formation assays were performed to evaluate the robustness of CRISPR-Cas genome shredding in eliminating target cells.
TMZ-resistant LN-229 cell lines were isolated to determine which types of treatments could overcome such resistance. Cas9 expressing U-251, LN-229, T98G, and LN-18 cells were stably transduced with lentiviral vectors expressing sgNT-1/2 or sgCIDE-1/2/3/6/8/10 (Table 2), and seeded at 100, 1,000, and 10,000 cells per 6-well plate. Control cells were treated with DMSO or TMZ (50 μM).
Crystal violet staining two weeks later revealed that TMZ treatment reduced colony numbers by about two log-scales compared to DMSO in U-251 and LN-229 cells only, while T98G and LN-18 cells were unaffected as expected. Treatment with sgNT-1/2 had little effect on colony formation. Conversely, genome shredding by sgCIDE-1/2/3/6/8/10 expression led to an over three log-scales reduction in colony count across all four tested GBM cell lines. Hence, under the tested conditions, CRISPR-Cas genome shredding was more than 10-fold efficient at eliminating GBM cells compared to TMZ in chemotherapy-sensitive cell lines.
A small percentage of Cas-9 glioblastoma cells appeared to escape genome shredding when transduced with the sgRNA expression cassette shown in
The proof-of-concept studies described above were all carried out with pre-engineered cell lines stably expressing Cas9 and guide RNAs from lentiviral vectors. To assess the therapeutic potential of CRISPR-Cas genome shredding, orthotopic intracranial glioblastoma xenograft models were established that provided local delivery of CRISPR-Cas9 after establishment of tumors. Direct delivery of Cas9-sgRNA ribonucleoprotein (RNP) complexes, rather than viral vectors encoding those components, can reduce toxicities of persistent viral transductions and integrational mutagenesis, but may suffer low efficacy.
To leverage high viral delivery efficiencies, virus-like particles (VLPs) can be used as Cas9 RNP carriers. Hence, a murine leukemia virus (MLV)-based system of VLPs was adopted for local Cas9 RNP delivery (Mangeot et al., Nat. Commun. 10, 45 (2019)). Vector-based improvements in guide RNA and Cas9 expression so that both are expressed in target cells (
Genome shredding efficiency was then assessed in wild-type U-251 and LN-229 glioblastoma cells upon VLP-based delivery of Cas9 and negative control sgNT-1/3 or sgCIDE-1/3. Parental U251 cells (U251-pCF226-pCF821-sgNT-1 #1) and U251 cells that stably expressed AcrIIA4 (pCF525-AcrIIA4) were transduced with all-in-one lentiviral vectors (pCF826) expressing an mCherry-tagged Cas9 and sgCIDE1, sgCIDE2 or control non-targeting sgNT-1 sgRNAs. Viral particles were produced using either standard HEK293T packaging cells or the CRISPR-Safe packaging cell line. Viral titers were assessed by flow cytometry-based quantification of mCherry expression at day two post-transduction.
As illustrated in
All patents and publications referenced or mentioned herein are indicative of the levels of skill of those skilled in the art to which the invention pertains, and each such referenced patent or publication is hereby specifically incorporated by reference to the same extent as if it had been incorporated by reference in its entirety individually or set forth herein in its entirety. Applicants reserve the right to physically incorporate into this specification any and all materials and information from any such cited patents or publications.
The following statements are intended to describe and summarize various embodiments of the invention according to the foregoing description in the specification.
This application claims benefit of priority to the filing of U.S. Provisional Application Ser. No. 62/910,558, filed Oct. 4, 2019, the contents of which are specifically incorporated herein by reference in their entirety.
This invention was made with government support under R00GM118909 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/053896 | 10/2/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62910558 | Oct 2019 | US |