The contents of the text file submitted electronically herewith are incorporated herein by reference in their entirety: A computer readable format copy of the Sequence Listing (filename: AIBS_005_07WO_ST25.txt; date recorded: Feb. 9, 2018; file size 326 kilobytes)
The present disclosure relates to the fields of stem cell biology, genetics, and genetic engineering. In particular aspects, the present disclosure relates to methods of genetically engineering stem cells to express one or more fluorescently-tagged structural or other proteins. In further embodiments, the methods described herein allow for the generation of genetically-engineered, fluorescently-tagged stem cells, wherein the endogenous functions of the stem cells remain un-altered (e.g., pluripotency and genomic stability). In further embodiments, the methods allow for three-dimensional live cell imaging of intracellular proteins. In further embodiments, the methods allow for use of the cells for screening, observing cellular dysplasia, disease staging, monitoring disease progression or improvement, or cellular stress in response to a test agent
Conventional methods of live-cell protein imaging utilize protein fusion constructs, wherein a detectable marker (e.g., a fluorescent protein) is fused to the protein of interest, transduced or transfected into a cell. As such, these systems result essentially in the production of a cell that overexpresses the transduced protein. Although these systems have enabled the probing and analysis of protein localization and cellular dynamics in a wide range of cell types and assays, they fail to allow for the analysis and characterization of a target protein in an un-altered, endogenous state. For example, fusion constructs often result in unpredictable and artificial expression levels of the tagged protein, either as a result of transient expression of transfected constructs, or as a result of copy number variation with transduced constructs. These realities hinder the interpretation of experiments and in turn the study of pathogenesis and drug discovery.
The limitations of exogenous fusion construct systems are further exacerbated in the context of cells that are difficult to transfect or transduce, such as stem cells. In such cells, variation in the expression level of the construct may be especially problematic, as levels of transduction/transfection efficiency may be particularly low to begin with. Accordingly, there is a need in the art for methods that enable tagging of endogenous proteins such that the endogenous expression levels, function, and localization of the protein remain unaltered.
In stem cells, and other cells that are particularly difficult to transfect or transduce, engineering of the endogenous genomic sequence to insert a protein tag overcomes the challenges of variable expression and allows for dynamic study of the endogenously-regulated, targeted gene product. These systems are enabled by the CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR Associated) nuclease system, which allows for the precise targeting of a genomic locus, and with the insertion of a fluorescent protein (FP) tag under the endogenous regulatory control of the target locus.
CRISPR/Cas9 eliminates many of the challenges associated with genetic engineering and an ever-growing number of studies illuminate the power of this approach. The system is most commonly used in loss-of-function studies, wherein one or more genes are mutated or deleted to generate genetic knock-outs. Less common is the use of the system to introduce exogenous genetic sequences into a target locus. In this instance, homology-directed repair (HDR) mediates the insertion of a repair template into the target locus and can be used to correct an existing mutation in the genomic sequence or to insert exogenous nucleic acid sequences (e.g., a nucleic acid sequence encoding a fluorescent protein). Although HDR has a low error-rate, it is an inherently inefficient process, with rates of less than 10% in normal cells. As such, until now it has been difficult to reproduce HDR-mediated protein tagging across multiple targets to enable systematic use of this process in the study of endogenous protein dynamics particularly in view of the unpredictability of how the introduction of large fluorescent tags may affect endogenous gene function as well as stem cell viability, pluripotency, and chromosomal stability.
The methods provided herein utilize CRISPR/Cas9-mediated gene editing to introduce fluorescent tags via HDR into the genomic loci of target proteins, into the genomic safe harbor location, or other locations in the genome. These methods result in the production of isogenic hiPSC clones expressing detectable endogenously-regulated fusion proteins unique to each cell line, and do not substantially modify or alter stem cell pluripotency or function.
In some embodiments, the present invention provides a method for producing a stem cell comprising at least one tagged endogenous protein comprising: (a) providing a ribonucleoprotein (RNP) complex comprising a Cas protein, a CRISPR RNA (crRNA) and a trans-activating RNA (tracrRNA), wherein the crRNA is specific for a target genomic locus; (b) providing a donor plasmid comprising a first polynucleotide sequence encoding a detectable tag, a second polynucleotide sequence encoding a 5′ homology arm, and a third polynucleotide sequence encoding a 3′ homology arm, wherein the 5′ homology arm and 3′ homology arms are at least about 1 kb in length; and (c) transfecting the complex of (a) and the donor plasmid of (b) into a stem cell such that the polynucleotide sequence encoding the detectable tag is inserted into a target genomic locus to generate a tagged endogenous protein, thereby producing a stem cell comprising at least one tagged endogenous protein.
In some embodiments, the polynucleotide sequence encoding the detectable tag further comprises a polynucleotide sequence encoding a flexible linker. In some embodiments, the polynucleotide sequence encoding the detectable tag comprises at least about 20 nucleotides in length. In some embodiments, the encoded detectable tag comprises at least about 8 amino acids in length. In some embodiments, the polynucleotide sequence encoding the detectable tag comprises at least about 300 nucleotides in length. In some embodiments, the encoded detectable tag comprises at least about 100 amino acids in length. In some embodiments, the detectable tag is a fluorescent protein, a luminescent protein, a photoactivatable protein, a FLAG tag, a SNAP tag, or a Halo tag. In some embodiments, the fluorescent protein is selected from the group comprising green fluorescent protein (GFP), blue fluorescent protein, cyan fluorescent protein, yellow fluorescent protein, or red fluorescent protein.
In some embodiments, the RNP comprises a crRNA, tracrRNA, and Cas9 protein complexed at a ratio of 1:1:1. In some embodiments, the Cas protein is a wild-type Cas9 protein or a Cas9-nickase protein. In some embodiments, the crRNA sequence is selected to minimize off-target cleavage of genomic DNA sequences and/or insertion of the detectable tag. In some embodiments, the off-target cleavage of genomic DNA sequences and/or insertion of the detectable tag is less than 1.0%. In some embodiments, transfecting the CRISPR/Cas9 RNP and the donor plasmid into a stem cell results in a double stranded break at the target genomic locus. In some embodiments, the double stranded break is repaired by homology directed repair (HDR). In some embodiments, the polynucleotides encoding 5′ homology arm, the detectable tag, and the 3′ homology arm act as a repair template during HDR In some embodiments, protospacer adjacent motif (PAM) sequences are removed from the polynucleotide backbone of the donor plasmid. In some embodiments, the donor plasmid further comprises an antibiotic-resistance gene. In some embodiments, the antibiotic-resistance gene confers resistance to ampicillin and/or kanamycin.
In some embodiments, the stem cell is an induced pluripotent stem cell (iPSC) derived from a healthy donor. In some embodiments, the iPSC is a WTC cell or a WTB cell.
In some embodiments, transfecting the CRISPR/Cas9 RNP and the donor plasmid occurs by electroporating the stem cells. In some embodiments, the stem cells are electroporated using a Neon® transfection system or an Amaxa Nucleofector® system. In some embodiments, the stem cells are electroporated for at least 1 pulse. In some embodiments, the pulse is at least about 15 ms at a voltage of at least about 1300 V. In some embodiments, the stem cells are electroporated for 1-5 pulses. In some embodiments, the stem cells are electroporated for at least 2 pulses.
In some embodiments, the target genomic locus is a locus within a gene encoding a structural protein. In some embodiments, the structural protein is selected from paxillin, alpha tubulin, lamin B1, Tom20, desmoplakin, beta actin, Sec61B, fibrillarin, myosin, centrin2, ZO-1, Safe-harbor-GFP, ST6Gal1, vimentin, LAMP1, LC3, Safe harbor-CAAX, and PMP34.
In some embodiments, a plurality of detectable tags are inserted into a plurality of target loci. In some embodiments, a plurality polynucleotides encoding a plurality of detectable tags are inserted into one donor plasmid. In some embodiments, two or more polynucleotides encoding two or more detectable tags are inserted into one donor plasmid. In some embodiments, a first plurality of polynucleotides encoding two or more detectable tags are inserted into a first donor plasmid and a second plurality of polynucleotides encoding two or more detectable tags are inserted into a second donor plasmid. In some embodiments, a first polynucleotide encoding a first detectable tag is inserted into a first donor plasmid and a second polynucleotide encoding a second detectable tag is inserted into a second donor plasmid. In some embodiments, the first and second donor plasmid are introduced to the cell at the same time. In some embodiments, the first and second donor plasmid are introduced to the cell sequentially.
In some embodiments, 10 polynucleotides each encoding a unique detectable tag and each inserted into one of about 10 different donor plasmids. In some embodiments, the 10 different donor plasmids are introduced to the cell at the same time. In some embodiments, the 10 different donor plasmids are introduced to the cell sequentially.
In some embodiments, between 2 and 10 detectable tags are inserted into between 2 and 10 target loci. In some embodiments, between 3 and 5 detectable tags are inserted into between 3 and 5 target loci.
In some embodiments, the methods described herein further comprise selecting the stem cells that comprise at least one tagged protein. In some embodiments, selecting the stem cells comprises selecting the stem cells that are positive for the detectable tag using fluorescence activated cell sorting (FACS). In some embodiments, at least about 0.1% of the stem cells are positive for the detectable tag.
In some embodiments, the methods described herein further comprise screening of the stem cells comprises genetic screening to determine at least two or more of the following: (a) insertion of the detectable tag sequence; (b) stable integration of the plasmid backbone; and/or (c) relative copy number of the detectable tag sequence. In some embodiments, the genetic screen is performed by droplet digital PCR (ddPCR), by tile junction PCR, or both.
In some embodiments, selecting clones comprising an insertion of the detectable tag comprises selecting clones that have the detectable tag sequence inserted into one or both alleles of the target genomic locus and do not have stable integration of the plasmid backbone.
In some embodiments, the methods described herein further comprise sequencing clones comprising an insertion of the detectable tag to identify clones comprising a precise insertion of the detectable tag. In some embodiments, clones comprising a precise insertion are identified by: (a) amplifying the genomic sequences across the junction between the inserted detectable tag and the 5′ and 3′ distal genomic regions to generate tiled-junction amplification products. (b) sequencing the tiled-junction amplification products of (a); and (c) comparing the sequence of the tiled-junction amplification products with a reference sequence.
In some embodiments, the stem cell comprising at least one tagged endogenous protein expresses at least one protein associated with pluripotency. In some embodiments, the protein associated with pluripotency is selected from the group comprising Oct3/4, Sox2, Nanog, Tra-160, and Tra-181, SSEA3/4. In some embodiments, expression level of the at least one protein associated with pluripotency is comparable to the expression level of the same protein in an unmodified stem cell. In some embodiments, the stem cell comprising at least one tagged protein maintains a differentiation potential that is comparable to an unmodified stem cell. In some embodiments, the stem cell comprising at least one tagged protein is capable of differentiating into mesoderm, endoderm, or ectoderm.
In some embodiments, the expression of the at least one tagged protein is maintained in a differentiated cell derived from the stem cell comprising at least one tagged protein. In some embodiments, the morphology, viability, potency, and endogenous cellular functions of the stem cells comprising at least one tagged protein and/or differentiated cells derived from stem cells comprising at least one tagged protein are not substantially changed compared to unmodified stem cells and differentiated cells thereof.
In some embodiments, the present invention provides a method for screening the effects of one or more test agents on one or more cellular structures in one or more cell types comprising: providing one or more cultures of one or more stem cells and/or differentiated cells derived therefrom produced by the methods described herein, wherein the stem cells or differentiated cells derived therefrom comprise a tagged endogenous protein; adding one or more test agent to one or more of the cultures; assaying the culture at one or more time points before and/or after the addition of the one or more test agent, and determining the effects of the one or more test agent on one or more cellular structures in the one or more cell types.
In some embodiments, the effect of the one or more test agents are determined by visualization of the cells. In some embodiments, the tagged endogenous protein comprises at least about 100 amino acids in length. In some embodiments, the tagged endogenous protein is a fluorescent protein, a luminescent protein, a photoactivatable protein, a FLAG tag, a SNAP tag or a Halo tag. In some embodiments, the fluorescent protein is selected from the group comprising green fluorescent protein (GFP), blue fluorescent protein, cyan fluorescent protein, yellow fluorescent protein or red fluorescent protein. In some embodiments, the tagged endogenous protein is a structural protein. In some embodiments, the structural protein is selected from paxillin, alpha tubulin, lamin B1, Tom20, desmoplakin, beta actin, Sec61B, fibrillarin, myosin, centrin2, ZO-1, Safe-harbor-GFP, ST6Gal1, vimentin, LAMP, LC3, Safe harbor-CAAX, and PMP34.
In some embodiments, the methods provided herein for determining the effect of one or more test agents comprises providing two or more cultures of stem cells and/or one or more differentiated cells derived therefrom. In some embodiments, the two or more cultures each comprise a different differentiated cell type and/or a different tagged endogenous structure. In some embodiments, the two or more cultures each comprise a different differentiated cell type and a different tagged endogenous structure.
In some embodiments, the methods described herein comprise microscopy of the one or more cultures of one or more stem cells and/or one or more differentiated cells derived therefrom at one or more time points before and/or after addition of the one or more test agent. In some embodiments, the microscopy is confocal microscopy.
In some embodiments, determining effects on one or more cellular structures comprises comparing one or more variables selected from subcellular morphology, localization and/or dynamics of tagged structure(s), viability and cellular morphology from one or more cultures of one or more stem cells and/or one or more differentiated cells derived therefrom at one or more time points after treatment with the same variable prior to treatment.
In some embodiments, the determining effects on one or more cellular structures comprises comparing one or more variables selected from subcellular morphology, localization and/or dynamics of tagged structure(s), viability and cellular morphology from one or more cultures of one or more stem cells and/or one or more differentiated cells derived therefrom at one or more time points after treatment with one or more cultures of one or more stem cells and/or one or more differentiated cells derived therefrom treated with a control agent.
In some embodiments, the present invention provides kits comprising an array of stem cells or differentiated cells derived therefrom comprising at least one tagged endogenous protein. In some embodiments, the kit comprises stem cells or differentiated cells derived therefrom comprising at least one tagged endogenous protein made according to the methods described herein. In some embodiments of the kits, the detectable tag comprises at least about 100 amino acids in length. In some embodiments of the kits, the detectable tag is a fluorescent protein, a luminescent protein, a photoactivatable protein, a FLAG tag, a SNAP tag or a Halo tag. In some embodiments of the kits, the fluorescent protein is selected from the group comprising green fluorescent protein (GFP), blue fluorescent protein, cyan fluorescent protein, yellow fluorescent protein or red fluorescent protein. In some embodiments of the kits, the tagged protein is a structural protein. In some embodiments of the kits, the structural protein is selected from paxillin, alpha tubulin, lamin B1, Tom20, desmoplakin, beta actin, Sec61B, fibrillarin, myosin, centrin2, ZO-1, Safe-harbor-GFP, ST6Gal1, vimentin, LAMP1, LC3, Safe harbor-CAAX, and PMP34.
In some embodiments, the present invention provides a method for visualizing a stem cell produced by the method of claim 1, comprising: (a) plating the stem cells on plates; and (b) imaging the cells by microscope. In some embodiments, the imaging is live-cell imaging. In some embodiments, the imaging is in three dimensions. In some embodiments, the imaging involves co-localization with antibodies.
In some embodiments, the present invention provides a donor polynucleotide comprising a first polynucleotide sequence encoding a detectable tag, a second polynucleotide sequence encoding a 5′ homology arm, and a third polynucleotide sequence encoding a 3′ homology arm, wherein the 5′ homology arm and 3′ homology arm are each about 1 kb in length. In some embodiments, the donor polynucleotide further comprises a flexible linker sequence. In some embodiments, the polynucleotide sequence encoding the detectable tag comprises at least about 20 nucleotides in length. In some embodiments, the polynucleotide sequence encoding the detectable tag comprises between about 300 nucleotides in length and 3,000 nucleotides in length. In some embodiments, the polynucleotide sequence encoding the detectable tag is greater than 3000 nucleotides. In some embodiments, the polynucleotide sequence encoding the detectable tag encodes a detectable tag that comprises at least about 8 amino acids in length. In some embodiments, the polynucleotide sequence encoding the detectable tag encodes a detectable tag that comprises between about 8 and about 100 amino acids in length.
In some embodiments, at least two detectable tags are encoded by the donor polynucleotide. In some embodiments, the detectable tag is selected from the group consisting of a fluorescent protein, a luminescent protein, a photoactivatable protein, a FLAG tag, a SNAP tag, and a Halo tag. In some embodiments, the fluorescent protein is selected from the group consisting of green fluorescent protein (GFP), blue fluorescent protein, cyan fluorescent protein, yellow fluorescent protein, and red fluorescent protein. In some embodiments, the fluorescent protein is selected from the group consisting of mCherry, tdTomato, mNeonGreen, and mTagRFPt. In some embodiments, n the donor polynucleotide is a plasmid.
In some embodiments, the present invention provides a use of a donor polynucleotide of any of claims 91 to 92 to produce a stem cell using a gene editing system selected from the group consisting of: (a) a CRISPR/Cas9 ribonucleoprotein (RNP) complex comprising a Cas9 protein, a CRISPR RNA (crRNA) and a trans-activating RNA (tracrRNA), wherein the crRNA is specific for a target genomic locus; (b) a polynucleotide encoding a Cas nuclease, a CRISPR RNA (crRNA) and a trans-activating RNA (tracrRNA), wherein the crRNA is specific for a target genomic locus; (c) a TALEN; and (d) a zinc finger nuclease.
In some embodiments, the present invention provides use a of the donor polynucleotide described herein for imaging one or more proteins in one or more cells. In some embodiments, the one or more cells are tissue. In some embodiments, the one or more cells are living. In some embodiments, the imaging is three dimensional imaging.
In some embodiments, the present invention provides a stably tagged stem cell clone produced by the methods described herein.
In some embodiments, the present invention provides a purified preparation of the stably tagged stem cell clones described herein.
In some embodiments, the present invention provides a method of generating a signature for a test agent comprising: (a) admixing the test agent with one or more stably tagged stem cell clones produced by the methods described herein; (b) detecting a response in the one or more stem cell clone; (c) detecting a response in a control stem cell; (d) detecting a difference in the response in the one or more stem cell clones from the control stem cell; and (e) generating a data set of the difference in the response.
In some embodiments, the present invention provides a stably tagged stem cell clone produced by the methods described herein in an activity selected from the group consisting of: (a) determining toxicity of a test agent on the stably tagged stem cell clone; (b) determining the stage of disease in a stably tagged stem cell clone; (c) determining the dose of a test agent or drug for treatment of disease; (d) monitoring disease progression in a stably tagged stem cell clone; and (e) monitoring effects of treatment of a test agent or drug on the stably tagged stem cell clone.
In some embodiments, the present invention provides use of a stably tagged stem cell clone produced by the methods described herein for monitoring progression of disease or effect of a test agent on a disease wherein the disease is selected from the group consisting of aberrant cell growth, wound healing, inflammation, and neurodegeneration.
In some embodiments, the present invention provides a differentiated cell or group of differentiated cells derived from a stably tagged stem cell clone described herein. In some embodiments, the differentiated cell or group of differentiated cells are selected from the group consisting of cardiomyocytes, differentiated kidney cells, and differentiated fibroblasts.
In some embodiments, the present invention provides a stably tagged stem cell clone comprising a CRISPR/Cas9 ribonucleoprotein (RNP) complex. In some embodiments, the stably tagged stem cell clone comprises a donor polynucleotide, wherein in the donor polynucleotide comprises a first polynucleotide sequence encoding a detectable tag, a second polynucleotide sequence encoding a 5′ homology arm, and a third polynucleotide sequence encoding a 3′ homology arm, wherein the 5′ homology arm and 3′ homology arm are about 1 kb in length.
In some embodiments, the present invention provides a stably tagged stem cell clone comprising a donor polynucleotide, wherein in the donor polynucleotide comprises a first polynucleotide sequence encoding a detectable tag, a second polynucleotide sequence encoding a 5′ homology arm, and a third polynucleotide sequence encoding a 3′ homology arm, wherein the 5′ homology arm and 3′ homology arm are about 1 kb in length.
In some embodiments, the methods described here further comprise microscopy of the one or more cultures of one or more stem cells and/or one or more differentiated cells derived therefrom at one or more time points before and/or after addition of the one or more test agent. In some embodiments, the microscopy is confocal microscopy.
In some embodiments, the present invention provides a kit comprising an array of stem cells or differentiated cells derived therefrom for visualizing or screening the effects of one or more test agents on one or more cellular structures in one or more cell types comprising at least one tagged protein.
The present invention provides methods for producing stem cells comprising one or more tagged proteins using the CRISPR/Cas9 gene editing system. The methods described herein enable the insertion of fluorescent tags into a target genomic loci or plurality of target genomic loci to generate stem cells that are phenotypically and functional similar to the un-modified parent population. Stem cells produced by the methods described herein additionally retain the capacity to self-renew and differentiate into specialized cell types.
The section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described. All documents, or portions of documents, cited herein, including but not limited to patents, patent applications, articles, books, and treatises, are hereby expressly incorporated by reference in their entirety for any purpose. In the event that one or more of the incorporated documents or portions of documents define a term that contradicts that term's definition in the application, the definition that appears in this application controls. However, mention of any reference, article, publication, patent, patent publication, and patent application cited herein is not, and should not be taken as an acknowledgment, or any form of suggestion, that they constitute valid prior art or form part of the common general knowledge in any country in the world.
In the present description, any concentration range, percentage range, ratio range, or integer range is to be understood to include the value of any integer within the recited range and, when appropriate, fractions thereof (such as one tenth and one hundredth of an integer), unless otherwise indicated. As used in this application, the terms “about” and “approximately” are used as equivalents. Any numerals used in this application with or without about/approximately are meant to cover any normal fluctuations appreciated by one of ordinary skill in the relevant art. In certain embodiments, the term “approximately” or “about” refers to a range of values that fall within 25%, 20%, 19%, 18%, 17%, 16%, 15%, 14%, 13%, 12%, 11%, 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, or less in either direction (greater than or less than) of the stated reference value unless otherwise stated or otherwise evident from the context (except where such number would exceed 100% of a possible value).
It should be understood that the terms “a” and “an” as used herein refer to “one or more” of the enumerated components unless otherwise indicated. The use of the alternative (e.g., “or”) should be understood to mean either one, both, or any combination thereof of the alternatives. As used herein, the terms “include” and “comprise” are used synonymously. As used herein, “plurality” may refer to one or more components (e.g., one or more detectable tags).
In some embodiments, the present invention provides for methods of producing a stem cell comprising at least one tagged endogenous protein. In certain embodiments, the endogenous protein is a wild-type protein, whereas in other embodiments, the endogenous protein comprises one or more naturally-occurring mutations and/or one or more introduced mutations. Examples of mutations include but are not limited to amino acid insertions, deletions and substitutions.
The term “stem cell,” as used herein, refers to a multipotent, non-specialized cell with the capacity to self-renew and to differentiate into at least one differentiated cell lineage (e.g., potency). The “stemness” of a stem cell include the characteristics of self-renewal and multipotency. Self-renewal refers to the proliferation of a stem cell to generate one (asymmetric division) or two (symmetric division) daughter cells with development potentials that are indistinguishable from those of the mother cell. Self-renewal results in an expanded population of stem cells, each of which maintains an undifferentiated state and the ability to differentiate into specialized cells. Typically, an expanded population of stem cells retains the stemness characteristics of the parent cell.
Potency refers to the ability of a stem cell to differentiate into at least one type of specialized cell. The greater the number of different specialized cell types a stem cell can differentiate into, the greater its potency. In some embodiments, a stem cell may be a totipotent cell, and able to differentiate into any specialized cell type (e.g., a zygote). In some embodiments, a stem cell may be pluripotent and able to differentiate into cell types of any of the three germ layers (endoderm, mesoderm, or ectoderm) (e.g., an embryonic stem cell or an induced pluriopotent stem cell (iPSC)). In some embodiments, the stem cell may be multipotent and have the capacity to differentiate into multiple cell types of a particular cell lineage (e.g., a hematopoietic stem cell). Multipotent stem cells may also be referred to as progenitor cells. In certain embodiments, stem cells may be obtained from a donor, or they may be generated from a non-stem cell. Non-limiting examples of stem cells include embryonic stem cells and adult stem cells. Stem cells include, but are not limited to, mesenchymal stem cells, adipose tissue-derived stem cells, hematopoietic stem cells, and umbilical cord-derived stem cells.
In some embodiments, the stem cells described herein are human iPSCs. iPSCs are derived from differentiated adult cells and have been modified to express transcription factors and proteins responsible for the induction and/or maintenance of a pluripotent state (e.g., Oct 3/4, Sox family transcription factors, Klf family transcription factors, and Nanog). In some embodiments, the iPSCs described herein are derived from a normal, healthy human donor. In some embodiments, the iPSC is a WTC or a WTB cell line (Kreitzer et al, American Journal of Stem Cells, 2:119-31, 2013; Miyaoka et al., Nature Methods, 11:291-3, 2013). In some embodiments, the iPSC is derived from a human donor that has been diagnosed with a disease or disorder. For example, in some embodiments the iPSC may be derived from a patient diagnosed with a cardiomyopathy (e.g., arrhythmogenic right ventricular cardiomyopathy, dialated cardiomyopathy, hypertrophic cardiomyopathy, left ventricular non-compaction cardiomyopathy, or restrictive cardiomyopathy), a heritable disease (e.g., deficiency of acyl-CoA dehydrogenase, very long chain (ACADVL), Barth syndrome (BTHS), carnitine-acylcarnitine translocase deficiency (CACTD), congenital disorder of deglycosylation (CDDG), muscular dystrophies (including Emery-Dreifuss muscular dystrophy (EDMD1), autosomal dominant Emery-Dreifuss muscular dystrophy (EDMD2), Duchenne's muscular dystrophy, and chronic granulomatous disease), Friedreich ataxia 1 (FRDA), glycogen storage disease II, Hurler-Scheie syndrome, isobutyryl-CoA dehydrogenase deficiency, Kearn-Sayre syndrome (KSS), Leigh syndrome, leprechaunism, long chan 3-hydroxyacyl-CoA dehydrogenase deficiency, mitochondrial DNA depletion syndrome 12 (cardiomyopathic type), mucolipidosis mIa, myoclonus epilepsy associated with ragged-red fibers (MERFF), centronuclear myopathy 1 (CNMI), Preader-Willi syndrome (PWS), adult-onset progeria, propionic academia, Vici syndrome (VICIS), or Werner syndrome), or a disease caused by or associated with a chromosomal abnormality (e.g., chromosome 1P36 deletion syndrome, Duchenne's muscular dystrophy, and Prader-Willi syndrome).
“Stem cell markers” as used herein are defined as gene products (e.g. protein, RNA, glycans, glycoproteins, etc.) that are specifically or predominantly expressed by stem cells. Cells may be identified as a particular type of stem cell based on their expression of one or more of the stem cell markers using techniques commonly available in the art including, but not limited to, analysis of gene expression signatures of cell populations by microarray, qPCR, RNA-sequencing (RNA-Seq), Next-generation sequencing (NGS), serial analysis of gene expression (SAGE), and/or analysis of protein expression by immunohistochemistry, western blot, and flow cytometry. Stem cell markers may be present in the nucleus (e.g., transcription factors), in the cytosol, and/or on the cell membrane (e.g., cell-surface markers). In some embodiments, a stem cell marker is a gene product that directly and specifically supports the maintenance of stem cell identity and/or stem cell function. In some embodiments, a stem cell marker is gene that is expressed specifically or predominantly by stem cells but does not necessarily have a specific function in the maintenance of stem cell identity and/or stem cell function. Examples of stem cell markers include, but are not limited to, Oct 3/4, Sox2, Nanog, Tra-160, Tra-181, and SSEA3.
In some embodiments, the present invention provides genetically engineered stem cells. Herein, the terms “genetically engineered stem cells” or “modified stem cells” or “edited stem cells” refer to stem cells that comprise one or more genetic modifications, such as one or more tags inserted into a locus of one or more endogenous target genes. “Genetic engineering” refers to the process of manipulating a genomic DNA sequence to mutate or delete one or more nucleic acids of the endogenous sequence or to introduce an exogenous nucleic acid sequence into the genomic locus. The genetically-engineered or modified stem cells described herein comprise a genomic DNA sequence that is altered (e.g., genetically engineered to express a tag) compared to an un-modified stem cell or control stem cell. As used herein, an un-modified or control stem cell refers to a cell or population of cells wherein the genomes have not been experimentally manipulated (e.g., stem cells that have not been genetically engineered to express a tag).
In some embodiments, the stem cells described herein are derived from a donor (e.g., a healthy donor) and comprise one or more genetic mutations associated with a particular disease or disorder introduced into the iPSC genome. Such embodiments are referred to herein as “mutant stem cells.” Introduction of mutations into an iPSC derived from a health donor can mimic the genetic state of a particular disease or disorder, while maintaining the isogenic relationship between the mutant stem cell and the normal iPSC from which it is derived. This allows direct comparisons between the two cell types to be made when assessing the effect of a particular mutation on cellular structure, cellular function, protein localization, protein function, and/or protein expression. For example, mutations may be introduced into the PKD1 and/or PKD2 genes of an iPSC derived from a healthy donor to produce a PC 1-mutant stem cell, a PC2-mutant stem cell, or a PC1/PC2-mutant stem cell. These mutant stem cells and the corresponding normal stem cells from which they are derived can then be further engineered to express one or more detectable markers in one or more endogenous target genomic loci. In some embodiments, these cells are assayed according to the methods described herein to determine the effect of a particular mutation on cellular structure, cellular function, protein localization, protein function, and/or protein expression, and can elucidate the role of a protein in different diseases, such as polycystic kidney disease.
In some embodiments, the present invention provides populations of genetically engineered stem cells that have been modified to express one or more tagged endogenous proteins. Herein, a “population” of cells (e.g., stem cells) refers to any number of cells greater than 1, e.g., at least 1×103 cells, at least 1×104 cells, at least 1×105 cells, at least 1×106 cells, at least 1×107 cells, at least 1×108 cells, at least 1×109 cells, or at least 1×1010 or more cells.
In some embodiments, the present invention provides methods of producing genetically-engineered stem cells comprising at least one tagged endogenous protein. In some embodiments, the method comprises (a) providing a gene-editing system capable of producing double or single stranded DNA breaks at a target endogenous locus; (b) providing a repair template comprising a polynucleotide sequence encoding a detectable tag: (c) introducing the gene-editing system and the repair template into a stem cell such that the polynucleotide sequence encoding the detectable tag is inserted into an endogenous target genomic locus to generate the tagged endogenous protein. In certain embodiments, during step (c), the cells are cultured under conditions that allow insertion of the sequence encoding the detectable tag into the target genomic locus, such as any of those disclosed herein. In particular embodiments, the cells produced in step (c) are cultured under conditions suitable for expression of the tagged endogenous protein. In various embodiments of any of the methods disclosed herein, the stem cell is an iPSC, and the methods further comprise generating the iPSC. In particular embodiments, the iPSCs are generated from cells obtained from a donor, such as a normal, healthy donor or a diseased donor.
In some embodiments, the methods described herein are used to produce a genetically-engineered stem cell comprising one tagged endogenous protein. In some embodiments, the methods described herein are used to produce a genetically-engineered stem cell comprising two, three, four, five, six, seven, eight, nine, ten, or more tagged endogenous proteins. In some embodiments, the repair template comprises a 5′ homology arm and a 3′ homology arm, each of about 1 kb in length, or each more than 1 kb in length.
Herein, the term “gene-editing system” refers to a protein, nucleic acid, or combination thereof that is capable of modifying a target locus of an endogenous DNA sequence when introduced into a cell. Numerous gene editing systems suitable for use in the methods of the present invention are known in the art including, but not limited to, zinc-finger nuclease systems, TALEN systems, and CRISPR/Cas systems.
In some embodiments, the gene editing system used in the methods described herein is a CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats)/Cas (CRISPR Associated) nuclease system, which is an engineered nuclease system based on a bacterial system that can be used for mammalian genome engineering. Generally, the system comprises a CRISPR-associated endonuclease (for example, a Cas endonuclease) and a guide RNA (gRNA). The gRNA is comprised of two parts; a crispr-RNA (crRNA) that is specific for a target genomic DNA sequence, and a trans-activating RNA (tracrRNA) that facilitates endonuclease binding to the DNA at the targeted insertion site. In some embodiments, the crRNA and tracrRNA may be present in the same RNA oligonucleotide, referred to as a single guide-RNA (sgRNA). In some embodiments, the crRNA and tracrRNA may be present as separate RNA oligonucleotides. In such embodiments, the gRNA is comprised of a crRNA oligonucleotide and a tracrRNA oligonucleotide that associate to form a crRNA:tracrRNA duplex. As used herein, the term “guide RNA” or “gRNA” refers to the combination of a tracrRNA and a crRNA, present as either an sgRNA or a crRNA:tracrRNA duplex.
In some embodiments, the CRISPR/Cas systems described herein comprise a Cas protein, a crRNA, and a tracrRNA. In some embodiments, the crRNA and tracrRNA are combined as a duplex RNA molecule to form a gRNA. In some embodiments, the crRNA:tracrRNA duplex is formed in vitro prior to introduction to a cell. In some embodiments, the crRNA and tracrRNA are introduced into a cell as separate RNA molecules and crRNA:tracrRNA duplex is then formed intracellularly. In some embodiments, polynucleotides encoding the crRNA and tracrRNA are provided. In such embodiments, the polynucleotides encoding the crRNA and tracrRNA are introduced into a cell and the crRNA and tracrRNA molecules are then transcribed intracellularly. In some embodiments, the crRNA and tracrRNA are encoded by a single polynucleotides. In some embodiments, the crRNA and tracrRNA are encoded by separate polynucleotides.
In some embodiments, a detectable tag is inserted into a target locus of an endogenous gene mediated by Cas-mediated DNA cleavage at or near a target insertion site. As such, the term “target insertion site” refers to a specific location within a target locus, wherein a polynucleotide sequence encoding a detectable tag can be inserted. In some embodiments, a Cas endonuclease is directed to the target insertion site by the sequence specificity of the crRNA portion of the gRNA, which requires the presence of a protospacer motif (PAM) sequence near the target insertion site. A variety of PAM sequences suitable for use with a particular endonuclease (e.g., a Cas9 endonuclease) are known in the art (See e.g., Nat Methods. 2013 November; 10(11): 1116-1121 and Sci Rep. 2014; 4: 5405). Exemplary PAM sequences suitable for use in the present invention are shown in Table 5. In some embodiments, the target locus comprises a PAM sequence within 50 base pairs of the target insertion site. In some embodiments, the target locus comprises a PAM sequence within 10 base pairs of the target insertion site. The genomic loci that can be targeted by this method are limited only by the relative distance of the PAM sequence to the target insertion site and the presence of a unique 20 base pair sequence to mediate sequence-specific, gRNA-mediated Cas9 binding. In some embodiments, the target insertion site is located at the 5′ terminus of the target locus. In some embodiments, the target insertion site is located at the 3′ end of the target locus. In some embodiments, the target insertion site is located within an intron or an exon of the target locus.
The specificity of a gRNA for a target loci is mediated by the crRNA sequence, which comprises a sequence of about 20 nucleotides that are complementary to the DNA sequence at a target locus. In some embodiments, the crRNA sequences used in the methods of the present invention are at least 90% complementary to a DNA sequence of a target locus. In some embodiments, the crRNA sequences used in the methods of the present invention are at least 95%, 96%, 97%, 98%, or 99% complementary to a DNA sequence of a target locus. In some embodiments, the crRNA sequences used in the methods of the present invention are 100% complementary to a DNA sequence of a target locus. In some embodiments, the crRNA sequences described herein are designed to minimize off-target binding using algorithms known in the art (e.g., Cas-OFF finder) to identify target sequences that are unique to a particular target locus or target gene. In some embodiments, the crRNA sequences used in the methods of the present invention are at least 90% identical to one of SEQ ID NOs: 85-140. In some embodiments, the crRNA sequences used in the methods of the present invention are at least 95%, 96%, 97%, 98%, or 99% identical to one of SEQ ID NOs: 85-140. In some embodiments, the crRNA sequences used in the methods of the present invention are 100% identical to one of SEQ ID NOs: 85-140. Exemplary crRNA sequences are shown in Table 5.
In some embodiments, the endonuclease is a Cas protein. In some embodiments, the endonuclease is a Cas9 protein. In some embodiments, the Cas9 protein is derived from Streptococcus pyogenes (e.g., SpCas9), Staphylococcus aureus (e.g., SaCas9), or Neisseria meningitides (NmeCas9). In some embodiments, the Cas endonuclease is a Cas9 protein or a Cas9 ortholog and is selected from the group consisting of SpCas9, SpCas9-HF1, SpCas9-HF2, SpCas9-HF3, SpCas9-HF4, SaCas9, FnCpf, FnCas9, eSpCas9, and NmeCas9. In some embodiments, the endonuclease is selected from the group consisting of C2C1, C2C3, Cpf1 (also referred to as Cas12a), Cas1, Cas1B, Cas2, Cas3, Cas4, Cas5, Cas6, Cas7, Cas8, Cas9 (also known as Csn1 and Csx12), Cas10, Csy1, Csy2, Csy3, Cse1, Cse2, Csc1, Csc2, Csa5, Csn2, Csm2, Csm3, Csm4, Csm5, Csm6, Cmr1, Cmr3, Cmr4, Cmr5, Cmr6, Csb1, Csb2, Csb3, Csx17, Csx14, Csx10, Csx16, CsaX, Csx3, Csx1, Csx15, Csf1, Csf2, Csf3, and Csf4.
In some embodiments, the Cas9 is a wildtype (WT) Cas9 protein or ortholog. WT Cas9 comprises two catalytically active domains (HNH and RuvC). Binding of WT Cas9 to DNA based on gRNA specificity results in double-stranded DNA breaks that can be repaired by non-homologous end joining (NHEJ) or homology-directed repair (HDR). In some embodiments, Cas9 is fused to proteins that recruit DNA-damage signaling proteins, exonucleases, or phosphatases to further increase the likelihood or the rate of repair of the target sequence by one repair mechanism or another. In some embodiments, a WT Cas9 is co-expressed with a nucleic acid repair template to facilitate the incorporation of an exogenous nucleic acid sequence by homology-directed repair. In some embodiments, a WT Cas9 is co-expressed with an exogenous nucleic acid sequence encoding a detectable tag to facilitate the incorporation of the nucleic acid encoding the detectable tag into an endogenous target loci by homology-directed repair.
In some embodiments, the Cas9 is a Cas9 nickase mutant. Cas9 nickase mutants comprise only one catalytically active domain (either the HNH domain or the RuvC domain). The Cas9 nickase mutants retain DNA binding based on gRNA specificity, but are capable of cutting only one strand of DNA resulting in a single-strand break (e.g. a “nick”). In some embodiments, two complementary Cas9 nickase mutants (e.g., one Cas9 nickase mutant with an inactivated RuvC domain, and one Cas9 nickase mutant with an inactivated HNH domain) are expressed in the same cell with two gRNAs corresponding to two respective target sequences; one target sequence on the sense DNA strand, and one on the antisense DNA strand. This dual-nickase system results in staggered double stranded breaks and can increase target specificity, as it is unlikely that two off-target nicks will be generated close enough to generate a double stranded break. In some embodiments, a Cas9 nickase mutant is co-expressed with a nucleic acid repair template to facilitate the incorporation of an exogenous nucleic acid sequence by homology-directed repair. In some embodiments, a Cas9 nickase mutant is co-expressed with an exogenous nucleic acid sequence encoding a detectable tag to facilitate the incorporation of the nucleic acid encoding the detectable tag into an endogenous target loci by homology-directed repair.
In some embodiments, the components of a gene editing system (e.g., one or more gRNAs and a Cas9 protein, or nucleic acids encoding the same) are introduced into a population of stem cells with a repair template. In some embodiments, the repair template comprises a polynucleotide sequence encoding a detectable tag flanked on both the 5′ and 3′ ends by homology arm polynucleotide sequences. In such embodiments, the homology arm sequences and detectable tag sequences comprised within a repair template facilitate the repair of the Cas9-induced double-stranded DNA breaks at an endogenous target loci by homology-directed repair (HDR). In such embodiments, repair of the double-stranded breaks by HDR results in the insertion of the polynucleotide sequence encoding the detectable tag into the endogenous target locus. In some embodiments, the repair template comprises a nucleic acid sequence that is at least about 90% identical to a sequence selected from SEQ ID NOs: 31-84. In some embodiments, the repair template comprises a nucleic acid sequence that is at least about 95%, 96%, 97%, 98%, or 99% identical to a sequence selected from SEQ ID NOs: 31-84. In some embodiments, the repair template comprises a nucleic acid sequence that is 100% identical to a sequence selected from SEQ ID NOs: 31-84.
1. Homology Arms
In some embodiments, each of the 5′ and 3′ homology arms is at least about 500 base pairs long. For example, the homology arm sequences may be at least 500, 510, 520, 530, 540, 550, 560, 570, 580, 590, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 2000, 3000 or more base pairs long. In some embodiments, the homology arm sequences are at least about 1000 base pairs long. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to an endogenous nucleic acid sequence located 5′ to a particular endogenous target locus. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to an endogenous nucleic acid sequence located 5′ to a particular endogenous target locus. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to an endogenous nucleic acid sequence located 5′ to a particular endogenous target locus. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to a sequence selected from SEQ ID NOs: 1-15. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to a sequence selected from SEQ ID NOs: 1-15. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to a sequence selected from SEQ ID NOs: 1-15.
In some embodiments, the 3′ homology arm polynucleotide sequence is at least about 90% identical to an endogenous nucleic acid sequence located 3′ to a particular endogenous target locus. In some embodiments, the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to an endogenous nucleic acid sequence located 3′ to a particular endogenous target locus. In some embodiments, the 3′ homology arm polynucleotide sequence is 100% identical to an endogenous nucleic acid sequence located 3′ to a particular endogenous target locus. In some embodiments, the 3′ homology arm polynucleotide sequence is at least about 90% identical to a sequence selected from SEQ ID NOs: 16-30. In some embodiments, the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to a sequence selected from SEQ ID NOs: 16-30. In some embodiments, the 3′ homology arm polynucleotide sequence is 100% identical to a sequence selected from SEQ ID NOs: 16-30.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to a sequence selected from SEQ ID NOs: 1-15 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to a sequence selected from SEQ ID NOs: 16-30. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to a sequence selected from SEQ ID NOs: 1-15 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to a sequence selected from SEQ ID NOs: 16-30. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to a sequence selected from SEQ ID NOs: 1-15 and the 3′ homology arm polynucleotide sequence is 100% identical to a sequence selected from SEQ ID NOs: 16-30.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 1 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 16. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 1 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 16. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 1 and the 3′ homology arm polynucleotide sequence is 1000% identical to SEQ ID NO: 16.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90 identical to SEQ ID NO: 2 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 17. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 2 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 17. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 2 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 17.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 3 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 18. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 3 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 18. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 3 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 18.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 4 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 19. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 4 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 19. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 4 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 19.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 5 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 20. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 5 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 20. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 5 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 20.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 6 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 21. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 6 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 21. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 6 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 21.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 7 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 22. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 7 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 2. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 7 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 22.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 8 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 23. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 8 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 23. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 8 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 23.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 9 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 24. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 9 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 24. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 9 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 24.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 10 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 25. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 10 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 25. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 10 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 25.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 11 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 26. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 11 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 26. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 11 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 26.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 12 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 27. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 12 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 27. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 12 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 27.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 13 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 28. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 13 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 28. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 13 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 28.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 14 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 29. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 14 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 29. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 14 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 29.
In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 15 and the 3′ homology arm polynucleotide sequence is at least about 90% identical to SEQ ID NO: 30. In some embodiments, the 5′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 15 and the 3′ homology arm polynucleotide sequence is at least about 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 30. In some embodiments, the 5′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 15 and the 3′ homology arm polynucleotide sequence is 100% identical to SEQ ID NO: 30.
The components of the gene-editing system (e.g., a CRISPR/Cas system comprising a Cas, tracrRNA, and crRNA) can be intracellularly delivered to a population of cells by any means known in the art. In some embodiments, the Cas component of a CRISPR/Cas gene editing system is provided as a protein. In some embodiments, the Cas protein may be complexed with a crRNA:tracrRNA duplex in vitro to form an CRISPR/Cas RNP (crRNP) complex. In some embodiments, the crRNP complex is introduced to a cell by transfection. In some embodiments, the Cas protein may be introduced to a cell before or after a gRNA is introduced to the cell. In some embodiments, the Cas protein is introduced to a cell by transfection before or after a gRNA is introduced to the cell.
In some embodiments, a nucleic acid encoding a Cas protein is provided. In some embodiments, the nucleic acid encoding the Cas protein is an DNA nucleic acid and is introduced to the cell by transduction. In some embodiments, the Cas9 and gRNA components of a CRISPR/Cas gene editing system are encoded by a single polynucleotide molecule. In some embodiments, the polynucleotide encoding the Cas protein and gRNA component are comprised in a viral vector and introduced to the cell by viral transduction. In some embodiments, the Cas9 and gRNA components of a CRISPR/Cas gene editing system are encoded by a different polynucleotide molecules. In some embodiments, the polynucleotide encoding the Cas protein is comprised in a first viral vector and the polynucleotide encoding the gRNA is comprised in a second viral vector. In some aspects of this embodiment, the first viral vector is introduced to a cell prior to the second viral vector. In some aspects of this embodiment, the second viral vector is introduced to a cell prior to the first viral vector. In such embodiments, integration of the vectors results in sustained expression of the Cas9 and gRNA components. However, sustained expression of Cas9 may lead to increased off-target mutations and cutting in some cell types. Therefore, in some embodiments, an mRNA nucleic acid sequence encoding the Cas protein may be introduced to the population of cells by transfection. In such embodiments, the expression of Cas9 will decrease over time, and may reduce the number of off target mutations or cutting sites.
In some embodiments, each of the Cas9, tracrRNA, crRNA, and repair template components are introduced to a cell by transfection alone or in combination (e.g., transfection of a crRNP). Transfection may be performed by any means known in the art, including but not limited to lipofection, electroporation (e.g., Neon® transfection system or an Amaxa Nucleofector®), sonication, or nucleofection. In such embodiments, the gRNA components can be transfected into a population of cells with a plasmid encoding the Cas9 nuclease. In such embodiments, the expression of Cas9 will decrease over time, and may reduce the number of off target mutations or cutting sites.
In some embodiments, the repair templates described herein comprise a polynucleotide sequence encoding a “detectable tag”, “tag,” or “label.” These terms are used interchangeably herein and refer to a protein that is capable of being detected and is linked or fused to a heterologous protein (e.g., an endogenous protein). Herein, the detectable tag serves to identify the presence of the heterologous protein. Insertion of a polynucleotide sequence encoding a detectable tag into an endogenous target loci results in the expression of a tagged version of the endogenous protein. Examples of detectable tags include but are not limited to, FLAG tags, poly-histidine tags (e.g. 6×His), SNAP tags, Halo tags, cMyc tags, glutathione-S-transferase tags, avidin, enzymes, fluorescent molecules, luminescent proteins, chemiluminescent proteins, bioluminescent proteins, and phosphorescent proteins.
In some embodiments, the detectable tag is a fluorescent protein such as green fluorescent protein (GFP), blue fluorescent protein, cyan fluorescent protein, yellow fluorescent protein, or red fluorescent protein. In some embodiments, the detectable tag is GFP. Additional examples of detectable tags suitable for use in the present methods and compositions include mCherry, tdTomato, mNeonGreen, eGFP, Emerald, mEGFP (A208K mutation), mKate, and mTagRFPt. In some embodiments the fluorescent protein is selected from the group consisting ofbBlueiUV proteins (such as TagBFP, mTagBFP2, Azurite, EBFP2, mKalamal, Sirius, Sapphire, and T-Sapphire); cyan proteins (such as ECFP, Cerulean, SCFP3A, mTurquoise, mTurquoise2, monomeric Midoriishi-Cyan, TagCFP, and mTFP1); green proteins (such as: EGFP, Emerald, Superfolder GFP, Monomeric Azami Green, TagGFP2, mUKG, mWasabi, Clover, and mNeonGreen); yellow proteins (such as EYFP, Citrine, Venus, SYFP2, and TagYFP); orange proteins (such as Monomeric Kusabira-Orange, mKOκ, mKO2, mOrange, and mOrange2); red proteins (such as mRaspberry, mCherry, mStrawberry, mTangerine, tdTomato, TagRFP, TagRFP-T, mApple, mRuby, and mRuby2); far-red proteins (such as mPlum, HcRed-Tandem, mKate2, mNeptune, and NirFP); near-infrared proteins (such as TagRFP657, IFP1.4, and iRFP): long stokes shift proteins (such as mKeima Red, LSS-mKate1, LSS-mKate2, and mBeRFP); photoactivatible pProteins (such as PA-GFP, PAmCherryl, and PATagRFP); photoconvertible proteins (such as Kaede (green), Kaede (red), KikGRI (green), KikGRI (red), PS-CFP2, PS-CFP2, mEos2 (green), mEos2 (red), mEos3.2 (green), mEos3.2 (red), PSmOrange, and PSmOrange); and photoswitchable proteins (such as Dronpa). In some embodiments, the detectable tag can be selected from AmCyan, AsRed, DsRed2, DsRed Express, E2-Crimson, HcRed, ZsGreen, ZsYellow, mCherry, mStrawberry, mOrange, mBanana, mPlum, mRasberry, tdTomato, DsRed Monomer, and/or AcGFP, all of which are available from Clontech.
In some embodiments, the polynucleotide sequence encoding the detectable tag is at least about 20 base pairs long. In some embodiments, the polynucleotide sequence encoding the detectable tag is at least 100 base pairs long. For example, the polynucleotide sequence encoding the detectable tag may be about 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 250, 300, 350, 400, 450, 500, 550, 600, 650, 700, 750, 800, 850, 900, 950, 1000, 2000, 3000, 4000, 5000 or more base pairs long. In some embodiments, the polynucleotide sequence encoding the detectable tag comprises at least about 300 base pairs long. In some embodiments, the polynucleotide sequence encoding the detectable tag comprises at least about 500 base pairs long. In further embodiments, the polynucleotide sequence encoding the detectable tag is about 700 to about 750 base pairs long. For example, the polynucleotide sequence encoding the detectable tag may be about 701, 702, 703, 704, 705, 706, 707, 708, 709, 710, 711, 712, 713, 7114, 715, 716, 717, 718, 719, 720, 721, 722, 723, 724, 725, 726, 727, 728, 729, 730, 740, or about 750 base pairs long. In some embodiments, the polynucleotide sequence encoding the detectable tag is between 710 and 730 base pairs long. The polynucleotide sequence can encode a full-length detectable tag or a portion or fragment thereof. In some embodiments, the polynucleotide sequence encodes a full-length detectable tag. In some embodiments, insertion of the detectable tag into the target locus does not significantly alter the expression or function of either the endogenous protein or the encoded detectable tag.
The insertion of the detectable tag sequence into an endogenous gene results in the production of a tagged endogenous protein. In some embodiments, the tag is directly fused to the endogenous protein. The term “directly fused” refers to two or more amino acid sequences connected to each other (e.g., by peptide bonds) without intervening or extraneous sequences (e.g., two or more amino acid sequences that are not connected by a linker sequence). In some embodiments, the polynucleotide sequence encoding the detectable tag further comprises a linker sequence such that the detectable tag is attached (or linked) to the endogenous protein by a linker sequence. In such embodiments, the attachment may be by covalent or non-covalent linkage. In some embodiments, the attachment is covalent. In some embodiments, the linker sequence is a flexible linker sequence. In some embodiments, the tag is directly fused, or attached by a linker, to the C-terminal or N-terminal end of an endogenous protein. In some embodiments, the linker sequence is selected from the group consisting of sequences shown in Tables 3 and 4.
In some embodiments, the donor polynucleotide further comprises a polynucleotide sequence encoding a selectable marker that allows for the selection of cells comprising the donor polynucleotide. Selectable markers are known in the art and include antibiotic resistance genes. In some embodiments, the antibiotic resistance gene confers resistance to gentamycin, thymidine kinase, ampicillin, and/or kanamycin.
In some embodiments, the donor polynucleotide is a plasmid, referred to herein as a “donor plasmid.” In some embodiments, the donor plasmid comprises a repair template comprising (i) a 5′ homology arm sequence, (ii) a nucleic acid sequence encoding a detectable tag; and (iii) a 3′ homology arm sequence. In some embodiments, the repair template comprised within the donor plasmid further comprises a linker sequence located at the 5′ end or the 3′ end of the nucleic acid sequence encoding the detectable tag. In some embodiments, the repair template comprised within the donor plasmid further comprises an antibiotic resistance cassette located between the 5′ and 3′ homology arm sequences. In such embodiments, the antibiotic resistance cassette may be located 3′ to the 5′ homology arm sequence and 5′ to the nucleic acid sequence encoding the detectable tag. Alternatively, the antibiotic resistance cassette may be located 5′ to the 3′ homology arm sequence and 3′ to the nucleic acid sequence encoding the detectable tag. In some embodiments, the donor plasmid does not comprise a promoter. In such embodiments, the donor plasmid functions as a vehicle to deliver the tag sequence intracellularly to a cell and does not mediate transcription and/or translation of the tag sequence or any polynucleotide sequence comprised therein.
In some embodiments, the present invention provides for methods of inserting one or more detectable tags into one or more endogenous target loci. In some embodiments, the target locus is located within an endogenous gene encoding a structural protein or a non-structural protein. Exemplary target genes are shown below in Tables 1 and 2. In some embodiments, the structural protein is selected from paxillin (PXN), tubulin-alpha 1b (TUBA1B), lamin B1 (LMNB1), actinin alpha 1 (ACTN1), translocase of outer mitochondrial membrane 20 (TOMM20), desmoplakin (DSP), Sec61 translocon beta subunit (SEC61B), fibrillarin (FBL), actin beta (ACTB), myosin heavy chain 10 (MYH10), vimentin (VIM), tight junction protein 1 (TJPI, also known as ZO-1), safe harbor locus, CAGGS promoter (AAVS1), microtubule-associated protein 1 light chain 3 beta (MAP1LC3B, also known as LC3), ST6 beta-galactoside alpha-2,6-sialyltransferase 1 (ST6GAL1), lysosomal associated membrane protein 1 (LAMP1), centrin 2 (CETN2), solute carrier family 25 member 17 (SLC25A17), RAB5A, member RAS oncogene family (RAB5A), gap junction protein alpha 1 (also known as connexin 43 (CX43)) (GJA1), mitogen-activated protein kinase 1 (MAPK1), ATPase sarcoplasmiciendoplasmic reticulum Ca2+ transporting 2 (ATP2A2), AKT serine/threonine kinase 1 (AKT1), catenin beta 1 (CTNNB1), nucleophosmin (NPM1), histone cluster 1 H2B family member j (HIST1H2BJ), Histone cluster 1 H2B family member j:2A:CAAX (CAGGS:HIST1H2BJ:2A:CAAX), polycystin 2, transient receptor potential cation channel (PKD2), dystrophin (DMD), desmin (DES), solute carrier family 25 member 17 (SLC25A17, also known as PMP34), Structural maintenance of chromosomes 1A (SMCIA), Nucleoporin 153 (NUP153), CCCTC-binding factor (CTCF), Chromobox 1 (CBXI), POU class 5 homeobox 1 (Oct4), Sex-determining region-box 2 (Sox2), and Nanog homeobox (Nanog). In certain embodiments, any of these target loci are tagged with a detectable tag, e.g., a fluorescent tag, such as GFP.
In some embodiments, the one or more detectable tags are inserted into an endogenous target locus in a gene encoding a structural protein or a non-structural protein, wherein the expression of the gene and/or the encoded protein is associated with a particular cell type or tissue type. For example, in some embodiments, the expression of the gene and/or the encoded protein is associated with cardiomyocytes, hepatocytes, renal cells, epithelial cells, endothelial cells, neurons, mucosal cells of the gut, lung, or nasal passages. In some embodiments, the expression of the gene and/or the encoded protein is associated with cardiac tissue including, but not limited to, troponin II, slow skeletal type (TNNII), actinin alpha 2 (ACTN2), troponin 13, cardiac type (TNN13), myosin light chain 2 (MYL2), myosin light chain 7 (MYL7), titin (TTN), SMAD family member 2 (SMAD), SMAD family member 5 (SMAD5), NK2 homeobox 5 (NKX2-5), Mesoderm posterior bHLH transcription factor 1 (MESP1), Mix paired-like homeobox (MIXL1), and ISL LIM homeobox 1 (ISL1).
In some embodiments, the expression of the gene and/or the encoded protein is associated with liver tissue including, but not limited to Cytochrome P450E1 (CYP2E1), Transferrin (TF), hemopexin (HPX), and albumin (ALB). In some embodiments, the expression of the gene and/or the encoded protein is associated with kidney tissue including, but not limited to Polycystic kidney disease 1 (PKD1) and Polycystic kidney disease 2 (PKD2). In some embodiments, the expression of the gene and/or the encoded protein is associated with epithelial tissue including, but not limited to keratin 5 (KRT5) and lamanin subunit gamma 2 (LAMC2). Exemplary genes associated with specific tissue and cell types are shown below in Table 2.
In some embodiments, a plurality of detectable labels is inserted into a plurality of target loci. For example, one detectable label is inserted at one endogenous loci and a different detectable label is inserted at a different endogenous loci. In such embodiments, each of the individual detectable labels is selected such that the detection of one does not interfere, or minimally interferes with, the detection of another. In such embodiments, a unique crRNA is generated for each target locus. In further embodiments, a CRISPR ribonucleoprotein (crRNP), comprising a Cas protein complexed with a crRNA:tracrRNA duplex, is produced for each target locus. In some embodiments, the plurality of nucleic acid sequences encoding the plurality of detectable labels are comprised in a single donor plasmid and are flanked on the 5′ and 3′ ends by homology arms corresponding to genomic sequences within the target locus. For example, at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more detectable labels and their corresponding homology arms may be comprised within one donor polynucleotide.
In some embodiments, the plurality of nucleic acid sequences encoding the plurality of detectable labels and their corresponding homology arms are comprised within at least two different donor plasmids. For example, at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more donor plasmids may be used in the present methods. In some embodiments, a plurality of donor plasmids (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more) each comprising one sequence encoding a detectable label and the corresponding homology arms may be used in the present methods. In some embodiments, a plurality of donor plasmids (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more) each comprising a plurality of sequences encoding two or more detectable labels (e.g., at least 2, 3, 4, 5, 6, 7, 8, 9, 10 or more) and the corresponding homology arms may be used in the present methods. In some embodiments, the plurality of donor plasmids are introduced to a stem cell at the same time. In some embodiments, the plurality of donor plasmids are introduced to a stem cell sequentially.
In some embodiments, the present disclosure provides edited stem cell clones that stably express one or more tagged endogenous proteins. In some embodiments, the stably tagged stem cell clones of the current invention are characterized by (i) mono- or biallelic insertion of a nucleic acid sequence encoding a detectable tag (e.g. GFP) into one or more endogenous proteins (e.g., structural, non-structural, or non-expressed proteins of the stem cell); (ii) pluripotency (e.g., the ability to differentiate into all three germ layers); and (iii) the lack of additional mutations or alternations in the endogenous stem cell genome. Such edited stem cell clones are herein referred to as “stably tagged stem cell clones.”
The stably tagged stem cell clones described herein phenotypically differ from non-engineered stem cell clones only by the expression of one or more endogenous proteins that have been tagged with a detectable tag and the incorporation of one or more antibiotic resistance cassettes into the one or more tagged endogenous loci. In some embodiments, the stably tagged stem cell clones of the current invention are characterized by (i) mono- or biallelic insertion of a nucleic acid sequence encoding a detectable tag (e.g. GFP) into one or more endogenous proteins (e.g., structural, non-structural, or non-expressed proteins of the stem cell); (ii) pluripotency (e.g., the ability to differentiate into all three germ layers); and (iii) the presence of one or more additional mutations or alternations in the endogenous stem cell genome. Such edited stem cell clones are herein referred to as “stably tagged mutant stem cell clones.” In some embodiments, the stably tagged mutant stem cell clones comprise one or more one or more additional mutations or alternations in the endogenous stem cell genome that are associated with a particular disease or disorder. Thus, the stably tagged mutant stem cell clones described herein phenotypically differ from non-engineered stem cell clones by the expression of one or more endogenous proteins that have been tagged with a detectable tag, the incorporation of one or more antibiotic resistance cassettes into the one or more tagged endogenous loci, and the presence of one or more mutations additional not found in the non-engineered stem cell clones. The stably tagged mutant stem cell clones described herein phenotypically differ from the corresponding stably tagged stem cell clones only by the presence of one or more additional mutations.
Provided herein are compositions comprising stably tagged stem cell clones made by the methods described herein. In some embodiments, the compositions comprise a stably tagged stem cell clone wherein one endogenous protein is tagged. For example, a composition may comprise a stably tagged stem cell clone expressing a tagged endogenous protein wherein the endogenous protein is one selected from Tables 1 and/or 2 (e.g., one of PXN, TUBA1B, LMNB1, ACTN1, TOMM20, DSP, SEC61B, FBL, ACTB, MYH10, VIM. TJPI (also known as ZO-1), AAVS1, MAP1LC3B (also known as LC3), ST6GAL1, LAMP, CETN2, SLC25A17 (also known as PMP34), RAB5A, GJA1 (also known as connexin 43 (CX43)), MAPK1, ATP2A2, AKT1, CTNNB1, NPM1, HIST1H2BJ, CAGGS:HIST1H2BJ:2A:CAAX, PKD2, DMD, DES, SLC25A17 (also known as PMP34), SMC1A, NUP153, CTCF, CBX1, Oct4, Sox2, Nanog, TNNI1, ACTN2, TNN13, MYL2, MYL7, TTN, SMAD, SMAD5, NKX2-5, MESP1, MIXL1, ISL1, CYP2E1, TF, HPX, ALB, PKD1, PKD2, KRT5, and LAMC2.
In some embodiments, the compositions described herein comprise a stably tagged stem cell clone wherein at least two endogenous proteins are tagged. For example, a composition may comprise a stably tagged stem cell clone wherein one endogenous loci is tagged with a detectable tag and wherein another endogenous loci is tagged with a different detectable tag. In such embodiments, either of the endogenous loci may be selected from Tables 1 and/or 2. For example, the endogenous proteins may be two or more of those listed in Tables 1 and 2 (e.g., two or more of PXN, TUBA1B, LMNB1, ACTN1, TOMM20, DSP, SEC61B, FBL, ACTB, MYH10, VIM, TJP1 (also known as ZO-1), AAVS1, MAP1LC3B (also known as LC3), ST6GAL1, LAMP1, CETN2, SLC25A17 (also known as PMP34), RAB5A, GJA1 (also known as connexin 43 (CX43)), MAPK1, ATP2A2, AKT1, CTNNB1, NPM1, HIST1H2BJ, CAGGS:HIST1H2BJ:2A:CAAX, PKD2, DMD, DES, SLC25A17 (also known as PMP34), SMCIA, NUP153, CTCF, CBX1, Oct4, Sox2, Nanog, TNNII, ACTN2, TNN13, MYL2, MYL7, TTN, SMAD, SMAD5, NKX2-5, MESP1, MIXL1, ISL1, CYP2E1, TF, HPX, ALB, PKD1, PKD2, KRT5, and LAMC2. In some embodiments, one detectable tag may be inserted into a target loci in TUBABI and a different detectable tag may be inserted into a target loci in LMNB1. In some embodiments, one detectable tag may be inserted into a target loci in SEC61B and a different detectable tag may be inserted into a target loci in LMNB1. In some embodiments, one detectable tag may be inserted into a target loci in TOMM20 and a different detectable tag may be inserted into a target loci in TUBABI. In some embodiments, one detectable tag may be inserted into a target loci in SEC61B and a different detectable tag may be inserted into a target loci in TUBAB1. In some embodiments, one detectable tag may be inserted into a target loci in TUBABI and a different detectable tag may be inserted into a target loci in CETN2. In some embodiments, one detectable tag may be inserted into a target loci in SEC61B and a different detectable tag may be inserted into a target loci in LMNB1. In some embodiments, one detectable tag may be inserted into a target loci in AAVS1 and a different detectable tag may be inserted into a target loci in CAGGS:HIST1H2BJ:2A:CAAX. In some embodiments, one detectable tag may be inserted into a target loci in TOMM20 and a different detectable tag may be inserted into a target loci in TUBAB1.
In some embodiments, the compositions described herein comprise a stably tagged stem cell clone wherein at least three endogenous proteins are tagged. For example, a composition may comprise a stably tagged stem cell clone wherein a first endogenous loci is tagged with a first detectable tag, a second endogenous loci is tagged with a second detectable tag, and a third endogenous loci is tagged with a third detectable tag. In such embodiments, any of the endogenous loci may be selected from Tables 1 and/or 2. For example, the endogenous proteins may be three or more of those listed in Tables 1 and 2 (e.g., three or more of PXN, TUBAlB, LMNB1, ACTN1, TOMM20, DSP, SEC61B, FBL, ACTB, MYH10, VIM, TJP1 (also known as ZO-1), AAVS1, MAP1LC3B (also known as LC3), ST6GAL1, LAMP1, CETN2, SLC25A17 (also known as PMP34), RAB5A, GJA1 (also known as connexin 43 (CX43)), MAPK1, ATP2A2, AKT1, CTNNB,I NPM1, HIST1H2BJ, CAGGS:HIST1H2BJ:2A:CAAX, PKD2, DMD, DES, SLC25A17 (also known as PMP34), SMCIA, NUP153, CTCF, CBX1, Oct4, Sox2, Nanog, TNNI1, ACTN2, TNN13, MYL2, MYL7, TTN, SMAD, SMAD5, NKX2-5, MESP1, MIXL1, ISL1, CYP2E1, TF, HPX, ALB, PKD1, PKD2, KRT5, and LAMC2.
In some embodiments, the compositions described herein comprise a stably tagged stem cell clone wherein at least four or five or more endogenous proteins are tagged. In such embodiments, the endogenous proteins may be three or more of those listed in Tables 1 and 2 (e.g., four, five, or more of PXN, TUBA1B, LMNB1, ACTN 1, TOMM20, DSP, SEC61B, FBL, ACTB, MYH10, VIM, TJP1 (also known as ZO-1), AAVS1, MAP1LC3B (also known as LC3), ST6GAL1, LAMP, CETN2, SLC25A17 (also known as PMP34), RAB5A, GJA1 (also known as connexin 43 (CX43)), MAPK1, ATP2A2, AKT1, CTNNB1, NPM1, HIST1H2BJ, CAGGS:HIST1H2BJ:2A:CAAX, PKD2, DMD, DES, SLC25A17 (also known as PMP34), SMC1A, NUP153, CTCF, CBX1, Oct4, Sox2, Nanog, TNNI1, ACTN2, TNNI3, MYL2, MYL7, TTN, SMAD, SMAD5, NKX2-5, MESP1, MIXL1, ISL1, CYP2E1, TF, HPX, ALB, PKD1, PKD2, KRT5, and LAMC2.
In some embodiments, the compositions described herein comprise two or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses a tagged endogenous protein. In some embodiments, each stably tagged stem cell clone express a different tagged endogenous protein. In some embodiments, the compositions described herein comprise 2, 3, 4, 5, 6, 7, 8, 9, 10 or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses a different tagged endogenous protein. In some embodiments, the compositions described herein comprise two or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses two or more tagged endogenous proteins. In some embodiments, the compositions described herein comprise 2, 3, 4, 5, 6, 7, 8, 9, 10 or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses two or more tagged endogenous proteins. In some embodiments, the compositions described herein comprise two or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses 2, 3, 4, 5, 6, 7, 8, 9, 10 or more tagged endogenous proteins. In some embodiments, the compositions described herein comprise 2, 3, 4, 5, 6, 7, 8, 9, 10 or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses 2, 3, 4, 5, 6, 7, 8, 9, 10 or more tagged endogenous proteins. In some embodiments, each stably tagged stem cell clone express a group of tagged endogenous proteins that are different from the tagged endogenous proteins expressed by another stem cell clone in the same composition. Exemplary endogenous proteins that can be tagged in these embodiments are shown in Tables 1 and 2, including but not limited to PXN, TUBAlB, LMNB1, ACTN1, TOMM20, DSP, SEC61B, FBL, ACTB, MYH10, VIM, TJPI (also known as ZO-1), AAVS1, MAP1LC3B (also known as LC3), ST6GAL1, LAMP1, CETN2, SLC25A17 (also known as PMP34), RAB5A, GJA1 (also known as connexin 43 (CX43)), MAPK1, ATP2A2, AKT1, CTNNB1, NPM1, HIST1H2BJ, CAGGS:HIST1H2BJ:2A:CAAX, PKD2, DMD, DES, SLC25A17 (also known as PMP34), SMCIA, NUP153, CTCF, CBX1, Oct4, Sox2, Nanog, TNNI1, ACTN2, TNNI3, MYL2, MYL7, TTN, SMAD, SMAD5, NKX2-5, MESP1, MIXL1, ISL1, CYP2E1, TF, HPX, ALB, PKD1, PKD2, KRT5, and LAMC2.
Exemplary stably tagged stem cell clones that can be produced by the methods and techniques are shown below in Tables 3 and 4. The association of any tag in the table with any structural protein in the table is for illustrative purposes only. In this regard, any tag (or fluorescent protein) in the Table can be associated with any structural gene in the table.
In some embodiments, the present invention provides methods for selecting a stem cell that has been modified by the methods described herein to express a tagged endogenous protein. In some embodiments, the insertion of the tag sequence into the endogenous target loci does not result in additional genetic mutations or alterations in the endogenous target locus, or any other heterologous locus in the endogenous genome. In further embodiments, the insertion of the tag sequence into the endogenous target loci does not modify or alter the expression, function, or localization of the endogenous protein. In some embodiments, methods are provided herein for selecting stem cells modified by the methods described herein, wherein the identified stem cells comprise one or more of precise insertion of the nucleic acid sequence encoding a tag; pluripotency; maintained cell viability and function as compared to a non-modified stem cell; maintained levels of expression of the tagged endogenous protein as compared to a non-modified stem cell; maintained protein localization of the tagged endogenous protein as compared to a non-modified stem cell; maintained protein function of the tagged endogenous protein as compared to a non-modified stem cell; maintained expression of stem cell markers as compared to a non-modified stem cell; and/or maintained differentiation potential. In some embodiments, the properties of a selected stem cell are validated by one or more of several downstream assays.
In some embodiments, a population of edited stem cells (e.g., wherein a crRNP and a donor plasmid have been transfected into the cells) are sorted based on their relative expression of the detectable tag. In some embodiments, cells are sorted by fluorescence activated cell sorting (FACS). Cells that are positive for the inserted tag (e.g., express the tag at levels that are increased compared to non-edited population) are selected for further analysis. In some embodiments, the selected cells are expanded in a single colony expansion assay to produce individual clones of edited stem cells.
In some embodiments, edited clones are further analyzed by digital droplet PCR (ddPCR) to identify clones that have an inserted tag sequence and that do not have stable genomic incorporation of the plasmid backbone. In some embodiments, the clones are further analyzed to determine the copy number of the inserted tag sequence. In some embodiments, identified clones have monoallelic or biallelic insertion of the tag sequence.
In further embodiments, the modified cells are assessed for the functional expression of the one or more detectable tags. For example, live cell imaging may be used to observe localization, expression intensity, and persistence of expression of the tagged endogenous protein in the modified stem cells described herein. In some embodiments, the expression of one or more detectable tags does not substantially or does not significantly alter the endogenous expression, localization, or function of the tagged protein. In some embodiments, the precise insertion of the tag sequence is analyzed by sequencing the edited target locus or a portion thereof. In some embodiments, the junctions between the endogenous genomic sequence and the 5′ and 3′ ends of the tag sequence are amplified. The amplification products derived from the population of edited cells are sequenced and compared with sequences of the corresponding target locus derived from a population of non-edited cells. In some embodiments, potential off-target sites for the crRNA sequences are determined using algorithms known in the art (e.g., Cas-OFF finder). To determine the presence of off-target cutting or insertions, these predicted off-target sites and the surrounding genomic sequences can be amplified and sequenced to determine the presence of any mutations or inserted tag sequences. Sequencing can be performed by a number of methods known in the art, e.g., Sanger sequencing and Next-generation, high-throughput sequencing. In some embodiments, the edited populations of cells can be assessed for the expression of transcription factors, cell surface markers, and other proteins or genes associated with stem cells (e.g. Oct 3/4, Sox2, Nanog, Tra-160, Tra-181, and SSEA3). Protein expression can be determined by a number of means known in the art including flow cytometry, ELISA, Western blots, immunohistochemistry, or co-immunoprecipication. Gene expression can be determined by qPCR, microarray, and/or sequencing techniques (e.g., NGS, RNA-Seq, or CHIP-Seq). In some embodiments the edited populations of cells can be assessed for the presence of the CRISPR/Cas9 ribonucleoprotein (RNP) complex and/or the donor polynucleotide. In some embodiments, the edited stem cells are determined to be pluripotent according to the methods outlined above may be cryopreserved for later differentiation or use.
In some embodiments, the invention provides for methods of live-cell imaging in three dimensions using the stably tagged stem cell clones and methods described herein. In some embodiments, the present invention provides methods of assaying the differentiation potential of the edited stem cells and stably tagged clones thereof described herein. Such assays typically involve culturing edited stem cells or stably tagged clones thereof in media comprising one or more factors required for differentiation. Factors required for differentiation are referred to herein as “differentiation agents” and will vary according to the desired differentiated cell type. In some embodiments, the ability of the edited stem cells or stably tagged clones thereof described herein to differentiate into specialized cells is substantially similar to the ability of un-modified stem cells to differentiate into specialized cells. For example, in some embodiments, the edited stem cells and/or stably tagged clones thereof described herein are able to differentiate into substantially the same number of different types of specialized cells, differentiate at substantially the same rate (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more days to differentiated), and produce differentiated cells that are as viable and as function as un-modified stem cells.
In some embodiments, the methods of assaying the differentiation potential of the edited stem cells and stably tagged clones thereof described herein includes the addition of one or more test agents to a culture of edited stem cells or stably tagged clones thereof prior to, during, or after the addition of one or more differentiation agents. The edited stem cells or stably tagged clones thereof can then be visualized for changes in cellular morphology associated with the individual structural proteins tagged within each edited stem cells or stably tagged clones thereof. In some embodiments, these methods may be used to identify agents that promote differentiation into one or more cell lineages and therefore may be useful as differentiation agents. In some embodiments, these methods may be used to identify agents that disrupt or inhibit differentiation. In some embodiments, the stably tagged stem cells may be differentiated into any cell type, including but not limited to hematopoietic cells, neurons, astrocytes, dendritic cells, hepatocytes, cardiomyocytes, kidney cells, smooth muscle cells, skeletal muscle cells, epithelial cells, or endothelial cells.
C. Screening Assays with Stably-Tagged Stem Cells and Cells Derived Therefrom
In some aspects, the present invention provides methods for drug screening to identify candidate therapeutic agents, and methods of screening agents to determine the effects of agents on the stably-tagged stem cell clones described herein and cells derived therefrom produced by the methods of the present invention. The methods may be employed to identify an agent having a desired effect on the cells. The stably-tagged stems cells of the present invention enable changes across multiple cell types to be assayed with the built in control of the cell types all being derived from the same progenitor clone.
In some embodiments, methods are provided for determining the effect of agents including small molecules, proteins, nucleic acids, lipids or even physical or mechanical stress (i.e. UV light, temperature shifts, mechanical sheer, etc.) by culturing a population of the stably-tagged stem cell clones described herein and cells derived therefrom in the presence and absence of the test agent(s). In some embodiments, agents that disrupt, alter, or modulate various key cellular structures and processes, including but not limited to cell division, microtubule organization, actin dynamics, vesicle trafficking, cell signaling, DNA replication, calcium regulation, ion channel regulators, and/or statins are assayed by the present methods. In some embodiments, the agent exerts a biological effect on the cells, such as increased cell growth or differentiation, increased or reduced expression of one or more genes, or increased or reduced cell death or apoptosis, etc. In particular embodiments, the stably-tagged stem cell clones used to screen for agents having a particular effect comprise a tagged protein associated with the cellular structure, process or biological activity being examined, such as any of the combinations of genes and structures shown in tables 3 and 4. Exemplary agents are shown in
In a further embodiment, the method provides assaying the cells after the exposure period by any known method, including confocal microscopy in order to determine changes in the content, orientation or cellular composition of the tagged structural protein contained within the given cell population. In one embodiment, a comparison can be made between the treated cells and untreated controls. In a further embodiment, a positive control may also be utilized in such methods. In some embodiments, one or more positive control agents with known effects on targeted structures may be applied to differentiated cell cultures derived from stably tagged stem cell clones and imaged, for example by confocal microscopy. The data obtained from these positive control experiments may be used as a training set for data that would allow for the automated assaying of different cellular structures in different cell types based on machine learning.
In some embodiments, the data obtained from these experiments are used to generate a signature for a test agent. In some embodiments, the method of generating a signature for a test agent comprises (a) admixing the test agent with one or more stably tagged stem cell clones; (b) detecting a response in the one or more stem cell clones; (c) detecting a response in a control stem cell; (d) detecting a difference in the response in the one or more stem cell clones from the control stem cell; and (e) generating a data set of the difference in the response. In some embodiment the detected response in the stem cell clones and/or control cells is one or more of cell proliferation, microtubule organization, actin dynamics, vesicle trafficking, cell-surface protein expression, DNA replication, cytokine or chemokine production, changes in gene expression, and/or cell migration. In some embodiments, the control cell is a stably tagged stem cell clone that has not been exposed to the test agent or a control agent (e.g., a vehicle control). In some embodiments, the control cell is a stably tagged stem cell clone that has been exposed a control agent (e.g., a vehicle control). In some embodiments, these methods are used to determine the toxicity of a test agent and/or to determine the optimal dose of a test agent required to induce or inhibit a particular cell function or cell response. In such embodiments, the difference in the response in the one or more stem cell clones from the control stem cell are quantified and used to generate a data set of the difference in the response. This data-set can then be used as a training set for an algorithm to predict the effect of a related agent on a particular cellular function.
In some embodiments, stably tagged stem cell clones derived from diseased patients or stably tagged mutant stem cell clones can be differentiated into one or more differentiated cell types assayed by the methods described herein to generate a cell-type specific data-set related to a particular disease. In such embodiments, the cell proliferation, microtubule organization, actin dynamics, vesicle trafficking, cell-surface protein expression, DNA replication, cytokine or chemokine production, changes in gene expression, and/or cell migration of the differentiated cells can be determined at one or more time points during differentiation and maturation. Data sets derived from such assays can then be used as a training set for one or more disease-specific algorithms that can be applied to a cell sample derived from a patient to determine whether the patient has a disease, the stage of disease, and/or used to monitor the effects of a particular disease treatment. In some embodiments, the disease is selected from a disease characterized by aberrant cell growth, wound healing, inflammation, and/or neurodegeneration.
In some embodiments, methods are provided for live-cell imaging to observe intracellular protein localization, expression intensity, and persistence of expression in the modified stem cells or stably transfected stem cell clones described herein. In some embodiments, the expression of one or more detectable tags does not substantially or does not significantly alter the endogenous expression or localization of the tagged protein. In some embodiments, the invention provides for methods of live-cell imaging in three dimensions using the stably tagged stem cell clones and the cell culturing and plating and microscopy methods described herein.
In some embodiments, provided herein are kits comprising the stably tagged stem cell clones described herein. In some embodiments, the kits described herein comprise two or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses a tagged endogenous protein. In some embodiments, each stably tagged stem cell clone express a different tagged endogenous protein. In some embodiments, the kits described herein comprise 2, 3, 4, 5, 6, 7, 8, 9, 10 or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses a different tagged endogenous protein. In some embodiments, the kits described herein comprise two or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses two or more tagged endogenous proteins. In some embodiments, the kits described herein comprise 2, 3, 4, 5, 6, 7, 8, 9, 10 or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses two or more tagged endogenous proteins. In some embodiments, the kits described herein comprise two or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses 2, 3, 4, 5, 6, 7, 8, 9, 10 or more tagged endogenous proteins. In some embodiments, the kits described herein comprise 2, 3, 4, 5, 6, 7, 8, 9, 10 or more stably tagged stem cell clones, wherein each stably tagged stem cell clone expresses 2, 3, 4, 5, 6, 7, 8, 9, 10 or more tagged endogenous proteins. In some embodiments, each stably tagged stem cell clone express a group of tagged endogenous proteins that are different from the tagged endogenous proteins expressed by another stem cell clone in the same composition. Exemplary endogenous proteins that can be tagged in these embodiments are shown in Tables 1 and 2, including but not limited to PXN, TUBA1B, LMNB1, ACTN1, TOMM20, DSP, SEC61B, FBL, ACTB, MYH10, VIM, TJP1 (also known as ZO-1), AAVS1, MAP1LC3B (also known as LC3), ST6GAL1, LAMP1, CETN2, SLC25A17 (also known as PMP34), RABSA, GJA1 (also known as connexin 43 (CX43)), MAPK1, ATP2A2, AKT1, CTNNB1, NPM1, HIST1H2BJ, CAGGS:HIST1H2BJ:2A:CAAX, PKD2, DMD, DES, SLC25A17 (also known as PMP34), SMC1A, NUP153, CTCF, CBX1, Oct4, Sox2, Nanog, TNNI1, ACTN2, TNNI3, MYL2, MYL7, TTN, SMAD, SMAD5, NKX2-5, MESP1, MIXL1, ISL1, CYP2E1, TF, HPX, ALB, PKD1, PKD2, KRT5, and LAMC2. In some embodiments, the kits also allow for building an entire “cell clinic” or reference set that comprises cell types from every major organ system, or those of interest, that allows for the interrogation of likely function of new genes and assaying of cellular toxicity.
In some embodiments, the present disclosure provides kits for assessing differentiation agents and/or the effect of compounds or drugs on the differentiation of stem cells. In some embodiments, the present disclosure provides a kit comprising one or more stably tagged stem cell clones expressing one or more tagged endogenous proteins. In some embodiments, the present disclosure provides a kit comprising a plurality of stably tagged stem cell clones expressing one or more tagged endogenous proteins. In some embodiments, the cells are provided as an array such that all cellular structures are tagged among a plurality of stably tagged stem cell clones.
In some embodiments, the kits described herein further comprise one or more agents known to elicit stem cell differentiation into one or more cell types. One of skill in the art would understand the appropriate media and agents for differentiation into various cell types. For example, a kit may include stably tagged stem cells and media containing Activin A for cardiomyocyte differentiation. Alternatively, a kit may include stably tagged stem cells and media containing factors described in Methods Mol Biol. 2014; 1210:131-41 or Biomed Rep. 2017 April; 6(4): 367-373 for hepatocyte differentiation. Alternatively, a kit may include stably tagged stem cells and media containing factors described in Methods Mol Biol. 2017, 1597:195-206 or Nat Commun. 2015 Oct 23; 6:8715 for renal cell differentiation. Alternatively, a kit may include stably tagged stem cells and media containing factors described in Mol Psychiatry. 2017 Apr. 18. doi: 10.1038/mp.2017.56 or Scientific Reports volume 7, Article number: 42367 (2017) for neuronal cell differentiation. Additional exemplary factors for producing differentiated cell types from human iPSCs are shown in
In some embodiments, provided herein are kits comprising one or more differentiated cell types derived from one or more stably tagged stem cell clones. As used herein “derived from,” for example, one or more stably tagged stem cell clones refers to cells that are differentiated, from the stably tagged stem cell clones. In some embodiments, cells that are derived from stably tagged stem cell clones are terminally differentiated cells that are direct progeny of the stably tagged stem cell clones. Therefore, the differentiated cell types, like their stably tagged stem cell clone progenitors also express tagged (e.g. with a detectable marker, such as, for example, GFP and the like) structural or non-structural proteins. In one embodiment, the kits provided herein comprise one or more differentiated cell types. In some embodiments, kits provided herein contain differentiated cell types from all three germ layers. In some embodiments, kits are provided containing differentiated cells of substantially all major cell types of the body derived from stably tagged stem cell clones. In some embodiments, the kits are provided on multi-well plates in assay ready format. In some embodiments, the cells are provided in a form that requires thawing, culturing and/or expanding the cells. In some embodiments, the differentiated cells derived from stably tagged stem cells are provided in an array such that for each cell type member in the array, a tagged protein member is provided such that every structure being studied is tagged in each cell type being assayed.
The following examples are for the purpose of illustrating various exemplary embodiments of the invention and are not meant to limit the scope of the present invention in any fashion. Alterations, modifications, and other changes to the described embodiments which are encompassed within the spirit of the invention as defined by the scope of the claims are specifically contemplated.
The CRISPR/Cas9 system was used to introduce a GFP tag into the genomic loci of various proteins by HDR-mediated incorporation. Exemplary proteins tagged by the methods described herein are shown in Tables 1 and 2 above. Experiments were designed to introduce GFP at the N- or C-terminus along with a short linker using a CRISPR/Cas9 RNP and a donor plasmid encoding the full length GFP protein (
crRNA Design
Custom synthetic crRNAs and their corresponding tracrRNAs were ordered from either IDT or Dharmacon.
(a) exon: inside exon or within 50 bp of exon;
(b) genic: in intron (but >50 bp from an exon) or within 200 bp of an annotated gene;
(c) non-genic: >200 bp from an annotated gene.
When possible, crRNAs targeting Cas9 to within 50 bp of the intended GFP integration site were used, with a strong preference for any crRNAs with binding sites within 1 Obp. A subset of CRISPR/Cas9 alternative binding sites identified by Cas-OFFinder were selected for sequencing and
Only crRNAs unique within the human genome were used with one unavoidable exception (TOMM20, where the locus sequence restricted crRNA choice), and crRNAs whose alternative binding sites include mismatches in the “seed” region and are in non-genic regions were prioritized whenever possible. Table 5 below shows exemplary polynucleotide sequences of the crRNA sequences.
Donor plasmids were designed for each target locus and contained design features specific to each target and a GFP-encoding nucleic acid sequence (See e.g.,
(a) 5′ paxillin homology arm (SEQ ID NO: 6)—linker (SEQ ID NO: 278)—EGFP—3′ paxillin homology arm (SEQ ID NO: 21);
(b) 5′ SEC61B homology arm (SEQ ID NO: 7)—mEGFP—linker (SEQ ID NO: 279)—3′ SEC61B homology arm (SEQ ID NO: 22);
(c) 5′ TOMM20 homology arm (SEQ ID NO: 9)—linker (SEQ ID NO: 281)—mEGFP—3′ TOMM20 homology arm (SEQ ID NO: 24);
(d) 5′ TUBA1B homology arm (SEQ ID NO: 10)—mEGFP—linker (SEQ ID NO: 282)—3′ TUBA1B homology arm (SEQ ID NO: 25);
(e) 5′ LMNB1 homology arm (SEQ ID NO: 4)—mEGFP—linker (SEQ ID NO: 276)—3′ LMNB1 homology arm (SEQ ID NO: 19);
(f) 5′ FBL homology arm (SEQ ID NO: 3)—linker (SEQ ID NO: 275)—mEGFP—3′ FBL homology arm (SEQ ID NO: 18);
(g) 5′ ACTB homology arm (SEQ ID NO: 1)—mEGFP—linker (SEQ ID NO: 273)—3′ ACTB homology arm (SEQ ID NO: 16);
(h) 5′ DSP homology arm (SEQ ID NO: 2)—linker (SEQ ID NO: 274)—mEGFP—3′ DSP homology arm (SEQ ID NO: 17);
(i) 5′ TJP1 homology arm (SEQ ID NO: 8)—mEGFP—linker (SEQ ID NO: 280)—3′ TJPI homology arm (SEQ ID NO: 23); and
(j) 5′ MYH10 homology arm (SEQ ID NO: 5)—mEGFP—linker (SEQ ID NO: 277)—3′ MYH10 homology arm (SEQ ID NO: 20).
5′ homology arm sequences are shown in underlined text, linker sequences are shown in italic text, tag sequences are shown in regular text, and 3′ homology arm sequences are shown in bold text. Additional plasmid insert sequences are provided in SEQ ID NOs: 31-84.
CCTCTGCCTGCTGAGTTCCAGTGATTCTCCCGCCTCAGCCTCCCAAGTAGCTGAGATTACA
GGCACACGCCCCCATGCCTGGCTAATTTTTTGTATTTTCAGTAGAGACGGGGTTTCACCAT
GTTGGCCAAGCTGGTCTTGAACTCCTGACCTCAGGTGATCCGCCTCCCTTGGCCTCCCAAA
GTGCTGGGATTACAAGTGTGAGCCACCTCACCCGGCCCCTCTCAGAGCCTTTTCTACCTAT
ATGTGATGTGAATCTCCAATGAGAATCTAGGAGGCAGAGTTTGACTACAGACCAGTGTCAC
ACCTGTGTTTCGGGGAACACTGTTACAGCCACCTGGCTAAGTGCTCAGGAGTCAGAGCTGT
GTATGAATCCAGGCTGTGACCTCAGTAGCTGCATGACCCTGGGCAAGTTACTTCACCTGTG
TGCTTCAGTTGCCTCCCCTGTTGGGAGAACTAAATAATCCCAGCCCTGTGGGAGGCGGAGG
TGGGAGGATTGCAGGAGGCCACATTTGACCAGCATGGGCAATATAGTGAGACCCCCATCTC
TACAAAAAAAATTTATTTAATAAAATAAAAATGAAAAATGAGCGTTTAGGACAACAGGGCA
CATGGGAAACGCCTAGCAAGTAGGAGGCACTCCGAGCGTGCCGACTAGGCCCACCGCGGCC
CCATCACACAGGGTGCAGCTCTAGCCCGAGGGGCAGCTCCCTGAGCCCCTCTCTCCGCCTG
GCAGGAATGCTTCACGCCATTCGTGAACGGCAGCTTCTTCGAGCACGACGGGCAGCCCTAC
TGTGAGGGGCACTACCACGAGCGGCGCGGCTCGCTGTGTTCTGGCTGCCAGAAGCCCATCA
CCGGCCGCTGCATCACCGCCATGGCCAAGAAGTTCCACCCCGAGCACTTCGTCTGTGCCTT
CTGCCTCAAGCAGCTCAACAAGGGCAACTTCAAGGAGCAGAACGACAAGCCTTACTGTCAG
AACTGCTTACTCAAGCTCTTCTGC
GGTACCAGCGGCGGAAGCGTGAGCAAGGGCGAGGAGC
CCCCAGCCAGCATCACCAACTGCGACTGTGACCTAGAGACTTCACCCGGGGGTGAAGGGGT
AAACCCGACTGAAACTGGAACCCTTGTCCTCCGCTGGTGCGGGATGGACAGAGGGCCGTGA
GGGGTCCCCCTGCTTGTCTTCACCCCTGCCAGAGCCTCTGGGCCCCCTCCTCCCTCCTGTA
GCTCTCCCTAGGCTGCCCACTCTCCATCCTCCCCAGGGGTAGAGGCTGGGGGCTCCACCCC
AGCCCATGTACGTCCCCACGAACTGGCCTGGCCAGCACCCCACACTGGAGCCATCTCTTCC
TCATATTTCAGCAGTGCAGCCGGGGGGCAGGGAAGGGCAGGCAGGGTCTGTTGGGGTCTCT
TTTTATCCTTATTCCTCCCCCGACCTAATTGTCTTTGTTCTGTGATTATTGGGGGACACCC
GGCTCCCTCCAGACAATGCCAGCATAAATCCATCCATCCAAAGGCAGAGAACCAAAGGGGC
CATGGAAGGTTCTCTGTGCTCCTCCTACCCTTCCAGTGCCCTAGGCCTGGCGACTGCCCCT
GCCTTTTAGACCCGCCCTCCCCTTTTATACCTGCTCTTGTTCTACTGAGAAAAGCCTCTCC
AGCAATAATGTTTTCTAGTCACTTCCTCCGTCTCCGTGACGGCGTGCCTGGACACTGTACC
GACTTTGATAGATTTCTACACTGAGGTTTGAATTCATATCGCCTGAGTTGCTTTTACTTCT
CTATACAAAATGATTTTGAAGAGATTTTAAAGACGTTCCCTTTTGTATTCTCTTCCTCATC
CACCGCCACTGGGCCTGTCACTGATGGTGGCTCTGGTGTGAAGTTTGCTTTGTACTGAGGG
TTGGGGTGGGGAAGCAATTTGTATTTTATTGTTTCTTAGCACAAGCAGGTGAACTGGGAGC
AGCTCTGTGACTCCCCCTCTTTCACTTCATAGCTCACCAGGACTGTTTTATAAACT
TTTAAATGGGCCCACACTAAAGTTAGAGAACCACAGGCTCGCTCACAACCCTGACTTCTCC
ATGTCAGTTCCGATCTTTGCGAACCGCAGACAGGGAAGGTCTTCTCTCAGGGGTCATGCCC
GCGGCCGCCCTCCACGGCGAGGTCCGCACTCGCGCAGCCGGCCCCGCGGCCGCCTCACCTG
GTCGCACACTACCACGTCGAACTCCTCGTCGGCGAGGAACAGCACGTAGAGCGCCAGGAAA
ACCATGCGCACGTAGGCGCAGACGGCGGCGCCGCGGCCGCCCCAGCCCAGGCCTCGCGGCA
GCCAGTCCCCGGCACAGCGCACCGGTAGCTCGCGGCTCTCGGCGAAACAGTGGCCCGGGTC
GTAGTGCGCTGTCCAGATCTTCACGCTACACCCGCGCGCCTGCAGCGCCAGCGCCGCGTCC
AACACCAGCCGCTCAGCGCCGCCCACGCCCAGGTCTGGGTGGAGGAACAGCACCGACGGCT
TGGGAACCGAGTCCCGTTCCCGGCCCTGCTCCTCCGCCATGGCCCTGGAGCCGCAACTGCA
CCCCGCACCCTGATGGGGGTCTTCTGCGCAAGCTCCGCGCTCGTAGCTCCCAGCTGGCCAC
TGCGGGCCGACCCCGCCCTGCCGTACGTGCGTCAGTTAGGCCACATCAGCGCAAATCTGTG
AGGGTCTAGTAACTGCCTGAGAAAATATCTTGTCTGACCCCGGTTATATTTTTCCTTCGGT
AGGGATTGGACTTTCTGAAGGACGTTGTGATCCAAAGGAAGGAGGCCGGAGGTCTCTACTT
CCCATACAGCAGGTAACTAAGTTGTCTGTAGCAGACTGTCTACAGGCATATCGTGAGACGA
CCCAGGCGTCCCTGGGGTCAGAGAGGACCTTGCCTGCAAGTCCGGGGGCGGGGCCTGAGTC
AGTCTCGCCAGCTGCCGGTCTTTCGGGGGCTCCGTAACTTTCTATCCGTCCGCGTCAGCGC
CTTGCCACACTCATCTCCAATATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGC
AGCCCTGACTCCTCCTGCTTTGCGCCGTGCTTTTCCTCTGTAGCTCCCTTGCTTCCCCCAG
CCTCGGGTGTGGGTGTCTAGGCCGGGGTTCTGGGGCAGGCCTGCCGCGCTCACCCGTCTGT
CTGCTTGTCTCCCTCTACAGCCTGGTCCGACCCCCAGTGGCACTAACGTGGGATCCTCAGG
GCGCTCTCCCAGCAAAGCAGTGGCCGCCCGGGCGGCGGGATCCACTGTCCGGCAGAGGTAA
GGAACCCTGCAGTTCGTTCGCTTCCAGACTCGGAGATAGGACCCAGAACCTCGCTGATTCT
GGGGTGGAGACCCTAGCATGTGAAGATTGACAAAGGCAAAATGAGCTTCTAGTGACGTGGC
CGTGGGAGTAGTTAAAGGCCTTTTGGGAGGAAGGCGACATTTTTTTTCTCGTTGCTCAGTT
TAGGGCACTACTCTTAAAAAAGGAAAGTTAACAAACTGGAATAGAGTCAGAGATAACTTTG
AGAAAACCGATGTCATTAAACTGGTGTCTCTGGACCTGAGGTTTGCACTCACATTTCCATC
TGGCGGCCCCATAAGCAATCTGTCCTACAGATAACTCGTCCTACACAAAACTTAGTCTCTT
TTCAGCTCAGCTCTCTCACTCTCAATTATATCTCCTTACTTCCATATGGCACTGTTGTACA
CTCATTTACTCAGAGCCAGAAACGTCAGCGTCATCTTGGATTTTTCTTATGCTCTTTCTCT
CTCTAGTCATATGCCAGACTTTAAACTCTGCTTGAAAGCTTTCTCATAAGCTCTTTCCTTT
TCCCTTTCTACTGCTTTGCATTTGCTACTTAACCCTTTTCTTCAGGCTGTTTGCTTTCCAG
TCCATCGTTCGCTCTGCTGTTACTCTTCTGCGTAGTTTCTGTTACTTGTTGCTGAAC
GATGATAGGAAGTATTTACAGAACTTTATAGTTAGTAACTGACTGGTTAATTTTTCAAACT
GATTTTTACTCAACTGAATTAGAAAAGGACTGGAAAGAAAGTAAAGATCCCAACGACTTGA
GGGAACAAGTTGGACAACCAAGGACTTTGTCTAAATTGTTTTTATTTAGACTAATGTGGTT
CTAGTTCTAGAGGATTCATACTGGAATCATCGGTTTAATATTACGCTATTTGAAAGGCAGC
ATAGTATAGTACTTTTGGAAAATTGGCCTGAGGGTGATGTCTTTTGGAATATTTGTGAATT
CACTAGGAAGCCTAATTCCTTAAAAATGACCTCCTTCACTCAATTATCAGTGTTCTTGGTT
TGCCTGGGAGTGAAAAGAGATCTTAAAATCTTTTTGGTTTTAGTTACATAATTGACTGATG
TAATATTATGTAATGATGGCTGTACACAGTGTCTCATGCCCTATAATCCTAGCACTCATTT
GAGCTCAGGAGTTCAAGGCCAGCCTGGGCAACATGGTAAAACCCTGTCTGCACCAAAAATA
AAAAAAAAATTAGCCGGGCATGGTGGCATGAGTCTGTGGTTCCAGCTACTCAGGAGGCTGA
GGTGGGGGAGGATCGCTTGAGCCCAGGAGTCAGAGGTTGCAGTGAGCTGAGATTGTGCTAC
TGCACTCCAGCCTGGGTTACAGAGACCCCATCTGAATTAAAAACATATATAATGTAATGAT
CTGCCTCCTTTGTTAACTTGACTTTTGAAATGGGATTGTCAGTAGTATGATCATTGTTTTC
TTGGATGCCGACTGTGTGTAAAGTGTTACATTTTGAATTAAATGTCAGAATGGGTGAACTT
TACTAAGATTCAATTCTTTGAATACAAAGAGCATTTTATTTTGAAGTTAGAATACTAATTA
AATGCTTATGACACTTTAAAAAATTATTTTTTTTTTCTTTCAGAGAATTGTAAGTGCACAG
TCCTTGGCTGAAGATGATGTGGAA
GGCGGTAGCGGGGATCCACCGGTCGCCACCGTGAGCA
TAAAATCTCAGTTAAAAATATTTTAAAAATTCTTGGTAGTTGAGCAGCTCTGGGGTAATAA
GGGCAAATATGCTTGTTATGAACTACACTGAAATCTACCAAAGTTAATGTTTACTTTGTGT
AGATCCATTTGTCTATTTTATTTATTTTTCCCAGTGAAAAGTGTATTTTGATAGAGAACTT
TTCATTCTATAAATACACTATGAGTTACTAAAATATCATGGATTTTGTTTATTCCTGAAAC
ATAGTTACATAGTTAAACTGTACATATGACATGGCTTATGTTAAAAATACCCAGTGCTCAG
TTTTGAAAGATAGGCAAAAAAAAAAAAGTATAGGAGAAACTGAAGAATGTACACTTTTTTA
GAGGGCACATTTTGCTGTAAATCTGGAAATTTGATAGACTTGACTGTGTTTGTGAAAACTG
AGCATTAAAGGTTTTGATTGATCCTTTCTTTCCATTTAATCTCTGAGACGTAAATATGTGA
GGTGTGCTGCTGTGCTGGGTTAACAGCTTCCTTCCCTTTCTGTGTAGCAGTCTTGAAATGT
TCTGTTTAAATCAGTAGGCTTAATGTGTTCTGGGTATTTATCTCCTTGTATTTTAAATATA
TGTAGTTGCAAATAGCACCAGGAATTAGATTTCTGTACACCCCTAATCTAGCCTTGTGAGC
TTCGCTAGTTAATGTGTGCTCACTTTCCCTCCATTTGTTACGTGAGAGAATGCGTCTGCTG
ATCACTGAAGTGTCCCTTTTAGCTTCTGATTCATTGGGTTCTGTTGGGCATCTTTAAATCC
ACCTTAACCTGAGGAATGTATGTGGGCAACCAGGCCCTGCATTTTTTTATATTCTGAATTT
TGCATGCTTGCCTGACTTAGTATTTCTGAATTGATGTTTTTTTTAATGGTATAACTATCTT
GATTTTCACTGAAATTATATGGTTCTGTCACTACTCTGTAAATTAATCCGAAACTTTTAAG
GT
GAGTGTTCTTTTTTTGATGAAAGCAATAAGAGGACTGCGGAAGAGCTCCCTGTCAATGTAC
CGCTCTACACCAGTGTATTACGACAGTTCGTACACAACAGTCTGTAGAGGCCACCTGTCTC
TCCCTGCTGCGTTAGGAATTCAGGGGAGCAGGTGGTGGCAGTAAGGGATTTTGAGGGAACG
GAAATCGGATCTTGACCCAGATCTGGGCCGCCGATAATCTCCTACTGCGCTCAGACTGCTG
TGGAGGTGTTAGGCTGAGCCCGATGCCGGCAGGCAAGGGAGGATGGGCGGCTTGGGCAGCG
CCTTTGCAGACGTGGCCATTTCGTGCCTCTGCAGCACCGCCGGGGGGCGCAAGAGCGCGCG
CCCGGAATTGCTCATTCATCCTGTGCCGCAGAGCCCCGCCCCTTGTCCCTGCGGACAGACA
TTTCTTCTGCGCTGGTCTGGCCACGTGCTTCCTGTGCTAGGAGCTGCCCGGAAATGTGACC
ACCTAGTCTAAAGTGGGCTTCTGGGGCCTGAGCGCTGGATGGATGCCCACCTTCCTGTCTT
GGTCCTCCAAAGGAGGAAGCTGTGACTGAGCTGTCTTGGTCTGGAAGGAGGCCTTCCCGGT
TTAGGATGGGAAGGTAACATTCATTAAAAGCAACGTAGACTATAGTGTAGCTGTTCTCAAA
AGTAGTACATCTTAGAAAAGGATCTTTAGAAAAGATCGCTTTAGAAAAGGAAATTCGTTTT
CAGATTACGTGAGTAGCCTAGGTAACACAGCCAGACCTCATCTCCACAAAAAAAATGAAAA
AATTAGCCAGCTTGGTGGTCTGTGCCTGTGGTCCCAGCTGCTCCAGAGGCTGAGGTGGGGG
GATGACTGGAGCCTAGGCTGCAGTGAGCCTAGATGGCATCACTGCACTCAAGACTGGGCGA
CAGACCTTATCTCTAAAAAAATAAAGATTGCATGAGTATTTTGTTCCACTTGACAGTCATC
AATAGATTGGTTTAAATTGTGATATCTTTTTTACTTACCGCAGGTGAGCAAGGGCGAGGAG
GTTGGCCAGGCTGGTGTCCAGATTGGCAATGCCTGCTGGGAGCTCTACTGCCTGGAACACG
GCATCCAGCCCGATGGCCAGATGCCAAGTGACAAGACCATTGGGGGAGGAGATGACTCCTT
CAACACCTTCTTCAGTGAGACGGGCGCTGGCAAGCACGTGCCCCGGGCTGTGTTTGTAGAC
TTGGAACCCACAGTCATTGGTGAGTTGACCTCAGTAACCTGAGATCCCAGGATGCTGGGAC
AGGAGGTCTGTCCAGGGGCTTCTCTTGTCACTCACTCACTCCCTCCGTCCTTCTCTCCCTC
CTCCAGATGAAGTTCGCACTGGCACCTACCGCCAGCTCTTCCACCCTGAGCAGCTCATCAC
AGGCAAGGAAGATGCTGCCAATAACTATGCCCGAGGGCACTACACCATTGGCAAGGAGATC
ATTGACCTTGTGTTGGACCGAATTCGCAAGCTGGTAAGCACCACATATAAATATGCATTTA
ATGTGGTGTGATAGTTCCAGTGCAAGTTGGGTGGAGTGACTGACATCATTCATTCTTTGGC
ACCTACCAAAATGTGGAATAGGCTGCTTGCTATATTAATTGGACTTCTAAATCAGATAGTC
CCTAGGTTATGGACAGTTTGTGGATATGTCTGTTTTGCCAATTCCTTGTGCTTACATCAGT
GAGATATGGTTCGTAATCTAAAAAGTTGAAATAGAAATTCTAAGATAATGTGTCCTGGCAT
TAAAATATTACATTTTTTTATTCCCCTACAGGCTGACCAGTGCACCGGTCTTCAGGGCTTC
TTGGTTTTCCACAGCTTTGGTGGGGGAACTGGTTCTGGGTTCACCTCCCTGCTCATGGAAC
GTCTCTCAGTTGATTATGGCAAGAAGTCCAAGCTGGAGTTCTCCATTTACCCAGCACCCCA
GGTTTCCACAGCTGTAGTTGAGCCCTACAACTCCATCCTCACCACCCACACCACCCTGGAG
CACTC
TAAAGGCTGGTACTTGGAACCTGCAAGCCGTGCATTTGGAACCTCGGACTCAAGTGCCTAT
TACGTAATTCCACAGCGTCCCGGCCTCCAGGCCGTTTCCCGAGCCCTCCAGCGGAGCGGGG
GATAAGGTTACCACGCCCGCGGTGGCCGGGGACACTCTGAGTTTCGCGTGTGGCTTTTAGG
GACGTTTATATTTGAATTTCCCTGAACCGCCGAGTGTGGGCGGTGGCGCAGATCCGTCCCG
GAAACCTCCGGGCTCCTTCCCGCCTTTCTCAGGCCCGGCCCCTCCAAGGGGTCCCCGCGGG
GCGGCGGGAGGGCCCTGGGCCCAGAGCCGCGCGGGTGGGCAGTCCCAGGCGTCCTTCCTTA
CAGCCCTGAGCCTGGTCCGGGAACCGCCCAGCCGGGAGGGCCGAGCTGACGGTTGCCCAAG
GGCCAGATTTTAAATTTACAGGCCCGGCCCCCGAACCGCCGAAGCGCGCTGCCTGCTCCCC
ATTGGCCCATGGTAGTCACGTGGAGGCGCCGGGGCGTGCCGGCCATGTTGGGGAGTGCGGC
GCCGCGGCCCGCGCCACCTCCGCCCCCCGCGGCTTGCCTCCAGCCCGCCCCTCCCGGCCCT
CCTCCCCCCGCCCGCCGCTCCGTGCAGCCTGAGAGGAAACAAAGTGCTGCGAGCAGGAGAC
GGCGGCGGCGCGAACCCTGCTGGGCCTCCAGTCACCCTCGTCTTGCATTTTCCCGCGTGCG
TGTGTGAGTGGGTGTGTGTGTTTTCTTACAAAGGGTATTTCGCGATCGATCGATTGATTCG
TAGTTCCCCCCCGCGCGCCTTTGCCCTTTGTGCTGTAATCGAGCTCCCGCCATCCCAGGTG
CTTCTCCGTTCCTCTAAACGCCAGCGTCTGGACGTGAGCGCAGGTCGCCGGTTTGTGCCTT
CGGTCCCCGCTTCGCCCCCTGCCGTCCCCTCCTTATCACGGTCCCGCTCGCGGCCTCGCCG
CCCCGCTGTCTCCGCCGCCCGCCATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGT
GCGGATGGGCAGCCGCGCTGGCGGCCCCACCACGCCGCTGAGCCCCACGCGCCTGTCGCGG
CTCCAGGAGAAGGAGGAGCTGCGCGAGCTCAATGACCGGCTGGCGGTGTACATCGACAAGG
TGCGCAGCCTGGAGACGGAGAACAGCGCGCTGCAGCTGCAGGTGACGGAGCGCGAGGAGGT
GCGCGGCCGTGAGCTCACCGGCCTCAAGGCGCTCTACGAGACCGAGCTGGCCGACGCGCGA
CGCGCGCTCGACGACACGGCCCGCGAGCGCGCCAAGCTGCAGATCGAGCTGGGCAAGTGCA
AGGCGGAACACGACCAGCTGCTCCTCAAGTGAGTGCTAGCTGGCGGCCGCGTTAGCGCCAA
GGAGGGGCGGGGGCGCAACCGCGGCGACCAGCTCACCGGGTTCTGCCGTGGGGAGGGAGCA
GAGGCCAGGATGCACGCGTCCTTCTGAAGGAACAGGGTCTCGGTCTCCGGAAAGGAGAAAG
AATCTAGAGTTCATAGCGGAGCAGGGGTCGCGGAGGGGGCTCGAGCTGTAGCGCTGGGGGG
CCGTGATGCCCATTTCTAGATTTTGGATACCCGCTGGGACGTGGTAAGTGCGCGCCTGGGA
CTGCCGAGAAGGAGCTCCCGCTTTCGCACTCGAATCCGGGGAGCCGGCGCGGAGAGGCGGC
CCCTCAGGCCCCAGGTGCGGGGAGCTGGAGCGCGAGCGCGCGCTCGCGTGCGCGCCCCAGT
TTCCGGCCGGCGCGAGACAAAGCGTCTAGCGGATTTGCAGTGCCGGGATGGGCGGCCGGGG
AGGACTGGCAGCCCGCCTCTAGAATGAATGAGCTTCGCGCGGGCAGAGAGAGGAAGGGGAG
GGACCTTCCCGCAGCATCCGCGTCTCCTGGGGGTGGGTCCCGCTTTGGCGCGCTCAGTCTT
GGCCCTGTGACGTTTTGCGAAGATTCTACGCCTGCTTTAGGCGGGAGAGAGAGGCGGAG
TTTATTTTTATTTTTATTTTTTTGAGATGGAGTCTTACTCTGTCACCAGGTTGGAGTGCAG
TGGTACGATCTCGGCTCACTGCAACCTCCAGTTCCTGGGTTCAAGCGATTCTCCTGCGTCA
GCTTCCCGAGTAGGTGGGACTACAGGTGTGCGCCACCACACCCGACTAATTTTTGTATTTT
TAGTAGAGATAGGGTTTCACCGTGTTGGCCAGGATGGTCTCAATCTCCTGATTTCGTGATT
GAGCCACCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCACGCCCAGCCTTA
GACTGGGTAATTTATAATGAATGGAAATTTATTTGGCTCCCAGTTCCAAAGGCTGGAAAGT
CCAAGATTGGAGGTCTGAATCTGGCGAGGGCCTTCTTGCTGTCATCCATTGGCAGAAGGGT
GAGAGCAAGATAGAAAGGGGGCATAATCATCCTTTTAATCAGCAACCCACTCTTGTGATAA
TAGCATTACTCTATTCAGGAAGGCAGAGGCCTCATGACCTGAATCATCTCTCGAAGGTCCC
ACCTCTCAACTCTTGCATTTAAGGGTTACGTTTCCAACACATGAACTTTGGGGGACACACT
AGAACCATAGCACTGAGTTTTACTTGAATTAATAATGAAAACATCTGGTTTAAAGAGCACA
CAAGAGAAAAACAGCCCAAAGCCCTGTTGTAGACATTAGTCCTTTCTCCTCTTTAGGCCAA
CTGCATTGACTCCACAGCCTCAGCCGAGGCCGTGTTTGCCTCCGAAGTGAAAAAGATGCAA
CAGGAGAACATGAAGCCGCAGGAGCAGTTGACCCTTGAGCCATATGAAAGAGACCATGCCG
TGGTCGTGGGAGTGTACAGGTGAGCAGGGGCCCAGCAATACACCAAGACAGACATCTCTGT
CCCTTGCACCCCGAGTGCCATGATCCTGGGGACCCTCCTTCATCACCTATCTTCCTCTCAC
AGGCCACCTCCTAAAGTGAAGAAC
AAGCCCAACAGCGCCGTGGACGGCACCGCCGGCCCCG
GCGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCCCATCCTGGTCGAGCTGGACGG
TGAGCGGATTGCGAGAGATGTGTGTTGATACTGTTGCACGTGTGTTTTTCTATTAAAAGAC
TCATCCGTCTCCCATGTCTGCTGCTCATTCCTCCCCTTGACCTGCTGACACAGGGAGCACG
CACCCTTGGTCAATTTTGCGGGGTTGGGTAAATTCTCACTCGGTCACAGAGCGCATGCTCC
GTTTCTAGCTGCCTTTGCGCAGCGGCAGCCTGGATTTCGGTTCTTGGGTGGGATTGGTAGC
TCGCTGCGCATGCGTGCAGGTAAGCGGCCATCTCGCGCAGGCGGAGTGTCAGTGTGGGTCA
CGTGAGGGGAGCGGAGAGGGAGGGATGGGGGCGGAGTCCAGGGCGTGGGGGGGCCGGTTTG
TTGTGGTCGCCATTTTGCTGGTTGCATTACTGGGTAATCGGGGCCCTGGCTTGCCGCGTCC
GCCGGATACCCTCAGCCAGTGGGCAGGTCTGAGCTCGGGCTCCCCGAGCAGTTTGAGTCCC
CTTGCCCGCTCCTTCAGGTAACGGCGCGGGGACGGGTGGGGCGGCAAGCGGTCGCAGGGAG
GTGGGCAGGACGGGATCCGCCCTGCTCCCGTCGCCGTGAGACTTAGCACGAGGCCAAGGGA
GGAGAGGAGGGGGGTGGCAGGCAGGTGCGGGCCCTGCCTGGCTATTCATAGTTGAATTCCT
GGAACCGGCCAAGCCCGAGGAAGCAGTTGCAGGAGGGAGGCTGGGAGGGGGTAGCCGGGCC
CCACTCCCGCCCTTTGTTTGGGCTCAGCTCCGCGGGCCGCTTCTTCGTCGCCTAGCAACAG
CTGCCCTAGGCTGTGATTGGCTGAGCTCTTGGCACCAGCGACCAATGGTACAGTTGTTGCC
ATGGCAGGTGCCGATTGCCAAGCTCAGTCGGGCCCCGCCTTCCGGTCTCAGCAGGCCCAGG
AGGGCCTCCTGGGTGGGGGGCGGGACGCCGGGTCCCTAGGGGCTGGTGGTCACTCAGGGTG
GGGCGTGTCGC
GCTCGAGCGGCCGCGGCGGCGCCCTATAAAACCCAGCGGCGCGACGCGCCACCACCGCCGA
GACCGCGTCCGCCCCGCGAGCACAGAGCCTCGCCTTTGCCGATCCGCCGCCCGTCCACACC
CGCCGCCAGGTAAGCCCGGCCAGCCGACCGGGGCAGGCGGCTCACGGCCCGGCCGCAGGCG
GCCGCGGCCCCTTCGCCCGTGCAGAGCCGCCGTCTGGGCCGCAGCGGGGGGCGCATGGGGG
GGGAACCGGACCGCCGTGGGGGGCGCGGGAGAAGCCCCTGGGCCTCCGGAGATGGGGGACA
CCCCACGCCAGTTCGGAGGCGCGAGGCCGCGCTCGGGAGGCGCGCTCCGGGGGTGCCGCTC
TCGGGGCGGGGGCAACCGGCGGGGTCTTTGTCTGAGCCGGGCTCTTGCCAATGGGGATCGC
AGGGTGGGCGCGGCGGAGCCCCCGCCAGGCCCGGTGGGGGCTGGGGCGCCATTGCGCGTGC
GCGCTGGTCCTTGGGGCGCTAACTGCGTGCGCGCTGGGAATTGGCGCTAATTGCGCGTGCG
CGCTGGGACTCAAGGCGCTAACTGCGCGTGCGTTCTGGGGCCCGGGGTGCCGCGGCCTGGG
CTGGGGCGAAGGCGGGCTCGGCCGGAAGGGGTGGGGTCGCCGCGGCTCCCGGGCGCTTGCG
CGCACTTCCTGCCCGAGCCGCTGGCCGCCCGAGGGTGTGGCCGCTGCGTGCGCGCGCGCCG
ACCCGGCGCTGTTTGAACCGGGCGGAGGCGGGGCTGGCGCCCGGTTGGGAGGGGGTTGGGG
CCTGGCTTCCTGCCGCGCGCCGCGGGGACGCCTCCGACCAGTGTTTGCCTTTTATGGTAAT
AACGCGGCCGGCCCGGCTTCCTTTGTCCCCAATCTGGGCGCGCGCCGGCGCCCCCTGGCGG
CCTAAGGACTCGGCGCGCCGGAAGTGGCCAGGGCGGGGGCGACCTCGGCTCACAGCGCGCC
CGGCTATTCTCGCAACTGACAATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGC
GCATGTGCAAGGCCGGCTTCGCGGGCGACGATGCCCCCCGGGCCGTCTTCCCCTCCATCGT
GGGGCGCCCCAGGCACCAGGTAGGGGAGCTGGCTGGGTGGGGCAGCCCCGGGAGCGGGCGG
GAGGCAAGGGCGCTTTCTCTGCACAGGAGCCTCCCGGTTTCCGGGGTGGGGGCTGCGCCCG
TGCTCAGGGCTTCTTGTCCTTTCCTTCCCAGGGCGTGATGGTGGGCATGGGTCAGAAGGAT
TCCTATGTGGGCGACGAGGCCCAGAGCAAGAGAGGCATCCTCACCCTGAAGTACCCCATCG
AGCACGGCATCGTCACCAACTGGGACGACATGGAGAAAATCTGGCACCACACCTTCTACAA
TGAGCTGCGTGTGGCTCCCGAGGAGCACCCCGTGCTGCTGACCGAGGCCCCCCTGAACCCC
AAGGCCAACCGCGAGAAGATGACCCAGGTGAGTGGCCCGCTACCTCTTCTGGTGGCCGCCT
CCCTCCTTCCTGGCCTCCCGGAGCTGCGCCCTTTCTCACTGGTTCTCTCTTCTGCCGTTTT
CCGTAGGACTCTCTTCTCTGACCTGAGTCTCCTTTGGAACTCTGCAGGTTCTATTTGCTTT
TTCCCAGATGAGCTCTTTTTCTGGTGTTTGTCTCTCTGACTAGGTGTCTAAGACAGTGTTG
TGGGTGTAGGTACTAACACTGGCTCGTGTGACAAGGCCATGAGGCTGGTGTAAAGCGGCCT
TGGAGTGTGTATTAAGTAGGCGCACAGTAGGTCTGAACAGACTCCCCATCCCAAGACCCCA
GCACACTTAGCCGTGTTCTTTGCACTTTCTGCATGTCCCCCGTCTGGCCTGGCTGTCCCCA
GTGGCTTCCCCAGTGTGACATGGTGTATCTCTGCCTTACAGATCATGTTTGAGACCTTCAA
CACCCCAGCCATGTACGTTGCTATCCAGGCTGTGCTATCCCTGTA
TGTTGACAGGAAGTTCTTTGATCAGTACCGATCCGGCAGCCTCAGCCTCACTCAATTTGCT
GACATGATCTCCTTGAAAAATGGTGTCGGCACCAGCAGCAGCATGGGCAGTGGTGTCAGCG
ATGATGTTTTTAGCAGCTCCCGACATGAATCAGTAAGTAAGATTTCCACCATATCCAGCGT
CAGGAATTTAACCATAAGGAGCAGCTCTTTTTCAGACACCCTGGAAGAATCGAGCCCCATT
GCAGCCATCTTTGACACAGAAAACCTGGAGAAAATCTCCATTACAGAAGGTATAGAGCGGG
GCATCGTTGACAGCATCACGGGTCAGAGGCTTCTGGAGGCTCAGGCCTGCACAGGTGGCAT
CATCCACCCAACCACGGGCCAGAAGCTGTCACTTCAGGACGCAGTCTCCCAGGGTGTGATT
GACCAAGACATGGCCACCAGGCTGAAGCCTGCTCAGAAAGCCTTCATAGGCTTCGAGGGTG
TGAAGGGAAAGAAGAAGATGTCAGCAGCAGAGGCAGTGAAAGAAAAATGGCTCCCGTATGA
GGCTGGCCAGCGCTTCCTGGAGTTCCAGTACCTCACGGGAGGTCTTGTTGACCCGGAAGTG
CATGGGAGGATAAGCACCGAAGAAGCCATCCGGAAGGGGTTCATAGATGGCCGCGCCGCAC
AGAGGCTGCAAGACACCAGCAGCTATGCCAAAATCCTGACCTGCCCCAAAACCAAATTAAA
AATATCCTATAAGGATGCCATAAATCGCTCCATGGTAGAAGATATCACTGGGCTGCGCCTT
CTGGAAGCCGCCTCCGTGTCGTCCAAGGGCTTACCCAGCCCTTACAACATGTCTTCGGCTC
CGGGCTCCCGCTCCGGCTCCCGCTCGGGATCTCGCTCCGGATCTCGCTCCGGGTCCCGCAG
TGGGTCCCGGAGAGGAAGCTTTGACGCCACAGGGAATTCTTCCTACTCTTATTCATACTCA
TTTAGCAGTAGTAGCATTGGGCAC
CACGACCCCCCCGTTGCTACGGTGAGCAAGGGCGAGG
TGACTTCATTTATATGAATTTCCACTTTATTAAATAATAGAAAAGAAAATCCCGGTGCTTG
CAGTAGAGTGATAGGACATTCTATGCTTACAGAAAATATAGCCATGATTGAAATCAAATAG
TAAAGGCTGTTCTGGCTTTTTATCTTCTTAGCTCATCTTAAATAAGCAGTACACTTGGATG
CAGTGCGTCTGAAGTGCTAATCAGTTGTAACAATAGCACAAATCGAACTTAGGATTTGTTT
CTTCTCTTCTGTGTTTCGATTTTTGATCAATTCTTTAATTTTGGAAGCCTATAATACAGTT
TTCTATTCTTGGAGATAAAAATTAAATGGATCACTGATATTTTAGTCATTCTGCTTCTCAT
CTAAATATTTCCATATTCTGTATTAGGAGAAAATTACCCTCCCAGCACCAGCCCCCCTCTC
AAACCCCCAACCCAAAACCAAGCATTTTGGAATGAGTCTCCTTTAGTTTCAGAGTGTGGAT
TGTATAACCCATATACTCTTCGATGTACTTGTTTGGTTTGGTATTAATTTGACTGTGCATG
ACAGCGGCAATCTTTTCTTTGGTCAAAGTTTTCTGTTTATTTTGCTTGTCATATTCGATGT
ACTTTAAGGTGTCTTTATGAAGTTTGCTATTCTGGCAATAAACTTTTAGACTTTTGAAGTG
TTTGTGTTTTAATTTAATATGTTTATAAGCATGTATAAACATTTAGCATATTTTTATCATA
GGTCTAAAAATATTTGTTTACTAAATACCTGTGAAGAAATACCATTAAAAAACTATTTGGT
TCTGAATTCTTACTAGAAGGTGGTCTTTTGAAGTTAGTCCTTTCGGTACTTCTCAGATGCC
TGTCATGTACCCGATGGAGTCCTTGGAAAGAAGGCCTGTGTAAAGAGGCCAGCCTGGAGGT
CAATAACCTGTTCTAGTTTATTCTGGACATTGAGTACCAAGTAGCATTGGCAAA
AGCCTTGGCAGTCGGCGCCGGTGAACGAGAGCAACGCTTCTGACCCTGCCGGAGCTCCTCG
GAGATGAAAGCCATGACGCGCCTTGCAGAAAATGCATTCCGCCTTCCGTGGGAACAACGCC
GAGGCACGCGGTGACAGCCGTGACCATGCTGTTTGCCCAGTGAAGGAAACAACTGTCGGGT
ATCGGCTCTGCCGGCCTTTCCAGCCGCACTCATGCATGGGGCTCACCCCATGATGTGCGTG
GCTTGTCGAGGAGCAAGTGGACAAGTCTCTTAAGGAAAGCTTTGGTGCACAGGCGCTTTCT
CCTTGGGGGCGAATTCTGCCAGACCTTGGATAAAAACAAACAGGAAGACTCGCACGGCAGC
GGAAACTGTCTTCCAAGTTACTTGGGTTACCCGGCTTTTCCTTCCGCGCTTGGGGTCGGGA
CCCCGGCCGCTCGTCCCGCCCCCTCCCCCGCCGCGGCCCCGCCCCCTCCCCGCCTCGCCTC
GCCTCGCCTCGTCCAGCCCCGCCCCCGCCGGGCCGGGCATGCTCAGTGGGCCGGGCCGGCA
GGTTTGCGTGGCCGCTGAGTTGCCGGCGCCGGCTGAGCCAGCGGACGCCGCGTTCCTTGGC
GGCCGCCGGTTCCCGGGAAGTTACGTGGCGAAGCCGGCTTCCGAGGAGACGCCGGGAGGCC
ACGGGTGCTGCTGACGGGCGGGCGACCGGGCGAGGCCGACGTGGCCGGGCTGCGAAAGCTG
CGGGAGGCCGAGTGGGTGGCCGCGCTCGGAGGGAGGTGCCGGTCGGGCGCGCCCCGTGGAG
AAGACCCGGGCGGGGCGGGCGCTTCCCGGACTTTTGTCCGAGTTGAATTCCCTCCCCCTGG
GCCGGGCCCTTCCGGCCGCCCCCGCCCGTGCCCCGCTCGCTCTCGGGAGATGTTTATTTGG
GCTGTGGCGTGAGGAGCGGGCGGGCCAGCGCCGCGGAGTTTCGGGTCCGAGGAGTTGCGCG
CGGCGCTGGAGAGAGACAAGATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGCC
AAAGGTGAGAACCTCCGCGGCCGCCAGGGCCAGACCGGGCCGACCGTCGCCGCCCGCCCAC
CGGCATCTGGCCCGCGTCCCGCCCTCCCTCGCTGGCGGCTGTCTGGGCCCCGGGGCGGCGG
GGTGGGCAGGGCTGGCGCGGGGCCGCGGGCCGCGGGCCGCGGGCCGCGGGGAGCCCCTCGG
GCGGGGGCGGCGCGGGCCGCACTGGGGGCGGCCGGGGAGGGGGCTGCGGGCGCCCGGCCGC
CGTACTGGGCAGGTGCATAGCTGCCGGCGCCTGTGCCTGGCTGCGGCTCGCTGAGGGCGGG
GACACGCAACAGGTCCCTCGCGGAGAAACTCGGCTCCAGTGAGGGTTCGGGGGCTGGAAGC
CGGCTCTCAGCGGGTCGGGGCTTGGGGTGCCACCTCCTGCTGGCCGGGAGCTGCTGTCTTT
GGAGGAGTGGTTGGTCCCCGGCGAAACCCTGTAGTTTCGATCTGATGTCACTCCCTGCGGT
ATGCGCACGCCAGCGATAAGGCTTTGAGACTGCAAAACACTCCACTCAGCCTGTGAGGCGT
AGTAGGTCGGGTTTTCTTTCATGCTGTATTACTTATTTAAGGTAACTTTGAAAATAACCTC
TTTAACATTTAATAATTTAACTTGAATTAAACTTTCACAAGTAATACAAAGTATTCCTACG
AATGGACAATAAGATGAGCACTTAAAAATTAGTAAAGGCCGGTGAGTTCAGCCGAAAAAAG
TAACGTTTTTCCTGTTACTTTTCCTATGTGCTCTGAAATATTATTGCATTTTCCCATTGCT
TTGAAACTAACTTGTGTATTACATTAAAAAGCCAAAGTTCCTGAAAAACAGCTAGGATGCT
CCTCCCATTTTGTATATTAATTTTTTCATCATAAAATAGTACTTGTTATTTCAAACAAAGG
AATACAGAAATGTGAGGAGTAAAAAATCTCCCCTTTAAAGAATATCAATTCATTACTTCAA
ATAGT
AAGTGATGGAGCTTCCTCTGCTAGCCCTTTGTGAGCCAATGGTAAATGGGTGCTAAATAAA
ACAACTAGGTCTTGAGATACATTAATTGTAAATGTCACAGAACCAGTACTTTCCTCAATGT
GGCTAAGATAGTTGATGGTTCCTTTTTCTTCTGCACTGGTCAGACCATATCTGGGCTATGA
TGTTTGCTTCTGGGCCACACACTTTAGAGGGAAGACAAGCAGCATGTTGGAGTCTGTTTAG
GCGAGAGATCCGGCAGGGGAAGGAGTCTTGGTGAATGAGGGGTGGAGGAGCTGCAGGATGG
GAATAGAGGCCTGAACTGCTACCATGACATATTCAAAAGGCTGCCGTGTGAAGCCAAGTTA
TGCTTGTCTTTTGTGGTCCCAGTTGATCACATTAAGACCTCATGGGGCCATAGAAAAGCTA
GAGGGAGACTGATTTGGGTTATTCATAAGAAGAACTTTAAGTCTGTTATCTGAGGGTAGAA
TGAGAGGCATGTTTTAGATTCTTTAGATTCTTTACTCTTCTGACAATCATGTGTTTTGGTA
GCTGTTTCCTTGTGGTCATATTAATTCTGGTACCACTTCATGAACCTTTTATTACCCATCT
TTGTTTTCTTTTTTTTTTTCCCTTCTTAACTCCCTGTTTAATTTGTGGTGAGGGTGAAAGA
GGAGATAAAGAAAAAAAAGGGTCAACTTGTAACTTTGCCTTTTCTTTTCTTTTCTTTTCTT
TTTTTTGCCCTCAGTAACTGAGGGCAAACCCATCAGACAACCAGAGCCATAATTTGTGGTC
ACCGCTGAAATTTACCTTGGAAACTCGGTTAGTATGGCTGTGAAGAGGTATACCCCAGCTC
CTTAACACAGAGTTAATGCTTAATCTAAGGTTTTAAGTTTCTTAGAAAAGAAAAACGTGTA
CATTCTTTTGTTTCTTAAACATCTAATTTTGCCCTCCTCCTCTTCTCTTAGAGGCAATTGC
TTTTGGATCGTTCCATTTACAATGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTGC
ATCCAGAGAGGTACCTGTTCGTCGATCGTGCTGTCATCTACAACCCTGCCACTCAAGCTGA
TTGGACAGCTAAAAAGCTAGTGTGGATTCCATCAGAACGCCATGGTTTTGAGGCAGCTAGT
ATCAAAGAAGAACGGGGAGATGAAGTTATGGTGGAGTTGGCAGAGAATGGAAAGAAAGCAA
TGGTCAACAAAGATGATATTCAGAAGATGAACCCACCTAAGTTTTCCAAGGTGGAGGATAT
GGCAGAATTGACATGCTTGAATGAAGCTTCCGTTTTACATAATCTGAAGGATCGCTACTAT
TCAGGACTAATCTATGTAAGTATTTCTTCCAAATAATCATGTGAAGTGGTAGCTAGGAATT
AATGTAAATTATACATCTTGTCATAATCAAATGAGAATGTGGAATACCCAAACTCTCTGTT
TAACATTTCTATTTCTCTTTAAGATAGAAAGATTTGTTGCTTGCTTACCCATGTCTTGCTT
TTCTTTGAATCTTAACACATTAAGTTTAAATAATACAGGCTGCAATTACATATAATAAAAT
GGCATTTGAAGACTTTTGTAGTGGTCTTCTGGAGCATAATAAGGTGGGAGAGAGCATGTAA
CAGGAAGACCAGAAGGTTTAATAAGGTAAAGAGAGTTGCATTAATTGGACGCAGACAGCAA
AACGGTCAAAAATCAAGTGCATACCCAAGAGTAAAGTGGAGGGGCTGTAAGCTGAGAAATT
TCTGTGGACAGCATGAACAGCTTCACTGGATGTAGTAGGGAAGTAGGAAAGATGAATGCTG
AGGTTTTTAAGAGGAAACAATTAGGGTAAGATTGAGGCTGGCTGGGGTCGTCCTGTGGTTA
GCAGCTGACATGAATGTTGGAGTCACCGACTTTGTCACTGACCATGTAGAAGAAGTTATTG
AAAATCATAAGGGATAATGTAGAGAGGGATAATGTAGAGAGGAAAAATGTAAGCCAGATAC
TA
Wild type (WT) S. pyogenes Cas9 (spCas9) protein was purchased from UC Berkeley QB3 Macrolab and was pre-complexed in vitro with synthetic CRISPR RNA (crRNA) and trans-activating crRNA (tracrRNA) duplexes to generate a CRISPR/Cas9 ribonucleoprotein (crRNP). Briefly, the crRNA and tracrRNA oligonucleotides were reconstituted to 100 μM in TE at pH 7.5 (catalog #11-01-02-02, IDT). The crRNA and tracrRNA oligonucleotides were then combined in a sterile PCR at a final concentration of 40 μM in Duplex Buffer (100 mM potassium acetate; 30 mM HEPES, pH 7.5). Using a thermocycler or heat block, the crRNA and tracrRNA mixture was heated to 95° C. for 5 min to generate a crRNA:tracrRNA duplex. After heating, the crRNA:tracrRNA duplex was allowed to cool at room temperature for a minimum of two hours, after which the crRNA:tracrRNA duplex was kept on ice. crRNA:tracrRNA duplexes were then diluted to a working concentration of 10 μM in TE. All dilutions and stocks were kept on ice throughout the protocol. Alternatively, the crRNA:tracrRNA duplexes were stored at −20° C. for later use.
spCas9 was stored at −80C and was thawed on ice or at 4° C. until no ice pellet was visible, approximately 2-5 min. spCas9 was then diluted to a working concentration of 10 μM in TE in preparation for use. Alternatively, working concentrations of Cas9 protein were stored at −20° C. for up to 2 weeks and multiple freeze-thaw cycles were avoided (<3 freeze-thaw cycles recommended).
crRNPs were generated by combining the solution of crRNA:tracrRNA duplexes and Cas9 protein in a 1.5 mL eppendorf tube and gently pipetting up and down three times. A separate crRNP was generated for each reaction to be performed. crRNPs were incubated a room temperature for a minimum of 10 minutes and no longer than 1 hour prior to the addition of the complexes to cells.
WTC iPSCs were cultured according to described methods. Briefly, WTC11 iPSCs were cultured in a feeder free system on tissue culture plates or dishes coated with pheno red-free GFR Matrigel (Corning) diluted 1:30 in DMEM/F12 (Gibco) in mTeSR1 media (StemCell Technologies) supplemented with 1% (v/v) Penicillin-streptomycin (P/S) (Gibco). Cells were not allowed to reach confluency greater than 85% and were passaged every 3-4 days by dissociation into single-cell suspension using StemPro® Accutase® (Gibco). When in single cell suspension, cells were counted using a Vi-CELL® Series Cell Viability Analyzer (Beckman Coulter). After splitting, cells were re-plated in mTeSR1 supplemented with 1% P/S and 10 μM ROCK inhibitor (RI) (Stemolecule Y-27632, Stemgent) for 24 h. Expired media was replenished with fresh mTeSR1 media supplemented with 1% P/S daily at 37° C. and 5% CO2.
Prior to transfection, mTeSR1 media (400 mL basal media with provided 100 mL 5× supplement (catalog #05850, Stem Cell Technologies) with added 5 mL (1% v/v) Penicillin/Streptomycin (catalog #15140-122, Gibco) was prepared and sterile filtered with a 0.22 μm filter prior to use. mTeSR1 media was brought to room temperature on the bench top, and was not warmed in a 37° C. water bath. mTeSR1+ROCK inhibitor (Ri) media was prepared by adding 10 mM Ri to mTeSR1 media at a 1:1000 dilution. Accutase was warmed in a 37° C. water bath. Previously prepared Matrigel-coated vessels (stored at 4′C) were brought to room temperature. 6-well plates were prepared by aspirating and discarding any excess Matrigel liquid, and adding 4 mL of RT mTeSR1+Ri media to each well. Plates with media were kept in an incubator at 37° C. and 5% CO2 until ready to plate cells after the transfection procedure.
Cells were aliquoted in mTeSR+Ri into separate 1.5 mL eppendorf tubes. Cells were pelleted by centrifuging in a micro-centrifuge at 211×g for 3 min at room temperature. Various delivery methods including CrisprMax, GeneJuice, Amaxa and Neon were evaluated before concluding that Neon electroporation resulted in favorable co-introduction of protein, RNA, and plasmid into hiPSCs as measured by transfection of a control reporter plasmid and T7 assays as a read out for Cas9 activity (data not shown). Supernatant was aspirated and discarded and cells were resuspended in Buffer R from the Neon Transfection Kit. 8×105 cells were resuspended in 100 μL Neon Buffer R with 2 μg donor plasmid, 2 μg Cas9 protein duplexed with a crRNA:tracrRNA at a 1:1 molar ratio to Cas9, then electroporated with one pulse at 1300 V for 30 ms, and plated onto Matrigel-coated 6-well dishes with mTeSR1 media supplemented with 1% P/S and 10 μM RI. Transfected cells were cultured as previously described for 3-4 days until the transfected culture had recovered to ˜70% confluent. Transfected cells were incubated for at least 24 hours before changing the media to mTeSR1 without Ri. Successfully transfected cells were identified and harvested by FACS sorting for use in downstream applications after reaching a healthy confluency and maturity (approximately 3-4 days) (
Fluorescence-activated cell sorting (FACS) was used to enrich the population of gene edited cells after transfection and to evaluate rates of HDR (
In all gene targeting experiments, flow-based selection resulted in the recovery and enrichment of GFP-tagged clones, even when HDR was inefficient (<1%). For example, weak GFP signal was observed in some experiments where the target gene transcript was relatively scarce (such as PXN) or where the protein is known to localize to small foci in cells corresponding to cell junctions (DSP) or substrate adhesion sites (PXN). However, enriched populations of cells edited at these loci were able to be obtained, despite the low percentages of GFP+ cells after transfection (
After FACS enrichment, approximately >70% of the cells were GFP+ even after a period of recovery and scale up post sorting, indicating that flow cytometry is an efficient method for isolation of GFP+ cells. To ensure the knock-in of GFP to the targeted genomic locus resulted in appropriate localization of the resulting fusion proteins, the cells were analyzed by live fluorescence imaging prior to generating clonal lines. Each population displayed localization of the GFP signal to the anticipated cellular structure (
Clonal cell lines generated from these edited, enriched cell populations were then generated to identify and isolate precisely edited cells. Briefly, cells from the FACS-enriched population were seeded at a density of 104 cells in a 10 cm Matrigel-coated tissue culture plate. After 5-7 days clones were manually picked with a pipette and transferred into individual wells of 96-well Matrigel-coated tissue culture plates and expanded clonally. Greater than 90% of these clones survived colony picking. After 3-4 days, colonies were dispersed with Accutase and transferred into a fresh 96-well plate. After recovery, the plate was divided into plates for ongoing culture or freezing and gDNA isolation. When cells were 60-85% confluent they were dissociated and pelleted in 96-well V-bottom plates for cryopreservation. Cells were then resuspended in 60 μL mTeSR1 supplemented with 1% P/S and 10 μM RI. Two sister plates were frozen using 30 μL cell suspension per plate, added to 170 μL CryoStor® CS10 (StemCell Technologies) in non-Matrigel coated 96-well tissue culture plates. Plates were sealed with Parafilm and introduced to the −80C freezer in a room temperature Styrofoam box. Plates were stored long term at −80° C. for up to 8 weeks before thawing. Few clones (<5% across experiments) spontaneously differentiated after isolation, splitting, and freezing and a majority of clones were able to be scaled up for genetic and quality control experiments. A schematic of the overall selection and quality control process is shown in
Genetic screening analyses were performed in order to identify clones in which GFP tagging was performed precisely, without damage to endogenous untagged alleles (if present) and without permanent incorporation of the plasmid donor backbone into the genome. A genetic screening strategy was used to rapidly discriminate between precisely and imprecisely edited clones. Criteria for precise editing were as follows:
(a) Incorporation of the GFP tag in-frame with the targeted exon;
(b) The absence of random or on-target donor plasmid backbone integration; and
(c) No unintended mutations in either allele.
An overview of the genetic screening process is shown in
Digital Droplet PCR (ddPCR)
Because primers and probes for GFP, the donor plasmid backbone, and the RPP30 reference gene could be used to analyze all gene edits, a droplet digital PCR (ddPCR) assay was used to rapidly interrogate large sets of clones in parallel without having to optimize parameters specifically for each target gene, a significant advantage for our high throughput platform. During clonal expansion, a sample of cells was pelleted and total gDNA was extracted using the PureLink Pro 96 Genomic DNA Purification Kit (Life Technologies). ddPCR was performed using the Bio-Rad QX200 Droplet Reader, Droplet Generator, and QuantaSoft software.
Assays were designed to measure three DNA sequences common to each experiment: (1) the GFP tag sequence to measure tag incorporation; (2) the ampicillin or kanamycin resistance gene to assess stable integration of the plasmid backbone; and (3) a two-copy genomic reference locus (RPP30) to calculate genomic copy number. These sequences were used to identify clones with a GFP:RPP30 signature of ˜0.5 or ˜1.0, suggesting monoallelic or biallelic stable integration of the GFP sequence into the host cell genome. Clones with an elevated AmpR/KanR:RPP30 ddPCR signature (>0.1) suggested stable integration of the donor plasmid backbone and were rejected.
First, GFP-tagged clones lacking plasmid backbone integration were identified using ddPCR, with equivalently amplifying primer sets and probes corresponding both to the GFP tag and the donor plasmid backbone. The abundance of the GFP tag sequence was quantified (x-axis in
The copy number of a marker sequence in the donor plasmid (AMP or KAN resistance genes) in each clone (y-axis in
The reported final copy number of mEGFP per genome was calculated as the ratio of [(copies/μL mEGFP)—(copies/μL nonintegrated AMP)]/(copies/μL RPP30), where a ratio of 0.5 indicated monoallelic insertion (˜1 copy per genome) and a ratio of 1 indicated biallelic insertion (˜2 copies/genome). The AMP sequence was used to normalize mEGFP signal only when integration into the genome was ruled out during primary screening. For primary screening [(copies/μLmEGFP)/(copies/μLRPP30) was plotted against [(copies/μLAMP)/(copies/μLRPP30) in order to identify cohorts of clones for ongoing analysis.
Clones with a GFP copy number of ˜1.0 (monoallelic) or ˜2.0 (biallelic) and AMP/KAN<0.2 were putatively identified as correctly edited clones. Combining data across all successful editing experiments, 39% of clones were retained as candidates using this assay (
The screening strategy also identified several faulty outcomes in the editing and selection process including unedited clones co-purified during flow cytometry selection, and clones harboring plasmid backbone in the targeted locus and enabled selection of successfully edited clones. These results demonstrate that the addition of the ddPCR assay to the genetic screening process enabled selection of successfully edited clones and eliminated unsuccessful or off-target edits from downstream analyses.
Clones whose ddPCR signature indicated the stable presence of GFP in the genome (GFP:RPP30 values ˜0.5 or 1) and the absence plasmid backbone integration (AmpR/KanR:RPP30<0.1) were further analyzed by tiled-junctional PCR to determine the presence of the predicted tagged alleles and sequences.
Primer sequences used in each PCR reaction are shown in
The untagged allele (for monoallelic GFP-tagged clones) was amplified and sequenced to ensure that no mutations had been introduced via the NHEJ repair pathway at the binding site of the crRNA used for editing). 77% (n=177) of the clones analyzed from all experiments contained a wild type untagged allele (
Clones were frequently rejected due to stable integration of plasmid backbone sequence and these rejected clones were further analyzed. In many cases, clones were derived from FACS-enriched populations in which most cells displayed the correct anticipated subcellular GFP tag localization, but nevertheless harbored the GFP tag and donor plasmid backbone at equivalent copy number. It is possible that non-random HDR-mediated incorporation of both the tag and the donor plasmid backbone at the targeted locus result in this pattern. Such an outcome would result in a tagged protein, but also unintended insertions of exogenous sequence into the locus (Rouet et al., 1994; Hockemeyer et al., 2009). This possibility was evaluated by performing the tiled junctional PCR assay (
It is possible that these categories of clones harbor insertions and/or duplications derived from the donor cassette sequence delivered by HDR to non-coding regions flanking the GFP tag at the target locus. The prevalence of clones with this flawed editing outcome may underlie heterogeneity in the GFP signal intensity observed in some experiments. However, the ddPCR results largely correlated with the presence or absence of appropriate junctions (
The relative rates of putative clonal confirmation and rejection in this assay varied widely based both on the locus and the crRNA used (
Putatively confirmed clones were almost exclusively tagged at one allele, while clones with putative biallelic edits with no plasmid incorporation were rare (
Taken together, confirmation rates of 39% (GFP incorporation with no plasmid), 90% (correct junctions), and 77% (wild type untagged allele) were observed in each of the three screening steps across all gene targeting experiments (
The analyses described above resulted in the identification of a refined set of candidate clones, wherein both tagged and untagged alleles were validated for the correct sequence identity. These candidate clones were further validated in a number of lower throughput downstream assays.
To assess whether the clones that met the above gene editing criteria contained off-target mutations due to non-specific CRISPR/Cas9 activity, several final candidate clones from each experiment were analyzed for mutations at off-target sites predicted by Cas-OFFinder (
Western blot analysis was performed on lysates from each candidate clone in order to confirm that the observed shift in molecular weight of the tagged vs. untagged peptide was consistent with the known molecular weight of the linker and GFP tag (
Semi-quantitative imaging of the blot was also used to determine the relative abundance of protein products derived from each allele. In all cases, immunoblotting with antibodies against the endogenous protein yielded products consistent with both the anticipated molecular weight of the tagged and untagged peptides. Notably, the appropriate Tom20-GFP fusion protein product was obtained despite our inability to identify a precisely edited clone, suggesting that the additional plasmid backbone sequence did not disrupt the coding sequence of the TOMM20 gene. Antibodies used in these experiments are described in
The western blot data was used to quantify the abundance of the GFP-tagged protein copy relative to the total abundance of the targeted protein (
The observation that the tagged allele had reduced expression in some experiments coupled with the rarity of biallelic edits in these experiments raised the possibility that the tagged protein copy has reduced function. The tolerance of biallelic tagging (and thus whether the tagged protein has sufficient function) was tested by introducing a spectrally distinct red fluorescent protein tag (mTagRFP-T) into the unedited allele of two different tagged clonal cell lines, LMNB1-mEGFP and TUBA1B-mEGFP (
Putative biallelically edited cells were FACS-isolated, expanded, and imaged to confirm localization of both tags to the nuclear envelope in the enriched population (
In contrast to LMNB1, mTagRFP-T+/GFP+ cells were not able to be recovered after attempted editing of the TUBA1B-mEGFP clonal cell line with the TUBA1B-mTagRFP-T donor plasmid, nor were mTagRFP-T+/GFP+ cells able to be isolated when both donors were co-delivered to unedited cells, despite the prevalence of both mTagRFP-T+ and GFP+ cells as separate edited populations (
To assess the possibility of allele-specific loss of expression in clonally derived cultures due to perturbed function of the tagged protein copy this, two cultures of the four cell lines displaying unequal tagged/untagged protein copy abundance (and TUBA1B-mEGFP as a control) were maintained for different amounts of time. These two sets of cultures were then imaged. As shown in
Upon validating the expression and localization of the GFP-tagged protein in each of the genome-edited lines, experiments were performed to ensure that each expanded candidate clonal line retained stem cell properties comparable to the unedited WTC cells. Assays included morphology, growth rate, expression of pluripotency markers, and differentiation potential (
Assays were performed to ensure that the clones identified to have precise edits retained stem cell properties during the process of gene editing and expansion. As such, the expression of established stem cell markers, including the transcription factors Oct3/4, Sox2 and Nanog, cell surface pluripotency markers Tra-160 and Tra 181, and the pro-differentiation marker SSEA3 were measured by flow cytometry (
In all candidate clones tested, each nuclear marker was expressed well above the commonly used thresholds of >85%+ for stem cell markers and <15%+ for differentiation markers and comparable to the parental WTC line (
In vitro differentiation assays to confirm the pluripotency of the cell lines were performed. Directed germ layer differentiation was compared between unedited cells and the final selected edited clonal line representing each of the 10 targeted structures. Each cell line was differentiated for 5-7 days under defined conditions to mesoderm, endoderm, and ectoderm using differentiation media specific to each lineage. The cells were stained for early markers of germ layer differentiation (Brachyury, Sox 17, and Pax6) and analyzed by flow cytometry (
Additional experiments were performed to assess whether each clone could robustly differentiate into cardiomyocytes. Each edited clone's differentiation potential was assessed by directing it to a cardiomyocyte fate using established protocols using a combination of growth factors and small molecules (Lian et al., 2015; Palpant et al., 2015) and evaluated cultures for spontaneous beating (days 6-20) and cardiac Troponin T (cTnT) expression (days 20-25), in order to evaluate the robustness of cardiomyocyte differentiation. Briefly, cells were seeded onto Matrigel-coated 6-well tissue culture plates at a density ranging from 0.5-2×106 cells per well in mTeSR1 supplemented with 1% P/S, 10 μM RI, and 1 μM CHIR99021 (Cayman Checmical). The following day (designated day 0), directed cardiac differentiation was initiated by treating the cultures with 100 ng/mL ActivinA (R&D) in RPMI media (Invitrogen) containing 1:60 diluted GFR Matrigel (Corning), and insulin-free B27 supplement (Invitrogen). After 17 hours (day 1), cultures were treated with 10 ng/mL BMP4 (R&D systems) in RPMI media containing 1 μM CHIR99021 and insulin-free B27 supplement. At day 3, cultures were treated with 1 μM XAV 939 (ToCris) in RPMI media supplemented with insulin-free B27 supplement. On day 5, the media was replaced with RPMI media supplemented with insulin-free B27. From day 7 onto about day 20, media was replaced with RPMI media supplemented with B27 with insulin (Invitrogen). Cells were harvested using 0.5% Trypsin-EDTA (Gibco), filtered with a 40 μm cell strainer, fixed with CytoFix Fixation Buffer™, permeabilized with BD PermiWash™ buffer, stained with anti-Cardiac Troponin T AlexaFluor® 647 (BD Bioscience) or isotype control, acquired on a FACSAria Fusion and analyzed using FlowJo software V.10.2.
Clonal lines generally displayed successful cardiomyocyte differentiation, with cTnT expression and qualitative spontaneous contractility comparable to the parental WTC line (
This cardiomyocyte differentiation data combined with pluripotency marker expression and germ layer differentiation data, support the conclusion that fusing GFP with these endogenously expressed proteins via monoallelic tagging does not appear to disrupt pluripotency or differentiation potential of these edited hiPSC cells.
Additional experiments can be performed according to protocols known in the art (e.g., Methods Mol Biol. 2014; 1210:131-41; Biomed Rep. 2017 April; 6(4): 367-373; Methods Mol Biol. 2017; 1597:195-206; Nat Commun. 2015 Oct 23; 6:8715; Mol Psychiatry. 2017 Apr. 18. doi: 10.1038/mp.2017.56; Scientific Reports volume 7, Article number: 42367 (2017)) and illustrated in
Establishing clonal hiPSC lines and culturing them long term is known to carry the risk of fixing somatic mutations and/or chromosomal aneuploidies (Weissbein et al., 2014). The possibility exists that the additional stressors inherent to gene editing heighten this risk. To address this concern, karyotype analysis was performed on each candidate clone. Karyotype analysis was performed by Diagnostic Cytogenetics Inc. (DCI, Seattle Wash.). At minimum of 20 metaphase cells were analyzed per clone. Of the ˜50 candidate clones tested, only two instances where karyotypic abnormalities became fixed in the culture were detected (data not shown). In one instance, a single candidate clone from an experiment was rejected. Further, all clones identified as candidates from the experiment targeting ACTN1 displayed the same aneuploidy event, suggesting that it had become fixed early in the editing process. These data indicate that that aneuploidy occurs at a rate that is non-negligible, and that chromosomal abnormalities must be ruled out in each experiment. However, these data suggest that the rate of aneuploidy is permissively low for high-throughput editing using these methodologies.
Transcriptome-wide analysis of two final candidate clones from a number of experiments was performed. This analysis was performed to determine whether hiPSC clones maintained over 10-15 passages and harboring potentially disruptive tags on key cellular proteins demonstrated similar global gene expression patterns to the unedited reference line, or if they had alternatively evolved into globally distinct cell lines in a manner not distinguishable by the above described quality control assays (data not shown).
In order to further characterize global gene expression changes between each edited clone and the reference line, genes whose expression differs by greater than 2-fold are annotated and compared between experiments and expression from control cell lines. Cluster analysis on these data sets is also performed to determine the most statistically significant GO term categories among edited clones. RNA-seq analysis is also performed to confirm the absence of detectable mutations in expressed sequences due to potential off-target Cas9 activity in the final clones. These findings are further confirmed by next generation exome sequencing. Analysis for additional clones is also be performed.
Phenotypic Characterization of GFP-Tagged iPSC Lines
These results indicate that the stem cells and stably tagged stem cell clones and differentiated cells therefrom of the invention can be used for three-dimensional live cell imaging of intracellular proteins. In further embodiments, the methods allow for use of the cells for screening, observing cellular dysplasia, disease staging, monitoring disease progression or improvement or cellular stress in response to a test agent.
As a final characterization step, live imaging on preferred candidate clones was performed. Cells were maintained with phenol red free mTeSR1 media (STEMCELL Technologies) one day prior to live-cell imaging. Stably tagged stem cell clones can be imaged using spinning disk confocal microscopy. Cells were imaged using spinning disk confocal microscopy at low (10× or 20×) and high (100×) magnification. Microscopes were outfitted with a humidified environmental chamber to maintain cells at 37° C. with 5% CO2 during imaging. Healthy, undifferentiated WTC hiPSCs ranged from 5-20 μm in diameter and 10-20 μm in height and grew in tightly packed colonies (
For example, paxillin was observed in the matrix adhesions formed between substrate contact points and the basal surface of cells, as well as at the dynamic edges of colonies (
Sec61 beta localized to endoplasmic reticulum (
These observations are consistent with the epithelial nature of tightly packed undifferentiated WTC hiPSCs grown on 2D surfaces. All final candidate clones, spanning 10 editing experiments, exhibited predicted subcellular localization of their tagged proteins (
The impact of the tag on correct localization of the targeted protein compared to the localization of the native, unedited protein was also assessed. Edited clones were fixed alongside unedited cells and immunocytochemistry or phalloidin staining was performed. In all 10 experiments, no detectable differences in the pattern of antibody labeling between the unedited cells and the edited cell line were observed (
As endogenously GFP-tagged proteins in live imaging experiments generate more interpretable localization data than that produced in fixed and immunostained cells (Allen Institute for Cell Science, 2017), endogenous localization in edited lines was directly compared to cells transiently transfected with constructs expressing FP-fusion proteins (EGFP or mCherry) (
The collection of the gene-edited hiPS cells described herein was used to develop image-based drug-induced protein signatures. Experiments were conducted with 12 known reference compounds that disrupt various key cellular structures and processes including cell division, microtubule organization, actin dynamics, vesicle trafficking, cell signaling, DNA replication, calcium regulation, ion channel regulators, and statins. Agents used in these experiments are shown in
The pipeline was prototyped using a small suite of well-characterized compounds that include brefeldin A, paclitaxel, rapamycin, wortmannin and staurosporine (
These perturbations showed alterations roughly analogous to those seen in other cell types. For example, the microtubule stabilizing agent paclitaxel increased microtubule bundle thickness and altered the shape and position of the mitotic spindle during hiPS cell division. In addition, paclitaxel, also induced aberrant reorganization of the ER in cells undergoing mitosis, while showing minimal effects on the bulk organization of the actin bundles and cell junctions. Other drugs, such as staurosporine, a broad kinase inhibitor, had major effects on colony and cell morphology, inducing rearrangements in cell packing and shape. It also induced re-localization of desmosomes, indicating that the cell-cell junctions undergo substantial rearrangement.
Fluorescence quantification of the 3D images were used to analyze drug-induced Golgi reorganization, cytoskeleton reorganization, and cell junction reorganization. To quantify the relative abundance of each structure of interest (e.g. Golgi as presented above), the pixel intensities of the GFP channel (488 nm) were summed across the entire z-stack. For each experiment, the same threshold was used to exclude background intensity noise across the control (DMSO) and experimental (perturbation agent) groups. The data were plotted by averaging z-stack data from a time interval (30-minute) and compared to the control DMSO data. Dunnett's multiple comparison test was used to perform one-way ANOVA between the different time intervals against the control group.
As shown in
Relative protein abundance of actin and myosin were also quantified. As shown in
For drug-induced effects on cell junction reorganization, representative maximum intensity projections of a z-stack along the x-z axis are shown in
These data demonstrate that image-based 3D data sets of fluorescently tagged structures in human induced pluripotent stem cells (hiPSC), generated by a scalable and reproducible imaging pipeline, identifies signature profiles for a range of well-characterized small molecules and can be used to generate a predictive model of the dynamic organization and behavior of cells. These unique data can be used to train predictive models to identify the effects of perturbing target pathways, ascertain “off-target” effects and the mode of action of unknown compounds, and identify likely pathways influenced by mutations. By building complete combinations of image-based observations of many structures/lines in the presence of a large number of standardized biochemical perturbations, a comprehensive database of drug signatures on hiPS cells in their normal, pathological and regenerative (developmental) states can be generated.
To generate the predictive model, the resulting imaging data from each compound per stably tagged stem cell clone or differentiated cell derived therefrom, can be compared to the negative controls (untreated and vehicle controls) to determine effect on various criteria including cell and subcellular morphology, localization of tagged structure, and dynamics. By testing each compound in multiple gene edited iPSC lines (where each line has one structure tagged with GFP), the effect of that compound on multiple structures can be assessed within the cell. First, the intended effect of each compound with the relevant gene edited cell line can be confirmed as described in the assays above. The effect of that compound on all other structures can be assessed using the suite of gene edited iPSC lines to create a unique “fingerprint” or signature for that compound in relation to multiple structures. The data generated with these established set of compounds can be used as an initial training set for assays with compounds with unknown function. These profiles can serve as a reference database that can be used for screening novel and previously uncharacterized compound libraries to identify targets, help guide mechanistic studies, and determine specificity. Additionally, the combination of using human, diploid, non-transformed cells with live imaging using these gene edited iPSCs can provide a much better platform for performing toxicology screening. Further, these predictive models based on the stem cells and stably tagged stem cell clones and differentiated cells therefrom of the present invention can be used for screening, observing cellular dysplasia, disease staging, monitoring disease progression or improvement or cellular stress in response to a test agent.
From the foregoing, it will be appreciated that, although specific embodiments of the invention have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by the appended claims.
This application claims priority to U.S. Provisional Application Nos. 62/457,088, filed Feb. 9, 2017; 62/519,045, filed Jun. 13, 2017; 62/546,237, filed Aug. 16, 2017; 62/552,185, filed Aug. 30, 2017; 62/556,115, filed Sep. 8, 2017; 62/570,081, filed Oct. 9, 2017; and 62/582,295, filed Nov. 6, 2017, the contents of which are each incorporated herein by reference in their entireties.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/017708 | 2/9/2018 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62457088 | Feb 2017 | US | |
62519045 | Jun 2017 | US | |
62546237 | Aug 2017 | US | |
62552185 | Aug 2017 | US | |
62556115 | Sep 2017 | US | |
62570081 | Oct 2017 | US | |
62582295 | Nov 2017 | US |