ENGINEERING CELLS FOR CELL-BASED THERAPIES, AND ASSOCIATED COMPOSITIONS AND METHODS

Abstract
Provided are methods for genetically modifying cells to knock out or modify one or more genes associated with blood type, e.g., ABO, FUT1, RHD. Also provided are cells and compositions derived therefrom, as well as methods of using the same to treat various human diseases.
Description
SUMMARY

An emerging cell therapy approach called adoptive cell transfer (ACT) is rapidly changing the scene of human disease treatment. ACT involves collecting cells from a patient (autologous) or healthy donors (allogeneic), engineering these cells (e.g., through gene, mRNA, or protein modifications), and transferring the cells into the patient to fight diseases. One of the complications of allogeneic therapies is that it requires blood type matching between a donor and a recipient. Blood types are determined by the presence or absence of certain antigens on the surface of red blood cells (RBCs) and many other cell types in the body. Since the presence of these antigens can trigger a recipient's immune system to attack the infused cells, safe and effective allogeneic therapies depend on blood type matching.


As of 2019, a total of 41 human blood group systems are recognized by the International Society of Blood Transfusion (ISBT). The two most commonly referenced blood group systems are ABO and Rh. Four major blood types are determined by the presence or absence of the A and B antigens (A, B, AB, and O). In addition to the A and B antigens, the Rh factor can be either present (+) or absent (−), creating the eight most common blood types (A+, A−, B+, B−, AB+, AB−, O+, and O−). Several of the blood type-determining antigens are controlled by a single gene.


The presence of the blood type-determining antigens is usually associated with the absence of antibodies against those antigens in the subject's plasma thereby preventing a potential agglutination reaction. Thus, for allogeneic therapies, blood type compatibility is important to avoid graft rejection and other negative immune reactions. Because type O− individuals do not have A, B, or Rh antigens on the surface of their cells, they are usually referred to as universal donors for any recipient having any blood type.


The present technology provides methods for genetically engineering cells to knock out, knock down or otherwise alter one or more genes associated with blood type, e.g., ABO, FUT1, RHD, to improve the efficacy and safety of allogeneic cell therapies. Also provided herein are cells and compositions derived therefrom, as well as methods of using the same to treat various human diseases.


In some aspects, methods are provided for genetically modifying one or more genes associated with blood type in a cell, the methods comprising introducing into the cell a site-directed nuclease or a nucleotide sequence encoding a site-directed nuclease, wherein the one or more genes associated with blood type are selected from the group consisting of ABO, FUT1, and RHD. In some embodiments, the methods further comprise introducing to the cell a guide RNA (gRNA) targeting the ABO, FUT1, or RHD locus.


In some aspects, gRNAs are provided for use in genetically modifying one or more genes associated with blood type in a cell, wherein the one or more genes associated with blood type is selected from the group consisting of ABO, FUT1, and RHD.


In some embodiments, the site-directed nuclease is selected from the group consisting of a zinc finger nuclease (ZFN), a transcription activator-like effector nuclease (TALEN), a meganuclease, a CRISPR-associated transposase, and a CRISPR/Cas nuclease.


In some embodiments, the site-directed nuclease is a CRISPR/Cas nuclease selected from the group consisting of Cas3, Cas4, Cas5, Cas8a, Cas8b, Cas8c, Cas9, Cas10, Cas12, Cas12a (Cpf1), Cas12b (C2c1), Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX), Cas12f (C2c10), Cas12g, Cas12h, Cas12i, Cas12k (C2c5), Cas13, Cas13a (C2c2), Cas13b, Cas13c, Cas13d, C2c4, C2c8, C2c9, Cmr5, Cse1, Cse2, Csf1, Csm2, Csn2, Csx10, Csx11, Csy1, Csy2, Csy3, and Mad7.


In some embodiments, the gRNA comprises a CRISPR RNA (crRNA) and optionally a transactivating CRISPR RNA (tracrRNA). In some embodiments, the gRNA comprises a crRNA and a tracrRNA as two separate molecules. In some embodiments, the gRNA comprises a crRNA and a tracrRNA as a single guide RNA (sgRNA). In some embodiments, the sgRNA comprises a complementary region, a crRNA repeat region, a tetraloop, and a tracrRNA.


In some embodiments, the crRNA repeat region comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:5, SEQ ID NO:9, SEQ ID NO:13, or SEQ ID NO:18. In some embodiments, the tetraloop comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:6 or SEQ ID NO:17. In some embodiments, the tracrRNA comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:7, SEQ ID NO:11, SEQ ID NO:15, or SEQ ID NO:16.


In some embodiments, the crRNA comprises a complementary region specific to a region of the ABO locus, including, for example, a coding sequence (CDS), an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region. In some embodiments, the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 20-203.


In some embodiments, the crRNA comprises a complementary region specific to a region of the FUT1 locus, including, for example, a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region. In some embodiments, the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 204-420.


In some embodiments, the crRNA comprises a complementary region specific to a region of the RHD locus, including, for example, a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region. In some embodiments, the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 421-580.


In some embodiments, the genetic modification occurs via non-homologous end-joining (NHEJ). In some embodiments, the genetic modification occurs via homology-directed repair (HDR). In some embodiments, the genetic modifications include both HDR- and NHEJ-induced modifications.


In some embodiments, provided are compositions comprising the gRNA according to various embodiments of the present technology. In some embodiments, the compositions further comprise a site-directed nuclease or a nucleotide sequence encoding a site-directed nuclease protein as described herein.


In some embodiments, the compositions comprising the gRNA according to various embodiments of the present technology are formulated for delivery into a cell. In some embodiments, the cell further comprises a site-directed nuclease or a nucleotide sequence encoding a site-directed nuclease protein as described herein.


In some aspects, provided are methods of identifying a new genomic locus for genetically modifying one or more genes associated with blood type in a cell, the methods comprising (a) locating a genomic locus based on a known gRNA; and (b) scanning a region of about 500 to 4000 bp on either side of the genomic locus for a PAM sequence, wherein the one or more genes associated with blood type is selected from the group consisting of ABO, FUT1, and RHD. In some embodiments, the known gRNA targets the ABO, FUT1, or RHD locus. In some embodiments, the gRNA comprises a complementary region comprising, consisting of, or consisting essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 20-580.


In some aspects, provided are cells having one or more genes associated with blood type genetically modified according to various embodiments of the present technology. In some embodiments, the cell is an autologous cell. In some embodiments, the cell is an allogeneic cell. In some embodiments, the cell is a pluripotent stem cell, an embryonic stem cell (ESC) or an induced pluripotent stem cell (iPSC). In some embodiments, the cell is differentiated from a pluripotent stem cell (e.g., an ESC or an iPSC). In some embodiments, the cell is the cell is a primary cell. In some embodiments, the cell is a blood cell, e.g., a red blood cell, a platelet cell, a mast cell, a basophil, an eosinophil, a neutrophil, a monocyte, a natural killer (NK) cell, a natural killer T (NKT) cell, a macrophage, a T cell, a B cell, or a plasma cell. In some embodiments, the cell is a T cell, an NK cell, or an NKT cell. In some embodiments, the cell is a cardiomyocyte. In some embodiments, the cell is a retinal pigment epithelial cell (RPE). In some embodiments, the cell is an endothelial cell. In some embodiments, the cell is a β islet cell. In some embodiments, the cell is a glial progenitor cell (GPC).


In some embodiments, the cell is modified to have reduced expression of one or more MHC I molecules and/or one or more MHC II molecules, optionally, wherein the one or more MHC I molecules are selected from the group consisting of HLA-A, HLA-B, HLA-C, and optionally, wherein the one or more MHC II molecules are selected from the group consisting of HLA-DR, HLA-DQ, HLA-DP, HLA-DM, and HLA-DO. In some embodiments, the modification is by modulation of the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci. In some embodiments, the modulation of the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci comprises B2M, TAP1, CIITA, MIC-A, and/or MIC-B knockout. In some embodiments, the modulation of the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci comprises knock-in of a transgene at the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci.


In some embodiments, the transgene encodes one or more tolerogenic factors selected from the group consisting of A20/TNFAIP3, CD16, CD16 Fc receptor, CD24, CD35, CD39, CD46, CD47, CD52, CD55, CD59, CD200, CCL22, CTLA4-Ig, C1 inhibitor, complement receptor (CR1), DUX4, FASL, H2-M3, IDO1, IL15-RF, HLA-C, HLA-E, HLA-E heavy chain, HLA-G, IL-10, IL-35, MANF, PD-1, PD-L1, SERPINB9, CCL21, and MFGE8. In some embodiments, the one or more tolerogenic factors comprise CD47, for example, human CD47. In some embodiments, the human CD47 comprises an amino acid sequence that is at least 80% identical to the amino acid sequence set forth in any one of SEQ ID NOs: 583-588. In some embodiments, the human CD47 comprises an amino acid sequence that is at least 80% identical to the amino acid sequence set forth in SEQ ID NO:584. In some embodiments, the one or more tolerogenic factors comprise HLA-E. In some embodiments, the one or more tolerogenic factors comprise CD24. In some embodiments, the one or more tolerogenic factors comprise PD-L1. In some embodiments, the one or more tolerogenic factors comprise CD24, CD47, and PD-L1. In some embodiments, the one or more tolerogenic factors comprise CD46. In some embodiments, the one or more tolerogenic factors comprise CD55. In some embodiments, the one or more tolerogenic factors comprise CD59. In some embodiments, the one or more tolerogenic factors comprise C1 inhibitor. In some embodiments, the one or more tolerogenic factors comprise CD46, CD55, CD59, and C1 inhibitor. In some embodiments, the one or more tolerogenic factors comprise HLA-E, CD24, CD47, PD-L1, CD46, CD55, CD59, and C1 inhibitor.


In some embodiments, the cell is modified to have reduced expression of one or more MHC I molecules and/or one or more MHC II molecules; increased expression of CD47, and optionally CD24 and PD-L1; and increased expression of CD46, CD55, CD59, and CR1.


In some embodiments, the cell is modified to have reduced expression of one or more MHC I molecules; reduced expression of TXNIP; increased expression of PD-L1 and HLA-E; and optionally increased expression of A20/TNFAIP3, and/or MANE.


In some embodiments, the cell is modified to have increased expression of CCL21, PD-L1, FASL, SERPINB9, HLA-G, CD47, CD200, and MFGE8.


In some aspects, provided are pharmaceutical compositions comprising cells having one or more genes associated with blood type genetically modified according to various embodiments of the present technology.


In some aspects, provided are methods of treating a disease in a subject in need thereof, the methods comprising administering the subject cells having one or more genes associated with blood type genetically modified according to various embodiments of the present technology, or pharmaceutical compositions comprising the same.


In some embodiments, the disease is cancer, e.g., a hematologic malignancy. In some embodiments, the hematologic malignancy is selected from the group consisting of myeloid neoplasm, myelodysplastic syndromes (MDS), myeloproliferative/myelodysplastic syndromes, acute lymphoid leukemia (ALL), chronic lymphocytic leukemia (CLL), acute myeloid leukemia (AML), chronic myelogenous leukemia (CML), B cell acute lymphoid leukemia (B-ALL), T cell acute lymphoid leukemia (T-ALL), T cell lymphoma, and B cell lymphoma.


In some embodiments, the disease is an autoimmune disease, e.g., lupus, systemic lupus erythematosus, rheumatoid arthritis, psoriasis, psoriatic arthritis, multiple sclerosis, Crohn's disease, ulcerative colitis, Addison's disease, Graves' disease, Sjögren's syndrome, Hashimoto's thyroiditis, and celiac disease.


In some embodiments, the disease is diabetes mellitus, e.g., Type I diabetes, Type II diabetes, prediabetes, and gestational diabetes.


In some embodiments, the disease is a neurological disease, e.g., catalepsy, epilepsy, encephalitis, meningitis, migraine, Huntington's, Alzheimer's, Parkinson's, Pelizaeus-Merzbacher disease, and multiple sclerosis.


In some embodiments, the disease is a cardiac disease, e.g., pediatric cardiomyopathy, age-related cardiomyopathy, dilated cardiomyopathy, hypertrophic cardiomyopathy, restrictive cardiomyopathy, chronic ischemic cardiomyopathy, peripartum cardiomyopathy, inflammatory cardiomyopathy, idiopathic cardiomyopathy, other cardiomyopathy, myocardial ischemic reperfusion injury, ventricular dysfunction, heart failure, congestive heart failure, coronary artery disease, end-stage heart disease, atherosclerosis, ischemia, hypertension, restenosis, angina pectoris, rheumatic heart, arterial inflammation, cardiovascular disease, myocardial infarction, myocardial ischemia, congestive heart failure, myocardial infarction, cardiac ischemia, cardiac injury, myocardial ischemia, vascular disease, acquired heart disease, congenital heart disease, atherosclerosis, coronary artery disease, dysfunctional conduction systems, dysfunctional coronary arteries, pulmonary hypertension, cardiac arrhythmias, muscular dystrophy, muscle mass abnormality, muscle degeneration, myocarditis, infective myocarditis, drug- or toxin-induced muscle abnormalities, hypersensitivity myocarditis, cardiomegaly, mitral insufficiency, and autoimmune endocarditis.







DETAILED DESCRIPTION

The present technology provides methods for engineering cells, including through genetic editing, to alter the expression of one or more genes associated with blood type, e.g., ABO, FUT1, and/or RHD. Also provided are site-directed nucleases and guide RNAs, as well as compositions and vectors thereof, for use in these methods. Moreover, the present technology provides genetically modified cells and cell populations generated using these gene editing methods as well as methods of using these cells and cell populations to treat various human diseases.


While the present disclosure is capable of being embodied in various forms, the description below of several embodiments is made with the understanding that the present disclosure is to be considered as an exemplification of the invention and is not intended to limit the invention to the specific embodiments illustrated. Headings are provided for convenience only and are not to be construed to limit the invention in any manner. Embodiments illustrated under any heading may be combined with embodiments illustrated under any other heading.


The use of numerical values in the various quantitative values specified in this application, unless expressly indicated otherwise, are stated as approximations as though the minimum and maximum values within the stated ranges were both preceded by the word “about.” It is to be understood, although not always explicitly stated, that all numerical designations are preceded by the term “about.” It is to be understood that such range format is used for convenience and brevity and should be understood flexibly to include numerical values explicitly specified as limits of a range, but also to include all individual numerical values or sub-ranges encompassed within that range as if each numerical value and sub-range is explicitly specified. For example, a ratio in the range of about 1 to about 200 should be understood to include the explicitly recited limits of about 1 and about 200, but also to include individual ratios, such as about 2, about 3, and about 4, and sub-ranges, such as about 10 to about 50, about 20 to about 100, and so forth. It also is to be understood, although not always explicitly stated, that the reagents described herein are merely exemplary and that equivalents of such are known in the art.


To the extent any materials incorporated by reference herein conflict with the present disclosure, the present disclosure controls.


Definitions

The term “about,” as used herein when referring to a measurable value, such as an amount or concentration and the like, is meant to encompass variations of 20%, 10%, 5%, 1%, 0.5%, or even 0.1% of the specified amount.


The term “antibody” is used to denote, in addition to natural antibodies, genetically engineered or otherwise modified forms of immunoglobulins or portions thereof, including chimeric antibodies, human antibodies, humanized antibodies, or synthetic antibodies. The antibodies may be monoclonal or polyclonal antibodies. In those embodiments wherein an antibody is an immunogenically active portion of an immunoglobulin molecule, the antibody may include, but is not limited to, a single chain variable fragment antibody (scFv), disulfide linked Fv, single domain antibody (sdAb), VHH antibody, antigen-binding fragment (Fab), Fab′, F(ab′)2 fragment, or diabody. An scFv antibody is derived from an antibody by linking the variable regions of the heavy (VH) and light (VL) chains of the immunoglobulin with a short linker peptide. Similarly, a disulfide linked Fv antibody can be generated by linking the VH and VL using an interdomain disulfide bond. On the other hand, sdAbs consist of only the variable region from either the heavy or light chain and usually are the smallest antigen-binding fragments of antibodies. A VHH antibody is the antigen binding fragment of heavy chain only. A diabody is a dimer of scFv fragment that consists of the VH and VL regions noncovalent connected by a small peptide linker or covalently linked to each other. The antibodies disclosed herein, including those that comprise an immunogenically active portion of an immunoglobulin molecule, retain the ability to bind a specific antigen.


The term “antigen” refers to an immunogenic molecule that provokes an immune response. This immune response may involve antibody production, activation of specific immunologically competent cells, or both. An antigen may be, for example, a peptide, glycopeptide, polypeptide, glycopolypeptide, polynucleotide, polysaccharide, lipid, or the like. It is readily apparent that an antigen can be synthesized, produced recombinantly, or derived from a biological sample. Exemplary biological samples that can contain one or more antigens include tissue samples, tumor samples, cells, biological fluids, or combinations thereof. Antigens can also be produced by cells that have been modified or genetically engineered to express an antigen.


The term “autoimmune disease,” “autoimmune disorder,” “inflammatory disease,” or “inflammatory disorder” refers to any disease or disorder in which the subject mounts an immune response against its own tissues and/or cells. Autoimmune disorders can affect almost every organ system in the subject (e.g., human), including, but not limited to, diseases of the nervous, gastrointestinal, and endocrine systems, as well as skin and other connective tissues, eyes, blood and blood vessels. Examples of autoimmune diseases include, but are not limited to Hashimoto's thyroiditis, systemic lupus erythematosus, Sjögren's syndrome, Graves' disease, scleroderma, rheumatoid arthritis, multiple sclerosis, myasthenia gravis and diabetes.


The term “codon-optimized” or “codon optimization” when referring to a nucleotide sequence is based on the discovery that the frequency of occurrence of synonymous codons (i.e., codons that code for the same amino acid) in coding nucleotide is biased in different species. Such codon degeneracy allows an identical polypeptide to be encoded by a variety of nucleotide sequences. Codon optimization refers to the process of substituting certain codons in a coding nucleotide sequence with synonymous codons based on the host cell's preference without changing the resulting polypeptide sequence. A variety of codon optimization methods are known in the art, and include, for example, methods disclosed in at least U.S. Pat. Nos. 5,786,464 and 6,114,148.


The term “construct” refers to any polynucleotide that contains a recombinant nucleic acid molecule. A construct may be present in a vector (e.g., a bacterial vector, a viral vector) or may be integrated into a genome. A “vector” is a nucleic acid molecule that is capable of introducing a specific nucleic acid sequence into a cell or into another nucleic acid sequence, or as a means of transporting another nucleic acid molecule. Vectors may be, for example, plasmids, cosmids, viruses, an RNA vector, or a linear or circular DNA or RNA molecule that may include chromosomal, non-chromosomal, semi-synthetic, or synthetic nucleic acid molecules. Exemplary vectors are those capable of autonomous replication (episomal vector), capable of delivering a polynucleotide to a cell genome (e.g., viral vector), or capable of expressing nucleic acid molecules to which they are linked (expression vectors).


The term “expression” refers to the process by which a polypeptide is produced based on the encoding sequence of a nucleic acid molecule, such as a gene. The process may include transcription, post-transcriptional control, post-transcriptional modification, translation, post-translational control, post-translational modification, or any combination thereof. An expressed nucleic acid molecule is typically operably linked to an expression control sequence (e.g., a promoter).


The term “hypoimmunogenicity,” “hypoimmunogeneic,” “hypoimmunogenic,” “hypoimmunity,” or “hypoimmune” is used interchangeably to describe a cell being less prone to immune rejection by a subject into which such cell is transplanted. For example, relative to an unaltered or unmodified wild-type cell, such a hypoimmunogenic cell may be about 2.5%, 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 97.5%, 99% or more less prone to immune rejection by a subject into which such cell is transplanted. In some examples described herein, genome editing technologies are used to modulate the expression of MHC I and MHC II genes, and thus, to generate a hypoimmunogenic cell. In other examples described herein, a tolerogenic factor is introduced into a cell and when expressed can modulate or affect the ability of the cell to be recognized by host immune system and thus confer hypoimmunogenicity. Hypoimmunogenicity of a cell can be determined by evaluating the cell's ability to elicit adaptive and innate immune responses. Such immune response can be measured using assays recognized by those skilled in the art, for example, by measuring the effect of a hypoimmunogenic cell on T cell proliferation, T cell activation, T cell killing, NK cell proliferation, NK cell activation, and macrophage activity. Hypoimmunogenic cells may undergo decreased killing by T cells and/or NK cells upon administration to a subject or show decreased macrophage engulfment compared to an unmodified or wildtype cell. In some cases, a hypoimmunogenic cell elicits a reduced or diminished immune response in a recipient subject compared to a corresponding unmodified wild-type cell. In some cases, a hypoimmunogenic cell is nonimmunogenic or fails to elicit an immune response in a recipient subject. Detailed descriptions of hypoimmunogenic cells, methods of producing the same, and methods of using the same are found in WO2016183041 filed May 9, 2015; WO2018132783 filed Jan. 14, 2018; WO2018176390 filed Mar. 20, 2018; WO2020018615 filed Jul. 17, 2019; WO2020018620 filed Jul. 17, 2019; WO2021022223 filed Jul. 31, 2020; WO2021022223 filed Jul. 31, 2020; WO2021041316 filed Aug. 24, 2020; WO2021222285 filed Apr. 27, 2021, 2020; and WO2021222285 filed Apr. 27, 2021, the disclosures including the examples, sequence listings and figures are incorporated herein by reference in their entirety.


The term “nucleic acid” or “polynucleotide” refers to a polymeric compound including covalently linked nucleotides comprising natural subunits (e.g., purine or pyrimidine bases). Purine bases include adenine and guanine, and pyrimidine bases include uracil, thymine, and cytosine. Nucleic acid molecules include polyribonucleic acid (RNA) and polydeoxyribonucleic acid (DNA), which includes cDNA, genomic DNA, and synthetic DNA, either of which may be single- or double-stranded. A nucleic acid molecule encoding an amino acid sequence includes all nucleotide sequences that encode the same amino acid sequence.


The term “subject” refers to a mammalian subject, preferably a human. A “subject in need thereof” may refer to a subject who has been diagnosed with a disease or is at an elevated risk of developing a disease. The phrases “subject” and “patient” are used interchangeably herein.


A “therapeutically effective amount” as used herein is an amount that produces a desired effect in a subject for treating a disease. In certain embodiments, the therapeutically effective amount is an amount that yields maximum therapeutic effect. In other embodiments, the therapeutically effective amount yields a therapeutic effect that is less than the maximum therapeutic effect. For example, a therapeutically effective amount may be an amount that produces a therapeutic effect while avoiding one or more side effects associated with a dosage that yields maximum therapeutic effect. A therapeutically effective amount for a particular composition will vary based on a variety of factors, including, but not limited, to the characteristics of the therapeutic composition (e.g., activity, pharmacokinetics, pharmacodynamics, and bioavailability), the physiological condition of the subject (e.g., age, body weight, sex, disease type and stage, medical history, general physical condition, responsiveness to a given dosage, and other present medications), the nature of any pharmaceutically acceptable carriers, excipients, and preservatives in the composition, and the route of administration. One skilled in the clinical and pharmacological arts will be able to determine a therapeutically effective amount through routine experimentation, namely by monitoring a subject's response to administration of the therapeutic composition and adjusting the dosage accordingly. For additional guidance, see, e.g., Remington: The Science and Practice of Pharmacy, 22nd Edition, Pharmaceutical Press, London, 2012, and Goodman & Gilman's The Pharmacological Basis of Therapeutics, 12th Edition, McGraw-Hill, New York, NY, 2011, the entire disclosures of which are incorporated by reference herein.


The term “tolerogenic factor” as used herein includes hypoimmunity factors, complement inhibitors, and other factors that modulate or affect the ability of a cell to be recognized by the immune system of a host or recipient subject upon administration, transplantation, or engraftment.


The terms “treat,” “treating,” and “treatment” as used herein with regard to a disease refers to alleviating one or more symptoms of the disease partially or entirely; preventing the disease; decreasing the likelihood of occurrence or recurrence of the disease; slowing the progression or development of the disease; eliminating, reducing, or slowing the development of one or more symptoms associated with the disease; or increasing progression-free or overall survival of the disease. For example, “treating” may refer to preventing or slowing the existing disease from progressing and/or slowing the development of certain symptoms of the disease. In some embodiments, the term “treat,” “treating,” or “treatment” means that the subject has a lesser degree of the disease comparing to a subject without being administered with the treatment. In some embodiments, the term “treat,” “treating,” or “treatment” means that one or more symptoms of the disease are alleviated in a subject receiving the treatment as disclosed and described herein comparing to a subject who does not receive such treatment.


A “vector” refers to a DNA construct containing a nucleic acid molecule that is operably linked to a suitable control sequence capable of effecting the expression of the nucleic acid molecule in a suitable host. Such control sequences may include a promoter to effect transcription, an optional operator sequence to control such transcription, a sequence encoding suitable mRNA ribosome binding sites, and sequences which control termination of transcription and translation. The vector may be a plasmid, a phage particle, a virus, or simply a potential genomic insert. Once transformed into a suitable host, the vector may replicate and function independently of the host genome, or may, in some instances, integrate into the genome itself.


Methods of Genetically Engineering Cells

Provided herein in certain embodiments are methods of genetically engineering a cell or population of cells to knock out, knock down, or otherwise alter the expression of one or more genes associated with blood type, including but not limited to ABO, FUT1, and RHD. As used herein, “knock out” includes deleting all or a portion of the target nucleotide sequence in a way that interferes with the function of the target gene. For example, a knock out can be achieved by altering a target nucleotide sequence by inducing an indel in a functional domain of the target nucleotide sequence (e.g., a DNA binding domain) or where base editing and prime editing can be used to change single nucleic acid bases to an alternate base in order to alter the genome sequence. “Knock down” refers to genetic modifications that result in reduced expression of the edited gene. As used herein, “indel” refers to a mutation resulting from an insertion, deletion, or a combination thereof, of nucleotide bases in the genome. Thus, an indel typically inserts or deletes nucleotides from a sequence. As will be appreciated by those skilled in the art, an indel in a coding region of a genomic sequence will result in a frameshift mutation, unless the length of the indel is a multiple of three. A gene editing system, e.g., the CRISPR/Cas system, of the present disclosure can be used to induce an indel of any length in a target polynucleotide sequence.


In certain embodiments, the methods provided herein utilize gene editing. Gene editing is a type of genetic engineering in which a nucleotide sequence may be inserted, deleted, modified, or replaced in the genome of a living organism. Current gene editing techniques generally utilize the innate mechanism for cells to repair double-strand breaks (DSBs) in DNA.


Eukaryotic cells repair DSBs by two primary repair pathways: non-homologous end-joining (NHEJ) and homology-directed repair (HDR). HDR typically occurs during late S phase or G2 phase, when a sister chromatid is available to serve as a repair template. NHEJ is more common and can occur during any phase of the cell cycle, but it is more error prone. In gene editing, NHEJ is generally used to produce insertion/deletion mutations (indels), which can produce targeted loss of function in a target gene by shifting the open reading frame (ORF) and producing alterations in the coding region or an associated regulatory region. HDR, on the other hand, is a preferred pathway for producing targeted knock-ins, knockouts, or insertions of specific mutations in the presence of a repair template with homologous sequences. Several methods are known to a skilled artisan to improve HDR efficiency, including, for example, chemical modulation (e.g., treating cells with inhibitors of key enzymes in the NHEJ pathway); timed delivery of the gene editing system at S and G2 phases of the cell cycle; cell cycle arrest at S and G2 phases; and introduction of repair templates with homology sequences. The methods provided herein may utilize HDR-mediated repair, NHEJ-mediated repair, or a combination thereof.


A. Gene Editing Systems

In some embodiments, the methods provided herein for genetically modifying a cell or population of cells to knock out, knock down, or otherwise modify one or more genes utilize a site-directed nuclease, including, for example, prime editing, zinc finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs), meganucleases, transposases, and clustered regularly interspaced short palindromic repeat (CRISPR)/Cas systems.


1. Prime and PASTE Editing

Prime editing is a versatile and precise genome editing method that directly writes new genetic information into a specified DNA site using a catalytically impaired Cas9 endonuclease fused to an engineered reverse transcriptase, programmed with a prime editing guide RNA (pegRNA) that both specifies the target site and encodes the desired edit. See, e.g., Anzalone et al., Nature, 576:149-157 (2019); WO2021072328; WO2022067130, all of which are incorporated herein by reference in their entirety. Cas9 and a reverse transcriptase can also be used to insert an integrase site into the genome for insertion of a nucleic acid of interest in a process called Programmable Addition via Site-specific Targeting Elements (PASTE) editing. See, e.g., loannidi et al., bioRxiv 2021.11.01.466786; doi.org/10.1101/2021.11.01.466786, all of which are incorporated herein by reference in their entirety.


2. ZFNs

ZFNs are fusion proteins comprising an array of site-specific DNA binding domains adapted from zinc finger-containing transcription factors attached to the endonuclease domain of the bacterial FokI restriction enzyme. A ZFN may have one or more (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more) of the DNA binding domains or zinc finger domains. See, e.g., Carroll et al., Genetics Society of America (2011) 188:773-782; Kim et al., Proc. Natl. Acad. Sci. USA (1996) 93:1156-1160. Each zinc finger domain is a small protein structural motif stabilized by one or more zinc ions and usually recognizes a 3- to 4-bp DNA sequence. Tandem domains can thus potentially bind to an extended nucleotide sequence that is unique within a cell's genome.


Various zinc fingers of known specificity can be combined to produce multi-finger polypeptides which recognize about 6, 9, 12, 15, or 18-bp sequences. Various selection and modular assembly techniques are available to generate zinc fingers (and combinations thereof) recognizing specific sequences, including phage display, yeast one-hybrid systems, bacterial one-hybrid and two-hybrid systems, and mammalian cells. Zinc fingers can be engineered to bind a predetermined nucleic acid sequence. Criteria to engineer a zinc finger to bind to a predetermined nucleic acid sequence are known in the art. See, e.g., Sera et al., Biochemistry (2002) 41:7074-7081; Liu et al., Bioinformatics (2008) 24:1850-1857.


ZFNs containing FokI nuclease domains or other dimeric nuclease domains function as a dimer. Thus, a pair of ZFNs are required to target non-palindromic DNA sites. The two individual ZFNs must bind opposite strands of the DNA with their nucleases properly spaced apart. See Bitinaite et al., Proc. Natl. Acad. Sci. USA (1998) 95:10570-10575. To cleave a specific site in the genome, a pair of ZFNs are designed to recognize two sequences flanking the site, one on the forward strand and the other on the reverse strand. Upon binding of the ZFNs on either side of the site, the nuclease domains dimerize and cleave the DNA at the site, generating a DSB with 5′ overhangs. HDR can then be utilized to introduce a specific mutation, with the help of a repair template containing the desired mutation flanked by homology arms. The repair template is usually an exogenous double-stranded DNA vector introduced to the cell. See Miller et al., Nat. Biotechnol. (2011) 29:143-148; Hockemeyer et al., Nat. Biotechnol. (2011) 29:731-734.


3. TALENs

TALENs are another example of an artificial nuclease which can be used to edit a target gene. TALENs are derived from DNA binding domains termed TALE repeats, which usually comprise tandem arrays with 10 to 30 repeats that bind and recognize extended DNA sequences. Each repeat is 33 to 35 amino acids in length, with two adjacent amino acids (termed the repeat-variable di-residue, or RVD) conferring specificity for one of the four DNA base pairs. Thus, there is a one-to-one correspondence between the repeats and the base pairs in the target DNA sequences.


TALENs are produced artificially by fusing one or more TALE DNA binding domains (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more) to a nuclease domain, for example, a FokI endonuclease domain. See Zhang, Nature Biotech. (2011) 29:149-153. Several mutations to FokI have been made for its use in TALENs; these, for example, improve cleavage specificity or activity. See Cermak et al., Nucl. Acids Res. (2011) 39:e82; Miller et al., Nature Biotech. (2011) 29:143-148; Hockemeyer et al., Nature Biotech. (2011) 29:731-734; Wood et al., Science (2011) 333:307; Doyon et al., Nature Methods (2010) 8:74-79; Szczepek et al., Nature Biotech (2007) 25:786-793; Guo et al., J. Mol. Biol. (2010) 200:96. The FokI domain functions as a dimer, requiring two constructs with unique DNA binding domains for sites in the target genome with proper orientation and spacing. Both the number of amino acid residues between the TALE DNA binding domain and the FokI nuclease domain and the number of bases between the two individual TALEN binding sites appear to be important parameters for achieving high levels of activity. Miller et al., Nature Biotech. (2011) 29:143-148.


By combining engineered TALE repeats with a nuclease domain, a site-specific nuclease can be produced specific to any desired DNA sequence. Similar to ZFNs, TALENs can be introduced into a cell to generate DSBs at a desired target site in the genome, and so can be used to knock out genes or knock in mutations in similar, HDR-mediated pathways. See Boch, Nature Biotech. (2011) 29:135-136; Boch et al., Science (2009) 326:1509-1512; Moscou et al., Science (2009) 326:3501.


4. Meganucleases

Meganucleases are enzymes in the endonuclease family which are characterized by their capacity to recognize and cut large DNA sequences (from 14 to 40 base pairs). Meganucleases are grouped into families based on their structural motifs which affect nuclease activity and/or DNA recognition. The most widespread and best known meganucleases are the proteins in the LAGLIDADG family, which owe their name to a conserved amino acid sequence. See Chevalier et al., Nucleic Acids Res. (2001) 29(18): 3757-3774. On the other hand, the GIY-YIG family members have a GIY-YIG module, which is 70-100 residues long and includes four or five conserved sequence motifs with four invariant residues, two of which are required for activity. See Van Roey et al., Nature Struct. Biol. (2002) 9:806-811. The His-Cys family meganucleases are characterized by a highly conserved series of histidines and cysteines over a region encompassing several hundred amino acid residues. See Chevalier et al., Nucleic Acids Res. (2001) 29(18):3757-3774. Members of the NHN family are defined by motifs containing two pairs of conserved histidines surrounded by asparagine residues. See Chevalier et al., Nucleic Acids Res. (2001) 29(18):3757-3774.


Because the chance of identifying a natural meganuclease for a particular target DNA sequence is low due to the high specificity requirement, various methods including mutagenesis and high throughput screening methods have been used to create meganuclease variants that recognize unique sequences. Strategies for engineering a meganuclease with altered DNA-binding specificity, e.g., to bind to a predetermined nucleic acid sequence are known in the art. See, e.g., Chevalier et al., Mol. Cell. (2002) 10:895-905; Epinat et al., Nucleic Acids Res (2003) 31:2952-2962; Silva et al., J Mol. Biol. (2006) 361:744-754; Seligman et al., Nucleic Acids Res (2002) 30:3870-3879; Sussman et al., J Mol Biol (2004) 342:31-41; Doyon et al., J Am Chem Soc (2006) 128:2477-2484; Chen et al., Protein Eng Des Sel (2009) 22:249-256; Arnould et al., J Mol Biol. (2006) 355:443-458; Smith et al., Nucleic Acids Res. (2006) 363(2):283-294.


Like ZFNs and TALENs, Meganucleases can create DSBs in the genomic DNA, which can create a frame-shift mutation if improperly repaired, e.g., via NHEJ, leading to a decrease in the expression of a target gene in a cell. Alternatively, foreign DNA can be introduced into the cell along with the meganuclease. Depending on the sequences of the foreign DNA and chromosomal sequence, this process can be used to modify the target gene. See Silva et al., Current Gene Therapy (2011) 11:11-27.


5. Transposases

Transposases are enzymes that bind to the end of a transposon and catalyze its movement to another part of the genome by a cut and paste mechanism or a replicative transposition mechanism. By linking transposases to other systems such as the CRISPR/Cas system, new gene editing tools can be developed to enable site specific insertions or manipulations of the genomic DNA. There are two known DNA integration methods using transposons which use a catalytically inactive Cas effector protein and Tn7-like transposons. The transposase-dependent DNA integration does not provoke DSBs in the genome, which may guarantee safer and more specific DNA integration.


6. CRISPR/Cas

The CRISPR system was originally discovered in prokaryotic organisms (e.g., bacteria and archaea) as a system involved in defense against invading phages and plasmids that provides a form of acquired immunity. Now it has been adapted and used as a popular gene editing tool in research and clinical applications.


CRISPR/Cas systems generally comprise at least two components: one or more guide RNAs (gRNAs) and a Cas protein. The Cas protein is a nuclease that introduces a DSB into the target site. CRISPR-Cas systems fall into two major classes: class 1 systems use a complex of multiple Cas proteins to degrade nucleic acids; class 2 systems use a single large Cas protein for the same purpose. Class 1 is divided into types I, Ill, and IV; class 2 is divided into types II, V, and VI. Different Cas proteins adapted for gene editing applications include, but are not limited to, Cas3, Cas4, Cas5, Cas8a, Cas8b, Cas8c, Cas9, Cas10, Cas12, Cas12a (Cpf1), Cas12b (C2c1), Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX), Cas12f (C2c10), Cas12g, Cas12h, Cas12i, Cas12k (C2c5), Cas13, Cas13a (C2c2), Cas13b, Cas13c, Cas13d, C2c4, C2c8, C2c9, Cmr5, Cse1, Cse2, Csf1, Csm2, Csn2, Csx10, Csx11, Csy1, Csy2, Csy3, and Mad7. See, e.g., Jinek et al., Science (2012) 337 (6096):816-821; Dang et al., Genome Biology (2015) 16:280; Ran et al., Nature (2015) 520:186-191; Zetsche et al., Cell (2015) 163:759-771; Strecker et al., Nature Comm. (2019) 10:212; Yan et al., Science (2019) 363:88-91. The most widely used Cas9 is a type II Cas protein and is described herein as illustrative. These Cas proteins may be originated from different source species. For example, Cas9 can be derived from S. pyogenes or S. aureus.


In the original microbial genome, the type II CRISPR system incorporates sequences from invading DNA between CRISPR repeat sequences encoded as arrays within the host genome. Transcripts from the CRISPR repeat arrays are processed into CRISPR RNAs (crRNAs) each harboring a variable sequence transcribed from the invading DNA, known as the “protospacer” sequence, as well as part of the CRISPR repeat. Each crRNA hybridizes with a second transactivating CRISPR RNA (tracrRNA), and these two RNAs form a complex with the Cas9 nuclease. The protospacer-encoded portion of the crRNA directs the Cas9 complex to cleave complementary target DNA sequences, provided that they are adjacent to short sequences known as “protospacer adjacent motifs” (PAMs).


While the foregoing description has focused on Cas9 nuclease, it should be appreciated that other RNA-guided nucleases exist which utilize gRNAs that differ in some ways from those described to this point. For instance, Cpf1 (CRISPR from Prevotella and Franciscella 1; also known as Cas12a) is an RNA-guided nuclease that only requires a crRNA and does not need a tracrRNA to function.


Since its discovery, the CRISPR system has been adapted for inducing sequence specific DSBs and targeted genome editing in a wide range of cells and organisms spanning from bacteria to eukaryotic cells including human cells. In its use in gene editing applications, artificially designed, synthetic gRNAs have replaced the original crRNA:tracrRNA complexes, including in certain embodiments via a single gRNA. For example, the gRNAs can be single guide RNAs (sgRNAs) composed of a crRNA, a tetraloop, and a tracrRNA. The crRNA usually comprises a complementary region (also called a spacer, usually about 20 nucleotides in length) that is user-designed to recognize a target DNA of interest. The tracrRNA sequence comprises a scaffold region for Cas nuclease binding. The crRNA sequence and the tracrRNA sequence are linked by the tetraloop and each have a short repeat sequence for hybridization with each other, thus generating a chimeric sgRNA. One can change the genomic target of the Cas nuclease by simply changing the spacer or complementary region sequence present in the gRNA. The complementary region will direct the Cas nuclease to the target DNA site through standard RNA-DNA complementary base pairing rules.


In order for the Cas nuclease to function, there must be a PAM immediately downstream of the target sequence in the genomic DNA. Recognition of the PAM by the Cas protein is thought to destabilize the adjacent genomic sequence, allowing interrogation of the sequence by the gRNA and resulting in gRNA-DNA pairing when a matching sequence is present. The specific sequence of PAM varies depending on the species of the Cas gene. For example, the most commonly used Cas9 nuclease derived from S. pyogenes recognizes a PAM sequence of 5′-NGG-3′ or, at less efficient rates, 5′-NAG-3′, where “N” can be any nucleotide. Other Cas nuclease variants with alternative PAMs have also been characterized and successfully used for genome editing, which are summarized in Table 1 below.









TABLE 1







Exemplary Cas nuclease variants and their PAM sequences









CRISPR Nuclease
Source Organism
PAM Sequence (5′→3′)





SpCas9

Streptococcus pyogenes

ngg or nag


SaCas9

Staphylococcus aureus

ngrrt or ngrrn


NmeCas9

Neisseria meningitidis

nnnngatt


CjCas9

Campylobacter jejuni

nnnnryac


StCas9

Streptococcus thermophilus

nnagaaw


TdCas9

Treponema denticola

naaaac


LbCas12a (Cpf1)

Lachnospiraceae bacterium

tttv


AsCas12a (Cpf1)

Acidaminococcus sp.

tttv


AacCas12b

Alicyclobacillus acidiphilus

ttn


BhCas12b v4

Bacillus hisashii

attn, tttn, or gttn





r = a or g;


y = c or t;


w = a or t;


v = a or c or g;


n = any base






In some embodiments, Cas nucleases may comprise one or more mutations to alter their activity, specificity, recognition, and/or other characteristics. For example, the Cas nuclease may have one or more mutations that alter its fidelity to mitigate off-target effects (e.g., eSpCas9, SpCas9-HF1, HypaSpCas9, HeFSpCas9, and evoSpCas9 high-fidelity variants of SpCas9). For another example, the Cas nuclease may have one or more mutations that alter its PAM specificity.


B. Genomic Loci for Gene Editing

In some embodiments of the methods provided herein, the genomic locus for site-directed knockout, knockdown, or other modification is a gene associated with blood type. In some embodiments, the one or more genes associated with blood types are selected from the group consisting of ABO, FUT1, and RHD. In some embodiments, two or more locations in a gene are modified. In some embodiments, two or more genes are modified.


The specific site for editing within a gene may be located within any suitable region of the gene, including but not limited to a gene coding region (also known as a coding sequence or “CDS”), an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region (e.g., promoter, enhancer). In some embodiments, the gene editing occurs in one allele of the specific genomic locus. In some embodiments, the gene editing occurs in both alleles of the specific genomic locus.


1. ABO

The ABO gene encodes blood group ABO system transferase, which is an enzyme with glycosyltransferase activity, and determines the ABO blood type of an individual by modifying the oligosaccharides on cell surface glycoproteins. The ABO gene locus encodes three alleles. The A allele produces α-1,3-N-acetylgalactosamine transferase (A-transferase), which catalyzes the transfer of GaINAc residues from the UDP-GaINAc donor nucleotide to the Gal residues of the acceptor H antigen, converting the H antigen into A antigen in A and AB individuals. The B allele encodes α-1,3-galactosyl transferase (B-transferase), which catalyzes the transfer of Gal residues from the UDP-Gal donor nucleotide to the Gal residues of the acceptor H antigen, converting the H antigen into B antigen in B and AB individuals. The O allele lacks both enzymatic activities because of a frame shift near the N-terminus resulting in translation of an almost entirely different protein. Thus, neither A nor B antigen is found in O individuals.


The human ABO gene resides on chromosome 9 at the band 9q34.2 (chromosome 9: 133,233,278-133,276,024, reverse strand). The ABO genomic sequence as set forth in Ensembl ID ENSG00000175164.16 is set forth in SEQ ID NO:1.


2. FUT1

The FUT1 gene encodes galactoside 2-alpha-L-fucosyltransferase 1, which is involved in the synthesis of the H antigen (determinant of blood type O).


The human FUT1 gene resides on chromosome 19 at the band 19q13.33 (chromosome 19:48,748,011-48,755,390, reverse strand). The FUT1 genomic sequence as set forth in Ensembl ID: ENSG00000174951.12 is set forth in SEQ ID NO:2.


3. RHD

The Rh system is the second most important blood-group system with currently 50 antigens, of which the D antigen is the most significant Rh antigen for its likelihood to provoke an immune system response. The Rh D antigen is encoded by the RHD gene. Other Rh antigen encoding genes include RHCE, RhAG, RhBG, and RhCG.


The human RHD gene resides on chromosome 1 at the band 1p36.11 (chromosome 1: 25,272,393-25,330,445, forward strand). The RHD genomic sequence as set forth in Ensembl ID: ENSG00000187010.21 is set forth in SEQ ID NO:3.


In some embodiments, the modifications to the specific genomic loci may prevent expression of the genes entirely (i.e., knockout). In some embodiments, the modifications to the specific genomic loci may result in reduced expression of the genes (i.e., knockdown). In certain of these embodiments, gene knockout or knockdown may be achieved by any of the site-directed nuclease-based gene editing systems described, including, for example, the CRISPR/Cas system. In some embodiments, the gene knockout or knockdown occurs through insertion-deletion (indel) mutations at the target loci (e.g., through the NHEJ pathway) including, for example, frameshift-inducing indels or indels in a protein-coding region that result in loss-of-function mutations. In some embodiments, the gene knockout or knockdown occurs through deletions of the genes or portions thereof through either the NHEJ or HDR pathway, although HDR is a preferred pathway for introducing specific deletions. In some embodiments, the gene knockout or knockdown occurs through introduction of silencing or loss-of-function mutations at the target loci through the HDR pathway.


In some embodiments, the modifications to the specific genomic loci may involve insertion of exogenous genes to be expressed in place of the genes being edited (i.e., knock-in). In certain of these embodiments, gene knock-in may be achieved by the introduction of a site-directed nuclease and a transgene to be inserted through homologous recombination. The transgene can be flanked by homology arms (e.g., left homology arm (LHA) and right homology arm (RHA), respectively) and delivered to the cell for insertion into specified loci by HDR-based approaches as described. The homology arms are specifically designed for the target loci to serve as a template for HDR. The length of each homology arm is generally dependent on the size of the transgene being inserted, with larger insertions requiring longer homology arms. Any of the gene editing systems described, including, for example, the CRISPR/Cas system, may be employed for gene knock-in. In addition to expression of transgenes, gene knock-in may result in lost or reduced expression of the original genes at the target loci.


In some embodiments, the gene editing occurs at one or more genomic loci associated with blood types including, for example, ABO, FUT1, and RHD, and results in reduced or no expression of one or more of these genes. By modulating (e.g., reducing or deleting) expression of the ABO, FUT1, and/or RHD gene, the blood type of the cells may be modified. For example, the blood type of the cell may be changed from A, B, or AB to O by knocking out the A and/or B alleles of the ABO gene. For another example, the blood type of the cell may be changed from Rh+ to Rh− by knocking out the RHD gene.


C. Guide RNAs (gRNAs) for Gene Editing


In some embodiments, gRNAs are provided for use in targeted gene editing as described herein, especially in association with the CRISPR/Cas system. The gRNAs comprise a crRNA sequence, which in turn comprises a complementary region (also called a spacer) that recognizes and binds a complementary target DNA of interest. The length of the spacer or complementary region is generally between 15 and 30 nucleotides, usually about 20 nucleotides in length, although will vary based on the requirements of the specific CRISPR/Cas system. In certain embodiments, the spacer or complementary region is fully complementary to the target DNA sequence. In other embodiments, the spacer is partially complementary to the target DNA sequence, for example at least 80%, 85%, 90%, 95%, 98%, or 99% complementary.


In certain embodiments, the gRNAs provided herein further comprise a tracrRNA sequence, which comprises a scaffold region for binding to a nuclease. The length and/or sequence of the tracrRNA may vary depending on the specific nuclease being used for editing. In certain embodiments, nuclease binding by the gRNA does not require a tracrRNA sequence. In those embodiments where the gRNA comprises a tracrRNA, the crRNA sequence may further comprise a repeat region for hybridization with complementary sequences of the tracrRNA.


In some embodiments, the gRNAs provided herein comprise two or more gRNA molecules, for example a crRNA and a tracrRNA as two separate molecules. In other embodiments, the gRNAs are single guide RNAs (sgRNAs), including sgRNAs comprising a crRNA and a tracrRNA on a single RNA molecule. In certain of these embodiments, the crRNA and tracrRNA are linked by an intervening tetraloop.


In some embodiments, one gRNA can be used in association with a site-directed nuclease for targeted editing of a gene locus of interest. In other embodiments, two or more gRNAs targeting the same gene locus of interest can be used in association with a site-directed nuclease.


In some embodiments, exemplary gRNAs (e.g., sgRNAs) for use with various common Cas nucleases that require both a crRNA and tracrRNA, including Cas9 and Cas12b (C2c1), are provided in Table 2. See, e.g., Jinek et al., Science (2012) 337 (6096):816-821; Dang et al., Genome Biology (2015) 16:280; Ran et al., Nature (2015) 520:186-191; Strecker et al., Nature Comm. (2019) 10:212. For each exemplary gRNA, sequences for different portions of the gRNA, including the complementary region or spacer, crRNA repeat region, tetraloop, and tracrRNA, are shown. In some embodiments, the gRNA comprises all or a portion of the nucleotide sequences set forth in SEQ ID NOs: 4-7. In some embodiments, the gRNA comprises all or a portion of the nucleotide sequences set forth in SEQ ID NOs: 8-11. In some embodiments, the gRNA comprises all or a portion of the nucleotide sequences set forth in SEQ ID NOs: 12-15. In some embodiments, the gRNA comprises all or a portion of the nucleotide sequences set forth in SEQ ID NOs: 16-19.


In some embodiments, the gRNA comprises a crRNA repeat region comprising, consisting of, or consisting essentially of the nucleotide sequence set forth in SEQ ID NO:5, SEQ ID NO:9, SEQ ID NO:13, or SEQ ID NO:18. In some embodiments, the gRNA comprises a tetraloop comprising, consisting of, or consisting essentially of the nucleotide sequence set forth in SEQ ID NO:6 or SEQ ID NO:17. In some embodiments, the gRNA comprises a tracrRNA comprising, consisting of, or consisting essentially of the nucleotide sequence set forth in SEQ ID NO:7, SEQ ID NO:11, SEQ ID NO:15, or SEQ ID NO:16.









TABLE 2







Exemplary gRNA structure and sequence for CRISPR/Cas









SEQ ID NO:
Sequence (5′→3′)
Description





 4
nnnnnnnnnnnnnnnnnnnn
Exemplary spCas9 1




Complementary region




(spacer)





 5
guuuuagagcua
Exemplary spCas9 1




crRNA repeat region





 6
gaaa
Exemplary spCas9 1




tetraloop





 7
uagcaaguuaaaauaaggcuaguccguuaucaacuug
Exemplary spCas9 1



aaaaaguggcaccgagucggugcuuuuuu
tracrRNA





 8
nnnnnnnnnnnnnnnnnnnn
Exemplary spCas9 2




Complementary region




(spacer)





 9
guuusagagcuaugcug
Exemplary spCas9 2




crRNA repeat region





10
gaaa
Exemplary spCas9 2




tetraloop





11
cagcauagcaaguusaaauaaggcuaguccguuauca
Exemplary spCas9 2



acuugaaaaaguggcaccgagucggugcuuuuuu
tracrRNA





12
nnnnnnnnnnnnnnnnnnnn
Exemplary saCas9




Complementary region




(spacer)





13
guuuuaguacucug
Exemplary saCas9




crRNA repeat region





14
gaaa
Exemplary saCas9




tetraloop





15
cagaaucuacuaaaacaaggcaaaaugccguguuuau
Exemplary saCas9



cucgucaacuuguuggcgagauuuuuu
tracrRNA





16
gucgucuauaggacggcgaggacaacgggaagugcca
Exemplary AkCas12b



augugcucuuuccaagagcaaacaccccguuggcuuca
tracrRNA



agaugaccgcucg






17
aaaa
Exemplary AkCas12b




tetraloop





18
cgagcggucugagaaguggcacu
Exemplary AkCas12b




crRNA repeat region





19
nnnnnnnnnnnnnnnnnnnn
Exemplary AkCas12b




Complementary region




(spacer)





S = c or g; n = any base






In some embodiments, the gRNA comprises a complementary region specific to a blood type gene locus, for example, the ABO locus, the FUT1 locus, or the RHD locus. The complementary region may bind a target sequence in any region of the blood type gene locus, including for example, a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region (e.g., promoter, enhancer). Where the target sequence is a CDS, exon, intron, or sequence spanning portions of an exon and intron, the CDS, exon, intron, or exon/intron boundary may be defined according to any splice variant of the target gene. In some embodiments, the genomic locus targeted by the gRNA is located within 4000 bp, within 3500 bp, within 3000 bp, within 2500 bp, within 2000 bp, within 1500 bp, within 1000 bp, or within 500 bp of any of the loci or regions thereof as described.


In some embodiments, the gRNA used herein for targeted gene editing comprises a complementary region that recognizes a target genomic sequence of the ABO gene locus. In certain of these embodiments, the target sequence is located in a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or regulatory regions of the ABO gene. In certain embodiments, the gRNA comprises a complementary region that recognizes a target genomic sequence located entirely within an exon of the ABO gene, for example an exon as identified in Ensembl ENSG00000175164.4, ENST00000611156.4, or ENST00000538324.2, or in NCBI NC_000009.11. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 1-63, 806-863, 13857-13926, 14651-14707, 16159-16206, 17893-17983, 18483-18616, 19669-26168, or 42416-42747 of SEQ ID NO:1. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 824-863, 13857-13926, 14651-14707, 16159-16206, 17893-17928, 18483-18616, or 19669-20849 of SEQ ID NO:1. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 810-863, 13857-13926, 14651-14707, 16159-16206, 17893-17928, 18483-18501, 18504-18616, or 19669-20849 of SEQ ID NO:1. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 810-863, 13857-13926, 14651-14707, 16159-16206, 17893-17928, 18483-18501, 18504-18616, 19669-20350, or 20355-20423 of SEQ ID NO:1.


Exemplary target genomic sequences, the strand in which they are located, their associated PAM sequences, and cut sites of gRNAs targeting the ABO gene are provided in Table 4. In some embodiments, the gRNA targeting the ABO gene comprises a complementary region comprising, consisting of, or consisting essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 20-203. In some embodiments, the gRNA targeting the ABO gene comprises a complementary region comprising, consisting of, or consisting essentially of a nucleotide sequence complementary to the reverse complement of any of SEQ ID NOs: 20-203.


In some embodiments, the gRNA used herein for targeted gene editing comprises a complementary region that recognizes a target genomic sequence of the FUT1 gene locus. In certain of these embodiments, the target sequence is located in a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or regulatory region of the FUT1 gene. In certain embodiments, the gRNA comprises a complementary region that recognizes a target genomic sequence located entirely within an exon of the FUT1 gene, for example an exon as identified in Ensembl ENSG00000174951.12 or ENST00000645652.2, or in NCBI NG_007510. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 1-101, 1731-2066, 2269-2901, or 4108-7380 of SEQ ID NO:2. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 2750-2901 or 4108-7380 of SEQ ID NO:2. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 33-101, 1731-1743, 1828-2066, 2269-2901, or 4108-7380 of SEQ ID NO:2. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 33-101, 1731-1743, 1828-2066, 2269-2499, or 4108-7380 of SEQ ID NO:2. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 33-101, 1731-1743, 1828-2066, 2666-2901, or 4108-7380 of SEQ ID NO:2. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 33-101, 1731-1743, 1828-2066, 2750-2901, or 4108-7380 of SEQ ID NO:2.


Exemplary target genomic sequences, the strand in which they are located, their associated PAM sequences, and cut sites of gRNAs targeting the FUT1 gene are provided in Table 4. In some embodiments, the gRNA targeting the FUT1 gene comprises a complementary region comprising, consisting of, or consisting essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 204-420. In some embodiments, the gRNA targeting the FUT1 gene comprises a complementary region comprising, consisting of, or consisting essentially of a nucleotide sequence complementary to the reverse complement of any of SEQ ID NOs: 204-420.


In some embodiments, the gRNA used herein for targeted gene editing comprises a complementary region that recognizes a target genomic sequence of the RHD gene locus. In certain of these embodiments, the target sequence is located in a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or regulatory regions of the RHD gene. In certain embodiments, the gRNA comprises a complementary region that recognizes a target genomic sequence located entirely within an exon of the RHD gene, for example an exon as identified in Ensembl ENSG00000187010.21, ENST00000328664.9, ENST00000622561.4, ENST00000423810.6, ENST00000342055.9, ENST00000568195.5, ENST00000357542.8, ENST00000417538.6, ENST00000454452.6, or ENST00000648012.1, or in NCBI NG_007494.1. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 1-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 34204-34550, 35255-35424, 44608-44687, 49497-49570, or 56506-58053 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 117-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 34204-34337, 44608-44687, 49497-49570, or 56506-58053 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 94-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 34204-34337, 35255-35424, 44608-44687, 49497-49570, or 56506-58053 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 156-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 34204-34337, 35255-35424, 44608-44687, or 56506-56819 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 156-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 34204-34337, 35255-35424, or 56506-56819 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 156-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 34204-34337, 44608-44687, or 56506-56819 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 156-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 34204-34337, or 56506-56819 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 156-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 44608-44687, or 56506-56819 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 106-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, or 56506-56819 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 47-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 44608-44687, 49497-49570, or 56506-56769 of SEQ ID NO:3. In certain embodiments, the gRNA comprises a complementary region that recognizes a 15-30 nucleotide target sequence located within nucleotides 117-303, 12181-12367, 18249-18399, 28554-28701, 29128-29294, 30930-31067, 34204-34337, 44608-44687, 49497-49570, or 56506-58053 of SEQ ID NO:3.


Exemplary target genomic sequences, the strand in which they are located, their associated PAM sequences, and cut sites of gRNAs targeting the RHD gene are provided in Table 4. In some embodiments, the gRNA targeting the RHD gene comprises a complementary region comprising, consisting of, or consisting essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 421-580. In some embodiments, the gRNA targeting the RHD gene comprises a complementary region comprising, consisting of, or consisting essentially of a nucleotide sequence complementary to the reverse complement of any of SEQ ID NOs: 421-580.


In some embodiments, provided are methods of identifying new loci and/or gRNA sequences for use in the gene editing systems as described. For example, for CRISPR/Cas systems, when an existing gRNA for a particular locus (e.g., any of the exemplary gRNAs provided) is known, an “inch worming” approach can be used to identify additional loci by scanning the flanking regions on either side of the known locus for PAM sequences, which usually occurs about every 100 base pairs (bp) across the genome. The PAM sequence will depend on the particular Cas nuclease used because different nucleases usually have different corresponding PAM sequences. The flanking regions on either side of the locus can be between about 500 to 4000 bp long, for example, about 500 bp, about 1000 bp, about 1500 bp, about 2000 bp, about 2500 bp, about 3000 bp, about 3500 bp, or about 4000 bp long. When a PAM sequence is identified within the search range, a new guide can be designed according to the sequence of that locus for use in association with any of the gene editing system described. In certain embodiments, the new gRNAs identified using this approach can target a genomic locus within 4000 bp, within 3500 bp, within 3000 bp, within 2500 bp, within 2000 bp, within 1500 bp, within 1000 bp, within 500 bp, within 400 bp, within 300 bp, within 200 bp, within 100 bp, or within 50 bp of any of the genomic cut sites provided in Table 4. In certain embodiments, the gRNA is configured to produce a cut site at a position within 5, 10, 15, 20, 30, 40, or 50 nucleotides of any of the genomic cut sites provided in Table 4.


In some embodiments, the activity, stability, and/or other characteristics of gRNAs can be altered through the incorporation of chemical and/or sequential modifications. As one example, transiently expressed or delivered nucleic acids can be prone to degradation by, e.g., cellular nucleases. Accordingly, the gRNAs described herein can contain one or more modified nucleosides or nucleotides which introduce stability toward nucleases. While not being bound by a particular theory, it is believed that certain modified gRNAs described herein can exhibit a reduced innate immune response when introduced into a population of cells, particularly the cells of the present technology. As used herein, the term “innate immune response” includes a cellular response to exogenous nucleic acids, including single stranded nucleic acids, generally of viral or bacterial origin, which involves the induction of cytokine expression and release, particularly the interferons, and cell death. Other common chemical modifications of gRNAs to improve stabilities, increase nuclease resistance, and/or reduce immune response include 2′-O-methyl modification, 2′-fluoro modification, 2′-O-methyl phosphorothioate linkage modification, and 2′-O-methyl 3′ thioPACE modification.


One common 3′ end modification is the addition of a poly A tract comprising one or more (and typically 5-200) adenine (A) residues. The poly A tract can be contained in the nucleic acid sequence encoding the gRNA or can be added to the gRNA during chemical synthesis, or following in vitro transcription using a polyadenosine polymerase (e.g., E. coli Poly(A)Polymerase). In vivo, poly-A tracts can be added to sequences transcribed from DNA vectors through the use of polyadenylation signals. Other suitable gRNA modifications include, without limitations, those described in U.S. Patent Application No. US 2017/0073674 A1 and International Publication No. WO 2017/165862 A1, the entire contents of each of which are incorporated by reference herein.


D. Delivery of Gene Editing Systems into a Cell


In some embodiments, provided are compositions comprising one or more components of a gene editing system described herein, including one or more gRNAs, a site-directed nuclease (e.g., a Cas nuclease) or a nucleotide sequence encoding a site-directed nuclease protein, and optionally a transgene for targeted insertion. In some embodiments, the compositions are formulated for delivery into a cell.


In some embodiments, components of a gene editing system provided herein, including one or more gRNAs, a site-directed nuclease (e.g., a Cas nuclease) or a nucleotide sequence encoding a site-directed nuclease protein, and optionally a transgene for targeted insertion, may be delivered into a cell in the form of a vector. The delivery vector can be any type of vector suitable for introduction of nucleotide sequences into a cell, including, for example, plasmids, adenoviral vectors, adeno-associated viral (AAV) vectors, retroviral vectors, lentiviral vectors, phages, and HDR-based donor vectors. The different components may be introduced into a cell together or separately, and may be delivered in a single vector or multiple vectors.


In some embodiments, the vector may be introduced into a cell by any known method in the field, including, for example, viral transformation, calcium phosphate transfection, lipid-mediated transfection, DEAE-dextran, electroporation, microinjection, nucleoporation, liposomes, nanoparticles, or other methods.


In some embodiments, the present technology provides compositions comprising a vector according to various embodiments disclosed herein. In some embodiments, the compositions may further comprise one or more pharmaceutically acceptable carriers, excipients, preservatives, or a combination thereof. A “pharmaceutically acceptable carrier or excipient” refers to a pharmaceutically acceptable material, composition, or vehicle that is involved in carrying or transporting a compound of interest from one tissue, organ, or portion of the body to another tissue, organ, or portion of the body. For example, the carrier or excipient may be a liquid or solid filler, diluent, excipient, solvent, or encapsulating material, or some combination thereof. Each component of the carrier or excipient must be “pharmaceutically acceptable,” in that it must be compatible with the other ingredients of the formulation. It also must be suitable for contact with any tissue, organ, or portion of the body that it may encounter, meaning that it must not carry a risk of toxicity, irritation, allergic response, immunogenicity, or any other complication that excessively outweighs its therapeutic benefits. Suitable excipients include water, saline, dextrose, glycerol, or the like and combinations thereof. In some embodiments, compositions comprising cells as disclosed herein further comprise a suitable infusion media.


In some embodiments, provided are cells or compositions thereof comprising one or more components of a gene editing system described herein, including one or more gRNAs, a site-directed nuclease (e.g., a Cas nuclease) or a nucleotide sequence encoding a site-directed nuclease protein, and optionally a transgene for targeted insertion.


E. Additional Genetic Modifications for Hypoimmunity

In some embodiments, in addition to the gene editing methods as described to knock out, knock down, or otherwise alter the expression of one or more genes associated with blood type (e.g., ABO, FUT1, and RHD) in a cell, the cell, for example, in cases of allogeneic cells, may have additional genetic modifications, to further reduce potential graft-versus-host risks after infusion into the recipient or risks of being eliminated by the recipient's innate immune system. These additional modifications may include, for example, reducing or eliminating the expression of major histocompatibility complex (MHC) class I and/or MHC class II (MHC I and/or MHC II) genes, which encode cell surface molecules specialized to present antigenic peptides to immune cells. Reduced expression of MHC I and/or MHC II molecules in allogeneic cells may prevent recognition of these cells by the immune cells of the recipient and thus rejection of the graft. The step of modifying (e.g., reducing or eliminating) MHC I and/or MHC II molecules may occur before, at the same time as, or after the step of modifying the expression of one or more blood type genes. The MHC in humans is called human leukocyte antigen (HLA). Class I HLA (HLA I) corresponding to MHC I include the HLA-A, HLA-B, and HLA-C genes, and Class II HLA (HLA I) corresponding to MHC II include the HLA-DR, HLA-DQ, HLA-DP, HLA-DM, and HLA-DO genes.


In some embodiments, the additional modifications to a cell to reduce the immunogenicity of the cell comprise genetically modifying the cell to reduce the expression of one or more immune factors, including, for example, class II transactivator (CIITA), P2 microglobulin (B2M), NLRC5, CTLA-4, PD-1, HLA-A, HLA-BM, HLA-C, RFX-ANK, NFY-A, RFX5, RFX-AP, NFY-B, NFY-C, IRF1, MIC-A, MIC-B, TXNIP, CD142, CD38, PCDH11Y, NLGN4Y, and TAP1.


In some embodiments, the cell may be modified to have reduced expression of MHC I genes by targeting and modulating one or more of the HLA loci individually, such as HLA-A, HLA-B, and/or HLA-C, or collectively with HLA-Razor. In some embodiments, the modulation occurs through insertion-deletion (indel) modifications of one of more of the HLA loci, including HLA-A, HLA-B, and/or HLA-C, for example, by using the CRISPR/Cas system as described. By modulating (e.g., reducing or deleting) expression of any of the HLA genes, the cell can be rendered hypoimmunogenic and have a reduced ability to induce an immune response in a recipient subject. In some embodiments, reduced expression of any of the HLA loci reduces or eliminates expression of one or more of the HLA-A, HLA-B, and HLA-C genes. In some embodiments, the cell has HLA-A, HLA-B, and/or HLA-C knockout. In some embodiments, the genetic modification targeting any of the HLA loci comprises inserting an exogenous nucleic acid or transgene encoding a polypeptide (e.g., a tolerogenic factor) as described herein at the HLA locus. In certain of these embodiments, insertion of the transgene into any of the HLA loci results in HLA-A, HLA-B, and/or HLA-C knockout.


In some embodiments, the cell may be modified to have reduced expression of MHC I genes by targeting and modulating the B2M locus. The B2M gene encodes a component of MHC I molecules. In some embodiments, the modulation occurs through insertion-deletion (indel) modifications or targeted mutations of the B2M locus, for example, by using the CRISPR/Cas system as described. By modulating (e.g., reducing or deleting) expression of B2M, surface trafficking of MHC I molecules is blocked, and the cell is thus rendered hypoimmunogenic. In some embodiments, the allogeneic cell modified to have reduced expression of MHC I genes has a reduced ability to induce an immune response in a recipient subject. In some embodiments, reduced expression of B2M reduces or eliminates expression of one or more of the HLA-A, HLA-B, and HLA-C genes. In some embodiments, the cell has B2M knockout. In some embodiments, the genetic modification targeting the B2M locus comprises inserting an exogenous nucleic acid or transgene encoding a polypeptide (e.g., a tolerogenic factor) as described herein at the B2M locus. In certain of these embodiments, insertion of the transgene into the B2M locus results in B2M knockout.


In some embodiments, the cell may be modified to have reduced expression of MHC I genes by targeting and modulating the TAP1 locus. TAP1 encoded by the TAP1 gene assembles with TAP2 encoded by the TAP2 gene to form the transporter associated with antigen processing (TAP) complex, which is found in the endoplasmic reticulum (ER) and transports peptides of foreign origin into the ER to be attached to MHC class I proteins for presentation on the cell surface to the immune system. In some embodiments, the modulation occurs through insertion-deletion (indel) modifications of the TAP1 locus, for example, by using the CRISPR/Cas system as described. By modulating (e.g., reducing or deleting) expression of TAP1, surface trafficking of MHC I molecules is blocked, and the cell is thus rendered hypoimmunogenic. In some embodiments, reduced expression of TAP1 reduces or eliminates expression of one or more of the HLA-A, HLA-B, and HLA-C genes. In some embodiments, the cell has TAP1 knockout. In some embodiments, the genetic modification targeting the TAP1 locus comprises inserting an exogenous nucleic acid or transgene encoding a polypeptide (e.g., a tolerogenic factor) as disclosed herein at the TAP1 locus. In certain of these embodiments, insertion of the transgene into the TAP1 locus results in TAP1 knockout.


In some embodiments, the cell may be modified to have reduced expression of MHC II genes by overexpression of CD74.


In some embodiments, the cell may be modified to have reduced expression of MHC II genes by targeting and modulating the CIITA locus. CIITA is a member of the nucleotide binding domain (NBD) leucine-rich repeat (LRR) family of proteins and regulates the transcription of MHC II by associating with the MHC enhanceosome. In some embodiments, the modulation occurs through insertion-deletion (indel) modifications of the CIITA locus, for example, by using the CRISPR/Cas system as described. In some embodiments, reduced expression of CIITA reduces or eliminates expression of one or more of the HLA-DR, HLA-DQ, HLA-DP, HLA-DM, and HLA-DO genes. In some embodiments, the cell has CIITA knockout. In some embodiments, the genetic modification targeting the CIITA locus comprises inserting an exogenous nucleic acid or transgene encoding a polypeptide (e.g., a tolerogenic factor) as disclosed herein at the CIITA locus. In certain of these embodiments, insertion of the transgene into the CIITA locus results in CIITA knockout.


In certain embodiments, the cell comprises a modification, such as a genetic modification, targeting the MIC-A gene. MIC-A is a protein having known isoforms and variants (see, e.g., UniProt Q29983, accessed Jul. 18, 2022); all such forms of MIC-A are encompassed by the disclosure provided herein. In some embodiments, the genetic modification targeting the MIC-A gene is by using a targeted nuclease system that comprises a Cas protein or a polynucleotide encoding a Cas protein, and at least one guide ribonucleic acid sequence for specifically targeting the MIC-A gene. In some embodiments, the genetic modification occurs using a CRISPR/Cas system as described. For example, in some embodiments, a gRNA with a targeting sequence GATGACCCTGGCTCATATCA (SEQ ID NO:581) can be used. In some embodiments, methods of gene editing with a CRISPR/Cas system and gRNA targeting MIC-A, such as with a targeting sequence GATGACCCTGGCTCATATCA (SEQ ID NO:581), knocks out all alleles of MIC-A in a cell. In some embodiments, the cell has MIC-A knockout. In some embodiments, the genetic modification targeting the MIC-A locus comprises inserting an exogenous nucleic acid or transgene encoding a polypeptide (e.g., a tolerogenic factor) as disclosed herein at the MIC-A locus. In certain of these embodiments, insertion of the transgene into the MIC-A locus results in MIC-A knockout.


In certain embodiments, the cell comprises a modification, such as a genetic modification, targeting the MIC-B gene. MIC-B is a protein having known isoforms and variants (see, e.g., UniProt Q29980, accessed Jul. 18, 2022); all such forms of MIC-B are encompassed by the disclosure provided herein. In some embodiments, the genetic modification targeting the MIC-B gene is by using a targeted nuclease system that comprises a Cas protein or a polynucleotide encoding a Cas protein, and at least one guide ribonucleic acid sequence for specifically targeting the MIC-B gene. In some embodiments, the genetic modification occurs using a CRISPR/Cas system as described. For example, in some embodiments, a gRNA with a targeting sequence GTTTCTGCCTGTCATAGCGC (SEQ ID NO:582) can be used. In some embodiments, methods of gene editing with a CRISPR/Cas system and gRNA targeting MIC-B, such as with a targeting sequence GTTTCTGCCTGTCATAGCGC (SEQ ID NO:582) knocks out all alleles of MIC-B in a cell. In some embodiments, the cell has MIC-B knockout. In some embodiments, the genetic modification targeting the MIC-B locus comprises inserting an exogenous nucleic acid or transgene encoding a polypeptide (e.g., a tolerogenic factor) as disclosed herein at the MIC-B locus. In certain of these embodiments, insertion of the transgene into the MIC-B locus results in MIC-B knockout.


In some embodiments, the cell has genetic modifications at the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci, have B2M, TAP1, CIITA, MIC-A, and/or MIC-B knockout, or have CD74 overexpression. The B2M, TAP1, CIITA, MIC-A, and/or MIC-B knockout can occur at one allele, or both alleles, of the respective gene locus. In some embodiments, the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci are modified so that the cell has reduced or no expression of B2M, TAP1, CIITA, MIC-A, and/or MIC-B, respectively. In these embodiments, the cell has reduced expression of MHC I and/or MHC II genes (HLA I and/or HLA II in humans) as a result of B2M, TAP1, CIITA, MIC-A, and/or MIC-B deletion or knockout, or overexpression of CD74.


In some embodiments, the transgene for targeted insertion (i.e., knock-in) at a genomic locus for genetic modification as described (e.g., the ABO, FUT1, RHD, B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci) may encode a tolerogenic factor that can improve the hypoimmunogenicity of the resulting cells so that they will not be subject to immune rejection when transplanted into a recipient and thus increasing the effectiveness of cell-based therapies. Examples of a tolerogenic factor include, but are not limited to, A20/TNFAIP3, CD16, CD16 Fc receptor, CD24, CD35, CD39, CD46, CD47, CD52, CD55, CD59, CD200, CCL22, CTLA4-Ig, C1 inhibitor, complement receptor (CR1), DUX4, FASL, H2-M3, IDO1, IL15-RF, HLA-C, HLA-E, HLA-E heavy chain, HLA-G, IL-10, IL-35, MANF, PD-1, PD-L1, SERPINB9, CCL21, MFGE8, and truncations, modifications, or fusions of any of the above.


In some embodiments, the tolerogenic factor is CD47, which is a leukocyte surface antigen and has a role in cell adhesion and modulation of integrins. It is expressed on the surface of a cell (e.g., a T cell) and signals to circulating macrophages not to phagocytize the cell. Overexpression of CD47 thus can reduce the immunogenicity of the cell when grafted and improve immune protection in allogeneic recipients.


CD47 is a transmembrane protein that, in humans, is encoded by the CD47 gene. It is a member of the immunoglobulin (Ig) superfamily. CD47 has a molecular weight of about −50 kDa. It is glycosylated and ubiquitously expressed by virtually all cells in the human body. It has a single IgV-like domain at its N-terminus, a highly hydrophobic stretch with five membrane-spanning segments, and an alternatively spliced cytoplasmic tail at its C-terminus. In addition, it has two extracellular regions and two intracellular regions between neighboring membrane-spanning segments. A signal peptide, when it exists on a CD47 isoform, is located at the N-terminus of the IgV-like domain.


CD47 is involved in a range of cellular processes, including apoptosis, proliferation, adhesion, and migration. CD47 interacts with multiple extracellular ligands, such as TSP-1, integrins, other CD47 proteins, and SIRPa. The CD47/SIRPa interaction regulates a multitude of intercellular interactions in many body systems, such as the immune system where it regulates lymphocyte homeostasis, dendritic cell (DC) maturation and activation, proper localization of certain DC subsets in secondary lymphoid organs, and cellular transmigration. CD47 on cells, including on donor cells in the context of transplantation or cell therapy applications, can function as a “marker of self” and regulate phagocytosis by binding to SIRPα on the surface of circulating immune cells to deliver an inhibitory “don't kill me” signal. CD47-SIRPα binding results in phosphorylation of immunoreceptor tyrosine-based inhibition motifs (ITIMs) on SIRPα, which triggers recruitment of the SHP1 and SHP2 Src homology phosphatases. These phosphatases, in turn, inhibit accumulation of myosin II at the phagocytic synapse, preventing phagocytosis (Fujioka et al., Mol. Cell. Biol., 16:6887-6899 (1996)). Phagocytosis of target cells by macrophages is ultimately regulated by a balance of activating signals (e.g., FcγR, CRT, LRP-1) and inhibitory signals (e.g., SIRPα-CD47). Elevated expression of CD47 can help the cell evade immune surveillance, subsequent destruction, and innate immune cell killing. Thus, CD47 can be used as a tolerogenic factor to induce immune tolerance, for example, when there is pathological or undesirable activation of an otherwise normal immune response. This can occur, for example, when a patient develops an immune reaction to donor antigens after receiving an allogeneic transplantation or an allogeneic cell therapy, or when the body responds inappropriately to self-antigens implicated in autoimmune diseases.


The human CD47 gene has six naturally occurring transcripts, five of which each encode a protein isoform of CD47 (Ensembl, Gene: CD47, ENSG00000196776). The six transcripts are named CD47-201, CD47-202, CD47-203, CD47-204, CD47-205, and CD47-206. The coding DNA sequence (CDS) of the six transcripts are as set forth in SEQ ID NOs: 589-594, respectively. The amino acid sequences of the five protein isoforms are as set forth in SEQ ID NOs: 583-588 respectively (see Table 3).


Transcript CD47-201 (SEQ ID NO:589) encodes isoform CD47-201 (SEQ ID NO:583), which has 305 amino acids. Isoform CD47-201 has a C-terminal truncation of 18 amino acids from isoform CD47-202. All splice junctions of the CD47-201 transcript are supported by at least one non-suspect mRNA.


Transcript CD47-202 (SEQ ID NO:590) encodes isoform CD47-202 (SEQ ID NO:584), which has 323 amino acids. CD47-202 is the longest transcript of the human CD47 gene. It is designated as the representative transcript in the Ensembl database. In identifying the representative transcript, Ensembl aims to identity the transcript that, on balance, has the highest coverage of conserved exons, highest expression, longest coding sequence and is represented in other key resources, such as NCBI and UniProt. All splice junctions of the CD47-202 transcript are supported by at least one non-suspect mRNA. Amino acids 1-18 are the signal peptide. The amino acid sequence of CD47-202 without the signal peptide is set forth in SEQ ID NO:585.


Transcript CD47-203 (SEQ ID NO:591) encodes isoform CD47-203 (SEQ ID NO:586), which has 86 amino acids. The only support for the transcript model is from a single expressed sequence tag (EST).


Transcript CD47-204 (SEQ ID NO:592) does not encode any protein. All splice junctions of this transcript are supported by at least one non-suspect mRNA


Transcript CD47-205 (SEQ ID NO:593) encodes isoform CD47-205 (SEQ ID NO:587), which has 109 amino acids. Isoform 205 comprises 3 transmembrane domains and a truncated intracellular domain from isoform CD47-202. The best supporting mRNA for the transcript model is flagged as suspect or the support is from multiple ESTs.


Transcript CD47-206 (SEQ ID NO:594) encodes isoform CD47-206 (SEQ ID NO:588), which has 183 amino acids. Isoform 206 comprises a truncated extracellular domain and 5 transmembrane domains from isoform CD47-202.


In some embodiments, the transgene for targeted insertion (i.e., knock-in) at a genomic locus for genetic modification as described (e.g., the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci) may encode CD47, for example, human CD47. In certain of these embodiments, the human CD47 comprises or consists of an amino acid sequence set forth in any one of SEQ ID NOs: 583-588, or is at least 80% identical (e.g., at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical) to the amino acid sequence set forth in any one of SEQ ID NOs: 583-588. In some embodiments, the human CD47 comprises or consists of an amino acid sequence set forth in SEQ ID NO:584 or is at least 80% identical (e.g., at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical) to the amino acid sequence set forth in SEQ ID NO:584. In some embodiments, the nucleotide sequence encoding CD47 corresponds to an mRNA sequence of human CD47. In some embodiments, the nucleotide sequence encoding CD47 is at least 80% identical (e.g., at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical) to the nucleotide sequence set forth in any one of SEQ ID NOs: 589-594. In some embodiments, the nucleotide sequence encoding CD47 is at least 80% identical (e.g., at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical) to the nucleotide sequence set forth in SEQ ID NO:590.


In some embodiments, the nucleotide sequence encoding CD47 is codon-optimized for expression in a mammalian cell, for example, a human cell. In some embodiments, the codon-optimized nucleotide sequence encoding CD47 is at least 80% identical (e.g., at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100% identical) to the nucleotide sequence set forth in SEQ ID NO:595.









TABLE 3







Exemplary sequences of CD47









SEQ ID NO:
Sequence
Description





583
MWPLVAALLLGSACCGSAQLLFNKTKSVEFTFCNDTV
CD47-201



VIPCFVTNMEAQNTTEVYVKWKFKGRDIYTFDGALNK
amino acid



STVPTDFSSAKIEVSQLLKGDASLKMDKSDAVSHTGN
sequence



YTCEVTELTREGETIIELKYRVVSWFSPNENILIVIFPIFA




ILLFWGQFGIKTLKYRSGGMDEKTIALLVAGLVITVIVIV




GAILFVPGEYSLKNATGLGLIVTSTGILILLHYYVFSTAI




GLTSFVIAILVIQVIAYILAVVGLSLCIAACIPMHGPLLISG




LSILALAQLLGLVYMKFVASNQKTIQPPRNN






584
MWPLVAALLLGSACCGSAQLLFNKTKSVEFTFCNDTV
CD47-202



VIPCFVTNMEAQNTTEVYVKWKFKGRDIYTFDGALNK
amino acid



STVPTDFSSAKIEVSQLLKGDASLKMDKSDAVSHTGN
sequence



YTCEVTELTREGETIIELKYRVVSWFSPNENILIVIFPIFA




ILLFWGQFGIKTLKYRSGGMDEKTIALLVAGLVITVIVIV




GAILFVPGEYSLKNATGLGLIVTSTGILILLHYYVFSTAI




GLTSFVIAILVIQVIAYILAVVGLSLCIAACIPMHGPLLISG




LSILALAQLLGLVYMKFVASNQKTIQPPRKAVEEPLNA




FKESKGMMNDE






585
QLLFNKTKSVEFTFCNDTVVIPCFVTNMEAQNTTEVYV
CD47-202



KWKFKGRDIYTFDGALNKSTVPTDFSSAKIEVSQLLKG
amino acid



DASLKMDKSDAVSHTGNYTCEVTELTREGETIIELKYR
sequence



VVSWFSPNENILIVIFPIFAILLFWGQFGIKTLKYRSGG
without



MDEKTIALLVAGLVITVIVIVGAILFVPGEYSLKNATGLG
signal



LIVTSTGILILLHYYVFSTAIGLTSFVIAILVIQVIAYILAVV
peptide



GLSLCIAACIPMHGPLLISGLSILALAQLLGLVYMKFVA




SNQKTIQPPRKAVEEPLNAFKESKGMMNDE






586
XWTESLYCGVYTNAWPSSDFRFEYLSSSTITWTSLYE
CD47-203



ICGVQRDTYSQLKGKKRQASNQKTIQPPRKAVEEPLN
amino acid



AFKESKGMMNDE
sequence





587
XKNATGLGLIVTSTGILILLHYYVFSTAIGLTSFVIAILVIQ
CD47-205



VIAYILAVVGLSLCIAACIPMHGPLLISGLSILALAQLLGL
amino acid



VYMKFVASNQKTIQPPRKAVEEPLNE
sequence





588
MEAQNTTEVYVKWKFKGRDIYTFDGALNKSTVPTDFS
CD47-206



SAKIEVSQLLKGDASLKMDKSDAVSHTGNYTCEVTEL
amino acid



TREGETIIELKYRVVSWFSPNENILIVIFPIFAILLFWGQF
sequence



GIKTLKYRSGGMDEKTIALLVAGLVITVIVIVGAILFVPG




EYSLKNATGLGLIVTSTGILILLHYYVF






589
atgtggcccctggtagcggcgctgttgctgggctcggcgtgctgcggatcag
CD47-201



ctcagctactatttaataaaacaaaatctgtagaattcacgttttgtaatgaca
CDS



ctgtcgtcattccatgctttgttactaatatggaggcacaaaacactactgaag
nucleotide



tatacgtaaagtggaaatttaaaggaagagatatttacacctttgatggagct
sequence



ctaaacaagtccactgtccccactgactttagtagtgcaaaaattgaagtctc




acaattactaaaaggagatgcctctttgaagatggataagagtgatgctgtct




cacacacaggaaactacacttgtgaagtaacagaattaaccagagaagg




tgaaacgatcatcgagctaaaatatcgtgttgtttcatggttttctccaaatgaa




aatattcttattgttattttcccaatttttgctatactcctgttctggggacagtttggt




attaaaacacttaaatatagatccggtggtatggatgagaaaacaattgcttt




acttgttgctggactagtgatcactgtcattgtcattgttggagccattcttttcgtc




ccaggtgaatattcattaaagaatgctactggccttggtttaattgtgacttcta




cagggatattaatattacttcactactatgtgtttagtacagcgattggattaac




ctccttcgtcattgccatattggttattcaggtgatagcctatatcctcgctgtggt




tggactgagtctctgtattgcggcgtgtataccaatgcatggccctcttctgattt




caggtttgagtatcttagctctagcacaattacttggactagtttatatgaaattt




gtggcttccaatcagaagactatacaacctcctaggaataactga






590
atgtggcccctggtagcggcgctgttgctgggctcggcgtgctgcggatcag
CD47-202



ctcagctactatttaataaaacaaaatctgtagaattcacgttttgtaatgaca
CDS



ctgtcgtcattccatgctttgttactaatatggaggcacaaaacactactgaag
nucleotide



tatacgtaaagtggaaatttaaaggaagagatatttacacctttgatggagct
sequence



ctaaacaagtccactgtccccactgactttagtagtgcaaaaattgaagtctc




acaattactaaaaggagatgcctctttgaagatggataagagtgatgctgtct




cacacacaggaaactacacttgtgaagtaacagaattaaccagagaagg




tgaaacgatcatcgagctaaaatatcgtgttgtttcatggttttctccaaatgaa




aatattcttattgttattttcccaatttttgctatactcctgttctggggacagtttggt




attaaaacacttaaatatagatccggtggtatggatgagaaaacaattgcttt




acttgttgctggactagtgatcactgtcattgtcattgttggagccattcttttcgtc




ccaggtgaatattcattaaagaatgctactggccttggtttaattgtgacttcta




cagggatattaatattacttcactactatgtgtttagtacagcgattggattaac




ctccttcgtcattgccatattggttattcaggtgatagcctatatcctcgctgtggt




tggactgagtctctgtattgcggcgtgtataccaatgcatggccctcttctgattt




caggtttgagtatcttagctctagcacaattacttggactagtttatatgaaattt




gtggcttccaatcagaagactatacaacctcctaggaaagctgtagaggaa




ccccttaatgcattcaaagaatcaaaaggaatgatgaatgatgaataa






591
tggactgagtctctgtattgcggcgtgtataccaatgcatggccctcttctgattt
CD47-203



caggtttgagtatcttagctctagcacaattacttggactagtttatatgaaattt
CDS



gtggggttcagagggacacctacagtcagttgaaaggcaagaagagaca
nucleotide



agcttccaatcagaagactatacaacctcctaggaaagctgtagaggaac
sequence



cccttaatgcattcaaagaatcaaaaggaatgatgaatgatgaataa






592
agatattttcttgttcaatttaaggagaggtaaatttggtatcaatagaaaaaat
CD47-204



gtttctgaaaaatttaaaccctggaaatgtatttatggcatggagtcagatgttt
CDS



cagggagagaagaacaaatcaagaagcattgcaagtatgctcatatgga
nucleotide



atgcttaaggcttgtggttaaaaaatatatatatatggctgtcaatgtcttaggct
sequence



catggtagcagcagaaatcgtaataattcttttgtcacatgggttatatccatat




tggagagaattaactcaggtgaaattaacttgtacactgtttggttttataatatt




tagagggatcacaactgactgatgtccctttgaagtaccattcttcataaatctt




tttttttcagaatgggccagccaactgtgacatcccttggatcggagatttaga




actagaaagtattctttctacattattagggaagaaaaggagttacttggcggt




tagcaatattctattttgttttgttttgtttttagagacagggtctcattatgttgacca




ggctggcctcgagctcctgggctcaagcaatgctcccacctcagcctccca




agtagctgggactacaggcatgtgccactacacctggcagtgtttattctgat




aaatacatttatgagctcaaaaatgtaactctaaaaccttatctctgaacttcc




atattaccatcagaaatttagatagttgtttagttctctttttctttgtagaacatag




atataaggcatggtttcattgaagtcagttgtatatacatgtaactatcctgatgt




tcccaaataaagctctgtatttctgcttagtttattggggaggctgctaaatgta




gtgcatcccaacccattttaccctgttctactttaaaaagaggttggcttcttgttt




ggatacaaggaccaagtcactcccccaggttcctccacagtaagggaggc




ctatttaaagccgcccatggcactaacagaaactggactcctatgagctca




gatacataactgggcctcacagggggggacagtatgtagtctaggaattg




gaaggatccattccatatcaaagaactgaagcatcgtgttgccctctcagca




gcaagagtaaggtgatgcccctgtcagttatagttcctgagttcctctgtctttg




attctttgcctattagccagctagctcaccctcttgtttatgccactgttttttatcct




attcatgccttctcacagacaacttttcttacctacagctttggactcatccttgtc




tcctttctgtttctttttcactttcccttcccatcaccaactttctgggtttttttctgtttct




tcttagagtccagtggcagggagaaacttgtcagtccagtctgttgccatttttc




ctgtttgagaaagactcaccagcttttggctggctcacagattggctttccttgg




gtcaggacccacccttttccctgccagctttggaagcttgacagaattcgagt




gtgcagtggtggtaaataaatagtaaggaacacagagcagtcctggaggc




gtgcctccatctgctgatgagaaaatccagtgctgtcatccagcccaggtcc




cagcggaatgggcctctctgttcagtaggatccccctcctgctgagtggttcat




ggcatgtttctgttcaacgcttttccatctgtaggattcttattctgtatttatttgttttt




ttgggtttttttattttttgagatggagtctcgctctgtcgcccaggctggagtgca




gtggcacgaccccagctcgctgcagcctctgcctcccaggacgagggag




atcctcccacctcagccttccacgtagctgggactacaggcatgcaccaca




ggcatgcaccaccacgccagctaatttttgtatttttggtagagacagggttgc




atcatgttgcccaggctggtcttgaatgcctgagctcaagcaatctatttgcctt




ggcctcccaaagtgctgggattacaggcatgagccaccacggccagcctt




ctcatttgttttttttataaggaagctatctcttcttccctccccaactagggtattct




ttttccctttcgtcactttgctcatgtactgtattccttcaacttcattaatgaatccat




ttggaagcagtgaaaaaggcaactcagaaagctaagaagaaatagata




gaggaatactcagagctatctgagtattttctttagtttgttagctctttggagcttt




gaaactggaaagacccagggagtgatgtggagaaagagactgagcttgt




aagacacaggagcagtgagctaagggagatggagtagtggggacaaat




tctggcacattctgtctacactctgggtagatagaggagggaggatggagc




acccatggtgggggtatgttggtgacagcattttcccaccagccagtgtaac




aagtggctgatttgggggaaagatggcataaacaaatgagagaatgtgttt




actatttgatgtagatgggttatttgcttcatttttcaaatcagtgtatataatcaag




aatattcagcatgtttgaatagactgtcagagctggaactctttcattaacatct




ctggcacctttagttttagccctgaacattttatcttaaaattaaacattaccaaa




tgccttagtttatttcatttattaaatttatattcttatttgttatttatatcagcttccaat




cagaagactatacaacctcctaggaataactgaagtgaagtgatggactcc




gatttggagagtagtaagacgtgaaaggaatacacttgtgtttaagcaccat




ggccttgatgattcactgttggggagaagaaacaagaaaagtaactggttgt




cacctatgagacccttacgtgattgttagttaagtttttattcaaagcagctgta




atttagttaataaaataattatgatctatgttgtttgcccaattgagatccagtttttt




gttgttatttttaatcaattaggggcaatagtagaatggacaatttccaagaat




gatgcctttcaggtcctagggcctctggcctctaggtaaccagtttaaattggtt




cagggtgataactacttagcactgccctggtgattacccagagatatctatga




aaaccagtggcttccatcaaacctttgccaactcaggttcacagcagctttgg




gcagttatggcagtatggcattagctgagaggtgtctgccacttctgggtcaa




tggaataataaattaagtacaggcaggaatttggttgggagcatcttgtatga




tctccgtatgatgtgatattgatggagatagtggtcctcattcttgggggttgcc




attcccacattcccccttcaacaaacagtgtaacaggtccttcccagatttag




ggtacttttattgatggatatgttttccttttattcacataaccccttgaaaccctgt




cttgtcctcctgttacttgcttctgctgtacaagatgtagcaccttttctcctctttga




acatggtctagtgacacggtagcaccagttgcaggaaggagccagacttgt




tctcagagcactgtgttcacacttttcagcaaaaatagctatggttgtaacatat




gtattcccttcctctgatttgaaggcaaaaatctacagtgtttcttcacttcttttct




gatctggggcatgaaaaaagcaagattgaaatttgaactatgagtctcctgc




atggcaacaaaatgtgtgtcaccatcaggccaacaggccagcccttgaat




ggggatttattactgttgtatctatgttgcatgataaacattcatcaccttcctcct




gtagtcctgcctcgtactccccttcccctatgattgaaaagtaaacaaaaccc




acatttcctatcctggttagaagaaaattaatgttctgacagttgtgatcgcctg




gagtacttttagacttttagcattcgttttttacctgtttgtggatgtgtgtttgtatgtg




catacgtatgagataggcacatgcatcttctgtatggacaaaggtggggtac




ctacaggagagcaaaggttaattttgtgcttttagtaaaaacatttaaatacaa




agttctttattgggtggaattatatttgatgcaaatatttgatcacttaaaactttta




aaacttctaggtaatttgccacgctttttgactgctcaccaataccctgtaaaa




atacgtaattcttcctgtttgtgtaataagatattcatatttgtagttgcattaataat




agttatttcttagtccatcagatgttcccgtgtgcctcttttatgccaaattgattgt




catatttcatgttgggaccaagtagtttgcccatggcaaacctaaatttatgac




ctgctgaggcctctcagaaaactgagcatactagcaagacagctcttcttga




aaaaaaaaatatgtatacacaaatatatacgtatatctatatatacgtatgtat




atacacacatgtatattcttccttgattgtgtagctgtccaaaataataacatat




atagagggagctgtattcctttatacaaatctgatggctcctgcagcactttttc




cttctgaaaatatttacattttgctaacctagtttgttactttaaaaatcagttttgat




gaaaggagggaaaagcagatggacttgaaaaagatccaagctcctatta




gaaaaggtatgaaaatctttatagtaaaattttttataaactaaagttgtaccttt




taatatgtagtaaactctcatttatttggggttcgctcttggatctcatccatccatt




gtgttctctttaatgctgcctgccttttgaggcattcactgccctagacaatgcca




ccagagatagtgggggaaatgccagatgaaaccaactcttgctctcactag




ttgtcagcttctctggataagtgaccacagaagcaggagtcctcctgcttggg




catcattgggccagttccttctctttaaatcagatttgtaatggctcccaaattcc




atcacatcacatttaaattgcagacagtgttttgcacatcatgtatctgttttgtcc




cataatatgctttttactccctgatcccagtttctgctgttgactcttccattcagtttt




atttattgtgtgttctcacagtgacaccatttgtccttttctgcaacaacctttcca




gctacttttgccaaattctatttgtcttctccttcaaaacattctcctttgcagttcct




cttcatctgtgtagctgctcttttgtctcttaacttaccattcctatagtactttatgc




atctctgcttagttctattagttttttggccttgctcttctccttgattttaaaattccttc




tatagctagagcttttctttctttcattctctcttcctgcagtgttttgcatacatcag




aagctaggtacataagttaaatgattgagagttggctgtatttagatttatcact




ttttaatagggtgagcttgagagttttctttctttctgttttttttttttgtttttttttttttttt




ttttttttttttttttgactaatttcacatgctctaaaaaccttcaaaggtgattatttttctc




ctggaaactccaggtccattctgtttaaatccctaagaatgtcagaattaaaat




aacagggctatcccgtaattggaaatatttcttttttcaggatgctatagtcaatt




tagtaagtgaccaccaaattgttatttgcactaacaaagctcaaaacacgat




aagtttactcctccatctcagtaataaaaattaagctgtaatcaaccttctaggt




ttctcttgtcttaaaatgggtattcaaaaatggggatctgtggtgtatgtatgga




aacacatactccttaatttacctgttgttggaaactggagaaatgattgtcggg




caaccgtttattttttattgtattttatttggttgagggatttttttataaacagttttact




tgtgtcatattttaaaattactaactgccatcacctgctggggtcctttgttaggtc




attttcagtgactaatagggataatccaggtaactttgaagagatgagcagtg




agtgaccaggcagtttttctgcctttagctttgacagttcttaattaagatcattg




aagaccagctttctcataaatttctctttttgaaaaaaagaaagcatttgtacta




agctcctctgtaagacaacatcttaaatcttaaaagtgttgttatcatgactggt




gagagaagaaaacattttgtttttattaaatggagcattatttacaaaaagcc




attgttgagaattagatcccacatcgtataaatatctattaaccattctaaataa




agagaactccagtgttgctatgtgcaagatcctctcttggagcttttttgcatag




caattaaaggtgtgctatttgtcagtagccatttttttgcagtgatttgaagacca




aagttgttttacagctgtgttaccgttaaaggtttttttttttatatgtattaaatcaat




ttatcactgtttaaagctttgaatatctgcaatctttgccaaggtacttttttatttaa




aaaaaaacataactttgtaaatattaccctgtaatattatatatacttaataaaa




cattttaagctat






593
aagaatgctactggccttggtttaattgtgacttctacagggatattaatattact
CD47-205



tcactactatgtgtttagtacagcgattggattaacctccttcgtcattgccatatt
CDS



ggttattcaggtgatagcctatatcctcgctgtggttggactgagtctctgtattg
nucleotide



cggcgtgtataccaatgcatggccctcttctgatttcaggtttgagtatcttagct
sequence



ctagcacaattacttggactagtttatatgaaatttgtggcttccaatcagaag




actatacaacctcctaggaaagctgtagaggaaccccttaatgaataa






594
atggaggcacaaaacactactgaagtatacgtaaagtggaaatttaaagg
CD47-206



aagagatatttacacctttgatggagctctaaacaagtccactgtccccactg
CDS



actttagtagtgcaaaaattgaagtctcacaattactaaaaggagatgcctct
nucleotide



ttgaagatggataagagtgatgctgtctcacacacaggaaactacacttgtg
sequence



aagtaacagaattaaccagagaaggtgaaacgatcatcgagctaaaatat




cgtgttgtttcatggttttctccaaatgaaaatattcttattgttattttcccaatttttg




ctatactcctgttctggggacagtttggtattaaaacacttaaatatagatccg




gtggtatggatgagaaaacaattgctttacttgttgctggactagtgatcactgt




cattgtcattgttggagccattcttttcgtcccaggtgaatattcattaaagaatg




ctactggccttggtttaattgtgacttctacagggatattaatattacttcactact




atgtgttt






595
atgtggcccctggtcgccgccctgttgctgggctcggcatgctgcggatcag
Codon-



ctcagctactgtttaataaaacaaaatctgtagaattcacgttttgtaacgaca
optimized



ctgtcgtgatcccatgctttgttactaatatggaggcacaaaacaccactgaa
nucleotide



gtgtacgtgaagtggaaattcaaaggcagagacatttacacctttgacggc
sequence



gccctcaacaagtccaccgtgcccactgactttagtagcgcaaaaattgag
encoding



gtcagccaattactaaaaggagatgcctctttgaagatggacaagagcgat
CD47



gctgtcagccacacagggaactacacttgtgaagtaacagagttaacccg




cgaaggtgaaacgatcatcgagctgaagtatcgagtggtgtcctggttttctc




cgaacgagaatatccttatcgtaattttcccaattttcgctatcctcctgttctgg




ggccagtttggtatcaagacactcaaatatcggtccggtgggatggatgag




aagacaattgccctgcttgttgctggactcgtgatcaccgtcatcgtgattgttg




gggccatccttttcgtcccaggggagtacagcctgaagaatgctacgggcc




tgggattaattgtgacctctacagggatactcatcctgcttcactactatgtgttc




agtaccgcgattggactgacctccttcgtcattgccatattggtgattcaggtg




atagcctacatcctcgccgtggttggcctgagtctctgtatcgcggcgtgcata




cccatgcatggccctcttctgatttcagggttgagtatcctcgcactagcacag




ttgctgggactggtttatatgaaatttgtggcctccaaccagaagactataca




gcctcctaggaaggctgtagaggagcccctgaatgcattcaaggaatcaa




aaggcatgatgaatgatgaa









Cells, Cell Populations, and Compositions Thereof

Provided herein in some aspects are cells that have been genetically edited according to various embodiments disclosed herein. In some embodiments, the cells have genetic modifications at one or more genomic loci associated with blood type, including, for example, the ABO, FUT1, and/or RHD loci. In certain of these embodiments, the cells have modified expression (e.g., reduced or no expression) of one or more of these genes. As a result of the modified expression of the ABO, FUT1, and/or RHD gene, the cells may have modified blood type.


In some embodiments, the cell is an autologous cell, i.e., obtained from the subject who will receive the cell after genetic modification. In some embodiments, the cell is an allogeneic cell, i.e., obtained from someone other than the subject who will receive the cell after genetic modification.


In some embodiments, the cell is a mesenchymal stem cell or a hematopoietic stem cell. In some embodiments, the cell is a hematopoietic cell or a blood cell, for example, a red blood cell (erythrocyte), a platelet cell (thrombocyte), a mast cell, a basophil, an eosinophil, a neutrophil, a monocyte, a natural killer (NK) cell, a natural killer T (NKT) cell, a macrophage, a lymphocyte (e.g., a T cell, a B cell), or a plasma cell.


In some embodiments, the cell is a T cell, for example, a naïve T cell, a helper T cell (CD4+), a cytotoxic T cell (CD8+), a regulatory T cell (Treg), a central memory T cell (TCM), an effector memory T cell (TEM), a stem cell memory T cell (TSCM), or any combination thereof. More specifically, the T cell can be naïve (not exposed to antigen; increased expression of CD62L, CCR7, CD28, CD3, CD127, and CD45RA, and decreased expression of CD45RO as compared to TCM), memory T cells (antigen-experienced and long-lived), or effector cells (antigen-experienced, cytotoxic). Memory T cells can be further divided into subsets of TCM (increased expression of CD62L, CCR7, CD28, CD127, CD45RO, and CD95, and decreased expression of CD54RA as compared to naïve T cells) and TEM (decreased expression of CD62L, CCR7, CD28, CD45RA, and increased expression of CD127 as compared to naïve T cells or TCM). Effector T cells refer to antigen-experienced CD8+ cytotoxic T cells that has decreased expression of CD62L, CCR7, CD28, and are positive for granzyme and perforin as compared to TCM. Helper T cells are CD4+ cells that influence the activity of other immune cells by releasing cytokines. CD4+ T cells can activate or suppress an adaptive immune response, and which of those two functions is induced will depend on the presence of other cells and signals. T cells can be collected using known techniques, and the various subpopulations or combinations thereof can be enriched or depleted by known techniques, such as by affinity binding to antibodies, flow cytometry, or immunomagnetic selection. In some embodiments, the T cell can be a primary T cell obtained from a number of sources, including peripheral blood mononuclear cells, bone marrow, lymph node tissue, cord blood, thymus tissue, tissue from a site of infection, ascites, pleural effusion, spleen tissue, and tumors. In other embodiments, the T cell can be derived or differentiated from embryonic stem cells (ESCs) or induced pluripotent cells (iPSCs). In some embodiments, T cells may be modified to delete T cell receptors, e.g., by deletion of the TRAC or TRB locus, and may also be modified to express a chimeric antigen receptor.


In some embodiments, the cell is an NK cell. NK cells (also defined as large granular lymphocytes) represent a cell lineage differentiated from the common lymphoid progenitor (which also gives rise to B lymphocytes and T lymphocytes). Unlike T-cells, NK cells do not naturally express CD3 at the plasma membrane. Importantly, NK cells do not express a TCR and typically also lack other antigen-specific cell surface receptors. NK cells' cytotoxic activity does not require sensitization but is enhanced by activation with a variety of cytokines including IL-2. NK cells are generally thought to lack appropriate or complete signaling pathways necessary for antigen-receptor-mediated signaling, and thus are not thought to be capable of antigen receptor-dependent signaling, activation and expansion. NK cells are cytotoxic, and they balance activating and inhibitory receptor signaling to modulate their cytotoxic activity. For instance, NK cells expressing CD16 may bind to the Fc domain of antibodies bound to an infected cell, resulting in NK cell activation. By contrast, activity is reduced against cells expressing high levels of MHC class I proteins. On contact with a target cell, NK cells release proteins such as perforin, and enzymes such as proteases (granzymes). Perforin can form pores in the cell membrane of a target cell, inducing apoptosis or cell lysis. In some embodiments, the NK cells can be primary NK cells obtained from a number of sources, including peripheral blood mononuclear cells, bone marrow, lymph node tissue, cord blood, thymus tissue, tissue from a site of infection, ascites, pleural effusion, spleen tissue, and tumors. In some embodiments, the NK cells can be derived or differentiated from ESCs or iPSCs. There are a number of techniques that can be used to generate NK cells from pluripotent stem cells (e.g., iPSCs). See, for example, Zhu et al., Methods Mol Biol. 2019; 2048:107-119; Knorr et al., Stem Cells Transl Med. 2013 2(4):274-83. doi: 10.5966/sctm.2012-0084; Zeng et al., Stem Cell Reports. 2017 Dec. 12; 9(6):1796-1812; Ni et al., Methods Mol Biol. 2013; 1029:33-41; Bernareggi et al., Exp Hematol. 2019 71:13-23; Shankar et al., Stem Cell Res Ther. 2020; 11(1):234, all of which are incorporated herein by reference in their entirety and specifically for the methodologies and reagents for differentiation. Differentiation can be assayed as is known in the art, generally by evaluating the presence of NK cell associated and/or specific markers, including, but not limited to, CD56, KIRs, CD16, NKp44, NKp46, NKG2D, TRAIL, CD122, CD27, CD244, NK1.1, NKG2A/C, NCR1, Ly49, CD49b, CD11b, KLRG1, CD43, CD62L, and/or CD226.


In some embodiments, the cell is an NKT cell. NKT cells are a heterogeneous group of T cells that share properties of both T cells and NK cells. Many of these cells recognize the non-polymorphic CD1d molecule, an antigen-presenting molecule that binds self and foreign lipids and glycolipids. They constitute only approximately 1% of all peripheral blood T cells. In some embodiments, the NKT cells can be primary NKT cells obtained from a number of sources, including peripheral blood mononuclear cells, bone marrow, lymph node tissue, cord blood, thymus tissue, tissue from a site of infection, ascites, pleural effusion, spleen tissue, and tumors. In some embodiments, the NKT cells can be derived or differentiated from ESCs or iPSCs.


In some embodiments, the cell is a pancreatic islet cell, including, for example, a β cell (also referred to as beta cell or β islet cell). Exemplary pancreatic islet cell types include, but are not limited to, pancreatic islet progenitor cells, immature pancreatic islet cells, mature pancreatic islet cells, and the like. In some embodiments, the β islet cells can be primary β islet cells. In some embodiments, the β islet cells can be derived or differentiated from ESCs or iPSCs. Useful method for differentiating pluripotent stem cells into pancreatic islet cells are disclosed, for example, in U.S. Pat. Nos. 9,683,215; 9,157,062; and 8,927,280. In some embodiments, the β islet cells genetically engineered by the methods as disclosed herein secretes insulin. In some embodiments, a β islet cell exhibits at least two characteristics of an endogenous pancreatic islet cell, for example, but not limited to, secretion of insulin in response to an increase in glucose, and expression of beta cell markers. In some embodiments, the β islet cells disclosed herein are administered to a subject to treat diabetes. Exemplary p cell markers or p cell progenitor markers include, but are not limited to, c-peptide, Pdxl, glucose transporter 2 (Glut2), HNF6, VEGF, glucokinase (GCK), prohormone convertase (PC 1/3), Cdcpl, NeuroD, Ngn3, Nkx2.2, Nkx6.1, Nkx6.2, Pax4, Pax6, Ptfla, Isll, Sox9, Soxl7, and FoxA2. In some embodiments, the PSCs are differentiated into beta-like cells or islet organoids for transplantation to address type I diabetes mellitus (T1DM). Cell systems are a promising way to address T1DM, see, e.g., Ellis et al., Nat Rev Gastroenterol Hepatol. 2017 October; 14(10):612-628, incorporated herein by reference. Additionally, Pagliuca et al. (Cell, 2014, 159(2):428-39) reports on the successful differentiation of β cells from hiPSCs, the contents incorporated herein by reference in its entirety and in particular for the methods and reagents disclosed there for the large-scale production of functional human β cells from human pluripotent stem cells). Furthermore, Vegas et al. shows the production of human β cells from human pluripotent stem cells followed by encapsulation to avoid immune rejection by the recipient; Vegas et al., Nat Med, 2016, 22(3):306-11, incorporated herein by reference in its entirety and in particular for the methods and reagents disclosed there for the large-scale production of functional human B cells from human pluripotent stem cells. Additional disclosure of pancreatic islet cells including pancreatic β islet cells for use in the present technology are found in WO2020/018615, the disclosure is herein incorporated by reference in its entirety.


In some embodiments, the cell is a pluripotent stem cell, for example, an ESC or iPSC. In some embodiments, the cell is a cell differentiated from pluripotent stem cells, e.g., ESCs or iPSCs. ESCs and iPSCs have the ability to differentiated into any cell type of the body, including, for example, neurons, astrocytes, oligodendrocytes, retinal epithelial cells, epidermal cells, hair cells, keratinocytes, hepatocytes, pancreatic β islet cells, intestinal epithelial cells, lung alveolar cells, hematopoietic cells, endothelial cells, cardiomyocytes, smooth muscle cells, skeletal muscle cells, renal cells, adipocytes, chondrocytes, thyroid cell, NK cells, NKT cells, macrophages, T cells, B cells, and osteocytes.


In some embodiments, the cell is a primary cell, including, for example, neurons, astrocytes, oligodendrocytes, retinal epithelial cells, epidermal cells, hair cells, keratinocytes, hepatocytes, pancreatic β islet cells, intestinal epithelial cells, lung alveolar cells, hematopoietic cells, mesenchymal stem cells, hematopoietic stem cells, endothelial cells, cardiomyocytes, smooth muscle cells, skeletal muscle cells, renal cells, adipocytes, chondrocytes, thyroid cells, NK cells, NKT cells, macrophages, T cells, B cells, and osteocytes. In some embodiments, the cell is a cardiomyocyte, a retinal pigment epithelial cell (RPE), an endothelial cell, a β islet cell, or a glial progenitor cell (GPC).


In some aspects, the present technology provides pharmaceutical compositions comprising a cell according to various embodiments disclosed herein.


In some embodiments, the compositions can have various formulations, for example, injectable formulations, lyophilized formulations, liquid formulations, oral formulations, etc., depending on the suitable routes of administration.


In some embodiments, the compositions can be co-formulated in the same dosage unit or can be individually formulated in separate dosage units. The terms “dose unit” and “dosage unit” herein refer to a portion of a pharmaceutical composition that contains an amount of a therapeutic agent suitable for a single administration to provide a therapeutic effect. Such dosage units may be administered one to a plurality (i.e., 1 to 10, 1 to 8, 1 to 6, 1 to 4, or 1 to 2) of times per day, or as many times as needed to elicit a therapeutic response.


In some embodiments, a single dosage unit includes at least about 1×102, 5×102, 1×103, 5×103, 1×104, 5×104, 1×105, 5×105, 1×106, 5×106, 1×107, 5×107, 1×108, 5×108, 1×109, 5×109, 1×1010, or 5×1010 cells.


Methods of Treatment

In some aspects, provided are methods for treating and/or preventing a disease in a subject in need thereof. The method entails obtaining a cell from the subject (autologous) or from a donor (allogeneic); genetically modifying the cell, including targeted gene editing at one or more genomic loci associated with blood types, according to various embodiments disclosed herein; and administering to the subject a therapeutically effective amount of the genetically modified cell or a pharmaceutical composition containing the same.


In some embodiments, the disease is cancer. In some embodiments, the cancer is a hematologic malignancy. Non-limiting examples of hematologic malignancies include myeloid neoplasm, myelodysplastic syndromes (MDS), myeloproliferative/myelodysplastic syndromes, acute lymphoid leukemia (ALL), chronic lymphocytic leukemia (CLL), acute myeloid leukemia (AML), chronic myelogenous leukemia (CML), B cell acute lymphoid leukemia (B-ALL), T cell acute lymphoid leukemia (T-ALL), T cell lymphoma, and B cell lymphoma.


In some embodiments, the disease is an autoimmune disease, including, for example, lupus, systemic lupus erythematosus, rheumatoid arthritis, psoriasis, psoriatic arthritis, multiple sclerosis, Crohn's disease, ulcerative colitis, Addison's disease, Graves' disease, Sjögren's syndrome, Hashimoto's thyroiditis, and celiac disease.


In some embodiments, the disease is diabetes mellitus, including, for example, Type I diabetes, Type II diabetes, prediabetes, and gestational diabetes.


In some embodiments, the disease is a neurological disease, including, for example, catalepsy, epilepsy, encephalitis, meningitis, migraine, Huntington's, Alzheimer's, Parkinson's, Pelizaeus-Merzbacher disease, and multiple sclerosis.


In some embodiments, the disease is a cardiac disease or disorder, i.e., a condition and/or disorder relating to the heart, including the valves, endothelium, infarcted zones, or other components or structures of the heart. Cardiac diseases or disorders include, for example, pediatric cardiomyopathy, age-related cardiomyopathy, dilated cardiomyopathy, hypertrophic cardiomyopathy, restrictive cardiomyopathy, chronic ischemic cardiomyopathy, peripartum cardiomyopathy, inflammatory cardiomyopathy, idiopathic cardiomyopathy, other cardiomyopathy, myocardial ischemic reperfusion injury, ventricular dysfunction, heart failure, congestive heart failure, coronary artery disease, end-stage heart disease, atherosclerosis, ischemia, hypertension, restenosis, angina pectoris, rheumatic heart, arterial inflammation, cardiovascular disease, myocardial infarction, myocardial ischemia, congestive heart failure, myocardial infarction, cardiac ischemia, cardiac injury, myocardial ischemia, vascular disease, acquired heart disease, congenital heart disease, atherosclerosis, coronary artery disease, dysfunctional conduction systems, dysfunctional coronary arteries, pulmonary hypertension, cardiac arrhythmias, muscular dystrophy, muscle mass abnormality, muscle degeneration, myocarditis, infective myocarditis, drug- or toxin-induced muscle abnormalities, hypersensitivity myocarditis, cardiomegaly, mitral insufficiency, and autoimmune endocarditis.


In some embodiments, the genetically modified cell or pharmaceutical composition containing the same according to the present technology may be administered in a manner appropriate to the disease, condition, or disorder to be treated as determined by persons skilled in the medical art. In any of the above embodiments, the genetically modified cell may be administered intravenously, intraperitoneally, intratumorally, into the bone marrow, into a lymph node, or into the cerebrospinal fluid, so as to encounter the target antigen or cells. An appropriate dose, suitable duration, and frequency of administration of the compositions will be determined by such factors as a condition of the patient; size, type, and severity of the disease, condition, or disorder; the undesired type or level or activity of the tagged cells, the particular form of the active ingredient; and the method of administration.


In some embodiments, the amount of genetically modified cells of the present technology in a pharmaceutical composition is typically greater than 102 cells, for example, about 1×102, 5×102, 1×103, 5×103, 1×104, 5×104, 1×105, 5×105, 1×106, 5×106, 1×107, 5×107, 1×108, 5×108, 1×109, 5×109, 1×1010, 5×1010 cells, or more.


In some embodiments, the methods comprise administering to the subject the genetically modified cells or pharmaceutical composition containing the same once a day, twice a day, three times a day, or four times a day for a period of about 3 days, about 5 days, about 7 days, about 10 days, about 2 weeks, about 3 weeks, about 4 weeks, about 1 month, about 2 months, about 3 months, about 4 months, about 5 months, about 6 months, about 7 months, about 8 months, about 9 months, about 10 months, about 11 months, about 1 year, about 1.25 years, about 1.5 years, about 1.75 years, about 2 years, about 2.25 years, about 2.5 years, about 2.75 years, about 3 years, about 3.25 years, about 3.5 years, about 3.75 years, about 4 years, about 4.25 years, about 4.5 years, about 4.75 years, about 5 years, or more than about 5 years. In some embodiments, the genetically modified cells or pharmaceutical composition containing the same can be administered every day, every other day, every third day, weekly, biweekly (i.e., every other week), every third week, monthly, every other month, or every third month.


In some embodiments, the genetically modified cells or pharmaceutical composition containing the same may be administered over a pre-determined time period. Alternatively, the genetically modified cell or pharmaceutical composition containing the same may be administered until a particular therapeutic benchmark is reached. In some embodiments, the methods provided herein include a step of evaluating one or more therapeutic benchmarks in a biological sample, such as, but not limited to, the level of a disease related biomarker, to determine whether to continue administration of the genetically modified cell or pharmaceutical composition containing the same.


In some embodiments, the methods further comprise administering the subject a pharmaceutically effective amount of one or more additional therapeutic agents to obtain improved or synergistic therapeutic effects. In some embodiments, the one or more additional therapeutic agents are selected from the group consisting of an immunotherapy agent, a chemotherapy agent, and a biologic agent. In some embodiments, the subject was administered the one or more additional therapeutic agents before administration of the genetically modified cells or pharmaceutical composition containing the same. In some embodiments, the subject is co-administered the one or more additional therapeutic agents and the genetically modified cells or pharmaceutical composition containing the same. In some embodiments, the subject was administered the one or more additional therapeutic agents after administration of the genetically modified cells or pharmaceutical composition containing the same.


As one of ordinary skill in the art would understand, the one or more additional therapeutic agents and the genetically modified cells or pharmaceutical composition containing the same can be administered to a subject in need thereof one or more times at the same or different doses, depending on the diagnosis and prognosis of the subject. One skilled in the art would be able to combine one or more of these therapies in different orders to achieve the desired therapeutic results. In some embodiments, the combinational therapy achieves improved or synergistic effects in comparison to any of the treatments administered alone.


The above detailed description of embodiments of the technology are not intended to be exhaustive or to limit the technology to the precise forms disclosed above. Although specific embodiments of, and examples for, the technology are described above for illustrative purposes, various equivalent modifications are possible within the scope of the technology as those skilled in the relevant art will recognize. For example, although steps are presented in a given order, alternative embodiments may perform steps in a different order. The various embodiments described herein may also be combined to provide further embodiments.


From the foregoing, it will be appreciated that specific embodiments of the technology have been described herein for purposes of illustration, but well-known components and functions have not been shown or described in detail to avoid unnecessarily obscuring the description of the embodiments of the technology. Where the context permits, singular or plural terms may also include the plural or singular term, respectively. Further, while advantages associated with some embodiments of the technology have been described in the context of those embodiments, other embodiments may also exhibit such advantages, and not all embodiments need necessarily exhibit such advantages to fall within the scope of the technology. Accordingly, the disclosure and associated technology can encompass other embodiments not expressly shown or described herein.









TABLE 4







Exemplary ABO, FUT1, and RHD target sequences and cut sites for


targeted gene editing












SEQ
Target sequence
Target


Genomic


ID NO:
(5′→3′)
Gene
Strand
PAM
Cut Site















20
ggccagcgtccgcaacacct
ABO
−1
cgg
133,275,179





21
ggccgaggtgttgcggacgc
ABO
1
tgg
133,275,170





22
ggtcgaagtgcgtggcattt
ABO
−1
tgg
133,262,157





23
ggatcataggtcgaagtgcg
ABO
−1
tgg
133,262,149





24
attattaggaaaaggatcat
ABO
−1
agg
133,262,136





25
agacaagcattattaggaaa
ABO
−1
agg
133,262,128





26
agaccaagacaagcattatt
ABO
−1
agg
133,262,122





27
tttcctaataatgcttgtct
ABO
1
tgg
133,262,114





28
atgcttgtcttggtcttgtt
ABO
1
tgg
133,262,104





29
gcattagacttctggggctt
ABO
−1
agg
133,261,359





30
ttcctggcattagacttctg
ABO
−1
ggg
133,261,353





31
cttcctggcattagacttct
ABO
−1
ggg
133,261,352





32
gcttcctggcattagacttc
ABO
−1
tgg
133,261,351





33
agccccagaagtctaatgcc
ABO
1
agg
133,261,344





34
aacccccgttccaggcttcc
ABO
−1
tgg
133,261,337





35
aagtctaatgccaggaagcc
ABO
1
tgg
133,261,336





36
taatgccaggaagcctggaa
ABO
1
cgg
133,261,331





37
aatgccaggaagcctggaac
ABO
1
ggg
133,261,330





38
atgccaggaagcctggaacg
ABO
1
ggg
133,261,329





39
tgccaggaagcctggaacgg
ABO
1
ggg
133,261,328





40
gagacgcgctgcagatggtc
ABO
−1
agg
133,259,844





41
gcaacgagacgcgctgcaga
ABO
−1
tgg
133,259,839





42
gtgtcagcacctttggctgg
ABO
−1
ggg
133,258,117





43
ggtgtcagcacctttggctg
ABO
−1
ggg
133,258,116





44
gatggtctacccccagccaa
ABO
1
agg
133,258,115





45
cggtgtcagcacctttggct
ABO
−1
ggg
133,258,115





46
acggtgtcagcacctttggc
ABO
−1
tgg
133,258,114





47
gacaatgggagccagccaag
ABO
−1
ggg
133,257,514





48
agacaatgggagccagccaa
ABO
−1
ggg
133,257,513





49
cagacaatgggagccagcca
ABO
−1
agg
133,257,512





50
cttggctggctcccattgtc
ABO
1
tgg
133,257,500





51
aatgtgccctcccagacaat
ABO
−1
ggg
133,257,500





52
ttggctggctcccattgtct
ABO
1
ggg
133,257,499





53
gaatgtgccctcccagacaa
ABO
−1
tgg
133,257,499





54
gctggctcccattgtctggg
ABO
1
agg
133,257,496





55
ctggctcccattgtctggga
ABO
1
ggg
133,257,495





56
ggagcctgaactgctcgttg
ABO
−1
agg
133,257,465





57
acatcctcaacgagcagttc
ABO
1
agg
133,257,458





58
ttaacccaatggtggtgttc
ABO
−1
tgg
133,257,444





59
aggctccagaacaccaccat
ABO
1
tgg
133,257,438





60
ggctccagaacaccaccatt
ABO
1
ggg
133,257,437





61
aaacacagttaacccaatgg
ABO
−1
tgg
133,257,436





62
ggcaaacacagttaacccaa
ABO
−1
tgg
133,257,433





63
ccgtctccaggaacagcttc
ABO
−1
agg
133,256,338





64
ggctttcctgaagctgttcc
ABO
1
tgg
133,256,333





65
cctgaagctgttcctggaga
ABO
1
cgg
133,256,327





66
agtgcttctccgccgtctcc
ABO
−1
agg
133,256,326





67
gaagctgttcctggagacgg
ABO
1
cgg
133,256,324





68
gacggcggagaagcacttca
ABO
1
tgg
133,256,309





69
ggcggagaagcacttcatgg
ABO
1
tgg
133,256,306





70
gcggagaagcacttcatggt
ABO
1
ggg
133,256,305





71
agacatagtagtggacacgg
ABO
−1
tgg
133,256,293





72
tgaagacatagtagtggaca
ABO
−1
cgg
133,256,290





73
ggtcggtgaagacatagtag
ABO
−1
tgg
133,256,284





74
ctatgtcttcaccgaccagc
ABO
1
cgg
133,256,267





75
gggcaccgcggccggctggt
ABO
−1
cgg
133,256,267





76
cgcggggcaccgcggccggc
ABO
−1
tgg
133,256,263





77
cttcaccgaccagccggccg
ABO
1
cgg
133,256,261





78
gtcacgcggggcaccgcggc
ABO
−1
cgg
133,256,259





79
cagcgtcacgcggggcaccg
ABO
−1
cgg
133,256,255





80
ccggtccccagcgtcacgcg
ABO
−1
ggg
133,256,247





81
accggtccccagcgtcacgc
ABO
−1
ggg
133,256,246





82
gaccggtccccagcgtcacg
ABO
−1
cgg
133,256,245





83
cgcggtgccccgcgtgacgc
ABO
1
tgg
133,256,243





84
gcggtgccccgcgtgacgct
ABO
1
ggg
133,256,242





85
cggtgccccgcgtgacgctg
ABO
1
ggg
133,256,241





86
ccccgcgtgacgctggggac
ABO
1
cgg
133,256,236





87
gcgtgacgctggggaccggt
ABO
1
cgg
133,256,232





88
cagcactgacagctgccgac
ABO
−1
cgg
133,256,228





89
cggtcggcagctgtcagtgc
ABO
1
tgg
133,256,216





90
tcggcagctgtcagtgctgg
ABO
1
agg
133,256,213





91
cacgtcctgccagcgcttgt
ABO
−1
agg
133,256,195





92
aggtgcgcgcctacaagcgc
ABO
1
tgg
133,256,193





93
gcgcgcctacaagcgctggc
ABO
1
agg
133,256,189





94
gatcatctccatgcggcgca
ABO
−1
tgg
133,256,171





95
ggacgtgtccatgcgccgca
ABO
1
tgg
133,256,168





96
agtcactgatcatctccatg
ABO
−1
cgg
133,256,164





97
tgatcagtgacttctgcgag
ABO
1
cgg
133,256,142





98
cgagcggcgcttcctcagcg
ABO
1
agg
133,256,126





99
ccaggtaatccacctcgctg
ABO
−1
agg
133,256,125





100
gcggcgcttcctcagcgagg
ABO
1
tgg
133,256,123





101
cctcagcgaggtggattacc
ABO
1
tgg
133,256,114





102
tgtccacgtccacgcacacc
ABO
−1
agg
133,256,107





103
ggtggattacctggtgtgcg
ABO
1
tgg
133,256,105





104
ttacctggtgtgcgtggacg
ABO
1
tgg
133,256,099





105
ggtgtgcgtggacgtggaca
ABO
1
tgg
133,256,093





106
tctccacgcccacgtggtcg
ABO
−1
cgg
133,256,077





107
catggagttccgcgaccacg
ABO
1
tgg
133,256,075





108
atggagttccgcgaccacgt
ABO
1
ggg
133,256,074





109
tcaggatctccacgcccacg
ABO
−1
tgg
133,256,071





110
gttccgcgaccacgtgggcg
ABO
1
tgg
133,256,069





111
gggtgccgaacagcggagtc
ABO
−1
agg
133,256,053





112
gagatcctgactccgctgtt
ABO
1
cgg
133,256,047





113
gggtgcagggtgccgaacag
ABO
−1
cgg
133,256,046





114
tccgtagaagccggggtgca
ABO
−1
ggg
133,256,033





115
ctgttcggcaccctgcaccc
ABO
1
cgg
133,256,032





116
ttccgtagaagccggggtgc
ABO
−1
agg
133,256,032





117
ggctgcttccgtagaagccg
ABO
−1
ggg
133,256,026





118
cggctgcttccgtagaagcc
ABO
−1
ggg
133,256,025





119
ccggctgcttccgtagaagc
ABO
−1
cgg
133,256,024





120
accctgcaccccggcttcta
ABO
1
cgg
133,256,023





121
ccggcttctacggaagcagc
ABO
1
cgg
133,256,013





122
cggcttctacggaagcagcc
ABO
1
ggg
133,256,012





123
cttctacggaagcagccggg
ABO
1
agg
133,256,009





124
gctcgtaggtgaaggcctcc
ABO
−1
cgg
133,256,005





125
gggccggcgctcgtaggtga
ABO
−1
agg
133,255,997





126
ggactggggccggcgctcgt
ABO
−1
agg
133,255,991





127
aggccttcacctacgagcgc
ABO
1
cgg
133,255,989





128
tgtaggcctgggactggggc
ABO
−1
cgg
133,255,981





129
gggatgtaggcctgggactg
ABO
−1
ggg
133,255,977





130
cgagcgccggccccagtccc
ABO
1
agg
133,255,976





131
ggggatgtaggcctgggact
ABO
−1
ggg
133,255,976





132
tggggatgtaggcctgggac
ABO
−1
tgg
133,255,975





133
gtccttggggatgtaggcct
ABO
−1
ggg
133,255,970





134
cgtccttggggatgtaggcc
ABO
−1
tgg
133,255,969





135
gccctcgtccttggggatgt
ABO
−1
agg
133,255,964





136
gtcccaggcctacatcccca
ABO
1
agg
133,255,961





137
agaaatcgccctcgtccttg
ABO
−1
ggg
133,255,957





138
tagaaatcgccctcgtcctt
ABO
−1
ggg
133,255,956





139
ggcctacatccccaaggacg
ABO
1
agg
133,255,955





140
gtagaaatcgccctcgtcct
ABO
−1
tgg
133,255,955





141
gcctacatccccaaggacga
ABO
1
ggg
133,255,954





142
cgagggcgatttctactacc
ABO
1
tgg
133,255,937





143
gagggcgatttctactacct
ABO
1
ggg
133,255,936





144
agggcgatttctactacctg
ABO
1
ggg
133,255,935





145
gggcgatttctactacctgg
ABO
1
ggg
133,255,934





146
ggcgatttctactacctggg
ABO
1
ggg
133,255,933





147
gcgatttctactacctgggg
ABO
1
ggg
133,255,932





148
accccccgaagaaccccccc
ABO
−1
agg
133,255,930





149
tactacctgggggggttctt
ABO
1
cgg
133,255,924





150
actacctgggggggttcttc
ABO
1
ggg
133,255,923





151
ctacctgggggggttcttcg
ABO
1
ggg
133,255,922





152
tacctgggggggttcttcgg
ABO
1
ggg
133,255,921





153
acctgggggggttcttcggg
ABO
1
ggg
133,255,920





154
gggggggttcttcggggggt
ABO
1
cgg
133,255,916





155
cttcggggggtcggtgcaag
ABO
1
agg
133,255,907





156
ggtcggtgcaagaggtgcag
ABO
1
cgg
133,255,899





157
aagaggtgcagcggctcacc
ABO
1
agg
133,255,890





158
agaggtgcagcggctcacca
ABO
1
ggg
133,255,889





159
catggcctggtggcaggccc
ABO
−1
tgg
133,255,883





160
gctcaccagggcctgccacc
ABO
1
agg
133,255,877





161
gaccatcatggcctggtggc
ABO
−1
agg
133,255,877





162
ggtcgaccatcatggcctgg
ABO
−1
tgg
133,255,873





163
cctggtcgaccatcatggcc
ABO
−1
tgg
133,255,870





164
ggcctgccaccaggccatga
ABO
1
tgg
133,255,868





165
gttggcctggtcgaccatca
ABO
−1
tgg
133,255,865





166
ccaggccatgatggtcgacc
ABO
1
agg
133,255,859





167
atgatggtcgaccaggccaa
ABO
1
cgg
133,255,852





168
cggcctcgatgccgttggcc
ABO
−1
tgg
133,255,852





169
ccacacggcctcgatgccgt
ABO
−1
tgg
133,255,847





170
cgaccaggccaacggcatcg
ABO
1
agg
133,255,844





171
ccaacggcatcgaggccgtg
ABO
1
tgg
133,255,836





172
gtggctctcgtcgtgccaca
ABO
−1
cgg
133,255,832





173
gcagcaggtacttgttcagg
ABO
−1
tgg
133,255,813





174
ggcgcagcaggtacttgttc
ABO
−1
agg
133,255,810





175
tggtgggtttgtggcgcagc
ABO
−1
agg
133,255,798





176
agagcaccttggtgggtttg
ABO
−1
tgg
133,255,789





177
gctgcgccacaaacccacca
ABO
1
agg
133,255,784





178
tcgggggagagcaccttggt
ABO
−1
ggg
133,255,782





179
ctcgggggagagcaccttgg
ABO
−1
tgg
133,255,781





180
gtactcgggggagagcacct
ABO
−1
tgg
133,255,778





181
ctggtcccacaagtactcgg
ABO
−1
ggg
133,255,766





182
gctggtcccacaagtactcg
ABO
−1
ggg
133,255,765





183
tgctggtcccacaagtactc
ABO
−1
ggg
133,255,764





184
ctgctggtcccacaagtact
ABO
−1
cgg
133,255,763





185
tgctctcccccgagtacttg
ABO
1
tgg
133,255,761





186
gctctcccccgagtacttgt
ABO
1
ggg
133,255,760





187
cgggccagcccagcagctgc
ABO
−1
tgg
133,255,747





188
cttgtgggaccagcagctgc
ABO
1
tgg
133,255,745





189
ttgtgggaccagcagctgct
ABO
1
ggg
133,255,744





190
gggaccagcagctgctgggc
ABO
1
tgg
133,255,740





191
ctcagcttcctcaggacggc
ABO
−1
ggg
133,255,728





192
cctcagcttcctcaggacgg
ABO
−1
cgg
133,255,727





193
tgggctggcccgccgtcctg
ABO
1
agg
133,255,725





194
gaacctcagcttcctcagga
ABO
−1
cgg
133,255,724





195
cagtgaacctcagcttcctc
ABO
−1
agg
133,255,720





196
ccgccgtcctgaggaagctg
ABO
1
agg
133,255,716





197
gaggaagctgaggttcactg
ABO
1
cgg
133,255,706





198
cggaccgcctggtggttctt
ABO
−1
ggg
133,255,692





199
ccggaccgcctggtggttct
ABO
−1
tgg
133,255,691





200
tgcggtgcccaagaaccacc
ABO
1
agg
133,255,688





201
ggtgcccaagaaccaccagg
ABO
1
cgg
133,255,685





202
acgggttccggaccgcctgg
ABO
−1
tgg
133,255,684





203
ccaagaaccaccaggcggtc
ABO
1
cgg
133,255,680





204
ggcagagctgacgatggctc
FUT1
−1
cgg
48,748,518





205
aggccaggcagagctgacga
FUT1
−1
tgg
48,748,512





206
gagccatcgtcagctctgcc
FUT1
1
tgg
48,748,504





207
cacagactagcaggaaggcc
FUT1
−1
agg
48,748,497





208
gaggacacagactagcagga
FUT1
−1
agg
48,748,492





209
cagagaggacacagactagc
FUT1
−1
agg
48,748,488





210
ggaggaagaagattacagag
FUT1
−1
agg
48,748,473





211
agctgtcttgatggatatgg
FUT1
−1
agg
48,748,455





212
gaaagctgtcttgatggata
FUT1
−1
tgg
48,748,452





213
catgtggaaagctgtcttga
FUT1
−1
tgg
48,748,446





214
catcaagacagctttccaca
FUT1
1
tgg
48,748,434





215
atcgacaggcctaggccatg
FUT1
−1
tgg
48,748,430





216
gacagctttccacatggcct
FUT1
1
agg
48,748,428





217
gacacaggatcgacaggcct
FUT1
−1
agg
48,748,422





218
ggtctggacacaggatcgac
FUT1
−1
agg
48,748,416





219
ccaggcggcggtctggacac
FUT1
−1
agg
48,748,407





220
ggtgtcaccaggcggcggtc
FUT1
−1
tgg
48,748,400





221
cctgtgtccagaccgccgcc
FUT1
1
tgg
48,748,396





222
ctgggggtgtcaccaggcgg
FUT1
−1
cgg
48,748,395





223
ccactgggggtgtcaccagg
FUT1
−1
cgg
48,748,392





224
tggccactgggggtgtcacc
FUT1
−1
agg
48,748,389





225
ccgcctggtgacacccccag
FUT1
1
tgg
48,748,381





226
aggcagaagatggccactgg
FUT1
−1
ggg
48,748,379





227
caggcagaagatggccactg
FUT1
−1
ggg
48,748,378





228
gcaggcagaagatggccact
FUT1
−1
ggg
48,748,377





229
ggcaggcagaagatggccac
FUT1
−1
tgg
48,748,376





230
agtacccggcaggcagaaga
FUT1
−1
tgg
48,748,369





231
agtggccatcttctgcctgc
FUT1
1
cgg
48,748,363





232
gtggccatcttctgcctgcc
FUT1
1
ggg
48,748,362





233
ggcccatcgcagtacccggc
FUT1
−1
agg
48,748,359





234
ttggggcccatcgcagtacc
FUT1
−1
cgg
48,748,355





235
ctgcctgccgggtactgcga
FUT1
1
tgg
48,748,351





236
tgcctgccgggtactgcgat
FUT1
1
ggg
48,748,350





237
gacaggaagaggaggcgttg
FUT1
−1
ggg
48,748,338





238
ggacaggaagaggaggcgtt
FUT1
−1
ggg
48,748,337





239
gggacaggaagaggaggcgt
FUT1
−1
tgg
48,748,336





240
gtgctggggacaggaagagg
FUT1
−1
agg
48,748,330





241
agggtgctggggacaggaag
FUT1
−1
agg
48,748,327





242
ggaagcagggtgctggggac
FUT1
−1
agg
48,748,321





243
gagagggaagcagggtgctg
FUT1
−1
ggg
48,748,316





244
ggagagggaagcagggtgct
FUT1
−1
ggg
48,748,315





245
cggagagggaagcagggtgc
FUT1
−1
tgg
48,748,314





246
aggtgccggagagggaagca
FUT1
−1
ggg
48,748,308





247
caggtgccggagagggaagc
FUT1
−1
agg
48,748,307





248
cagcaccctgcttccctctc
FUT1
1
cgg
48,748,302





249
gacagtccaggtgccggaga
FUT1
−1
ggg
48,748,300





250
agacagtccaggtgccggag
FUT1
−1
agg
48,748,299





251
ctgcttccctctccggcacc
FUT1
1
tgg
48,748,295





252
ggggtagacagtccaggtgc
FUT1
−1
cgg
48,748,294





253
gccattggggtagacagtcc
FUT1
−1
agg
48,748,288





254
acctggactgtctaccccaa
FUT1
1
tgg
48,748,278





255
gattaccaaaccggccattg
FUT1
−1
ggg
48,748,275





256
ggactgtctaccccaatggc
FUT1
1
cgg
48,748,274





257
tgattaccaaaccggccatt
FUT1
−1
ggg
48,748,274





258
ctgattaccaaaccggccat
FUT1
−1
tgg
48,748,273





259
gtctaccccaatggccggtt
FUT1
1
tgg
48,748,269





260
gtcccatctgattaccaaac
FUT1
−1
cgg
48,748,266





261
tggccggtttggtaatcaga
FUT1
1
tgg
48,748,258





262
ggccggtttggtaatcagat
FUT1
1
ggg
48,748,257





263
gggacagtatgccacgctgc
FUT1
1
tgg
48,748,237





264
ctgggccagagccagcagcg
FUT1
−1
tgg
48,748,237





265
gtatgccacgctgctggctc
FUT1
1
tgg
48,748,231





266
ggcccggcggccgttgagct
FUT1
−1
ggg
48,748,219





267
ctggctctggcccagctcaa
FUT1
1
cgg
48,748,218





268
aggcccggcggccgttgagc
FUT1
−1
tgg
48,748,218





269
tggcccagctcaacggccgc
FUT1
1
cgg
48,748,211





270
ggcccagctcaacggccgcc
FUT1
1
ggg
48,748,210





271
caggcaggataaaggcccgg
FUT1
−1
cgg
48,748,206





272
tggcaggcaggataaaggcc
FUT1
−1
cgg
48,748,203





273
atgcatggcaggcaggataa
FUT1
−1
agg
48,748,198





274
gggcggcatgcatggcaggc
FUT1
−1
agg
48,748,191





275
gccagggcggcatgcatggc
FUT1
−1
agg
48,748,187





276
cggggccagggcggcatgca
FUT1
−1
tgg
48,748,183





277
gcctgccatgcatgccgccc
FUT1
1
tgg
48,748,177





278
gcggaataccggggccaggg
FUT1
−1
cgg
48,748,174





279
catgcatgccgccctggccc
FUT1
1
cgg
48,748,171





280
gatgcggaataccggggcca
FUT1
−1
ggg
48,748,171





281
tgatgcggaataccggggcc
FUT1
−1
agg
48,748,170





282
cagggtgatgcggaataccg
FUT1
−1
ggg
48,748,165





283
gcagggtgatgcggaatacc
FUT1
−1
ggg
48,748,164





284
ggcagggtgatgcggaatac
FUT1
−1
cgg
48,748,163





285
ccagcacgggcagggtgatg
FUT1
−1
cgg
48,748,155





286
ttctggggccagcacgggca
FUT1
−1
ggg
48,748,147





287
cttctggggccagcacgggc
FUT1
−1
agg
48,748,146





288
ccgcatcaccctgcccgtgc
FUT1
1
tgg
48,748,144





289
tccacttctggggccagcac
FUT1
−1
ggg
48,748,142





290
gtccacttctggggccagca
FUT1
−1
cgg
48,748,141





291
gcccgtgctggccccagaag
FUT1
1
tgg
48,748,132





292
cgtgcggctgtccacttctg
FUT1
−1
ggg
48,748,132





293
gcgtgcggctgtccacttct
FUT1
−1
ggg
48,748,131





294
ggcgtgcggctgtccacttc
FUT1
−1
tgg
48,748,130





295
gcagctcccgccacggcgtg
FUT1
−1
cgg
48,748,116





296
aagtggacagccgcacgccg
FUT1
1
tgg
48,748,115





297
tggacagccgcacgccgtgg
FUT1
1
cgg
48,748,112





298
ggacagccgcacgccgtggc
FUT1
1
ggg
48,748,111





299
tgaagctgcagctcccgcca
FUT1
−1
cgg
48,748,109





300
gggagctgcagcttcacgac
FUT1
1
tgg
48,748,091





301
gcagcttcacgactggatgt
FUT1
1
cgg
48,748,084





302
gcttcacgactggatgtcgg
FUT1
1
agg
48,748,081





303
ctggatgtcggaggagtacg
FUT1
1
cgg
48,748,072





304
aagccagagagcttcaggaa
FUT1
−1
agg
48,748,049





305
aggggaagccagagagcttc
FUT1
−1
agg
48,748,044





306
gatcctttcctgaagctctc
FUT1
1
tgg
48,748,041





307
ggaagaaagtccaagagcag
FUT1
−1
ggg
48,748,026





308
tctctggcttcccctgctct
FUT1
1
tgg
48,748,025





309
tggaagaaagtccaagagca
FUT1
−1
ggg
48,748,025





310
gtggaagaaagtccaagagc
FUT1
−1
agg
48,748,024





311
ggatctgttcccggagatgg
FUT1
−1
tgg
48,748,005





312
ggactttcttccaccatctc
FUT1
1
cgg
48,748,004





313
gactttcttccaccatctcc
FUT1
1
ggg
48,748,003





314
tgcggatctgttcccggaga
FUT1
−1
tgg
48,748,002





315
actctctgcggatctgttcc
FUT1
−1
cgg
48,747,996





316
cgtgcagggtgaactctctg
FUT1
−1
cgg
48,747,984





317
ttcccgaaggtggtcgtgca
FUT1
−1
ggg
48,747,970





318
cttcccgaaggtggtcgtgc
FUT1
−1
agg
48,747,969





319
tcaccctgcacgaccacctt
FUT1
1
cgg
48,747,962





320
caccctgcacgaccaccttc
FUT1
1
ggg
48,747,961





321
tctgcgcctcttcccgaagg
FUT1
−1
tgg
48,747,960





322
cactctgcgcctcttcccga
FUT1
−1
agg
48,747,957





323
gcacgaccaccttcgggaag
FUT1
1
agg
48,747,955





324
ggaagaggcgcagagtgtgc
FUT1
1
tgg
48,747,940





325
gaagaggcgcagagtgtgct
FUT1
1
ggg
48,747,939





326
tgtgctgggtcagctccgcc
FUT1
1
tgg
48,747,925





327
gtgctgggtcagctccgcct
FUT1
1
ggg
48,747,924





328
ggtcccctgtgcggcccagg
FUT1
−1
cgg
48,747,921





329
ggcggtcccctgtgcggccc
FUT1
−1
agg
48,747,918





330
cagctccgcctgggccgcac
FUT1
1
agg
48,747,915





331
agctccgcctgggccgcaca
FUT1
1
ggg
48,747,914





332
gctccgcctgggccgcacag
FUT1
1
ggg
48,747,913





333
tgcgcgggcggtcccctgtg
FUT1
−1
cgg
48,747,912





334
cgccgacaaaggtgcgcggg
FUT1
−1
cgg
48,747,900





335
ggacgccgacaaaggtgcgc
FUT1
−1
ggg
48,747,897





336
tggacgccgacaaaggtgcg
FUT1
−1
cgg
48,747,896





337
gaccgcccgcgcacctttgt
FUT1
1
cgg
48,747,891





338
gcgcacgtggacgccgacaa
FUT1
−1
agg
48,747,889





339
gatagtccccacggcgcacg
FUT1
−1
tgg
48,747,876





340
gtcggcgtccacgtgcgccg
FUT1
1
tgg
48,747,873





341
tcggcgtccacgtgcgccgt
FUT1
1
ggg
48,747,872





342
cggcgtccacgtgcgccgtg
FUT1
1
ggg
48,747,871





343
taacctgcagatagtcccca
FUT1
−1
cgg
48,747,867





344
gcgccgtggggactatctgc
FUT1
1
agg
48,747,859





345
tgcaggttatgcctcagcgc
FUT1
1
tgg
48,747,842





346
accacacccttccagcgctg
FUT1
−1
agg
48,747,842





347
ggttatgcctcagcgctgga
FUT1
1
agg
48,747,838





348
gttatgcctcagcgctggaa
FUT1
1
ggg
48,747,837





349
gcctcagcgctggaagggtg
FUT1
1
tgg
48,747,832





350
tcagcgctggaagggtgtgg
FUT1
1
tgg
48,747,829





351
cagcgctggaagggtgtggt
FUT1
1
ggg
48,747,828





352
tgggcgacagcgcctacctc
FUT1
1
cgg
48,747,809





353
gtccatggcctgccggaggt
FUT1
−1
agg
48,747,808





354
cgacagcgcctacctccggc
FUT1
1
agg
48,747,805





355
accagtccatggcctgccgg
FUT1
−1
agg
48,747,804





356
ggaaccagtccatggcctgc
FUT1
−1
cgg
48,747,801





357
cgcctacctccggcaggcca
FUT1
1
tgg
48,747,799





358
acctccggcaggccatggac
FUT1
1
tgg
48,747,794





359
ccgtgcccggaaccagtcca
FUT1
−1
tgg
48,747,793





360
ggcaggccatggactggttc
FUT1
1
cgg
48,747,788





361
gcaggccatggactggttcc
FUT1
1
ggg
48,747,787





362
ccatggactggttccgggca
FUT1
1
cgg
48,747,782





363
cgggggcttcgtgccgtgcc
FUT1
−1
cgg
48,747,780





364
gctggtgaccacgaaaacgg
FUT1
−1
ggg
48,747,763





365
tgctggtgaccacgaaaacg
FUT1
−1
ggg
48,747,762





366
ttgctggtgaccacgaaaac
FUT1
−1
ggg
48,747,761





367
gcacgaagcccccgttttcg
FUT1
1
tgg
48,747,760





368
gttgctggtgaccacgaaaa
FUT1
−1
cgg
48,747,760





369
gttttcgtggtcaccagcaa
FUT1
1
cgg
48,747,747





370
acaccactccatgccgttgc
FUT1
−1
tgg
48,747,745





371
cgtggtcaccagcaacggca
FUT1
1
tgg
48,747,742





372
tcaccagcaacggcatggag
FUT1
1
tgg
48,747,737





373
agaaaacatcgacacctccc
FUT1
1
agg
48,747,709





374
gaaaacatcgacacctccca
FUT1
1
ggg
48,747,708





375
aaacgtcacatcgccctggg
FUT1
−1
agg
48,747,706





376
agcaaacgtcacatcgccct
FUT1
−1
ggg
48,747,703





377
cagcaaacgtcacatcgccc
FUT1
−1
tgg
48,747,702





378
cagggcgatgtgacgtttgc
FUT1
1
tgg
48,747,690





379
gatgtgacgtttgctggcga
FUT1
1
tgg
48,747,684





380
gacgtttgctggcgatggac
FUT1
1
agg
48,747,679





381
gtttgctggcgatggacagg
FUT1
1
agg
48,747,676





382
atggacaggaggctacaccg
FUT1
1
tgg
48,747,665





383
agcagggcaaagtotttcca
FUT1
−1
cgg
48,747,659





384
gtggttgcactgtgtgagca
FUT1
−1
ggg
48,747,643





385
tgtggttgcactgtgtgagc
FUT1
−1
agg
48,747,642





386
tgccaatggtcataatggtg
FUT1
−1
tgg
48,747,624





387
gaaggtgccaatggtcataa
FUT1
−1
tgg
48,747,619





388
aaccacaccattatgaccat
FUT1
1
tgg
48,747,615





389
ccagaagccgaaggtgccaa
FUT1
−1
tgg
48,747,610





390
attatgaccattggcacctt
FUT1
1
cgg
48,747,606





391
gtaggcagcccagaagccga
FUT1
−1
agg
48,747,601





392
ccattggcaccttcggcttc
FUT1
1
tgg
48,747,599





393
cattggcaccttcggcttct
FUT1
1
ggg
48,747,598





394
cggcttctgggctgcctacc
FUT1
1
tgg
48,747,586





395
agtgtctccgccagccaggt
FUT1
−1
agg
48,747,583





396
ttctgggctgcctacctggc
FUT1
1
tgg
48,747,582





397
tgggctgcctacctggctgg
FUT1
1
cgg
48,747,579





398
agacagtgtctccgccagcc
FUT1
−1
agg
48,747,579





399
tggcggagacactgtctacc
FUT1
1
tgg
48,747,562





400
ctggcagggtgaagttggcc
FUT1
−1
agg
48,747,555





401
agagtctggcagggtgaagt
FUT1
−1
tgg
48,747,550





402
caggaactcagagtctggca
FUT1
−1
ggg
48,747,541





403
tcaggaactcagagtctggc
FUT1
−1
agg
48,747,540





404
atcttcaggaactcagagtc
FUT1
−1
tgg
48,747,536





405
cctccggcttaaagatottc
FUT1
−1
agg
48,747,522





406
gttcctgaagatctttaagc
FUT1
1
cgg
48,747,514





407
cctgaagatctttaagccgg
FUT1
1
agg
48,747,511





408
gaagatctttaagccggagg
FUT1
1
cgg
48,747,508





409
tcgggcaggaaggccgcctc
FUT1
−1
cgg
48,747,506





410
gcccacccactcgggcagga
FUT1
−1
agg
48,747,496





411
taatgcccacccactcgggc
FUT1
−1
agg
48,747,492





412
aggcggccttcctgcccgag
FUT1
1
tgg
48,747,491





413
ggcggccttcctgcccgagt
FUT1
1
ggg
48,747,490





414
gcattaatgcccacccactc
FUT1
−1
ggg
48,747,488





415
ggccttcctgcccgagtggg
FUT1
1
tgg
48,747,487





416
tgcattaatgcccacccact
FUT1
−1
cgg
48,747,487





417
gccttcctgcccgagtgggt
FUT1
1
ggg
48,747,486





418
atgcagacttgtctccactc
FUT1
1
tgg
48,747,458





419
ggcttagccaatgtccagag
FUT1
−1
tgg
48,747,455





420
cttgtctccactctggacat
FUT1
1
tgg
48,747,451





421
ggcagcgccggacagaccgc
RHD
−1
ggg
25,272,568





422
aggcagcgccggacagaccg
RHD
−1
cgg
25,272,569





423
ctaagtacccgcggtctgtc
RHD
1
cgg
25,272,572





424
cccagaggggcaggcagcgc
RHD
−1
cgg
25,272,580





425
gtgttagggcccagaggggc
RHD
−1
agg
25,272,589





426
tccggcgctgcctgcccctc
RHD
1
tgg
25,272,590





427
ccggcgctgcctgcccctct
RHD
1
ggg
25,272,591





428
tccagtgttagggcccagag
RHD
−1
ggg
25,272,593





429
ttccagtgttagggcccaga
RHD
−1
ggg
25,272,594





430
cttccagtgttagggcccag
RHD
−1
agg
25,272,595





431
gcccctctgggccctaacac
RHD
1
tgg
25,272,603





432
gagagctgcttccagtgtta
RHD
−1
ggg
25,272,603





433
tgagagctgcttccagtgtt
RHD
−1
agg
25,272,604





434
agtgggtaaaaaaatagaag
RHD
−1
agg
25,272,631





435
ctctaaggaagcgtcatagt
RHD
−1
ggg
25,272,648





436
cctctaaggaagcgtcatag
RHD
−1
tgg
25,272,649





437
ccactatgacgcttccttag
RHD
1
agg
25,272,660





438
gagccccttttgatcctcta
RHD
−1
agg
25,272,663





439
cgcttccttagaggatcaaa
RHD
1
agg
25,272,669





440
gcttccttagaggatcaaaa
RHD
1
ggg
25,272,670





441
cttccttagaggatcaaaag
RHD
1
ggg
25,272,671





442
agaggatcaaaaggggctcg
RHD
1
tgg
25,272,678





443
ccgccatcacggtcagatct
RHD
−1
tgg
25,284,583





444
tggccaagatctgaccgtga
RHD
1
tgg
25,284,591





445
ccaagatctgaccgtgatgg
RHD
1
cgg
25,284,594





446
caagccaatggccgccatca
RHD
−1
cgg
25,284,594





447
ctgaccgtgatggcggccat
RHD
1
tgg
25,284,601





448
cgtgatggcggccattggct
RHD
1
tgg
25,284,606





449
ggtgaggaagcccaagccaa
RHD
−1
tgg
25,284,606





450
gtgatggcggccattggctt
RHD
1
ggg
25,284,607





451
gtctccggaaactcgaggtg
RHD
−1
agg
25,284,622





452
gctgtgtctccggaaactcg
RHD
−1
agg
25,284,627





453
gcttcctcacctcgagtttc
RHD
1
cgg
25,284,629





454
cactgctccagctgtgtctc
RHD
−1
cgg
25,284,637





455
cgagtttccggagacacagc
RHD
1
tgg
25,284,641





456
gagacacagctggagcagtg
RHD
1
tgg
25,284,651





457
cgccagcatgaagaggttga
RHD
−1
agg
25,284,663





458
caccaagcgccagcatgaag
RHD
−1
agg
25,284,670





459
ggccttcaacctcttcatgc
RHD
1
tgg
25,284,672





460
aacctcttcatgctggcgct
RHD
1
tgg
25,284,679





461
tgctggcgcttggtgtgcag
RHD
1
tgg
25,284,689





462
gctggcgcttggtgtgcagt
RHD
1
ggg
25,284,690





463
tgtgcagtgggcaatcctgc
RHD
1
tgg
25,284,702





464
cagtgggcaatcctgctgga
RHD
1
cgg
25,284,706





465
ggctcaggaagccgtccagc
RHD
−1
agg
25,284,706





466
tcccagaagggaactggctc
RHD
−1
agg
25,284,721





467
ccaccttcccagaagggaac
RHD
−1
tgg
25,284,727





468
ttcctgagccagttcccttc
RHD
1
tgg
25,284,730





469
tcctgagccagttcccttct
RHD
1
ggg
25,284,731





470
tgatgaccaccttcccagaa
RHD
−1
ggg
25,284,733





471
gtgatgaccaccttcccaga
RHD
−1
agg
25,284,734





472
gagccagttcccttctggga
RHD
1
agg
25,284,735





473
ccagttcccttctgggaagg
RHD
1
tgg
25,284,738





474
caccgacaaagcactcatgg
RHD
−1
tgg
25,290,658





475
cagcaccgacaaagcactca
RHD
−1
tgg
25,290,661





476
ggccaccatgagtgctttgt
RHD
1
cgg
25,290,667





477
tttgtcggtgctgatctcag
RHD
1
tgg
25,290,682





478
gatctcagtggatgctgtct
RHD
1
tgg
25,290,694





479
atctcagtggatgctgtctt
RHD
1
ggg
25,290,695





480
tctcagtggatgctgtcttg
RHD
1
ggg
25,290,696





481
agtggatgctgtcttgggga
RHD
1
agg
25,290,700





482
tgtcttggggaaggtcaact
RHD
1
tgg
25,290,709





483
gaaggtcaacttggcgcagt
RHD
1
tgg
25,290,718





484
ggtcaacttggcgcagttgg
RHD
1
tgg
25,290,721





485
cttggcgcagttggtggtga
RHD
1
tgg
25,290,727





486
gcagttggtggtgatggtgc
RHD
1
tgg
25,290,733





487
gttggtggtgatggtgctgg
RHD
1
tgg
25,290,736





488
ggtggtgatggtgctggtgg
RHD
1
agg
25,290,739





489
ctggtggaggtgacagcttt
RHD
1
agg
25,290,752





490
tgacagctttaggcaacctg
RHD
1
agg
25,290,762





491
agctttaggcaacctgagga
RHD
1
tgg
25,290,766





492
tattactgatgaccatcctc
RHD
−1
agg
25,290,767





493
agatgtgcatcatgttcatg
RHD
−1
tgg
25,300,960





494
tacgtgttcgcagcctattt
RHD
1
tgg
25,300,993





495
acgtgttcgcagcctatttt
RHD
1
ggg
25,300,994





496
ggccacagacagcccaaaat
RHD
−1
agg
25,300,995





497
agcctattttgggctgtctg
RHD
1
tgg
25,301,004





498
attttgggctgtctgtggcc
RHD
1
tgg
25,301,009





499
tagaggctttggcaggcacc
RHD
−1
agg
25,301,016





500
cctcgggtagaggctttggc
RHD
−1
agg
25,301,023





501
gttccctcgggtagaggctt
RHD
−1
tgg
25,301,027





502
tcctccgttccctcgggtag
RHD
−1
agg
25,301,033





503
cctgccaaagcctctacccg
RHD
1
agg
25,301,034





504
ctgccaaagcctctacccga
RHD
1
ggg
25,301,035





505
tctttatcctccgttccctc
RHD
−1
ggg
25,301,039





506
aaagcctctacccgagggaa
RHD
1
cgg
25,301,040





507
atctttatcctccgttccct
RHD
−1
cgg
25,301,040





508
gcctctacccgagggaacgg
RHD
1
agg
25,301,043





509
acccagtttgtctgccatgc
RHD
1
tgg
25,301,088





510
ccagaacatccacaagaaga
RHD
−1
ggg
25,301,529





511
gccagaacatccacaagaag
RHD
−1
agg
25,301,530





512
ccctcttcttgtggatgttc
RHD
1
tgg
25,301,540





513
agcagagcagagttgaaact
RHD
−1
tgg
25,301,552





514
tgctgagaagtccaatcgaa
RHD
1
agg
25,301,582





515
acggcattcttcctttcgat
RHD
−1
tgg
25,301,582





516
agcatagtaggtgttgaaca
RHD
−1
cgg
25,301,601





517
gctgactgctacagcatagt
RHD
−1
agg
25,301,613





518
ctatgctgtagcagtcagcg
RHD
1
tgg
25,301,628





519
agcgtggtgacagccatctc
RHD
1
agg
25,301,644





520
gcgtggtgacagccatctca
RHD
1
ggg
25,301,645





521
agccaaggatgaccctgaga
RHD
−1
tgg
25,301,646





522
agccatctcagggtcatcct
RHD
1
tgg
25,301,655





523
cttcccttgggggtgagcca
RHD
−1
agg
25,301,661





524
tcatccttggctcaccccca
RHD
1
agg
25,301,668





525
catccttggctcacccccaa
RHD
1
ggg
25,301,669





526
ttatgtgcacagtgcggtgt
RHD
1
tgg
25,303,341





527
gtgcacagtgcggtgttggc
RHD
1
agg
25,303,345





528
cacagtgcggtgttggcagg
RHD
1
agg
25,303,348





529
tgcggtgttggcaggaggcg
RHD
1
tgg
25,303,353





530
gttggcaggaggcgtggctg
RHD
1
tgg
25,303,359





531
ttggcaggaggcgtggctgt
RHD
1
ggg
25,303,360





532
agaagggatcaggtgacacg
RHD
−1
agg
25,303,374





533
caagccacggagaagggatc
RHD
−1
agg
25,303,384





534
ccatggcaagccacggagaa
RHD
−1
ggg
25,303,390





535
gtcacctgatcccttctccg
RHD
1
tgg
25,303,391





536
accatggcaagccacggaga
RHD
−1
agg
25,303,391





537
cccagcaccatggcaagcca
RHD
−1
cgg
25,303,397





538
cccttctccgtggcttgcca
RHD
1
tgg
25,303,401





539
tccgtggcttgccatggtgc
RHD
1
tgg
25,303,407





540
agccacaagacccagcacca
RHD
−1
tgg
25,303,407





541
ccgtggcttgccatggtgct
RHD
1
ggg
25,303,408





542
tgccatggtgctgggtcttg
RHD
1
tgg
25,303,416





543
atggtgctgggtcttgtggc
RHD
1
tgg
25,303,420





544
tggtgctgggtcttgtggct
RHD
1
ggg
25,303,421





545
gtggctgggctgatctccgt
RHD
1
cgg
25,303,435





546
tggctgggctgatctccgtc
RHD
1
ggg
25,303,436





547
ggctgggctgatctccgtcg
RHD
1
ggg
25,303,437





548
gctgggctgatctccgtcgg
RHD
1
ggg
25,303,438





549
caggtacttggctcccccga
RHD
−1
cgg
25,303,440





550
gggtgttgtaaccgagtgct
RHD
1
ggg
25,306,613





551
tgtggggaatccccagcact
RHD
−1
cgg
25,306,613





552
ggtgttgtaaccgagtgctg
RHD
1
ggg
25,306,614





553
tagcccatgatggagctgtg
RHD
−1
ggg
25,306,629





554
gtagcccatgatggagctgt
RHD
−1
ggg
25,306,630





555
tgtagcccatgatggagctg
RHD
−1
tgg
25,306,631





556
gattccccacagctccatca
RHD
1
tgg
25,306,636





557
attccccacagctccatcat
RHD
1
ggg
25,306,637





558
gctgaagttgtagcccatga
RHD
−1
tgg
25,306,639





559
gggctacaacttcagcttgc
RHD
1
tgg
25,306,657





560
ggctacaacttcagcttgct
RHD
1
ggg
25,306,658





561
ttcagcttgctgggtctgct
RHD
1
tgg
25,306,667





562
gatcatctacattgtgctgc
RHD
1
tgg
25,306,693





563
ctgctggtgcttgataccgt
RHD
1
cgg
25,306,709





564
gtgcttgataccgtcggagc
RHD
1
cgg
25,306,715





565
gataccgtcggagccggcaa
RHD
1
tgg
25,306,721





566
ccccaatgctgaggaggacc
RHD
−1
tgg
25,317,015





567
tgagttccccaatgctgagg
RHD
−1
agg
25,317,021





568
ttccaggtcctcctcagcat
RHD
1
tgg
25,317,024





569
agctgagttccccaatgctg
RHD
−1
agg
25,317,024





570
tccaggtcctcctcagcatt
RHD
1
ggg
25,317,025





571
ccaggtcctcctcagcattg
RHD
1
ggg
25,317,026





572
cagcattggggaactcagct
RHD
1
tgg
25,317,038





573
agacatgagagctatcacga
RHD
−1
tgg
25,317,050





574
atcgtgatagctctcatgtc
RHD
1
tgg
25,317,063





575
ctttccatattttaagattt
RHD
−1
agg
25,321,902





576
tgctcctaaatcttaaaata
RHD
1
tgg
25,321,909





577
aatatggaaagcacctcatg
RHD
1
agg
25,321,925





578
tcaaaatatttagcctcatg
RHD
−1
agg
25,321,927





579
attttgatgaccaagttttc
RHD
1
tgg
25,321,954





580
taaaatccaacagccaaatg
RHD
−1
agg
25,328,907





In the ″Strand″ column, “1” denotes the forward strand; “−1” denotes the reverse strand.














SEQ ID NO: 1 (ABO):



agtgtaaactcctctgagagatatcaggaaaagcaggaagaagcctctgggacccttcgggaggtaactcctcttcg


cagcggggcgcgctctcccagtccctgcagccgccgccgccctctcctgagcttcctcgagcggacgccaggcaag


ggcgggggtcgtagcggggcggagcggggctttgtccacggaccgcgcgaagaggcctcagggcccggcgcgg


gcgccggagggggactcgctcgcagggggaacgcgaaggttcctcagtctgcgggacgcagagctccgtggggc


cgcgagccggggccggggaagcgactctgcctagggggacgtcgcgggcgcggggcacagggtcctgcgggg


ctggagggctacaggctgcggcgcgcgcgagccggaaggccggggatcgtgggttctggggccgcagcttcacg


ggttcgtctcccccgcctccccgggggagcaggatgtcagggggtcgcccccgcccgggagacagggtgtcaagg


ggcccccggggacggggcttcaggggcacccggagccgctcggccccagggcgggatgcggggacagggccc


caaggtaccagggccacgaggggcgcgcgggtcccttggggatgcgcgcgaggaggcgccgtcccttcctagca


ggggtccctggggacccgcggccgcctcccgcgcccctctgtcccctcccgtgttcggcctcgggaagtcggggcg


gcgggcggcgcgggccgggagggggcgcctcgggctcaccccgccccagggccgccgggcggaaggcggag


gccgagaccagacgcggagccatggccgaggtgttgcggacgctggccggtgagtgcaggcctcggccccgggt


gcccgcgagggagccgctaccgcagggaatgcggggtgcacccgacagccgggccggggtgggggcgctcag


ggctgcgaggcttcgggccggccgccgccccagcctccgagaccctgcgtcctggggagccggcgggcaggtgg


gctcggccgcgctgtgggtgcctgggacccgcagggaggatgggcgcggtggcgcggcctggcggggggctcgt


ctccggggtccccgggtcctggtgagagcggggtccctcgacgccgtggcggtctccagcctctcctcgcccctccac


gctccccgccttccatgagctgctattttcagcacctaccgcccgaccctggactaggacaaggctctgggctgccctg


cccgccccccagcccttccctcgggcacggcggccaggcgcccgggttgaccgggaacagcctccataccccaa


acgcggaggcgcctcgggaaggcgaggtgggcaagttcaatgccaagcgtgacgggggaactgtgccccgggc


cctcaggtgatataggagttaagaagaaattattgaggcaaccagatgcggtgactcaggcctgtaaccccagcact


ttgggaggccgagggtggatcacctgtccttaattttcttggcgccagaagatgaattgagtatttacccagacaacaa


cgtcgcttcagagggagggatgcagaacgcagggccacggggtgcaggctgcaggccagtgaaccccaacgcc


aaaggccagggagagccgggtggggtacccagagccagcacacagccctttaatttagaggagtgctgtgtacac


atctggggagagatgttttactttgatttggaatcaggtggcggataaggcatactgaggcctgacttggtgagggctcc


tgccccggaggtgcagccctggaggagcgggaggcagaggagtggaaaattcatgaagaaaaccgggtatggt


gtaggtcgaggccctgccctcagtaatgctcaccatttgtcagtgtttactgtgaggcagcactgtgttcagtatcgctga


gttctcaggaaggaacggtaaatacttcccgggtcattctttcacccacgggaaaacaggtttggagagatctgggac


agtgctctggtcctaggcaggaagggctgagtggggcctgggactcaggtctgactgcaaacacctgcctcctccct


gtgctgccagcgccttccgggttcttccctgtccctcctttgtggtctttgttttcccttttttgtcttaatatgtttcaacggatgtat


acaataaaccgcacatgaaaggtacagctggatacatcttgacccagtcaagatgatgaacacagctgccacccc


aggagtctgtcctgccccacgggttatgctgtcttagttggtctcatgtcagggagccttggaggaccagggtcgggca


gttggtctctatactcctggggttctggcactggctctggcccatgaccgcacccaagacaaacgtcttgaagactaag


aggttaggtctttgagaaacctggcaaatgagtgcccattctcaggttacccacattctgcatgttgatttagtcatccaac


caatgttggttgaacactgatgagaacaagcaggcctgtgctagaaggtgcctgcagccaggagctggtgagctggt


gtccttagggacaccaatggcgagggacccagtgtgtggaaatctgggggacaagcatgccagggagaagagct


cacatggggaaggcccggctccacaaatcagtcaggcttgttggggcgggcaggagagcagggtagtggagtca


gagggagtgatcccccgaaaggcaggaagaggacatgagagagccttggagatgacggtgaggatgtggttggt


gggcggtgagctgggtgtcatgtgctggctccttagagaatgctcagctccttcacacccatcataatccctggaggac


tgagaccacgtgcaggagttttggaagctggcagtgcacccagtcccggctctcctccattctggtgggtctcaccaga


gattggccaagaagagatcaaactgttcctggaccaaactgagggtggggctgctatctctcgcggcccaataacga


gatgcagatgaactggggagaaagagagtttttatttctgtaaccagttacaaggagaagacctggaaattatctcca


gaccaactcaaaattacaaagttttccagagcttatataccttctaagctatatgtctatgtgtaagtgtgcattcatctcaa


gacataagtaattgacttatgttaatctataactaaggtctgagtcctgaagaccttcctctggatcctcagtaaatttactt


aatctaaaacccttatcttgtctcctaaatcatgggggtttgggaagttccttcagacccccagtaaacttatttgtggagtc


ctggggaatttcttcagatccccaataaaacttgtttaatcctaaatgggtcctgttaagaattccttcgttattttgtcatgctt


gaaggcccaggaaaggtctaggcaaaactcttggtgggattttgttatattccagcctttttataagggcactggcttttaa


tatttaatttaaccactcagtcagtactgaaacagttgttagggaggcctgcgttagtgagacctgacctgccacaacac


atcttactcggaatgctgcccataacttcaaaaaatcagctttgacggagccctactgaacacacctagcatctctcttc


cttcagcttagggtcaaggggctggggttgatggcaccattgaaagaaacagctttattgccgtgtcattgatatgccat


aaaattcacctgtttcaaatgaattattttcagttagtttacagagttgtgaaataaattttataacttttccatccccagccca


ccaaaactccctggaactcctccgcagtcattccccattcccacctggcctcagacaatcactttctgtctctccagtcttg


ccttttctggacagttcctatgaatggagtcctgtgttacatggtcttttgcatctgacttccttcacttagaataatgattccga


gattcatgtatgttgtagtatgtatcagtatttaattcctttttattactgaataatccattgtacagatagaccacattttgtttat


ccattcatcagctgaaggacatttcggctgtttctgcttttttagctattttaaactgcacgcagcactgctatgaacatttgtg


tacaagactttgtgtgaacatgttttcatttctcttgggttgatacccagcagaggaattgctgggtcatacagaaagtctg


atttaacattttaagaaactagcaaactgttttccaaagtgactgccccattttacattcccatcagcagtgtataacggttc


taattttcttttcctttttcttttcttgagacaaggttttgctctgtcacccaggctggagtgcagtggcatgatcttggctcactg


cagcctcaatctcttgggctcaattggtcctcccacctcagcctcctgagtagctgggactacaggtgagtaccaccac


atccagctaatttttgtatttttggtagagatggggttttgccatgttgcccaggctgttctcaaactcctgggctcaagtgat


ctgcccacctcggcctccccaagtcctgggattataggtgtgagccgctgcgcctggctgagggttccactgtctgtac


gtctgcagcaatacataccattcttgtgggtaaaaggtggtatctcattatagttttgatttgcatttccccaaggtcaaatg


atggcaagtggcttttcttgtgctttttagccatttgtatatgtttttggtgaaatgcctattgaaatgttttgcctttttaaaaattg


agttgtctcctttgttcagttttgagagttctttacatacacagtatcatatatatatatatatatatatatatatatatatatatata


cacaccagatatatgatttgcaaatattttcaaccatcaatagtttaacacttttttattggtattttgaagtagaaaagtttta


cattttgatccattccaatttattaactttttttttattgtgtatctggtatcatatctaagaaatcttaatccagtgtcacaaaga


tttaatcttatattttcttctaaccgctttctagtttatgtgtaagaatctgtccattttacctaagttgcataatttgttggcaaaca


gttgtacatagtatttccttgaaatccttttaatctctgtaagattggaattgctgtctcctcttttagtcctgattttagttatttgtgt


tctctctctctctggtcaatgtagctaaagctttgtcaattctcttgatcttttcagagaactgccatttgatatttttttactttatctt


tttctttctgttctctagttcattggtttccactcttctcactttgggtttaattcgttctgtcttttcttattgtagttacttacaatggaa


gcttacacacttgatttaagatttttttttctaatgtagacatttacagctataaatttcccttgaaacacagctttagttatatct


cataaattttggtatgttgtgtttacattttcattcagctcagtgtatcttttgattttcttctttgacccattgcttatttagaattatgt


tgtttaatttctatgtatttataggtttcccaaatttcttttgttaatttctgatttcattcccctgtggctagagaagaaactctgtg


gatatcagtcctttcgaatttctcagaattgttttatggtccagcatatgattgaccttggacactgttccatgtgcacttgag


gagaacgtgacttcggctcttgccaggtggagtgttctagagatgtcagttggtgtccagtgatgtcaagtcatttgtatct


actgattttctttctaattattctatccattactgagagtggggtaggattttggtgtccaattattgtttttttgtttttttttttttgtttgtt


tttttttttttttgaggtggagtctcactttgtcacccaggctggagtgcagtggtatgatcacagctcactgcagcccccac


ctcccaaggcttaggtcacctcagcctcctgagtagctgcaactacaggcaagtaccatcatgcctagctgatttttgca


ttttttatagacagggttttgccatgttacttaggctggtcacgaactcctgggctcaagtgacccacccacctcggcctcc


caaagtgctgggattacaagcgtgagtcaccatgcctggcccaaccattactggtgaattgactgtttcacccttcaatt


ctgtgagtttttgcttcatgtattgctgcgggataattaaggaatcagagagaccgatggggttgaggaggaattatttaa


ttatttaggtgcaccgacccaatcagattaacatccaaaggactgggccccaaacaaagagtcaagctaccttttaag


cattttgtggggtggggggagatttgtgcagggggaagagtattacagaagcaagaaacaaagacagttattcagtt


aagacatgcattacattatttcttacttttcaaggaacaacacgttttatgactcaagattatctgtttagtgaccttgcagct


gcacagctagagaaacagagtcttcgcaatgcctgggaaagggagagataaggctcactagccacagaaaaac


agccagttaatttttaaaggactccagccctttctcttcctcaaggggaattggttttttacatacaactgagtttttgcttaca


cagtttttaatttcttttaattcctgttctagtattttggggctaggttgtcaggtatgtatatatttctgtttgttatattttcgtgatgta


ttcactttatatcatggcagaatgtttctctttagtaagatttttgatcttaaaaaagttggccagatggggggctcatgcct


gtgatcccagcactttgggaggccaaggcaggtggatcacctgaggtcaggagttcaagactagcctggcgaacat


ggtgaaaccccgtctctactgaaaatacaaaaaaattagccagctgtgatggtgtgcgcctgtaatccctgctacttgg


gaggctgaggcacgagaattgcttgaacccgggaggctgaggttacagtcagccaagatcgtgccactgcactcca


acctgggtgatagagtgagactctgtctcaaaaaaacaaaaaaattatttagtctgacgttagcatttctcctcaagctc


ccttacggttgtttgcatagcaaatctactatccttttgctttgacctatttgtatctttgtttctaaagtatgtatgtctctcacag


gcagcatatagtcaaagcttaaaaaaaaaatccagtctaacgatctctgccttttgattggcatgttcattctattcccattc


aatgttattattgctgtggttggatttccatctatcagttcacaatttgttttctatgttttatgcctttttgttcttctgttcatcttttact


accttcttttgtattaagtattttctagtgtagcatttaaattcccttttttttttaagtgtatattttaaagttattttgttagtgtttgct


ccagggattacaatatgcattttaatttatcaggatctacttcagattaatactaattttagtaaaatacaggaacttgactc


cagtataactccatttcctccctccttgcttgtggtttgtagtattattgtcgtatatgttcatctatatatgttataaactcaaca


acatggtgttataattattgtttcacacaatcttatttcttttcaattcagtaagacaagtaaggagaaaaacacttttcaagt


cttttatattacactgtatatttatcactgactttactcttgatttcttcctgtatattcaagctattgtctggtgacctttccttgctcc


agtatatataataacttcattgcctccttgctcctttatgctattattgtcatatatattatatatgtttatgctgtgagcccatcag


ctaagtcagcttagcaaggtctcaagatacaaagtcaatgaataaatcaggctgggtgtggtggctcacacctgcaat


cccagcactttgggaggccaagataggtagatcacttgaggccaggagtttgagagcagcctggcccacatggcaa


gactccatctttacaaaaatacaaaaaaaaaaaaaaagttgagtgtggtggtgcgcctgtaatcccacctacttggga


ggctgaggcatgagaattgcttgaacccaggagggagaggttgcagtgactggagatcacaccactgcactccag


actgggcaacagggcaagacacagtctaaaaaaaaaaaaaaaagaaaaaagaaaaacaagaataaatcaatt


atttttctataatacttgccacaatcaattgaaccatgaaaaattttaaataccatttacagtagcatcataaaacatgaaa


tatttagagaataatttaccaaattaggagaaatgtctatacaattaaaactgcaatataaaccaggcaccatggctca


tgcctgtaatcccagcactttgggaggtcaaggtggccagatcacttgagcccagaagtttgagaccagcctgggtga


catagtgagaccctgtctctacaaaaaaaaaaaaaaattagctgggcatggtggcatgcacctgtagcccctgctact


caggaggctgaggtgagaggatgacttgagcccagtaggcagagattgcagtgcattccagcctgggctacacag


caagaccctgacaaaaaaaaaaaaaaaaagaaaaagaaggtcacaaaaaaacagaaaactgcaaaatattg


atgacaaaaattaaagaagacataaataaatggagaaatataccatgctcatggattggaagacttacattgctaaa


ctgtcacttctccccagatggatctacagagtccacataatctcacttaaaactccagaagaaatttttgtaaagttgaca


gctgattctaaaattttacatagtaatcagaataggttgatattaggatagacagatagttcaatggaatagaatgcaga


gtttagaaatagacctacacagaaatagtccattgattttttaagagttgctttttgataagggtgctaaggtaatgtgata


gagaaaggaaagtattttcaattcaaatggctgaaatgactggatatccattgagggaagaaagggactttagcctttc


acacaatacacaaaaattatggaattctgaagaaaataagagaaaatgttcatgaacttggggtaggcaaaaattta


atagatgaggcaaaaaaaaaaaaaggcccaaaccataaaaatggtttcatttttataaggataaattagagtttataa


aaattaacacttcccttcaaaagaaaaattaaggaaaaatgaataaataagccaccgactgaaagaaaatatttgttt


tctaagaagtattttgtttttcattttatgggttcatagtaggtgtatatatttatggggtccctgagatattgtggttcagtcatac


aatggaaaattcacatcatggagaactggtatccatcctcttgagcaattatcctttgtgttacaaacaatccaactatac


tcttttagttatttttattctttttttttttttttttgagatggagtctcgctctgtcacccaggctggagtgtggtggcgccatctcaa


ctcactgcaacctccgcctcccaggtttaagtgattctcctgcctcagtttcctgagtagctgggactacaggcacccac


caccacgcccggctagtttttgtatttttaaactttttttttctttttttttttcttctttttttttttttgagatggagtctcgctctgtcgcc


cgggctggagtgcagtggtgcaatctcggctcactgcaacctccgtctcccaggttcaggagattctcctgcctcagcc


tcccaagtagctgggattacaggcatgcgccaccacacccagctaattttttgtatttttagtagaggcaggatttcaccg


tgttggtcaggctagactcgaatgcctgacctcgtgatccacccacctcggcctcccaaagtgctggggttacgggcgt


gagccaccacgcctggccttagtttttgtatttttagtagagatgaggtttcaccatgttggccaggctggtcttgaactcct


gatctcaggtgatccacctgcctcggcctcccaaagtgctgagattacaggcatgagccactgtgcccgacctctctttt


agttatttttaaatgtgcaattaaattattattgactatagtctccctgttatgctatcaaatactgggtcttattcattctttctattt


atttgtacacattaaccatccctacataccctctcacccctgccactacccttccaagtctctgcgaaccatccttctattct


ctatctccatgagtttaattgtttcgatgttttgatactacagataagtgagaacatgcagtgtctgtctctctgtgcctggcg


agaaaatatttgtaatatttgtatttggcaaaggacttgtatccaaaatatagaaataaattctgctcaataataaaaaga


gaaacaacttaatgagacccagtgtctacaaaaagtagaataattagccgggcatgttggtgcatgtctgtagtccca


cctaatcaggaggctgagggggaaagatcacttaagctcaggagttcgaggttgcagcgagctgtgatcgtgccact


gcactccagcctgggtgacagagtaagatcctgtctaaaaaaaaaaaaatagagggccaggtgaggtggcttatg


cctgtaatcccagcacattgggaggctgaggcaggtggattgcttaagcccaggagttcaagaccagcctgggcaa


catggtgaaaccccatctctaccaaaaccacaaaaaatcagctagatgtggtgcatgcctatagtctcagctactcaa


aaggctgaggtgggaggatcacctgagcccaggaagtcgaggctgtagtgagccatgatcatgccaccacactcc


agcctgggcaactggagtgggactgtgccccccaaaaatatatataagtaaataaataaataaataaataataaaat


gagctaaagatctggacaggcttcataagaagcaagcaaatggccaataaatacatgaaaagatgatttacctcctt


agtcattggacacacttagataccactcctcatccactagcatggctaaaggataaaagagtgaccatcaagtgttgg


caaggactgtggttctcgtacattcctggcgggaatggaaaattcagtcaccactttggaacacagtttcttacaaagtt


aaacatacacttaccatatgacccatcttttccattccttggtatttactcaagggaaatgaaaacacaggtccacaaat


acctgcacatgagtgtttacagaagctttgttgctagtggccccgaacagtgaaaaaaccacagtgtctgtcaatagg


agacagaataaatgaattgtggcggattaacaatggcgtgcttctcagaaatgaaaaagaacgaacgattgataca


cgtgacaatataggtgaatcttcaaaacagcatgcggtgtgatgcggccagacataaaagagcatgtatcagagga


ttccgtgtatgggaaacgccagggaatcagatccatggttgtcagggtcaggagggaccttactggggtgattacatgt


gtacatacattgtgtgaaactcatcaaaaccgtatacctaaagtgggcacattttattggatgtaagttatccctcaaatta


aattaattttcaaagttttaaaaaatggaaccatttttgccacattgaaggaaaattatttccaccaagatttccctacagc


caaacgatctaccaactacaaaaatggaaaaaataatttaggacatgtaaagttcaaatgttttgcctcccacgtttctg


tttcaagaagctattcgagataaatcgctccgtggtcacaggacttagaaaggtggaggtaaacacacacaagcatt


ataagataagaagtaacagatgaattagttgaaagggactgatttcgggggaaggaataggaactgggccaggag


atagacgcctttcaggcttgcccttgtgaaaacgatagtcccctcttacctgcgggggataggttccgagatccccagt


ggatgtctgaaaccacagatagtaccaaaccctacagagagtatgttttctcctgtacacacatacctatgatagggttt


aatttataaattaggcaccgggtgggattaacaaccataattaatagtaaaatagaacaattataacaatatactgtcc


gaggcgggtggatcacttgaagtcaggagatcgagaccagcctggccaacatggtgaaacctgtctctactaaaaa


tacaaaaaattagccaggcgtagtggtgggcgcctgtaaacccagctactcgggaggctgaggcaggagaattgct


tgaacccgggaggcggaggctgtagtgagccgaaggtgcccctttgcactccagcctgggcaacgagcaaaactc


catctcaaaaacaacaacaaaacaacaacaacaacaaaaacaatatactgtaataaaggttacgtgaatgtggtct


ctctaaaaaaaaatgtgtatatatcttagtttgtgggttttttgttgttgtagttttgtttattattaagagacagggtcttgctctgt


tacccaggctggagtgcagtggtgtgatcatggcttactgcagccttcacctcctaggctcaagtgatccttccacctca


gcctcctgggaccagaggcatgcaccactatgcccagctaattgttttttttttttttttcttgatagaaatgagttctcactatg


ttgcccaggctggtctagaactcctggcttccagcaattctgctggctcagcctcccaaagtcctgggattataggcacg


agccacgatgcctgtcctcaaaatattttactgtactgtactcacctattattggcctttggttgaccatgggtaactgaaac


tccagcaagcgaaggtgtggataagggaactactgtacgtgactttaatatacataaaatcaccttaaagacaatgat


tgtttccagaacatgagcctctgagacagcagggagatactaggggaagttggtggctcctctccctggcaggaact


ggtctgggctcctggctcagcctggccatgaggctgtcctggcctccctcggtgggttcgacagtgacctcgacgtgct


catttcagtgtggttcattccggtctcccaggggaagggggtgctgagtggaaggaggtcaatgggaagccggggtg


gctctcagagtctgcaggagcagtcgggctgatgagctgggaggagcagaccgcctccctcttctctgagtgggagg


agggccagatctggactgggtttggagatgctcaggtggggctcagagcatcacctgtggggcagagggaccatctt


ggcagatgaaggcccgtcgcagggtgtgatgcctgaattacaaggcgggacaggtaaagtggggcaggtgagag


aaggagggtgagtgatgtgatttttctactcctgttttccaggaaaaccaaaatgccacgcacttcgacctatgatcctttt


cctaataatgcttgtcttggtcttgtttgggtaagacacatttgaccatcgaggctggcctggtttggggagaagtgacca


cagcagccaatcagacccatggggcctccctgagctccccaagtatcacagttatcagggtcctaaggacagttattg


cctgcgtccagctctggcggagggtgtgcttacttgctcccttattttagcctcacctgggcaacaggctcatctcactccc


atttaaaattttcctaagtgtggagtctggggctgggagagcaagccccttgcccacaattgcgtggctgggggtgggg


aaggcaattctgggtcccaagctgttagtcgcttccagacacagaaggtcccagaaccaagagtgaagtcacctgtc


acctctactggggcatctctggacacggtctggaaacactccctgacgtggcctcagggactgcactgaccaaggca


ctggtggcggggggtgagggagctggggctctggagctccagcaggtgcccatacgtgagcaatatcccagggac


caccctcctgcccacctcccggtgtgggacgtggcgaggcgcctgagctttgctgagaacttgccctacctgcctcga


ggccttgcagcttcaccgggaactcttgtgctcacgctgctggccgcaccatgcactttttggaggaagggaccaaca


ggcagtcttcgttctgtgtcctgagtcttggcacacttcctttctgcagttacggggtcctaagccccagaagtctaatgcc


aggaagcctggaacgggggttctggtgagtgcagggaagagcaggtggagcatccatgctggccggggtgctggc


tgtgggcgggggtcccactctgggaactccccctccccttcctgggcccgctctctatgctctgcccagtgtagacatgc


tcaactctgtggccttgaagggtcacctggatacctctggagtcagccttgacctcccttctgacctccagtccccaactc


caggcttacccagagttcccatgcatggtctctgctcccccatccccacccctcctccaccagccatcctcaactctccc


tttgctctcacccctacactgggttcccaaatcctgccagccctgctcttcagcagctcgcgtgcccactggcctccctgc


cttcgacgtcctgctccagggctccggtggggctgccctcatctctgtgacagcctcagacttgtctccccatccccatc


agccctgtcccccttcttctcacagcagctggagtgatctttccagaacataaggtaggaggctctccatgatctgatcct


gccttcctgacctccccagcctcatctctccctcctctgccctcgccctctgtgctccagccaacgtggcctgtcactcgtc


cacctgccatactgtcctgactccaggcctttgcctgtgctatggcctctgttgggaccactcttgtctctcccctgctgtgtc


tgctaattcccactcgtgtcagtgatccatggctgcagaacacaccactccatccatggaaaacaatcacaatcattga


ttactatctctccctgggcttcctggggggactgggctcagctgggtggttcactttggggcctcagcagtcatgggcag


acagtggctggggtcactgcaaagacttccctgcatctctggcagttgatgctggctgtcatctgagacacctacccag


ggcctctccctggggcctgggctcctgcttagcttggttgggaggctccaagaccaacatcccaagaaagatgagac


agaagccagatcacctttttgggcctggcttcagaagtcacccagcatcacttctgctgcatttatttcttagaaacacag


aatatcagaccccatctcttgatgggagggggcctcatggtttgtagacatgttctgaaactccactctgcccggccttg


gcttcgacgtgtctggtgaatgtgtgggctgtgaggctccccgaacgtagacctcagactgcaacgctggccgttaca


gggtctggcacacgggcccacgtcaggcccatcgccacagtgatggttgttctgtgactgtttctggtggcctctgctcc


acactccaggctgacgctgtgccccttccactgggaccctcgggtggcttccatgcacttgtgccctaaatcctgctcct


agactaaacttcatctcctgtgttctcattctgcagcatggctgttagggaacctgaccatctgcagcgcgtctcgttgcca


aggtataatgtcagtgcctcccttcagtggctcccatgtcacagaattgtcctgcagccctggcacatgtgtgccatgtg


ggagctggggcaggtcctctttcctcctgtggctccgagggagggggccgctccttccccagtctctaccctgacttgg


ccctcgtcctgcagccactcagagagcacgatggagctggagcttcagttttgaccaaatgcgtgtcgccggcttttgt


gtgtgtgtgtgtgtgatggagtctcgctctgtcgcccaggctggagtgcattggcaccatctctgcccactgcaacctctg


cctcctgggctcaagtgattctcctgcctcagcctcctgagtagctgggattacaggcgcccaccaccacacctggcta


atttttgtatttttagcagagacagcgtttcaccatgttggccaggctagtcttgaactcctgacttcaaatgatccgcccgc


ctcagcctcccaaagtgctgggattacaggcatgagccccggcacctggccttcaaattggcttttaaagaagatgac


cagctggtctgttctgtctggggccttggagggctgtttcccaggatgtgggcctttatcagtgctgacctaggcaatcaa


ggccaagctggccacactgcttttgattttttttaaattgtgatgaaatacgcttaacataaaatttgccatcttagtcattttg


gagtgttcagtagctttaagtccattcacattgctgtgcagccatcaccaccatccatctccagaactcttttcatcttgcca


acctgaagctctgtccccataaaatagcaactccattttccctcccccagctcaccattccatggtctgttgttatgagtct


gctactccaactaccttttataagtggagtcaatggtacctgtgtttttatgtcgggctcaaatcactgagcgtgatgtcctc


acggtccatccaggttgtaccaagtgtcagaatttccttcctgttttctgctgactccctgcttttaaaggccatcttggacat


tcaaattgtggccattactgacagttaaaatggccccggctcatgtctcctgagcctctacaatcctttaggtggggaca


gtactgcggcccacacgtccccacaggctgcgtgtgacacctgacagtgctttgtgccactgtggagcaggtgtcact


ctcccacttctcagagaggaagccgggctctggaagccagcagcccacaaggaggcccagcaggcttcgggccc


aggtggggaggtctgcacatctcagggcctttccctgggaacaaccagggacagaggccacacacccagcctcct


ctctgcccacagtgaattgagacacaaccccttacgtccctcccagccttaagtcaagtggaaagcaatggcccctg


aggtcacttagtccctcttgccagtttgtaagtgctgaatcagagactctgagctttggccctttcccgagttcccgggatt


atgagctgacacagcccccaagcaaccacagtgattccagcctgggagtgtggccttggcgggagggtcaatgcc


cagtcatggttcagcggctgtgcacacccctgcttacctgcatcccacgctttccatgcaaactcacgtggggcgcttt


gcgcttgcaggatggtctacccccagccaaaggtgctgacaccgtggtgagtaaagttactgacactgaaactgaac


gcagctcaaggggctgttctgaaggtattagagggcggtttccttgatgtaaatctcagttggggctgcttcgttctctcct


ctcagattctcctgacatctgaattcagaggcccacatggctgtcctctttccctttgctttctctgacttgcgtctcttgtttcct


gtccctttgttctccaaagcccctgcaaaggcctgataggtacctcctacctggggaggggcagcgggggttgggtgc


tggggagggtttgttcctatctctttgccagcaaagctcagcttgctgtgtgttcccacaggtccaatgttgagggagggc


tgggaatgatttgcccggttggagtcgcatttgcctctggttggtttcccggggaagggcggctgcctctggaagggtgg


tcagaggaggcagaagctgagtggagtttccaggtgggggcggccgtgtgccagaggcgcatgtgggtggcaccc


tgccagctccatgtgaccgcacgcctctctccatgtgcagtaggaaggatgtcctcgtggtaccccttggctggctccc


attgtctgggagggcacattcaacatcgacatcctcaacgagcagttcaggctccagaacaccaccattgggttaact


gtgtttgccatcaagaagtaagtcagtgaggtggccgagggtagagacccaggcagtggcgagtgactgtggacat


tgaggtctctccttgtgttcaagacagagtggggggcggccagccttgtcctcccagagggtagatgggaaaggtca


ttcatgcagcatcttactgagctcatgtgggctcgtgggctcgtgggctcgccaggtcggtaaaacccagctccttctcc


agaggctgcgtctcacccagggatggtggcttctgctgccccctcctctctgtaactgtggccggccgtcatgctgagc


caccccctcaatacaaggctccagatgtttcctgctcactgaccagagatagcaggagggggacacctgtttgctgtc


cttggaccctagaaagaggatgctggcagagccgtggtcacttctctgtcagatgtaggtggggcaggcaaagcagt


tggccccagacaccaaaggaagtggctgacccacaaggccctgggactctgggccaggccagagagggagcta


gccaggcaaccgcagacacatacttgacttctcggcagctgtgggcagctgggccagcgacagtggcggaggcc


aggaatgacttactcttaggaataggtgcagttcaagcctggagggaggaagctctagggtgcagaggcgggtgtgt


ggaggcctcgcgtgcagcttataatgagggagcacgtggccggcctggccataagaggggcagctgcgtgggga


ggcgtggctcaggccaggctgagggggagtgagcagacgccagcctgcggcctgctaccagcctccagccacct


gccctcagccctccttagtaagagggggtgctggtggtcccccatcgctgggaagaggatgaagtgaatcgcagcc


cgaggactcgctcaggacagggcaggagaacgtggtgcatctgctgctctaagccttccaatggccgctggcgggc


gggtgcaggacgggcctcctgcagcccaggggtgcacggccggcggctcccccagcccccgtccgcctgccttgc


agatacgtggctttcctgaagctgttcctggagacggcggagaagcacttcatggtgggccaccgtgtccactactatg


tcttcaccgaccagccggccgcggtgccccgcgtgacgctggggaccggtcggcagctgtcagtgctggaggtgcg


cgcctacaagcgctggcaggacgtgtccatgcgccgcatggagatgatcagtgacttctgcgagcggcgcttcctca


gcgaggtggattacctggtgtgcgtggacgtggacatggagttccgcgaccacgtgggcgtggagatcctgactccg


ctgttcggcaccctgcaccccggcttctacggaagcagccgggaggccttcacctacgagcgccggccccagtccc


aggcctacatccccaaggacgagggcgatttctactacctgggggggttcttcggggggtcggtgcaagaggtgca


gcggctcaccagggcctgccaccaggccatgatggtcgaccaggccaacggcatcgaggccgtgtggcacgacg


agagccacctgaacaagtacctgctgcgccacaaacccaccaaggtgctctcccccgagtacttgtgggaccagc


agctgctgggctggcccgccgtcctgaggaagctgaggttcactgcggtgcccaagaaccaccaggcggtccgga


acccgtgagcggctgccaggggctctgggagggctgccggcagccccgtccccctcccgcccttggttttagcagaa


cgggtaaactctgtttcctttgtccgtcctgttgtgagtaactgaagcctaggccccgtccccacctcaaatcacacaca


ccccctccccaccacagagacaccattacatacacagacacacacagaaagacacacacagacacaaaatcac


acacacaccctccccgccacagagacaccattacatacacagacacacacagaaagacacagacacaaaatc


acacacacaccctccccgccacagagacacaccattacatacacagacacgcaatcgcagatacgcccttccgg


ccacagaaacacaccattacacacacatacacagaaagacacacacagacacacaatcacacgcagcccctcc


ccgccacagagacacaccattacatacacagacacacacagaaagacacacacagacacaaaatcacacaca


caccctccccgccacagagacacaccattacatacacagacacacacagacacacaatcacacacagcccctcc


ccgccacagagacacaccattacatacacagacacacacagacacacaatcacagataccccctcccggccata


gagacacaccgttacacacacatacacagaaagacacacacagcccctccccgccacagagacacaccattac


atacacagacacacacacacacacaatcacacacacaccccccgccacagagacacaccgttacatacacaga


catacacacacacacagacatacaccagacacgcaaagacacacagacacagatacacagatacaaagacac


agacatatagacacacagacatgcacagagacacatggagacacatgcaaaaatgcacagagaaagacatac


agaagtgtacacacagacacatagaccacacagacacacagacatgcatgcaaacacacagacatgcagacat


gcacacaaacacagactcacgcacacagacttaggcagcccaaattcagcgcctggggcataagttcctggagg


ggtggccaccttcagcccccacggtaaggtcctgaggaaccttccccttagacaagggatcatggaggaggtctcttc


cggagcctggagggaggcctcaagtggtccttccacctcggcatcccaaagtgctaggattataagcatgagccact


gcacctggccccaacatcatttattgaacagactgtcgtttccacattgtgtttcttggcatttctgtcaaaaatcagttgac


cgtaaatgcatggattcacttccgggctctctattctgtttcattggtctgtttgtttttatgccaataccatgatgttttgattatta


tagctttatagtatatcttgaagttaggtagtgtattgtctccagctttgttctttttgcttaagattgctttggctatttaggggtctt


tgtggccattactgacagttaaaatggccctggctcatgtctcctgggcctctacagtactttaggtggggaagtactgtg


gcccacatgtcccaacaggctgtgtgtggcacctgacagtgctttgtgccactgtggagcaggtgttactccctcatacg


aattttaggattattttatttatttcagtgaaaaatgtcaatgaaattttgatagggattgcattgaatctgtagattgtttggggt


tgtatggacattttaacaatattaattcttccaattcatgaacatgggatgtcttttttttggcacctgatctcagatatgggac


agctttctaattatttgtatcttcaattttttcacaaatgttttatagttttcagcatacagatctttcacttccttggttggatttattt


gcaagtattttttgtagctattgtaaatgacattgttttcttgatttctttgttagggtatagaaatgctacagattgttgtatgtta


attttatatcctgcaactttactgatttcatttattctaacatcttattggtggaatctttagggttttctataagatcatgtcatctgt


aaacagggacaatttaagttcttcctttccaatttggataccttttatttcttcctcttgcctaattattttggctgagggccaact


tgttgggtttgttgttgttggtttttgtctgtgtcttttggtgtttcgtgttgctggcttttccagcacacagtcacagatatatgagg


cagaaagaaaacccagggaactcctgtcatgtcattcctcaggtcccaaggtccctacatggtctgccttcttctctcca


ccttttagtgtttgatatacatacgtttatgtatgtagttatatatgtatgtgtaatacccagggaaatatacagagattttaact


gtccttagtgtagggcgtggggacaggtgcatctattctaccttgcttggaagtggaaaaatcttcaagaaatcaatttat


caccaactggcatgacagaactcctggtttagtgtctagctgtctccactaaaatcaatcaccatgccatcaggcactct


cttccctctggtgtgtactggtacctgaaacagcgtccagcaccctgagagcacagaacgcagcttcttatgtgcccca


cccaataatcccaaatcgagcacagacctgggagctgaagagacaagctggaggtcaggcagttctattacttaag


cctggttcttcctgactccaaaagccaggcgccaccccatgctccttcccagcagaagctggttttcaccgggtcagga


ggacaggatggctattcctgaccgttgtacagcagtgccacatctcctagtttccagaacatttaatgtgttgctgtggag


gcctggcagctctgaagtagccccttgcgcccaggacaaaccgggcctggaggggggggtggggtaagctgaag


cttccagtatcccacgtgggcattctgttgctgagggcaaccgcccctccctgcaagggaggggaaggagcaggca


ggcacacccttcctccctccacatttcactggagttgacctgtgtcctactgtggttgcacctggaccagcccagaggg


actggagtgactcttcccgcaggtagggcacagccagcctgggcacagggcctcactgcaatccctgcgccccgcc


tccctgggggatttccaggggttgtcctgctgcttggtgagctgggggtgtgggggaccacagatgaggcaggcgcct


aaattgcttgggcggagtccggctgctctctccctccaggccctttcctgtgtcctcgggcccctcgggtccctgcacca


ggccagcagcaatcgggtgtcagaggttcaagtccaggctgactcctgggacgcaccgactttgcccaggagggct


ggggagagggcagccccaaagtgtcatctttgattattctttgtccccaaagccagccttgggcatgtgcccagcaca


gggagagctgctgcctctgtgggcttgagtcatttttgcttccttgcccctctctgctctgtttggtttaaaaaacaaaaacc


aacacccatgggtgaggcgaggcgaggcgaggcttgccttgatccacccgcttcctcccctgcagcctcgggcccc


atcctcccgcgtggccgcactaggggtgctgggggcggagtggggaccagacctgagtgggttctgggttcagcttc


gcagcaggagcagctggggcatcatcagttgggcaggtgacccaggatctgagcaggaatcttccaggtggaggt


ggggccgcccctctgcttgccacagctgccttctgctctgggggcagcactggctgagaatccagtgaagtaaggcct


ttgaaatggttttgattgtaaaagtaatgtttatgtttagccattcctttttttattttttattttttgagacgggtctccctctgtcacc


caggctggagtgcagtcgtgaaatgtgggctcactgcaacctccacctcccaggttcaagtgattctcctgcctcagcc


tcctgagtagttgggattacaagtgcctgccaccacacctgactaagttttgtattttttagtagagatggagtttcaccatg


ttggccaggctggtctcaaactcatatcctcaagtgatccaccctcctcgtcctcccaaagtgctgggattttgccatgag


ccaccacgctcagccatgtttagccatttttaaaaggtgtgaacagataattaagtctattagcaacattagaaaatttaa


gttaaaatttaaatgactctagcaagaattagcctttaggtgccctgggtccccctcctcaccgccccccaccccgccg


ccttctgagaagcgttttgtaatttttcttatttataaaatgggtttgattcaaaactgacttttcctcttcacagtctcacaggtc


cttccccaggtccaagaggctcttctgtgtcctgatgacaagtagctgcctagccgtggtggcacctcctatcacatgtta


agggacccctccccagggccacacctggcagaaggtggcttatgatgttcgcagcttgaaagtagtgtaaaccaaa


gataaaattctaagcccactcccccagccatcggaatggacccctcctcttggccagggcactccaaagttaacctg


aaaaaccggttcaggctgtgaagagaaggtggagtggacatgcctcatttatgtcctcctcccttttggaattcagcaa


agctgaccagcatgaacattaacacagaccttaagtctgattagtggcatttacaatctatactctctgaagcgtgctac


ctggagtcttcctttgcatgataaaactttggtctccacaaccccttatcataacctagacactcctttctagtgataataac


tctttcaaccaattgccaataaaaaaattttgaatctacctataacctggaacctccccgctccaccttcgagttgtcctac


ctttctggacagaagcaatgtggatcttgcatgtatttgattgatgtctcatgtctccctaaaatgtatacaattaggctgtgc


ccagatcaccctgggcacatgttctcaggccctcctgaggtctctgtctcgggccattggtcactcagattcggctcaga


ataaatctcttcaaatatttttcagagtttgactctttttgtcgacaataagctcatcagattaaagctgttggaactttaaatt


acttcgagccttgaaagaatgtgattctgacaccctagtcaagcagcaagtcgctgtgaactttgcttctccaattataga


ttaacgctcttcctttcttgatttgtaaaatgttatgaaagactaaatggcaccagatgtaagaccccttccctttcttagttac


tgaccttttgttacgaattaacttctttcttttcttgcacccaactcagaccagatggtacaaaagaccccatgcctatcaa


attccataagacagtatgttaaatatacttttcccgaagggaacaggatataaccaaacaaattgctgcaactcataag


ccaatcttatgtggaaaatgtaatcatgctaaacctccttgtattttcccatttaagtgaagccctaatccctccacctcag


agcactgagcccattcctttagagtctgtgttttttgagtggtggtcgtcaaacttagagttcgaataaactctataacattg


gattttgaacctttcaattatttcaggtggcagaagaaacacacagaccccatctcttcctttggagttctgtactgctcattt


caaagggatttcatcctgccctctgtaaccacccaaggggttcctcctgctccctttctaaataaagaccatgcattcca


gtaaagaaagagtttaatagacatgaggccaaccacatggtagagagagtttgtactcaaatcatctcatcaaaggc


tcatagctaggggtttctcaaggcagtttggggggtcaggggggccaagtaacaggtactgattggctggggcaga


catgaactcatggagggtcaaagctgtcctcctgcaggctaaatcgcttcttaggggggtgtcacaggagtgaggttg


ggggttccaggtagagccatgggtatcagacctgcaaaaaacctgaaaagatatctcaaaagaccaatctacagtt


ggggaaattgtgtattttgctagagaggtctacaccttagcttctccctctcgcctcatagcttttcattaggtttacacaggc


agaagagttttgggggaggcctgctgttatttaaactataaattaaatgtctcccaaggttagctcagcccaaaagccc


aggaatcattaagggaaaggaaagatgaggggcgggttagctcagctcactcttacaatttttctaattgatagagttttt


gcaaaggcggtttcactttcagcgtggtcgttctcctaacctctcctgcttccccaggtcctcaccctaaaccagcaccc


atcctgcactcccagggtgaaccctcccttccctcctgctcacctgggcccaacccagtttccccccaggagctgcgg


aagggcccagtggtgccaccttggcctccaggcatcactctccaacttcttccctcctccctccacaggacccttgcctc


aaatgtcagtcccgctgctctctgctgcactcagcctcttcctggcaagtgtgcccctcccctcttgaagccatcaccag


cctctcgtgcccacccccatctaggccccccactcctctccagtcccggtgtcttggaagagcctctggatcacagggg


ctgggaaattcgacttgtcatccccaccaagcaaacacatgcttccatccagggcccctcagtgggtgaaggctgag


ccccagtaaaggatagatcatcactgtcagaggcgagtgaactagagcgactccatgttgaatagggactgggtaa


aatgaggctgaaccttctgcactgcattctcagggggttaggcattcttagtcacaggataagataggaggttggcagg


attagtatcacaagatacaagtcaccaagaccctgctgatcaaatgggatgcagtaaagaagccagccaaaaccc


accaaaaccaagatggcgacaaaaggaacctccagttgtcctcactgctcattatgcgctaattataatacattagctg


ctaaaagacactcccacctgtgccacaacagtttacaaatgccatggcaacgtccagaagttaccccatatggtctaa


aaaggagagggaccctcagttcagaaaaatcccctcccctttcttggaaaactcacaaacaatccacctgttgttgag


cctataatccagaaatcagtataagtatactcagctgagatccaagaaccctgtcttggggtttggactgggactcctttc


tggtaacaatactagggtatacggagcccccacatccccgggcaagtggtgcagaagccggagccatcagcagc


cgcccctggcaggctcctgtgtggactccttgctgcagggcatttggtttaccaggcgtgagagacccacacgtgaag


cgctagcccagccccatcaccgcagcacagcctcaccactgcagcccagaaaaaccactgcagcccagacaca


cgactgcagctgaggcccatcactgcagcccagacacaccactgcacccactgcagctcagcccaccactgcacc


ccagccccaccactgtagctcagcccgaccactgcagcccagcccaccattgcagcccagacacaccactgcag


cccagcccaccactgcagcccagccccaccattgcagcccaggccctgcagttgtccctactaggtctgagtggag


gaatcagagagaaaactctggaaagtgctagacacaggggaggccctcaaggaatggtggctgccactgggaac


tctgtttctgtggtgcttactggacagataggctcctttgcagatttttttattccaaaactaagctcccatcacccagtcctgt


cctccctgctggcagagcttgtccctggatccaagctgtctcctctgcaccaaatccggtggcctcaagtcccagctctg


ctttgcacaggggcttttcgaagccctgtctcccctgctgtgagatgggggtcacccagctgccccccagggggtggtg


ttgctgacggagcaggtgaagacaatgccaagagctgcgcgtggtctggctttcccaggctggatgtgcccctctgat


cctcagtctccccacatggacaatggctgatgccgaccacccacactaggtttgccccagagtcaaagtgagctcctt


gaggacagggaactgacccccacacagggccccccaggatctgcttgaggaggcaccaggcagttctcccagaa


ggcagggcgtgtcttcagagctctctgggttgagtgggcctaggccatcgctgcctctgcccaagtctttctagaggctt


cccaagtaccagagctgccctggagggccagagttaccagaaagtccaaagggagaaagtcagagagacacca


gcacacccaagaagtccagggccttggggacactaagggtgaactctgtgctgtgaagggcaggcctggcccgga


gagagggagcctgcagaggagacgggatctctcggtgctcttccgaagaagcccctggatctcccactgtgttctgct


ctcccgggcgggcaggtgtcaggatggcgcctggtacagctggttttgggcagaagccccgcccgtggcctgcacgt


tgggtgtctgcgtgttgggcccctgctctgcagcctcagcaggtactgtgacgggggatggccgggcagggacctac


ctggccagctctttgtctcagctgccttcacacggcagggtttgctttgcgcatcccatccctgtgaacatacaggtgcga


ttagtttgctggtttgtggcagggtggttttttcttcttcctaaaaatggatttagtagatttttagcttcacaggtttcatctgaac


caatgacttggaatttaagacaatggctacacgcttttgactgggtttgggatttgtgatccgaaacggttcatttgttatat


ggctagaatatgacatgacttttcatgtgttttaaaaaatataatgaggcggcggctgcgtccacgtcgaccaagactg


gagcgaagtttaaggaagggacagaatcgccggggagtgcccttgttgttgctgggggactcccagccttccctgtg


gcccgagaaggagaagcagtggctctctgaagcaggcatggagagtagaaaactggagttattagcataccctagt


acctcttacaactttcccttccatgttagcactttagtgctggcttctcagttttcttaacattgagacaataaatgtgtgttgtgt


cttgtagatggcataaagagtaaataagaagttttagagttgttctggaaaatgtcagaataaatctccacttgagttgtg


tattctgctagtccaagtggacagcaacttcctgctaccctcccttgcaaccttgcagacaagtctctcctctgaagaac


gaattagatttagctaattagaattaatccttgctttcatcgccatcatctgtaaaatactttgttgggtagaccactttatacc


tttgcaatacggtctccgtgggtaaaaaacaaatatctgtaatgcagacaggaggatgtgaaaagttctgttgcacgtg


ttttttttttttgttttttgttttttgttttttgtttttttgagacagtctcgctctgtcccccaggctggagtgcagtggcgcgatctcgg


ctcactgcaagcaagctccgcctcctgggttcacgccattctcctgcctcagcctccggagtagctgggactacggac


gcctgccaacatgcccggctaattttttgtatttttagtagagactgggtttcaccggttgcacgtatttttaattctgtggctgt


cacatgatagagaatggaattgaatggagatttcccttttgcttcttagggttggaaatattcccatgaaaatgttcaaattt


ggaatttgaaagccaccaaatgaatctttatgtataaatccttgtaaatgatagattccataggtgagacttttatgtatttc


aggtgggagcctactggcatatatttttaaatgttcatattacttagaatctccaataggaagtctttatttgaaatagttgaa


tccgtgttctagtattttcctttcagcaagatctgttaggtttttaccccttcaaaaataagttttatttcatctgcaaattgtggc


aatgttatagcgatcagaaactacgtaaggaatgttatataggcttgtcagttcccatttatcttaacaacaataaatatc


acatttcttcttttgaaaatgacacatatttaggccaggcgtggtggctcactcctgtaatcccagccctttgggaggccg


aggtgggcggatcacgaggtcaggagatccagacaatctggttaacatggtgaaaccccgtctctattataaataca


aaaaaattagcagggcatggtggcaggtgcctgtagtcccagttactcgggaggctgaggcaggagaatggcgtg


aacctgggaggtggagcttgcagtgagcggagatcgcgccactgcactccagcctgggcgacagagcaagactc


cgcctcataaaacaaaacaaaacatatttaaacacttagaaaataaagttaacacttactgaagtgccggtactaca


ctgtcctagtactaaaaggaagcaggttggaacatacatatggcctatcatttataacagaattaacttgttgaatgtctg


taaatgatttttttttgcaaaggaaaaagttgatcctggaaaagattgttgtgcatagttattagtcatttgtaacctcgcttaa


gtgtttctcagttgttcaacatagacacttttttctccttaccatgtatttttaaaaatagtctattacttgactttgaacgtaaag


ctttaatcataatttcccacgtatacatagttcatctgacggtaagctggatttgaaggtaggggtttcagcgttacttaagtt


ggtagctgagggtatcaggcatcagttcatgcaataagacaaaaaaatatatatcctttgcttgccaaggggtagagt


gatgtgcatttatctgttttctgttctgtaagtttagactttcaaaccattttgtaaaccaacccttgggaaatttgaaattacctt


ataacttaagactctgtggtctctggaatcaccctatctgtttcttttctgcgtaggtatttataacattgctgtttgactatagc


gtgcactctgaaatgttatcagtggaaatttgtttgagtttcattaatgctatttcactagttagacataattacttctaccagt


gtaaatgacactgatgcgcacagagcttccagatctttcagactcaactactaggtcaattagtttgcataataaaactt


ggcagtttctacaagcctattatgacaaaccaggagctaattctgtaatgaaaaactatccattctgaatgatagggac


gtaattatttgctgctgctgtcctttgtaaattttgaacatgacattatactctgtgcctactaaaggtatcctctggagttttttg


agaggagggaaactggaaaattaaattgtatttttgccagaagactcttacttgcatgtgtctcagggtcttcagtttttcta


gaagtttccatatccaaggttcagagttcatgtgaaatacttctttgggagaaaaaaatccttcattcctggtattcattgga


ttggaaatctgcagcaaaatgctgtttaaaattactatgtggtttttctatcttatccttagctctctggctattgaacttttttttctt


ctttgaagttagcttcaaatttgcttctatgctaaattacctgtaaatattctggataggaactatttgaaatagtatttgttaaa


agaaatgataaaatgaaaatgttcaaactacagagattttaaaatgccataacgatcttgcaagactaactttaaaata


tactataaatgattattatgattttggtggtaacgatcccccacacacaaccgctgtgaagaaatgacgccacattttccc


ctattgtaccaaaaagataaagatggtaaacattaatcaaggtattttgtattgtcaacgcgtgcatattctaaagagtta


aatgccaactcagcagcactggcttcctggctggtcaaccataggaaacctcgttcatttctcccagtgttgtgatgttca


tacttctacaatcttccctgtcatgactttaacttctacgtttcattaaccattcctgatgttagttctcagagcttcttcttttttttttt


tttttttttttgagatggagtctcactctgtcgcccaggctggagtgcagtggccccatctcggcacactgcaacctccgcc


tcccgggttcaagcgattctcctgcctcagcctcccaagtagctgggactacaggcgtgcgccaccacgcccagctat


tttttgtatttttagtagaggtgtggtttcaccctgttggacaggatggtctcaatctcttgacctcgtgatccgcccgcctcgg


cctcccaaagtgctgggattacacgtgtgagccactgccggttcttagagtttcttaaggcaaaaataaaattattcaatt


ctgaaaaaatataaataaataaataataaaaaatataacgaatctggaggactgcatggtgtcctatagcgtgacggt


taaaagcttgggctttggaaccaagtcggcctcgaaatcgcagctttgctgctgcctgggccactggaccttctgagcg


tcgttttcctcctgtgtcagctgagggtggtgacttgggctcccgctcaggggttgaaagagattctgcagcctctggagc


tgcccctcagcgcatcatcactgggtctgctgtcctcggtacctgcctgggtggaggtttctgtgacttctcaacggaagt


ggctgtctcaggcgaacgctgtaacagtgaccatcccttgaggtgcctgtttcttgctctgtttttatttaaagatctgtgga


aatgtagaatatgaaaaatgaactgcaatattcgggatgaaaatggactaggctaagtattcctgatttgaaaaaactt


ttatcttaggtttaggggatcgtgtgcgggattgttatataggtaaactccgtgtctcgggggtttggtgtacaggttatttagt


cacccaggtaataagcatagtacccgataggtatttatttttgatcctttccctactcccacgcttcactctcaagtaggcc


ccattatctgttgttcccttcttcgtgtccaagcactcctgattaatgagcctaaagaaatactgaaagaacaagcgaaa


cacatctgttctctttcctgctaccctctccgcatgtccccctctctctagaaaacaatggttttaatcttggctgcatgttaga


gtcacctagggtgcttttaaaattccctttgccaggcgccacacttgaacgttgtgattatttgcctggcccaggtgtcaat


attttttttaagagatgggggtctttctctgttgcccaggctagaatgcagtggcgtaatcatagcgcactgtagcgctgaa


ctcctgggttcaagagatcctccgacctcaacctcttgagtagctgggactactgacacgtgccaccatgcctggttaat


ttttaaaaagtgtttgtagagatggagtcttactatgttgcccaggctggtctcttaacttctaggctcaagggatctcctgc


ctcaggctcccaaagtgctgggaatataggcgtgagccactgcgcccggctggcatcagtatcttttgaagttccgatg


attctacagagcagccacagttgacaaccattgccctagaatcatcgtcttcactcacatttcatttttaaatgaccgtaat


tgaaaagacacatttggtcacgaggaataggaaatggacaactcacatctcatacaaatcacaagtacctcctggtg


gaagcagcaggcctctgggaaggttctagaaagaatgggtccctgtgtctgaagctgtgcttctgagcaactgcctgg


ctggggcgcccctgctcactgggtggtcactgaacttcacacactgctccccagctatgctttcctccttcatgcttccttct


tgcttgaccggactccaacaaggaagcctctttttcttcatgattaaaagagtaataaatggtcatgggagaaaaataa


agtcatatagaagcagacaggctgggtgtggtggctcatgcctgtaatctcagcactttgggaggccaagatgggtgg


atcactggagctcaggagtttgagaccagcctggggaacataacgaaaccccatctctacaaataatgcaaaaaa


aaaaaaattaggcatggtggcgctctcctgtagtcccaattactcgggagggtgaggtgggaggttcacttgagcctg


ggagttggaggctacaatgagctgtgatcaggccactgcattccagcctgggggacccagccaaaccatgtcccaa


aataaaataaaataaaataaaattaacataaacataaaaaaggaaggagatgagcagaaagtaaccacccacc


cccaaaagctcctctacctgggacagtaggttgggggagtgtggcaccctggagccaaaggtcccaggctcctgtcc


aactcactacaggccatctctgctctctaaccccaagcctgagtttccttgcctgtaaaatggagaggacagtagagcc


tgcccctgggcctgggaatgaggagtaaaggagtcggagcattactggcacgtggggggcagaagtgctgtgtcc


tgtttttacacacatgcgcagctgtaaaacggtttattttatttttcattttttgacagagtctcgctctgtcgcccagtgcagtg


gcgcgatctcggctcactgcaagctccgcctcccgggttcacgccattctcctgccttagcctcccgagtagctggaac


tacagacacccaccaccacgcggggctaattgtgtgtgtgtgtgtgtgtgtctgtgtgtctgtgtgtgtttagtagagacgg


gatttcaccgtgttagccaggatggtttcagtctcctgatctcatgatccacctgcctcggcctcccaaagtgctgggatta


caggcatgagccaccgcgcccggccaaaaggtttttaaaaacatgaaatgagtcattctgtgcacaccattttgttgtc


ccgactttgtgcccatagctgagtttgccagacgagggacccccagtcagcaccgtgggctcaagagagggcagcc


tcagaatctcaggtggccggactcagggctagccctggtgtcctggacagtgcccagacctgaaggccgcagggg


ccccctcccccacaccgcccagagtgaaagctccagaaacagctcagggcctgagtcgggaatccctgatcccta


aagtggagttacagaactttcagggctgtggggaggatgatgaaaggggctcggggcggcagccccgttgagtca


agggggagtgcccagctttgacattttatccacaggaggctcggaagggcaaagtcccatttcgcttccatctggctttc


ccgccgcttgctctgtgtgcctggttttgttttgtttttgttgttgttgttttgagggagtctcgccctgttgccaggctggagtgc


agtggtgctatctcggctcactgcaacctccacctcccgggttcaagcgcttctcctgcctcagcctccaatgtagctgg


gattacaggcgcccgccaccacacccagctacttttttgtgtttttagtagagacgaggtttcaccatgttggtcagggtg


atctcgaactcctgacctcaagtgatccgcccacctcagcctccccaaagtgctaggatccgtgtgcctttaactccttg


cctttaggagtcgccgtgctgtaagcagcactgccccggggtaagggcgggcgctggagactggcagtgctcacca


gcgcccccagctggtaagtatggcagtcatcgagcctggggggaacggaaagcaccactgggatttctccccgctg


agctgggcagttctctcctctgcccagggcgctgccgtgcccccagagggctttgggaaatgtgagttggctcttgtgg


ctgttcctatgcctggggaatttggaagagaccaatgtcattgggagggacagaaagtaccacgcactgaagaagtg


tgcccacttcccaaccccatgcccagggtgctcaggagaggaccactgggaagaccgcggatgaacattccagat


cccctcggggagagaaggcgctcctagccagctggtccttgagcgcacgcgcactccgtgttggagcccatttagaa


taaacattcaaaccctcagagcccactggagggctcagggcctttcaccttggctaagaaacgatccccacagcctt


gaagctgggtggaagctcaggtaggtggtcagggtccctgggtcagcctgtctgcccagctccaggcagctttcctga


tccttacaaactccgggtgcagctccacaccgccgggtcccgcccaccggctccgctccgaccacgcccactctacc


cctcaacacagccttgaccctgccccactggccctgccccaaccacgctcatcccactcctctgccccgcccttctcg


gccccgcctcccccgctcagctcagcccccactgggcctggacctccctggttcctgtctatcctcccctccaccctgcc


ctctctaaccgaatccccgccccatctagagccctaacctctagatcggccgggggttcatagatcataagccaggcc


ctgctggtgacctggaggccgcggacagccgcccttcaccctggtggggccaccctgccaggagctggcctggag


gcggggcccggggctgaatttggggtcgctgctcccactgcccacggttcttcccccaaccctgcccagctctggggg


cctttctctcatctctctcccttttgttgttgctgaaaaatgctgctgtttttctcgtggctaagttatatatgctcttaaaaagtaat


gcccggcatggaggctcacacctttaatcccaacgctttaggaggtggaggcgggaggatcgcttgagcccaggag


ttcaagaccagcctgggtaacaaagtgagaccccatctccataaacaaacacataaatagtaagaacttgataaag


gtctgagaaagaaattagagatcactgtgaatcctttccatgctgacattttggcacattttccccaggctgcgtgaggcc


caggctgaatgtgcacattggcatcttgttttctttattaacatcatttcagaagcattttctgagatgagtacaaactgaaa


acattcttctgaacacctgcatggagccctcttttcacacatgaccctgtgatgggccagctgggctttggataaaaaaa


agaaaaaaaggaaaaaatgattagaggtgatccagtagcgcgatgctcctcccatctgaggcaaaggaagaaga


aataatttaaaaatgttcccagagcaaaaacagccgttttctaaagaatggccccacaattggttcaagttgaagagc


ctccccgaggaataacagcttcgtcatccatctagttcaggcgttgtttctatagtaacctgtgacgtggtccccttcccaa


aacacaaaacaaaacaaaaaacagaaacaaaaacaatgaatccctgagaacagtgagaaagaaaaagctctt


gcttcctttggcctacggaagggtgtggctgttgagctgagccttggagttcctgggtttgccggtttccccctctggcagt


gctgggggctttgagccagcaacctcaaagaatgcccatgccagggaggccagcatgcacgcaggtattgatgttg


gctttccacgggtgcagtgagccgcataatggtttggggtgaggatttaaacgttttgagacaggaacggcttgcctctg


ttttaaagatcagctgctgccatttcaaagatacgttctgttactttgccaaggggcctcattccatgttgctgggaatcagc


agcagtgctgctctgatgtggctggtggccgtcccaagcccccatgcctccctgacctctcgctgcccagcacttcccg


ctccaggggtgctctcaccagccatgactccccctcccgatgcctacatcccccgcctgtctgctgtgccctgtccaca


gcgccctgtccacagcacagctggctcaacacatgtggcatccaagccgggcaccgtgctttgcctcacactgctga


gtccacctggccttacatgccttggctggagtccaggcgtggggcaggcttcggggatggtgtcaaggactgggcact


tcctgcctctctcctcaaggtccccagggctgcctcatcccaactgggtcccctgctgcagttctcagatgccctctgcat


gttccttctctgggctccgtggctcattccagagcaatcacagaccagagggacggcgccccattagatcaatcagac


aagccctggagtggggagtgcagtcaccatccatgagggtgcgggtgtgttggggggatgcggggaggggaatgg


atgccgcgtcagccaccaacagccctgacgggcccctgacttcctgcactaacattgtccctgtgctcacatctgccc


agtaccactggggccattctcacctacggctctaccaggtgcccaggaaatgcccccaaacgggcaggatgctcac


aaggcaaggccacctgccctgaaggacgcgggggccacagggtaatggaaagcaaagggatgggggaattatc


cccaaatcagtcacacgcctgatggcagccctgtcaaaatccctagggaggttttttgttttgttttgtttttggagggggat


gactttgtagagaaactgcattttttcttaaaaatgtaaaatgatctgcaatgcaaatataccatgttctgtttagggcaagt


agcctcttctgattttgagtctacgaataatcagaagcactgaaaattccaaagcaggagttaaagtcctcagcagcg


cttgggtgtgcggtggggtccgtgatggatgctgaaaccgtgggctccaaaggccagcacaggaaggaaggagag


ggcctcaaaggagggggcaggttggacctggccaaccccagcgtggcaccctcagcccttctcagctcccagtgcc


tcttaacattgctggcaggtgtgagcctggggtcgtgctagcctctgagtgagctcagccccttaccatttccagcctcac


cttcctctcccaagaaagtggcatgagggtaacagcaacctcatgggagtgcaaggctggcacaggctttagccgc


atctctgagccctttcaccaaaaaggactcactgaaccaaagagctgcacagaacattagcatccacccccacccc


agagagggcttcctagtgctcacctgggggagggtgtgtgtgcaggtgggaggacggagatcgtggcctccctccat


gtggctcgtggtatgaagaccccacctcagacatctcctgcctcccaggtggcagctgcctgagtggtgggtggtggc


tggaaccaggagacccagttctgcctggctgggcctcagtttcccaggaggtgacctggacagggtcacgctgcccc


atctctgtggcacctggcttgggactccctccctccccgctgctcttccttctccttactcctggggtcttccaggaaagtgc


cgatcccggagtaagatttggggggcagttcctttgctggcctcaagccccctgcgcactccttccccagcggcagag


cgagggcatccggtcctccaggagaagagctgggctaggagcttagtgtgttccccatgtcacggggcctcttccacc


ccctgactcatggaccccatgctggccacgctgcggaggcctgccaggcttccagatgccggcgactcccctcctgg


ggccagccttctggaagggggtgttgagcccatgaggagctccataagtggggaagcaagtggggacggccatgt


ccacagcatcaccttggcagctgatgggcgggaggacaggtccaggcggcctccaggcacgttcctgtttgtgtgac


attcaccgtgacatggtgcatgccgtggcacacagggtcctgtgttcagataagccagggcctcggaggaactcagg


cggaggaaagagccggaacacaaacacagcgcgacctttcccgggaagcggcccctccaggaatgcgccctgg


gcccccctgccaagccctgggcctccatggccaagtacccctcacttccaccatccgcgactcggttcactccagttg


agccccaagagcctctgctgagcccaacccggaagggcgagggagcctcggtggccagagagggccgaggcct


gttgggatgactgcactccttcaggacaggcaccctctgtgacggggacatgcagggaccactgcggctgccctggc


ccaagacgccctggaggccaggcgggcagggccccttcatccttcaggaggtgggactgggctgggttttgaagga


tggcagcgtccaggtagactgagggcacagacagggaaggccacacaggcctaaaagcacacagacttaggcg


ggcgggaaatgtgaggtcaaatcctggagccctcggagggggactcaggaaatcagatgctcagtagggggccg


agtttggagccttggccttggggcaggcgccgtgtccagcagaggggctgctgcccgccacggctggccttgcccttg


gcattggccccagggaagtgcagggcgcaggagagatgccacaaggcccttgggtgttcttcccgcagcttccttcg


gtcgcaggcggatgacgcatctttgtcactgcatccacgtaagatgctctgttagaaaaaagacaggaaggaaactg


ttccaactcgcgagtgagtcactggggaggtgggattggaagtgattttctcccctcttccagctttctaagatttataaca


cgtgtgcatcctttaaactcgtgtcaggaagcgacctcactagggcattgtgtctgagcaggaagaatctcaggactcc


ctgggtttactcatttgagcgccagcctccaggagacggggcaaagggctgctcgtcctccgtgaccaggcggcccc


tctgcctttgtcgcatgtgtggtcaacgtggtggctgaaaggctcacgtgttccttagcccgggggacctgagaaacctg


accgtcaccgtgtctctgtagcctagttggtggcccggcagtggacagaggccctgcaggccaccaaggaggacg


gaaccccagggacagaggaggagccgcagtgcagggggtccaggcccgcccacagccccaccctcgggaca


ccccaacccctgcaccctctgtgcttgccttggggcatttgcttttcaactccacagaaaatattgtgacagtaacgtggg


agcagcaagaacccaaataaaagaacgtgtcattcacgctgcctccctttgtcctgagctgcaggtaccagcacatct


gggcactttctccatggtccgctgtcctaatcctgactatttcagcacccagggagcaagccaagaacaaaaacacct


atcatccatgcccggcattagaaaatctcattgttggtgagaatgatggtttccagcttcatccgtgtccctacaaaggac


atgaactcatcgttttctatggctgcgtagtattccatggtgtatatgtgccacattttcttaatccagtctatcattgatggac


atttgggttggtttcaagtctttgctattgtgaatagtgctgcaataaacatacgtgagcatgtgtctctatgg.





SEQ ID NO: 2 (FUT1):


gaaagtccctgactggagttggcagccaagccaggccctggagtgggcacccagagggaagacaggttggctaa


tttcctggagcccctaagggtgcaagggtaggccttctgtgtctgagggaggagggctggggctctggactcctgggt


ctgagggaggaggggtggggggcctggactcctgggtctgagggaggagggtctgggcctgtactcctggatctga


gggaggaggggctggggaacttgggctcctgggtctgagggaggagggagctttggtctggactcctgggtctgag


ggagtaggggctagggatctggactcgtgggtgtgaggaaggaggggctggggtcctggactcctgggtctgagga


aggaggggcagggggcttggactcctgggtctgaggaaggaggggccgggagcctggactcctaagtctgaggg


aggagggtctgggggcctggactgctgggtgtgagcagaagggtctgggtgctgggagtcccgagcctggggaga


tgatggttaaacttctgggaatcaagtcaaactcctgagtctttgacattgatgtatcttgaatgggagggtcagtctgtgg


ggaaggattacccaggtgccgaggcaagagactgaaggcacaaactgtttcagtataataaagaaaatagttaga


ataagaatagttatcatacaaattagatatagagatgatcatggacagtatcaatcattagtgtaaacattattaatcatt


agctattacttttattctttgttgtataactaatataaccaggaaacaaccggtgggtatagggtcaggtactgaagggac


attgtgagaagtgacctagaaggcaagaggtgagccttctgtcacaccggcataagggcctcttgagggctccttggt


caagcgggaacgccagtgtctgggaaggcacccgttactcagcagaccacgaaagggaatctccttttcttggagg


agtcagggaacactctgctccaccagcttcttgtgggaggctgggtattatctaggcctgcccgcagtcatcctgctgtg


ctgtgcttcaatggtcacgctccttgtcctcttgcattttcctcccgtactcctggttcctctttgaagttcgtagtagatagcgg


tagaagaaatagtgaaagcctttttttttttttttttgaggcggagtctcgctctgtcccccaggctggagtgcagtggcgtg


atctcggctcactgcaatctccgcctcctgggttcacaccattctcctgcctcaccctcccaaatagctaggactacagg


cgccctccaccacgcgcccggataattttttgtatttttagtagagacagggtttcaccgtgttagccaggatggcctcca


cctcctgaccttgtgatccgcccgcctcagcctcccaaagtgctgggattacaggcgtgagccaccgcgcccgcccg


aaatagtgaaagtcttaaagtctttgatctttcttataagtgcagagaagaaaacgctgacatatgctgccttctctttctgc


ttcggctgcctaaaagggaagggccccctgtcccatgatcacgtgacttgcttgaccttatcagtcatttggacgactca


ccctccttatcctgcccccccttgtcttgtatacaataaatatcagcgcgcccagccattcggggccactaccggtctctg


cgtcttgatggtagtggtcccccgggcccagctgttttctctttatctctttgtcttgtgtctttatttcttacaatctctcctctcctc


acaggggaagaacacccacccgcaaagccccgtagggctggaccctacgttagcctgccctgctcggggttggcg


atgctggaggtgggccttggaccagagaaaatgctttaattaggtgacaagcgggcagaggcctttgtctctggcgcc


ggcagccacggcccccgctgacggcgtgggaaacagaccctgttccactccggtctccagccttggaatggttgcct


tcgtgcagtgcaggtctggaaagtagcagtttggcacgggaccctagaattccccaaaaggagtgactaggggctg


ggattctggaatttgagtgtggacggtgaggcggggggtgtgggagatcggagaccctggtgggcgcgggagcac


ctgcaggctggaggccctcgcgcgctccggcggcagcctggcaaacaggttctccatcccccaggaggacgcggc


agagggcggacgatcgctccactcgccgggaccaggtgcgggggccctgcccagccgctggggcgtggccagg


ctcgaagcacccaggtgtcgggggccgactctaagccctggcaccggaagagagagggcggcggattggacctc


ccggctccagcattgcaactgggcgctccgtctcctggtccacgcaatgatgctgcggctgctcagaagccaggtag


cctgccctgggtgaagccttcgcgcaggtcaatgacggggcggaggggcagggcgcggtcccctgcatccccgat


ctggggagcggtgggcccaggggccatcgccttagcccctggcgctggggctcggcgccaagtgacgggcgggg


ctccaccttccagccatccgcccggcccgggagggcggacgctgcgagactcccggccgcgccctctccttcctctc


ctccccaagccctcgctgccagtccggacaggctgcgcggaggggagggctgccgggccggatagccggacgc


ctggcgttccaggggcggccggatgtggcctgcctttgcggagggtgcgctccggccacgaaaagcggactgtgga


tctgccacctgcaagcagctcgggtaagtggggactgccccactcagttgttcctgggacccaggaacaactccttca


gaaccaggaggtgcacccccaacctcttctccaggtcttcctaaggccctaggaatctccgccacctccccagccatt


actcctccaggaaccaagatgctccttccgctcctgaccctccagcctctcttgttttacttgaactatcgtttcccatcacc


acctctgtggtggattttgcgcctcacagacaggtactcctgagaaacaggctggtggaagagtccagtatcagcgg


aacttacaggaggggagactcgagattccttcaggaaaggtgtaggaacctggaccactttcttttttttttttttttttttttttt


aagacagggtccctctctgtcgcgcaagctggagtgcagtcagcggtgctatcgcggctcattgtgagctccggggat


cctcccgccttagcatccggtgtagctgagaccacagacatgtgccaccatgccaagctaattttatttatttttttttggag


acggagtttcactcttgttgcccaggctggagtgtaatggcatgatctcagctcaccgcaactcccgccccccgggttc


aggcgattctcctgcctcagcctcccgagtggctgggattacaggcatgcgccaccatgcccggctaattttgtattttaa


gtagagacagggtttctccacgttggtcaggctggtctcgaactcccaacctcaggtgatccacccaccttggcctccc


aaagtgctggggttacaggtgtgagccaccgcgcctggcccatgccaagctaattttaaaatttttttgtaagagtgctct


gttgcccaggctgatcttgaactcctgggctcaagggatcctcccatctcagcctcccaatatgctgggattacaggtgt


gagccacagtgcccagccaaaccatggctatcttgaaaaccacttgtcttccagtccccatgccccgaaattccaag


gctctcatccctgaaacctaggactcaggctctccctacctcagccccaggagtctaaacctttaacttcctctttccctg


ggactaaggagtgctgcaccccaggcgcctcccttaccccacatccctcctcagcctcccctcctcagcctcagagc


atttgctaattcgcctttcctcccctgcagccatgtggctccggagccatcgtcagctctgcctggccttcctgctagtctgt


gtcctctctgtaatcttcttcctccatatccatcaagacagctttccacatggcctaggcctgtcgatcctgtgtccagaccg


ccgcctggtgacacccccagtggccatcttctgcctgccgggtactgcgatgggccccaacgcctcctcttcctgtccc


cagcaccctgcttccctctccggcacctggactgtctaccccaatggccggtttggtaatcagatgggacagtatgcca


cgctgctggctctggcccagctcaacggccgccgggcctttatcctgcctgccatgcatgccgccctggccccggtatt


ccgcatcaccctgcccgtgctggccccagaagtggacagccgcacgccgtggcgggagctgcagcttcacgactg


gatgtcggaggagtacgcggacttgagagatcctttcctgaagctctctggcttcccctgctcttggactttcttccaccat


ctccgggaacagatccgcagagagttcaccctgcacgaccaccttcgggaagaggcgcagagtgtgctgggtcag


ctccgcctgggccgcacaggggaccgcccgcgcacctttgtcggcgtccacgtgcgccgtggggactatctgcaggt


tatgcctcagcgctggaagggtgtggtgggcgacagcgcctacctccggcaggccatggactggttccgggcacgg


cacgaagcccccgttttcgtggtcaccagcaacggcatggagtggtgtaaagaaaacatcgacacctcccagggc


gatgtgacgtttgctggcgatggacaggaggctacaccgtggaaagactttgccctgctcacacagtgcaaccacac


cattatgaccattggcaccttcggcttctgggctgcctacctggctggcggagacactgtctacctggccaacttcaccc


tgccagactctgagttcctgaagatctttaagccggaggcggccttcctgcccgagtggggggcattaatgcagactt


gtctccactctggacattggctaagccttgagagccagggagactttctgaagtagcctgatctttctagagccagcagt


acgtggcttcagaggcctggcatcttctggagaagcttgtggtgttcctgaagcaaatgggtgcccgtatccagagtga


ttctagttgggagagttggagagaagggggacgtttctggaactgtctgaatattctagaactagcaaaacatcttttcct


gatggctggcaggcagttctagaagccacagtgcccacctgctcttcccagcccatatctacagtacttccagatggct


gcccccaggaatggggaactctccctctggtctactctagaagaggggttacttctcccctgggtcctccaaagactga


aggagcatatgattgctccagagcaagcattcaccaagtccccttctgtgtttctggagtgattctagagggagacttgtt


ctagagaggaccaggtttgatgcctgtgaagaaccctgcagggcccttatggacaggatggggttctggaaatccag


ataactaaggtgaagaatctttttagttttttttttttttttttggagacagggtctcgctctgttgcccaggctggagtgcagtg


gcgtgatcttggctcactgcaacttccgcctcctgtgttcaagcgattctcctgtctcagcctcctgagtagatgggactac


aggcacaggccattatgcctggctaatttttgtatttttagtagagacagggtttcaccatgttggccaggatggtctcgat


ctcctgaccttgtcatccacctgtcttggcctcccaaagtgctgggattactggcatgagccactgtgcccagcccggat


attttttttttaattatttatttatttatttatttattgagacggagtcttgctctgtagcccaggccagagtgcagtggcgcgatct


cagctcactgcaagctctgcctcccgggttcatgccattctgcctcagcctcctgagtagctgggactacaggcgcccg


ccaccacgcccggctaattttttttgtatttttagtagagacggggtttcatcgtgttaaccaggatggtctcgatctcctgac


ctcgtgatctgcccacctcggcctcccacagtgctgggattaccggcgtgagccaccatgcctggcccggataatttttt


ttaatttttgtagagacgaggtcttgtgatattgcccaggctgttcttcaactcctgggctcaagcagtcctcccaccttggc


ctcccagaatgctgggtttatagatgtgagccagcacaccgggccaagtgaagaatctaatgaatgtgcaacctaatt


gtagcatctaatgaatgttccaccattgctggaaaaattgagatggaaaacaaaccatctctagttggccagcgtcttg


ctctgttcacagtctctggaaaagctggggtagttggtgagcagagcgggactctgtccaacaagccccacagcccc


tcaaagacttttttttgtttgttttgagcagacaggctaaaatgtgaacgtggggtgagggatcactgccaaaatggtaca


gcttctggagcagaactttccagggatccagggacacttttttttaaagctcataaactgccaagagctccatatattgg


gtgtgagttcaggttgcctctcacaatgaaggaagttggtctttgtctgcaggtgggctgctgagggtctgggatctgttttc


tggaagtgtgcaggtataaacacaccctctgtgcttgtgacaaactggcaggtaccgtgctcattgctaaccactgtctg


tccctgaactcccagaaccactacatctggctttgggcaggtctgagataaaacgatctaaaggtaggcagaccctg


gacccagcctcagatccaggcaggagcacgaggtctggccaaggtggacggggttgtcgagatctcaggagccc


cttgctgttttttggagggtgaaagaagaaaccttaaacatagtcagctctgatcacatcccctgtctactcatccagacc


ccatgcctgtaggcttatcagggagttacagttacaattgttacagtactgttcccaactcagctgccacgggtgagaga


gcaggaggtatgaattaaaagtctacagcactaa.





SEQ ID NO: 3 (RHD):


aactccatagagaggccagcacaaccagccttgcagcctgagataaggcctttggcgggtgtctcccctatcgctcc


ctcaagccctcaagtaggtgttggagagaggggtgatgcctggtgctggtggaacccctgcacagagacggacac


aggatgagctctaagtacccgcggtctgtccggcgctgcctgcccctctgggccctaacactggaagcagctctcatt


ctcctcttctatttttttacccactatgacgcttccttagaggatcaaaaggggctcgtggcatcctatcaaggtgagagttc


attggaaaagtggtcacaggagcaaatagcaggggcaggggcgggggaggcctgtggttctccaggggcacag


atgttcctttctacaaaatcccaaggaaaaagattcccccatcttcttccgtagattgcaccgaaattcagccaacaatg


taagctttcctttagaagcagcctgggcatgccctcttctgtgaagcctgccttgatttttcagcacagtgagaggcatcct


ctttggtgttcctcaaattccctctaccaaatggtcttcataattctctgcttctctgcttccccttctctctcctcagtggcaagg


aatttttttatttttatagatttaggggatacaagtgcagctatcttatgcaagcaatttcatgttgttgggtttttggtttttgtttcct


ttttgtggcctctcgctcatttcttatttctttttgaggcagggtctcactctgttgcccaggctgaagtgcagtggcatgatcat


ggttcactgcagccttgacctcctagtctcaagcaatcttcccacctcagcctcccaagaagctgggaccacaggag


ggcaccaccatgcctggctaattttttttttttttttttttggtagagatgtgggtctccctgtgtttcccagactggtctcaaactc


ctggacacaagcgatcctccagcctcagtctcccaaagtgctggaattacaggcgtgaagcactgtgcccagctctct


tgctcatatctatactagttttcttttggaagcttcagcctgttgctaccccccacccccacccccaccgaccccagctttctt


ctcacttaggggctgggaagtctgcatgctgtctataaatccagaaccagaaggtatggctgaaggggagggtagg


atgatggttattttatattcagctaaaaatattcccagactgtgatgagacaactgtaaataagacagatgtccacaatg


gtgtgactttgcttttttaaaaatattgaaatgagtttcaggcatctcagtgggctgataggttgttgataatagacagggcc


tccttgaagaatgtccctgagacaaagttgaagcttgagcctggttgagtccttgcttgttcctaggttgatatgaacggct


agttaactggaagcaaagagaagtcatcctgggggccatggcagtgacaagtaggacttagggagggaagccctt


ataccatttaaggtgctggcccagagaggagccttcagtgacagacaaacaagagctggcacaattttaattcacttc


aatttactctaattcatttcaatccaatacaattcaatgcattccattcattcaaccatgtatgacatccaatgtgggatcca


gactcatgatgattagagctgatatttatgagcacttactatgtaccaggcactattctacatgctttacattgaaccctcac


aataacccaatgaggtgggtactattatgatcttcgtttttcatatgaggaaactaggcatatggatgttgagtaatttgcc


cacggtcgctcagctagcaatagcacagcgtatttaaatttagccaccctggatttagtttccttacacttaaccattatgc


atcatggccccattttacagtgggcttgagtctttgtcatataacccagtaggttagcagccactattccaaccctgtagat


tgactctagggtccatgttctttacccctgcaccgtgctactaacgtaggtacaaaatgtcctcagaaactcactttatac


ggaagctcagaggagggtccacaacccaggcaggggagacgatggtgtcaggggagggaggtgactgcccag


ccaggtcttgaaggctcagtaggaattacctgtgggacaaaggagggtcatccaagtgagggcacagtgggtgcca


tggcgtgcacacacaatagagcagactgagcctgggcttaacattgcattgccctggagcctaaaaggggaaaca


aagggccgggcgacgtggctcacgcctgtaatcccggcacattgggaggccaaggctggagaatcacctgaggtt


aggagttcgagaccagcctggccaacatggcaaaaccgcatctctactaaaattataaaaactggctgggtgtggtg


gcacacgtctataatccgagctacttgggaggccattacactccagcctgggcgccagagtgagacttcatctcaaa


aaaccaaacaacaaaaacaacaacaagaacaacaaaaaaacaaagaggagagcagggactgggtgtggtg


actcatgcctgtaatcccaaacactttgggagaccaaggcaggcagatcacctgaggtcaggagttcgagaccagc


ctggccaacatggtaaaaccctgtctctactaaaaatacaaaaattagccggatgtggtggcacgtgcctgtagtccc


agctgcttgggaagctgagggaggagaattgcttgaacccaggaggcagaggttgctgagctgagaacatgccact


gcactccaccctgggtgacagagtgggactctgtctgaaaaaaataatagtaataaataaaaataaagagggaag


cagcgggtggcagactcactgggctgcatacgaagtttggcttcagtctgaggtccgaatagtaaacagcagcgag


acaagtttgggtttgggtcatggaggaagccatgccagggctggtgttgggcacagggaaaggggcatggcttgag


acaccagaccagcgtggaggctgtagtgtagtattgacctgaggacttcaacattctgatggtgtacacacgattttttg


agcatgtaccatggttatatattacactttaagtattactttaagtattactacattaatatattttgtatgttacaataaatacat


acaaattaggaaaattgaaagagatcaaaatgaaatatataatattttcaaattactaatcataatggtgtcaatctcca


ggcagggtccattgctacagttgacgatagtggatgaaaattcactcctcagagtcttcttgataatttgaaattgtcttgat


tgacttgtcagatctgattagatcaacatgttttaaatctcgaatgtgactgacagcttgtacgaggagaagtttcactctg


ccttttcccttttgttcacttgactgccattatttctatgcttccaatctgtgtttttctgcacgagttggttaagccattacttcatttt


gtgaaagtttgttgagttaaacttaggtaacttaatctgtcaatccacttaattgaattcagtcctggtaaactataatagatt


attcaaacctgccaattctaaaaagacattttgagacaatcaggaaatctgaatatagcatgaatatcttacgatataca


aggattattgttaattttgttaggtatgataaaagcatggtgggttgtttttgtttttgttttttaagtctccatctgttagagaggc


acattgaaatggcatgatatctggggtttgcttttatgccagaaaaaagaaaaagtacagaaggattatagaaacaa


gattggtctcatgtgacaatcatcagagtttggagatgggcacgtagggtcatcgtgctgttctctctgttttcgtatatgcttt


aaaagttctgtaatagttaattaaaaaaaaaaaaaaacaccctggctgagcatttagggaggccaagtggggagga


tcgcttaaaccaaggagttcaagacgagcctaggaaacatagggagacccccccccatctctaaaaaaaaaaaa


aaaaaaaaaaactttaaaatttaacccagtgtggtggcacatgcctatagtcccagctactcagtaggctgaggtgag


aggcttgcttgagcctgggagcttgaggctgcagtgggacgggattgtaccacttcactccagcatgggcgacagag


caagaccctgtctcaaaaaaaataaaaatatttgaggtgaagcgaggctgtaataacaaatttaaaaatataaataa


aacataaaggctgggtgtagtggctcacgcctgtaatcccagcactttgggaggccaaagcaggcagatcacgag


gtctggagatggagaccatcctggctaacacgatgaaaccccatctctaccaaaaatacaaaaaaattagccgggt


gtggtggcgggtgcctgtagtcccagctacttgggaggctgaggcaggagaatggcgtgaacccaggaggcggag


ctttcagtgagctgagattacgccactgcactccagcctgggcaacagagcgagactccgtctaaaaaaaaatgaa


aataaaaataaatgaaacataaaaccctgccattagttgcaatatgaagaatatagagaaatgcatatcaaatccttc


tcattggaccaatattcccttagggcaccttccaaagctaggagactcaaggctgtatgacatcctgagcaagtgagg


ggtggcttctgggtgaatctgaatattaaatatttgcagaattgaaaacttcacaaagtacctttagagatagaatagcct


agatccatgtttctcaaagtgtggtccccagacctgctgcctcagcatctcctggaaatttagtagaaatgcagattctca


ggccctaggccagacctactgatcagaagctctgggcctggggcccagcagtctgtgttttcacaagccctcttggtga


ttcttctgtgcatgaaagttcgagaattcctggagctagactgattcaaatcttgcctctgtatcttagagaccttgggcag


attagtcaacctctttctgcctctgtttctacttctgtcagaggatgatagtacttgtttcattaagttgttgaaaggataaatga


attgacacacataaagagtattagcttttattatcaaaagctttttttttgagacagagttttgctcttattgcccaggggagt


gcagtggtgcgatcttggctcaccgcaacctccacctcccaggttcaagtaattctcctgcctcagcctcccgagtagct


gggattacaggcatgcgccaccacgcccggctaattttgtatttttagtagagatggggtttctccatgttggtgaggctg


gtctcgaactcccaacctcaggtgatgcacccgccttggcctcccaaagtgctgggattacaggcgtgagccaccgc


gcctggcccaaaagctttaatttcttaattttttaaataaaataaataaaactagaattgcttgttttcttccagctaccctggt


gattgtattgagcattttctggggtgtgtgttctttgctgtaatgactactggtctggatgacctgtgatgagaccagatggg


caggggcagtggaggagattctagagatatttaggagataagtcagctgtacttgatgaaaagagtggggagttaag


gctggctgcagatgtatgatttggcatagagaggtgccagttcctgagatgagagacagaaggggagggacaggtt


gtgaggatgaatgaacaatgatatgttcattctgggcttggagttaaggggcctatgatatgcttaggggaagcagag


agtatcaattacctattgctgcataacagccaccccaaacttagtggcttaaaatagtaaccttttaatttactcatgatcat


gattctgtggtgcaacaactgggctgggttcagctgggcagttcttctgttagtttcacccagggtcattcatgcatctgca


gtttggggtgggatggcctcagatgacctcattcacgtgtttggcagttggtgattcactgggggccattactgtaacaat


cgcctaccaggcagagcttccctaaggcttccaaactaggagactatcctgggtcctgtgctgtggataccactcagtc


ccccatccccaccccatattcctcaaaggcagagagaggggctactagaagacagaggagttttcccagtgacatg


taaacactccaaaccctggcaccttccacactgcagctttggtctgcccctttgggaaatctctgtttttcttcccaggctgc


tggaggggtgagagtcgccggtagagtagaggctgtgggcgaggaggtggcggcctcctgaggctgcagtggtctt


tccaggcagcagtgggagcacagggtggaggtcaaccctagagcctgggagagtgaagctgggtgtgacttcaga


gctgttggtgctgaagtttctgcaggccagaaggaggggcaagagtggggggggcgcagatccagaatcacgg


aggcagctgaccggaggaggcagctgcccaaggggatggactcagaaggccaaagtgctgttatccaaacgaa


ctctttgcaagtggtctctttgcaacaggcctgggggagagcagtcttgcctaaagtcacaccgctaatcagcggccg


gcacggggtaacagttactaacactcactacgtacccaatgctgggcgaagtgacttgcatgagccagcgagctca


atgctcatggcaatcctctgagcagctggcattgtttcatctcaattttacagctcaggaagctgggacacagaggaag


agccaggctctgaacactgacaacctgattgagagacccacactgttcatcaccgttacgctatatatgctgtatagaa


aggcaggatggcataatggttaaacctaggtaggtagggtttgaatcctcctgctaccatttactagctctgtgacttgga


ctagttatagcacctctctgtgcctccctttccccatctctaaaatggggataataaatcgtacctcctacctgaggctgttg


tgggctaagtctgtaaggcacgtagaacagtgcctggaacgtggggtactgtctatctgtgtgcctgctgttacaacaat


ggtgagtattgccttatctctcgctgctgaactaccaggttagacttctttctgcaagtcatgaggctttcataaacttttcctg


aaggctttccgtagaatgtacaattcccctctgggtccaggcatgggcgcccgggtagcacatccacttcttatcaccc


ctgaacaccttagagcccatcagcttatcaaaccagcagctgatgtgagtgcagagcagactgtgagaggtggagg


ctgataccagtgaggatgctccaagctgggacccagccctgaagcgggagcccagataatggatgggtggaaatg


ggcctggagcccaggagaagtgggaggatgagggggcagggggaggagaagcctgaaatcaaatgttatttcct


gaccagtttggggtgcatgagctctgtcaacagctcatggaaactgctgccctaatttcatcttgttggctgaggcacaat


tcctctctcagggacagtgtagagccttggggaggaaggccctgagcgcgtatacctggaatcagggaatcgggat


caggggcagcagctgtgcccaataaagcccccacccaggatcctctgacttcctcatctctttttttttttttttgagctgca


gtctcactctgtcatccaggctggagtacagtggtgcgatctcggctcactgcaacctcagccttctgggttcaagcgatt


ctcctgcctcagcctcctgagtagctgggattacaggcatgcgccaccatgccaggctaatttttgtatttttagtagaga


cggggtttcaccatgttggccaggctggtctcaaactcctgacttcaagtgatctgcccacctcagcctcccaaagtgct


aggattacagacataagccactgtgcctggccttttttttttttttttttttgtaaacagggtctccctctgtcacccaggctgct


ggagtgtagtggtgtgaccgcagctcactgcagccttaaccttctaggcacaagccatcctcctacctcaccctcctga


gtagctgggactacaggcactcgccaccacgcccaagtaattttgtattttttgtagagacaaggtcttgctatgttgccta


ggctggtcttgaactcctcagctcaagcaatcctccctccttggcctcccaaagtgctgggattgtgctgggattacaggt


gtgagccaccatacctggtctgacttcctaatctttagggccccaactctgcccttatccaggcaactctcctctccccat


cttccactaacttctttggaatattccagagctgtaaaagccttagagagtatcaagtccaactcctatgtgttacagaca


gggaaactgaggcctaaagagggtaatggacttgcctaagatcacttagtgaggtgagagaagaaagagctagag


acagcctagcctgtgcaaggacatagttccaggcattcagagctgggctctgctgccggcatgtttggggcctggtagt


tagttcactgctgaactaccaggttagattttctttctccaagttgtggagctttcataaacttttcctgaaggtcttccttacaa


tgtacaattctcctctgggcccggtcatgagcgcccctcacaggctctctctggtccccttctgtaaaatgagaggaaaa


tggaagaattgctctactcatggaatcttcaataagtctgggccctatgcatatagcattgctacaaaatggcagatgca


ctttaacaatcgtgtttaataaaaggttggatttgcatatctgaagtggggcatgcagtctccaactgaacacaagcctc


actgctcccgcatgtgcactgcaccttcatatacatatttcctgcttggctcctgagggaatttgagtaatcccaagagga


acccctgtagaaaatgtcccctggccacacacccccattcctaaggatgcaagcaggagatagaaacattccctgc


acctccctccttgttgtcagaagaagtgcaaagagttgaatccttcctaatgcccacttctcacccacgccccaaatccc


caggtcccatggaggtccttgggggcctcctatatcctggtggtgtcaggttgatttggaaatgtcagtgtcctcccttgtc


ctctctggcagaccctgggtatgtgtatgtttcaatggaagtgaatttaaatgtactttataaatcaaagactttttctgaga


ctttggagagttccagtaatgagagcttctcattgttatcaaggccagggctggagaccagtggcaggtgagttcctatt


gctgtgattgtcatgatgatgttgatgaacagtcactatttattgagcgttctccatgtgccagtcactgtactaaacattattt


cctttggatttcccagaaacctctcaggtgggtctaattacccttattcagctgataaggaaagtaagcaacttacaaga


ccacagggctatgaagtggaaacacataaattgatatttcattttatttatttatttattttgagacagagtctcactgtgtcg


cccaggctggagtgcagtggtgcggtctcagctcactgcaacctctgcctcccgggttcaagcgattctcctgcctgcct


cccgagtagctgggattacaggtgcccaccaccacatccagctaatttttttgtaattttagtagagacggggtttcacca


tgttggccaggctagtctcgaactgctgacttcatgatctgcccacctcatcctcctaaattggtatctttatatgtccaaaa


gagtcaactggtggcaatttagtgaggtttaatctaataggaaatgatagagctgggatcgaacagagccatgtgaac


tcaaaacctatgcttccccttccacctttttgaaaaacattgtctaggctgggcacgatggctcatgcctgtaatcccagc


actttgggagacggaggtgggtggattacatgaggtcaggagttcgagaccagcttggccaaaaattagccaggcg


tggtggcgcgcgcctgtggttcccactgaagcacaggaggctgaagcacaagaatcacttgaacccgggaggtgg


aggttgcagcgagccgagatcgcaccactgcactccaacctgggcaacagagagactctgtctcgaaaaaaaaa


aattgtctacatgctggttgcagaaaatttaaacactaaaactaaaaaagtaaaacacctcccaaacttagagacaat


attaatgacggaaaaaaaattcttcaagatctctctctctccagtcatttattcatgtgcgaaaacagttggtgattattgat


aaaatagcttttagagtttggagcaattatgtgcattacatataccatttgattctggcaacctaatgaaggagtatgatca


tttcccctatttaacagacaagaacaagaagagggagggcagatggtgtggtagtctaaggcacaggctccagcag


attatctaggtgtaaatcttggctgtaggccaggccctgtggctcatgtctgtaatcccatcactttgggaaaccgaggtg


ggcagatcacttgaggtcaggagttcgagaccagcttggccaacatagcgaaaccccttctctattaaaaatacaaa


aattagccgggcacggtggcaggcacctgtaatcccagctacttgggaggctgaggcaggagaatcacttgaaccc


aggaggcagaggttgcagtgagccaagatcttgccactgtactccagcctgggtgacgagtgaaactctatctcgat


attaaaaaaaaaaatcttagctctacccaccggggcaagttacgtaacgcctctgtgccttggttttcatatctgtaaaat


ggtgacagtaacagcacccacgtcaaagtgtggttgtgagaacgaaacaagatagtctatgtaaagtgattaaaac


agcgtaggcacatggtaaacgcttaggaaatgtaggctgttataaagctcagagatgttaagtaactagatcaagatc


acacagttagagggtgccagagtcctgatttgaacccaagtttgtctcgttctggagctcaagctgctaaccctttttcaa


aactggaattaaaccaaagtgctcaccctccgctttgctgggcccctccctgccctcaggtgcgtctcttccactcacct


gccacagcagcctctgctcagggtctgagaccgggaaaggtgagggctacccaggtggccctgatgttttctgccag


ccagctcaccaggtccctcgcagcaggcggcaaagggagggaggtttgctgtgaagattatgtggttcccaacaac


aagagcgctgggcctatctctgccctctcttttctgtgtgtcctgggacaagtcacttggcttctgtggcttcattttctcatgt


gcccagccagggggttggccctcatatgcaataacagcagcaatgacctttactgagtgtccatgtgcgtcaagcac


gtgtgctttacacttgttcttattattaggtttaataatagaataattgccacatttactgagcactcattatgggccaggccct


gccctaagtgcttaattagctttagctcctctaatccttatcttatccccacacggcatgttatgttatccccattattcagttga


gaacattgaggctcaaagaggcaaagtaacttgaccaaatacttgtaaacgatcttgcatgccccttccagctgccatt


tagtaagactctaatttcataccaccctaaatctcgtctgcttccccctcgtccttctcgccatctccccaccgagcagttg


gccaagatctgaccgtgatggcggccattggcttgggcttcctcacctcgagtttccggagacacagctggagcagtg


tggccttcaacctcttcatgctggcgcttggtgtgcagtgggcaatcctgctggacggcttcctgagccagttcccttctgg


gaaggtggtcatcacactgttcaggtattgggatggtggctggatcacttctgggtcatagagggaatggaccccgaa


aggacaggttccagaagatctgggatattgccccctctctgtctagcaccagtgctgtgcaatatttaggacatccttata


ctaaaagattattcattgtttaaaattcaaattaactgggcatcctgtattttactggacagccctactccgtgtatcacaag


gaatccaggcctacattcctcctgcatcctttctttcctgttattgtcgattatgattttgtaaagttacataatcaatataagttt


atggaaaacgtaagaaggaaacacgttagacagagagaaatagacatgccacacctagagagacattctattttttt


ttttttttttgagacggagtttcacttttgttgcccaggctggagtgcaatggcgctatctcggcacaccacaacctcagcct


tctgggttcaagcgattctcctgcctcagccgcctgagtagctgggattacaggcatgtgccaccgcgcctggctgatttt


gtatttttagtagagatagggtttctccgtgttggtcaggctagtctcaaactcctgacctcaggtgatccgcccgcctcgg


cctcccaaagtgctgggattacagacatgagccaccgcgtccagcctgagagacattctcttgaaaagaaaggactt


tcagccccctaatgctgctagacaataaatagccatgcctttattttcattaaattacctgtgctttgtttacatgcatttgtgtg


aaatgctaagaaccatcacaactaatgtatggtgccagaagtcagaatagttgttacctgggcaggaggtggatattg


attaggaaggaacacaaaataaccgcatggggtgcagaaaatgttctctatgttcacctgggtgatgattacacatca


agctatacacgttttaaaagggcattggcacttaataggaggaagtaggctaaattttttcctgaaacattgttttgttttgtt


caaacctctgaatccctgtgctgcccagatgatggtaaacgtcatcctaggcatcttagggacctctcaaggccattcc


agcctccccttctaagaccctgctaaacctctgggcactgctgttaaacatttctctatgagccaggaactgtgctgagc


actccacaaatattattttgtttaactcttccgggtagggatctaacctggtatacaggtaaggaagtggaagctcagag


agggcaaggcacttgcctagggccacacagctaagtggtggagatggctccaactttttattataaccttttccacatgc


tccagagtgctcagaacatgaaacacagtctagccagctcccgattggccctggagggaaaaaactttatatatttttct


tttttaaaaggtttagaggctgggcatggtggttcacacctgtaatcccagtacttttgggaaccgaggtgggcagatca


cttgagcccagaagtttaagaccagcctgactaacacagtgagatcctgtctctgcagaaaatagaaaaatcagcta


ggcgtggtggtgtgcacccacagtcccagctacttgggaggctgaggcaggaggatcacctgaacccagtgaggtt


gaggctgagtgagccatgatcgtgccacttcactccagcctggacaacagagtgagaccctgtctcaaaaaacagtt


ttaggggccgggcgcagtggttcatgcctgtaatcccagcactttgggaggccaaggcggggggatcatgaggtca


ggagatcgagaccatcctggctaactcggagaaaccctgtctctactaaaaatacaaaaaattagccgggcgtggt


ggtgggcgcctgtagtcccagccactcgggaggctgaggcaggagaatggcgtgaacccgggaggcggagtttg


cagtgaaccgagatggtgccactgcactccagcctgggtgacagagcgagactccgtctcaaaaaaaaaaaaca


aaaacagttttaggccaggcgcggtggttcatgcctgtaatcctagtactttaggaggcctagcaggtggattacctga


ggtcaggagtccgagaccaacctgagcaacatggtgaaatcctgtctctactaaaaacacaaaaattagctgggtgt


ggcggcaggcacctgtaatcccagctacttgggaggctgaggcaggcgaatcacttgaacccgggaggcggagg


ctatagtgagccgagatcgcaccattgcactgtagcctgggcgacagagtgaggctctgtctcaaaaacaaaacaa


aacaaaaacagtctatgagttaattcccaccagaattcaatacacacacgcacacatgcacgcatacacacactgt


gtccacctgggaagtgacaaagggcaccctgggggatttcaaatggtggtggccctggtttggtgttgctgccttagctt


aaggtcacaccagccttcagcctcctgccccacagtctagggctgctcccctcatctgatgtccacagggacctgtttgt


tcttgactcaatctagaaagacgagaagggagagaagtcactcgcagcctgagtgaactcccctgccccacccctg


actgcttggatccccctaggggtgacccctgctgaaactggctccttcctgaccggttcccgtcagggctgtgctgatgg


gtggtgcccaggcctgcccctggggacggggtactctcccttggcaacactccagcttgtgccacttgacttgggactg


atttggttctgttttgagtcccttcaggggaggggcctatcttattcaacgttgttgtttgttttcctcacatactgataacttagc


aaatggctattggagcaaaaatgaaaataaacggaactctgaagtgggatgttttaaaattttatttatttttttagagaca


gggtcttgctctgttgcccagtctggagtgcagtggtacaatcatagctcattgcagcctgtgcctcctgggctcaagtga


tcctcccacctcagcctcctgagttaaatttttttacaggcgcctgctaccatgccctgctaatttttgtatttttagtagacaa


ggggtttcaccaggtgggtcaggttggtctggaactcccgacctcaagtgatccacctgcctaggcctcccaaagtact


gggattacaggcgtgagccactgtgtccagcctaaaactgtttttgagacagggtctcactctgttgtccaggctggagt


gaagtggcatgttcatggctcactcagcctcaacctcactgggttcaggtgatcctcctgcctcagcctcccaagtagct


gggactgtgggtgcacaccaccacgcctagctgatttttctattttctgcagagacaggacctcactgtgttgctcaggct


ggtctcaaactcctgggctcaagtgatctgcccacctcggctctgaaaagtactggaattacagcctcctgagtagctg


agaccacaggcacacaccaccacacctagctttttttttttttgctttttgtagagatggagtctcactatgttgcccaggct


ggtctcaaactccaggccttaagcaatcctcccacctcagcctcccaaagtgcgaagattacaggtgtgagccacca


ttcctggccttaaaagtgtgatatttttaatgtattttgaaatctgcaggactctccctagaagataatagcaataaccaact


cctttattgtgcttgacgtatatcaactcactttgcccttaccgtggctccagaggcattgggtccaccttataaatggagg


caccaaggcacagagtgattaaataaattgcccaggatcacacagccagaaagtgtctgagtcaagattccagccc


aggcagcctagacctgagagcacgctcctaaccactgcacatcactgtcttagcacctcctcagcacaaactggccc


ttgaggaatgaaataccgccgccggcacacacgctcctgagttaagcctttgtcaatgaaatgaacacccacttaaa


aggaataacctgtccaggcacgatggaacattgagtaaccccttattctaaattcctggtccctgtaagactccttcccc


atgcccttgcccttttctgaccttcccctaaagtccttgaggcttaagcgggcatagtctgcagcaaacactggggaagc


tgagtccagacttcagagcacaggctttggatctaggccagctggatttgaacctcacatttgtgatcagctggcatgac


tgtttccaaaaagtccattttaatcctctacgtgaccctctgtaaaatgggatactgaatggtgagctagcacgattttaca


gagagtgaattttttttgtgtgtgtgtgaggcagtcttactctgttgcccaggctggagtgcagtggtgcagtctcggccca


ctgaaacctctgcctcccgggttcaagcgactgccatgcctcagcctcgagagtggctgggattacaagcatgcacc


accatgcccgggtaatttttgtatttttagttgagacagagtttcaccatgttggccaggccactcttgaacccctggcctc


aagtgatccacctgccttggcctcccaaagtgctgggagtacaggcatgagccactgcacccagccttatagggtta


aaatttaaaagaggtgatgctgttacaagcctgttttacaaaatgctcttataataaatcattatcatcactgttgctgtggtt


gtagcatcatcatcattaactcccagagggaggagggagtctcagagcaagctgctcaggggagactggatgtcca


tggattgtccagctcagtaccacttcctccaggaagtcctccctgataagtccagtcagcatcaccctctccttccaatga


accccactagccttgtgatatcacagatattcttagttgacaggctcatggtgtagcctgtctagatcataagtacattttttt


tttttttggatcataagtatcttcaagaccaaaataattttctactcctgagcatgctcattggtcaaaggaaggaaggaat


cataatagcgttaataaggctagcgtcttttcagaagttggttctttgtgccagtcttggtgctagacacaccgataggaa


gaatactccttcacatccccaggacaccaacatgggatacgtttgatcatcattcttaatttgcagaaggagaaatagg


ctcagtgagatgaaatagccactccagtggcaaggctgggactggaagccgggcttgtcctgattccaaatccagttt


ctttccactgccacggagacggagagaagggacagtggccccagatggggatggggtgactggatgtgggcagg


cctgcgggggaagagtgccctctgttgagcatccgaatgatggcagcagaaaagaagactgggcagaatcccagt


tatcagatcccctgagggaacagtcaccccgatcaccctcagtcagatgagtgtgtgtagatcaatgcctcatagatg


aaggcactgaggcacagagtggttaagtcatctgccagaccacatggctcagggtgcagaggccaccttaacggg


agaagagatggtcactccactctgcagcatcagcgcccaggtgggtagaaatcttgtcttctattcccacagaaagta


ggtgcccaacagtgtttgttgaaagaatgaatgaatgaatgaatgaatgaatgaatgagtgagaggcatccttccttct


cagtcgtcctggctctccctctctcccccagtattcggctggccaccatgagtgctttgtcggtgctgatctcagtggatgc


tgtcttggggaaggtcaacttggcgcagttggtggtgatggtgctggtggaggtgacagctttaggcaacctgaggatg


gtcatcagtaatatcttcaacgtgagtcatggtgctgggaggagggacctgggagaaaagggccaaaagctccattt


ggtggggtttccagggttttgaaaaataaagacaacctgtaatcccagctacttgggaggttgaggagggaagatca


cttgaggccaggagtttgagaccagcctgggcatcatagcaagatcctcatctctaaaaagtaattttttctaaattatcc


agttgtggtggcatgcacctgtagtctcagttactcaggaggctgaggtgtgagttggaaggattgtttgagcccagga


gttagggaccgagctgggcaacatagcaagacctcatctctaaataaataggtaggtggatagacagatagataga


tagacagacagacagacagacagacaggctgggtacagtggctcacacctgtaatcccagcactttgggaggcca


aggagggcagatcacctgaggtcaggagttcaagaccagcctggtcaacatgggggaacctcatctctactaaaa


atacaaaatttagctgggcatggtggcaggcgcctgtaatcccagctactcaggaggctgaggcaagagaatcgctt


gaacccgagaggtggaggttgcagtgaaccgagatcgcgccattgcactgcagcctgggggacaagagcaaga


cttcatctcaaatttaaaataaagaaaaaagaaaagaaaagattgatagatagatagatatccaaatgagtttacaa


aaatgtggtctgtgcaaatgtttaaacacaacaaaccaatgcctttaactactacagtataatcctgtaggattgtgctatt


catgatataattatggttatataaaagtaattaattctcagagcctcaccagcagtgggtccagcaagtttgtacagcca


gcatcttctttcagtcagtgcgtgtcagtaactgcatatgtcctctcattgggagagcctgtcgaaagtctaaatttgaagg


cagctgtgaaggtaaggccaatccaaatggctctcccagatcctctgctgtaaccctgaccctgagtgaggacatag


ccaaccttcccatctcataggtgagaaagctgatgcctggagaggggaagggactgcccaagatcacatagcaag


atagtggcagaacccaagcgagaacccacagttccagcctggcttagaagaaagtgcactggacttggagtcaaa


ggctggggtttgcatcccagctctgccataaatccctgtgtgactctgggcaatttaacctcttagagctttagtttcttcatc


tgtaatatgagggtagcagtactaccacatagggttttgagggagtaattgaattaatcacatgagatgatgcatgtttac


aaaaaaaagcatgaagcccctttactgtgcctcagtgtcccaaaggactttggattttactctgagaaatacagggag


aactagggagtgttgggcagaggagagccatgatctgacttatgttttaagatactctggcttctgggttcagaaaaga


ctgaaggggcaagagaggaagcaggtggagaccagagcggcagtgattgccatcatccagactcagactagga


caatagctgtgagagtgatgggaagtggttggatcctgactgtattttaatagcagaattgacaggatttgctgatagact


gcacgtgggggggagagggtcaagatgacttcaaggttctcatctggcacaactcagcggctgctggtgccatttac


tgagatggggaatgttggggtgggatagatctgggagggaaaacccagagttcagtgtcgaatgtggtagcgttagg


gttaaggttgggggagggggggtagagatgtgtatgaaacatcccagtggagacactgaatggagatgtacaagtc


tgaagcttagtggaaaggttagggctagggatataaatttgggagttgttacaatacagatggtgtttaaagccatgag


acccaaggagatcactcaggagtgaggataaagagagatgggaagaagtctgaggactgagtcctagaacaccc


tgcattttagaggggggacatgtgtaagagccagcaaaggagacagaattgtgcttggagaggcaggaggaagcc


caggagagcgtgaggtcctggaaggcaaggaaagagagggccccaggtgggctgaatgctgctgagaggtcaa


gtcggatgagggctgggaagtagccattggatttggccaggagaccttggcatgcatggttgtagaggaggatgaag


gcaacagcctggcttgactgattcaagagcaggagatgagaaagtggagacagcatgcaggggcagctctgcca


aggactttgctataaaggggaacagagaaatggaggagaagcaggagggcaataatccgatagagaggaaaa


atctgatgatacagaagagagatgaactgcaagagtcaagcctttgagttggaaagcaggagtgggattttgagcac


tgatacctttaggccgatgcagggacagttcatcttttttttttttttatacaacattttatttaaaaaaattattttcatagaatac


attttcacattagagattcccattgtgcggaaataacaatttattacttatagttttatatttgtggacagattgttttagaacaa


gtagaatacatttgagaattaaatctcagtttacaatggataatattttgatatgtctctggggaaacttgcccttaaatgga


acttctgtatcttcagaagcactccaagcgtttcttcctaggatttagaaatttataatatgagatagcagcatttcctaatttt


aaaatttccctagtatatgtaaccatcagtaggtggtatctactgactagagagggaagtttttgaaaattaaacactgtc


taattttctgcaaagtttttattcatgaattaagagtatttccctttgtccattattcccaaggcaaatatggaaatttgatcatg


tactaatcataataaagctggattctctttaagagattgagaaattaaaaggcaaaagctgatatatcatgtttagttatat


tgtgagtcttataagaagctgggaggcaaccccattaactcaccagaatacagaactcagtctcacaacttagatata


attcctctcaaaccttttcctcaaagattaaattctgaaaataatcttgtgattaagagaagaaggctgtccaccaatggg


cttatctgttatttcttccttattgtgagcttaatggcatgacaaagcagaggcaaagaggcatacatcaattcttcaaagt


aggaagtcaaaaaggtcagagcttccacagcatggcaacagctttgcagatgcccacatcgtgatagttgaaatag


caaagcccagcaaaggttaaagctgaaaatgccaaaagccctgccttggcagctttctgcgaggcatccccatgaa


cataatcagtaacaacttgtccaaggccccagtgaccatgaagagtgagggctgcagccagggaatagtccgtcgc


agagcaaggattcaaataagcagccggaagcagacccgggagcaaaacactgacaaccctctcgctagtccagt


ggagagatgcagccttggagccagaatggtggctcggtgacaagtgtatgtgctgcactccacaccattctgggata


ggtcggtcctgaagaaatgctgagatatgagcaggtctgaccactggagttcgcagcaacagagctcggcctccttg


ggcaccgcaaacggcactcagcctccagggaaccgccatctcgttcctgaggcggagagttcatcttaacgagaga


aatggcagggactgtgaataggccggcagatttggtggcgggtgccacaggttcagtctcctgcagggagaggaga


aaatgccttactaattccttgtattttctcagagaaacaagaggcaccgtcatcagcctcatgtgagggtgggaaggag


ggatggggtttgcggagagggaaagtgtggtatggtcatctgtgggagtggaagagagtgagagggctgcaggggt


gcagcgggactgcaggctggcaccagggtccctagggcttgtagttggtggaaagtgcatcagtgaccagggctgt


gtgcagctgctccaggcaggtgtggaagaagcagagttgaacttgcccagcctggagtgctgcccagagtgagccc


aaagcccaggggagaccagagatggggctgtttgcaaaggaggaagtataacagtagcccacaaaatctgagct


ggttaagaaaggagagagagtgaaaatggggagcccagcctggcagcctgggtacacatctcagctcaacccac


actagctgaatccatttgggccccttcgttgacctctctgtgcctcagtttccctatctatagaatggggataagaataagg


ctacttcctagggctgttgtgaggattgaacaagtgaccgaacacttgttcaattttgaacactgttctaaagcatttagga


cagtgcctggcatggggtaagtgttgcggcagtgctgttattttcatcatcaccattgttctcaggctgcgttgattggagct


gctgaagggaggcaatttaaggaagtgagccggacagataggaggtggtggtggttatcaggtgcgatgcttgaaa


ctgaggcttcggaggcaacagttactggtaatgacaaggtctaaggcttgacagtgggtggcagaagtgtaacgca


gggaaagagacgagcggtcaaggagccgagagggaaggagttgggtggactaagatcatttgtggaagaatgat


ggagagaaaggctgaagggcaggggctgacatcatcagtgaccaagaggcggccgggaggctgagaccacag


caagaaagggagagtgtgatggcatcttcttcaagggagctggggatgtttggggtggaaaaaagaacaatggtct


gggagggaatatgggaaatttttttttttttttttttttttttttttgagatggagtttcgctgttgtcatccaggctggattgcaatgtt


gcaatcttggctcactgcaacttctgccttccaggttcaagtgattctcctgtctcagcttcccgagtagctgagattacag


gcacacaccaccacgcctggcttacttttgtatttttagtagagacggagttttgccatgttggccaggctggtctcaaact


cctgacctcaggtgatccacccgccttggcctcccaaagtgctgggattagaggtgtgagccaccgcgcccagcctg


gaagtttgtatttattaatttttggttgtcttcatctgtgtatgtgactttaacccctaaatacttcagtgtacatttctttttttttttttct


ttgagacagagtcttgctccatcaatcacccaggctggagtgcggtggtgtgatctcggctcactgcaacctccgcctc


ctggattcaagcaattcttgtgcctcaccctcccgagtagctgggattaggggcatgccaccatgcccagttaatttttgt


atttttagtagagatggagtttcaccatattggccaggctggtcttgagctcctggcctcagttgatccacctgtctcagcct


cccaaattgctgagattacaggcgtgggccaccataaccggcctcagtgtatatttctgatgcagttgggttctgtatccc


cctccaatctcatctcgaattgtaatccccacgtgttgagggcatgacctcgtgggaggtgattggatcacaggggtggt


ttcccccatgctgttcttgtgacagtgagtgggttttcaggagagctgatggtttgaaagtgtggcacttcctctctctctttct


ctctctctctcacctgacaccacgtaagatgtgccttgcttccctttcaccttccaccatgattgtaagtttcctgaggcctcc


ccggccatgccaaactgtgagtcaattcagcctcttttgtttataaattacgcagtctcaggaagtatctttatagcagtgt


gaaaacagactaacacaatttcctaaaacaaggggacattctcttacataaccttttttcagttaacaaaaatgagaaa


ttgacattgatatattatgattaccttattctcatttcaccaattttctcaataatatcttttctagaaaaaaatatatattttttgtg


gtcgaggattacatcttgcatttagttctcatgtcttattaaattccatcaatctggagcagtttcttcatctttctttatctttcatg


accttgacatgttttgaagtttcgagccagttcttttgtagaatgtgggtttgtctgctgttcctcatgattagattgtgggtatg


catttttggtaggaattctccaagagccgtgtgtgcccttcttagtatatcatatcagaagacatgctatcaatttgccccatt


actgggtgtgttaactgtgatcattgggttaagatggtacctgccaggatcttccactgcaaagttactattttcccctttgta


attaataaacatcttgtgaggagataatttcctatagaaatcctgttgatcatccaactttcacccactgattttagtgttcatt


gattcttccctgaataaattagtactataataattgccaatggtggttttctaattccatctttccttcagtagttggcattcttct


gtaaggaaaagctttcgcttctctgttcatccactcatctatgtacttatttatatcaccatgggctcctggattccggtttaca


cacttccattttctgccttttctctctgcttaatataaggattaatgagaactccctgattcccaggaagaaaatgtcagcag


agctttcttaggcggaatgaagagaattcagtgtaagaaccataaaggtgtatctgtgtagtatggacagttttaaaaa


acaaacaaacacaaagaacctccaagggcaggaggtgctgccagactcaggagggcactagaactggctatga


gaagccactgagatcccaggtagtctgtgctctccatcttttggctcttattctctccgtacatctaacatctctgtacacca


gctttctctttagcgaaaaacgtgtcccctccacccacccatccacctccacttgttcctgcatttctatgtcccagatcctg


cagaaaacaactcttttctctcagttagtctcaattctgtagtccagggagagagaatctgatcagtcccctgggtcattttt


ccactctggtccaagcagctacagctggcatgggaaatagttcacacagtaaaaacatggctgtcaagaagagga


gtaaatttcagaggcagaacactccctgtgagcccgaacctcttcctgctttgttgcagtcttcataacgattgctttaaaa


gactgcattgatataacatcatctctcttctctgcatctttgacttgctagcttaactggtctagaggagggcttagcactgat


tttgagtattcattttcctcaaaacttcaattcagcctgggtttcttcagcaggagggcccgggggaaccagagccaggg


accagagtcatttcagtgcaccagctcaagaaatgaatattccaggccaagaatccccaagtgttcttcctgaactcct


tcctggtggagttcaaagagatgaaaaacacaagcccgcttttcagttcttatcaggaaactgcatagactttcctcttta


tgtatgactgagggctttttaccatcatttgttcccttcacaaatatttatttggtatttactatataccagggactcttgtggca


gtggaaaatacaactctcatggaacgtctgttccagaaggaaagactgccaataaacaataaaataggcaaaaga


tatagcatgttagagagtggtaagtaccacagataaaaatgaaatggagaaaagaaacacgaaaagttggggag


agaggataactgtttgagagggtggccaggggcagcttcatcttatcaagagggtgattttttgagtacagacctgaag


gtaacgagtgcacaagccatatgggtacctgagaacagcggcagaacaatggcagggtgctgggagggctgttta


ccagccacgctgtttagaattgtcagcacatggtgataaaaaaaaaaaaaaaaaaaaaaaaacaggctgggagc


agtggctcatgcctgtaatcccagcgctttgggaggccaaggcggatggatcacttgaggtcaggagttcgagacca


ggctggggaacatggtgaaaccccgtctctactaaaaatacaaaaattagccgggcacggtggtgggtgcctgtaat


cccagctacttgggaggctgaagcaggagaatcgcttgaacccaacgggtggaggttgcagtgagccaagatggc


accagtgcactctagcctggcgacagagtgagactccgtctcaaaaataaataaataaataaatacaaataaaaag


cagacagactttttagttggctttagaattcttagacaccctctacagacaaggcaccccgattgcttgcacccagggtg


gactactccctccaccctgcccttgttacaccctggctgggggtcagcatttcaggcagctgaatgacccaaagtggg


aacacgctagtgggtttgaggatgagcaagtggaggagggcaataggaggtgacgcccgagaggtcaggtgaga


gtggatcctgcagggtcgtggcaagaacctggaccttgactttgagtgacatgggagccgctggaggcttctgagca


gaggagtaacatgatctgacttgcattttattttatttatttatttgacgcagtgtcactctgtcgctgaagctggagtgcagt


ggcgacatctcagctcactatagcctccgcctcccaggttccagtgaatctcctgcatcagcctcccaggtagatagg


attacaagcaagcatcaccacgcctggctaatttttgtatttttagtagagacagggttttgccatgttggccaggctggta


tcgaactcctgacctcaggtgatccacccacctcagcctcccaaagtgctgggattacaggcaaaattagaatatatct


agaatttcctgaagaccttagtttggtattataagaagtctggttgcttcatgttgcaaaatttatatcactcatcactcccgc


agagttaaaattccgctgagaagtaggaatcagtgaggtgcgtgtccatgtgggtttttgccacacctaagtgaaccttg


gtcaaaagcatataagagctactgataggccgggtgtggtggctcatgcctgtaatctcagcactttgggagggaagg


atctcttgagcccaggagttcaagaccagcctgagcaacatagcaagattccatctttacacaaaatttaaaaattggc


caggcatggttgtacattcctgtaatcccagctactcaggaggctgaggtgggaggattgcttgagcctgggagttgga


gactacagtgagctgtggccacaccactgcactccagcttgagcaatggagcaagactctgtctcaaaaaaaaaaa


aaaaaggccaggcgcagtggctcatgcctgtaatcccagcactttgggaggccgaggcgggtggatcgcctgaggt


caggagtttgagaccagcctggcaaacacggtgaaaccccatctctactaaaaatacaaaattagcccagcgtagt


ggcgcatgcctgtaatcccagctactagggaagctgaggcaggagaatcgcgtgaacctgggaggcaaatgttcc


agtgagccgagatcgtgccattgcactccagcctgggcagagcctgctgggttgggctgggtaagctctgaacacca


gtctcatggcttcaagtcacacctcctaagtgaagctctgaactttctccaaggactatcagggcttgccccgggcaga


ggatgccgacactcactgctcttactgggttttattgcagacagactaccacatgaacatgatgcacatctacgtgttcg


cagcctattttgggctgtctgtggcctggtgcctgccaaagcctctacccgagggaacggaggataaagatcagaca


gcaacgatacccagtttgtctgccatgctgggtaaggacaaggtggggtgagtggtctcctacttgggctgagcagaa


tggctcagaaaaggctctggctgaaaaaatctccctcctttaccaagttcccctgggtgtctgaagcccttccatcatgat


tcatttctttgagtagtgtttgctaaattcatacctttgaattaagcacttcacagagcaggttcaggaggcctggggtatgc


agatttcaaccctcttggcctttgtttccttgtctgtaaaatgtggttagctggtatcagcttgagagctcggaggggagac


gtgacttccccatctaactctaagtgacaaggctgagactctccagccctaggattctcatccaaaacccctcgaggct


cagacctttggagcaggagtgtgattctggccaaccaccctctctggcccccaggcgccctcttcttgtggatgttctgg


ccaagtttcaactctgctctgctgagaagtccaatcgaaaggaagaatgccgtgttcaacacctactatgctgtagcag


tcagcgtggtgacagccatctcagggtcatccttggctcacccccaagggaagatcagcaaggtgagcagggcgct


gcccttgggcagcacttgggtctaacaggactagcacacatatttatgcccctccccaccccagggccagcgtgggtt


gggagagggcatgccgggtggtggagctgtgcctgcctctacagtggagctctaggtagaatgctgggtggtcacag


tgggcctgggactcaggagactgtccagtgatcaaaggctttctgggggtagtgattaaatccatccatgctaacatga


aacagacctcagtttgaaccccatttctgctagttgctaaagtcagtcaccatgagcgagagtcagcagcaacagact


agactagaattagccagcctctctcttccccccaacaaatttcaagaatggaaccatcagaatcagaagtagagaag


tatgtgacactagccatgtggctctggtcaagccacttcaacgttttgagtctcagtggcctcatctgtaaagtgggaatt


aagagatggtgcatgtaaagtgcttaacggggagtaaatggtaggcaaacattagctgctgctattagtaaagagag


acgatggtgtgtgtgagtcttgtgggcagagatgggtgagaggggagacaaaacaagttctcatgatgatggggga


aggggctccagctggtggtgtcggagggaagtctggacagaccagtggggggctcgggtgggaggcactgggg


gggctggagtggaaagaatgtggccacagatgacagcttcacagcagaattcagtgctaagaggaagtgagtggc


catgagttccatggtgacagaaagtctaagacacccagcaaggcaggagtgggtgtcaactcagggaagcccag


aggctaatcctaggtgagagctgagggtgtcagataagagcaaggcaaggctccggttctggagcagtgaaggac


atagcagagctatgacccaggaacaaggcccagcttattgaaactgggcccagtcacacagggggcacaggca


ccaagtagccaataataataataaaaacaataacaatgatttgtgtctactgggcatttattcatgttctatgccagacac


tgggctaagagctttatatgtggaaactcatttaatccttacaataaccttatgaagaaggtacatccaaaaccccattct


tctaggccaggtgcagtggctcacacctgtaatcccaatattttgggaggctgaggcaagaggattggttgaggccag


gagttcaagaccagcccaggcaacatagcaagaccctgtctctaaaaaaaaaacaaaaacccattcttcccgctg


cccagggacacaccactaatgagtgtgatgggtgcctaggatgctgagcacctggacttcccagctcattccctaaat


gctgcacaatcagggtaactgtgccctgagcctaagaggcagtagtgagctggcccatcatgtccactgatgaagga


cacgtagccccaacacaggggagaagtggtttcaggatcagcaaagcagggaggatgttacagggttgccttgttc


ccagcgtgctggtcacttgcagcaagatggtgttctctctctaccttgcttcctttacccacacgctatttctttgcagacttat


gtgcacagtgcggtgttggcaggaggcgtggctgtgggtacctcgtgtcacctgatcccttctccgtggcttgccatggt


gctgggtcttgtggctgggctgatctccgtcgggggagccaagtacctgccggtaagaaactagacaactaacctcct


ctgctttggctgaaggccagcaggacgctgggacctgatgggccactgtgcagtgcacagctgcattaggcaggtgt


cggcgcattctcttattggcttcaacgcctagtgagggatccatcctggctcggtggcgcatttgttaagatgctcgggag


caggtggcagaacccatttgagcttgcttgggcattggggagaatttgttatcaggctactggggtgtcacagaactca


aggacagggactggagtgttgtggggagccccgaagcccctgttttacttctttctttgcttttcctgaatatctgctttattctt


actctatagacatgcttcctcctctttcaccccacattgtggggtgtagtcttttgcttcaagaaagcagcctggtggatgg


aatctcttggccccaatcccaaattctctggagaaggggctctttggtttaacttggataatgttgtcttcagctgggggtg


ggcacatcgtgcatatgtggctgctgccggggaaccacgtggatgatgtgagaggagcagcacccagaagaggg


agtgctgggctgatggtccaggtcgtgtccacttctgattgtttaattcttcttctaagtggatggatctttctccaatactcag


caaatcctgatcgttccagaatacttcattatagccaattggttataatgtgcttctctaagagaaatatttagggacaaca


aatcttcatgggtttgaagacttgatggaggaaaaaggagtagattttcgaaggctggatttggatgaacaggggctat


tcagggagtgcattccaacctaaaattaggaaaaactggctgggcgcagtggctcacgcgctttgggaggccgagg


cgggcagatggcctgaggtcaggagttcaagaccagcctggccaacatggtgaaacccatctctactaaaagtaca


aaaattagccaggcatggtggcgggcacctgtcatcttagcgactcaggaggctgagacacgagaatcacttgaac


ctgggagacagagcttgcagtgagctgaaatcgtgccatggcactccagcctgggcgacagaacaagactctgtctt


aaaaaaaaaaaaagtggtttatatacagagtggaatattatttagccataaaaagaatgaaatcctgtcatttgcagc


aacatggatggaactggaggtaattaaaaaataaaattaaataaggaaaaacgtatcaatacttcgattaaccaaa


accagggcaaatctgattttcatctttgcaaggggaacaaatttcttttatctcctctggctttgaaaccctgaaatgaaag


gaggaagggcagaaaaaagaacacatagcaagttatcatcagtctcagcgcccatcgcattccctgagcttgtttcct


tgacttcatcactggcaggactattcaaaaatgattcgctcattcattcatatattcattcattcatcattccttcattcaacac


atacgttttaacactcatcttgcttttcaagctatagtttagtgagcgaaatggatacacacaatacagtgtgagaacagc


aagagggcacatctgagctagcctgggatgggtctggaaatgcttcctggagcagaggaaacggttgacagccaa


gtgttgacagagaagtagtattagccaggcagagacatggggaatgtattccaggcagaaggcacagtgtgtatga


aagcttattgttaagaagagtgtgtggcccaaccaggaaacagacattctaaaggcatagggtccacccaggagca


tggtggacccagatccctgaaagatgggaggtgctcaggcacacttcctgggctagttgaggagtctggatatttattta


tttatttatttatttatttatttatttattgagacagagtctcattctgtcacccaggctggagtgcagtggtgcaatctcagctca


ctgcaacctccacctcctgggttcaagtgattctcctacctcagcctcctgagtagctgggattacaggtgcccaccacc


atgcctggctaattttcgtgtgtgtatgtattttgttgttgttgttgttgttgttgttgttgttgttgagacggtgtctcgctcttttgccc


aggctggagtgcagtggcgccatctcagcttactgcaagctccgcctcccgggttcacaccattctcctgcctcagcct


cctgagtagctgggtctacaggcgcccaccaccacgcccagctaattttttgtgtttttagtagagacggggtttcaccat


gttggccctgctggtcttgaactcccgacttcaggtgatccacccatgtcggcctcccaaagtgctgggattacaggcat


gagccaccgtgcccaacctggatttttattctgaagactaatagggattctaaggaaggaaccagcctgattgaatttg


catatgtgtccacatctgctggctcacggctgtgtgggaggctgagtgatggggaggaaggattactgagtagggatc


tgaaggtgtggcctcatgctttctttctaaccagctgtgttgtctttgggatggtgcttaaatttgggctagaccagtgggtctt


ggtcaccccccaggggacatcttacaatgtctggaggcgttcttggttgacacagtggggtgagggctgctactggca


gctcgtggggagagaccagggatgctgcttaacatcctacagtacacagggcagcccccaccacaaggaattatc


agctgaaattgtgaacagtgtctacactagacccttgctactcatagtgtggtccgtagaccagcagcattggcatcac


ctgggaccttgttagaaatgctgttagaccccaccccacatccactaaagccagctcttcatttcaacaaactccccga


tgatgtgagtgcacattcaagtctgagaagggcttctttgaggtgagccttagtgcccatccccctttggtggccccggat


accaagggtgtgtgaaagggggggtagggaatatgggtctcacctgccaatctgcttataataacacttgtccacag


gggtgttgtaaccgagtgctggggattccccacagctccatcatgggctacaacttcagcttgctgggtctgcttggaga


gatcatctacattgtgctgctggtgcttgataccgtcggagccggcaatggcatgtgggtcactgggcttaccccccatc


cccttaacactcccctccaactcaggaagaaatgtgtgcagagtccttagctggggcgtgtgcactcggggccaggt


gctcagtaggcttcggtgaatatttgttggctgatttattcagaaattctgtccagcccctaccttggatggatttatcacctct


ccaggccacctcttctttccaaatagggccacctaggtatagaccaaagacacgaaatcttttgtgatcccacaaaca


cagagcaggtcaaataggcccaagccaattgagactgtggttcaggtcgtgatgcagagctttgctgtggacgtgctc


ccactgcgtactagctgggcatgtggcttaacctttctcagcctcagtcgccccattgtaaatggagataatgatactatc


tcccctcacaggactgttgggatgctactggatttaataagctaatgcagggacatgctaagcacaacccatccctga


ggcccagagagggggggccttggctgaggtctcactgcgaggtgggaatgtgggcctccagaccagaggtaggt


cctgtggcccctagacagtggacagcaatggtcagtttgacacaccagagccctagccattacttcctggatgttgtgt


gaatattttctggacatggcttatataaaatgaaaaagtgaattgggcacgatacagggatagatttttagagatgaact


ggtagcatgatgataatcatattcactgataacatttactactgttattgactgctttaaaagtgttgggcattgtgctagaa


accattatatgcattatctccttgaattctcacaaccgcctactgaggtattctcagactctaagaaatgagatttaagag


aagttatctgcccaaggtcactcggctggaacctggctgtaaaaatggctgaagcaggtgatgaggagctgatgcgtt


tggacgtgtctcagagaaatcatggaggcgctgcggttcctaccggttcttggatgccttctacagagacaaccatagc


cccaaattatagggatcacatatcagtgggtgagacatccttgcttgggatgaggaggggatgagctgtgtgaagca


aggcgcctctgtgatgggttccagtgatgtgtctgccactgtcttaataactgtgcaattctaagcagaacctttcctgtctc


tgggcctgagagttcccctctgaaagatgaggacttgacctagcaaggtcctactcacatgcctgtagagaacaggc


aggggaagttagaaaaaaaaaaaagccagtgaaggaagggagctcttcagcttgcacccatcatcacagtgcag


ggacccaggctcagtgttgccagatccaatgacttctcaagagctcaaaatctagagttttgcatgtgctctcccaagta


ctggcagaaaattcaagattgttagtaacactgtgtggctaaattctgcttgtgggctgcctagattcccaattctgtgattc


tgtggttctctggaagcattggttctccacagcacctgcatcacttggaaacttgttagaaatgcaagccctacctacgg


ccccaccccagacctacccagttagaaatctggggggggacctatcagtccatgtttgaacaagccccacaagtgtt


ctcttgcaagctcaagttttagaaccactgacctatagccaaaaaagaaaaagccaatcagtggttttctggtaaagg


attaacttaacaaactggctttccaagaaaataaagccttgattggtagcacttgcaatttctatggtacaaacgcttccc


gcatgactgagttcaagctgtcaaggagacatcactatacatggacttgggaagagatgagaacaatcagcccact


gagcctatgggaactggctccagcacatccctgcaagtcaactctcatcagggtgagtgagttgaggaccaagaag


cagttatcctcttgcctttgcaggacccaggcaaagggaagggcatagtgacagtgatgatctctcttccggaagtcttt


ggtttgctgagagtaaaaggcgtgggcttcaccagtggtgaagccagtcatgcagccttagtcctggtactgaaactct


ctaaatctcagttttctatctgtaaaatgggaaaataagacctatgtcacagggttgctgtgcagatttagcaacagaac


atagccccgttctttatgatgactgatgctgcatccgtatgaggacatctctatgtaatggaaagatggagagaggatta


agcgcaaagtcacaacacttaatgggaactgtggattagctacttggtggcattgggcaagtcagttgactttgcattaa


ttccacaaacaatatttcccaatttcctattcagatgagcatatgtgattgagtcagatgctgtgatcagaaccaggatgg


agcatttcccacaaactgtgggatttttaagtaatgggaaggcacactgaaatggcactgaatcatgcagttgcagata


ctctttttcaattctcagtcctttgattacgtcagggagaaaagaaagtccccacttggcctgagaatctctgcacccttct


agctcttgttaaccactcttttgaatagcagagaaaacctcagactgccatatctgggagagattttagcaacattttgtttt


cattgtatctctttttacagctacctcccatttcccttctatttcaagctagtaactcagttttttttaaattcaattatttaaatgta


aaaataagtctatttggagaaaaaaaattttaatagcatctctggaatgccagtatggctaaattcatgaatgttgtcctc


aaatgctgaaatctgggaagcatctggccaagctttgtggacaggcctgcctagtttgaatcccaagagccacccagt


ccaagccacaaaacattggaattcttggttcacttccctaacctgaacttgccctctgtgaaatagggacactaatagct


cactcacagggctgctgtgaggacatgtgttgagctgagggtctcgccaggggagaccctgtgcagggagactgtta


tcatggtgatggatttctgcttcattcatttctttttccagacagcatcatatagaatgagttgtggggggcagtcagcaggt


ttgggtttatcctctattctgccacttattacttaaaaaaaccccaaaaaacccaacttatatagtataagctatatccaga


aaagtgcaaatatcatacaagtaccatttgatgaatcttctgatatccccacataaccaacacccagaacctcttcttgt


ctcattccaggataaccactaacctgacttctaacagcatcagtcagttttgtctgtttttgtacattatatatgtgatggtttg


aatgtgtcccccaaatttcatgtgctggaaacttaatccttcaattcatatgttgatggtttttggaggaagggcctttggga


agtaattaggattagataaggtcatggggtgaggtatgatggcactggtgacttataagaagagaaagagaaatctg


agctggcatgctcttgccctctcactgtgtgatgacttctccatgtcatgatgcagcaagaaggccctcaccagatggtg


gcaccatgcttttggacttcccagcctctagaactgtgagctaaatcaatttattttctttataatcacccagtttgatattttgt


catagcaacagaatatggacaaagaaagaaaattaatgcaagaagtagagtttttactgtaacagattcctgaaaat


gtggaagtggctttggaactgggtgatgggaataggttggaagagttttgaggagcaggctagaaaaagcctgtattg


tcaagaatggagcattatgccaggcacggtgtctcaggcttataatcccagcactttgggaggccaaagcaggtgga


tcacctgaggtcaggagttcgagaccagcctagctaacatggtgaaacgctgtttctaccaaaaatacaaaaaatta


gctgggcgtggtggcgcacacctgtaatctcagctactcaggaggctgaagcaggagaatcacttgaacccaggag


gcagaggttgcagtgagctgagatcgtgctattgcactccagcttgggcaacaagagcaaaactccatctcaaaaa


aaaaaaaaaagaaagaaaaagaatggagcattaaagacagttctgcagttctggtgagggcttaaaggaagacc


ccagaactagggaaagtctggaacttcttaatggttactgaagtcgttgagatcagagtgctgatagaaatatggctgg


taaaggccattctgatgaggtctcagatagaactgaagaaccacgtgttggaaactggagcaaaggtcatcctttttat


aaagaagcaaagatcttagctgaactttttctgtgccagagtcatttatggaaggcagaaaatctgtaggtcagccatg


ttgtagggaatgaaagaacattttcagctgagaacactgagagtgtgacacaactaccgactgataagaaaactagt


acacataaattagccaggcgtggtggtgggcgcctgtattcccagctacctgggaggctgaggcaggagaatggca


tgaacccgggaggcagagcttgcagtgagccaagatcgcgccactgcactccagcctgggcgacagagcaaaa


ctccgtctcaaaaagaaaaaaaaaaggaagaaagaaaattagtacacatagaacaaagccagaggctgttcatc


aggacaagggagaaaaactccaaagccatttcagagatcttcaagactgcccctcccattactggcccagagctct


aagagggcagaatggtttggaatgaccagctgctgcccagggctgccttgggtctctgctccccacatttctggtgcag


cattcctcagccatcccagctgtggttcaggtggccacaggtgtgatgtggaaggtaaaagtcataaaccttggcagc


atacacatggcactaattttgcaggtgtgcagaatgcaaaagctgagggggcatgccttcttccacctacatttcaaag


ggtgctgtgaacagccaccccagagagcccctagtagagcagggtctagtggagctacaagggtggggccaccg


ccaagaccccagaatggtagagctatcatagtgcaatgccagcttgggagaactgcaggcatgagactccaacctg


tgcgaagtgcaacatgggcagaacccagcaaaaccacaggggcagagctccccgaagcttcgggggtccaaatt


ccatagtgtgtccaggaggtggcacacagagtaaaagatcattctgaaggtttaaggtttaatgttgttttctatgttgggtt


ttgtactttcctggaaccagttaccctttttcccttgcctctttttccttttagaatgggaatgtctgtcctatgcctgttccactgtt


gtattttggaagtcaataacttgttttgactttacaggcttacagccagagggaatctcccatagaatgaattgtaccttaa


gtctcacccacatctgatttagatgagaccatggactttggaattttgagttggtgctggaacaagttaagactttggggg


ttgtctaagtgtggtgtttcatgcctgtaatcccagtgatttgggaggttgaggtgggaggattgcttgagcccaggagct


caagaccagcctgggcaacatagtgagacctgtctctacaaaaaataaaaataaaaaaattagccaggtattgtgg


catatacctgtaattctagctactcaggaggctgaggtgagaggatcacttgagcccaggagtttgaggctgcagtga


gctatggtcgtgccactgcattccagccagggcaacagagtgagactctgtctctacaaataaaattaaataaactta


gctggatatggtggcacacatctgtagtcctagctactcaggaggctgagacaggaggattacttgagccaaggagtt


tgaggctgcagtgagctatgatcatgccactgcattccagcctggatgatagagcaaaatcccatctctaaaaaaaa


aaaaaaaaaaaaaaaaaaaaaaaaaaactttagtgctattggaatgaattttgcatgtaagaaggacatgcattttg


ggggctggggcaggatgctgtggtttgaatgcatccctcaaatttcatgtgttggaaacttaatctccaaattcatatgttg


atgaaattggaggtgaagcctttgggaggtaactaggattagataaagtcatcagggtggggcccctatgatgagact


ggtggcttacaagaggaagagagaactgagctgacatgctcttgccctcttgccatgtgataccctctgccatgtaatg


gcaggcacagcaagaaggtcctcaacagatgccagcagcatgttcttggacttcccagcctccagaaccatgagct


atatatacttattttacaaattacccattctgtggtattctgttatagcaatagaaaatgaactgagataatatacatggaat


catacagtaagtctgtgcttttgtatgcttcttttactcaacattgtagttgtgagattcatccaggttgttaagcattgctgtac


cctttttccactgggatatagtgttctgtcatgcttgggtcttaatttataaaggtgactgagtggcattttcttccagtattattg


gaaggaaagttttgttgttcacagttcccctgtaaacaagaggcagaacacgtcatgcagggccacacaaaactgta


tcatccagggaccaggcagcagaaagagagggggaactgggactatgcctttatgaaaaagagtggtgggagag


taactgggtgagggcatccactaatgggcaggaagtgaaaacacatatgttagaatttgtagctgaggggtttataata


tgagtttcctatgcctgagaaagctgacttgcaagaaaatgagataaacaactttggccattagtgtggccctgtcataa


atgaatgccagataggcaaatagagaatctaagaaaagatagttggaacaagtgttccattgtgtgaatgcagcaga


atttatttatccattattgaggaggatttgggtagtttccagtttggagctattatgaatattctagtattgctcctatgaacattc


tagcacttttatttttggagcacacgaatgcacttctgttgattatatgcctagaagtgaaattgttgaattatacagtattca


cacagtcagctttagtggctactgctaaacaattttctctagtagtttgcgccaatctaatcaccagtagtgtatagaagct


ccttttactccacattttgccaacacttggtgttttccttctttttgattagtcatttagcaatcaaacctattgtttacattttgatat


ctccaataactaactaaatggagcacttttaatatgctttttggacagttgaatatcttttcttgtgaaatgtctattcaagtta


gtttgcccattttctattgtggtgttctgtctttttcttattgatttgtaggaattccttacgtatcctggatatgaatcccactttgtg


cgttacctttttccttctttctttctttttgaaacagagtctccttctgtcacccaggctggaatgcagtggcgctatctcagccc


actacaacctctgcctcccagcttcaagcaattctcatacttcatcctcctgagtagcttagattacaggcgcatgccacc


atgcccagctaacttctgtatagacaaaataatttttggtagagacagggttttgccatgttggacaggctgatcttggact


cctggcctcaactttggcccaccttggcctcccaaagtgccaggattacaggtgtgagccaccatgcccagcccacct


tttactttcttaatggtgtcttttgaacaagagaggttcttaattttaatatagcccaatttatcattgttccctttatgtttagttcttt


tatgtcctttttaagaatttttgcagccagcgcggtggctcacacctgtaatcccagcactttgggaggctgaggctggcg


gatcacaaggtcaagagatcgagatcatcctggccaacatggtgaagccctgtgcctactaaaaatacaaaaaatt


agctgggcgttgtggctcttgcctgtagtctcagctactcgggaggctgagatcacgccactgcactccagcctggtga


cacagcaagactccatctcaaaaaaaaattttttttgcaaggtcatgcatatgtccccctgatttttttcctaaaaatcactt


attattagatcaatgaattgagtaattgactacatttttcagtcattcaacaaatatttccctgaggttttgataacctgaact


gtgtttggagctggggaggaagcaaactattgaagatatacaaagatggcaaagatgagggcctggagcttgccac


acggaaggggggatggctgcctgaatggttgggcaggtagttgttgacatctgcactccctacatgagcagcagggt


ggcaactctttttatctttttaatttatttttcttttctttctttctttttttttttttgagatggagtctcgctgtgttgcccaggctggagtg


cagtggcgtgatctcagctcactgcaaactccacctcccaggttcacgccgttctcctgcctcagcctcctgagtagctg


ggactacaggcgcctgccaccactcccggctaatgttttgtatttttagtagagaaggggtttcactgtgttagccaggat


ggtctccatctcctgacctcatgatctgcccgcctcggcctcccaaagtgtggggattacaggtgtgagccaccacacc


cggccttaatttatttttctagtctgcaggtaattctttttaattctctccactctcctatgatcttatgaggtagggactgtcatta


tttctcccactttataatgaacaatcagtaaagacagggaagataaccaaatgacatacaaggtggggtccacccca


tgaggctgcaggcttggagctttgctttgtcttaaaaatgagaacatgagctgcccacctgttgagacaagaaacagg


aaaggcttaaaaaactggcttgttatgtacaactatccgtggggctgcagtgaacgggctggcagtgcccaggtgca


ggctgaaccctgggacaatcacattcagcatccaagggcccccgtaatagcttaatgtttgaattgaacccctggggtt


gccttgaaggagagaggtcgtggaagtatgttcaaggggtagggatgggcaggggagatgggtctgaaagccaa


gctctaccccacccaccttgccccaagagaaatagaaccttcatctttaattgcctaacgagaaaactggggctggcc


agatgtggtggctcatgtctgtaatcccagcactttgggaggccgaggcgggcagatcacttgaggtcaggagttcga


gatcaccctggtcaacatggtgaaaccccgtctctattaataatacaaaaattatccaggtatggtggcgcatgcctgta


gtcccagctacttgaggcacaagaatcgcttgaacctgggggacagaggttgcagtgagccgaccactgcactcca


gtctggacgacagagtgagactccatctcacaaacaaaaacagaaaaaaaaaaaaaaaaaagagagagaga


gaaaactggaggctctgagaggttgagggacttgcccagggtcttgcagctagtaagtgacagagctgggacttga


gcttgggttttctgactcctggtctggttcattatccatgaggtgctgggaactaaaataagccacaatcttggaatctccg


tcgcctccctccctcccacatgtctgcgtggctttttgggaaaatgccaggggaatgtaccagccagggagaggaccc


ttgttttcctcatggcccttcctggcaatggcactactgacaccgacagtcctttttgtccctgatgacctctgctgcctgatg


cccaagtgaccacctctgctttgtcatttctaggattggcttccaggtcctcctcagcattggggaactcagcttggccatc


gtgatagctctcatgtctggtctcctgacaggtcagtgtgaggccacctttcttccaccattgccaggacacagcaccca


cgtccagagcgcaccctgccgtgtggctggatgtctatgtgccccatctccttccctgaggatcacataatttcagaattg


gaaaggttcttagaggtcacctgctgctaatgtggactgtgaggccagggcagggaagggacatccctgaggttata


agtagggtgagtggcaacgttgcagacttttgaacccagggctggtgatcacactcagttttgcacagaagcccgag


aaaatccttacacccaaaagcctaccttttatttctgaggacacccataatactattttattcaacagatatttattcaatatc


cactatgagccaggcactggggacacagcagtgagcaaaacaaattccctgaccccatggaattgaccttctagtg


ggggaaggtattagcaataaatagacaaataagtgtctactacgccagatgggaagaagtggctgtgaagacaga


gcaaactagagaaacatagagtcaatgtgggatggggtgttcttttaggggggggtcagggaaagcttatctgagta


gttagcttttaagcagagaccccaatgaagaggagggagatatgcgatgcatttagttaggggaagaacattccatg


aaaataggatagcaagtgcaaaggccctgagacagcagcatgctttgtgtgttgagggaacagtaaggagaccag


tgtggttggtgtgaatggagtgagaaggagcagcaggggttgagggcagaatggtagtgaggagcaggcccttata


aaagatgggaagccactggagatctttcaacaaaggggaaaagtatgtttctgttcttgcaataaaatagaacagca


aaaaatctaggggagttgctaattagccagttttacttatatgccaggtgaaaatatgtggctaggtgcagtggctcata


cctgtaattgcagcagtttgggagaccgaagtgggcagatcatctgagatcaggattcaagaccagcatggccaac


atggtgaaaccccatctctactaaaaattaaaaaataagccaggcgtggtgttggatcccagctacttgggaggctga


ggcagtagaattgcttgaacccgggaggcagaggttgcagtgagccgagactctgtctaaaaaaaaagaaaaaa


agaaaatacacattcaggccaggtgcagtggctcacgcctgtaatcccagcactttgggaggctgagacaggtaga


tcacttgaggtcaggagttcgagaccagcctgaccaacatggcaaaaccctgtctctaccagaaatacaaaaattag


ccaggcgtggtggcgtgtgcctgtagtcccagctactggggaggctgaagtaggggaatggcttgaccccaggagg


tggaggttatagtgagtcgaggttgcaccactgccctccagcctaggtgacagagtgagactgtctcaaaaaaaaaa


gaaagaaaatatacattccatccagaactgttcacctttattctacaagcaaacatcttttattggttagacacccatatat


gtgtccctaagcaggaggtgaatgccaaataagagacaaatggcgtaagacactatgagttgtgtgacgttgggcat


gtcactttactccctctgagccttggttagcttctctgtaaaatgaaaggattatggtaactaagctggcttccttccagcttt


aacaaactgtatggaggtactttttggagttacctgggtaatttttgagtgtgagattggctagaattgctttaatataccatg


tctggccttagctttttgcagagtctttgtgaagaagcagaggcggagtagcgttaattccgtaagttaacgttcagttcgt


ggcagctggcaatccaaccctgggaaaggctgccggatttagcaaaaatgcaaggtgtctgtttttaaatttgaaatga


attgggtatcctgcattttatttggcaaccctgtcctgggactcacactattcactgttatcactggtatgttcaaagtggtgct


gacttgccctctgtcttgcaaagtaccaggaggtcttttcttattcttcactggagtcaaaaaagagaatagaggaaaag


acaatcatattgttcctttaagagttaagaccaacaagttttcttctttacatgttgtttttgacatgagcaaactggtgattaa


aaacaacttgggtggctcatacttgtaatcccagcaccttgggaagctgaggtgggagaatagcttgaggccaggag


ttcaagccagggcaacatagtgagaccccatctctacaaaagatacaaaaattagccaggcgtggtggtacacctgt


agtcccagctgctctggaggctgagatgggaggatcagttgagcttgggaggcagaagttgcagtgagctgagatc


atgccactgcactccagcctggacaacagagcaagaccctgtctcaaaaaaggaaacaaaacaacttggacaat


ggaagggggaaaaagttcctcaagcagccaaaattgcaccaaatggactcccagaagacaagcatttaatttgtta


attgagccctctatgggcctgtctgtatttatttaagaaacaatcctatcaagcatagttattgggtttctcagcccaggtag


attagaaatagcagattagaggtgggctaggtttctagaggtaaagtacaccagcagaagttagaagtgaaagcaa


agagcctaacagaggaagagaaattcttttttttttctttttttagacgcagttttgctcttgttgcccaggctggagtgcaatg


gcgctatctcggctcactacaacctcagcctcctgggttcaagtgattctcctgcctcagcctcccgagtagctgggatt


acaggcatgcaccaccacacccggctaattttgtatttttagtagagacagggtttctccatgttggtcatgctggtctcga


actcctgacctcaggtgatccgcccaccttggcctcccaaagtgctgggattacagggataagccactgcgaccggc


cgacaaattcttaaaactggacacaagaacacaaaacgcttgggctgctgagagattagaacaacaaccctccac


agctacacaccttttccacgttatatggcacgttataagtgggtgttcctagtgatggttctgattttttttaaaaaaagtctaa


atatgtttaatgttgtctcagaagacaaaatatattttagacagatattcctcagtgatgagtaagcctcagctatctggaa


aattcatgcaggcgccagagatcgttactgagtaattcaagctaactgcgtcatgctggttgtaccctgcatgccaatat


cagctaaaagcagcaccacgaaagggaaatacgaatctcactaagcactcgcccattcttgttaacgacactggaa


ctgatcatccttaataatacacagataaatctatcaggagcatttccttgcttcctgtgaaaggaagcactcattccatgtg


tcctgtgaaattcatccaacttcaggaagctggaggaatacatatggccaagctatctgggcagagagtagacaggg


aatggaggttgggcacagtggctcacacctgtaatcgcagccatttagaaggcaaaggcgggcagatcacttgagc


tcaggtgttcaagaccagcctgggcaacatggctaagtcctgtctctgcaaaaaataccaaaaactgagctggatat


ggtagcacacacctgtggtcccagctacttgggaggctgaggtgggagggttgcttgaccccgggagtttgaggctg


caatgagctgtgattgtgccactgcactccagcctggataacagaatgagactctgtcccaaaaaaaaaaataaaa


tcaaagacacttaaaaagatggggaaaaggaaggacaggcacttaagcaagttataagctactttcctaactacac


aagtggaatcttaagctgaggttcccaggagttgactggagccagagaagacagacctataggagcacccaattgg


agtcaccctccatagtagcccatatgtcttacatggatcagctttcgtggggcccttttactccatctggggaagggcgtc


agatctgtggctctcatgtactgctcagtacactgccattcccagttctttttttcaaaaaaaaaaaaaaaatgtctacag


aatcggccaggtgtggtggctcatgcctgtaatactagcactttggaaggctgaggtgggtggatcacctgaggtcgg


gagttcgagaccagcctggccaacatggtgaaactccatctctactaaaaaaaaaaaaaaaaaaaaaattagctg


gatgtggtggcaggcgcctataatctcagctacttgggaggctgaggcaggataatcgcttgaacctgggaggcaga


ggctgcagtgagccgagatcacgccattgtactccagcctgggcgatagagtgagactctgtctcaaaataaataaa


ataaaataaaataaaataaaataaaataggctacagaattaagctggtccaggaatgacagggcttccatttatttgtc


tttcaattgtgggagaaaaaggatttctgttgagatactgtcgttttgacacacaatatttcgattaatcttgagattaaaaat


cctgtgctccaaatcttttaacattaaattatgcatttaaacaggtttgctcctaaatcttaaaatatggaaagcacctcatg


aggctaaatattttgatgaccaagttttctggaaggtaagatttttcacctattaacgtgatagattttgagtgcatgaactta


aaaacatacctgagtatatatgttgacttgctgtttatgagtaaaacaaaaacaaaaatggagtaaggagcattgcag


gaggaactagaggagaaacaaatccatgatatgcatgtgtgtgggggagggtggcggggaggtggtaaaggtca


ccatttccctgatacctcaaattcattcagagtcagggatgagacagctttcactggccacacttcccctccccctatctg


cagtcctcagcgtagccaaatagtctgacatggggtgacagaaccccacaatgcaaaagctggaagaaacctca


agccttggagtccaaccccttttttgacagatgctaagagtggagacatgacttatcaagatcttacaactggctgggc


acggtggctcacgcctgtgatcccagcactttgggaggctgaggtggggcgatcacctgaggccaggagttcgaga


ccagcctggccaacgtgtcgaaaccccatctctactaaaaatacaaaagttagctgggtgtggtggcacatgcctgta


atcccagttactcaggaggctgaggcaggagaatcacttgaacctgggaagcgaagtttgcagtgatctgagatcat


gccactgcactccagcctgggtgacagagcgagactttgcctcaaaaacaaaacaaaacaattgtacatatttaaag


tgttgtaaccaagtgagttacagagaaacaccacactttgagcctaattcaggagtcctttattagccggcgacctaga


gacgactagtgctcaaaattctctcggccccaaagaaggggctagattttcttttataccttggtttagaaaggggagcg


ggaattgagctgaagcaatcttacagaagtaaaacaggcaaaaaagttaaaaagacaaatggttacaggaaaac


aaacagttccaggtgcaggagctttaaagccatcacaaggtgacaggtgcgggggctctgggtgctatctgccgga


cacaaacgcaggggcactagagtactatcacccgggcaaattcctgggaactgcggacacagcttgccacagtac


cttatcagctaattgcactctttgatgtgctgggagtcagcttgcacaagttaagtccttgaggaaggggggggtaagg


agcccttaacgtcttgcaaatgaaggagccgaatggaatccctccggctttcttagctaagagagagtcaatcaagtt


aatacaagttagggtatcacaaaagtatataatttgatacattttaacgtatttatacactgaagagaccatcaccaccat


caagacaaggagcacacccatcacttccacacacttcctcctgctcctttgaaattcctccctccctacccacctggtcc


cacccaaaggcaaccactgaactactttctgtcactaaggtttgcattttctgtaatttttttgtttgagacagggtctcactc


cgccacccacaccgtaatgcagtggcaccatcatggctcactgtagcctcaacctccccaggctcaggagatcctcc


cccctcagcctcctgagtagctaggaccacaggtgtaggccaccatggcaggctaatttttgtatttttttgtagagatgg


ggtttcaccgtattacctaggctggtctcgaactcatgggttcaagcaatcctcctgccttggcctctcaaagtgctggga


ttataggcatgagccactgtgcccagccctctgtaatgttacacaaagggaatcatgcagcacgtactgcccttggtct


ggcttcttttgctcagcatgattattctgagaatcatccgtgttgttgcgtgtaactgacttcatcagcttctctctgcagctgtc


agctcttggcttctcccaacagccaatctctctttatcccctgcaagtgttcttgcctatttagcagaatcaaggtactctatc


gaaaagactcggaaaattggtttaatctattcattcattcctcaggtatttatcgaataactattctataccaagtactatgct


aatcaaccaaggacagcacaaacaggagaaatctccagctcagtcacttgagttgcaataaatatttgctggatagg


tcaggtgcagtggctcacacttgtaatcccagcactttggggattactgagacgggaggatctcttgagcccaggagg


ccaaggctgcagagaaccatgatcatgccactgcactccagcctgggtgacagagtgagatcctgtctctgaaaaa


aaatatttgctggataaattaaggaaatctgacgaaccccatcagtagccattgcagcaacaggtaaactagaacga


gtgtgaatttggaatgaggaaacccgatgttggccatcattctgtaatgtcatgtattatgtaatgtattatatattaatgtat


gtattatgtaggcaagttccttgacctctctcactggtaacataagagtagtaatctttgtgctacttcactgggttattttaaa


gatcaagtgaggtaataatgtctgtaacaacattctgtaaaatgcaaaccgccacatgaatgtgaaagtttattactag


ggatttagccaaccacaagggaatgtgtgagcataagagctatcatattgcaagcctacagtttctgattttgtgctaggt


gcttttccacattacctgattttatcctcacaacagccctgcataaaagtaagtatgtcgcccaggtgcggtggctcatgc


ctataatcccagcactttgggagcccgaggtgggcaaatcacttgagatcaggagtttgaaaccagcctggtcaacgt


ggtgcaaccctgtctctactaaaaatacaaaaaaaaattagacaggcgtggtggtggatgcctgtaatcccagctact


tgggaagctgaggcaggagaatggcttgagcccgggagatggagattgcagtgagatgagattgcgccactgcac


tccagcctgggtgacagagcaaggctatgtctcaaaagagaaaaaaaaagtaagtatctcagtcttgaagatgatg


aaatggaggcctagagagattaagtaacttgcccaaaatgacagaactaatgcatagaaaagaagaaatgtgatgt


cttttggctccaaagacaccccacatatgcgttggttacagttactagagaaaagttattccacccccaccccaccccc


agaaatcttctgacttgttttctcgcagttgagtaggaccatttattcggcagtgtaccattctcagcttgcagttgaaagcc


aaatatccattaaagaggcaaggatgcaaacttgctaagctgataaatccaggggtgattttttttttttttgcaaaccatc


caacaagacattttaaatactcattgaatttcatagaactgactgccaggattggaaagacattaaagccagctcagc


cactgcctcgctggttggccagaccacgcctggcacttctgggagggagcactcaccaccccccaagggcacccat


ctcatcctccgaaggtttatgaaaatgcactcatcatttgctaattcattccactacgtgtattacctaatttgtgacacgatg


tgaagtaccagagagataattctaaataaaatatagttatgggtctcaaggagccagatatgctaatctcctatcctcct


gcagtttacagtggtcctcaccagatacttatttacaaaaattcagtttattatttatttttttgagacagagtcttgctctatag


ctcaggctagagtgtaatggtgtgatctcggctcacttcaacctctgcctcccaggttcaagtgattctcctgcctcaacct


cccaagtagctgggactacaggcacctgccaccacggctaatttttggagttttagtagagacagggtttcaccacgtt


ggccaggctggcctcgaactcctgacctcaggtgatctgcccacatcagcctcccaaaatgttgggattacaggcgtg


agccaccatgcccggccaaaacttcagtttataacacaatctttcacgtgtcttctgctttcattaaaagaatagacagtt


cccttctttatttcagtttaataaaccatggattttatttcatgctttgcaaaacacaagggctcactgacatgcacttcttaaa


ctaattctggctggtcgcctgtaattccagcactttgggaggctgaggccgacagatcacttcaagtcaggagttcaag


accagcctggccaatatggtgaaaccacgtctctaccaaaaatataaaaaattagccaggtgtggtggtgcgtgact


ataatcccagctactcaggggcctgaggcagaaaaatcacttgaacccgggaggcggaggttacagtgagctgag


atcgcgccactgcactccagcctgggcgacagagtgagactctgtctcaaaaaataaataaatacaaataatgtaa


aatacgaaacaagcaatcctggcagtagctgctggaatgagaggagggagaggtcatagggaggtcggggaca


atggagcatggagttgtgttggatttggctaagcagcaggaagtgcaaggcattccaagcaagaggaggggggca


ggtggggagcatctgcaagaacagaagcagcatgagcaacctggctcggcagtgtgtgaaaaggctgaaaggtg


gctagagccacttcaatttcatccttcaggcaaatgggaaattcccaaaggtttgagtggggaagcaatgcctacaat


gaaagtttgagagtgaagcagagtgatcgaattaagcatgtaggccgagttctgaaataactgcaatgtgctgaagat


catccattggcttctgaatgagtatttgcagtttattttttaaaatgattttattgccaagaaagataaacactactgttttggta


caaaaacataacaaaatgtgttgagtccctcttgctgttttacgcgaagttttaaaaatctactcttgtcacagtggtatca


cccctacttctgatttcaaataaatgttctagagacacagtaagggcccaacaaacgcttgttcaacaacacaaggag


agccagcttttaaagtaggaaaacaggccgggcgccgtggctcacacctgtaatcccaacactttgggaggctgag


gtgggcagatcacttgaggtcaggagttcaagaacagcttggccaacatggtgaaaccctgtctctactaaaaacac


aaacattagccaggcgtggtggtgcacaccagtagtcccagctattcaggaggctgaggcaggaaaatggcttgaa


ctggggaggcagtggttgcagtgagccgagatcgtgccactgcactccagcctgggggacagagggagactccat


ctcaaaataaaacaaaacaaaaccaaatcatacaaaaacattagctgggtgtggtggtgcatacctgtaatcccag


ctacttgggaagctgaggcagaattacttgaacccctggggggaggttgcagtgagctgagatcttgccactacactc


cagcctgggcaacagagtgaggagactctgtctcaaaaaatatatatattaaaaaaaagaaaaaaaaaagtaaac


taggaaaacacatcagcagcctgccaacagactcccctagcctcggtgagggccagtgttctgggaggcagatctg


aattctagtcctagttcacccactggcaggctggtgcccttgggcaggtcgcttctctggggctcagtttcttcctctataaa


atgagatcaaatcccatgttctaagagtttgtgctctggagtcagacagatctgggttctaccactgccagctctgtgatct


tgtagcttcagtctcgtcatctgacatggagataacagtaactgtctcactgtgttgttagggtttaaaggagataatgtat


gtgaaatgttagcaaacaagtgttagctaccctgatttccggtttcagagttctgtggtcccagtttatgccacatgcagtg


acgttgtatggtaggctgtggtgtggcaccacttcagaactcagcgcatgcacagcttgcagaagagaaggccaga


ggagacctaagaaggctcttcgaacacttgaaagaccggcatgtaggccgggcgcagtgactcacgcctgtaatcc


cagcagttttggaggtcgaggcgggtggatcacctgagtttgggagtttgataccagcctgaccaacaaggtgaaac


cccgtctctactaaaaaatacaaacattagctgggcatggtggcgggtgcctgtaatcccagctactccggtggttgag


gcagaattgcttgaacccgggaggcagaggttgcagtgagctgagattgcatcactgcactccagcctgagacaag


agcgaaactccatctcaaacaaaacaaacaaccaaccaaacaaaaccaaaaaaaaaactggcatgtagaaga


aaaatactttttctctacacttctccaaagaatttaactaggcccaggggaggtgcagtataaatttctaacaatctcaac


tgtctgccaaatggaatgagctacttcatatggcagtagtgagtcctctgtctttggaggcattcaaataaaagccagat


ggccatttatcaacaatccatgtaaaacgttagatgaaataaaacctatatatccaagatctcttccaattcagattttatg


aaagaatttctaaggtctttgtaatgagacatttaggctgtttcaagagatcaagccaaaatcagtatgtgggttcatctg


caataaaaatgtttgttttgcttttacagtttcctcatttggctgttggattttaagcaaaagcatccaagaaaaacaaggcc


tgttcaaaaacaagacaacttcctctcactgttgcctgcatttgtacgtgagaaacgctcatgacagcaaagtctccaat


gttcgcgcaggcactggagtcagagaaaatggagttgaatcctttctctgccactctttgaggagaatctcaccatttatt


atgcactgtagaatacaacaataaaatacagccatgtaccacataacaacatcttggtaaacaacagactgcatata


tgatggtggtcatccagtaagctaaggttaatttattattattccttgtttttttttttttttttttttttttgagatgtagtcttactctgtca


cccaggctagagtgcaatggcaccatcttggctcactgcaacctctacctcctgggttcaagcaaatctcctgcctcag


cctccaaagtagctgggattacaggcacccaccacatctggctaattttttgtatttttagtaaagatggggtttcaccatg


ttggccaggctgatctcaaactcctgacctcaagtgatctgcccgcctcggcctcccaaagtgctggaaccacaggcc


tgagccactgtgcccagccttgtttgcttttttaacagataacagtgtgctcatagaaactgctttgacatgactgcaatcat


gtgcttcatagaaacttaattagattataccactagagtcttcagatttttatactttttttttttgaaacggagtctcactctgtc


accaggctggagtgcagtgccgcaatctcggctcactgcaacctccgcctcccaggttcaagcaattctcctgcctca


gcctcccgagtagctggaattacaagtgcgcactaccacacccagctaatttttgcatttttacttgacagggtttcacca


tgttggctaggatagtttcaccaggatctcttggcctcatgatcagcctgcctcggcctcccaaagtgctgggattacag


gtgtgagccaccgtgcccagcctatacttccctttttgaataccatttggtgttttgaagaattaacagctttgtgaacgtgg


cagtgcttgtgattcaggcttccattgagaccaaggggagaacctggttgcaggacaaacagacggacagcgtgtg


gcagtgtttaaatgctcttctgaaggctgatacgacagctctctgtgcactgattgcatatgcatcccaagattatattattg


ttttctactgctatgtgtcacactttgccaaacaggatgtggaaaatgaataagcggttttcttaggcacttcttaacagac


aattggtcaaaatgaactccattgcttaagaaacacataaacaccatttagtcactgaacatagctatatgtatggttgtt


actatgggaaatcttgttttgccaattttctttgaaaattctggcagaccaaggttctttttgtttacataatacttgaaaaata


aaaatgaacaagctaacaaacta.





Claims
  • 1. A method of genetically modifying one or more genes associated with blood type in a cell, the method comprising introducing into the cell a site-directed nuclease or a nucleotide sequence encoding a site-directed nuclease, wherein the one or more genes associated with blood type are selected from the group consisting of ABO, FUT1, and RHD.
  • 2. The method of claim 1, wherein the site-directed nuclease is selected from the group consisting of a zinc finger nuclease (ZFN), a transcription activator-like effector nuclease (TALEN), a meganuclease, a CRISPR-associated transposase, and a CRISPR/Cas nuclease.
  • 3. The method of claim 2, wherein the site-directed nuclease is a CRISPR/Cas nuclease selected from the group consisting of Cas3, Cas4, Cas5, Cas8a, Cas8b, Cas8c, Cas9, Cas10, Cas12, Cas12a (Cpf1), Cas12b (C2c1), Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX), Cas12f (C2c10), Cas12g, Cas12h, Cas12i, Cas12k (C2c5), Cas13, Cas13a (C2c2), Cas13b, Cas13c, Cas13d, C2c4, C2c8, C2c9, Cmr5, Cse1, Cse2, Csf1, Csm2, Csn2, Csx10, Csx11, Csy1, Csy2, Csy3, and Mad7.
  • 4. The method of claim 3, wherein the method further comprises introducing to the cell a guide RNA (gRNA) targeting the ABO, FUT1, or RHD locus.
  • 5. The method of claim 4, wherein the gRNA comprises a CRISPR RNA (crRNA) and optionally a transactivating CRISPR RNA (tracrRNA).
  • 6. The method of claim 5, wherein the gRNA comprises a crRNA and a tracrRNA as two separate molecules.
  • 7. The method of claim 5, wherein the gRNA comprises a crRNA and a tracrRNA as a single guide RNA (sgRNA).
  • 8. The method of claim 7, wherein the sgRNA comprises a complementary region, a crRNA repeat region, a tetraloop, and a tracrRNA.
  • 9. The method of claim 8, wherein the crRNA repeat region comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:5, SEQ ID NO:9, SEQ ID NO:13, or SEQ ID NO:18.
  • 10. The method of claim 8 or 9, wherein the tetraloop comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:6 or SEQ ID NO:17.
  • 11. The method of any one of claims 8-10, wherein the tracrRNA comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:7, SEQ ID NO:11, SEQ ID NO:15, or SEQ ID NO:16.
  • 12. The method of any one of claims 5-11, wherein the crRNA comprises a complementary region specific to a region of the ABO locus.
  • 13. The method of claim 12, wherein the region of the ABO locus is a coding sequence (CDS), an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region.
  • 14. The method of claim 12 or 13, wherein the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 20-203.
  • 15. The method of any one of claims 5-11, wherein the crRNA comprises a complementary region specific to a region of the FUT1 locus.
  • 16. The method of claim 15, wherein the region of the FUT1 locus is a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region.
  • 17. The method of claim 15 or 16, wherein the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 204-420.
  • 18. The method of any one of claims 5-11, wherein the crRNA comprises a complementary region specific to a region of the RHD locus.
  • 19. The method of claim 18, wherein the region of the RHD locus is a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region.
  • 20. The method of claim 18 or 19, wherein the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 421-580.
  • 21. The method of any of claims 1-20, wherein the genetic modification is through non-homologous end-joining (NHEJ).
  • 22. The method of any of claims 1-20, wherein the genetic modification is through homology-directed repair (HDR).
  • 23. A guide RNA (gRNA) for use in genetically modifying one or more genes associated with blood type in a cell, wherein the one or more genes associated with blood type is selected from the group consisting of ABO, FUT1, and RHD.
  • 24. The gRNA of claim 23, wherein the genetic modification is through use of a site-directed nuclease selected from the group consisting of a zinc finger nuclease (ZFN), a transcription activator-like effector nuclease (TALEN), a meganuclease, and a CRISPR-associated transposase, and a CRISPR/Cas nuclease.
  • 25. The gRNA of claim 24, wherein the site-directed nuclease is a CRISPR/Cas nuclease selected from the group consisting of Cas3, Cas4, Cas5, Cas8a, Cas8b, Cas8c, Cas9, Cas10, Cas12, Cas12a (Cpf1), Cas12b (C2c1), Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX), Cas12f (C2c10), Cas12g, Cas12h, Cas12i, Cas12k (C2c5), Cas13, Cas13a (C2c2), Cas13b, Cas13c, Cas13d, C2c4, C2c8, C2c9, Cmr5, Cse1, Cse2, Csf1, Csm2, Csn2, Csx10, Csx11, Csy1, Csy2, Csy3, and Mad7.
  • 26. The gRNA of claim 25, wherein the gRNA comprises a crRNA and optionally a tracrRNA.
  • 27. The gRNA of claim 26, wherein the gRNA comprises a crRNA and a tracrRNA as two separate molecules.
  • 28. The gRNA of claim 26, wherein the gRNA comprises a crRNA and a tracrRNA as a single guide RNA (sgRNA).
  • 29. The gRNA of claim 28, wherein the sgRNA comprises a complementary region, a crRNA repeat region, a tetraloop, and a tracrRNA.
  • 30. The gRNA of claim 29, wherein the crRNA repeat region comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:5, SEQ ID NO:9, SEQ ID NO:13, or SEQ ID NO:18.
  • 31. The gRNA of claim 29 or 30, wherein the tetraloop comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:6 or SEQ ID NO:17.
  • 32. The gRNA of any one of claims 29-31, wherein the tracrRNA comprises, consists of, or consists essentially of a nucleotide sequence set forth in SEQ ID NO:7, SEQ ID NO:11, SEQ ID NO:15, or SEQ ID NO:16.
  • 33. The gRNA of any one of claims 26-32, wherein the crRNA comprises a complementary region specific to a region of the ABO locus.
  • 34. The gRNA of claim 33, wherein the region of the ABO locus is a coding sequence (CDS), an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region.
  • 35. The gRNA of claim 33 or 34, wherein the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 20-203.
  • 36. The gRNA of any one of claims 26-32, wherein the crRNA comprises a complementary region specific to a region of the FUT1 locus.
  • 37. The gRNA of claim 36, wherein the region of the FUT1 locus is a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region.
  • 38. The gRNA of claim 36 or 37, wherein the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 204-420.
  • 39. The gRNA of any one of claims 26-32, wherein the crRNA comprises a complementary region specific to a region of the RHD locus.
  • 40. The gRNA of claim 39, wherein the region of the RHD locus is a CDS, an exon, an intron, a sequence spanning a portion of an exon and a portion of an adjacent intron, or a regulatory region.
  • 41. The gRNA of claim 39 or 40, wherein the complementary region comprises, consists of, or consists essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 421-580.
  • 42. A composition comprising the gRNA of any one of claims 23-41.
  • 43. The composition of claim 42, further comprising a site-directed nuclease or a nucleotide sequence encoding a site-directed nuclease protein.
  • 44. The composition of claim 43, wherein the site-directed nuclease is a CRISPR/Cas nuclease selected from the group consisting of Cas3, Cas4, Cas5, Cas8a, Cas8b, Cas8c, Cas9, Cas10, Cas12, Cas12a (Cpf1), Cas12b (C2c1), Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX), Cas12f (C2c10), Cas12g, Cas12h, Cas12i, Cas12k (C2c5), Cas13, Cas13a (C2c2), Cas13b, Cas13c, Cas13d, C2c4, C2c8, C2c9, Cmr5, Cse1, Cse2, Csf1, Csm2, Csn2, Csx10, Csx11, Csy1, Csy2, Csy3, and Mad7.
  • 45. The composition from any one of claims 42-44, wherein the composition is formulated for delivery into a cell.
  • 46. A cell comprising the gRNA of any one of claims 23-41.
  • 47. The cell of claim 46, further comprising a site-directed nuclease or a nucleotide sequence encoding a site-directed nuclease protein.
  • 48. The composition of claim 47, wherein the site-directed nuclease is a CRISPR/Cas nuclease selected from the group consisting of Cas3, Cas4, Cas5, Cas8a, Cas8b, Cas8c, Cas9, Cas10, Cas12, Cas12a (Cpf1), Cas12b (C2c1), Cas12c (C2c3), Cas12d (CasY), Cas12e (CasX), Cas12f (C2c10), Cas12g, Cas12h, Cas12i, Cas12k (C2c5), Cas13, Cas13a (C2c2), Cas13b, Cas13c, Cas13d, C2c4, C2c8, C2c9, Cmr5, Cse1, Cse2, Csf1, Csm2, Csn2, Csx10, Csx11, Csy1, Csy2, Csy3, and Mad7.
  • 49. A method of identifying a new genomic locus for genetically modifying one or more genes associated with blood type in a cell, the method comprising (a) locating a genomic locus based on a known gRNA; and (b) scanning a region of about 500 to 4000 bp on either side of the genomic locus for a PAM sequence, wherein the one or more genes associated with blood type is selected from the group consisting of ABO, FUT1, and RHD.
  • 50. The method of claim 49, wherein the known gRNA targets the ABO, FUT1, or RHD locus.
  • 51. The method of claim 50, wherein the gRNA comprises a complementary region comprising, consisting of, or consisting essentially of a nucleotide sequence complementary to a nucleotide sequence set forth in any of SEQ ID NOs: 20-580.
  • 52. A cell having one or more genes associated with blood type genetically modified according to the method of any one of claims 1-22.
  • 53. The cell of claim 52, wherein the cell is an autologous cell.
  • 54. The cell of claim 52, wherein the cell is an allogeneic cell.
  • 55. The cell of any one of claims 52-54, wherein the cell is a pluripotent stem cell, an embryonic stem cell (ESC), or an induced pluripotent stem cell (iPSC).
  • 56. The cell of any one of claims 52-54, wherein the cell is differentiated from a pluripotent stem cell, an ESC, or an iPSC.
  • 57. The cell of any one of claims 52-54, wherein the cell is a primary cell.
  • 58. The cell of claim 56 or 57, wherein the cell is a mesenchymal stem cell or a hematopoietic stem cell.
  • 59. The cell of claim 56 or 57, wherein the cell is a blood cell.
  • 60. The cell of claim 59, wherein the blood cell is a red blood cell, a platelet cell, a mast cell, a basophil, an eosinophil, a neutrophil, a monocyte, a natural killer (NK) cell, a natural killer T (NKT) cell, a macrophage, a T cell, a B cell, or a plasma cell.
  • 61. The cell of claim 60, wherein the cell is a T cell, an NK cell, or an NKT cell.
  • 62. The cell of claim 56 or 57, wherein the cell is a cardiomyocyte.
  • 63. The cell of claim 56 or 57, wherein the cell is a retinal pigment epithelial cell (RPE).
  • 64. The cell of claim 56 or 57, wherein the cell is an endothelial cell.
  • 65. The cell of claim 56 or 57, wherein the cell is a β islet cell.
  • 66. The cell of claim 56 or 57, wherein the cell is a glial progenitor cell (GPC).
  • 67. The cell of any one of claims 52-66, wherein the cell is modified to have reduced expression of one or more MHC I molecules and/or one or more MHC II molecules, optionally, wherein the one or more MHC I molecules are selected from the group consisting of HLA-A, HLA-B, HLA-C, and optionally, wherein the one or more MHC II molecules are selected from the group consisting of HLA-DR, HLA-DQ, HLA-DP, HLA-DM, and HLA-DO.
  • 68. The cell of claim 67, wherein the modification is by modulation of the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci.
  • 69. The cell of claim 68, wherein the modulation of the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci comprises B2M, TAP1, CIITA, MIC-A, and/or MIC-B knockout.
  • 70. The cell of claim 68, wherein the modulation of the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci comprises knock-in of a transgene at the B2M, TAP1, CIITA, MIC-A, and/or MIC-B loci.
  • 71. The cell of claim 70, wherein the transgene encodes one or more tolerogenic factors selected from the group consisting of A20/TNFAIP3, CD16, CD16 Fc receptor, CD24, CD35, CD39, CD46, CD47, CD52, CD55, CD59, CD200, CCL22, CTLA4-Ig, C1 inhibitor, complement receptor (CR1), DUX4, FASL, H2-M3, IDO1, IL15-RF, HLA-C, HLA-E, HLA-E heavy chain, HLA-G, IL-10, IL-35, MANF, PD-1, PD-L1, SERPINB9, CCL21, and MFGE8.
  • 72. The cell of claim 71, wherein the one or more tolerogenic factors comprise CD47.
  • 73. The cell of claim 72, wherein the CD47 is human CD47.
  • 74. The cell of claim 73, wherein the human CD47 comprises an amino acid sequence that is at least 80% identical to the amino acid sequence set forth in any one of SEQ ID NOs: 583-588.
  • 75. The cell of claim 74, wherein the human CD47 comprises an amino acid sequence that is at least 80% identical to the amino acid sequence set forth in SEQ ID NO:584.
  • 76. The cell of claim 71, wherein the one or more tolerogenic factors comprise HLA-E.
  • 77. The cell of claim 71, wherein the one or more tolerogenic factors comprise CD24.
  • 78. The cell of claim 71, wherein the one or more tolerogenic factors comprise PD-L1.
  • 79. The cell of claim 71, wherein the one or more tolerogenic factors comprise CD24, CD47, and PD-L1.
  • 80. The cell of claim 71, wherein the one or more tolerogenic factors comprise CD46.
  • 81. The cell of claim 71, wherein the one or more tolerogenic factors comprise CD55.
  • 82. The cell of claim 71, wherein the one or more tolerogenic factors comprise CD59.
  • 83. The cell of claim 71, wherein the one or more tolerogenic factors comprise C1 inhibitor.
  • 84. The cell of claim 71, wherein the one or more tolerogenic factors comprise CD46, CD55, CD59, and C1 inhibitor.
  • 85. The cell of claim 71, wherein the one or more tolerogenic factors comprise HLA-E, CD24, CD47, PD-L1, CD46, CD55, CD59, and C1 inhibitor.
  • 86. A pharmaceutical composition comprising the cell of any one of claims 52-85.
  • 87. A method of treating a disease in a subject in need thereof, the method comprising administering the subject the cell of any one of claims 52-85, or the pharmaceutical composition of claim 86.
  • 88. The method of claim 87, wherein the disease is cancer.
  • 89. The method of claim 88, wherein the cancer is a hematologic malignancy.
  • 90. The method of claim 89, wherein the hematologic malignancy is selected from the group consisting of myeloid neoplasm, myelodysplastic syndromes (MDS), myeloproliferative/myelodysplastic syndromes, acute lymphoid leukemia (ALL), chronic lymphocytic leukemia (CLL), acute myeloid leukemia (AML), chronic myelogenous leukemia (CML), B cell acute lymphoid leukemia (B-ALL), T cell acute lymphoid leukemia (T-ALL), T cell lymphoma, and B cell lymphoma.
  • 91. The method of claim 87, wherein the disease is an autoimmune disease.
  • 92. The method of claim 91, wherein the autoimmune disease is selected from the group consisting of lupus, systemic lupus erythematosus, rheumatoid arthritis, psoriasis, psoriatic arthritis, multiple sclerosis, Crohn's disease, ulcerative colitis, Addison's disease, Graves' disease, Sjögren's syndrome, Hashimoto's thyroiditis, and celiac disease.
  • 93. The method of claim 87, wherein the disease is diabetes mellitus.
  • 94. The method of claim 93, wherein the diabetes is selected from the group consisting of Type I diabetes, Type II diabetes, prediabetes, and gestational diabetes.
  • 95. The method of claim 87, wherein the disease is a neurological disease.
  • 96. The method of claim 95, wherein the neurological disease is selected from the group consisting of catalepsy, epilepsy, encephalitis, meningitis, migraine, Huntington's, Alzheimer's, Parkinson's, Pelizaeus-Merzbacher disease, and multiple sclerosis.
  • 97. The method of claim 87, wherein the disease is a cardiac disease.
  • 98. The method of claim 97, wherein the cardiac disease is selected from the group consisting of pediatric cardiomyopathy, age-related cardiomyopathy, dilated cardiomyopathy, hypertrophic cardiomyopathy, restrictive cardiomyopathy, chronic ischemic cardiomyopathy, peripartum cardiomyopathy, inflammatory cardiomyopathy, idiopathic cardiomyopathy, other cardiomyopathy, myocardial ischemic reperfusion injury, ventricular dysfunction, heart failure, congestive heart failure, coronary artery disease, end-stage heart disease, atherosclerosis, ischemia, hypertension, restenosis, angina pectoris, rheumatic heart, arterial inflammation, cardiovascular disease, myocardial infarction, myocardial ischemia, congestive heart failure, myocardial infarction, cardiac ischemia, cardiac injury, myocardial ischemia, vascular disease, acquired heart disease, congenital heart disease, atherosclerosis, coronary artery disease, dysfunctional conduction systems, dysfunctional coronary arteries, pulmonary hypertension, cardiac arrhythmias, muscular dystrophy, muscle mass abnormality, muscle degeneration, myocarditis, infective myocarditis, drug- or toxin-induced muscle abnormalities, hypersensitivity myocarditis, cardiomegaly, mitral insufficiency, and autoimmune endocarditis.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Patent Application No. 63/232,142, filed on Aug. 11, 2021, the contents of which are incorporated by reference in their entirety.

PCT Information
Filing Document Filing Date Country Kind
PCT/US2022/074846 8/11/2022 WO
Provisional Applications (1)
Number Date Country
63232142 Aug 2021 US