This application claims the benefit of Chinese Patent Application App. No. 202010444099.X, filed on May 22, 2020. The entire content of the foregoing application is incorporated herein by reference.
This disclosure relates to genetically modified animal expressing human or chimeric (e.g., humanized) CD94 and/or NKG2A, and methods of use thereof.
The immune system has developed multiple mechanisms to prevent deleterious activation of immune cells. One such mechanism is the intricate balance between positive and negative co-stimulatory signals delivered to immune cells. Targeting the stimulatory or inhibitory pathways for the immune system is considered to be a potential approach for the treatment of various diseases, e.g., cancers and autoimmune diseases.
The traditional drug research and development for these stimulatory or inhibitory receptors typically use in vitro screening approaches. However, these screening approaches cannot provide the body environment (such as tumor microenvironment, stromal cells, extracellular matrix components and immune cell interaction, etc.), resulting in a higher rate of failure in drug development. In addition, in view of the differences between humans and animals, the test results obtained from the use of conventional experimental animals for in vivo pharmacological test may not reflect the real disease state and the interaction at the targeting sites, resulting in that the results in many clinical trials are significantly different from the animal experimental results. Therefore, the development of humanized animal models that are suitable for human antibody screening and evaluation will significantly improve the efficiency of new drug development and reduce the cost for drug research and development.
Because the amino acid sequences of human CD94 and NKG2A are significantly different from the corresponding proteins in rodents (e.g., the sequence identity between human and mouse CD94 protein sequences is only about 55% (See
This disclosure is related to an animal model with human CD94 or chimeric CD94. The animal model can express human CD94 or chimeric CD94 (e.g., humanized CD94) protein in its body. It can be used in the studies on the function of CD94 gene, and can be used in the screening and evaluation of anti-human CD94 antibodies. This disclosure is also related to an animal model with human NKG2A or chimeric NKG2A. The animal model can express human NKG2A or chimeric NKG2A (e.g., humanized NKG2A) protein in its body. It can be used in the studies on the function of NKG2A gene, and can be used in the screening and evaluation of anti-human NKG2A antibodies. In some embodiments, the disclosure is related to NKG2A/CD94 double-gene humanized mice.
In addition, the animal models prepared by the methods described herein can be used in drug screening, pharmacodynamics studies, treatments for immune-related diseases, and cancer therapy for human CD94 and/or NKG2A target sites; they can also be used to facilitate the development and design of new drugs, and save time and cost. In summary, this disclosure provides a powerful tool for studying the function of CD94 and/or NKG2A protein and a platform for screening cancer drugs.
In one aspect, the disclosure is related to a genetically-modified, non-human animal whose genome comprises at least one chromosome comprising a sequence encoding a human or chimeric CD94.
In some embodiments, the sequence encoding the human or chimeric CD94 is operably linked to an endogenous regulatory element at the endogenous CD94 gene locus in the at least one chromosome.
In some embodiments, the sequence encoding a human or chimeric CD94 comprises a sequence encoding an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to human CD94 (NP_001337991.1 (SEQ ID NO: 2)).
In some embodiments, the sequence encoding a human or chimeric CD94 comprises a sequence encoding an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to SEQ ID NO: 8.
In some embodiments, the sequence encoding a human or chimeric CD94 comprises a sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to amino acids of 37-179 of SEQ ID NO: 2.
In some embodiments, the animal is a mammal, e.g., a monkey, a rodent, or a mouse. In some embodiments, the animal is a mouse.
In some embodiments, the animal does not express endogenous CD94 or expresses a decreased level of endogenous CD94.
In some embodiments, the animal has one or more cells expressing human or chimeric CD94.
In some embodiments, the animal has one or more cells expressing human or chimeric CD94, and a human NKG2A can interact with the expressed human or chimeric CD94, forming a heterodimer that can recognize MHC class I molecules. In some embodiments, the animal has one or more cells expressing human or chimeric CD94, and an endogenous NKG2A can interact with the expressed human or chimeric CD94, forming a heterodimer that can recognized MHC class I molecules.
In one aspect, the disclosure is related to a genetically-modified, non-human animal, in some embodiments, the genome of the animal comprises a replacement of a sequence encoding a region of endogenous CD94 with a sequence encoding a corresponding region of human CD94 at an endogenous CD94 gene locus.
In some embodiments, the sequence encoding the corresponding region of human CD94 is operably linked to an endogenous regulatory element at the endogenous CD94 locus, and one or more cells of the animal expresses a chimeric CD94.
In some embodiments, the animal does not express endogenous CD94.
In some embodiments, the replaced sequence encodes all or a portion of the extracellular region of endogenous CD94.
In some embodiments, the animal has one or more cells expressing a chimeric CD94 having a cytoplasmic region, a transmembrane region, and an extracellular region, in some embodiments, the extracellular region comprises a sequence that is at least 50%, 60%, 70%, 80%, 90%, 95%, or 99% identical to the extracellular region of human CD94.
In some embodiments, the extracellular region of the chimeric CD94 has a sequence that has at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, or 140 contiguous amino acids that are identical to a contiguous sequence present in the extracellular region of human CD94.
In some embodiments, the sequence encoding a region of endogenous CD94 comprises exon 1, exon 2, exon 3, exon 4, exon 5, and/or exon 6, or a part thereof, of the endogenous CD94 gene.
In some embodiments, the animal is a mouse, and the sequence encoding a region of endogenous CD94 starts within exon 3 and ends within exon 6 of the endogenous mouse CD94 gene.
In some embodiments, the animal is heterozygous with respect to the replacement at the endogenous CD94 gene locus. In some embodiments, the animal is homozygous with respect to the replacement at the endogenous CD94 gene locus.
In one aspect, the disclosure is related to a method for making a genetically-modified, non-human animal, comprising: replacing in at least one cell of the animal, at an endogenous CD94 gene locus, a sequence encoding a region of an endogenous CD94 with a sequence encoding a corresponding region of human CD94.
In some embodiments, the sequence encoding the corresponding region of human CD94 comprises exon 2, exon 3, exon 4, exon 5, exon 6, and/or exon 7, or a part thereof, of a human CD94 gene.
In some embodiments, the sequence encoding the corresponding region of human CD94 starts within exon 4 and ends within exon 7 of a human CD94 gene.
In some embodiments, the sequence encoding the corresponding region of human CD94 encodes amino acids 37-179 of SEQ ID NO: 2.
In some embodiments, the region of an endogenous CD94 is located within the extracellular region.
In some embodiments, the sequence encoding a region of endogenous CD94 comprises exon 1, exon 2, exon 3, exon 4, exon 5, and/or exon 6, or a part thereof, of the endogenous CD94 gene.
In some embodiments, the animal is a mouse, and the sequence encoding a region of an endogenous CD94 starts within exon 3 and ends within exon 6 of the endogenous mouse CD94 gene.
In one aspect, the disclosure is related to a non-human animal comprising at least one cell comprising a nucleotide sequence encoding a chimeric CD94 polypeptide, in some embodiments, the chimeric CD94 polypeptide comprises at least 50 contiguous amino acid residues that are identical to the corresponding contiguous amino acid sequence of a human CD94, in some embodiments, the animal expresses the chimeric CD94.
In some embodiments, the chimeric CD94 polypeptide has at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 110, at least 120, at least 130, or at least 140 contiguous amino acid residues that are identical to the corresponding contiguous amino acid sequence of a human CD94 extracellular region.
In some embodiments, the chimeric CD94 polypeptide comprises a sequence that is at least 90%, 95%, or 99% identical to amino acids 37-179 of SEQ ID NO: 2.
In some embodiments, the nucleotide sequence is operably linked to an endogenous CD94 regulatory element of the animal.
In some embodiments, the chimeric CD94 polypeptide comprises an endogenous CD94 cytoplasmic region and/or an endogenous CD94 transmembrane region.
In some embodiments, the nucleotide sequence is integrated to an endogenous CD94 gene locus of the animal.
In some embodiments, the chimeric CD94 has at least one mouse CD94 activity and/or at least one human CD94 activity.
In one aspect, the disclosure is related to a method of making a genetically-modified non-human animal cell that expresses a chimeric CD94, the method comprising: replacing at an endogenous CD94 gene locus, a nucleotide sequence encoding a region of endogenous CD94 with a nucleotide sequence encoding a corresponding region of human CD94, thereby generating a genetically-modified non-human animal cell that includes a nucleotide sequence that encodes the chimeric CD94, in some embodiments, the non-human animal cell expresses the chimeric CD94.
In some embodiments, the animal is a mammal, e.g., a monkey, a rodent, or a mouse.
In some embodiments, the chimeric CD94 comprises: a cytoplasmic region and/or a transmembrane region of endogenous CD94; and an extracellular region of human CD94.
In some embodiments, the nucleotide sequence encoding the chimeric CD94 is operably linked to an endogenous CD94 regulatory region, e.g., promoter.
In some embodiments, the animal further comprises a sequence encoding an additional human or chimeric protein. In some embodiments, the additional human or chimeric protein is Killer Cell Lectin Like Receptor C1 (NKG2A), programmed cell death protein 1 (PD-1), B7 Homolog 3 (B7-H3), V-set domain-containing T-cell activation inhibitor 1 (B7-H4), Interleukin-2 (IL-2), Interleukin-23 subunit alpha (IL23A), C-C Motif Chemokine Receptor 2 (CCR2), cytotoxic T-lymphocyte-associated protein 4 (CTLA-4), Lymphocyte Activating 3 (LAG-3), B And T Lymphocyte Associated (BTLA), Programmed Cell Death 1 Ligand 1 (PD-L1), CD27, CD28, CD47, CD137, CD154, T-Cell Immunoreceptor With Ig And ITIM Domains (TIGIT), T-cell Immunoglobulin and Mucin-Domain Containing-3 (TIM-3), Glucocorticoid-Induced TNFR-Related Protein (GITR), Signal regulatory protein α (SIRPα) or TNF Receptor Superfamily Member 4 (OX40).
In some embodiments, the additional human or chimeric protein is NKG2A, and the animal expresses the human or chimeric NKG2A.
In some embodiments, the animal further comprises a sequence encoding human or chimeric PD-1, and the animal expresses the human or chimeric PD-1.
In one aspect, the disclosure is related to a genetically-modified, non-human animal whose genome comprises at least one chromosome comprising a sequence encoding a human or chimeric Killer cell lectin-like receptor subfamily C, member 1 (NKG2A).
In some embodiments, the sequence encoding the human or chimeric NKG2A is operably linked to an endogenous regulatory element at the endogenous NKG2A gene locus in the at least one chromosome.
In some embodiments, the sequence encoding a human or chimeric NKG2A comprises a sequence encoding an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to human NKG2A (NP_998823.1 (SEQ ID NO: 29)).
In some embodiments, the sequence encoding a human or chimeric NKG2A comprises a sequence encoding an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to SEQ ID NO: 35.
In some embodiments, the sequence encoding a human or chimeric NKG2A comprises a sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to amino acids of 94-233 of SEQ ID NO: 29.
In some embodiments, the animal is a mammal, e.g., a monkey, a rodent, or a mouse. In some embodiments, the animal is a mouse.
In some embodiments, the animal does not express endogenous NKG2A or expresses a decreased level of endogenous NKG2A.
In some embodiments, the animal has one or more cells expressing human or chimeric NKG2A.
In some embodiments, the animal has one or more cells expressing human or chimeric NKG2A, and a human CD94 can interact with the expressed human or chimeric NKG2A, forming a heterodimer that can recognize MHC class I molecules. In some embodiments, the animal has one or more cells expressing human or chimeric NKG2A, and an endogenous CD94 can interact with the expressed human or chimeric NKG2A, forming a heterodimer that can recognized MHC class I molecules.
In one aspect, the disclosure is related to a genetically-modified, non-human animal, in some embodiments, the genome of the animal comprises a replacement of a sequence encoding a region of endogenous NKG2A with a sequence encoding a corresponding region of human NKG2A at an endogenous NKG2A gene locus.
In some embodiments, the sequence encoding the corresponding region of human NKG2A is operably linked to an endogenous regulatory element at the endogenous NKG2A locus, and one or more cells of the animal expresses a chimeric NKG2A.
In some embodiments, the animal does not express endogenous NKG2A.
In some embodiments, the replaced sequence encodes all or a portion of the extracellular region of endogenous NKG2A.
In some embodiments, the animal has one or more cells expressing a chimeric NKG2A having a cytoplasmic region, a transmembrane region, and an extracellular region, in some embodiments, the extracellular region comprises a sequence that is at least 50%, 60%, 70%, 80%, 90%, 95%, or 99% identical to the extracellular region of human NKG2A.
In some embodiments, the extracellular region of the chimeric NKG2A has a sequence that has at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, or 140 contiguous amino acids that are identical to a contiguous sequence present in the extracellular region of human NKG2A.
In some embodiments, the sequence encoding a region of endogenous NKG2A comprises exon 1, exon 2, exon 3, exon 4, exon 5, and/or exon 6, or a part thereof, of the endogenous NKG2A gene.
In some embodiments, the animal is a mouse, and the sequence encoding a region of endogenous NKG2A starts within exon 2 and ends within exon 6 of the endogenous mouse NKG2A gene.
In some embodiments, the animal is heterozygous with respect to the replacement at the endogenous NKG2A gene locus. In some embodiments, the animal is homozygous with respect to the replacement at the endogenous NKG2A gene locus.
In one aspect, the disclosure is related to a method for making a genetically-modified, non-human animal, comprising: replacing in at least one cell of the animal, at an endogenous NKG2A gene locus, a sequence encoding a region of an endogenous NKG2A with a sequence encoding a corresponding region of human NKG2A.
In some embodiments, the sequence encoding the corresponding region of human NKG2A comprises exon 3, exon 4, exon 5, exon 6, exon 7, and/or exon 8, or a part thereof, of a human NKG2A gene.
In some embodiments, the sequence encoding the corresponding region of human NKG2A starts within exon 4 and ends within exon 8 of a human NKG2A gene.
In some embodiments, the sequence encoding the corresponding region of human NKG2A encodes amino acids 94-233 of SEQ ID NO: 29.
In some embodiments, the region of an endogenous NKG2A is located within the extracellular region.
In some embodiments, the sequence encoding a region of endogenous NKG2A comprises exon 1, exon 2, exon 3, exon 4, exon 5, and/or exon 6, or a part thereof, of the endogenous NKG2A gene.
In some embodiments, the animal is a mouse, and the sequence encoding a region of an endogenous NKG2A starts within exon 2 and ends within exon 6 of the endogenous mouse NKG2A gene.
In one aspect, the disclosure is related to a non-human animal comprising at least one cell comprising a nucleotide sequence encoding a chimeric NKG2A polypeptide, in some embodiments, the chimeric NKG2A polypeptide comprises at least 50 contiguous amino acid residues that are identical to the corresponding contiguous amino acid sequence of a human NKG2A, in some embodiments, the animal expresses the chimeric NKG2A.
In some embodiments, the chimeric NKG2A polypeptide has at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 110, at least 120, at least 130, or at least 140 contiguous amino acid residues that are identical to the corresponding contiguous amino acid sequence of a human NKG2A extracellular region.
In some embodiments, the chimeric NKG2A polypeptide comprises a sequence that is at least 90%, 95%, or 99% identical to amino acids 94-233 of SEQ ID NO: 29.
In some embodiments, the nucleotide sequence is operably linked to an endogenous NKG2A regulatory element of the animal.
In some embodiments, the chimeric NKG2A polypeptide comprises an endogenous NKG2A cytoplasmic region and/or an endogenous NKG2A transmembrane region.
In some embodiments, the nucleotide sequence is integrated to an endogenous NKG2A gene locus of the animal.
In some embodiments, the chimeric NKG2A has at least one mouse NKG2A activity and/or at least one human NKG2A activity.
In one aspect, the disclosure is related to a method of making a genetically-modified non-human animal cell that expresses a chimeric NKG2A, the method comprising: replacing at an endogenous NKG2A gene locus, a nucleotide sequence encoding a region of endogenous NKG2A with a nucleotide sequence encoding a corresponding region of human NKG2A, thereby generating a genetically-modified non-human animal cell that includes a nucleotide sequence that encodes the chimeric NKG2A, in some embodiments, the non-human animal cell expresses the chimeric NKG2A.
In some embodiments, the animal is a mammal, e.g., a monkey, a rodent, or a mouse.
In some embodiments, the chimeric NKG2A comprises: a cytoplasmic region and/or a transmembrane region of endogenous NKG2A; and an extracellular region of human NKG2A.
In some embodiments, the nucleotide sequence encoding the chimeric NKG2A is operably linked to an endogenous NKG2A regulatory region, e.g., promoter.
In some embodiments, the animal further comprises a sequence encoding an additional human or chimeric protein. In some embodiments, the additional human or chimeric protein is Cluster of Differentiation 94 (CD94), programmed cell death protein 1 (PD-1), B7 Homolog 3 (B7-H3), V-set domain-containing T-cell activation inhibitor 1 (B7-H4), Interleukin-2 (IL-2), Interleukin-23 subunit alpha (IL23A), C-C Motif Chemokine Receptor 2 (CCR2), cytotoxic T-lymphocyte-associated protein 4 (CTLA-4), Lymphocyte Activating 3 (LAG-3), B And T Lymphocyte Associated (BTLA), Programmed Cell Death 1 Ligand 1 (PD-L1), CD27, CD28, CD47, CD137, CD154, T-Cell Immunoreceptor With Ig And ITIM Domains (TIGIT), T-cell Immunoglobulin and Mucin-Domain Containing-3 (TIM-3), Glucocorticoid-Induced TNFR-Related Protein (GITR), Signal regulatory protein α (SIRPα) or TNF Receptor Superfamily Member 4 (OX40).
In some embodiments, the additional human or chimeric protein is CD94, and the animal expresses the human or chimeric CD94.
In some embodiments, the animal further comprises a sequence encoding human or chimeric PD-1, and the animal expresses the human or chimeric PD-1.
In one aspect, the disclosure is related to a method of determining effectiveness of a therapeutic agent recognizing a NKG2A/CD94 receptor for treating cancer, comprising: administering the therapeutic agent to the animal as described herein, in some embodiments, the animal has a cancer; and determining the inhibitory effects of the therapeutic agent to the cancer.
In some embodiments, the therapeutic agent is an anti-CD94 antibody or anti-NKG2A antibody.
In some embodiments, the cancer comprises one or more cells that express MHC class I molecules (e.g., human HLA-E or mouse Qa-1).
In some embodiments, the cancer comprises one or more cancer cells that are injected into the animal.
In some embodiments, determining the inhibitory effects of the therapeutic agent to the cancer involves measuring the tumor volume in the animal.
In some embodiments, the cancer is gynecologic cancer, ovarian cancer, uterine cancer, vaginal cancer, cervical cancer, vulvar cancer, head and neck cancer, non-small cell lung cancer (NSCLC), hematological cancer, solid tumor, breast cancer, chronic lymphocytic leukemia, squamous cell carcinoma of the oral cavity, colorectal cancer, liver cancer, glioblastoma, Hodgkin lymphoma, esophagus cancer, gastric cancer, pancreas cancer, renal cancer, lung cancer, or melanoma.
In one aspect, the disclosure is related to a method of determining effectiveness of an therapeutic agent recognizing a NKG2A/CD94 receptor and an additional therapeutic agent for the treating cancer, comprising administering the therapeutic agent and the additional therapeutic agent to the animal as described herein, in some embodiments, the animal has a cancer; and determining the inhibitory effects on the cancer.
In some embodiments, the animal further comprises a sequence encoding a human or chimeric programmed cell death protein 1 (PD-1).
In some embodiments, the animal further comprises a sequence encoding a human or chimeric programmed death-ligand 1 (PD-L1).
In some embodiments, the additional therapeutic agent is an anti-PD-1 antibody, an anti-PD-L1 antibody, or an anti-EGFR (epidermal growth factor receptor) antibody..
In some embodiments, the cancer comprises one or more cancer cells that express PD-L1, or PD-L2.
In some embodiments, the cancer is caused by injection of one or more cancer cells into the animal.
In some embodiments, determining the inhibitory effects of the treatment involves measuring the tumor volume in the animal.
In some embodiments, the animal has gynecologic cancer, ovarian cancer, uterine cancer, vaginal cancer, cervical cancer, vulvar cancer, head and neck cancer, non-small cell lung cancer (NSCLC), hematological cancer, solid tumor, breast cancer, chronic lymphocytic leukemia, squamous cell carcinoma of the oral cavity, colorectal cancer, liver cancer, glioblastoma, Hodgkin lymphoma, esophagus cancer, gastric cancer, pancreas cancer, renal cancer, lung cancer, or melanoma.
In one aspect, the disclosure is related to a method of determining effectiveness of a therapeutic agent recognizing a NKG2A/CD94 receptor for treating an immune disorder, comprising: administering the therapeutic agent to the animal as described herein, in some embodiments, the animal has the immune disorder; and determining the inhibitory effects of the therapeutic agent to the immune disorder.
In some embodiments, the therapeutic agent is an anti-CD94 antibody or anti-NKG2A antibody.
In some embodiments, the cancer comprises one or more cells that express MHC class I molecules (e.g., human HLA-E or mouse Qa-1).
In some embodiments, the immune disorder is rheumatoid arthritis.
In one aspect, the disclosure is related to a protein comprising an amino acid sequence, in some embodiments, the amino acid sequence is one of the following:
In one aspect, the disclosure is related to a nucleic acid comprising a nucleotide sequence, in some embodiments, the nucleotide sequence is one of the following:
In one aspect, the disclosure is related to a cell comprising the protein and/or the nucleic acid as described herein. In one aspect, the disclosure is related to an animal comprising the protein and/or the nucleic acid as described herein.
In another aspect, the disclosure also provides a genetically-modified, non-human animal whose genome comprise a disruption in the animal’s endogenous NKG2A gene, wherein the disruption of the endogenous NKG2A gene comprises deletion of exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, and/or exon 7, or part thereof of the endogenous NKG2A gene.
In some embodiments, the disruption of the endogenous NKG2A gene comprises deletion of one or more exons or part of exons selected from the group consisting of exon 2, exon 3, exon 4, exon 5, and/or exon 6 of the endogenous NKG2A gene.
In some embodiments, the disruption of the endogenous NKG2A gene comprises deletion of a portion of exon 2, exons 3-5, and a portion of exon 6 of the endogenous NKG2A gene. In some embodiments, the disruption of the endogenous NKG2A gene also comprises deletion of intron 2, intron 3, intron 4, and/or intron 5 of the endogenous NKG2A gene. In some embodiments, the disruption of the endogenous NKG2A gene comprises a nucleotide sequence within endogenous NKG2A gene exon 2 encoding the last amino acid. In some embodiments, the disruption of the endogenous NKG2A gene comprises a nucleotide sequence encoding the extracellular region of endogenous NKG2A.
In some embodiments, the disruption of the endogenous NKG2A gene further comprises deletion of one or more introns or part of introns selected from the group consisting of intron 2, intron 3, intron 4, and intron 5 of the endogenous NKG2A gene.
In some embodiments, the deletion can comprise deleting at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 10, 220, 230, 240, 250, 260, 270, 280, 290, 300, 350, 400, 450, 500, 550, 600, 1000, 1500, 2000, 2500, 3000, 3400, or more nucleotides.
In some embodiments, the disruption of the endogenous gene comprises the deletion of at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 10, 220, 230, 240, 250, 260, 270, 280, 290, 300, 350, 400, 450, 500, 550, 600, or more nucleotides of exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, and/or exon 7 (e.g., deletion of a nucleotide sequence starting from the first few nucleotide (e.g., 1 ,2 ,3, 4, or 5) of the extracellular region-encoding sequence to the last nucleotide of the extracellular region-encoding sequence).
In one aspect, the disclosure is related to a cell comprising the protein as described herein and/or the nucleic acid as described herein. In one aspect, the disclosure is related to an animal comprising the protein as described herein and/or the nucleic acid as described herein.
The disclosure also relates to non-human mammal generated through the methods as described herein. In some embodiments, the genome thereof contains human gene(s). In some embodiments, the non-human mammal is a rodent. In some embodiments, the non-human mammal is a mouse. In some embodiments, the non-human mammal expresses a protein encoded by a humanized CD94 or NKG2A gene.
The disclosure also relates to an offspring of the non-human mammal.
In another aspect, the disclosure relates to a tumor bearing non-human mammal model, characterized in that the non-human mammal model is obtained through the methods as described herein. In some embodiments, the non-human mammal is a rodent. In some embodiments, the non-human mammal is a mouse.
The disclosure also relates to a cell (e.g., stem cell or embryonic stem cell) or cell line, or a primary cell culture thereof derived from the non-human mammal or an offspring thereof, or the tumor bearing non-human mammal.
The disclosure further relates to the tissue, organ or a culture thereof derived from the non-human mammal or an offspring thereof, or the tumor bearing non-human mammal.
In another aspect, the disclosure relates to a tumor tissue derived from the non-human mammal or an offspring thereof when it bears a tumor, or the tumor bearing non-human mammal.
The disclosure further relates to a CD94 or NKG2A genomic DNA sequence of a humanized mouse, a DNA sequence obtained by a reverse transcription of the mRNA obtained by transcription thereof is consistent with or complementary to the DNA sequence; a construct expressing the amino acid sequence thereof; a cell comprising the construct thereof; a tissue comprising the cell thereof.
The disclosure further relates to the use of the non-human mammal or an offspring thereof, or the tumor bearing non-human mammal, the animal model generated through the method as described herein in the development of a product related to an immunization processes of human cells, the manufacture of a human antibody, or the model system for a research in pharmacology, immunology, microbiology and medicine.
The disclosure also relates to the use of the non-human mammal or an offspring thereof, or the tumor bearing non-human mammal, the animal model generated through the method as described herein in the production and utilization of an animal experimental disease model of an immunization processes involving human cells, the study on a pathogen, or the development of a new diagnostic strategy and /or a therapeutic strategy.
The disclosure further relates to the use of the non-human mammal or an offspring thereof, or the tumor bearing non-human mammal, the animal model generated through the methods as described herein, in the screening, verifying, evaluating or studying the CD94 or NKG2A gene function, human CD94 or NKG2A antibodies, the drugs or efficacies for human CD94 or NKG2A targeting sites, and the drugs for immune-related diseases and antitumor drugs.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Methods and materials are described herein for use in the present invention; other, suitable methods and materials known in the art can also be used. The materials, methods, and examples are illustrative only and not intended to be limiting. All publications, patent applications, patents, sequences, database entries, and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control.
Other features and advantages of the invention will be apparent from the following detailed description and figures, and from the claims.
This disclosure relates to transgenic non-human animal with human or chimeric (e.g., humanized) CD94, and methods of use thereof.
Natural killer (NK) cells play an important role in the innate immune response to infections and malignancies by directly killing pathogen-infected or transformed cells and producing cytokines and chemokines that help shape the immune response. NK cell activation is controlled by a number of activating and inhibitory receptors. Activating NK receptors recognize a variety of stress-induced molecules such as the human MICA and MICB molecules, human ULBP1-6 proteins, mouse Rae-1, H60, and MULTI family members, all of which are recognized by the activating NKG2D receptor. Some activating NK receptors directly recognize pathogen-encoded ligands. For example, the m157 glycoprotein expressed by MCMV is recognized by the activating Ly49H receptor and NKp46 interacts with influenza hemagglutinin. NK cells are prevented from attacking normal self cells by inhibitory receptors that are reactive to MHC class I.
CD94/NKG2 is a family of C-type lectin receptors which are expressed predominantly on the surface of NK cells and a subset of CD8+ T-lymphocyte. These receptors stimulate or inhibit cytotoxic activity of NK cells, therefore they are divided into activating and inhibitory receptors according to their function. CD94/NKG2 recognize non-classical MHC glycoproteins class I (HLA-E in human and Qa-1 molecules in mouse). CD94/NKG2 family includes seven members: NKG2A, B, C, D, E, F and H. Genes encoding these receptors are clustered in the natural killer complex (NKC) on human chromosome 12 and mouse chromosome 6 together with Clr (C-lectin related) genes.
NKG2 receptors are transmembrane proteins type II which dimerize with CD94 molecule. CD94 contains a short cytoplasmic domain and it is responsible for signal transduction. Therefore NKG2 receptors form disulfide bonded heterodimers.
NKG2A and NKG2B receptors transmit inhibitory signal. They contain two Immunoreceptor tyrosine-based inhibitory motives (ITIM) in their cytoplasmic tail that are defined by the sequence (I/L/V/S)xYxx(L/V), where “x” means any amino acid at a given position. If ITIM-bearing receptors engage their ligand, probably Src family kinase phosphorylates tyrosine residue, and this allows recruitment of the tyrosine phosphatase SHP-1, SHP-2 or SHIP. It leads to de-phosphorylation of tyrosine kinase’s substrates, which are involved in the activating cascades. As a result, NK cell activation is suppressed. By contrast, NKG2C (encoded by the KLRC2 gene), NKG2E and NKG2H are activating receptors.
NKG2A can interact with CD94 to form a heterodimer inhibitory receptor of the C-type lectin family, recognizing a non-classical MHC-I molecule, HLA-E, as ligand. CD94-NKG2A and its HLA-E ligand are non-polymorphic. HLA-E*0101 and HLA-E*0103 represent the only two alleles exhibited by HLA-E in worldwide populations. Almost 50% of the NK cells in the peripheral blood express CD94/NKG2A, primarily those that do not express inhibitory KIRs (killer cell immunoglobulin-like receptors). The co-expression of CD94/NKG2A with other inhibitory receptors of different specificity also exists. In addition, γδ and CD 8+ T cells also express CD94/NKG2A. Ligation of NKG2A and CD94 to HLA-E expressed on normal cells suppresses signaling activation, thereby avoiding the destruction of normal bystander cells.
Non-classical MHC glycoproteins class I are structurally similar to classical MHC class I molecules, but they present mainly peptides derived from the signal peptides of MHC class I. Therefore NK cells can indirectly monitor the expression of classical MHC class I molecules through the interaction of CD94/NKG2 with HLA-E (or Qa-1 in mouse) and HLA-E (or Qa-1 in mouse) themselves as well. During cytomegalovirus infection, virus peptides are presented on HLA-E and NK cells that express the CD94/NKG2C receptor can specifically recognize these virus peptides, which results in activation, expansion, and differentiation of adaptive NK cells. HLA-E molecules are expressed at low levels in most tissues and primarily present peptides derived from the leader sequences of classical class Ia HLA molecules. However, HLA-E is commonly expressed at high levels on the surface of a variety of different cancers.
Tumor cells (hematological as well as solid tumors), in order to avoid killing by NK cells, have shown upregulation of HLA-E expression. In various cancers, poor prognosis has been associated with HLA-E upregulation, including colorectal, ovarian, gynecologic cancers, liver, glioblastoma, Hodgkin lymphoma, chronic lymphocytic leukemia, esophagus, gastric, pancreas, colon, kidney, head and neck, lung and melanoma. Blocking of the CD94/NKG2A receptor with an antibody could be used as a therapeutic strategy. Hence, an antibody against CD94/NKG2A (IPH2201-Monalizumab), developed by Innate Pharma, has been employed in various trials. Thus, CD94 or NKG2A antibodies can be potentially useful as cancer therapies.
Experimental animal models are an indispensable research tool for studying the effects of these antibodies (e.g., CD94 or NKG2A antibodies). Common experimental animals include mice, rats, guinea pigs, hamsters, rabbits, dogs, monkeys, pigs, fish and so on. However, there are many differences between human and animal genes and protein sequences, and many human proteins cannot bind to the animal’s homologous proteins to produce biological activity, leading to that the results of many clinical trials do not match the results obtained from animal experiments. A large number of clinical studies are in urgent need of better animal models. With the continuous development and maturation of genetic engineering technologies, the use of human cells or genes to replace or substitute an animal’s endogenous similar cells or genes to establish a biological system or disease model closer to human, and establish the humanized experimental animal models (humanized animal model) has provided an important tool for new clinical approaches or means. In this context, the genetically engineered animal model, that is, the use of genetic manipulation techniques, the use of human normal or mutant genes to replace animal homologous genes, can be used to establish the genetically modified animal models that are closer to human gene systems. The humanized animal models have various important applications. For example, due to the presence of human or humanized genes, the animals can express or express in part of the proteins with human functions, so as to greatly reduce the differences in clinical trials between humans and animals, and provide the possibility of drug screening at animal levels.
Unless otherwise specified, the practice of the methods described herein can take advantage of the techniques of cell biology, cell culture, molecular biology, transgenic biology, microbiology, recombinant DNA and immunology. These techniques are explained in detail in the following literature, for examples: Molecular Cloning A Laboratory Manual, 2nd Ed., ed. By Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989); DNA Cloning, Volumes I and II (D. N. Glovered., 1985); Oligonucleotide Synthesis (M. J. Gaited., 1984); Mullisetal U. S. Pat. No.4, 683, 195; Nucleic Acid Hybridization (B. D. Hames& S. J. Higginseds. 1984); Transcription And Translation (B. D. Hames& S. J. Higginseds. 1984); Culture Of Animal Cell (R. I. Freshney, Alan R. Liss, Inc., 1987); Immobilized Cells And Enzymes (IRL Press, 1986); B. Perbal, A Practical Guide To Molecular Cloning (1984), the series, Methods In ENZYMOLOGY (J. Abelson and M. Simon, eds.-in-chief, Academic Press, Inc., New York), specifically, Vols.154 and 155 (Wuetal. eds.) and Vol. 185, “Gene Expression Technology” (D. Goeddel, ed.); Gene Transfer Vectors For Mammalian Cells (J. H. Miller and M. P. Caloseds., 1987, Cold Spring Harbor Laboratory); Immunochemical Methods In Cell And Molecular Biology (Mayer and Walker, eds., Academic Press, London, 1987); Hand book Of Experimental Immunology, Volumes V (D. M. Weir and C. C. Blackwell, eds., 1986); and Manipulating the Mouse Embryo, (Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N. Y., 1986); each of which is incorporated herein by reference in its entirety.
CD94 (Cluster of Differentiation 94), also known as killer cell lectin-like receptor subfamily D, member 1 (KLRD1), is a type II transmembrane protein that is involved in cell signaling and is expressed on the surface of natural killer cells in the innate immune system. CD94 exists primarily in a heterodimeric form with NKG2A, C, and E on the cell surface. Depending on the associated NKG2 subunit, CD94/NKG2 can function as either an activating (NKG2C, and E) or an inhibitory (NKG2A) receptor. The inhibitory form has intracellular immunoreceptor tyrosine-based inhibitory motifs (ITIM), while the activating forms contain a positive transmembrane charged residue which facilitates interaction and signaling through the ITAM-containing DAP 12 molecule. Human Leukocyte Antigen-E (HLA-E) was identified as the ligand for CD94/NKG2.
The CD94 transmembrane-anchored glycoprotein forms disulfide-bonded heterodimers with the NKG2A subunit to form an inhibitory receptor or with the NKG2C or NKG2E subunits to assemble a receptor complex with activating DAP12 signaling proteins. CD94 receptors expressed on human and mouse NK cells and T cells have been proposed to be important in NK cell tolerance to self, play an important role in NK cell development, and contribute to NK cell-mediated immunity to certain infections including human cytomegalovirus.
In mice, the inhibitory members of the Ly49 C-type lectin-like receptor family of NK cell receptors are the primary MHC class I receptors. Humans do not express Ly49 receptors; instead, human NK cells express the structurally unrelated inhibitory killer cell immunoglobulin-like receptors (KIRs) that bind to MHC class I. Additionally, humans and mice both express members of the NKG2 family receptors that form obligate disulfide-bonded heterodimers with CD94. When expressed at physiological levels, the human NKG2 proteins cannot be stably expressed on the cell surface without CD94. CD94-NKG2 receptors bind non-classical MHC class Ib molecules, HLA-E in humans and Qa-1 in mice. HLA-E and Qa-1 both present conserved peptides derived from the leader segments of classical MHC class I molecules. Both human and murine NKG2 families consist of three members that share a high degree of similarity in their extracellular domains, NKG2A, NKG2C, and NKG2E. NKG2D is an unrelated receptor that does not pair with CD94 and has low sequence homology with NKG2A, NKG2C, and NKG2E.
In adult mice, CD94 is expressed on splenic NK cells and NKT cells and a small proportion of γδ T cells and CD8+ T cells, primarily CD44hi memory cells. In mouse NK cells, NKG2A is the predominant NKG2 family member with NKG2A mRNA being more prevalent than NKG2C and NKG2E transcripts. NKG2A is the only NKG2 family member expressed in mouse T cells. The cytoplasmic domain of NKG2A contains one canonical immunoreceptor tyrosine-based inhibitory motif (ITIM) in mice and two ITIMs in human. Accordingly, triggering of CD94-NKG2A suppresses NK cell functions. NKG2C and NKG2E lack known intracellular signaling domains, but instead contain a charged residue within their transmembrane domains that facilitate binding to the DAP12 signaling adapter molecule, which contains an immunoreceptor tyrosine based activating motif (ITAM). Triggering of CD94-NKG2C leads to phosphorylation of the DAP12 ITAM and signaling via Syk and ZAP-70, resulting in NK cell activation. As with other paired inhibitory and activating ligands, the affinity for HLA-E is 10-fold higher for the inhibitory human CD94-NKG2A than the activating CD94-NKG2C receptor.
A detailed description of CD94 and its function can be found, e.g., in Orr, Mark T., et al., “Development and function of CD94-deficient natural killer cells.” PloS One 5.12 (2010): e15184; Joyce, M. G. et al., “The structural basis of ligand recognition by natural killer cell receptors.” Journal of Biomedicine and Biotechnology 2011 (2011); each of which is incorporated by reference in its entirety.
In human genomes, CD94 gene (Gene ID: 80381) locus has seven exons, exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, and exon 7 (
The human CD94 gene (Gene ID: 3824) is located in Chromosome 12 of the human genome, which is located from 10238383 to 10329608 of NC_000012.12. The 5′-UTR is from 10304446 to 10304547, and from 10307978 to 10308077, , Exon 1 is from 10304446 to 10304547, the first intron is from 10304548 to 10307977, Exon 2 is from 10307978 to 10308084, the second intron is from 10308085 to 10309387, Exon 3 is from 10309388 to 10309480, the third intron is from 10309481 to 10309625, Exon 4 is from 10309626 to 10309688, the fourth intron is from 10309689 to 10311463, Exon 5 is from 10311464 to 10311615, the fifth intron is from 10311616 to 10313409, Exon 6 is from 10313410 to 10313513, the sixth intron is from 10313514 to 10314672, Exon 7 is from 10314673 to 10329600, and the 3′-UTR is from 10314794 to 10329600. All relevant information for human CD94 locus can be found in the NCBI website with Gene ID: 3824, which is incorporated by reference herein in its entirety.
According to the UniProt Database (UniProt ID: Q13241), the cytoplasmic region of human CD94 corresponds to amino acids 1-10 of SEQ ID NO: 2; the transmembrane region of human CD94 corresponds to amino acids 11-31 of SEQ ID NO: 2; and the extracellular region of human CD94 corresponds to amino acids 32-179 of SEQ ID NO: 2. Specifically, there is a C-type lectin domain within the extracellular region of human CD94, which corresponds to amino acids 68-175 of SEQ ID NO: 2.
In mice, CD94 gene locus has six exons, exon 1, exon 2, exon 3, exon 4, exon 5, and exon 6 (
The mouse CD94 gene (Gene ID: 16643) is located in Chromosome 6 of the mouse genome, which is located from 129588092 to 129598775 of NC_000072.6 (GRCm38.p6 (GCF_000001635.26)). The 5′-UTR is from 129,591,782 to 129,591,830, exon 1 is from 129,591,782 to 129,591,837, the first intron is from 129,591,838 to 129,593,489, exon 2 is from 129,593,490 to 129,593,582, the second intron is from 129,593,583 to 129,593,727, exon 3 is from 129,593,728 to 129,593,790, the third intron is from 129,593,791 to 129,595,409, exon 4 is from 129,595,410 to 129,595,561, the fourth intron is from 129,595,562 to 129,596,861, exon 5 is from 129,596,862 to 129,596,965, the fifth intron is from 129,596,966 to 129,598,348, exon 6 is from 129,598,349 to 129,598,775, and the 3′-UTR is from 129598470 to 129,598,775, based on transcript NM_010654.4. All relevant information for mouse CD94 locus can be found in the NCBI website with Gene ID:16643, which is incorporated by reference herein in its entirety.
According to the UniProt Database (UniProt ID: 054707), the cytoplasmic region of mouse CD94 corresponds to amino acids 1-10 of SEQ ID NO: 1; the transmembrane region of mouse CD94 corresponds to amino acids 11-31 of SEQ ID NO: 1; and the extracellular region of mouse CD94 corresponds to amino acids 32-179 of SEQ ID NO: 1. Specifically, there is a C-type lectin domain within the extracellular region of mouse CD94, which corresponds to amino acids 68-175 of SEQ ID NO: 1.
CD94 genes, proteins, and locus of the other species are also known in the art. For example, the gene ID for CD94 in Rattus norvegicus (rat) is 25110, the gene ID for CD94 in Macaca mulatta (Rhesus monkey) is 574145, the gene ID for CD94 in Canis lupus familiaris (dog) is 611360, and the gene ID for CD94 in Equus caballus (horse) is 100062540. The relevant information for these genes (e.g., intron sequences, exon sequences, amino acid residues of these proteins) can be found, e.g., in NCBI database, which is incorporated by reference herein in its entirety.
The present disclosure provides human or chimeric (e.g., humanized) CD94 nucleotide sequence and/or amino acid sequences. In some embodiments, the entire sequence of mouse exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, cytoplasmic region, transmembrane region, extracellular region, and/or C-type lectin domain, are replaced by the corresponding human sequence. In some embodiments, a “region” or “portion” of mouse exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, cytoplasmic region, transmembrane region, extracellular region, and/or C-type lectin domain are replaced by the corresponding human sequence. The term “region” or “portion” can refer to at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 250, 300, 350, 400, 500, 600, 700, 800 nucleotides, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, or 180 amino acid residues. In some embodiments, the “region” or “portion” can be at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identical to exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, cytoplasmic region, transmembrane region, extracellular region, and/or C-type lectin domain of mouse CD94 gene or protein; or exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, cytoplasmic region, transmembrane region, extracellular region, and/or C-type lectin domain of human CD94 gene or protein. In some embodiments, a region, a portion, or the entire sequence of mouse CD94 exon 1, exon 2, exon 3, exon 4, exon 5, and/or exon 6 (e.g., a portion of exon 3, exons 4-5, and a portion of exon 6) are replaced by human CD94 exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, and/or exon 7 (e.g., a portion of exon 4, exons 5-6, and a portion of exon 7) sequence.
In some embodiments, the genetically-modified non-human animal described herein comprises a sequence encoding a humanized CD94 protein. In some embodiments, the humanized CD94 protein comprises an endogenous cytoplasmic region. In some embodiments, the humanized CD94 protein comprises an endogenous transmembrane region. In some embodiments, the humanized CD94 protein comprises a humanized extracellular region. In some embodiments, the humanized CD94 protein comprises a humanized C-type lectin domain.
In some embodiments, the genetically-modified non-human animal described herein comprises a humanized CD94 gene. In some embodiments, the humanized CD94 gene comprises 6 exons. In some embodiments, the humanized CD94 gene comprises humanized exon 1, humanized exon 2, humanized exon 3, humanized exon 4, humanized exon 5, and/or humanized exon 6.
In some embodiments, the present disclosure also provides a chimeric (e.g., humanized) CD94 nucleotide sequence and/or amino acid sequences, wherein in some embodiments, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% of the sequence are identical to or derived from mouse CD94 mRNA sequence (e.g., NM_010654.4), mouse CD94 amino acid sequence (e.g., NP_034784.1; SEQ ID NO: 1), or a portion thereof (e.g., exons 1-2, a portion of exon 3, and a portion of exon 6 of mouse CD94 gene); and in some embodiments, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% of the sequence are identical to or derived from human CD94 mRNA sequence (e.g., NM_001351062.1), human CD94 amino acid sequence (e.g., NP_001337991.1; SEQ ID NO: 2), or a portion thereof (e.g., a portion of exon 4, exons 5-6, and a portion of exon 7 of human CD94 gene).
In some embodiments, the sequence encoding amino acids 37-179 of mouse CD94 (SEQ ID NO: 1) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human CD94 (e.g., amino acids 37-179 of human CD94 (SEQ ID NO: 2)).
In some embodiments, the sequence encoding amino acids 32-179 of mouse CD94 (SEQ ID NO: 1) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human CD94 (e.g., amino acids 32-179 of human CD94 (SEQ ID NO: 2)).
In some embodiments, the sequence encoding amino acids 68-175 of mouse CD94 (SEQ ID NO: 1) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human CD94 (e.g., amino acids 68-175 of human CD94 (SEQ ID NO: 2)).
In some embodiments, the sequence encoding the entirety or a portion of the extracellular region of mouse CD94 (SEQ ID NO: 1) is replaced or inactivated. In some embodiments, the sequence is replaced by a sequence encoding the entirety or a portion of the corresponding region of human CD94 (SEQ ID NO: 2). In some embodiments, the corresponding region of human CD94 comprises the entirety or a portion of the extracellular region of human CD94. In some embodiments, the sequence encoding amino acids 37-179 of mouse CD94 (SEQ ID NO: 1) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human CD94 (e.g., amino acids 37-179 of human CD94 (SEQ ID NO: 2)). In some embodiments, the sequence encoding amino acids 32-179 of mouse CD94 (SEQ ID NO: 1) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human CD94 (e.g., amino acids 32-179 of human CD94 (SEQ ID NO: 2)). In some embodiments, the sequence encoding the corresponding region of human CD94 does not include at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 amino acids at the N-terminus and/or C-terminus of the extracellular region of human CD94. In some embodiments, the sequence encoding the corresponding region of human CD94 comprises a nucleotide sequence within human CD94 exon 4 encoding at least 10-21, 10-15, or 16-19 contiguous amino acids. In some embodiments, the sequence encoding the corresponding region of human CD94 comprises a nucleotide sequence within human CD94 exon 4 encoding at least 20 or 21 contiguous amino acids. In some embodiments, the sequence encoding the corresponding region of human CD94 comprises a nucleotide sequence at the 3′ end of human CD94 exon 3 encoding at least 1, at least 2, at least 3, at least 4, or at least 5 amino acids.
In some embodiments, the human portion of the humanized CD94 protein may also include amino acid residues outside of 37-179 of SEQ ID NO: 2, which are identical or similar to the corresponding residues of endogenous CD94 protein of the non-human animal. Such residues can be derived from the extracellular region, transmembrane region, and/or cytoplasmic region, e.g., amino acids 34-179, or amino acids 33-179 of SEQ ID NO: 2, within the extracellular region of human CD94 protein. An alignment of the amino acid resides between human and mouse CD94 is shown in
In some embodiments, the sequence encoding the extracellular C-type lectin domain of mouse CD94 (SEQ ID NO: 1) is replaced or inactivated. In some embodiments, the sequence is replaced by a sequence encoding the extracellular C-type lectin domain of human CD94 (SEQ ID NO: 2). In some embodiments, the sequence encoding amino acids 68-175 of mouse CD94 (SEQ ID NO: 1) is replaced. In some embodiments, the sequence is replaced by a sequence encoding amino acids 68-175 of human CD94 (SEQ ID NO: 2).
In some embodiments, the nucleic acids as described herein are operably linked to a promotor or regulatory element, e.g., an endogenous mouse CD94 promotor, an inducible promoter, an enhancer, and/or mouse or human regulatory elements.
In some embodiments, the nucleic acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides, e.g., contiguous or non-contiguous nucleotides) that are different from a portion of or the entire mouse CD94 nucleotide sequence (e.g., a portion of exon 3, exons 4-5, and a portion of exon 6 of NM_010654.4).
In some embodiments, the nucleic acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides, e.g., contiguous or non-contiguous nucleotides) that is the same as a portion of or the entire mouse CD94 nucleotide sequence (e.g., exons 1-2, a portion of exon 3, exons 4-5, and a portion of exon 6 of NM_010654.4).
In some embodiments, the nucleic acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides, e.g., contiguous or non-contiguous nucleotides) that is different from a portion of or the entire human CD94 nucleotide sequence (e.g., exons 1-3, a portion of exon 4, and a portion of exon 7 of NM_001351062.1).
In some embodiments, the nucleic acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides, e.g., contiguous or non-contiguous nucleotides) that is the same as a portion of or the entire human CD94 nucleotide sequence (e.g., a portion of exon 4, exons 5-6, and a portion of exon 7 of NM_001351062.1).
In some embodiments, the amino acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 amino acid residues, e.g., contiguous or non-contiguous amino acid residues) that is different from a portion of or the entire mouse CD94 amino acid sequence (e.g., an amino acid sequence encoded by a portion of exon 3, exons 4-5, and a portion of exon 6 of NM_010654.4; or amino acids 37-179 of NP_034784.1 (SEQ ID NO: 1)).
In some embodiments, the amino acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 amino acid residues, e.g., contiguous or non-contiguous amino acid residues) that is the same as a portion of or the entire mouse CD94 amino acid sequence (e.g., an amino acid sequence encoded by exons 1-2, a portion of exon 3, exons 4-5, and a portion of exon 6 of NM_010654.4; or amino acids 1-36 of NP_034784.1 (SEQ ID NO: 1)).
In some embodiments, the amino acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 amino acid residues, e.g., contiguous or non-contiguous amino acid residues) that is different from a portion of or the entire human CD94 amino acid sequence (e.g., an amino acid sequence encoded by exons 1-3, a portion of exon 4, and a portion of exon 7 of NM_001351062.1; or amino acids 1-36 of NP_001337991.1 (SEQ ID NO: 2)).
In some embodiments, the amino acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 amino acid residues, e.g., contiguous or non-contiguous amino acid residues) that is the same as a portion of or the entire human CD94 amino acid sequence (e.g., an amino acid sequence encoded by a portion of exon 4, exons 5-6, and a portion of exon 7 of NM_001351062.1; or amino acids 37-179 of NP_001337991.1 (SEQ ID NO: 2)).
NKG2A (also known as CD159A), also known as killer cell lectin-like receptor subfamily C, member 1 (KLRC1), is a type II transmembrane protein belonging to the CD94/NKG2 family of C-type lectin-like receptors that inhibits innate immune system activation.
The cytoplasmic tail of human NKG2A receptor contains two immunoreceptor tyrosine-based inhibition motifs (ITIM) capable of recruiting both SHP-1 and SHP-2 phosphatases, but not the inositol phosphatase SHIP. Both ITIMs are required to mediate the maximal inhibitory signal, but the membrane-distal ITIM is of primary importance rather than the membrane-proximal ITIM. The partner CD94 has a short cytoplasmic region, thus lacks ITIMs and has no role in downstream signaling.
The CD94/NKG2A heterodimeric receptor is one of the most prominent NK inhibitory receptors. It binds to a nonclassical minimally polymorphic HLA class I molecule (HLA-E), which presents peptides derived from leader peptide sequences of other HLA class I molecules, such as HLA-G. Upon ligation by peptide-loaded HLA-E, NKG2A transduces inhibitory signaling through 2 inhibitory immune-receptor tyrosine-based inhibition motifs, thus suppressing NK cytokine secretion and cytotoxicity.
CD94/NKG2A and CD94/NKG2C were identified in the mid-1990s as cell surface glycoproteins that form disulfide-bonded heterodimers with CD94 and bind the non-classical MHC class Ib molecule HLA-E. NKG2C engagement in CMV-seropositive individuals imparts NK cell activation (with activated cells termed adaptive NK cells), while engagement of CD94/NKG2A transduces an inhibitory signal, consistent with the presence of two I/VxYxxL immunoreceptor tyrosine-based inhibitory motifs (ITIMs) within the cytoplasmic domain of NKG2A. While NKG2A expression and NKG2C expression are usually mutually exclusive, the frequency of NKG2A+ NK cells is considerably higher than that of NKG2C+ NK cells, especially in CMV-seronegative donors. Therefore, alongside killer immunoglobulin-like receptors (KIR), NKG2A represents a dominant inhibitory receptor on NK cells. The NKG2A phospho-ITIMs interact directly with the SH2 domains of the tyrosine phosphatases SHP-1 and SHP-2. One of the major targets of SHP-1-mediated dephosphorylation in NK cells is the guanine exchange factor and adaptor protein Vav1. Dephosphorylation of Vav1 prevents Rac1-dependent rearrangement of the actin cytoskeleton and amplification of activating signals. Engagement of CD94/NKG2A by HLA-E within inhibitory signaling clusters can also lead to the phosphorylation of the signaling adaptor protein Crk and disruption of actin-dependent signaling upstream of Vav1. NKG2A is uniformly high on immunoregulatory CD56bright NK cells, whereas cytotoxic CD56dim NK cells exhibit more heterogeneous expression, with a general decrease associated with terminal differentiation.
A detailed description of NKG2A and its function can be found, e.g., in Khan, M., et al., “NK cell-based immune checkpoint inhibition.” Frontiers in Immunology 11 (2020); Kamiya, T. et al., “Blocking expression of inhibitory receptor NKG2A overcomes tumor resistance to NK cells.” The Journal of Clinical Investigation 129.5 (2019): 2094-2106; Borst, L., et al., “The NKG2A-HLA-E axis as a novel checkpoint in the tumor microenvironment.” Clinical Cancer Research 26.21 (2020): 5549-5556; Cichocki, F. et al., “Setting traps for NKG2A gives NK cell immunotherapy a fighting chance.” The Journal of Clinical Investigation 129.5 (2019): 1839-1841; each of which is incorporated by reference in its entirety.
In human genomes, NKG2A gene (Gene ID: 3821) locus has eight exons, exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, and exon 8 (
The human NKG2A gene (Gene ID: 3821) is located in Chromosome 12 of the human genome, which is located from 10441673 to 10454685 of NC_000012.12.. The 5′-UTR is from 10454332 to 10454616, and from 10453198 to 10453269, and from 10451157 to 10451187, Exon 1 is from 10454616 to 10454332, the first intron is from 10454331 to 10453270, Exon 2 is from 10453269 to 10453198, the second intron is from 10453197 to 10451188, Exon 3 is from 10451187 to 10450970, the third intron is from 10450969 to 10450580, Exon 4 is from 10450579 to 10450484, the fourth intron is from 10450483 to 10449968, Exon 5 is from 10449967 to 10449914, the fifth intron is from 10449913 to 10449389, Exon 6 is from 10449388 to 10449237, the sixth intron is from 10449236 to 10447633, Exon 7 is from 10447632 to 10447532, the seventh intron is from 10447531 to 10446663, Exon 8 is from 10446662 to 10446041 and the 3′-UTR is from 10446041 to 10446550.All relevant information for human NKG2A locus can be found in the NCBI website with Gene ID: 3821, which is incorporated by reference herein in its entirety.
According to the UniProt Database (UniProt ID: P26715), the cytoplasmic region of human NKG2A corresponds to amino acids 1-70 of SEQ ID NO: 29; the transmembrane region of human NKG2A corresponds to amino acids 71-93 of SEQ ID NO: 29; and the extracellular region of human NKG2A corresponds to amino acids 94-233 of SEQ ID NO: 29. Specifically, there is a C-type lectin domain within the extracellular region of human NKG2A, which corresponds to amino acids 118-231 of SEQ ID NO: 29. In addition, there are two Immunoreceptor tyrosine-based inhibition motifs (ITIMs) within the cytoplasmic region, which correspond to amino acids 6-11 and 38-43 of SEQ ID NO: 29, respectively.
In mice, NKG2A gene locus has seven exons, exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, and exon 7 (
The mouse NKG2A gene (Gene ID: 16641) is located in Chromosome 6 of the mouse genome, which is located from 129666015 to 129682852 of NC_000072.6 (GRCm38.p6 (GCF_000001635.26)). The 5′-UTR is from 129,678,973 to 129,678,911, exon 1 is from 129,678,973 to 129,678,724, the first intron is from 129,678,723 to 129,678,398, exon 2 is from 129,678,397 to 129,678,296, the second intron is from 129,678,295 to 129,677,816, exon 3 is from 129,677,815 to 129,677,765, the third intron is from 129,677,764 to 129,677,323, exon 4 is from 129,677,322 to 129,677,147, the fourth intron is from 129,677,146 to 129,675,401, exon 5 is from 129,675,400 to 129,675,300, the fifth intron is from 129,675,299 to 129,674,977, exon 6 is from X129,674,976 to 129,674,834, the sixth intron is from 129,674,833 to 129,667,176, exon 7 is from 129,667,175 to 129,666,015, the 3′-UTR is from 129,674,858 to 129,674,834 and 129,667,175 to 129,666,015, based on transcript NM_001136068.2. All relevant information for mouse NKG2A locus can be found in the NCBI website with Gene ID: 16641, which is incorporated by reference herein in its entirety.
According to the UniProt Database (UniProt ID: Q9Z202), the cytoplasmic region of mouse NKG2A corresponds to amino acids 1-71 of SEQ ID NO: 28; the transmembrane region of mouse NKG2A corresponds to amino acids 72-94 of SEQ ID NO: 28; and the extracellular region of mouse NKG2A corresponds to amino acids 95-244 of SEQ ID NO: 28. Specifically, there is a C-type lectin domain within the extracellular region of mouse NKG2A, which corresponds to amino acids 135-239 of SEQ ID NO: 28.
NKG2A genes, proteins, and locus of the other species are also known in the art. For example, the gene ID for NKG2A in Rattus norvegicus (rat) is 29683, the gene ID for NKG2A in Macaca mulatta (Rhesus monkey) is 574146, the gene ID for NKG2A in Bos taurus (cattle) is 100139049, and the gene ID for NKG2A in Pan troglodytes (chimpanzee) is 450131. The relevant information for these genes (e.g., intron sequences, exon sequences, amino acid residues of these proteins) can be found, e.g., in NCBI database, which is incorporated by reference herein in its entirety.
The present disclosure provides human or chimeric (e.g., humanized) NKG2A nucleotide sequence and/or amino acid sequences. In some embodiments, the entire sequence of mouse exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, cytoplasmic region, transmembrane region, extracellular region, and/or C-type lectin domain, are replaced by the corresponding human sequence. In some embodiments, a “region” or “portion” of mouse exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, cytoplasmic region, transmembrane region, extracellular region, and/or C-type lectin domain are replaced by the corresponding human sequence. The term “region” or “portion” can refer to at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 250, 300, 350, 400, 500, 600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 1600, 1700, 1800, 1900 nucleotides, or at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, 200, 210, 220, 230, or 240 amino acid residues. In some embodiments, the “region” or “portion” can be at least 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, or 99% identical to exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, cytoplasmic region, transmembrane region, extracellular region, and/or C-type lectin domain of mouse NKG2A gene or protein; or exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, exon 8, cytoplasmic region, transmembrane region, extracellular region, and/or C-type lectin domain of human NKG2A gene or protein. In some embodiments, a region, a portion, or the entire sequence of mouse NKG2A exon 1, exon 2, exon 3, exon 4, exon 5, exon, 6, and/or exon 7 (e.g., a portion of exon 2, exons 3-5, and a portion of exon 6) are replaced by human NKG2A exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, and/or exon 8 (e.g., a portion of exon 4, exons 5-7, and a portion of exon 8) sequence.
In some embodiments, the genetically-modified non-human animal described herein comprises a sequence encoding a humanized NKG2A protein. In some embodiments, the humanized NKG2A protein comprises an endogenous cytoplasmic region. In some embodiments, the humanized NKG2A protein comprises an endogenous transmembrane region. In some embodiments, the humanized NKG2A protein comprises a humanized extracellular region. In some embodiments, the humanized NKG2A protein comprises a humanized C-type lectin domain.
In some embodiments, the genetically-modified non-human animal described herein comprises a humanized NKG2A gene. In some embodiments, the humanized NKG2A gene comprises 7 exons. In some embodiments, the humanized NKG2A gene comprises humanized exon 1, humanized exon 2, humanized exon 3, humanized exon 4, humanized exon 5, humanized exon 6, and/or humanized exon 7.
In some embodiments, the present disclosure also provides a chimeric (e.g., humanized) NKG2A nucleotide sequence and/or amino acid sequences, wherein in some embodiments, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% of the sequence are identical to or derived from mouse NKG2A mRNA sequence (e.g., NM_001136068.2), mouse NKG2A amino acid sequence (e.g., NP_001129540.1; SEQ ID NO: 28), or a portion thereof (e.g., exon 1, a portion of exon 2, a portion of exon 6, and exon 7 of mouse NKG2A gene); and in some embodiments, at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% of the sequence are identical to or derived from human NKG2A mRNA sequence (e.g., NM_213658.2), human NKG2A amino acid sequence (e.g., NP_998823.1; SEQ ID NO: 29), or a portion thereof (e.g., a portion of exon 4, exons 5-7, and a portion of exon 8 of human NKG2A gene).
In some embodiments, the sequence encoding amino acids 96-244 of mouse NKG2A (SEQ ID NO: 28) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human NKG2A (e.g., amino acids 94-233 of human NKG2A (SEQ ID NO: 29)).
In some embodiments, the sequence encoding amino acids 95-244 of mouse NKG2A (SEQ ID NO: 28) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human NKG2A (e.g., amino acids 94-233 of human NKG2A (SEQ ID NO: 29)).
In some embodiments, the sequence encoding amino acids 135-239 of mouse NKG2A (SEQ ID NO: 28) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human NKG2A (e.g., amino acids 118-231 of human NKG2A (SEQ ID NO: 29)).
In some embodiments, the sequence encoding the entirety or a portion of the extracellular region of mouse NKG2A (SEQ ID NO: 28) is replaced or inactivated. In some embodiments, the sequence is replaced by a sequence encoding the entirety or a portion of the corresponding region of human NKG2A (SEQ ID NO: 29). In some embodiments, the corresponding region of human NKG2A comprises the entirety or a portion of the extracellular region of human NKG2A. In some embodiments, the sequence encoding amino acids 96-244 of mouse NKG2A (SEQ ID NO: 28) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human NKG2A (e.g., amino acids 94-233 of human NKG2A (SEQ ID NO: 29)). In some embodiments, the sequence encoding amino acids 95-244 of mouse NKG2A (SEQ ID NO: 28) is replaced. In some embodiments, the sequence is replaced by a sequence encoding a corresponding region of human NKG2A (e.g., amino acids 94-233 of human NKG2A (SEQ ID NO: 29)). In some embodiments, the sequence encoding the corresponding region of human NKG2A does not include at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 amino acids at the N-terminus and/or C-terminus of the extracellular region of human NKG2A. In some embodiments, the sequence encoding the corresponding region of human NKG2A comprises a nucleotide sequence at the 3′ end of human NKG2A exon 4 encoding at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, or at least 10 amino acids. In some embodiments, the sequence encoding the corresponding region of human NKG2A comprises a nucleotide sequence at the 3′ end of human NKG2A exon 4 encoding the last amino acid residue.
In some embodiments, the human portion of the humanized NKG2A protein may also include amino acid residues outside of 94-233 of SEQ ID NO: 29, which are identical or similar to the corresponding residues of endogenous NKG2A protein of the non-human animal. Such residues can be derived from the extracellular region, transmembrane region, and/or cytoplasmic region. An alignment of the amino acid resides between human and mouse NKG2A is shown in
In some embodiments, the sequence encoding the extracellular C-type lectin domain of mouse NKG2A (SEQ ID NO: 28) is replaced or inactivated. In some embodiments, the sequence is replaced by a sequence encoding the extracellular C-type lectin domain of human NKG2A (SEQ ID NO: 29). In some embodiments, the sequence encoding amino acids 135-239 of mouse NKG2A (SEQ ID NO: 28) is replaced. In some embodiments, the sequence is replaced by a sequence encoding amino acids 118-231 of human NKG2A (SEQ ID NO: 29).
In some embodiments, the nucleic acids as described herein are operably linked to a promotor or regulatory element, e.g., an endogenous mouse NKG2A promotor, an inducible promoter, an enhancer, and/or mouse or human regulatory elements.
In some embodiments, the nucleic acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides, e.g., contiguous or non-contiguous nucleotides) that are different from a portion of or the entire mouse NKG2A nucleotide sequence (e.g., a portion of exon 2, exons 3-5, and a portion of exon 6 of NM_001136068.2).
In some embodiments, the nucleic acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides, e.g., contiguous or non-contiguous nucleotides) that is the same as a portion of or the entire mouse NKG2A nucleotide sequence (e.g., exon 1, a portion of exon 2, a portion of exon 6, and exon 7 of NM_001136068.2).
In some embodiments, the nucleic acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides, e.g., contiguous or non-contiguous nucleotides) that is different from a portion of or the entire human NKG2A nucleotide sequence (e.g., exons 1-3, a portion of exon 4, and a portion of exon 8 of NM_213658.2).
In some embodiments, the nucleic acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 nucleotides, e.g., contiguous or non-contiguous nucleotides) that is the same as a portion of or the entire human NKG2A nucleotide sequence (e.g., a portion of exon 4, exons 5-7, and a portion of exon 8 of NM_213658.2).
In some embodiments, the amino acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 amino acid residues, e.g., contiguous or non-contiguous amino acid residues) that is different from a portion of or the entire mouse NKG2A amino acid sequence (e.g., an amino acid sequence encoded by a portion of exon 2, exons 3-5, and a portion of exon 6 of NM_001136068.2; or amino acids 96-244 of NP_001129540.1 (SEQ ID NO: 28)).
In some embodiments, the amino acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 amino acid residues, e.g., contiguous or non-contiguous amino acid residues) that is the same as a portion of or the entire mouse NKG2A amino acid sequence (e.g., an amino acid sequence encoded by exon 1, a portion of exon 2, a portion of exon 6, and exon 7 of NM_001136068.2; or amino acids 1-95 of NP_001129540.1 (SEQ ID NO: 28)).
In some embodiments, the amino acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 amino acid residues, e.g., contiguous or non-contiguous amino acid residues) that is different from a portion of or the entire human NKG2A amino acid sequence (e.g., an amino acid sequence encoded by exons 1-3, a portion of exon 4, and a portion of exon 8 of NM_213658.2; or amino acids 1-93 of NP_998823.1 (SEQ ID NO: 29)).
In some embodiments, the amino acid sequence has at least a portion (e.g., at least 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 20, 30, 40, 50, 60, 70, 80, 90, or 100 amino acid residues, e.g., contiguous or non-contiguous amino acid residues) that is the same as a portion of or the entire human NKG2A amino acid sequence (e.g., an amino acid sequence encoded by a portion of exon 4, exons 5-7, and a portion of exon 8 of NM_213658.2; or amino acids 94-233 of NP_998823.1 (SEQ ID NO: 29)).
The present disclosure also provides a human or humanized CD94 amino acid sequence, or a human or humanized NKG2A amino acid sequence, wherein the amino acid sequence is selected from the group consisting of:
The present disclosure also relates to a CD94 nucleic acid (e.g., DNA or RNA) sequence, or a NKG2A nucleic acid (e.g., DNA or RNA) sequence, wherein the nucleic acid sequence can be selected from the group consisting of:
The present disclosure also relates to a CD94 protein sequence, wherein the amino acid sequence of the CD94 protein can be selected from the group consisting of:
The present disclosure also relates to a NKG2A protein sequence, wherein the amino acid sequence of the NKG2A protein can be selected from the group consisting of:
The present disclosure also relates to a humanized CD94 gene or a humanized CD94 mRNA sequence, wherein the humanized CD94 mRNA sequence or the mRNA sequence transcribed from the humanized CD94 gene can be selected from the group consisting of:
The present disclosure also relates to a humanized CD94 gene, wherein the humanized CD94 gene comprises a sequence that can be selected from the group consisting of:
The present disclosure also relates to a humanized NKG2A gene or a humanized NKG2A mRNA sequence, wherein the humanized NKG2A mRNA sequence or the mRNA sequence transcribed from the humanized NKG2A gene can be selected from the group consisting of:
The present disclosure also relates to a humanized NKG2A gene, wherein the humanized CD94 gene comprises a sequence that can be selected from the group consisting of:
The present disclosure further relates to a humanized CD94 gene comprising a sequence that is at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to SEQ ID NO: 6.
The present disclosure further relates to a humanized NKG2A gene comprising a sequence that is at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical to SEQ ID NO: 33.
The present disclosure further relates to a CD94 genomic DNA sequence of a humanized mouse. The DNA sequence (e.g., cDNA) is obtained by a reverse transcription of the mRNA obtained by transcription thereof and is consistent with or complementary to the DNA sequence homologous to the sequence shown in SEQ ID NO: 7.
The present disclosure further relates to a NKG2A genomic DNA sequence of a humanized mouse. The DNA sequence (e.g., cDNA) is obtained by a reverse transcription of the mRNA obtained by transcription thereof and is consistent with or complementary to the DNA sequence homologous to the sequence shown in SEQ ID NO: 34.
The disclosure also provides an amino acid sequence that has a homology of at least 90% with, or at least 90% identical to the sequence shown in SEQ ID NO: 1, 2, 8, 28, 29, or 35, and has protein activity. In some embodiments, the homology with the sequence shown in SEQ ID NO: 1, 2, 8, 28, 29, or 35 is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99%. In some embodiments, the foregoing homology is at least about 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 80%, or 85%.
In some embodiments, the percentage identity with the sequence shown in 1, 2, 8, 28, 29, or 35 is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99%. In some embodiments, the foregoing percentage identity is at least about 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 80%, or 85%.
The disclosure also provides a nucleotide sequence that has a homology of at least 90%, or at least 90% identical to the sequence shown in SEQ ID NO: 5, 7, 32, or 34, and encodes a polypeptide that has protein activity. In some embodiments, the homology with the sequence shown in SEQ ID NO: 5, 7, 32, or 34 is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99%. In some embodiments, the foregoing homology is at least about 50%, 55%, 60%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 80%, or 85%.
In some embodiments, the percentage identity with the sequence shown in SEQ ID NO: 5, 7, 32, or 34 is at least about 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or at least 99%. In some embodiments, the foregoing percentage identity is at least about 50%, 55%, 60%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 80%, or 85%.
The disclosure also provides a nucleic acid sequence that is at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% identical to any nucleotide sequence as described herein, and an amino acid sequence that is at least 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 15%, 20%, 25%, 30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% identical to any amino acid sequence as described herein. In some embodiments, the disclosure relates to nucleotide sequences encoding any peptides that are described herein, or any amino acid sequences that are encoded by any nucleotide sequences as described herein. In some embodiments, the nucleic acid sequence is less than 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 150, 200, 250, 300, 350, 400, 500, or 600 nucleotides. In some embodiments, the amino acid sequence is less than 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, 140, 150, 160, 170, 180, 190, or 200 amino acid residues.
In some embodiments, the amino acid sequence (i) comprises an amino acid sequence; or (ii) consists of an amino acid sequence, wherein the amino acid sequence is any one of the sequences as described herein.
In some embodiments, the nucleic acid sequence (i) comprises a nucleic acid sequence; or (ii) consists of a nucleic acid sequence, wherein the nucleic acid sequence is any one of the sequences as described herein.
To determine the percent identity of two amino acid sequences, or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences. For illustration purposes, the comparison of sequences and determination of percent identity between two sequences can be accomplished using a Blossum 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5.
The percentage of residues conserved with similar physicochemical properties (percent homology), e.g. leucine and isoleucine, can also be used to measure sequence similarity. Families of amino acid residues having similar physicochemical properties have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). The homology percentage, in many cases, is higher than the identity percentage.
Cells, tissues, and animals (e.g., mouse) are also provided that comprise the nucleotide sequences as described herein, as well as cells, tissues, and animals (e.g., mouse) that express human or chimeric (e.g., humanized) CD94 and/or NKG2A from an endogenous non-human CD94 or NKG2A locus.
As used herein, the term “genetically-modified non-human animal” refers to a non-human animal having exogenous DNA in at least one chromosome of the animal’s genome. In some embodiments, at least one or more cells, e.g., at least 1%, 2%, 3%, 4%, 5%, 10%, 20%, 30%, 40%, 50% of cells of the genetically-modified non-human animal have the exogenous DNA in its genome. The cell having exogenous DNA can be various kinds of cells, e.g., an endogenous cell, a somatic cell, an immune cell, a T cell, a B cell, an antigen presenting cell, a macrophage, a dendritic cell, a germ cell, a blastocyst, or an endogenous tumor cell. In some embodiments, genetically-modified non-human animals are provided that comprise a modified endogenous CD94 and/or NKG2A locus that comprises an exogenous sequence (e.g., a human sequence), e.g., a replacement of one or more non-human sequences with one or more human sequences. The animals are generally able to pass the modification to progeny, i.e., through germline transmission.
As used herein, the term “chimeric gene” or “chimeric nucleic acid” refers to a gene or a nucleic acid, wherein two or more portions of the gene or the nucleic acid are from different species, or at least one of the sequences of the gene or the nucleic acid does not correspond to the wildtype nucleic acid in the animal. In some embodiments, the chimeric gene or chimeric nucleic acid has at least one portion of the sequence that is derived from two or more different sources, e.g., sequences encoding different proteins or sequences encoding the same (or homologous) protein of two or more different species. In some embodiments, the chimeric gene or the chimeric nucleic acid is a humanized gene or humanized nucleic acid.
As used herein, the term “chimeric protein” or “chimeric polypeptide” refers to a protein or a polypeptide, wherein two or more portions of the protein or the polypeptide are from different species, or at least one of the sequences of the protein or the polypeptide does not correspond to wild-type amino acid sequence in the animal. In some embodiments, the chimeric protein or the chimeric polypeptide has at least one portion of the sequence that is derived from two or more different sources, e.g., same (or homologous) proteins of different species. In some embodiments, the chimeric protein or the chimeric polypeptide is a humanized protein or a humanized polypeptide.
As used herein, the term “humanized protein” or “humanized polypeptide” refers to a protein or a polypeptide, wherein at least a portion of the protein or the polypeptide is from the human protein or human polypeptide. In some embodiments, the humanized protein or polypeptide is a human protein or polypeptide.
As used herein, the term “humanized nucleic acid” refers to a nucleic acid, wherein at least a portion of the nucleic acid is from the human. In some embodiments, the entire nucleic acid of the humanized nucleic acid is from human. In some embodiments, the humanized nucleic acid is a humanized exon. A humanized exon can be e.g., a human exon or a chimeric exon.
In some embodiments, the chimeric gene or the chimeric nucleic acid is a humanized CD94 gene or a humanized CD94 nucleic acid. In some embodiments, at least one or more portions of the gene or the nucleic acid is from the human CD94 gene, at least one or more portions of the gene or the nucleic acid is from a non-human CD94 gene. In some embodiments, the gene or the nucleic acid comprises a sequence that encodes a CD94 protein. The encoded CD94 protein is functional or has at least one activity of the human CD94 protein or the non-human CD94 protein, e.g., binding with human or non-human NKG2 molecules (e.g., NKG2A or NKG2C); pairing with NKG2 molecules to interact with human or non-human MHC class I molecules (e.g., HLA-E in humans or Qa-1 in mice); activating or inhibiting NK cell and/or T cell activities; upregulating or downregulating the immune response.
In some embodiments, the chimeric gene or the chimeric nucleic acid is a humanized NKG2A gene or a humanized NKG2A nucleic acid. In some embodiments, at least one or more portions of the gene or the nucleic acid is from the human NKG2A gene, at least one or more portions of the gene or the nucleic acid is from a non-human NKG2A gene. In some embodiments, the gene or the nucleic acid comprises a sequence that encodes a NKG2A protein. The encoded NKG2A protein is functional or has at least one activity of the human NKG2A protein or the non-human NKG2A protein, e.g., binding with human or non-human CD94; pairing with CD94 to interact with human or non-human MHC class I molecules (e.g., HLA-E in humans or Qa-1 in mice); inhibiting NK cell or T cell activities; upregulating or downregulating the immune response.
In some embodiments, the MHC class I molecule described herein is a non-classic MHC class I molecule. In some embodiments, the MHC class I molecule described herein is a classic MHC class I molecule.
In some embodiments, the chimeric protein or the chimeric polypeptide is a humanized CD94 protein or a humanized CD94 polypeptide. In some embodiments, at least one or more portions of the amino acid sequence of the protein or the polypeptide is from a human CD94 protein, and at least one or more portions of the amino acid sequence of the protein or the polypeptide is from a non-human CD94 protein. The humanized CD94 protein or the humanized CD94 polypeptide is functional or has at least one activity of the human CD94 protein or the non-human CD94 protein.
In some embodiments, the chimeric protein or the chimeric polypeptide is a humanized NKG2A protein or a humanized NKG2A polypeptide. In some embodiments, at least one or more portions of the amino acid sequence of the protein or the polypeptide is from a human NKG2A protein, and at least one or more portions of the amino acid sequence of the protein or the polypeptide is from a non-human NKG2A protein. The humanized NKG2A protein or the humanized NKG2A polypeptide is functional or has at least one activity of the human NKG2A protein or the non-human NKG2A protein.
The genetically modified non-human animal can be various animals, e.g., a mouse, rat, rabbit, pig, bovine (e.g., cow, bull, buffalo), deer, sheep, goat, chicken, cat, dog, ferret, primate (e.g., marmoset, rhesus monkey). For the non-human animals where suitable genetically modifiable embryonic stem (ES) cells are not readily available, other methods are employed to make a non-human animal comprising the genetic modification. Such methods include, e.g., modifying a non-ES cell genome (e.g., a fibroblast or an induced pluripotent cell) and employing nuclear transfer to transfer the modified genome to a suitable cell, e.g., an oocyte, and gestating the modified cell (e.g., the modified oocyte) in a non-human animal under suitable conditions to form an embryo. These methods are known in the art, and are described, e.g., in A. Nagy, et al., “Manipulating the Mouse Embryo: A Laboratory Manual (Third Edition),” Cold Spring Harbor Laboratory Press, 2003, which is incorporated by reference herein in its entirety.
In one aspect, the animal is a mammal, e.g., of the superfamily Dipodoidea or Muroidea. In some embodiments, the genetically modified animal is a rodent. The rodent can be selected from a mouse, a rat, and a hamster. In some embodiments, the genetically modified animal is from a family selected from Calomyscidae (e.g., mouse-like hamsters), Cricetidae (e.g., hamster, New World rats and mice, voles), Muridae (true mice and rats, gerbils, spiny mice, crested rats), Nesomyidae (climbing mice, rock mice, with-tailed rats, Malagasy rats and mice), Platacanthomyidae (e.g., spiny dormice), and Spalacidae (e.g., mole rates, bamboo rats, and zokors). In some embodiments, the genetically modified rodent is selected from a true mouse or rat (family Muridae), a gerbil, a spiny mouse, and a crested rat. In some embodiments, the non-human animal is a mouse.
In some embodiments, the animal is a mouse of a C57BL strain selected from C57BL/A, C57BL/An, C57BL/GrFa, C57BL/KaLwN, C57BL/6, C57BL/6J, C57BL/6ByJ, C57BL/6NJ, C57BL/10, C57BL/10ScSn, C57BL/10Cr, and C57BL/Ola. In some embodiments, the mouse is a 129 strain selected from the group consisting of a strain that is 129P1, 129P2, 129P3, 129X1, 129S1 (e.g., 129S1/SV, 129S1/SvIm), 129S2, 129S4, 129S5, 129S9/SvEvH, 129S6 (129/SvEvTac), 129S7, 129S8, 129T1, 129T2. These mice are described, e.g., in Festing et al., Revised nomenclature for strain 129 mice, Mammalian Genome 10: 836 (1999); Auerbach et al., Establishment and Chimera Analysis of 129/SvEv- and C57BL/6-Derived Mouse Embryonic Stem Cell Lines (2000), both of which are incorporated herein by reference in the entirety. In some embodiments, the genetically modified mouse is a mix of the 129 strain and the C57BL/6 strain. In some embodiments, the mouse is a mix of the 129 strains, or a mix of the BL/6 strains. In some embodiments, the mouse is a BALB strain, e.g., BALB/c strain. In some embodiments, the mouse is a mix of a BALB strain and another strain. In some embodiments, the mouse is from a hybrid line (e.g., 50% BALB/c-50% 12954/Sv; or 50% C57BL/6-50% 129).
In some embodiments, the animal is a rodent. In some embodiments, the rodent is selected from BALB/c, A, A/He, A/J, A/WySN, AKR, AKR/A, AKR/J, AKR/N, TA1, TA2, RF, SWR, C3H, C57BR, SJL, C57L, DBA/ 2. KM, NIH, ICR, CFW, FACA, C57BL/A, C57BL/An, C57BL/GrFa, C57BL/KaLwN, C57BL/6, C57BL/6J, C57BL/6ByJ, C57BL/6NJ, C57BL/10, C57BL/ 10ScSn, C57BL/10Cr and C57BL/Ola C57BL, C58, CBA/Br, CBA/Ca, CBA/J, CBA/st, CBA/H strains of mice and NOD, NOD/SCID, NOD-Prkdcscid IL-2rgnull Background mice.
In some embodiments, the animal is a rat. The rat can be selected from a Wistar rat, an LEA strain, a Sprague Dawley strain, a Fischer strain, F344, F6, and Dark Agouti. In some embodiments, the rat strain is a mix of two or more strains selected from the group consisting of Wistar, LEA, Sprague Dawley, Fischer, F344, F6, and Dark Agouti.
The animal can have one or more other genetic modifications, and/or other modifications, that are suitable for the particular purpose for which the humanized CD94 and/or NKG2A animal is made. For example, suitable mice for maintaining a xenograft (e.g., a human cancer or tumor), can have one or more modifications that compromise, inactivate, or destroy the immune system of the non-human animal in whole or in part. Compromise, inactivation, or destruction of the immune system of the non-human animal can include, for example, destruction of hematopoietic cells and/or immune cells by chemical means (e.g., administering a toxin), physical means (e.g., irradiating the animal), and/or genetic modification (e.g., knocking out one or more genes). Nonlimiting examples of such mice include, e.g., NOD-Prkdcscid fL-2rynull NOD mice, NOD-Rag 1-/--IL2rg-/- (NRG) mice, Rag 2-/--IL2rg-/- (RG) mice, SCID mice, NOD/SCID mice, IL2Rγ knockout mice, NOD/SCID/γcnull mice (Ito, M. et al., NOD/SCID/ γcnull mouse: an excellent recipient mouse model for engraftment of human cells, Blood 100(9): 3175-3182, 2002), nude mice, and Rag1 and/or Rag2 knockout mice. These mice can optionally be irradiated, or otherwise treated to destroy one or more immune cell type. Thus, in various embodiments, a genetically modified mouse is provided that can include a humanization of at least a portion of an endogenous non-human CD94 and/or NKG2A locus, and further comprises a modification that compromises, inactivates, or destroys the immune system (or one or more cell types of the immune system) of the non-human animal in whole or in part. In some embodiments, modification is, e.g., selected from the group consisting of a modification that results in NOD-Prkdcscid IL-2rγnull NOD mice, NOD-Rag 1-/--IL2rg-/- (NRG) mice, Rag 2-/--IL2rg-/- (RG) mice, NOD mice, SCID mice, NOD/SCID mice, IL-2Ry knockout mice, NOD/SCID/yc null mice, nude mice, Rag1 and/or Rag2 knockout mice, and a combination thereof. These genetically modified animals are described, e.g., in US20150106961, which is incorporated herein by reference in its entirety.
In some embodiments, the mouse can include a replacement of all or part of mature CD94 coding sequence with human mature CD94 coding sequence. In some embodiments, the mouse can include a replacement of all or part of mature NKG2A coding sequence with human mature NKG2A coding sequence.
In some embodiments, the genetically modified non-human animals comprises a modification of an endogenous non-human CD94 locus. In some embodiments, the modification can comprise a human nucleic acid sequence encoding at least a portion of a mature CD94 protein (e.g., at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the mature CD94 protein sequence). Although genetically modified cells are also provided that can comprise the modifications described herein (e.g., ES cells, somatic cells), in many embodiments, the genetically modified non-human animals comprise the modification of the endogenous CD94 locus in the germline of the animal.
In some embodiments, the genetically modified mice express a human CD94 and/or a chimeric (e.g., humanized) CD94 from endogenous mouse loci, wherein the endogenous mouse CD94 gene has been replaced with a human CD94 gene and/or a nucleotide sequence that encodes a region of human CD94 sequence or an amino acid sequence that is at least 10%, 20%, 30%, 40%, 50%, 60%, 70&, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the human CD94 sequence. In various embodiments, an endogenous non-human CD94 locus is modified in whole or in part to comprise human nucleic acid sequence encoding at least one protein-coding sequence of a mature CD94 protein.
In some embodiments, the genetically modified mice express the human CD94 and/or chimeric CD94 (e.g., humanized CD94) from endogenous loci that are under control of mouse promoters and/or mouse regulatory elements. The replacement(s) at the endogenous mouse loci provide non-human animals that express human CD94 or chimeric CD94 (e.g., humanized CD94) in appropriate cell types and in a manner that does not result in the potential pathologies observed in some other transgenic mice known in the art. The human CD94 or the chimeric CD94 (e.g., humanized CD94) expressed in animal can maintain one or more functions of the wild-type mouse or human CD94 in the animal. For example, the expressed CD94 can interact with a human or non-human NKG2A, forming a heterodimer that can recognize MHC class I molecules. Furthermore, in some embodiments, the animal does not express endogenous CD94. As used herein, the term “endogenous CD94” refers to CD94 protein that is expressed from an endogenous CD94 nucleotide sequence of the non-human animal (e.g., mouse) before any genetic modification.
The genome of the animal can comprise a sequence encoding an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to human CD94 (NP_001337991.1) (SEQ ID NO: 2). In some embodiments, the genome comprises a sequence encoding an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to SEQ ID NO: 8.
The genome of the genetically modified animal can comprise a replacement at an endogenous CD94 gene locus of a sequence encoding a region of endogenous CD94 with a sequence encoding a corresponding region of human CD94. In some embodiments, the sequence that is replaced is any sequence within the endogenous CD94 gene locus, e.g., exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, 5′-UTR, 3′-UTR, the first intron, the second intron, and the third intron, the fourth intron, the fifth intron, etc. In some embodiments, the sequence that is replaced is within the regulatory region of the endogenous CD94 gene. In some embodiments, the sequence that is replaced starts within exon 3 and ends within exon 6 of an endogenous mouse CD94 gene locus.
The genetically modified animal can have one or more cells expressing a human or chimeric CD94 (e.g., humanized CD94) having a cytoplasmic region, a transmembrane region, and/or an extracellular region, wherein the extracellular region comprises a sequence that is at least 50%, 60%, 70%, 80%, 90%, 95%, 99% identical to the extracellular region of human CD94. In some embodiments, the extracellular region of the humanized CD94 has a sequence that has at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, or 140 amino acids (e.g., contiguously or non-contiguously) that are identical to human CD94.
Because human CD94 and non-human CD94 (e.g., mouse CD94) sequences, in many cases, are different, antibodies that bind to human CD94 will not necessarily have the same binding affinity with non-human CD94 or have the same effects to non-human CD94. Therefore, the genetically modified animal having a human or a humanized extracellular region can be used to better evaluate the effects of anti-human CD94 antibodies in an animal model. In some embodiments, the genome of the genetically modified animal comprises a sequence encoding an amino acid sequence that corresponds to part or the entire sequence of exons 4-7 of human CD94, part or the entire sequence of extracellular region of human CD94, or part or the entire sequence of amino acids 37-179, 34-179, 33-179, 32-179, or 68-175 of SEQ ID NO: 2.
In some embodiments, the non-human animal can have, at an endogenous CD94 gene locus, a nucleotide sequence encoding a chimeric human/non-human CD94 polypeptide, wherein a human portion of the chimeric human/non-human CD94 polypeptide comprises the entirety or a portion of human CD94 extracellular domain, and wherein the animal expresses a functional CD94 on a surface of a cell (e.g., NK cell or CD8+ T-lymphocyte) of the animal.
In some embodiments, the non-human portion of the chimeric human/non-human CD94 polypeptide comprises cytoplasmic and/or transmembrane regions of an endogenous non-human CD94 polypeptide. There may be several advantages that are associated with the cytoplasmic and/or transmembrane regions of an endogenous non-human CD94 polypeptide. For example, once MHC class I molecules (e.g., human HLA-E) binds to a NKG2A/CD94 receptor, or an anti-CD94 antibody binds to CD94, they can properly transmit extracellular signals into the cells and initiate the downstream pathway. A human or humanized transmembrane and/or cytoplasmic regions may not function properly in non-human animal cells. In some embodiments, a few (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) extracellular amino acids that are close to the transmembrane region of CD94 are also derived from endogenous sequence.
In some embodiments, the genetically modified non-human animals comprises a modification of an endogenous non-human NKG2A locus. In some embodiments, the modification can comprise a human nucleic acid sequence encoding at least a portion of a mature NKG2A protein (e.g., at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the mature NKG2A protein sequence). Although genetically modified cells are also provided that can comprise the modifications described herein (e.g., ES cells, somatic cells), in many embodiments, the genetically modified non-human animals comprise the modification of the endogenous NKG2A locus in the germline of the animal.
In some embodiments, the genetically modified mice express a human NKG2A and/or a chimeric (e.g., humanized) NKG2A from endogenous mouse loci, wherein the endogenous mouse NKG2A gene has been replaced with a human NKG2A gene and/or a nucleotide sequence that encodes a region of human NKG2A sequence or an amino acid sequence that is at least 10%, 20%, 30%, 40%, 50%, 60%, 70&, 80%, 90%, 95%, 96%, 97%, 98%, or 99% identical to the human NKG2A sequence. In various embodiments, an endogenous non-human NKG2A locus is modified in whole or in part to comprise human nucleic acid sequence encoding at least one protein-coding sequence of a mature NKG2A protein.
In some embodiments, the genetically modified mice express the human NKG2A and/or chimeric NKG2A (e.g., humanized NKG2A) from endogenous loci that are under control of mouse promoters and/or mouse regulatory elements. The replacement(s) at the endogenous mouse loci provide non-human animals that express human NKG2A or chimeric NKG2A (e.g., humanized NKG2A) in appropriate cell types and in a manner that does not result in the potential pathologies observed in some other transgenic mice known in the art. The human NKG2A or the chimeric NKG2A (e.g., humanized NKG2A) expressed in animal can maintain one or more functions of the wild-type mouse or human NKG2A in the animal. For example, the expressed NKG2A can interact with a human or non-human CD94, forming a heterodimer that can recognize MHC class I molecules. Furthermore, in some embodiments, the animal does not express endogenous NKG2A. As used herein, the term “endogenous NKG2A” refers to NKG2A protein that is expressed from an endogenous NKG2A nucleotide sequence of the non-human animal (e.g., mouse) before any genetic modification.
The genome of the animal can comprise a sequence encoding an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to human NKG2A (NP_998823.1) (SEQ ID NO: 29). In some embodiments, the genome comprises a sequence encoding an amino acid sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to SEQ ID NO: 35.
The genome of the genetically modified animal can comprise a replacement at an endogenous NKG2A gene locus of a sequence encoding a region of endogenous NKG2A with a sequence encoding a corresponding region of human NKG2A. In some embodiments, the sequence that is replaced is any sequence within the endogenous NKG2A gene locus, e.g., exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, 5′-UTR, 3′-UTR, the first intron, the second intron, and the third intron, the fourth intron, the fifth intron, the sixth intron, etc. In some embodiments, the sequence that is replaced is within the regulatory region of the endogenous NKG2A gene. In some embodiments, the sequence that is replaced starts within exon 2 and ends within exon 6 of an endogenous mouse NKG2A gene locus.
The genetically modified animal can have one or more cells expressing a human or chimeric NKG2A (e.g., humanized NKG2A) having a cytoplasmic region, a transmembrane region, and/or an extracellular region, wherein the extracellular region comprises a sequence that is at least 50%, 60%, 70%, 80%, 90%, 95%, 99% identical to the extracellular region of human NKG2A. In some embodiments, the extracellular region of the humanized NKG2A has a sequence that has at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120, 130, or 140 amino acids (e.g., contiguously or non-contiguously) that are identical to human NKG2A.
Because human NKG2A and non-human NKG2A (e.g., mouse NKG2A) sequences, in many cases, are different, antibodies that bind to human NKG2A will not necessarily have the same binding affinity with non-human NKG2A or have the same effects to non-human NKG2A. Therefore, the genetically modified animal having a human or a humanized extracellular region can be used to better evaluate the effects of anti-human NKG2A antibodies in an animal model. In some embodiments, the genome of the genetically modified animal comprises a sequence encoding an amino acid sequence that corresponds to part or the entire sequence of exons 4-8 of human NKG2A, part or the entire sequence of extracellular region of human NKG2A, or part or the entire sequence of amino acids 94-233 or 118-231 of SEQ ID NO: 29.
In some embodiments, the non-human animal can have, at an endogenous NKG2A gene locus, a nucleotide sequence encoding a chimeric human/non-human NKG2A polypeptide, wherein a human portion of the chimeric human/non-human NKG2A polypeptide comprises a portion of human NKG2A extracellular domain, and wherein the animal expresses a functional NKG2A on a surface of a cell (e.g., NK cell or CD8+ T-lymphocyte) of the animal.
In some embodiments, the non-human portion of the chimeric human/non-human NKG2A polypeptide comprises cytoplasmic and/or regions of an endogenous non-human NKG2A polypeptide. There may be several advantages that are associated with the cytoplasmic and/or transmembrane regions of an endogenous non-human NKG2A polypeptide. For example, once MHC class I molecules (e.g., human HLA-E) binds to a NKG2A/CD94 receptor, or an anti-NKG2A antibody binds to NKG2A, they can properly transmit extracellular signals into the cells and initiate the downstream pathway. A human or humanized transmembrane and/or cytoplasmic regions may not function properly in non-human animal cells. In some embodiments, a few (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10) extracellular amino acids that are close to the transmembrane region of NKG2A are also derived from endogenous sequence.
In some embodiments, the genetically modified animal does not express endogenous CD94. In some embodiments, the genetically modified animal expresses a decreased level of endogenous CD94 as compared to a wild-type animal. In some embodiments, the genetically modified animal does not express endogenous NKG2A. In some embodiments, the genetically modified animal expresses a decreased level of endogenous NKG2A as compared to a wild-type animal.
Furthermore, the genetically modified animal can be heterozygous with respect to the replacement at the endogenous CD94 locus, or homozygous with respect to the replacement at the endogenous CD94 locus. Furthermore, the genetically modified animal can be heterozygous with respect to the replacement at the endogenous NKG2A locus, or homozygous with respect to the replacement at the endogenous NKG2A locus.
In some embodiments, the humanized CD94 locus lacks a human CD94 5′-UTR. In some embodiment, the humanized CD94 locus comprises a rodent (e.g., mouse) 5′-UTR. In some embodiments, the humanization comprises a human 3′- UTR. In appropriate cases, it may be reasonable to presume that the mouse and human CD94 genes appear to be similarly regulated based on the similarity of their 5′-flanking sequence. As shown in the present disclosure, humanized CD94 mice that comprise a replacement at an endogenous mouse CD94 locus, which retain mouse regulatory elements but comprise a humanization of CD94 encoding sequence, do not exhibit pathologies. Both genetically modified mice that are heterozygous or homozygous for humanized CD94 are grossly normal.
In some embodiments, the humanized NKG2A locus lacks a human NKG2A 5′-UTR. In some embodiment, the humanized NKG2A locus comprises a rodent (e.g., mouse) 5′-UTR. In some embodiments, the humanization comprises a human 3′- UTR. In appropriate cases, it may be reasonable to presume that the mouse and human NKG2A genes appear to be similarly regulated based on the similarity of their 5′-flanking sequence. As shown in the present disclosure, humanized NKG2A mice that comprise a replacement at an endogenous mouse NKG2A locus, which retain mouse regulatory elements but comprise a humanization of NKG2A encoding sequence, do not exhibit pathologies. Both genetically modified mice that are heterozygous or homozygous for humanized NKG2A are grossly normal.
The present disclosure further relates to a non-human mammal generated through the method mentioned above. In some embodiments, the genome thereof contains human gene(s).
In some embodiments, the non-human mammal is a rodent, and preferably, the non-human mammal is a mouse.
In some embodiments, the non-human mammal expresses a protein encoded by a humanized CD94 gene. In some embodiments, the non-human mammal expresses a protein encoded by a humanized NKG2A gene.
In addition, the present disclosure also relates to a tumor bearing non-human mammal model, characterized in that the non-human mammal model is obtained through the methods as described herein. In some embodiments, the non-human mammal is a rodent (e.g., a mouse).
The present disclosure further relates to a cell or cell line, or a primary cell culture thereof derived from the non-human mammal or an offspring thereof, or the tumor bearing non-human mammal; the tissue, organ or a culture thereof derived from the non-human mammal or an offspring thereof, or the tumor bearing non-human mammal; and the tumor tissue derived from the non-human mammal or an offspring thereof when it bears a tumor, or the tumor bearing non-human mammal.
The present disclosure also provides non-human mammals produced by any of the methods described herein. In some embodiments, a non-human mammal is provided; and the genetically modified animal contains the DNA encoding human or humanized CD94 in the genome of the animal. The present disclosure also provides non-human mammals produced by any of the methods described herein. In some embodiments, a non-human mammal is provided; and the genetically modified animal contains the DNA encoding human or humanized NKG2A in the genome of the animal.
In some embodiments, the non-human mammal comprises the genetic construct as described herein (e.g., gene construct as shown in
In some embodiments, the expression of human or humanized CD94 in a genetically modified animal is controllable, as by the addition of a specific inducer or repressor substance. In some embodiments, the expression of human or humanized NKG2A in a genetically modified animal is controllable, as by the addition of a specific inducer or repressor substance. In some embodiments, the specific inducer is selected from Tet-Off System/Tet-On System, or Tamoxifen System.
Non-human mammals can be any non-human animal known in the art and which can be used in the methods as described herein. Preferred non-human mammals are mammals, (e.g., rodents). In some embodiments, the non-human mammal is a mouse.
Genetic, molecular and behavioral analyses for the non-human mammals described above can performed. The present disclosure also relates to the progeny produced by the non-human mammal provided by the present disclosure mated with the same or other genotypes.
The present disclosure also provides a cell line or primary cell culture derived from the non-human mammal or a progeny thereof. A model based on cell culture can be prepared, for example, by the following methods. Cell cultures can be obtained by way of isolation from a non-human mammal, alternatively cell can be obtained from the cell culture established using the same constructs and the standard cell transfection techniques. The integration of genetic constructs containing DNA sequences encoding human CD94 and/or NKG2A protein can be detected by a variety of methods.
There are many analytical methods that can be used to detect exogenous DNA, including methods at the level of nucleic acid (including the mRNA quantification approaches using reverse transcriptase polymerase chain reaction (RT-PCR) or Southern blotting, and in situ hybridization) and methods at the protein level (including histochemistry, immunoblot analysis and in vitro binding studies). In addition, the expression level of the gene of interest can be quantified by ELISA techniques well known to those skilled in the art. Many standard analysis methods can be used to complete quantitative measurements. For example, transcription levels can be measured using RT-PCR and hybridization methods including RNase protection, Southern blot analysis, RNA dot analysis (RNAdot) analysis. Immunohistochemical staining, flow cytometry, Western blot analysis can also be used to assess the presence of human or humanized CD94 and/or NKG2A protein.
The present disclosure relates to a targeting vector, comprising: a) a DNA fragment homologous to the 5′ end of a region to be altered (5′ arm), which is selected from the CD94 gene genomic DNAs in the length of 100 to 10,000 nucleotides; b) a desired/donor DNA sequence encoding a donor region; and c) a second DNA fragment homologous to the 3′ end of the region to be altered (3′ arm), which is selected from the CD94 gene genomic DNAs in the length of 100 to 10,000 nucleotides.
In some embodiments, a) the DNA fragment homologous to the 5′ end of a conversion region to be altered (5′ arm) is selected from the nucleotide sequences that have at least 90% homology to the NCBI accession number NC_000072.6) the DNA fragment homologous to the 3′ end of the region to be altered (3′ arm) is selected from the nucleotide sequences that have at least 90% homology to the NCBI accession number NC_000072.6.
In some embodiments, a) the DNA fragment homologous to the 5′ end of a region to be altered (5′ arm) is selected from the nucleotides from the position 129590135 to the position 129593735 of the NCBI accession number NC_000072.6; c) the DNA fragment homologous to the 3′ end of the region to be altered (3′ arm) is selected from the nucleotides from the position 129599166 to the position 129603653 of the NCBI accession number NC_000072.6.
In some embodiments, a) the DNA fragment homologous to the 5′ end of a region to be altered (5′ arm) is selected from the nucleotides from the position 129682351 to the position 129678300 of the NCBI accession number NC_000072.6; c) the DNA fragment homologous to the 3′ end of the region to be altered (3′ arm) is selected from the nucleotides from the position 129674508 to the position 129669878 of the NCBI accession number NC_000072.6.
In some embodiments, the length of the selected genomic nucleotide sequence in the targeting vector can be about 1 kB, about 1.5 kb, about 2 kb, about 2.5 kb, about 3 kb, about 3.5 kb, about 4 kb, about 4.5 kb, about 5 kb, about 5.5 kb, or about 6 kb.
In some embodiments, the region to be altered is exon 1, exon 2, exon 3, exon 4, exon 5, and/or exon 6 of an endogenous CD94 gene (e.g., a sequence starting within exon 3 and ending within exon 6 of mouse CD94 gene).
In some embodiments, the region to be altered is exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, and/or exon 7 of an endogenous NKG2A gene (e.g., a sequence starting within exon 3 and ending within exon 6 of mouse NKG2A gene).
The targeting vector can further include a selected gene marker.
In some embodiments, the sequence of the 5′ arm is shown in SEQ ID NO: 3 or at least 90% identical to SEQ ID NO: 3; and the sequence of the 3′ arm is shown in SEQ ID NO: 4 or at least 90% identical to SEQ ID NO: 4. In some embodiments, the sequence of the selected genomic nucleotide sequence is shown in SEQ ID NO: 5.
In some embodiments, the sequence of the 5′ arm is shown in SEQ ID NO: 30 or at least 90% identical to SEQ ID NO: 30; and the sequence of the 3′ arm is shown in SEQ ID NO: 31 or at least 90% identical to SEQ ID NO: 31. In some embodiments, the sequence of the selected genomic nucleotide sequence is shown in SEQ ID NO: 32.
In some embodiments, the sequence is derived from human (e.g., 10309634-10314790 of NC_000012.12). For example, the target region in the targeting vector is a part or entirety of the nucleotide sequence of a human CD94, preferably a sequence starting within exon 4 and ending within exon 7 of the human CD94. In some embodiments, the nucleotide sequence of the humanized CD94 encodes the entire or the part of human CD94 protein with the NCBI accession number NP_001337991.1 (SEQ ID NO: 2).
In some embodiments, the sequence is derived from human (e.g., 10450487-10446554 of NC_000012.12). For example, the target region in the targeting vector is a part or entirety of the nucleotide sequence of a human NKG2A, preferably a sequence starting within exon 4 and ending within exon 8 of the human NKG2A. In some embodiments, the nucleotide sequence of the humanized CD94 encodes the entire or the part of human NKG2A protein with the NCBI accession number NP_998823.1 (SEQ ID NO: 29).
The disclosure also relates to a cell comprising the targeting vectors as described above.
In addition, the present disclosure further relates to a non-human mammalian cell, having any one of the foregoing targeting vectors, and one or more in vitro transcripts of the construct as described herein. In some embodiments, the cell includes Cas9 mRNA or an in vitro transcript thereof.
In some embodiments, the genes in the cell are heterozygous. In some embodiments, the genes in the cell are homozygous.
In some embodiments, the non-human mammalian cell is a mouse cell. In some embodiments, the cell is a fertilized egg cell.
The disclosure also provides vectors for constructing a humanized animal model or a knock-out model. In some embodiments, the vectors comprise sgRNA sequence, wherein the sgRNA sequence target CD94 or NKG2A gene, and the sgRNA is unique on the target sequence of the gene to be altered. In some embodiments, the sgRNA meets the sequence arrangement rule of 5′—NNN (20) -NGG3′ or 5′—CCN—N (20)-3′.
In some embodiments, the targeting site of the sgRNA in the mouse CD94 gene is located on the exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, intron 1, intron 2, intron 3, intron 4, intron 5, upstream of exon 1, or downstream of exon 6 of the mouse CD94 gene. In some embodiments, the 5′ targeting site is located on exon 1 of the mouse CD94 gene. In some embodiments, the 5′ targeting site is located on exon 3 of the mouse CD94 gene. In some embodiments, the 3′ targeting site is located on exon 6 of the mouse CD94 gene.
In some embodiments, the targeting site of the sgRNA in the mouse NKG2A gene is located on the exon 1, exon 2, exon 3, exon 4, exon 5, exon 6, exon 7, intron 1, intron 2, intron 3, intron 4, intron 5, intron 6, upstream of exon 1, or downstream of exon 7 of the mouse NKG2A gene. In some embodiments, the 5′ targeting site is located on exon 1 of the mouse NKG2A gene. In some embodiments, the 5′ targeting site is located on exon 2 of the mouse NKG2A gene. In some embodiments, the 3′ targeting site is located on exon 6 of the mouse NKG2A gene. In some embodiments, the 3′ targeting site is located on exon 7 of the mouse NKG2A gene.
In some embodiments, the 5′ targeting site sequences of the sgRNA are shown as SEQ ID NOs: 44-51, and the sgRNA recognizes the 5′ targeting site. In some embodiments, the 3′ targeting sequences for the sgRNA are shown as SEQ ID NOs: 52-58 and the sgRNA recognizes the 3′ targeting site. In some embodiments, the 5′ targeting sequence is SEQ ID NO: 44 and the 3′ targeting sequence is SEQ ID NO: 58. Thus, the disclosure provides sgRNA sequences for constructing a genetic modified animal model.
In some embodiments, the disclosure provides DNA sequences encoding the sgRNAs. In some embodiments, the disclosure relates to a plasmid construct (e.g., pT7-sgRNA) including the sgRNA sequence, and/or a cell including the construct.
Genetically modified animals can be made by several techniques that are known in the art, including, e.g., nonhomologous end-joining (NHEJ), homologous recombination (HR), zinc finger nucleases (ZFNs), transcription activator-like effector-based nucleases (TALEN), and the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system. In some embodiments, homologous recombination is used. In some embodiments, CRISPR-Cas9 genome editing is used to generate genetically modified animals. Many of these genome editing techniques are known in the art, and is described, e.g., in Yin et al., “Delivery technologies for genome editing,” Nature Reviews Drug Discovery 16.6 (2017): 387-399, which is incorporated by reference in its entirety. Many other methods are also provided and can be used in genome editing, e.g., micro-injecting a genetically modified nucleus into an enucleated oocyte, and fusing an enucleated oocyte with another genetically modified cell.
Thus, in some embodiments, the disclosure provides replacing in at least one cell of the animal, at an endogenous CD94 gene locus, a sequence encoding a region of an endogenous CD94 with a sequence encoding a corresponding region of human or chimeric CD94. Thus, in some embodiments, the disclosure provides replacing in at least one cell of the animal, at an endogenous NKG2A gene locus, a sequence encoding a region of an endogenous NKG2A with a sequence encoding a corresponding region of human or chimeric NKG2A. In some embodiments, the replacement occurs in a germ cell, a somatic cell, a blastocyst, or a fibroblast, etc. The nucleus of a somatic cell or the fibroblast can be inserted into an enucleated oocyte.
Thus, in some embodiments, the methods for making a genetically modified, humanized animal, can include the step of replacing at an endogenous CD94 locus (or site), a sequence encoding a region of endogenous CD94 with a sequence encoding a corresponding region of human CD94. The replaced sequence can include a region (e.g., a part or the entire region) of exon 3, exon 4, exon 5, and/or exon 6 of an endogenous CD94 gene. In some embodiments, the region is located within the extracellular region of CD94. In some embodiments, the sequence encoding a corresponding region of human CD94 includes a region of exon 4, exons 5-6, and a region of exon 7 of a human CD94 gene (e.g., a sequence encoding amino acids 37-179 of SEQ ID NO: 2). In some embodiments, the replaced sequence includes a region of exon 3, exons 4-5, and a region of exon 6 of mouse CD94 gene (e.g., a sequence encoding amino acids 37-179 of SEQ ID NO: 1).
In some embodiments, the region is located within the extracellular region of CD94. In some embodiments, the sequence encoding a region of endogenous CD94 includes a portion of exon 3 and a portion of exon 4 of mouse CD94.
In some embodiments, the methods of modifying a CD94 locus of a mouse to express a chimeric human/mouse CD94 peptide can include the steps of replacing at the endogenous mouse CD94 locus a nucleotide sequence encoding a mouse CD94 with a nucleotide sequence encoding a human CD94, thereby generating a sequence encoding a chimeric human/mouse CD94.
In some embodiments, provided herein is a genetically-modified non-human animal whose genome comprises a sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to SEQ ID NO: 1, 2, or 8.
In some embodiments, the methods described herein include insertion or replacement at a non-human animal CD94 gene locus, using a nucleotide sequence of humanized CD94 gene. In some embodiments, the insertion or replacement site is after the endogenous regulatory element of the CD94 gene. In some embodiments, the insertion includes disrupting the coding frame of the endogenous CD94 gene of the non-human animal or disrupting the coding frame of the endogenous CD94 gene after the insertion site first, followed by inserting the nucleotide sequence. Alternatively, the insertion step can cause frameshift mutations to the endogenous CD94 gene and also insert human sequences at the same time. In some embodiments, auxiliary sequences (e.g., stop codons or sequences containing termination functions, etc.) or other sequences (e.g., flipping sequences, or knockout sequences) can be added to the nucleotide sequence to make the endogenous CD94 protein of the non-human animal (encoded by a sequence after the insertion site) not express normally.
Thus, in some embodiments, the methods for making a genetically modified, humanized animal, can include the step of replacing at an endogenous NKG2A locus (or site), a sequence encoding a region of endogenous NKG2A with a sequence encoding a corresponding region of human NKG2A. The replaced sequence can include a region (e.g., a part or the entire region) of exon 2, exon 3, exon 4, exon 5, and/or exon 6 of an endogenous NKG2A gene. In some embodiments, the region is located within the extracellular region of NKG2A. In some embodiments, the sequence encoding a corresponding region of human NKG2A includes a region of exon 4, exons 5-7, and a region of exon 8 of a human NKG2A gene (e.g., a sequence encoding amino acids 94-233 of SEQ ID NO: 29). In some embodiments, the replaced sequence includes a region of exon 2, exons 3-5, and a region of exon 6 of mouse NKG2A gene (e.g., a sequence encoding amino acids 96-244 of SEQ ID NO: 28).
In some embodiments, the methods of modifying a NKG2A locus of a mouse to express a chimeric human/mouse NKG2A peptide can include the steps of replacing at the endogenous mouse NKG2A locus a nucleotide sequence encoding a mouse NKG2A with a nucleotide sequence encoding a human NKG2A, thereby generating a sequence encoding a chimeric human/mouse NKG2A.
In some embodiments, provided herein is a genetically-modified non-human animal whose genome comprises a sequence that is at least 70%, 75%, 80%, 85%, 90%, 95%, 99%, or 100% identical to SEQ ID NO: 28, 29, or 35.
In some embodiments, the methods described herein include insertion or replacement at a non-human animal NKG2A gene locus, using a nucleotide sequence of humanized NKG2A gene. In some embodiments, the insertion or replacement site is after the endogenous regulatory element of the NKG2A gene. In some embodiments, the insertion includes disrupting the coding frame of the endogenous NKG2A gene of the non-human animal or disrupting the coding frame of the endogenous NKG2A gene after the insertion site first, followed by inserting the nucleotide sequence. Alternatively, the insertion step can cause frameshift mutations to the endogenous NKG2A gene and also insert human sequences at the same time. In some embodiments, auxiliary sequences (e.g., stop codons or sequences containing termination functions, etc.) or other sequences (e.g., flipping sequences, or knockout sequences) can be added to the nucleotide sequence to make the endogenous NKG2A protein of the non-human animal (encoded by a sequence after the insertion site) not express normally.
In some embodiments, the nucleotide sequences as described herein do not overlap with each other (e.g., the 5′ homologous arm, the “KI fragment”, and/or the 3′ homologous arm do not overlap). In some embodiments, the amino acid sequences as described herein do not overlap with each other.
The present disclosure further provides a method for establishing a CD94 and/or NKG2A gene humanized animal model, involving the following steps:
In some embodiments, the non-human mammal in the foregoing method is a mouse (e.g., a C57BL/6 mouse).
In some embodiments, the non-human mammal in step (c) is a female with pseudo pregnancy (or false pregnancy).
In some embodiments, the embryonic stem cells for the methods described above are C57BL/6 embryonic stem cells. Other embryonic stem cells that can also be used in the methods as described herein include, but are not limited to, FVB/N embryonic stem cells, BALB/c embryonic stem cells, DBA/1 embryonic stem cells and DBA/2 embryonic stem cells.
Embryonic stem cells can come from any non-human animal, e.g., any non-human animal as described herein. In some embodiments, the embryonic stem cells are derived from rodents. The genetic construct can be introduced into an embryonic stem cell by microinjection of DNA. For example, by way of culturing an embryonic stem cell after microinjection, a cultured embryonic stem cell can be transferred to a false pregnant non-human animal, which then gives birth of a non-human mammal, so as to generate the non-human mammal mentioned in the methods described above.
Replacement of non-human genes in a non-human animal with homologous or orthologous human genes or human sequences, at the endogenous non-human locus and under control of endogenous promoters and/or regulatory elements, can result in a non-human animal with qualities and characteristics that may be substantially different from a typical knockout-plus-transgene animal. In the typical knockout-plus-transgene animal, an endogenous locus is removed or damaged and a fully human transgene is inserted into the animal’s genome and presumably integrates at random into the genome. Typically, the location of the integrated transgene is unknown; expression of the human protein is measured by transcription of the human gene and/or protein assay and/or functional assay. Inclusion in the human transgene of upstream and/or downstream human sequences are apparently presumed to be sufficient to provide suitable support for expression and/or regulation of the transgene.
In some cases, the transgene with human regulatory elements expresses in a manner that is unphysiological or otherwise unsatisfactory, and can be actually detrimental to the animal. The disclosure demonstrates that a replacement with human sequence at an endogenous locus under control of endogenous regulatory elements provides a physiologically appropriate expression pattern and level that results in a useful humanized animal whose physiology with respect to the replaced gene are meaningful and appropriate in the context of the humanized animal’s physiology.
Genetically modified animals that express human or humanized CD94 and/or NKG2A protein, e.g., in a physiologically appropriate manner, provide a variety of uses that include, but are not limited to, developing therapeutics for human diseases and disorders, and assessing the toxicity and/or the efficacy of these human therapeutics in the animal models.
In various aspects, genetically modified animals are provided that express human or humanized CD94, which are useful for testing agents that can decrease or block the interaction between CD94 and NKG2 molecules (e.g., NKG2A) or the interaction between CD94 and anti-human CD94 antibodies, testing whether an agent can increase or decrease the immune response, and/or determining whether an agent is an CD94 agonist or antagonist. In various aspects, genetically modified animals are provided that express human or humanized NKG2A, which are useful for testing agents that can decrease or block the interaction between NKG2A and CD94, or the interaction between NKG2A and anti-human NKG2A antibodies, testing whether an agent can increase or decrease the immune response, and/or determining whether an agent is an NKG2A agonist or antagonist. The genetically modified animals can be, e.g., an animal model of a human disease, e.g., the disease is induced genetically (a knock-in or knockout). In various embodiments, the genetically modified non-human animals further comprise an impaired immune system, e.g., a non-human animal genetically modified to sustain or maintain a human xenograft, e.g., a human solid tumor or a blood cell tumor (e.g., a lymphocyte tumor, e.g., a B or T cell tumor).
In some embodiments, the genetically modified animals can be used for determining effectiveness of an anti-CD94 antibody for the treatment of cancer. The methods involve administering the anti-CD94 antibody (e.g., anti-human CD94 antibody) to the animal as described herein, wherein the animal has a tumor; and determining the inhibitory effects of the anti-CD94 antibody to the tumor. In some embodiments, the genetically modified animals can be used for determining effectiveness of an anti-NKG2A antibody for the treatment of cancer. The methods involve administering the anti-NKG2A antibody (e.g., anti-human NKG2A antibody) to the animal as described herein, wherein the animal has a tumor; and determining the inhibitory effects of the anti-NKG2A antibody to the tumor. The inhibitory effects that can be determined include, e.g., a decrease of tumor size or tumor volume, a decrease of tumor growth, a reduction of the increase rate of tumor volume in a subject (e.g., as compared to the rate of increase in tumor volume in the same subject prior to treatment or in another subject without such treatment), a decrease in the risk of developing a metastasis or the risk of developing one or more additional metastasis, an increase of survival rate, and an increase of life expectancy, etc. The tumor volume in a subject can be determined by various methods, e.g., as determined by direct measurement, MRI or CT.
In some embodiments, the tumor comprises one or more cancer cells (e.g., human or mouse cancer cells) that are injected into the animal. In some embodiments, the anti-CD94 antibody prevents MHC class I molecules from binding to a CD94/NKG2 receptor. In some embodiments, the anti-CD94 antibody does not prevent MHC class I molecules from binding to a CD94/NKG2 receptor.
In some embodiments, the tumor comprises one or more cancer cells (e.g., human or mouse cancer cells) that are injected into the animal. In some embodiments, the anti-NKG2A antibody prevents MHC class I molecules from binding to a CD94/NKG2 receptor. In some embodiments, the anti-NKG2A antibody does not prevent MHC class I molecules from binding to a CD94/NKG2 receptor. Exemplary anti-NKG2A antibodies include monalizumab (See PCT Publication No.: WO2016041945A1).
In some embodiments, the genetically modified animals can be used for determining whether an anti-CD94 antibody is a CD94 agonist or antagonist. In some embodiments, the methods as described herein are also designed to determine the effects of the agent (e.g., anti-CD94 antibodies) on CD94. In some embodiments, the genetically modified animals can be used for determining whether an anti-NKG2A antibody is a NKG2A agonist or antagonist. In some embodiments, the methods as described herein are also designed to determine the effects of the agent (e.g., anti-NKG2A antibodies) on NKG2A. The effects can include, whether the agent can stimulate immune cells or inhibit immune cells (e.g., NK cells or CD8+ T lymphocytes), whether the agent can increase or decrease the production of cytokines, whether the agent can activate or deactivate immune cells (e.g., NK cells or CD8+ T lymphocytes), whether the agent can upregulate the immune response or downregulate immune response, and/or whether the agent can induce complement mediated cytotoxicity (CMC) or antibody dependent cellular cytoxicity (ADCC). In some embodiments, the genetically modified animals can be used for determining the effective dosage of a therapeutic agent for treating a disease in the subject, e.g., cancer, or autoimmune diseases.
The inhibitory effects on tumors can also be determined by methods known in the art, e.g., measuring the tumor volume in the animal, and/or determining tumor (volume) inhibition rate (TGITV). The tumor growth inhibition rate can be calculated using the formula TGITV (%) = (1 -TVt/TVc) x 100, where TVt and TVc are the mean tumor volume (or weight) of treated and control groups.
In some embodiments, the anti-CD94 antibody or anti-NKG2A antibody is designed for treating various cancers. As used herein, the term “cancer” refers to cells having the capacity for autonomous growth, i.e., an abnormal state or condition characterized by rapidly proliferating cell growth. The term is meant to include all types of cancerous growths or oncogenic processes, metastatic tissues or malignantly transformed cells, tissues, or organs, irrespective of histopathologic type or stage of invasiveness. The term “tumor” as used herein refers to cancerous cells, e.g., a mass of cancerous cells. Cancers that can be treated or diagnosed using the methods described herein include malignancies of the various organ systems, such as affecting lung, breast, thyroid, lymphoid, gastrointestinal, and genito-urinary tract, as well as adenocarcinomas which include malignancies such as most colon cancers, renal-cell carcinoma, prostate cancer and/or testicular tumors, non-small cell carcinoma of the lung, cancer of the small intestine and cancer of the esophagus. In some embodiments, the agents described herein are designed for treating or diagnosing a carcinoma in a subject. The term “carcinoma” is art recognized and refers to malignancies of epithelial or endocrine tissues including respiratory system carcinomas, gastrointestinal system carcinomas, genitourinary system carcinomas, testicular carcinomas, breast carcinomas, prostatic carcinomas, endocrine system carcinomas, and melanomas. In some embodiments, the cancer is renal carcinoma or melanoma. Exemplary carcinomas include those forming from tissue of the cervix, lung, prostate, breast, head and neck, colon and ovary. The term also includes carcinosarcomas, e.g., which include malignant tumors composed of carcinomatous and sarcomatous tissues. An “adenocarcinoma” refers to a carcinoma derived from glandular tissue or in which the tumor cells form recognizable glandular structures. The term “sarcoma” is art recognized and refers to malignant tumors of mesenchymal derivation.
In some embodiments, the cancer types as described herein include, but not limited to, lymphoma, non-small cell lung cancer (NSCLC), leukemia, ovarian cancer, nasopharyngeal cancer, breast cancer, endometrial cancer, colon cancer, rectal cancer, stomach cancer, bladder cancer, lung cancer, bronchial cancer, bone cancer, prostate cancer, pancreatic cancer, liver and bile duct cancer, esophageal cancer, kidney cancer, thyroid cancer, head and neck cancer, testicular cancer, glioblastoma, astrocytoma, melanoma, myelodysplastic syndrome, and sarcoma. In some embodiments, the leukemia is selected from acute lymphocytic (lymphoblastic) leukemia, acute myeloid leukemia, myeloid leukemia, chronic lymphocytic leukemia, multiple myeloma, plasma cell leukemia, and chronic myelogenous leukemia. In some embodiments, the lymphoma is selected from Hodgkin’s lymphoma and non-Hodgkin’s lymphoma, including B-cell lymphoma, diffuse large B-cell lymphoma, follicular lymphoma, mantle cell lymphoma, marginal zone B-cell lymphoma, T cell lymphoma, and Waldenstrom macroglobulinemia. In some embodiments, the sarcoma is selected from osteosarcoma, Ewing sarcoma, leiomyosarcoma, synovial sarcoma, soft tissue sarcoma, angiosarcoma, liposarcoma, fibrosarcoma, rhabdomyosarcoma, and chondrosarcoma. In some embodiments, the cancer types as described herein also include gynecologic cancer, ovarian cancer, uterine cancer, vaginal cancer, cervical cancer, vulvar cancer, head and neck cancer (head and neck squamous cell carcinoma (HNSCC)), non-small cell lung cancer (NSCLC), hematological cancer, solid tumor, breast cancer, chronic lymphocytic leukemia, squamous cell carcinoma of the oral cavity, colorectal cancer, liver cancer, glioblastoma, Hodgkin lymphoma, esophagus cancer, gastric cancer, pancreas cancer, renal cancer, lung cancer, or melanoma. In some embodiments, the cancer type is head and neck squamous cell carcinoma (HNSCC).
In some embodiments, the antibody is designed for treating various immune disorder or immune-related diseases (e.g., rheumatoid arthritis, psoriasis, allergic rhinitis, sinusitis, asthma, atopic dermatitis, chronic obstructive pulmonary disease (COPD), chronic bronchitis, emphysema, eczema, osteoarthritis, rheumatoid arthritis, systemic lupus erythematosus, polymyalgia rheumatica, autoimmune hemolytic anemia, systemic vasculitis, pernicious anemia, inflammatory bowel disease, ulcerative colitis, Crohn’s disease, or multiple sclerosis). Thus, the methods as described herein can be used to determine the effectiveness of an anti-CD94 or anti-NKG2A antibody in inhibiting immune response.
In some embodiments, the immune disorder or immune-related diseases described here include allergy, asthma, myocarditis, nephritis, hepatitis, systemic lupus erythematosus, rheumatoid arthritis, scleroderma, hyperthyroidism, primary thrombocytopenic purpura, autoimmune hemolytic anemia, ulcerative colitis, self-immune liver disease, diabetes, pain, or neurological disorders.
In some embodiments, the antibody is designed for reducing inflammation (e.g., inflammatory bowel disease, chronic inflammation, asthmatic inflammation, periodontitis, or wound healing). Thus, the methods as described herein can be used to determine the effectiveness of an antibody for reducing inflammation. In some embodiments, the inflammation described herein includes degenerative inflammation, exudative inflammation, serous inflammation, fibrinitis, suppurative inflammation, hemorrhagic inflammation, necrotitis, catarrhal inflammation, proliferative inflammation, specific inflammation, tuberculosis, syphilis, leprosy, or lymphogranuloma.
In some embodiments, the antibody is designed for treating viral infection, e.g., HIV-induced infection, EBV-induced infection, or HPV-induced infection.
The present disclosure also provides methods of determining toxicity of an antibody (e.g., anti-CD94 antibody or anti-NKG2A antibody). The methods involve administering the antibody to the animal as described herein. The animal is then evaluated for its weight change, red blood cell count, hematocrit, and/or hemoglobin. In some embodiments, the antibody can decrease the red blood cells (RBC), hematocrit, or hemoglobin by more than 20%, 30%, 40%, or 50%. In some embodiments, the animals can have a weight that is at least 5%, 10%, 20%, 30%, or 40% smaller than the weight of the control group (e.g., average weight of the animals that are not treated with the antibody).
The present disclosure also relates to the use of the animal model generated through the methods as described herein in the development of a product related to an immunization processes of human cells, the manufacturing of a human antibody, or the model system for a research in pharmacology, immunology, microbiology and medicine.
In some embodiments, the disclosure provides the use of the animal model generated through the methods as described herein in the production and utilization of an animal experimental disease model of an immunization processes involving human cells, the study on a pathogen, or the development of a new diagnostic strategy and/or a therapeutic strategy.
The disclosure also relates to the use of the animal model generated through the methods as described herein in the screening, verifying, evaluating or studying the CD94 gene function, human CD94 antibodies, drugs for human CD94 targeting sites, the drugs or efficacies for human CD94 targeting sites, the drugs for immune-related diseases and antitumor drugs. The disclosure also relates to the use of the animal model generated through the methods as described herein in the screening, verifying, evaluating or studying the NKG2A gene function, human NKG2A antibodies, drugs for human NKG2A targeting sites, the drugs or efficacies for human NKG2A targeting sites, the drugs for immune-related diseases and antitumor drugs.
In some embodiments, the disclosure provides a method to verify in vivo efficacy of TCR-T, CAR-T, and/or other immunotherapies (e.g., T-cell adoptive transfer therapies). For example, the methods include transplanting human tumor cells into the animal described herein, and applying human CAR-T to the animal with human tumor cells. Effectiveness of the CAR-T therapy can be determined and evaluated. In some embodiments, the animal is selected from the CD94 gene and/or NKG2A gene humanized non-human animals prepared by the methods described herein, the CD94 gene and/or NKG2A gene humanized non-human animals described herein, the double- or multi-humanized non-human animal generated by the methods described herein (or progeny thereof), a non-human animal expressing the human or humanized CD94 protein and/or NKG2A protein, or the tumor-bearing or inflammatory animal models described herein. In some embodiments, the TCR-T, CAR-T, and/or other immunotherapies can treat the CD94-associated and/or NKG2A-associated diseases described herein. In some embodiments, the TCR-T, CAR-T, and/or other immunotherapies provides an evaluation method for treating the CD94-associated and/or NKG2A-associated diseases described herein.
The present disclosure further relates to methods for generating genetically modified animal model with two or more human or chimeric genes. The animal can comprise a human or chimeric CD94 and/or NKG2A gene and a sequence encoding an additional human or chimeric protein.
In some embodiments, the additional human or chimeric protein can be programmed cell death protein 1 (PD-1), B7 Homolog 3 (B7-H3), V-set domain-containing T-cell activation inhibitor 1 (B7-H4), Interleukin-2 (IL-2), Interleukin-23 subunit alpha (IL23A), C-C Motif Chemokine Receptor 2 (CCR2), cytotoxic T-lymphocyte-associated protein 4 (CTLA-4), Lymphocyte Activating 3 (LAG-3), B And T Lymphocyte Associated (BTLA), Programmed Cell Death 1 Ligand 1 (PD-L1), CD27, CD28, CD47, CD137, CD154, T-Cell Immunoreceptor With Ig And ITIM Domains (TIGIT), T-cell Immunoglobulin and Mucin-Domain Containing-3 (TIM-3), Glucocorticoid-Induced TNFR-Related Protein (GITR), Signal regulatory protein α (SIRPα) or TNF Receptor Superfamily Member 4 (TNFRSF4 or OX40).
The methods of generating genetically modified animal model with two or more human or chimeric genes (e.g., humanized genes) can include the following steps:
In some embodiments, in step (b) of the method, the genetically modified animal can be mated with a genetically modified non-human animal with human or chimeric PD-1, B7-H3, B7-H4, IL-2, IL23A, CCR2, CTLA-4, LAG-3, BTLA, PD-L1, CD27, CD28, CD47, CD137, CD154, TIGIT, TIM-3, GITR, SIRPα, or OX40. Some of these genetically modified non-human animal are described, e.g., in PCT/CN2017/090320, PCT/CN2017/099577, PCT/CN2017/099575, PCT/CN2017/099576, PCT/CN2017/099574, PCT/CN2017/106024, PCT/CN2017/110494, PCT/CN2017/110435, PCT/CN2017/120388, PCT/CN2018/081628, PCT/CN2018/081629; each of which is incorporated herein by reference in its entirety.
In some embodiments, the CD94 gene and/or NKG2A gene humanization is directly performed on a genetically modified animal having a human or chimeric PD-1, B7-H3, B7-H4, IL-2, IL23A, CCR2, CTLA-4, BTLA, PD-L1, CD27, CD28, CD47, CD137, CD154, TIGIT, TIM-3, GITR, SIRPα, or OX40 gene.
As these proteins may involve different mechanisms, a combination therapy that targets two or more of these proteins thereof may be a more effective treatment. In fact, many related clinical trials are in progress and have shown a good effect. The genetically modified animal model with two or more human or humanized genes can be used for determining effectiveness of a combination therapy that targets two or more of these proteins, e.g., an anti-CD94 and/or anti-NKG2A antibody and an additional therapeutic agent (e.g., an anti-PD-1 antibody) for the treatment of cancer. The methods include administering the anti-CD94 and/or anti-NKG2A antibody and the additional therapeutic agent (e.g., an anti-PD-1 antibody) to the animal, wherein the animal has a tumor; and determining the inhibitory effects of the combined treatment to the tumor. In some embodiments, the additional therapeutic agent is an antibody that specifically binds to PD-1, B7-H3, B7-H4, IL-2, IL23A, CCR2, CTLA-4, BTLA, PD-L1, CD27, CD28, CD47, CD137, CD154, TIGIT, TIM-3, GITR, SIRPa, or OX40. In some embodiments, the additional therapeutic agent is an anti-CTLA4 antibody (e.g., ipilimumab), an anti-PD-1 antibody (e.g., nivolumab), an anti-PD-L1 antibody, or an anti-EGFR (epidermal growth factor receptor) antibody (e.g., cetuximab).
In some embodiments, the animal further comprises a sequence encoding a human or humanized PD-1, a sequence encoding a human or humanized PD-L1, or a sequence encoding a human or humanized CTLA-4. In some embodiments, the additional therapeutic agent is an anti-PD-1 antibody (e.g., nivolumab, pembrolizumab), an anti-PD-L1 antibody, or an anti-CTLA-4 antibody. In some embodiments, the tumor comprises one or more tumor cells that express CD80, CD86, PD-L1, and/or PD-L2.
In some embodiments, the combination treatment is designed for treating various cancer as described herein, e.g., melanoma, non-small cell lung carcinoma (NSCLC), small cell lung cancer (SCLC), bladder cancer, prostate cancer (e.g., metastatic hormone-refractory prostate cancer), advanced breast cancer, advanced ovarian cancer, and/or advanced refractory solid tumor. In some embodiments, the combination treatment is designed for treating metastatic solid tumors, NSCLC, melanoma, B-cell non-Hodgkin lymphoma, colorectal cancer, and multiple myeloma. In some embodiments, the combination treatment is designed for treating melanoma, carcinomas (e.g., pancreatic carcinoma), mesothelioma, hematological malignancies (e.g., Non-Hodgkin’s lymphoma, lymphoma, chronic lymphocytic leukemia), or solid tumors (e.g., advanced solid tumors). In some embodiments, the cancer types as described herein also include gynecologic cancer, ovarian cancer, uterine cancer, vaginal cancer, cervical cancer, vulvar cancer, head and neck cancer (head and neck squamous cell carcinoma (HNSCC)), non-small cell lung cancer (NSCLC), hematological cancer, solid tumor, breast cancer, chronic lymphocytic leukemia, squamous cell carcinoma of the oral cavity, colorectal cancer, liver cancer, glioblastoma, Hodgkin lymphoma, esophagus cancer, gastric cancer, pancreas cancer, renal cancer, lung cancer, or melanoma. In some embodiments, the cancer type is head and neck squamous cell carcinoma (HNSCC).
In some embodiments, the methods described herein can be used to evaluate the combination treatment with some other methods. The methods of treating a cancer that can be used alone or in combination with methods described herein, include, e.g., treating the subject with chemotherapy, e.g., campothecin, doxorubicin, cisplatin, carboplatin, procarbazine, mechlorethamine, cyclophosphamide, adriamycin, ifosfamide, melphalan, chlorambucil, bisulfan, nitrosurea, dactinomycin, daunorubicin, bleomycin, plicomycin, mitomycin, etoposide, verampil, podophyllotoxin, tamoxifen, taxol, transplatinum, 5-flurouracil, vincristin, vinblastin, and/or methotrexate. Alternatively or in addition, the methods can include performing surgery on the subject to remove at least a portion of the cancer, e.g., to remove a portion of or all of a tumor(s), from the patient.
The invention is further described in the following examples, which do not limit the scope of the invention described in the claims.
The following materials were used in the following examples.
BglII, StuI, DraIII, and EcoRV restriction enzymes were purchased from NEB. Catalog numbers are R0144M, R0187M, R3510L, and R3195M, respectively.
C57BL/6 mice and Flp transgenic mice were purchased from the China Food and Drugs Research Institute National Rodent Experimental Animal Center.
Brilliant Violet 510™ anti-mouse CD45 antibody (mCD4-BV605) was purchased from BioLegend with catalog number 103138.
PE-Cy™7 Mouse Anti-Mouse NK-1.1 (mNK1.1-PE/Cy7) antibody was purchased from BD Biosciences with catalog number 552878 (under brand BD Pharmingen™).
PE anti-mouse CD159a (NKG2AB6) Antibody (mNKG2A-PE) was purchased from BioLegend with catalog number 142803.
Alexa Fluor® 647-conjugated AffiniPure F(ab′)2 Fragment Goat Anti-Human IgG, Fcy fragment Specific (minimal cross-reaction to Bovine, Mouse, and Rabbit Serum Proteins) was purchased from Jackson Immuno Research, with catalog number 109-606-170.
PE Mouse IgG2b, κ Isotype Ctrl Antibody was purchased from BioLegend with catalog number 400311.
APC anti-mouse CD94 Antibody (mCD94-APC) was purchased from BioLegend with catalog number 105511.
PE anti-human CD94 Antibody (hCD94-PE) was purchased from BioLegend with catalog number 305504.
APC Rat IgG2a, κ Isotype Ctrl Antibody was purchased from BioLegend with catalog number 400512.
APC Mouse IgG2a, κ Isotype Ctrl Antibody was purchased from BioLegend with catalog number 400220.
Alexa Fluor® 647 Mouse IgG1, κ Isotype Ctrl (FC) was purchased from BioLegend with catalog number 400130.
Zombie NIR™ Fixable Viability Kit was purchased from BioLegend with catalog number 423106.
Human IgG4, κ Isotype Control Antibody was purchased from CrownBio with catalog number C0004-3.
FITC Rat Anti-Mouse CD3 antibody was purchased from BioLegend with catalog number 115506.
In this example, a non-human animal (e.g., a mouse) was modified to include a nucleotide sequence encoding human CD94 protein, and the obtained genetically-modified non-human animal can express a human or humanized CD94 protein in vivo. The mouse CD94 gene (NCBI Gene ID: 16643, Primary source: MGI: 1196275, UniProt ID: O54707) is located at 129588092 to 129598775 of chromosome 6 (NC_000072.6), and the human CD94 gene (NCBI Gene ID: 3824, Primary source: HGNC: 6378, UniProt ID: Q13241) is located at 10238383 to 10329608 of chromosome 12 (NC_000012.12). The mouse CD94 transcript is NM_010654.4, and the corresponding protein sequence NP_034784.1 is set forth in SEQ ID NO: 1. The human CD94 transcript is NM_001351062.1, and the corresponding protein sequence NP_001337991.1 is set forth in SEQ ID NO: 2. Mouse and human CD94 gene loci are shown in
Genetically modified non-human animals can be generated by several gene editing techniques that are known in the art, including but not limited to, zinc finger nucleases (ZFN), transcription activator-like effector-based nucleases (TALEN), homing endonucleases (megakable base ribozyme), the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system, or other molecular biology techniques. In this example, a nucleotide sequence encoding human CD94 protein was introduced into the endogenous mouse CD94 locus, such that the mouse can express a human or humanized CD94 protein. The method can include insertion of a sequence including a human CD94 gene sequence into the mouse endogenous CD94 locus. For example, the inserted sequence can include a human CD94 DNA or cDNA sequence. It is also possible to include an auxiliary sequence (e.g., a stop codon or a sequence having termination functions, etc.) or to use other methods (e.g., flipping or knocking out) to inactivate the mouse endogenous CD94 gene. The strategy of in situ replacement can also be used, e.g., a direct replacement with human CD94 gene sequence (DNA or cDNA sequence) on the mouse endogenous CD94 gene locus. Here, an in situ replacement strategy of CD94 DNA sequence was used to illustrate how to humanize the mouse CD94 gene.
Mouse cells can be modified by various gene-editing techniques, for example, replacement of specific mouse CD94 gene sequences with human CD94 gene sequences at the endogenous mouse CD94 locus. For example, under control of a mouse CD94 regulatory element, a 4731 bp sequence spanning from exon 3 (including a part of exon 3) to exon 6 (including a part of exon 6) of the mouse CD94 gene can be replaced with a corresponding 5157 bp sequence spanning from exon 4 (including a part of exon 4) to exon 7 (including a part of exon 7) of the human CD94 gene, to obtain a humanized mouse CD94 gene locus as shown in
As shown in the schematic diagram of the targeting strategy in
The upstream of the KI fragment containing the human CD94 gene sequence is directly connected to the 5′ homologous arm, and the connection between the downstream of the human CD94 gene sequence and the mouse CD94 gene locus was designed as: 5′-GAAGATAAAAATCGTTATATCTGTAAGCAACAGCTCATTTAAATGTTTCTTAAGGCA AAGGGTATAGACAAGGAAGGTCC -3′ (SEQ ID NO: 6), wherein the last “T” in sequence “TCATT” is the last nucleotide of the human sequence, and the first “T” of the sequence “TAAAT” is the first nucleotide of the mouse sequence. The mRNA sequence of the engineered mouse CD94 after humanization and its encoded protein sequence are shown in SEQ ID NO: 7 and SEQ ID NO: 8, respectively.
The CD94 gene targeting vector also included an antibiotic resistance gene for positive clone screening (neomycin phosphotransferase gene, or Neo), and two Frt recombination sites flanking the antibiotic resistance gene, that formed a Neo cassette. The connection between the upstream of the Neo cassette and the mouse CD94 gene locus was designed as: 5′-AAGTATGGTAACATATCATCTGCGGATGAAGCTTGATATCGAATTCCGAAGTTCCTA TTCTCTAGAAAGTATAGGAACTT -3′ (SEQ ID NO: 9), wherein the last “G” of the sequence “GATG” is the last nucleotide of the mouse sequence, and the first “A” of the sequence “AAGC” is the first nucleotide of the Neo cassette. The connection between the downstream of the Neo cassette with the mouse CD94 sequence was designed as 5′-TATTCTCTAGAAAGTATAGGAACTTCATCAGTCAGGTACATAATGGTGGATCCAGGC CTGATGTGGTTTGATTGGTTCTGTTCCT -3′ (SEQ ID NO: 10), wherein the “T” of the sequence “GCCT” is the last nucleotide of the Neo cassette, and the first “G” of the sequence “GATGT” is the first nucleotide of the mouse sequence. In addition, a coding gene with a negative selectable marker (a gene encoding diphtheria toxin A subunit (DTA)) was also inserted downstream of the 3′ homologous arm of the targeting vector.
The targeting vector used for replacement of a mouse CD94 gene sequence with the corresponding human CD94 gene sequence was constructed, e.g., by restriction enzyme digestion and ligation, or synthesized directly. Mouse and human CD94 DNA were obtained from bacterial artificial chromosome (BAC) clones RP23-208D19 and RP11-282C10, respectively. The constructed targeting vector sequence was preliminarily confirmed by restriction enzyme digestion, and then verified by sequencing. The correct targeting vector was electroporated and transfected into embryonic stem cells of C57BL/6 mice. The positive selectable marker gene was used to screen the cells, and the integration of exogenous genes was confirmed by PCR and Southern Blot. Specifically, positive clones identified by PCR were further confirmed by Southern Blot to screen out correct positive clone cells used for blastocyst injection.
Either the primers CD94-F1 and CD94-R1, or primers CD94-F2 and CD94-R2, were used for PCR amplification, and 9 clones were identified as positive clones with numbers 1-A09, 2-B09, 2-C02, 2-D02, 2-H10, 3-B12, 3-D05, 4-C03, and 4-G06. The positive clones identified by PCR were then verified by Southern Blot. Specifically, genomic DNA of the positive clone cells was digested with BglII, StuI, or DraIII, respectively, and then hybridized with 3 corresponding probes. As shown in
The following probes were used in Southern Blot assays:
The positive clones that had been screened (black mice) were introduced into isolated blastocysts (white mice), and the resulted chimeric blastocysts were transferred to a culture medium for short-term culture and then transplanted to the fallopian tubes of the recipient mother (white mice) to produce the F0 chimeric mice (black and white). The F2 generation homozygous mice were obtained by backcrossing the F0 generation chimeric mice with wild-type mice to obtain the F1 generation mice, and then breeding the F1 generation heterozygous mice with each other. The positive mice were also bred with the Flp transgenic mice to remove the positive selectable marker gene, and then the humanized CD94 homozygous mice expressing humanized CD94 protein were obtained by breeding with each other. The genotype of the progeny mice can be identified by PCR using primers shown in the table below. The identification results of exemplary F1 generation mice (Neo cassette not removed) are shown in
The expression of humanized CD94 protein in CD94 gene humanized heterozygous mice was confirm by flow cytometry. Specifically, one wild-type C57BL/6 mouse (6 weeks old) and one CD94 gene humanized heterozygous mouse (approximately 6-12 weeks old) were selected and the mouse spleens were isolated. Dead spleen cells were stained with vitality dyes (Zombie NIR™, BioLegend) and eliminated by flow cytometry. The live spleen cells were then stained with one of the following combinations of fluorescent dye-labeled antibodies: (1) mNK1.1-PE/Cy7, FITC Rat Anti-Mouse CD3 antibody, and mCD94-APC (
In this example, a non-human animal (e.g., a mouse) was modified to include a nucleotide sequence encoding human NKG2A protein, and the obtained genetically-modified non-human animal can express a human or humanized NKG2A protein in vivo. The mouse NKG2A gene (NCBI Gene ID: 16641, Primary source: MGI: 1336161, UniProt ID: Q9Z202) is located at 129666015 to 129682852 of chromosome 6 (NC_000072.6), and the human NKG2A gene (NCBI Gene ID: 3821, Primary source: HGNC: 6374, UniProt ID: P26715) is located at 10441673 to 10454685 of chromosome 12 (NC_000012.12). The mouse NKG2A transcript sequence is NM_001136068.2, and the corresponding protein sequence NP_001129540.1 is set forth in SEQ ID NO: 28. The human NKG2A transcript is NM_213658.2, and the corresponding protein sequence NP_998823.1 is set forth in SEQ ID NO: 29. Mouse and human NKG2A gene loci are shown in
Genetically modified non-human animals can be generated by several gene editing techniques that are known in the art, including but not limited to, zinc finger nucleases (ZFN), transcription activator-like effector-based nucleases (TALEN), homing endonucleases (megakable base ribozyme), the clustered regularly interspaced short palindromic repeats (CRISPR)-Cas system, or other molecular biology techniques. In this example, a nucleotide sequence encoding human NKG2A protein was introduced into the endogenous mouse NKG2A locus, such that the mouse can express a human or humanized NKG2A protein. The method can include insertion of a sequence including a human NKG2A gene sequence into the mouse endogenous NKG2A locus. For example, the inserted nucleotide sequence can include a human NKG2A DNA or cDNA sequence. It is also possible to include an auxiliary sequence (e.g., a stop codon or a sequence having termination functions, etc.) or to use other methods (e.g., flipping or knocking out) to make the mouse endogenous NKG2A gene unable to express normally. The strategy of in situ replacement can also be used, e.g., a direct replacement with human NKG2A gene sequence (DNA or cDNA sequence of human NKG2A) on the mouse endogenous NKG2A locus. Here, a in situ replacement strategy of NKG2A DNA sequence was used to illustrate how to humanize the mouse NKG2A gene.
Mouse cells can be modified by various gene-editing techniques, for example, replacement of specific mouse NKG2A gene sequences with human NKG2A gene sequences at the endogenous mouse NKG2A locus. For example, under control of a mouse NKG2A regulatory element, a 3438 bp sequence spanning from exon 2 (including a part of exon 2) to exon 6 (including a part of exon 6) of the mouse NKG2A gene can be replaced with a corresponding 3934 bp sequence spanning from exon 4 (including a part of exon 4) to exon 8 (including a part of exon 8) of the human NKG2A gene, to obtain a humanized mouse NKG2A locus as shown in
As shown in the schematic diagram of the targeting strategy in
The upstream of the KI fragment containing the human NKG2A gene sequence is directly connected to the 5′ homologous arm, and the connection between the downstream of the human NKG2A gene sequence and the mouse NKG2A gene locus was designed as: 5′-TCAATAATATATCATTGTAAGCATAAGCTTTGAAACACCTGCACTGG-3′ (SEQ ID NO: 33), wherein the last “T” in sequence “GCTT” is the last nucleotide of the human sequence, and the “T” of the sequence “TGAA” is the first nucleotide of the mouse sequence. The mRNA sequence of the engineered mouse NKG2A after humanization and its encoded protein sequence are shown in SEQ ID NO: 34 and SEQ ID NO: 35, respectively.
The targeting vector also included an antibiotic resistance gene for positive clone screening (neomycin phosphotransferase gene, or Neo), and two Frt recombination sites flanking the antibiotic resistance gene, that formed a Neo cassette. The connection between the upstream of the Neo cassette and the mouse NKG2A gene locus was designed as: 5′-ATTGCCAGTTGTATATTGCAACTTCAGCTTCTGTAGTACATTTGGGTCGAATTCCGAA GTTCCTATTCTCTAGAAAGTAT -3′ (SEQ ID NO: 36), wherein the “C” of the sequence “GGTC” is the last nucleotide of the mouse sequence, and the “G” of the sequence “GAAT” is the first nucleotide of the Neo cassette. The connection between the downstream of the Neo cassette with the mouse NKG2A sequence was designed as 5′-AGGAACTTCATCAGTCAGGTACATAATTAGGTGGATCCACCCACTTTTAGTCAATAA GTAATATTATATA -3′ (SEQ ID NO: 37), wherein the last “C” of the sequence “ATCC” is the last nucleotide of the Neo cassette, and the “A” of the sequence “ACCC” is the first nucleotide of the mouse sequence. In addition, a coding gene with a negative selectable marker (a gene encoding diphtheria toxin A subunit (DTA)) was also inserted downstream of the 3′ homologous arm of the targeting vector.
The targeting vector used for replacement of a mouse NKG2A gene sequence with the corresponding human NKG2A gene sequence was constructed, e.g., by restriction enzyme digestion and ligation, or synthesized directly. Mouse and human NKG2A DNA were obtained from bacterial artificial chromosome (BAC) clones RP23-164F1 and RP11-653F19, respectively. The constructed targeting vector sequence was preliminarily confirmed by restriction enzyme digestion, and then verified by sequencing. The correct targeting vector was electroporated and transfected into embryonic stem cells of C57BL/6 mice. The positive selectable marker gene was used to screen the cells, and the integration of exogenous genes was confirmed by PCR and Southern Blot. Specifically, positive clones identified by PCR were further confirmed by Southern Blot to screen out correct positive clone cells used for blastocyst injection.
The positive clones that had been screened (black mice) were introduced into isolated blastocysts (white mice), and the resulted chimeric blastocysts were transferred to a culture medium for short-term culture and then transplanted to the fallopian tubes of the recipient mother (white mice) to produce the F0 chimeric mice (black and white). The F2 generation homozygous mice were obtained by backcrossing the F0 generation chimeric mice with wild-type mice to obtain the F1 generation mice, and then breeding the F1 generation heterozygous mice with each other. The positive mice were also bred with the Flp transgenic mice to remove the positive selectable marker gene, and then the humanized NKG2A homozygous mice expressing humanized NKG2A protein were obtained by breeding with each other. The genotype of the progeny mice can be identified by PCR using primers shown in the table below.
CRISPR/Cas gene editing technology was used to obtain the NKG2A gene humanized mice. The target sequences are important for the targeting specificity of sgRNAs and the efficiency of Cas9-induced cleavage. Specific sgRNA sequences were designed and synthesized that recognize the 5′ end targeting site (sgRNA1-sgRNA8) and 3′ end targeting site (sgRNA9-sgRNA15). The 5′ end targeting sites are located on exon 2, and the 3′ end targeting sites are located on exon 6 of the mouse NKG2A gene. The targeting site sequence of each sgRNA on the NKG2A gene locus is as follows:
The UCA kit was used to detect the activities of sgRNAs. As shown in
The expression of humanized NKG2A protein in the positive mice was confirmed, e.g., using the Fluorescence-Activated Cell Sorting (FACS) method, and the detection was performed as follows. One wild-type C57BL/6 mouse and one NKG2A gene humanized heterozygous mouse were selected. The mouse spleen cells were stained with Brilliant Violet 510™ anti-mouse CD45 (an anti-mouse CD45 antibody), PE/Cy™ 7 Mouse anti-mouse NK1.1 (an anti-mouse NK cell surface antigen antibody), and either one of: (1) mNKG2A-PE (using PE Mouse IgG2b, κ Isotype Ctrl Antibody as an isotype control (ISO)) (
The CD94 gene humanized mice generated in Example 1 and the NKG2A gene humanized mice generated in Example 2 can also be used to generate double- or multi-gene humanized mouse model. For example, in Example 2, the embryonic stem (ES) cells used for blastocyst microinjection can be selected from the CD94 gene humanized positive clone cells in Example 1, thereby generating NKG2A and CD94 double-gene humanized mice. In addition, it is also possible to breed the homozygous or heterozygous NKG2A and/or CD94 gene humanized mice obtained by the methods described herein with other genetically modified homozygous or heterozygous mice, and the offspring can be screened. According to Mendel’s law, it is possible to generate double-gene or multi-gene modified heterozygous mice comprising humanized NKG2A and/or CD94 genes and other genetic modifications. Then the heterozygous mice can be bred with each other to obtain homozygous double-gene or multi-gene humanized mice.
For example, NKG2A/CD94 double-gene humanized mice (B-hCD94/hNKG2A) can be generated using the above method. Because mouse NKG2A and CD94 genes are both located on chromosome 6, after obtaining the CD94 gene humanized positive ES cells, the method as described in Example 2 was performed to conduct a second round of targeting. Afterwards, the positive offspring can be screened, thereby generating NKG2A/CD94 double-gene humanized mice.
RT-PCR can be used to detect the expression of humanized NKG2A mRNA and humanized CD94 mRNA in NKG2A/CD94 double-gene humanized mice. Specifically, three wild-type C57BL/6 mice (7 weeks old) and three NKG2A/CD94 double-gene humanized homozygous mice (7 weeks old) were selected and mouse spleen was isolated after euthanasia. Total RNA from the spleen was extracted and then reverse transcribed into cDNA using a reverse transcription kit, followed by PCR amplification. The primer sequence are shown in the table below.
As shown in
Further, flow cytometry was used to detect the in vivo expression of NKG2A and CD94 proteins in NKG2A/CD94 double-gene humanized mice. Specifically, one wild-type C57BL/6 mouse (9 weeks old) and one NKG2A/CD94 double-gene humanized homozygous mouse (9 weeks old) were selected and mouse spleen was isolated after euthanasia.
The spleen cells were stained with anti-mouse CD45 antibody Brilliant Violet 510™ anti-mouse CD45 and mNK1.1-PE/Cy7, together with (1) mCD94-APC; (2) hCD94-PE; (3) mNKG2A-PE; and/or (4) monalizumab in combination with Alexa Fluor® 647-conjugated AffiniPure F(ab′)2 Fragment Goat Anti-Human IgG, Fcy fragment Specific. APC Mouse IgG2a, κ Isotype Ctrl Antibody, PE Mouse IgG2b, κ Isotype Ctrl Antibody, or APC Rat IgG2a, κ Isotype Ctrl Antibody were used as isotype controls. As shown in
Further, immuno-phenotyping of leukocytes and T cells in the spleen of wild-type C57BL/6 mice and NKG2A/CD94 double-gene humanized homozygous mice was performed by flow cytometry. The immuno-phenotyping detection results of leukocytes and T cells in the spleen are shown in
The above results showed that NKG2A/CD94 double-gene humanized mice expressing humanized NKG2A protein and humanized CD94 protein can be successfully constructed.
In addition, the CD94 and/or NKG2A gene humanized mice as described herein can also be used to generate triple- or multi-gene humanized mice. For example, NKG2A/CD94/PD-1 triple-gene humanized mice can be generated. Because mouse PD-1 gene is located on chromosome 1, NKG2A/CD94 double-gene humanized mice can be selected for breeding with PD-1 gene humanized mice, and the positive offspring can be screened, thereby generating NKG2A/CD94/PD-1 triple-gene humanized mice.
The NKG2A and/or CD94 gene humanized mice as described herein can be used to establish mouse tumor models for testing the efficacy of drugs targeting human NKG2A and/or CD94. Specifically, the NKG2A/CD94 double-gene humanized homozygous mice (8 weeks old) as prepared in Example 3 were subcutaneously injected with 5 × 105 mouse colon cancer cell B-CAG-hHLA-E MC38 (i.e., MC38 cells expressing human HLA). When the tumor volume grew to about 100 mm3, the mice were randomly placed into a control group (G1) and a treatment group (G2) based on tumor size (5 mice per group). The treatment group mice were administered with the anti-human NKG2A antibody monalizumab via intraperitoneal injection (i.p.), whereas the control group mice were injected with an equal volume of phosphate-buffered saline (PBS). The mice were administered on the grouping day, and the frequency of administration was twice a week (6 times of administrations in total). The tumor volume was measured twice a week and the body weight of the mice was weighed as well. Euthanasia was performed when the tumor volume of the mouse reached 3000 mm3. The table below shows grouping and administration details. The mouse body weight, body weight change, and tumor volume measurement results during the experimental period are shown in
The table below lists the major data and analysis results of each experiment, including: the tumor volume at the time of grouping (Day 0), 11 days post grouping (Day 11), and 18 days post grouping (Day 18); number of survived mice and tumor-free mice at the end of the experiment; tumor growth inhibition value (TGITV%); and the statistical difference (P value) of body weight and tumor volume between the treatment group and the control group.
Overall, the animals in each group were healthy, and the body weights of all the treatment group mice (G2) and control group mice (G1) increased, and were not significantly different (P > 0.05) from each other during the experimental period (
It is to be understood that while the invention has been described in conjunction with the detailed description thereof, the foregoing description is intended to illustrate and not limit the scope of the invention, which is defined by the scope of the appended claims. Other aspects, advantages, and modifications are within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
202010444099.X | May 2020 | CN | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2021/095255 | 5/21/2021 | WO |