Immunotherapy has revolutionized the treatment of a wide variety of diseases, including cancer and autoimmune disorders. While therapeutic antibodies have been raised successfully against large numbers of antigens, there still are certain types of targets that have proven challenging for antibody generation by standard methods in rodents. These include targets with epitopes that are inaccessible to or rarely bound by typical CDRs (e.g., targets with clefts or minimal surface access), such as multipass membrane proteins (e.g., G protein coupled receptors (GPCRs) and ion channels), as well as enzyme active sites and allosteric epitopes. Such targets also can be difficult to stabilize as immunogen preparations, adding to the challenge of their successful use as antigens.
Despite these hurdles, antibodies to intractable antigens such as transmembrane receptors have been described, including against the M2 ion channel of Influenza A (Wei et al. (2011) PLOS One 6:e28309), the formyl peptide receptor 1 (FPR1) GPCR (Douthwaite et al. (2015) MABS 7:152-66), the voltage gated potassium channel Kv1.3 (Wang et al. (2016) Proc. Natl. Acad. Sci. USA 113:11501-11506) and the 5-hydroxytryptamine 2B (5HT2B) GPCR (Ishchenko et al. (2017) Proc. Natl. Acad. Sci. USA 114:8223-8228). These antibodies were observed to have or were modified to have unusually long CDR H3 regions.
Some of these antibodies were raised in cows or camels, which are known to have alternative immunoglobulin locus scaffold structures that can give rise to CDR H3 regions that are longer than typical in rodents or humans (reviewed in de los Rios et al. (2015) Curr. Opin. Struct. Biol. 33:27-41; see also De Genst et al. (2006) Proc. Natl. Acad. Sci. USA 103:4586-4591; Wang et al. (2013) Cell 153:1379-1393; Sok et al. (2017) Nature 548:108-111). For example, the bovine Ig locus has limited combinatorial diversity potential because it has only 12 VH regions, but it can generate unusually long CDR H3 regions that can reach lengths of over 60 amino acids long, whereas human CDR H3 regions typically are only 8-16 amino acids in length. The camelid Ig locus can give rise to heavy chain-only antibodies having a dedicated variable domain (VHH) with long CDR H3 regions.
Approaches to take advantage of long CDR H3 regions for antibody generation have been described, including transgenic chickens with long CDR H3 regions (US Patent Publication No. 20210230253) and libraries of genetic packages with long CDR H3 regions (US Patent Publication No. 20200399785).
While some advances have been made, additional approaches and compositions are needed for designing, preparing and using heavy chain long CDR3 transgenes, particularly for use in raising antibodies to intractable antigens.
The disclosure provides human immunoglobulin heavy chain transgene constructs that encode a long CDR H3 region. The extended CDR H3 region results from the inclusion in the transgene of D-D fusion segments, which comprise two D regions joined together. The long CDR H3 heavy chain transgenes of the disclosure can be introduced into an animal host for raising antibodies, in particular to intractable antigens such as multipass transmembrane receptors (e.g., GPCRs and ion channels) that may be more readily bound by long CDR H3-containing antibodies.
Accordingly, in one aspect, the disclosure pertains to a transgene construct encoding an immunoglobulin heavy chain variable region comprising:
In an embodiment, the plurality of human D-D fusion segments comprises at least one naturally-occurring human D-D fusion pair. In an embodiment, the plurality of human D-D fusion segments comprises at least one synthetic human D-D fusion pair. In an embodiment, the plurality of human D-D fusion segments comprises at least one synthetic human D-D fusion pair and at least one naturally-occurring human D-D fusion pair.
In an embodiment, the transgene construct encodes at least five, at least ten, at least fifteen, at least twenty or at least twenty-five VH regions. In an embodiment, the transgene construct encodes human VH regions in 5′ to 3′ orientation as follows: 3-73, 3-72, 2-70D, 1-69, 4-61, 5-51, 3-49, 3-43, 4-39, 4-32, 2-26, 1-24, 3-23, 3-15, 3-9, 1-8, 2-5, 7-4-1, 1-2 and 6-1.
In an embodiment, the transgene construct encodes a synthetic 1-1/2-8 D-D fusion or a synthetic 2-8/1-1 D-D fusion. In an embodiment, the transgene construct encodes a synthetic 1-1/2-8 D-D fusion and a synthetic 2-8/1-1 D-D fusion.
In an embodiment, the transgene construct encodes at least one naturally-occurring D-D fusion pair selected from the group consisting of 2-2/3-3; 5-12/4-17; 5-5/3-22; 6-6/3-10; 6-6/6-19; 5-12/5-5; 6-13/6-19; 2-2/6-13; 5-12/6-19; 6-13/2-21; 5-5/3-10; 2-15/3-22; 5-12/2-15; 2-15/5-24; 6/19-1/26; 2-15/5-5; 5-12/3-22; 2-15/4-17; 5-5/6-13; 6-19/3-22; 2-15/2-21 and 3-3/3-10.
In an embodiment, the transgene construct encodes naturally-occurring D-D fusion pairs 2-2/3-3; 5-12/4-17; 5-5/3-22; 6-6/3-10; 6-6/6-19; 5-12/5-5; 6-13/6-19; 2-2/6-13; 5-12/6-19; 6-13/2-21; 5-5/3-10; 2-15/3-22; 5-12/2-15; 2-15/5-24; 6/19-1/26; 2-15/5-5; 5-12/3-22; 2-15/4-17; 5-5/6-13; 6-19/3-22; 2-15/2-21 and 3-3/3-10.
In an embodiment, the transgene construct further encodes at least one natural D segment.
In an embodiment, the transgene construct encodes D segments and D-D fusion pairs in 5′ to 3′ orientation as follows: 1-1/2-8; 2-2/3-3; 3-3; 5-12/4-17; 5-5/3-22; 6-6/3-10; 6-6/6-19; 2-8; 5-12/5-5; 6-13/6-19; 2-2/6-13; 5-12/6-19; 6-13/2-21; 5-5/3-10; 2-15/3-22; 3-16; 5-12/2-15; 2-15/5-24; 6/19-1/26; 2-15/5-5; 5-12/3-22; 2-15/4-17; 5-5/6-13; 6-19/3-22; 2-15/2-21; 3-3/3-10 and 2-8/1-1.
In an embodiment, the transgene construct encodes J1-J6 segments.
In an embodiment, the transgene construct further encodes a constant region (e.g., a mouse or human Ig constant region) downstream (3′) of the plurality of J segments.
In an embodiment, the transgene construct further comprises lox sites to facilitate cre/lox mediated RMCE (Recombinase Mediated Cassette Exchange). In an embodiment, the transgene construct further comprises Guide Recombination Sequences (GRS) to facilitate CRISPR/CAS-mediated recombination.
In an embodiment, the transgene construct comprises the sequence shown in SEQ ID NO: 1.
In an embodiment, the transgene construct is carried on a bacterial artificial chromosome (BAC).
In another aspect, the disclosure pertains to a transgenic animal comprising a transgene construct of the disclosure. In an embodiment, the transgenic animal is a mouse. In an embodiment, the transgenic mouse further comprises a transgene construct encoding an immunoglobulin light chain such that the mouse expresses antibodies comprising the light chain paired with a heavy chain comprising a long CDR3 region.
In another aspect, the disclosure pertains to a method of generating antibodies to an antigen of interest, the method comprising administering the antigen of interest to the transgenic animal (e.g., mouse) of the disclosure, such that antibodies that bind to the antigen of interest are generated. In an embodiment, the method further comprises isolating an antibody of interest from the animal and determining the heavy chain CDR3 sequence thereof.
The long CDR3 heavy chain transgene constructs of the disclosure encode a combination of longer than average VH regions and D-D fusions, as illustrated schematically in
Unless otherwise defined, all terms of art, notations and other scientific terminology used herein are intended to have the meanings commonly understood by those of skill in the art to which this disclosure pertains. In some cases, terms with commonly understood meanings are defined herein for clarity and/or for ready reference, and the inclusion of such definitions herein should not necessarily be construed to represent a difference over what is generally understood in the art. The techniques and procedures described or referenced herein are generally well understood and commonly employed using conventional methodologies by those skilled in the art, such as, for example, the widely utilized molecular cloning methodologies described in Sambrook el al., Molecular Cloning: A Laboratory Manual 4th ed. (2012) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. As appropriate, procedures involving the use of commercially available kits and reagents are generally carried out in accordance with manufacturer defined protocols and/or parameters unless otherwise noted.
The design and construction of a long CDR3 human heavy chain transgene is described in detail in Example 1. As illustrated in
Non-limiting examples of human VH regions between 98 and 101 amino acids in length include regions: 6-1, 2-5, 2-26, 2-70, 2-70D, 3-15, 3-49, 3-72, 3-73, 3-9, 3-23, 3-43, 4-30-2, 4-30-4, 4-31, 4-32, 4-39, 4-61, 5-51, 7-4-1, 1-2, 1-3, 1-8, 1-18, 1-24, 1-38-4, 1-45, 1-46, 1-58 and 1-69. In an embodiment, the transgene construct encodes at least five, at least ten, at least fifteen, at least twenty or at least twenty-five VH regions, e.g., selected from the aforementioned list.
In an embodiment, the transgene construct encodes human VH regions in 5′ to 3′ orientation as follows: 3-73, 3-72, 2-70D, 1-69, 4-61, 5-51, 3-49, 3-43, 4-39, 4-32, 2-26, 1-24, 3-23, 3-15, 3-9, 1-8, 2-5, 7-4-1, 1-2 and 6-1.
In an embodiment, the plurality of human D-D fusion segments comprises at least one naturally-occurring human D-D fusion pair. In an embodiment, the plurality of human D-D fusion segments comprises at least one synthetic human D-D fusion pair. In an embodiment, the plurality of human D-D fusion segments comprises at least one synthetic human D-D fusion pair and at least one naturally-occurring human D-D fusion pair. Non-limiting examples of naturally-occurring and synthetic human D-D fusion pairs are shown in
In an embodiment, the transgene construct encodes a synthetic 1-1/2-8 D-D fusion or a synthetic 2-8/1-1 D-D fusion. In an embodiment, the transgene construct encodes a synthetic 1-1/2-8 D-D fusion and a synthetic 2-8/1-1 D-D fusion.
In an embodiment, the transgene construct encodes at least one, at least two, at least three, at least four, or at least five, at least six, at least seven, at least eight, at least nine, at least ten or more naturally-occurring D-D fusion pairs selected from the group consisting of 2-2/3-3; 5-12/4-17; 5-5/3-22; 6-6/3-10; 6-6/6-19; 5-12/5-5; 6-13/6-19; 2-2/6-13; 5-12/6-19; 6-13/2-21; 5-5/3-10; 2-15/3-22; 5-12/2-15; 2-15/5-24; 6/19-1/26; 2-15/5-5; 5-12/3-22; 2-15/4-17; 5-5/6-13; 6-19/3-22; 2-15/2-21 and 3-3/3-10.
In an embodiment, the transgene construct encodes naturally-occurring D-D fusion pairs 2-2/3-3; 5-12/4-17; 5-5/3-22; 6-6/3-10; 6-6/6-19; 5-12/5-5; 6-13/6-19; 2-2/6-13; 5-12/6-19; 6-13/2-21; 5-5/3-10; 2-15/3-22; 5-12/2-15; 2-15/5-24; 6/19-1/26; 2-15/5-5; 5-12/3-22; 2-15/4-17; 5-5/6-13; 6-19/3-22; 2-15/2-21 and 3-3/3-10.
In an embodiment, the transgene construct further encodes at least one human natural D segment (i.e., a D segment that is not a D-D fusion). In embodiments, the transgene construct further encodes at least two, at least three, at least four, at least five, at least six, at least seven, at least eight, at least nine, at least ten or more natural D segments. In an embodiment, the natural D segment(s) is longer than average, e.g., encodes a translated sequence of ≥9 a.a., ≥10 a.a., ≥11 a.a., or is 9-12 amino acids in length. Non-limiting examples of human natural D segments that are 9-12 amino acids in length include segments 2-2, 2-8, 2-15, 2-21, 3-3, 3-9, 3-10, 3-16 and 3-22. In an embodiment, the transgene construct comprises natural human D segments 3-3, 2-8 and 3-16.
In an embodiment, the transgene construct encodes D segments and D-D fusion pairs in 5′ to 3′ orientation as follows: 1-1/2-8; 2-2/3-3; 3-3; 5-12/4-17; 5-5/3-22; 6-6/3-10; 6-6/6-19; 2-8; 5-12/5-5; 6-13/6-19; 2-2/6-13; 5-12/6-19; 6-13/2-21; 5-5/3-10; 2-15/3-22; 3-16; 5-12/2-15; 2-15/5-24; 6/19-1/26; 2-15/5-5; 5-12/3-22; 2-15/4-17; 5-5/6-13; 6-19/3-22; 2-15/2-21; 3-3/3-10 and 2-8/1-1.
In an embodiment, the transgene construct encodes human J1-J6 segments.
In an embodiment, the transgene construct further encodes a constant region (e.g., a mouse or human Ig constant region) downstream (3′) of the plurality of J segments.
In an embodiment, the transgene construct further comprises lox sites to facilitate cre/lox mediated RMCE (Recombinase Mediated Cassette Exchange). In an embodiment, the transgene construct further comprises Guide Recombination Sequences (GRS) to facilitate CRISPR/CAS-mediated recombination.
The nucleotide sequence of the transgene construct can be further optimized for intended purposes. For example, the construct can be altered for codon optimization (e.g., to increase expression of the encoded regions). Additionally or alternatively, the construct can be altered to avoid excessive somatic hypermutation (SHM), for example by analyzing hypermutable regions in CDR1, CDR2 and or CDR3 of the heavy chain variable regions and eliminating sequence(s) that could enhance SHM. Approaches for codon optimization and SHM reduction are well established in the art.
The transgene construct can further comprise sequences that allow for targeted insertion of the transgene into a specific locus, e.g., an endogenous mouse heavy chain locus. Knock-in technology for replacing an endogenous locus with a targeted transgene is well established in the art. In a preferred embodiment, the transgene construct comprises recombination sequences (Guide Recombination Sequences, or GRS) allowing for the transgene to be knocked-in to the endogenous mouse heavy chain locus.
In an embodiment, the long CDR3 heavy chain construct comprises the nucleotide sequence shown in SEQ ID NO: 1.
The transgene constructs of the disclosure can be prepared using standard recombinant DNA techniques. Cloning vectors containing polylinkers are useful as starting vectors for insertion of DNA fragments of interest. Suitable cloning vectors are well established in the art. Moreover, plasmids or other vectors (e.g., YACs) carrying human unrearranged light chain immunoglobulin sequences have been described in the art (see e.g., U.S. Pat. Nos. 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,789,650; 5,877,397; 5,661,016; 5,814,318; 5,874,299; and 5,770,429; all to Lonberg and Kay; and U.S. Pat. Nos. 5,939,598; 6,075,181; 6,114,598; 6,150,584 and 6,162,963, all to Kucherlapati et al.) and can be used as a source of heavy chain V. D and J region sequences. Alternatively, desired sequences can be synthesized by standard methods. The appropriate DNA fragments are then operatively linked through ligation into a cloning vector, followed by characterization of the vector (e.g., by restriction fragment analysis or sequencing or the like) to ensure proper arrangement of the fragments.
In an embodiment, the transgene construct is carried on a bacterial artificial chromosome (BAC). BAC technology for carrying Ig transgenes is well-established in the art.
A non-limiting example of a long CDR3 heavy chain vector of the disclosure is illustrated schematically in
To prepare the transgene construct for microinjection or other technique for transgenesis, the transgene construct can be isolated from the vector in which it is carried by cleavage with appropriate restriction enzymes to release the transgene construct fragment. The fragment can be isolated using standard techniques, such as by pulse field gel electrophoresis on an agarose gel, followed by isolation of the fragment from the agarose gel, such as by [beta]-agarase digestion or by electroelution. For example, the agarose gel slice containing the transgene construct fragment can be excised from the gel and the agarose can be digested with [beta]-agarase (e.g., from Takara), using standard methodology. Alternatively, preparation of the transgene for knock-in purposes may be carried out by standard BAC or plasmid purification techniques, isolating the closed circular form for direct transfection or introduction into recipient mouse cells or embryos.
Another aspect of the disclosure pertains to a transgenic non-human host animal that comprises a transgene construct of the disclosure (i.e., the transgene construct is integrated into the genome of the host animal), such that the animal expresses an immune repertoire that comprises antibodies using heavy chains comprising long CDR3 regions. The transgenic non-human host animals of the disclosure are prepared using standard methods known in the art for introducing exogenous nucleic acid into the genome of a non-human animal. In a preferred embodiment, the transgene construct is inserted into the genome of the host animal using knock-in technology to replace all or a part of an endogenous heavy chain locus (e.g., an endogenous mouse heavy chain locus) with the transgene (e.g., a human heavy chain transgene). Alternatively, the transgene construct can be introduced into the genome of the host animal by, for example, pronuclear microinjection for random genomic insertion, or transfection into mouse embryonic stem (mES) cells.
For knock-in approaches, typically loxP flanking sites are included in the construct such that these sites facilitate recombination between host loxP flanking sites and loxP flanking sites in the transgene donor upon expression of Cre recombinase. Recombination is performed in embryonic stem cells (e.g., mouse embryonic stem cells) and then embryonic stem cells with the modification of interest are implanted into a viable blastocyst, which then grows into a mature chimeric animal (e.g., mouse) with some cells having the original blastocyst cell genetic information and other cells having the modifications introduced to the embryonic stem cells. Subsequent offspring of the chimeric animal will then have the gene knock-in. Knock-in technology is summarized in, for example, Manis (2007) New Engl. J. Med. 357:2426-2429.
As an alternative knock-in approach, the construct can include flanking Guide Recombination Sequences (GRS) to facilitate CRISPR/CAS-mediated recombination. These are 500-1500 bp sequences which flank the transgene insert, and have specific homology to endogenous mouse sequences that adjoin specific CRISPR/CAS cleavage sites in the mouse genome. Appending the same CRISPR/CAS cleavage sites to the ends of the GRS flanking sequences allows for CRISPR/CAS mediated digestion to simultaneously cleave the endogenous mouse genome as well as the circular BAC transgene donor. In this manner, the cleaved ends of the mouse CRISPR/CAS sites are available for homologous recombination mediated repair via the similarly cleaved and linearized transgene donor insert, resulting in a site-specific knock-in.
In a preferred embodiment, the transgene construct is inserted into the genome of a mouse using knock-in technology to replace all or part of the endogenous heavy chain locus. In a preferred embodiment, the transgene construct is a human heavy chain construct that is inserted into the endogenous mouse heavy chain locus by homologous recombination, thereby deleting at least some portion of the mouse VH, DH, and JH and CH sequences. In another embodiment, a heavy chain transgene that lacks a constant region is inserted into the endogenous heavy chain locus such that at least some portion of the VH. DH and JH sequences is deleted but the CH sequences are left intact and operatively linked to the functional heavy chain variable region of the transgene, thereby producing chimeric antibodies in the mice (which can be reverse engineered to be fully human).
Another method for preparing a transgenic non-human animal, in particular a transgenic mouse, is that of pronuclear microinjection. This technology is well established in the art (see e.g., Wagner. T. E. et al. (1981) Proc. Natl. Acad. Sci. USA 78:6376-6380; U.S. Pat. No. 4,873,191 by Wagner and Hoppe). In general, the method involves introducing exogenous genetic material into the pronucleus of a mammalian zygote (e.g., mouse zygote) by microinjection to obtain a genetically transformed zygote and then transplanting the genetically transformed zygote into a pseudopregnant female animal. The embryo is then allowed to develop to term and the genome of the resultant offspring is analyzed for the presence of the transgenic material.
Southern blot analysis. PCR, or other such technique for analyzing genomic DNA is used to detect the presence of a unique nucleic acid fragment that would not be present in the non-transgenic animal but would be present in the transgenic animal. Selective breeding of transgenic offspring allows for homozygosity of the transgene to be achieved.
If a long CDR3 heavy chain transgene is inserted into the genome randomly, it is preferable to also disable the endogenous heavy chain locus to achieve a limited repertoire of heavy chain options in the animal (e.g., mouse). The endogenous heavy chain locus can be engineered to render it non-operational by standard knock-out technology, for example by deleting all or a portion of the endogenous heavy chain V, D, J and C regions such that they are non-functional.
Although the preferred embodiment of the disclosure comprises transgenic mice, the invention encompasses other non-human host animals, including but not limited to rats, rabbits, pigs, goats, sheep, cows, and chickens. Techniques for creating transgenic animals of each of these species have been described in the art. For example, preparation of transgenic rats is described in Tesson, L. et al. (2005) Transgenic Res. 14:531-546, including by techniques such as DNA microinjection, lentiviral vector mediated DNA transfer into early embryos and sperm-mediated transgenesis. Methods of transgenesis in rats are also described in Mullin, L. J. et al. (2002) Methods Mol. Biol. 180:255-270. Preparation of transgenic rabbits is described in, for example, Fan, J. et al. (1999) Pathol. Int. 49:583-594; Fan, J. and Watanabe, T. (2000) J. Atheroscler. Thromb. 7:26-32; Bosze, Z. et al (2003) Transgenic Res. 12:541-553. Preparation of transgenic pigs is described in, for example. Zhou. CY. et al. (2002) Xenotransplantation 9:183-190; Vodicka, P. et al. (2005) Ann. N. Y. Acad. Sci. 1049:161-171.
Alternative transgenesis techniques to pronuclear microinjection in pigs include adenovirus mediated introduction of DNA into pig sperm (see e.g., Farre, L. et al. (1999) Mol. Reprod. Dev. 53:149-158) and linker-based sperm-mediated gene transfer (Chang, K. et al. (2002) BMC Biotechnol. 2:5). Preparation of transgenic goats is described in, for example. Ebert, K. M. et al. (1991) Biotechnology (NY) 9:835-838; Baldassarre. H. et al. (2004) Reprod. Fertil. Dev. 16:465-470. Somatic cell nuclear transfer in goats is described in, for example. Behboodi. E. et al. (2004) Transgenic Res. 11:215-224. Preparation of transgenic sheep is described in, for example. Ward, K. A. and Brown. B. W. (1998) Reprod. Fertil. Dev. 10:659-665. Preparation of transgenic cows is described in, for example, Donovan, D. M. et al. (2005) Transgenic Res. 14:563-567. Gene transfection of donor cells for nuclear transfer of bovine embryos is described in, for example, Lee S. L. et al. (2005) Mol. Reprod. Dev. 72:191-200. Preparation of transgenic domestic farm animals is also reviewed in Niemann, H. et al. (2005) Rev. Sci. Tech. 24:285-298. Preparation of transgenic chickens is described in, for example, Pain, B. et al. (1999) Cells Tissues Organs 165:212-219; Lillico, S. G. et al. (2005) Drug Discov. Today 10:191-196; and Ishii, Y. et al. (2004) Dev. Dyn. 229:630-642.
An animal of the disclosure (e.g., mouse) carrying a long CDR3 heavy chain transgene construct can be cross-bred with an animal (e.g., mouse) that carries an immunoglobulin light chain transgene to thereby produce an animal (e.g., mouse) that expresses antibodies comprising the light chain paired with a heavy chain comprising long CDR3 regions. Immunoglobulin light chain transgenic animals (e.g., mice) are well-established in the art.
The transgenic animals of the disclosure are useful for generating antibodies to a wide variety of antigens of interest. For animals carrying only the long CDR3 heavy chain transgene and an endogenous light chain locus, the animal will produce chimeric light chain/heavy chain antibodies that, if desired, can be reverse engineered to pair the long CDR3 heavy chain with a light chain of the same species. Alternatively, for animals (e.g., mice) carrying both a long CDR3 heavy chain Ig transgene (e.g., human) and a light chain Ig transgene (e.g., human), fully heterologous antibodies (e.g., fully human antibodies) can be prepared in the host transgenic animal. For animals carrying chimeric Ig transgene loci (e.g., human variable regions appended to mouse constant regions, of the heavy and/or light chains), combinations of the humanized loci may functionally pair with other humanized loci, or alternatively with their wild-type mouse counterparts.
Accordingly, in another aspect, the disclosure pertains to a method of generating antibodies to an antigen of interest, the method comprising administering the antigen of interest to a transgenic animal of the disclosure. In an embodiment, the animal is a transgenic mouse and the antigen is administered to the mouse such that antibodies that bind to the antigen of interest are generated in the mouse. In an embodiment, the animal is a transgenic mouse carrying both a human Ig long CDR3 heavy chain transgene and a human Ig light chain transgene and the antigen is administered to the mouse such that human or human-mouse chimeric antibodies that bind to the antigen of interest are generated in the mouse. In an embodiment, the antigen is a GPCR or an ion channel protein. In an embodiment, the method can further comprise isolating an antibody of interest from the host animal (e.g., mouse) and determining the heavy chain CDR3 sequence of the antibody.
Transgenic animals can be immunized with an antigen(s) of interest by standard methodologies known in the art and antibodies generated in the animals can be isolated and characterized also be standard established methods. Polyclonal antibodies can be directly isolated form the host animal and monoclonal antibodies can be prepared by standard methods, such as hybridoma technology. Procedures for making monoclonal antibodies using hybridomas are well established in the art (see, e.g., U.S. Pat. No. 4,977,081, PCT Publication WO 97/16537, and European Patent No. 491057B1, the disclosures of which are incorporated herein by reference). Alternatively, in vitro production of monoclonal antibodies from cloned cDNA molecules is also established in the art (see e.g., Andris-Widhopf et al. (2000) J. Immunol. Methods 242:159; and Burton (1995) Immunotechnology 1:87, the disclosures of which are incorporated herein by reference). B cell clones from the immunized transgenic animals can be isolated and cDNAs encoding the antibodies can be isolated and cloned by standard molecular biology techniques into expression vectors. Further recombinant engineering of the cloned Ig cDNAs is also possible and well established in the art.
As used herein, the term “D-D fusion segment” is intended to refer to the direct joining of two different immunoglobulin heavy chain D region nucleic acid sequences. For example, a D-D fusion segment of a 5-12 D segment and a 4-17 D segment is referred to herein as a 5-12+4-17 D-D fusion segment (or simply “D-D fusion”). A “naturally-occurring D-D fusion pair” refers to a D-D fusion whose two constituent D segments have been observed in nature arising from a V(D-D)J recombination event. Such naturally-occurring D-D fusion pairs have been described in the art, such as in Larimore et al. (2012) J. Immunol. 189:3221-3230; Briney et al. (2012) Immunol. 137:56-64; Yu and Guan (2014) Front. Immunol. 5:250; Safonova and Pevzner (2019) Front. Immunol. 10:987; and Safonova and Pevzner (2020) Genome Res. 30:1547-1558. A “synthetic D-D fusion pair” refers to a D-D fusion that has not been observed in nature arising from a V(D-D)J recombination event. Regardless of whether a D-D fusion pair is “naturally-occurring” (i.e., has been observed in nature) or “synthetic” (i.e., has not been observed in nature), the D-D fusion segment(s) used in the transgenes of the disclosure can be engineered ex vivo by genetically linking the sequences of the two D segments by standard methods.
As used herein, “long CDR3”, “long H CDR3”, “long CDR H3” or “long HCDR3” refers to a heavy chain CDR3 region that is longer than the typical or average length of human heavy chain CDR3 regions, e.g., typically longer than 8-16 amino acids in length. A transgene of the disclosure, after VDJ recombination, generates heavy chain variable regions having CDR3s of various lengths and is considered to be a transgene encoding long CDR3s when a significant portion, e.g., at least 25%, at least 30%, at least 35%, at least 40%, at least 45% or at least 50% or more of the generated HCDR3s are longer than the typical or average length of HCDR3s.
As used herein, the term “operatively linked” is intended to describe the configuration of a nucleic acid sequence that is placed into a functional relationship with another nucleic acid sequence. For example, a promoter or enhancer is operatively linked to a coding sequence if it affects the transcription of the sequence. With respect to the joining of two protein coding regions, operatively linked means that the nucleic acid sequences being linked are contiguous and in reading frame. For splice donor/acceptor and RSS sequences, operatively linked means that the sequences are capable of effecting their functional purposes.
As used herein, the term “transgene” refers to a gene that is introduced as an exogenous source to a site within a host genome (e.g., mouse heavy chain Ig locus).
As used herein, the term “transgene construct” refers to a nucleic acid preparation suitable for introduction into the genome of a host animal.
As used herein, the term “transgenic mouse” refers to a mouse comprising cells having a transgene, as defined herein. The transgene may be present in all or some cells of the mouse.
As used herein, the term “unrearranged” with respect to an immunoglobulin V segment refers to an immunoglobulin V segment that is in its germline configuration wherein the V segment is not recombined so as to be immediately adjacent to a D or J segment.
The present invention is further illustrated by the following examples, which should not be construed as further limiting. The contents of figures and all references, patents and published patent applications cited throughout this application are expressly incorporated herein by reference.
In this example, the preparation of a human immunoglobulin heavy chain transgene construct is described, wherein the CDR3 domain of the construct is on average larger (longer) than that found in normal naïve human B cell heavy chain CDR3s. The rationale for this approach is that certain antigens can have functionally important epitopes that are inaccessible or rare to find amongst normal-length heavy chain CDR3 domain antibodies. The likelihood of an antibody interacting with such epitopes is increased by the use of a set of CDR3 domains that are longer (larger) than average.
Long CDR3s have been reported in the art to have potential for difficult membrane targets. In particular, GPCRs, ion channels and other membrane targets with clefts or limited surface access may benefit from long CDR3 mAbs to better access functional sites (see e.g., Douthwaite et al. (2015) MAbs 7:152-166; Corti et al. (2013) Annu. Rev. Immunol. 31:705-742; Wei et al. (2011) PLOS One 6:e28309; Wang et al. (2016) Proc. Natl. Acad. Sci. USA 113:11501-11506; Ishchenko et al. (2017) Proc. Natl. Acad. Sci. USA 114:8223-8228). Camelid and cow antibodies with long CDR3 domains have been exploited for this purpose (see e.g., Wang et al. (2013) Cell 153:1379-1393; de los Rios et al. (2015) Curr. Opin. Struct. Biol. 33:27-41; Sok et al. (2017) Nature 548:108-111).
Using mechanisms for CDR3 lengthening, transgenes were designed that skew expression to longer domains. First, the longest VH and D segments were identified and transgenes were designed to selectively use some of the longer VH segments and longer D segments. Additionally, D-D fusions using some of the shorter D segments were included so as to retain some of the naturally occurring D amino acid sequences while also keeping to the longer-length CDR3s.
The choices for VH segments were based on length and uniqueness. VH segments of 98-101 amino acids in length were selected for possible inclusion in the constructs.
Natural long D segments to be used in wild-type configuration were also selected for possible inclusion in the constructs, examples of which include: 2-02, 2-08, 2-15, 2-21, 3-03, 3-09, 3-10, 3-16 and 3-22.
Regarding D-D fusions, the natural formation of D-D fusions during recombination can be explained by the presence of so-called “cryptic nonamers”, which are nonamer sequences (normally part of the RSS motifs enabling VDJ recombination) that are out of their normal context but still able to support recombination (Safonova and Pevzner (2020) Genome Res. 30:1547-1558). Some D segments have a higher probability of the non-canonical nonamers in spacing that would allow for recombination, examples of which include: 2-02, 2-15, 3-03, 3-09, 3-10, 3-16, 3-22, 6-06, 6-19 and 6-25. Also, empirically it seems that the appearance of D-D fusions generally follows their germline order (5′ D fusing to a 3′ D), such that 3′ D segments less commonly fuse to 5′ segments.
The sequences of exemplary natural D segments, synthetic D-D fusions and naturally-occurring D-D fusions are shown in
The strategy for placement of the various D and D-D fusions into the construct was (in order of priority): (i) place a D-D fusion in a position where one of the D-D pair normally resides within the D domain, if possible; (ii) if its own locus is not possible, try to match expression of the naturally-occurring D segment there to the expression level of one of the D segments present in the D-D fusion; and (iii) find remaining slots where possible. The D segments and D-D fusions (and their associated nucleotide and amino acid sequences) were placed genomically into the transgene construct in the endogenous D locations shown in
A schematic diagram of a representative long CDR3 construct is shown in
This construct comprises the following native VH segments in 5′ to 3′ order: 3-73, 3-72, 2-70D, 1-69, 4-61, 5-51, 3-49, 3-43, 4-39, 4-32, 2-26, 1-24, 3-23, 3-15, 3-9, 1-8, 2-5, 7-4-1, 1-2 and 6-1. The VH segments are each 98-101 amino acids in length and use native RSS and octamer sites.
This construct also comprises three native DH sequences (3-3, 2-8 and 3-16), two novel synthetic D-D fusions (1-1/2-8 and 2-8/1-1) and 22 naturally-occurring D-D fusion pairs (2-2/3-3; 5-12/4-17; 5-5/3-22; 6-6/3-10; 6-6/6-19; 5-12/5-5; 6-13/6-19; 2-2/6-13; 5-12/6-19; 6-13/2-21; 5-5/3-10; 2-15/3-22; 5-12/2-15; 2-15/5-24; 6/19-1/26; 2-15/5-5; 5-12/3-22; 2-15/4-17; 5-5/6-13; 6-19/3-22; 2-15/2-21 and 3-3/3-10). The long D regions are 9-20 amino acids in length, they retain natural positioning as much as possible and the RSS sites are left intact. The 5′ to 3′ order of the D segments and D-D fusions in the construct is as follows: 1-1/2-8; 2-2/3-3; 3-3; 5-12/4-17; 5-5/3-22; 6-6/3-10; 6-6/6-19; 2-8; 5-12/5-5; 6-13/6-19; 2-2/6-13; 5-12/6-19; 6-13/2-21; 5-5/3-10; 2-15/3-22; 3-16; 5-12/2-15; 2-15/5-24; 6/19-1/26; 2-15/5-5; 5-12/3-22; 2-15/4-17; 5-5/6-13; 6-19/3-22; 2-15/2-21; 3-3/3-10 and 2-8/1-1.
The D-D fusion segments inserted into the construct used the DNA sequences shown in
LoxP flanks the insert at the 5′ end. There is also an ADAM/IGHD GRS1 site at the 5′ end and a J-Mu GRS site at the 3′ end to facilitate donor transgene and mouse host genome recombination. The success of GRS donor delivery for knockins has been validated to a level where this is the preferred mechanism for site-specific donor transgene delivery. The PGK-Puro cassette is flanked by FRT sites, which allows the cassette to be removed after GRS donor gene delivery.
The segment from the end of the “Long DH” as shown in
The long CDR3 bacterial artificial chromosome (BAC) transgene donor, as illustrated in
GCGGCCGCACCATCATGATATCCCACAAGTATGGAAGATACATAAGTAGATGGCCAGACTA
TACAGGAAAGAAGCAAAACTTGGGCTAACAGAAGAGCAAACACAGAGCTCAGACAGAACTA
CCTGGCAATGCGACTGGGCACACTGAAAGCACTGGGCATCAGCACTGAGCCCCAAATATGC
ACTCAGGATCCTCTGCATAATAATGTGACATAACAGGAAGGTTAGAACAGGCCAAAAGAGG
AAACAGAACAAATGCCCCCAACCAAAGAAGTATAAACAAATTGGGAAGAGTAAAGAAGGAT
TGTAAGGATTGAGTACCACACAGAACATGCTCTTAATGGCCTCAATGCTGAAGCTAGGAAG
AACTAAGTTAAAAGAAACATGTTCAACGGGATTCCCTGTCACTGGACTTCACAACAAGCAA
AATTCAATCTTTCTGTTAAGGAGATGAGAAGAGAATATCTGAACCTTGTGTTGACAGTGCC
CCACCCCGACTGTCAGGCTGTGGGAAATGCCAGAGCAATCACTAGGAACACACAAGGATGA
GGGAGACGAGGGTTAGGACACAATTCTCGAGAAGTTCCTATTCTCTAGAAAGTATAGGAAC
TTCCTGATCACTCGATCTCGTGCTACGATAACTTCGTATAGCATACATTATACGAAGTTAT
CGCGCCGCACACAAAAACCAACACACAGATCATGAAAATAAAGCTCTTTTATTGGTACCGA
ATTCGCCAGGGAGCTCTCAGACGTCGCTTGGTCGGTCTTTATTCGAACCCCAGAGTCCCGC
TCAGGCACCGGGCTTGCGGGTCATGCACCAGGTGCGCGGTCCTTCGGGCACCTCGACGTCG
GCGGTGACGGTGAAGCCGAGCCGCTCGTAGAAGGGGAGGTTGCGGGGCGCGGAGGTCTCCA
GGAAGGCGGGCACCCCGGCGCGCTCGGCCGCCTCCACTCCGGGGAGCACGACGGCGCTGCC
CAGACCCTTGCCCTGGTGGTCGGGCGAGACGCCGACGGTGGCCAGGAACCACGCGGGCTCC
TTGGGCCGGTGCGGCGCCAGGAGGCCTTCCATCTGTTGCTGCGCGGCCAGCCGGGAACCGC
TCAACTCGGCCATGCGCGGGCCGATCTCGGCGAACACCGCCCCCGCTTCGACGCTCTCCGG
CGTGGTCCAGACCGCCACCGCGGCGCCGTCGTCCGCGACCCACACCTTGCCGATGTCGAGC
CCGACGCGCGTGAGGAAGAGTTCTTGCAGCTCGGTGACCCGCTCGATGTGGCGGTCCGGGT
CGACGGTGTGGCGCGTGGCGGGGTAGTCGGCGAACGCGGCGGCGAGGGTGCGTACGGCCCG
GGGGACGTCGTCGCGGGTGGCGAGGCGCACCGTGGGCTTGTACTCGGTCATGGTTTAGTTC
CTCACCTTGTCGTATTATACTATGCCGATATACTATGCCGATGATTAATTGTCAACACGTG
CTGCTGCAGGTCGAAAGGCCCGGAGATGAGGAAGAGGAGAACAGCGCGGCAGACGTGCGCT
TTTGAAGCGTGCAGAATGCCGGGCCTCCGGAGGACCTTCGGGCGCCCGCCCCGCCCCTGAG
CCCGCCCCTGAGCCCGCCCCCGGACCCACCCCTTCCCAGCCTCTGAGCCCAGAAAGCGAAG
GAGCAAAGCTGCTATTGGCCGCTGCCCCAAAGGCCTACCCGCTTCCATTGCTCAGCGGTGC
TGTCCATCTGCACGAGACTAGTGAGACGTGCTACTTCCATTTGTCACGTCCTGCACGACGC
GAGCTGCGGGGGGGGGGGGAACTTCCTGACTAGGGGAGGAGTGGAAGGTGGCGCGAAGGGG
CCACCAAAGAACGGAGCCGGTTGGCGCCTACCGGTGGATGTGGAATGTGTGCGAGCCAGAG
GCCACTTGTGTAGCGCCAAGTGCCCAGCGGGGCTGCTAAAGCGCATGCTCCAGACTGCCTT
GGGAAAAGCGCCTCCCCTACCCGGTAGGAAGTTCCTATTCTCTAGAAAGTATAGGAACTTC
This application claims the benefit under 35 U.S.C. § 119(c) of U.S. Provisional Application No. 63/439,797, filed on Jan. 18, 2023, which is hereby incorporated herein by reference in its entirety for all purposes. The instant application contains a Sequence Listing which has been submitted electronically in .XML file format and is hereby incorporated by reference in its entirety. Said .XML copy, created on created on Jan. 12, 2024, is named ZL8017-WO-PCT_SL.xml and is 128,639 bytes in size.
Number | Date | Country | |
---|---|---|---|
63439797 | Jan 2023 | US |