The instant application contains a Sequence Listing which has been submitted electronically in XML file format and is hereby incorporated by reference in its entirety. Said XML copy, created on Mar. 15, 2024, is named 701039-092250USD1_SL.xml and is 91,877 bytes in size.
The invention relates to engineered antibodies and methods of discovery and/or optimizing antibodies.
The mammalian adaptive immune response relies upon antibodies. A healthy animal will produce a very large number of different antibodies, each of which can selectively bind to a different molecule, which is called an antigen. The binding of the antibody to an antigen triggers an immune response which allows the body to destroy the antigen. If the antigen is a molecule on a pathogen, this permits the body to counter the infection by attacking the pathogen.
Antibodies are comprised of two identical Ig heavy chain (IgH) polypeptides and two identical light chain (IgL) polypeptides. Portions of the IgH and IgL chains called the variable region form the antigen-binding site. The sequence of the antigen-binding site determines what antigen(s) the antibody can bind to and how tight that binding is. In order to have a robust immune response, it is important for an animal to have both a wide-variety of antigen-binding sites represented in the antibody population so that the body can recognize any given antigen, and a mechanism for affinity maturation of primary antibodies to improve the ability to recognize any given antigen.
The IgH variable regions are assembled in the genome of B cells from gene segments referred to as VH, D, and JH. Counting only the functional gene segments, there are 39 VH, 25 D and 6 JH segments in the human IgH locus. Prior to an antibody being expressed, the IgH gene will be subjected to a process called V(D)J recombination, in which 1 VH, 1 D, and 1 JH segment are combined in a highly diverse fashion in order to create a nucleic acid sequence that encodes a mature antibody. The different combinations of VH, D, and JH, and in particular the way the edges of the VH, D, and JH segments are joined to each other contribute to the extensive diversity of antibodies present in an individual. The light chain present in the B cell will be undergoing a similar set of processes, and further diversity is generated by the pairing of unique light and heavy chains along with the diversity in the junction by which they are put together. Specifically, the Ig light chain present in the B cell is generated by a similar V(D)J recombination process at either the Igκ or Igλ light chain locus. In these loci joining of VL and JL segments similarly results in generation of diverse IgL chains by joining different VL and JL segments and by generating diversity in the junctions in which they are put together Further antibody diversity in the context of the pairing of unique Ig light and Ig heavy chains further diversifies antibody repertories. In general, most B cells express a single IgH and IgL pair out of the huge number that can be assembled in the total population of developing B cells. In this regard, the size of the potentially expressed antibody repertoire in an organism is limited by the total number of B cells it can generate at steady state.
If an antibody encounters a foreign antigen to which it can bind, the B cell which makes that particular antibody will be activated. This will cause the B cell to replicate and those resulting B cells can be subject to additional genomic alterations that can lead to further diversification/affinity maturation (e.g. via somatic hypermutation (SHM) or germinal center reaction (GC)) of their antibodies. The efficacy of an antibody depends upon its specificity and affinity toward a relevant antigen. As described above, both V(D)J recombination and SHM make important contributions in this respect but at different points in the evolution of the antibody. V(D)J recombination creates an enormous pool of antigen-binding sites, individually expressed on particular B cells in a steady-state B cell population, so that any potential antigen might find a reasonable match; once a matched B cell has been found, somatic hypermutation and the GC response fine-tune the antigen-binding site to perfect the antibody-antigen interaction.
By studying natural immune responses or peripheral B cell repertoires, it is possible to identify which V, D, and/or J segments are most frequently expressed and by extension have a strong chance of being involved in generating an immune response to particular antigens. However, the ability of current methods of antibody production to apply the power of V(D)J recombination, SHM, and GC processes (Lonberg, Nature Biotechnology 23, 1117 (2005)) to optimization of existing antibodies in mice is limited, e.g., by the very small overall repertoire of potentially expressed antibodies in mice versus humans due to the much smaller size of the mouse immune system, and correspondingly much smaller number of B cells and potential precursors for targeting a particular immune response.
The invention relates to, in significant part, a novel method to generate novel antibodies (e.g., therapeutic and/or human antibodies), using a novel engineered immune system, e.g., in mice, as well as a novel system/method to optimize existing therapeutic antibodies or newly discovered candidate antibodies. In some embodiments, the system and/or method relates to an engineered mouse immune system. The engineered immune system is modified to allow easy insertion of one or more non-native components into the Ig locus of a model cell of a model animal. The engineered immune system is modified to drive production of V(D)J recombinations with any desired component, such as a desired VH and/or VL segment. These segments can be taken, for example, from a known antibody (e.g., human antibody) that is in need of improvement, such as improved affinity, specificity, or breadth. In some embodiments of any of the aspects, the segments are frequently used segments in human antibody repertoire. In some embodiments of any of the aspects, the segments are human V segments with mouse D and J segments. In some embodiments of any of the aspects, mouse D and J segments are appropriate for most humanized antibodies for two reasons: 1. Ds are diverse and in the full antibody the V(D)J junctional region is usually extremely diversified by V(D)J joining mechanisms, sometimes leaving the Ds nearly unrecognizable in the final antibody; 2. JH segments are highly homologous in mouse and human; 3. SHM can mature the entire V(D)J segment including the antigen contact CDR1, CDR2 and CDR3 V(D)J junction in mature B cells during germinal center (GC) reaction. Some embodiments involve expressing precursor IgH and IgH V exons specifically in peripheral or GC B cells to allow them to escape potential tolerance control (e.g., central tolerance control), so that they can be optimized specifically by SHM in peripheral germinal center (GC) B cell responses. The system can be carried out in a model animal, such as a mouse. Moreover, the engineered immune system can be used for optimizing antigens, e.g. for testing sequential immunization strategies for optimization of bnAbs.
The invention is based, at least in part, on the discovery that which IgH locus V segment is most strongly subject to V(D)J recombination can be controlled by providing non-native CBE sequences in an engineered Ig locus, for example by providing a CBE to the most proximal IgH VH5-1 which is barely rearranged (and which lacks an endogenous CBE) thereby rendering it the most highly rearranging VH. Thus, if the engineered VH 5-1 were replaced with a human VH (and a downstream engineered CBE was included in the engineered locus), the human VH will rearrange far more frequently than it would in the absence of the CBE.
Increases in the recombination of such a VH segment can also be obtained by rendering the downstream IGCR1 non-functional. Engineering both of these modifications together tremendously increases the utilization of the targeted VH making it the most dominantly used VH. The reasons for this is that in the absence of IGCR1, RAG (the V(D)J recombination initiating enzyme) bound to a DJHrecombination center can more readily find its next upstream target VH by a linear scanning mechanism during which their interaction with the RAG RC is promoted by an associated downstream CBE to promote their accessibility for rearrangement.
In the IgL (κ) locus, rendering the Cer/Sis sequence non-functional also enhance utilization of proximal Vκ segments due to a scanning mechanism similar to what occurs in the absence of IGCR1 function in the IgH locus. As demonstrated herein, a Cas9-gRNA based approach was used to delete the Sis/Cer elements of the Igk locus in a mouse v-Abl pre-B cell line that can be induced in vitro to undergo Igκ V(D)J recombination. After control and Sis/Cer deleted v-Abl pre-B cells were induced to undergo V(D)J recombination of their endogenous Igκ locus, HTGTS-based high throughput V(D)J recombination assay was used to analyze the frequency with which different endogenous Vκ segments rearranged to a Jk4 bait sequence. It was demonstrated that deletion of Cer/Sis elements substantially increased the rearrangement frequency of the proximal Vκ3-1, Vκ3-2 and Vκ3-3 segment (
Accordingly, as described herein, a target V segment (e.g., human Vκ3-20 or Vk1-33 segment), when positioned in place of the proximal mouse Vκ segments in the context of Sis/Cer deletion, will also be preferentially utilized during V(D)J recombination. Due to junctional diversification, the B cell population in this model expresses diverse repertoires of Vκ3-20 and/or Vk1-33 light chains; and, as described above, such diversity can be made even more human-like by incorporation of constitutive TdT expression in the ES cell-based model (which increased CDR 3 diversity. In a v-Abl model cell line system. deletion of Cer alone provided a similar phenotype regarding proximal Vκ rearrangement as deleting Cer/Sis indicating that deletion of Cer is sufficient to induce preferential rearrangement of proximal Vκ segments.
Furthermore, based on IgH results described elsewhere herein, adding a CBE to a proximal IgL Vκ will lead to its additionally enhanced utilization, particularly in the absence of Cer/Sis to allow unabated RAG scanning from the Jκ recombination center. Combined with the ability to replace the V segments themselves, e.g., proximal mouse VH and VL segments, with IGH and/or IgL V segments of particular interest, such modifications, when combined, will permit creation of immunoglobin repertories which comprise the VH and VL segment(s) of interest combined to diverse CDR3s at a much higher frequency than would occur naturally, a frequency that would much more approximate the frequency of these nacent antibodies (BCR) in the much larger human BCR repertoires. This discovery permits the engineering of antibodies comprising a desired VH and VL segment(s) while still allowing the antibody to participate in V(D)J recombination, somatic hypermutation, and the germinal center reaction—important processes that contribute to antibody diversity (e.g., CDR 3 diversity) and functionality. Notably, the complexity of the CDR3, which is arguably the greatest site for antigen contact diversity, would be far higher than in other existing humanized mouse models, which permits, upon immunization, the selection of a broader set of specific antibody precursors than exist in prior mouse models and the selected precursor antibody will then be further optimized by SHM of all three CDRs (including CDR3) upon further immunization and selection during the GC reaction. Thus, these methods and compositions described herein can permit the discovery of novel therapeutic antibodies and/or also can be used, to further improve specificity and/or affinity of an existing antibody.
In one aspect of any of the embodiments, described herein is a cell comprising at least one of:
In some embodiments of any of the aspects, the target VH or VL segment is a non-native, exogenous, or engineered segment. In some embodiments of any of the aspects, the cell is a mouse cell and the target VH or VL segment is a human segment. In some embodiments of any of the aspects, the cell further comprises a non-native DH, JH, and/or IL segment. In some embodiments of any of the aspects, the non-native DH, JH, or IL segment is a human segment. In some embodiments of any of the aspects, the human segment is from a known antibody in need of improvement of affinity or specificity. In some embodiments of any of the aspects, the human segments are highly-utilized human segments.
In some embodiments of any of the aspects, the cell is a stem cell embryonic stem cell. In some embodiments of any of the aspects, the cell is a murine cell, optionally a murine stem cell or murine embryonic stem cell.
In some embodiments of any of the aspects, the cell is heterozygous for the engineered IgH and/or IgL locus and the other IgH and/or IgL locus has been engineered to be inactive, wherein the cell will express an IgH and/or IgL chain only from the engineered IgH and/or IgL locus. In some embodiments of any of the aspects, the cell further comprises an engineered non-functional IGCR1 sequence in the IgH within the nucleic acid sequence separating the 3′ end of the 3′-most VH segment of the IgH locus and the 5′ end of a DH segment of the IgH locus. In some embodiments of any of the aspects, the non-functional IGCR1 sequence comprises mutated CBE sequences; the CBE sequences of the IGCR1 sequence have been deleted; or the IGCR1 sequence has been deleted from the IgH locus.
In some embodiments of any of the aspects, the cell further comprises at least one of the following:
In some embodiments of any of the aspects, the cell further comprises a mutation capable of activating, inactivating or modifying genes lead to increased GC antibody maturation responses. In some embodiments of any of the aspects, the cell further comprises a cassette targeting sequence in the target segment, which permits the replacement of the target segment. In some embodiments of any of the aspects, the cassette targeting sequence is selected from the group consisting of: an I-SceI meganuclease site; a Cas9/CRISPR target sequence; a Talen target sequence or a recombinase-mediated cassette exchange system. In some embodiments of any of the aspects, the cell further comprises an exogenous nucleic acid sequence encoding TdT. In some embodiments of any of the aspects, a promoter is operably linked to the sequence encoding TdT.
In one aspect of any of the embodiments, described herein is a genetically engineered mammal comprising a cell as described herein. In one aspect of any of the embodiments, described herein is a genetically engineered mammal consisting essentially of cells as described herein. In one aspect of any of the embodiments, described herein is a genetically engineered mammal consisting of cells as described herein. In one aspect of any of the embodiments, described herein is a chimeric genetically engineered mammal comprising two populations of cells,
In one aspect of any of the embodiments, described herein is a genetically engineered mammal comprising a population of cells comprising at least one of:
In some embodiments of any of the aspects, the target VH segment and/or the target VLsegment are human V segments. In some embodiments of any of the aspects, the IgH locus is further engineered to comprise one target D segment and/or one target JH segment. In some embodiments of any of the aspects, the IgL locus is further engineered to comprise one target JL segment. In some embodiments of any of the aspects, the D segment, JH segment, and/or JL segment are human segments. In some embodiments of any of the aspects, the human segments are from a known antibody in need of improvement of affinity or specificity. In some embodiments of any of the aspects, the human segments are highly-utilized human segments. In some embodiments of any of the aspects, the cell is heterozygous for the engineered IgH and/or IgL locus and the other IgH and/or IgL locus has been engineered to be inactive, wherein the cell will express an IgH and/or IgL chain only from the engineered IgH and/or IgL locus. In some embodiments of any of the aspects, the CBE element is located 5′ of at least one V segment in the locus. In some embodiments of any of the aspects, the CBE element is in the same orientation as the target segment. In some embodiments of any of the aspects, the CBE element is in the inverted orientation with respect to the target segment. In some embodiments of any of the aspects, the CBE element is located 3′ of the VH recombination signal sequence of the target V segment. In some embodiments of any of the aspects, the cell or mammal further comprises a mutation capable of activating, inactivating or modifying genes lead to increased GC antibody maturation responses. In some embodiments of any of the aspects, the cell or mammal further comprises an exogenous nucleic acid sequence encoding TdT. In some embodiments of any of the aspects, a promoter is operably linked to the sequence encoding TdT.
In some embodiments of any of the aspects, the mammal is a mouse or the cell is a mouse cell.
In one aspect of any of the embodiments, described herein is a set of at least two mammals, wherein each mammal is a mammal as described herein, the first mammal comprising a first target VH segment and/or a first target VL segment and each further mammal comprising a further target VH segment and/or a further target VL segment. In some embodiments of any of the aspects, each mammal comprises a human target VH segment and a human target VL segment.
In one aspect of any of the embodiments, described herein is a method of making an antibody, the method comprising the steps of: injecting a mouse blastocyst with a cell as described herein, wherein the cell is a mouse embryonic stem cell; implanting the mouse blastocyst into a female mouse under conditions suitable to allow maturation of the blastocyst into a genetically engineered mouse; and isolating
In one aspect of any of the embodiments, described herein is a method of making an antibody, the method comprising the steps of: isolating an antibody comprising the one or more target segments from a mammal or set of mammals described herein, or isolating a cell expressing an antibody comprising the one or more target segments from the mammal or set of mammals described herein. In some embodiments of any of the aspects, the method further comprises a step of immunizing the genetically engineered mammal or set of mammals with a desired target antigen before the isolating step.
In one aspect of any of the embodiments, described herein is a method of making an antibody which is specific for a desired antigen, the method comprising the steps of:
In one aspect of any of the embodiments, described herein is an antibody produced by any one of the methods described herein.
In some embodiments of any of the aspects, the antibody is an optimized antibody. In some embodiments of any of the aspects, the antibody is a humanized antibody.
In one aspect of any of the embodiments, described herein is a method of identifying a candidate antigen as an antigen that activates a B cell population comprising a VH or VL segment of interest, the method comprising: immunizing a mammal as described herein, engineered such that a majority of the mammal's peripheral B cells express the VH or VL segment of interest, with the antigen; measuring B cell activation in the mammal; and identifying the candidate antigen as an activator of a B cell population comprising the VH or VL segment of interest if the B cell activation in the mammal is increased relative to a reference level. In some embodiments of any of the aspects, an increase in B cell activation is an increase in the somatic hypermutation status of the Ig variable region; an increase in the affinity of mature antibodies for the antigen; or an increase in the specificity of mature antibodies for the antigen.
Provided herein are methods and compositions that permit a user to direct V(D)J recombination to utilize specific V segments of an Ig locus. Such methods can be utilized with wild-type V segments to generate an antibody repertoire that more frequently uses a particular V segment(s) and/or combined with additional modifications of the Ig locus in order to direct antibody repertoire development to use a non-native V segment. Three different types of Ig locus modifications are described herein, and each type can be utilized independently or in any combination with the other modification types. Additionally, the technology described herein can be combined with the IgH locus modifications described in US Patent Publication 2016/0374320; which is incorporated by reference herein in its entirety.
In one aspect of any of the embodiments, described herein is a cell comprising at least one of: a) an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 3′ end of a target VH segment and the 5′ end of the first VH segment which is 3′ of the target VH segment; and/or b) an engineered IgL locus comprising at least one of: i) a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment; and ii) a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment. In some embodiments of any of the aspects, the CBE element can be located downstream of the RSS which flanks the 3′ end of the target VH segment. In one aspect of any of the embodiments, described herein is a cell comprising at least one of: a) an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 5′ end of a target VH segment and the 3′ end of the first VH segment which is proximal to the target VHsegment; and/or b) an engineered IgL locus comprising at least one of: i) a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment; and ii) a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment.
In one aspect of any of the embodiments, described herein is a cell comprising an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 3′ end of a target VH segment and the 5′ end of the first VH segment which is 3′ of the target VH segment. In some embodiments of any of the aspects, the CBE element can be located downstream of the RSS which flanks the 3′ end of the target VH segment. In one aspect of any of the embodiments, described herein is a cell comprising an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 5′ end of a target VH segment and the 3′ end of the first VH segment which is proximal to the target VH segment.
In one aspect of any of the embodiments, described herein is a cell an engineered IgL locus comprising a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment. In one aspect of any of the embodiments, described herein is a cell comprising an engineered IgL locus comprising a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment. In one aspect of any of the embodiments, described herein is a cell an engineered IgL locus comprising: i) a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment; and ii) a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment.
In one aspect of any of the embodiments, described herein is a cell comprising: a) an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 3′ end of a target VH segment and the 5′ end of the first VH segment which is 3′ of the target VH segment; and b) an engineered IgL locus comprising: i) a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment; and ii) a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment.
In one aspect of any of the embodiments, described herein is a cell comprising a) an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 3′ end of a target VH segment and the 5′ end of the first VH segment which is 3′ of the target VH segment; and b) an engineered IgL locus comprising at least one of: i) a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment; and ii) a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment. In one aspect of any of the embodiments, described herein is a cell comprising a) an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 3′ end of a target VH segment and the 5′ end of the first VH segment which is 3′ of the target VH segment; and b) an engineered IgL locus comprising i) a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment; and ii) a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment. In some embodiments of any of the aspects, the CBE element can be located downstream of the RSS which flanks the 3′ end of the target VH segment. In one aspect of any of the embodiments, described herein is a cell comprising a) an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 5′ end of a target VH segment and the 3′ end of the first VH segment which is proximal to the target VH segment; and b) an engineered IgL locus comprising at least one of: i) a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment; and ii) a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment. In one aspect of any of the embodiments, described herein is a cell comprising a) an engineered IgH locus comprising a CBE element within the nucleic acid sequence separating the 5′ end of a target VH segment and the 3′ end of the first VH segment which is proximal to the target VH segment; and b) an engineered IgL locus comprising i) a non-functional Cer/Sis sequence within the nucleic acid sequence separating the 3′ end of the 3′-most VL segment and the 5′ end of a JL segment; and ii) a CBE element within the nucleic acid sequence separating the 3′ end of a target VL segment and the 5′ end of the first VL segment which is 3′ of the target VL segment.
In some embodiments of any of the aspects, the CBE element can be located downstream of the RSS which flanks the 3′ end of the target VL segment.
As used herein, the term “Ig locus” refers to a locus which either encodes, or can be recombined to encode, a polypeptide chain of an immunoglobin molecule (e.g. a BCR or antibody). The Ig locus can be an IgH locus (encoding the heavy chain of the immunoglobin molecule) or an IgL locus (encoding the light chain of the immunoglobin molecule). An IgL locus can be either an Igκ or an Igλ locus. Prior to VDJ recombination, an IgH locus comprises, from 5′ to 3′, one or more VH segments, one or more DH segments, and one or more JH segments and multiple interspersed sequences, e.g. sequences that regulate and/or control the processes of VDJ recombination and expression. Prior to VDJ recombination, an IgL locus comprises, from 5′ to 3′, one or more VL segments and one or more JL segments and multiple interspersed sequences, e.g. sequences that regulate and/or control the processes of VJ recombination and expression.
As used herein, the term “V segment” refers to the variable segment of an Ig locus. As used herein, the term “D segment” refers to a diversity region segment of an Ig locus. As used herein, the term “J segment” refers to a joining region segment of an Ig locus. The segments can be further specified as being of the heavy or light chain, e.g., VH segment or VL segment respectively. One of skill in the art can readily identify such segments within an Ig locus or immunoglobin molecule. By way of non-limiting example, the structure of immunoglobins is discussed in Janeway et al. (eds.)(2001) Immunobiology. Fifth edition, Garland Sciences; Kabat, E. A., et al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, U.S. Department of Health and Human Services, NIH Publication No. 91-3242, and Chothia, C. et al. (1987) J. Mol. Biol. 196:901-917; which are incorporated by reference herein in their entireties.
During B cell development, an IgH DH segment is first recombined with a JH segment, physically joining them together to form a “DJH rearrangement”. A next step in B cell development recombines a VH segment with the DJH rearrangement to form a “VHDJH rearrangement.” That is, a “VHDJH rearrangement” or “DJH rearrangement” is a polynucleotide in which the named segments are recombined and intervening sequences found in the germline have been removed. Similarly, in a IgL locus, a VL segment is recombined with a JL segment, forming a VLJL rearrangement. Such rearrangements can be native constructs found in B cells or constructs created in vitro and optionally introduced into a cell.
A segment of an Ig gene, e.g., a V segment can be, e.g. a germline V segment, an affinity maturation intermediate, or a mature V segment. In some embodiments of any of the aspects, a germline segment can be a segment as found in the genome of a germline cell, e.g. prior to any V(D)J recombination event. In some embodiments of any of the aspects, a maturation intermediate can be a segment after at least one V(D)J recombination event but prior to the completion of the GC reaction and/or SHM. In some embodiments of any of the aspects, a mature segment can be a segment as found in a mature B-cell. A segment, as comprised by a maturation intermediate or a mature segment, is present in the cell as a VDJ rearrangement, having been recombined with a at least one other segment.
Certain segments, e.g., V segments are referred to herein as “target segments.” The target segment is the segment of its type (e.g., VH, VL, D, JH, or JL) which it is desired that the Ig locus will utilize in V(D)J recombination. It is not to be implied that the target V segment will be utilized in 100% of V(D)J recombination events, but that it will be utilized at a much higher rate than it would in the absence of the engineered modifications described herein. It also may be used at a higher rate that others of the same type (e.g. VHs or other VLs). The target segment (e.g., target VH or VL segment) can be a native, wild-type, non-native, exogenous, or engineered segment. In some embodiments of any of the aspects, the target segment (e.g., target VH or VL segment) can be from a different species than the cell, e.g., the cell can be a mouse cell and the target VH or VL segment can be a human segment.
As used herein, the term “native” refers to the sequence found in a particular location in the genome of a non-engineered cell and/or animal. As used herein, the term “non-native” refers to a sequence which varies from the sequence found in a particular location in the genome of a non-engineered cell and/or animal. A non-native sequence can be, e.g. a sequence from a different species or a sequence from the same species which has been moved to a non-native position in the genome. Thus, while a sequence may be “native” to a particular gene in the genome of an un-engineered cell, if it has been moved within the gene in an engineered cell, it is no longer considered native. In some embodiments of any of the aspects, a non-native sequence differs from the native sequence by, at least 5%, e.g. at least 5%, at least 10%, at least 20%, at least 30%, at least 40%, at least 50% or more.
In some embodiments of any of the aspects, the Ig locus is a mouse locus and the target V segment of the Ig locus has been engineered to comprise any V segment other than the original mouse V segment. In some embodiments of any of the aspects, the non-native V segment is a human V segment. In some embodiments of any of the aspects, the non-native V segment is a V segment from a known antibody in need of improvement of affinity, specificity, or breadth for which improvements in any or all of these properties is desired. In some embodiments of any of the aspects, the non-native V segment is a human V segment from a known antibody in need of improvement of affinity or specificity, or breath for which improvement of any or all of these or other properties is desired. In some embodiments of any of the aspects, the non-native V segment is a V segment from a known antibody. In some embodiments of any of the aspects, the non-native V segment is a human V segment from a known antibody. In some embodiments of any of the aspects, the V segment may be a commonly utilized human VH or VL segment.
While the methods and compositions described herein are suitable for use with any V segment, certain specific V segments, which may be from the germline or from a previously affinity matured antibody and thus harbor SHMs, are particularly contemplated for use in the compositions and methods described herein due to their known antigen specificities. Other V segments may be selected due to the high frequency with which the contribute to unselected antibody repertoires such as, but not limited to IGHV1-2*02, IGHV1-69, VH3-30, and VH4-59. The sequences of these VH segments are known in the art, for example, IGHV1-2*02 is described by Genbank Accession No: FN550184.1 (SEQ ID NO: 1) and SEQ ID NO: 13 of International Patent Publication WO 2010/054007; and IGVH1-46 is described by Genbank Accession No: AJ347091.1 (SEQ ID NO: 2). In some embodiments of any of the aspects described herein, the VL segment can be selected from the group consisting of: the frequently utilized Vκs and Vλs including but not limited to Vκ1-5, Vκ3-20, Vκ4-1, Vλ1-51, Vλ3-1, Vλ2-14.
In some embodiments of any of the aspects, the V segments can be the V segments of 2G12 bnAb or VRC42 bnAb. The V segments of 2G12 bnAb are: VH3-21, Vk1-5 and the V segments of VRC42 bnAb are: VH1-69, Vk3-20.
As used herein, “Cer/Sis sequence” refers collectively to Cer and/or Sis elements of Ig loci. Cer (contracting element for recombination) and Sis (silencer in the intervening sequence) elements are known elements of Ig genes. As used herein, “contracting element for recombination” or “Cer” refers to a region located in the IgL locus 3′ of the 3′-most native VL segment and the 5′ end of the 5′-most native JL segment and which controls VJ recombination. Cer is approximately 650 bp in length. Cer can bind CTCF and is DNaseI hypersensitive. As used herein, “silencer in the intervening sequence” or “Sis” refers to a region located in the IgL locus 3′ of the 3′-most native VL segment and the 5′ end of the 5′-most native JL segment and which controls VJ recombination. Sis is approximately 1,500 bp in length. Sis can bind CTCF and Ikaros and is also DNaseI hypersensitive. The structure of Cer and Sis are explained in more detail, e.g., in Xiang et al. J Immunol 190:1819-1826(2013); Liu et al. J Biol Chem 277:32640-32649 (2002); and Liue et al. Immunity. 24:405-415(2006); each of which is incorporated by reference herein.
Exemplary Cer and Sis sequences are provided in Xiang et al. J. Immunol. 190, 1819-1826 (2013) and Xiang et al. J. Immunol. 186, 5356-5366 (2011) which are incorporated by reference herein in their entireties. In some embodiments of any of the aspects, a Cer/Sis sequence can be a sequence having at least 80% sequence identity to to the ˜6.7 kb Cer/Sis sequence of SEQ ID NO: 13, e.g., 80% sequence identity, 85% sequence identity, 90% sequence identity, 95% sequence identity, 98% sequence identity, or greater sequence identity to SEQ ID NO:13. In some embodiments of any of the aspects, a Cer/Sis sequence can be a sequence having at least 95% identity to SEQ ID NO: 13 and the same activity e.g., CTCF-binding activity.
In some embodiments of any of the aspects, a Cer/Sis sequence can be a sequence having at least 80% sequence identity to bp 860-7288 of SEQ ID NO: 13, e.g., 80% sequence identity, 85% sequence identity, 90% sequence identity, 95% sequence identity, 98% sequence identity, or greater sequence identity to bp 860-7288 of SEQ ID NO: 13. In some embodiments of any of the aspects, a Cer/Sis sequence can be a sequence having at least 95% identity to bp 860-7288 of SEQ ID NO: 13 and the same activity e.g., CTCF-binding activity.
In some embodiments of any of the aspects, a Cer sequence can be a sequence having at least 80% sequence identity to bp 860-1529 of SEQ ID NO: 13, e.g., 80% sequence identity, 85% sequence identity, 90% sequence identity, 95% sequence identity, 98% sequence identity, or greater sequence identity to bp 860-1529 of SEQ ID NO:13. In some embodiments of any of the aspects, a Cer/Sis sequence can be a sequence having at least 95% identity to bp 860-1529 of SEQ ID NO: 13 and the same activity e.g., CTCF-binding activity.
In some embodiments of any of the aspects, a Sis sequence can be a sequence having at least 80% sequence identity to bp 3562-7288 of SEQ ID NO: 13, e.g., 80% sequence identity, 85% sequence identity, 90% sequence identity, 95% sequence identity, 98% sequence identity, or greater sequence identity to bp 3562-7288 of SEQ ID NO:13. In some embodiments of any of the aspects, a Cer/Sis sequence can be a sequence having at least 95% identity to bp 3562-7288 of SEQ ID NO: 13 and the same activity e.g., CTCF-binding activity.
Cer and Sis each comprise two CBEs. An exemplary murine wild-type sequence depicting Cer, Sis, and CBE elements is provided as Example 4 herein. Example 4 further demonstrates an exemplary embodiment of a deletion strategy using CRISPR/Cas9 technology to simultaneously delete both Cer and Sis elements (a total of ˜6.7 kb deletion). This deletion accordingly renders both the Cer and Sis non-functional, as detailed in Example 3. It is further contemplated herein that Cer and Sis block RAG scanning from the Jκ RC into the proximal Vκ domain.
Rendering the Cer/Sis sequence in the Igκ locus non-functional causes the 3′-most VLsegments to be subject to V(D)J recombination at an increased rate. In some embodiments of any of the aspects, the engineered IgL, or Igκ locus comprises a non-functional Cer/Sis sequence. A non-functional Cer/Sis sequence can be a Cer/Sis sequence which has 50% or less of the wild-type activity, e.g., 50% or less ability to attenuate VJ rearrangements with the 3′-most VL segments. Methods of measuring the rate of VJ rearrangements comprising any given segment are known in the art, e.g., by HTGTS using Jκ bait primers (see, e.g. Lin et al. PNAS 113 (28) 7846-7851(2016); which is incorporated by reference herein in its entirety).
In some embodiments of any of the aspects, a non-functional Cer or Sis sequence is one in which at least one CBE sequence has been deleted. In some embodiments of any of the aspects, a non-functional Cer or Sis sequence is one in which both CBE sequences have been deleted. In some embodiments of any of the aspects, a non-functional Cer/Sis sequence is one in which all four CBE sequences have been deleted. In some embodiments of any of the aspects, a non-functional Cer/Sis sequence is one in which the Cer/Sis sequence has been deleted. In some embodiments of any of the aspects, a non-functional Cer/Sis sequence is one in which the Cer and/or Sis sequence has been deleted. In some embodiments of any of the aspects, a non-functional Cer/Sis sequence is one in which the Cer/Sis sequence has been deleted, e.g. the sequence corresponding to SEQ ID NO:13, bp 860-7288 of SEQ ID NO: 13, bp 860-1592 of SEQ ID NO: 13 and/or bp 3562-7288 of SEQ ID NO: 13 has been deleted.
In some embodiments of any of the aspects, a non-functional Cer/Sis sequence is one in which one or more CBE sequences have been deleted, e.g., a contiguous sequence comprising all four CBE sequences has been deleted, or any portion of the Cer/Sis comprising at least one CBE sequence has been deleted. In some embodiments of any of the aspects, a non-functional Cer/Sis sequence is one in which one or more CBE sequences have been mutated.
As used herein, “CTCF-binding element” or “CBE” refers to a nucleotide sequence bound by CTCF. A number of CBE's are known to exist in Ig loci, and further detail of CBE structure is provided, e.g., in Guo et al. Nature 2011 477-424-431; which is incorporated by reference herein in its entirety. In some embodiments of any of the aspects, a CBE can comprise or consist of any of SEQ ID Nos: 3-12.
Further exemplary CBE sequences are described in Xiang et al. J. Immunol. 190, 1819-1826 (2013), which is incorporated by reference herein in its entirety, in which each of the two CBE sequences within both Cer and Sis elements (which are referred to therein as HS1-2 and HS3-6, respectively) are highlighted in
CBEs can be rendered non-functional by, e.g., mutating the CBE or deleting the CBE. Mutating the sequence of a CBE sequence, such that CTCF binding is reduced by at least 25% (e.g. reduced by 25% or more, 50% or more, or 75% or more) can render the CBE non-functional. Binding of CTCF to a given mutated CBE can be readily measured, e.g., EMSA or ChIP)-Non-limiting examples of such mutations are described, e.g., in Guo et al. Nature 2011 477-424-431 and Jain et al., Cell (2018); which is incorporated by references herein in their entireties.
In some embodiments of any of the aspects, the CBE element is located 5′ of at least one V segment in the locus, e.g., the target V segment is not the 3′ most V segment. The CBE element is contemplated to be arranged in either orientation with respect to the target segment, e.g., it can be in the same orientation or inverted with respect to the target segment.
In some embodiments of any of the aspects, the CBE element can be contiguous with the target V segment. In some embodiments of any of the aspects, the CBE element can be 3′ of the target V segment's recombination signal sequence. In some embodiments of any of the aspects, the CBE element can be 1 bp or more 3′ of the target V segment's recombination signal sequence, e.g., 1 bp, 3 bp, 5 bp, 10 bp, 15 bp, or further 3′ of the target V segment's recombination signal sequence. In some embodiments of any of the aspects, the CBE element can be 15 bp or more 3′ of the target V segment's recombination signal sequence. In some embodiments of any of the aspects, the CBE element can be about 15 bp 3′ of the target V segment's recombination signal sequence.
In some embodiments of any of the aspects, the cell can further comprise an engineered non-functional IGCR1 sequence in the IgH within the nucleic acid sequence separating the 3′ end of the 3′-most VH segment of the IgH locus and the 5′ end of a DH segment of the IgH locus. Rendering the IGCR1 sequence of an IgH locus non-functional causes the 3′-most VH segment to be recombined into a VDJ segment at an even higher rate. In some embodiments, when the IGCR1 sequence is non-function, the VH segment which will recombine into a VDJ segment most frequently is the most 3′VH segment with an associated CBE just downstream of it (e.g., downstream of its RSS). Such a CBE can be naturally occurring engineered as described herein. In some embodiments of any of the aspects, the engineered IgH gene comprises a non-functional IGCR1 sequence. As used herein, “intergenic control region 1” or “IGCR1” refers to a region located in the IgH locus the 3′ end of the 3′-most native VH segment and the 5′ end of the 5′-most native DH segment and controls VDJ recombination. The IGCR1 is approximately 4.1 kb in length The IGCR1 comprises two CTCF-binding elements (CBEs) that are required for IGCR1 function. The structure of IGCR1 and the CBEs is explained in more detail, e.g., in Guo et al. Nature 2011 477-424-431; which is incorporated by reference herein in its entirety. A non-functional IGCR1 sequence can be an IGCR1 sequence which has 50% or less of the wild-type activity, e.g., 50% or less ability to form V(D)J rearrangements with VH segments other than the 3′-most VHsegment. Methods of measuring the rate of VDJ rearrangements comprising any given segment are known in the art, e.g., by HTGTS using JH bait primers (see, e.g., Lin et al. PNAS 113 (28) 7846-7851(2016); which is incorporated by reference herein in its entirety.)
In some embodiments of any of the aspects, a non-functional IGCR1 sequence is one in which at least one CBE sequence has been deleted. In some embodiments of any of the aspects, a non-functional IGCR1 sequence is one in which both CBE sequences have been deleted. In some embodiments of any of the aspects, a non-functional IGCR1 sequence is one in which the IGCR1 sequence has been deleted, e.g. the 4.1 kb comprising IGCR1 has been deleted. In some embodiments of any of the aspects, a non-functional IGCR1 sequence is one in which one or more CBE sequences have been deleted, e.g., the 2.6 kb sequence comprising both CBE sequences has been deleted, or any portion of that 2.6 kb sequence comprising at least one CBE sequence has been deleted. In some embodiments of any of the aspects, a non-functional IGCR1 sequence is one in which one or more CBE sequences have been mutated. Mutating the sequence of a CBE sequence, such that CTCF binding is reduced by at least 25% (e.g. reduced by 25% or more, 50% or more, or 75% or more) can render the IGCR1 non-functional. Binding of CTCF to a given mutated CBE can be readily measured, e.g., EMSA or ChIP. Non-limiting examples of such mutations are described, e.g., in Guo et al. Nature 2011 477-424-431; and Jain et al., Cell (2018) which is incorporated by reference herein in its entirety.
If a particular VH segment, JH segment, D segment, assembled DJH segment, assembled VHDJH segment, heavy chain sequence, VL segment, JL segment, assembled VLJL segment, and/or light chain sequence is desired to be present in the mature antibody or antibodies produced by a cell and/or animal described herein, the IgH and/or IgL locus can be further engineered to comprise such a sequence of interest. In some embodiments of any of the aspects, the locus can be engineered to comprise the sequence of interest such that it is one possible segment of its type that can be recombined to form a mature antibody sequence (e.g. a human JH segment can be introduced into a murine IgH locus while retaining at least one native mouse JH segment). In some embodiments of any of the aspects, the locus can be engineered to comprise the sequence of interest such that it will be the segment of its type that will be present in all mature antibody sequences (e.g., a human JH segment or human DJH intermediate can be introduced into a murine IgH locus in which all native murine JH segments are deleted or disabled).
In some embodiments of any of the aspects, the JH locus can be replaced by a human D and JHcassette or a cassette with an assembled human DJH. In some embodiments of any of the aspects, one or more DH, one or more JH segments, and/or a DJH fusion comprise a cassette targeting sequence. In some embodiments of any of the aspects, the IgH locus comprises one or more non-native DH segments. In some embodiments of any of the aspects, the IgH locus comprises one DH segment. In some embodiments of any of the aspects, the IgH locus comprises one or more non-native JH segments. In some embodiments of any of the aspects, the IgH locus comprises one JH segment. In some embodiments of any of the aspects, the IgH locus comprises murine IgH locus sequence. In some embodiments of any of the aspects, the IgH locus comprises human IgH locus sequence. In some embodiments of any of the aspects, the locus comprises humanized IgH locus sequence.
In some embodiments of any of the aspects, the IgL locus comprises one or more non-native JL segments. In some embodiments of any of the aspects, the IgL locus comprises one JL segment. In some embodiments of any of the aspects, the IgL locus comprises murine IgL locus sequence. In some embodiments of any of the aspects, the IgL locus comprises human IgL locus sequence. In some embodiments of any of the aspects, the locus comprises humanized IgL locus sequence.
In some embodiments of any of the aspects, the IgL locus can be engineered to comprise human sequence, to be a humanized IgL locus, or to be a human IgL locus. In some embodiments of any of the aspects, the IgH locus can be engineered to comprise human sequence, to be a humanized IgH locus, or to be a human IgH locus.
In some embodiments of any of the aspects, a cell described herein can comprise an IgL locus with one VL segment. In some embodiments of any of the aspects, a cell described herein can comprise an IgL locus with one JL segment. In some embodiments of any of the aspects, a cell described herein can comprise a human rearranged VLJL at the IgL locus. In some embodiments of any of the aspects, the IgL gene encodes IGκV1.
In some embodiments of any of the aspects, a cell described herein can comprise an IgH locus with one VH segment. In some embodiments of any of the aspects, a cell described herein can comprise an IgH locus with one D segment. In some embodiments of any of the aspects, a cell described herein can comprise an IgH locus with one JH segment. In some embodiments of any of the aspects, a cell described herein can comprise a human rearranged VHDJH at the IgH locus.
The methods and compositions described herein can relate to the production of antibodies in a manner that capitalizes on the variation produced by, e.g., the GC response and SHM. In some embodiments of any of the aspects, a cell described herein can further comprise a mutation capable of activating, inactivating or modifying genes that in a lymphocyte-intrinsic fashion lead to increased GC antibody maturation responses. Such mutations are known in the art and can include, by way of non-limiting example PTEN−/− (see, e.g., Rolf et al. Journal of Immunology 2010 185:4042-4052; which is incorporated by reference herein in its entirety)/
In some embodiments of any of the aspects, the Ig locus and/or target segment can further comprise a cassette targeting sequence, e.g., to permit insertion and/or replacement of sequences in the Ig locus and/or target segment. As used herein, the term “cassette targeting sequence” refers to a sequence that permits a sequence of interest (e.g. a sequence comprising a V segment of interest), to be inserted into the genome at the location of the cassette targeting sequence via the action of at least one enzyme that targets the cassette targeting sequence. Non-limiting examples of cassette targeting sequences are an I-SceI meganuclease site; a Cas9/CRISPR target sequence; a Talen target sequence; a zinc finger nuclease (ZFN) and a recombinase-mediated cassette exchange system. Such cassette targeting systems are known in the art, see, e.g. Clark and Whitelaw Nature Reviews Genetics 2003 4:825-833; which is incorporated by reference herein in its entirety. In some embodiments of any of the aspects, the cassette targeting sequence permits the replacement of the 3′-most VH segment.
I-SceI, Zinc finger nucleases (ZFNs), the Cas9/CRISPR system, and transcription-activator like effector nucleases (TALENs) are nucleases. Nucleases are found commonly in microbial species and have the unique property of having very long recognition sequences (>14 bp) thus making them naturally very specific for cutting at a desired location. This can be exploited to make site-specific double-stranded breaks in, e.g. a genome. These nucleases can cut and create specific double-stranded breaks at a desired location(s) in the genome, which are then repaired by cellular endogenous processes such as, homologous recombination (HR), homology directed repair (HDR) and non-homologous end-joining (NHEJ). NHEJ directly joins the DNA ends in a double-stranded break, while HDR utilizes a homologous sequence as a template for regenerating the missing DNA sequence at the break point. Thus, by introducing, e.g., a ZFN, CRISPR, and/or TALENs specific for the cassette targeting sequence into a cell, at least one double strand-break can be generated in the genome, resulting in a template sequence, e.g. a sequence comprising a segment of interest, being used to repair the break, thereby introducing the template sequence into the genome and the desired location (see, e.g. Gaj et al. Trends in Biotechnology 2013 31:397-405; Carlson et al. PNAS 2012 109:17382-7; and Wang et al. Cell 2013 153:910-8; each of which is incorporated by reference herein in its entirety).
Mutagenesis and high throughput screening methods have been used to create nuclease and/or meganuclease variants that recognize unique sequences. For example, various nucleases have been fused to create hybrid enzymes that recognize a new sequence. Alternatively, DNA interacting amino acids of the nuclease can be altered to design sequence specific nucleases (see e.g., U.S. Pat. No. 8,021,867). Nucleases can be designed using the methods described in e.g., Certo, M T et al. Nature Methods (2012) 9:073-975; U.S. Pat. Nos. 8,304,222; 8,021,867; 8,119,381; 8,124,369; 8,129,134; 8,133,697; 8,143,015; 8,143,016; 8,148,098; or 8,163,514, the contents of each are incorporated herein by reference in their entirety. Alternatively, nucleases with site specific cutting characteristics can be obtained using commercially available technologies e.g., Precision BioSciences' Directed Nuclease Editor™ genome editing technology.
ZFNs and TALENs restriction endonuclease technology utilizes a non-specific DNA cutting enzyme which is linked to a specific DNA sequence recognizing peptide(s) such as zinc fingers and transcription activator-like effectors (TALEs). Typically, an endonuclease whose DNA recognition site and cleaving site are separate from each other is selected and its cleaving portion is separated and then linked to a sequence recognizing peptide, thereby yielding an endonuclease with very high specificity for a desired sequence. An exemplary restriction enzyme with such properties is FokI. Additionally, FokI has the advantage of requiring dimerization to have nuclease activity and this means the specificity increases dramatically as each nuclease partner recognizes a unique DNA sequence. To enhance this effect, FokI nucleases have been engineered that can only function as heterodimers and have increased catalytic activity. The heterodimer functioning nucleases avoid the possibility of unwanted homodimer activity and thus increase specificity of the double-stranded break.
In some embodiments of any of the aspects, the Cas9/CRISPR system can be used to introduce sequences at a cassette targeting sequence as described herein. Clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) systems are useful for, e.g. RNA-programmable genome editing (see e.g., Marraffini and Sontheimer. Nature Reviews Genetics 2010 11:181-190; Sorek et al. Nature Reviews Microbiology 2008 6:181-6; Karginov and Hannon. Mol Cell 2010 1:7-19; Hale et al. Mol Cell 2010:45:292-302; Jinek et al. Science 2012 337:815-820; Bikard and Marraffini Curr Opin Immunol 2012 24:15-20; Bikard et al. Cell Host & Microbe 2012 12:177-186; all of which are incorporated by reference herein in their entireties). A CRISPR guide RNA is used that can target a Cas enzyme to the desired location in the genome, where it generates a double strand break. This technique is known in the art and described, e.g. at Mali et al. Science 2013 339:823-6; which is incorporated by reference herein in its entirety and kits for the design and use of CRISPR-mediated genome editing are commercially available, e.g. the PRECISION X CAS9 SMART NUCLEASE™ System (Cat No. CAS900A-1) from System Biosciences, Mountain View, CA.
In some embodiments of any of the aspects, a CRISPR, TALENs, or ZFN molecule (e.g. a peptide and/or peptide/nucleic acid complex) can be introduced into a cell, e.g. a cultured ES cell, such that the presence of the CRISPR, TALENs, or ZFN molecule is transient and will not be detectable in the progeny of, or an animal derived from, that cell. In some embodiments of any of the aspects, a nucleic acid encoding a CRISPR, TALENs, or ZFN molecule (e.g. a peptide and/or multiple nucleic acids encoding the parts of a peptide/nucleic acid complex) can be introduced into a cell, e.g. a cultured ES cell, such that the nucleic acid is present in the cell transiently and the nucleic acid encoding the CRISPR, TALENs, or ZFN molecule as well as the CRISPR, TALENs, or ZFN molecule itself will not be detectable in the progeny of, or an animal derived from, that cell. In some embodiments of any of the aspects, a nucleic acid encoding a CRISPR, TALENs, or ZFN molecule (e.g. a peptide and/or multiple nucleic acids encoding the parts of a peptide/nucleic acid complex) can be introduced into a cell, e.g. a cultured ES cell, such that the nucleic acid is maintained in the cell (e.g. incorporated into the genome) and the nucleic acid encoding the CRISPR, TALENs, or ZFN molecule and/or the CRISPR, TALENs, or ZFN molecule will be detectable in the progeny of, or an animal derived from, that cell.
Recombinase-mediated cassette exchange systems (RMCEs) utilize recombinases (e.g. Flp) and the sequences recognized by the recombinases (e.g., FRT target sites) to swap sequences from the genome, flagged by the FRT target sites with sequences in a cassette that are likewise flanked by the FRT target sites. RMCEs are known in the art, e.g., Cesari et al. Genesis 2004 38:87-92 and Roebroek et al. Mol Cell Biol 2006 26:605-616; each of which is incorporated by reference herein in its entirety.
It can be difficult to isolate and/or produce antibodies comprising a particular segment (e.g., V segment) because that segment is selected against, e.g. if that segment is particularly likely to recognize a self-antigen, B-cells with the segment are more likely to be selected against. Such segments are termed “maturation-incompatible” herein. This term does not imply that B-cells expressing a BCR and/or antibody comprising such a segment are invariably subject to clonal deletion and/or anergy. Provided herein are methods and compositions for avoiding clonal deletion and/or anergy during B-cell development and causing B-cells to express a maturation-incompatible segment at a desired timepoint in development, e.g. after clonal deletion and/or anergy is likely to occur. These methods and compositions involve inserting a passenger V(D)J exon into a Ig locus in such a manner that while present in the locus, it will be neither expressed nor removed by normal Ig V(D)J recombination. A B cell comprising the passenger V(D)J exon will express a second, maturation-compatible, V(D)J exon (e.g. one generated by Ig V(D)J recombination) and at a desired time, the sequence of the locus can be manipulated to cause the passenger V(D)J exon to be expressed instead of the maturation-compatible exon. As used herein, a “passenger” exon is an exon that is present in the germline and mature B-cell genome but is not expressed until the genome is subjected to an induced recombination event, e.g. a Cre-mediated recombination event.
In a first approach, the maturation-incompatible segment (e.g. as part of a passenger V(D)J exon) is inserted into the Ig locus in a 3′ to 5′ conformation relative to the Ig locus and is located 5′ of the maturation-compatible V(D)J exon (or the sequences that will be recombined to make the maturation-compatible V(D)J exon). Expression of the passenger V(D)J exon is induced by the use of a pair of inverted recombinase sites, which cause the passenger V(D)J exon to be “flipped” so that it is in the 5′ to 3′ orientation with respect to the rest of the Ig locus. In a second approach, the maturation-incompatible segment, (e.g. as part of a passenger V(D)J exon) is inserted 5′ to 3′ with respect to the Ig locus and V(D)J recombination occurs downstream of the passenger exon to generate a maturation-compatible V(D)J exon. The maturation-compatible V(D)J exon can then be excised by inducing recombination (e.g., Cre-mediated recombination) at a pair of recombinase sites when desired, causing the cell to express the passenger exon. As an illustrative example, a known functional driver V(D)J exon can be used to permit B cell development with a passenger exon just upstream and not expressed due to transcriptional terminators or other blocks. The driver and transcription blocks are flanked by loxP elements and deleted by CD21 cre expression in periphery to allow passenger expression. This approach has been used successfully to express several HIV bnAB V(D)J intermediates that otherwise could not be expressed in periphery.
Recombination sites and systems for inducing recombination at these sites are known in the art, e.g. the cre-Lox system or the Flp recombinase. The loxP-Cre system utilizes the expression of the PI phage Cre recombinase to catalyze the excision or inversion of DNA located between flanking lox sites. By using gene-targeting techniques to produce binary transgene animals with modified endogenous genes that can be acted on by Cre or Flp recombinases expressed under the control of tissue-specific promoters, site-specific recombination may be employed to excise or invert sequences in a spatially or time-controlled manner. See, e.g., U.S. Pat. Nos. 6,080,576, 5,434,066, and 4,959,317; and Joyner, A. L., et al. Laboratory Protocols for Conditional Gene Targeting, Oxford University Press, New York (1997); Orban et al. (1992) PNAS 89:6861-6865; Aguzzi A, Brandner S, Isenmann S, Steinbach JP, Sure U. Glia. 1995 November; 15(3):348-64. Review; each of which is incorporated by reference herein in its entirety.
In some embodiments of any of the aspects, the cell further comprises a gene encoding a recombinase that will induce recombination at the recombinase site. In some embodiments of any of the aspects, the recombinase site is a LoxP site. In some embodiments of any of the aspects, the cell further comprises a gene encoding cre recombinase. A gene encoding a recombinase can be under the control of, e.g. an inducible promoter or a cell-specific promoter. Inducible promoters, temporally-specific, and tissue-specific promoters for the control of a recombinase are known in the art. In some embodiments of any of the aspects, the gene encoding a recombinase is under the control of a promoter which is not active in immature B cells and is active in peripheral B cells, e.g. the CD21 promoter, CD84 promoter. In some embodiments of any of the aspects, the gene encoding the recombinase is not active in all mature B cells but is preferentially expressed in germinal center B cells. Exemplary promoters for germinal center specific, or at least biased, expression include, but are not limited to, the Iγ1 or AID promoters.
In some embodiments of any of the aspects, the cell is heterozygous for the engineered Ig locus (or loci) as described herein and the other Ig locus or (loci) has been engineered to be inactive, wherein the cell will express an Ig chain only from the engineered Ig locus as described herein. The inactive Ig locus can be, by way of non-limiting example, deleted, partially deleted, and/or mutated (e.g. to inactivate sequences necessary for V(D)J recombination can be mutated and/or deleted (e.g. deleting the JH portion of the locus).
To further address whether Human VκJκ repertoires might show increased junctional diversity versus those of mouse VkJk repertoires, HTGTS-Rep-seq analysis was performed on DNA from WT mouse IgM+ splenic B cells and human peripheral blood mononuclear cells (PBMCs) using a mouse or human JK1 bait as a primer. To obviate the possibility of influences of cellular selection, presented are results of out of frame (non-productive) VκJκ junctions. This analysis demonstrated a markedly greater incorporation of P and/or N junctional elements into the human VκJκ junctions versus the mouse VκJκ junctions (
In some embodiments of any of the aspects, the TdT sequence can be operably linked to a promoter, e.g., a promoter active in B cells. In some embodiments of any of the aspects, the promoter is a strong promoter, a constitutively active promoter, and/or a synthertic promoter. Exemplary but non-hinting promoters are the “CAG” promoter—a combined sequences of cytomegalovirus (CMV) early enhancer element (“C”), the promoter, the first exon and the first intron of chicken beta-actin gene (“A”), and the splice acceptor of the rabbit beta-globin gene (“G”)), the Eμ-N-myc promoter (Bentolila et al., J I 158(2):715-723 (1997)), or other promoters that enforce TDT expression in developing pro and pre B lymphocytes. In some embodiments of any of the aspects, the TdT-encoding sequence can be present in a vector and/or stably integrated into the genome of the cell (e.g., at the Rosa26 locus) that is stably integrated into the constitutively expressed mouse Rosa26 locus.
In some embodiments of any of the aspects, a nucleic acid encoding a polypeptide as described herein (e.g. a TdT polypeptide) is comprised by a vector. In some of the aspects described herein, a nucleic acid sequence encoding a given polypeptide as described herein, or any module thereof, is operably linked to a vector. The term “vector”, as used herein, refers to a nucleic acid construct designed for delivery to a host cell or for transfer between different host cells. As used herein, a vector can be viral or non-viral. The term “vector” encompasses any genetic element that is capable of replication when associated with the proper control elements and that can transfer gene sequences to cells. A vector can include, but is not limited to, a cloning vector, an expression vector, a plasmid, phage, transposon, cosmid, chromosome, virus, virion, etc.
As used herein, the term “expression vector” refers to a vector that directs expression of an RNA or polypeptide from sequences linked to transcriptional regulatory sequences on the vector. The sequences expressed will often, but not necessarily, be heterologous to the cell. An expression vector may comprise additional elements, for example, the expression vector may have two replication systems, thus allowing it to be maintained in two organisms, for example in human cells for expression and in a prokaryotic host for cloning and amplification. The term “expression” refers to the cellular processes involved in producing RNA and proteins and as appropriate, secreting proteins, including where applicable, but not limited to, for example, transcription, transcript processing, translation and protein folding, modification and processing. “Expression products” include RNA transcribed from a gene, and polypeptides obtained by translation of mRNA transcribed from a gene. The term “gene” means the nucleic acid sequence which is transcribed (DNA) to RNA in vitro or in vivo when operably linked to appropriate regulatory sequences. The gene may or may not include regions preceding and following the coding region, e.g. 5′ untranslated (5′UTR) or “leader” sequences and 3′ UTR or “trailer” sequences, as well as intervening sequences (introns) between individual coding segments (exons).
As used herein, the term “viral vector” refers to a nucleic acid vector construct that includes at least one element of viral origin and has the capacity to be packaged into a viral vector particle. The viral vector can contain the nucleic acid encoding a polypeptide as described herein in place of non-essential viral genes. The vector and/or particle may be utilized for the purpose of transferring any nucleic acids into cells either in vitro or in vivo. Numerous forms of viral vectors are known in the art.
By “recombinant vector” is meant a vector that includes a heterologous nucleic acid sequence, or “transgene” that is capable of expression in vivo. It should be understood that the vectors described herein can, In some embodiments of any of the aspects, be combined with other suitable compositions and therapies. In some embodiments of any of the aspects, the vector is episomal. The use of a suitable episomal vector provides a means of maintaining the nucleotide of interest in the subject in high copy number extra chromosomal DNA thereby eliminating potential effects of chromosomal integration.
In some embodiments of any of the aspects, described herein is a cell comprising: a) an engineered IgH locus comprising at least one of:
A cell as described herein can be, by way of non-limiting example, a stem cell, an embryonic stem cell, a B cell, a mature B cell, an immature B cell, and/or a hybridoma cell. A cell as described herein can be, by way of non-limiting example, a mammalian cell, a human cell, and/or a mouse cell. In some embodiments of any of the aspects, a cell as described herein can be a mouse embryonic stem cell.
In one aspect, described herein is genetically engineered mammal comprising an engineered cell as described herein. In some embodiments of any of the aspects, the mammal can be a mouse. In some embodiments of any of the aspects, the methods described herein, e.g. methods of producing antibodies and/or testing antigens require only that the B-cells of the genetically engineered mammal are engineered as described herein. Accordingly, in some embodiments of any of the aspects, the genetically engineered mammal can be a chimera, e.g. it can comprise two genetically distinct populations of cells. The use of chimeras can expedite the process of obtaining a genetically engineered mammal to be used in the methods described herein. In one aspect, described herein is a chimeric genetically engineered mammal, e.g. a mouse, comprising two populations of cells, a first population comprising cells which are V(D)J recombination-defective; and a second population comprising engineered cells as described herein. V(D)J recombination-defective cells mice are known in the art, e.g. RAG2−/− cells.
In some embodiments of any of the aspects, the mammal, e.g., the genetically engineered mammal described herein, is a mouse.
In one aspect of any of the embodiments, provided herein is a set of at least two mammals, wherein each mammal is a mammal comprising an engineered Ig locus(loci) as described herein, the first mammal comprising a first target VH segment and/or a first target VL segment and each further mammal comprising a further target VH segment and/or a further target VL segment. In some embodiments of any of the aspects, each mammal comprises a human target VH segment and a human target VL segment.
For example, a mammal with an engineered IgH locus can be bred with a mammal with an engineered IgL locus to make a system in which the derived mammal would have both IgH and Igk rearranging loci. Such animals can be used for immunization to discover and or optimize novel humanized antibodies. Sets of such mammals can be provided for each of the frequently utilized human VHs and VL so that multiple combinations are available within the set. In some embodiments, the mice can have IGCR1 deleted for IgH with human VH replacing VH 81X (with its own CBE) or more proximal VH5-1 (with added CBE) and Igκ locus with Cer/Sis deleted and proximal Vκ replaced with human Vκ or Vλ (e.g., with replace Vλ23 RSS replaced with Vκ 12RSS to preserve pairing with Jκ 23RSS). Such mammals can also be produced by engineering all mutations in a single ES cell and reconstituting B cells (and T cells) in RAG-deficient chimeras for immunization via a RAG blastocyste complementation approach (e.g. see Tian et al., 201i6 which is incorporated by reference herein in its entirety).
The cells and mammals described herein permit the optimization, improvement, or modification of known antibodies. By engineering the cell and/or mammal to express antibodies (which are subject to V(D)J recombination, the GC reaction, and/or SHM), comprising segment(s) known to recognize a particular antigen (e.g. segment(s) from a known antibody that recognizes the particular antigen), a large number of precursor antibodies can be generated which are related to and/or derived from segments of the known antibody. These antibodies can be screened and/or selected, in vitro and/or in vivo for optimized characteristics relative to the known antibody. Optimization can be an increase in, e.g. affinity, breadth, and/or specificity or other desired characteristics.
In one aspect, described herein is method of making an optimized antibody from a known antibody, the method comprising the steps of: injecting a mouse blastocyst with a cell as described herein, wherein the cell is a mouse embryonic stem cell, and wherein the target segment comprises the VH or VLsegment of a known antibody; implanting the mouse blastocyst into a female mouse under conditions suitable to allow maturation of the blastocyst into a genetically engineered mouse; and isolating 1) an optimized antibody comprising the non-native V segment; or 2) a cell producing an optimized antibody comprising the non-native V segment from the genetically engineered mouse. In some embodiments of any of the aspects, the blastocyst cells are V(D)J recombination-defective cells, e.g. RAG2−/− cells. In some embodiments of any of the aspects, the IgH and/or IgL loci of the blastocyst cells have been rendered non-functional, as described elsewhere herein. In some embodiments of any of the aspects, the blastocyst cells are not capable of forming mature B cells, and optionally are not capable of forming mature T-cells. In some embodiments of any of the aspects, the blastocyst cells are not capable of forming mature lymphocytes.
In some embodiments of any of the aspects, the method can further comprise a step of immunizing the genetically engineered mouse with a desired target antigen before the isolating step. In some embodiments of any of the aspects, the method can further comprise a step of producing a monoclonal antibody from at least one cell of the genetically engineered mouse.
Once the cell as described herein is produced through the methods described herein, an animal can be produced from this cell through either stem cell technology or cloning technology. For example, if the cell into which the nucleic acid was transfected was a stem cell for the organism (e.g. an embryonic stem cell), then this cell, after transfection and culturing, can be used to produce an organism which will contain the engineered aspects in germline cells, which can then in turn be used to produce another animal that possesses the engineered aspects in all of its cells. In other methods for production of an animal containing the engineered aspects, cloning technologies can be used. These technologies generally take the nucleus of the engineered cell and either through fusion or replacement fuse the engineered nucleus with an oocyte which can then be manipulated to produce an animal. The advantage of procedures that use cloning instead of ES technology is that cells other than ES cells can be transfected. For example, a fibroblast cell, which is very easy to culture can be used as the cell which is engineered, and then cells derived from this cell can be used to clone a whole animal.
Production of the engineered animals described herein can, in some embodiments, also utilize RAG2-deficient blastocyst complementation (RDBC) technology, which is known in the art and described, e.g., in Chen et al. PNAS 90:4528-4532 (1993); Tian et al., Cell 166:1471-1484(2016); which are incorporated by reference herein in their entireties.
The engineered animals described herein can also be produced by zygote micro-injection/electroporation. Such methods are known in the art and described at, e.g., Wang et al. Cell. 2013; 153(4):910-8; Yang et al. Cell. 2013; 154(6):1370-9; Yasue et al. Scientific reports. 2014; 4:5705; Hashimoto et al. Developmental biology. 2016; 418(1):1-9; and Wang et al. BioTechniques. 2015; 59(4):201-2, 4, 6-8; each of which is incorporated by reference herein in its entirety.
Generally, cells (e.g. ES cells) used to produce the engineered animals will be of the same species as the animal to be generated. Thus, for example, mouse embryonic stem cells will usually be used for generation of engineered mice. Methods of isolating, culturing, and manipulating various cells types are known in the art. By way of non-limiting example, embryonic stem cells are generated and maintained using methods well known to the skilled artisan such as those described by Doetschman et al. (1985) J. Embryol. Exp. Mol. Biol. 87:27-45). The cells are cultured and prepared for genetic engineering using methods well known to the skilled artisan, such as those set forth by Robertson in: Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, E. J. Robertson, ed. IRI. Press, Washington, D.C. [1987]); by Bradley et al. (1986) Current Topics in Devel. Biol. 20:357-371); and by Hogan et al. (Manipulating the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1986)).
In some embodiments of any of the aspects, after cells comprising the engineered aspects have been generated, and optionally, selected, the cells can be inserted into an embryo or blastocyst, e.g. to generate a chimera. Insertion may be accomplished in a variety of ways known to the skilled artisan, however the typical method is by microinjection. For microinjection, about 10-30 cells are collected into a micropipet and injected into embryos that are at the proper stage of development to permit integration of the engineered ES cell into the developing embryo or blastocyst. For instance, the ES cells can be microinjected into blastocysts. The suitable stage of development for the embryo used for insertion of ES cells is very species dependent, however for mice it is about 3.5 days. The embryos are obtained by perfusing the uterus of pregnant females. Suitable methods for accomplishing this are known to the skilled artisan.
Methods of isolating antibodies and/or antibody-producing cells are known in the art, and can include, by way of non-limiting example, producing a monoclonal antibody via, e.g., the production of hybridomas or phage display. See, e.g., Little et al. Immunology Today 2000 21:364-370; Pasqualini et al. PNAS 2004 101:257-259; Reichert et al. Nature Reviews Drug Discovery 2007 6:349-356; and Wang et al. Antibody Technology Journal 2011 1:1-4; each of which is incorporated by reference herein in its entirety.
In one aspect, described herein is an optimized antibody produced by the method described above herein.
Certain vaccine development strategies rely upon identifying one or more intermediate antigens, such that immunization with the one or more intermediate antigens will trigger B cell activation and diversification of antibodies, resulting in the production of an antibody that will recognize the final target antigen (e.g. an HIV antigen). Accordingly, described herein are methods and compositions that permit the in vivo evaluation of such intermediate antigens. In some embodiments of any of the aspects, structural information about antibodies that will recognize the final target antigen is known, e.g. what VH or VL segment is comprised by antibodies to HIV antigens in those rare subjects with a natural antibody defense against HIV. Using the methods and compositions described herein, the ability of an intermediate antigen to activate B cells comprising antibodies with such a VH or VL segment can be assessed, permitting the development of multiple antigen immunization therapies.
In one aspect, described herein is a method of identifying a candidate antigen as an antigen that activates a B cell population comprising a V segment of interest, the method comprising: immunizing an engineered mammal as described herein, engineered such that a majority of the mammal's peripheral B cells express the V segment(s) of interest, with the antigen; measuring B cell activation in the mammal; and identifying the candidate antigen as an activator of a B cell population comprising the V segment(s) of interest if the B cell activation in the mammal is increased relative to a reference level. B cell activation can be, e.g. an increase in the somatic hypermutation status of the Ig variable region, an increase in the affinity of mature antibodies for the antigen, and/or an increase in the specificity of mature antibodies for the antigen. As used herein, the term “activator,” as used in reference to activation of B cells refers to an antigen that increases B cell activation, e.g. increases B cell proliferation, SHM, and/or the GC reaction.
For convenience, the meaning of some terms and phrases used in the specification, examples, and appended claims, are provided below. Unless stated otherwise, or implicit from context, the following terms and phrases include the meanings provided below. The definitions are provided to aid in describing particular embodiments, and are not intended to limit the claimed invention, because the scope of the invention is limited only by the claims. Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. If there is an apparent discrepancy between the usage of a term in the art and its definition provided herein, the definition provided within the specification shall prevail.
For convenience, certain terms employed herein, in the specification, examples and appended claims are collected here.
The terms “decrease”, “reduced”, “reduction”, or “inhibit” are all used herein to mean a decrease by a statistically significant amount. In some embodiments of any of the aspects, “reduce,” “reduction” or “decrease” or “inhibit” typically means a decrease by at least 10% as compared to a reference level (e.g. the absence of a given treatment or agent) and can include, for example, a decrease by at least about 10%, at least about 20%, at least about 25%, at least about 30%, at least about 35%, at least about 40%, at least about 45%, at least about 50%, at least about 55%, at least about 60%, at least about 65%, at least about 70%, at least about 75%, at least about 80%, at least about 85%, at least about 90%, at least about 95%, at least about 98%, at least about 99%, or more. As used herein, “reduction” or “inhibition” does not encompass a complete inhibition or reduction as compared to a reference level. “Complete inhibition” is a 100% inhibition as compared to a reference level. A decrease can be preferably down to a level accepted as within the range of normal for an individual without a given disorder.
The terms “increased”, “increase”, “enhance”, or “activate” are all used herein to mean an increase by a statically significant amount. In some embodiments of any of the aspects, the terms “increased”, “increase”, “enhance”, or “activate” can mean an increase of at least 10% as compared to a reference level, for example an increase of at least about 20%, or at least about 30%, or at least about 40%, or at least about 50%, or at least about 60%, or at least about 70%, or at least about 80%, or at least about 90% or up to and including a 100% increase or any increase between 10-100% as compared to a reference level, or at least about a 2-fold, or at least about a 3-fold, or at least about a 4-fold, or at least about a 5-fold or at least about a 10-fold increase, or any increase between 2-fold and 10-fold or greater as compared to a reference level. In the context of a marker or symptom, a “increase” is a statistically significant increase in such level.
As used herein, a “highly-utilized” segment is a segment which is found, on average, in at least 3% of a naturally-generated antibody repertoire of a wild-type animal. In some embodiments of any of the aspects, the antibody repertoire can be an unselected repertoire. Highly-utilized segments are known in the art for a human of species. For example, non-limiting examples of highly-utilized segments can include IGHV1-2*02, IGHV1-69, VH3-30, VH4-59, Vκ1-5, Vκ3-20, Vκ4-1, Vλ1-51, Vλ3-1, and Vλ2-14.
As used herein, the terms “protein” and “polypeptide” are used interchangeably herein to designate a series of amino acid residues, connected to each other by peptide bonds between the alpha-amino and carboxy groups of adjacent residues. The terms “protein”, and “polypeptide” refer to a polymer of amino acids, including modified amino acids (e.g., phosphorylated, glycated, glycosylated, etc.) and amino acid analogs, regardless of its size or function. “Protein” and “polypeptide” are often used in reference to relatively large polypeptides, whereas the term “peptide” is often used in reference to small polypeptides, but usage of these terms in the art overlaps. The terms “protein” and “polypeptide” are used interchangeably herein when referring to a gene product and fragments thereof Thus, exemplary polypeptides or proteins include gene products, naturally occurring proteins, homologs, orthologs, paralogs, fragments and other equivalents, variants, fragments, and analogs of the foregoing.
In the various embodiments described herein, it is further contemplated that variants (naturally occurring or otherwise), alleles, homologs, conservatively modified variants, and/or conservative substitution variants of any of the particular polypeptides described are encompassed. As to amino acid sequences, one of skill will recognize that individual substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein sequence which alters a single amino acid or a small percentage of amino acids in the encoded sequence is a “conservatively modified variant” where the alteration results in the substitution of an amino acid with a chemically similar amino acid and retains the desired activity of the polypeptide. Such conservatively modified variants are in addition to and do not exclude polymorphic variants, interspecies homologs, and alleles consistent with the disclosure.
A given amino acid can be replaced by a residue having similar physiochemical characteristics, e.g., substituting one aliphatic residue for another (such as Ile, Val, Leu, or Ala for one another), or substitution of one polar residue for another (such as between Lys and Arg; Glu and Asp; or Gln and Asn). Other such conservative substitutions, e.g., substitutions of entire regions having similar hydrophobicity characteristics, are well known. Polypeptides comprising conservative amino acid substitutions can be tested in any one of the assays described herein to confirm that a desired activity, e.g. activity and specificity of a native or reference polypeptide is retained.
Amino acids can be grouped according to similarities in the properties of their side chains (in A. L. Lehninger, in Biochemistry, second ed., pp. 73-75, Worth Publishers, New York (1975)): (1) non-polar: Ala (A), Val (V), Leu (L), Ile (I), Pro (P), Phe (F), Trp (W), Met (M); (2) uncharged polar: Gly (G), Ser (S), Thr (T), Cys (C), Tyr (Y), Asn (N), Gln (Q); (3) acidic: Asp (D), Glu (E); (4) basic: Lys (K), Arg (R), His (H). Alternatively, naturally occurring residues can be divided into groups based on common side-chain properties: (1) hydrophobic: Norleucine, Met, Ala, Val, Leu, Ile; (2) neutral hydrophilic: Cys, Ser, Thr, Asn, Gln; (3) acidic: Asp, Glu; (4) basic: His, Lys, Arg; (5) residues that influence chain orientation: Gly, Pro; (6) aromatic: Trp, Tyr, Phe. Non-conservative substitutions will entail exchanging a member of one of these classes for another class. Particular conservative substitutions include, for example; Ala into Gly or into Ser; Arg into Lys; Asn into Gln or into His; Asp into Glu; Cys into Ser; Gln into Asn; Glu into Asp; Gly into Ala or into Pro; His into Asn or into Gln; Ile into Leu or into Val; Leu into Ile or into Val; Lys into Arg, into Gln or into Glu; Met into Leu, into Tyr or into Ile; Phe into Met, into Leu or into Tyr; Ser into Thr; Thr into Ser; Trp into Tyr; Tyr into Trp; and/or Phe into Val, into Ile or into Leu.
In some embodiments of any of the aspects, the polypeptide described herein (or a nucleic acid encoding such a polypeptide) can be a functional fragment of one of the amino acid sequences described herein. As used herein, a “functional fragment” is a fragment or segment of a peptide which retains at least 50% of the wildtype reference polypeptide's activity according to the assays described below herein. A functional fragment can comprise conservative substitutions of the sequences disclosed herein.
In some embodiments of any of the aspects, the polypeptide described herein can be a variant of a sequence described herein. In some embodiments of any of the aspects, the variant is a conservatively modified variant. Conservative substitution variants can be obtained by mutations of native nucleotide sequences, for example. A “variant,” as referred to herein, is a polypeptide substantially homologous to a native or reference polypeptide, but which has an amino acid sequence different from that of the native or reference polypeptide because of one or a plurality of deletions, insertions or substitutions. Variant polypeptide-encoding DNA sequences encompass sequences that comprise one or more additions, deletions, or substitutions of nucleotides when compared to a native or reference DNA sequence, but that encode a variant protein or fragment thereof that retains activity. A wide variety of PCR-based site-specific mutagenesis approaches are known in the art and can be applied by the ordinarily skilled artisan.
A variant amino acid or DNA sequence can be at least 90%, at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or more, identical to a native or reference sequence. The degree of homology (percent identity) between a native and a mutant sequence can be determined, for example, by comparing the two sequences using freely available computer programs commonly employed for this purpose on the world wide web (e.g. BLASTp or BLASTn with default settings).
Alterations of the native amino acid sequence can be accomplished by any of a number of techniques known to one of skill in the art. Mutations can be introduced, for example, at particular loci by synthesizing oligonucleotides containing a mutant sequence, flanked by restriction sites enabling ligation to fragments of the native sequence. Following ligation, the resulting reconstructed sequence encodes an analog having the desired amino acid insertion, substitution, or deletion. Alternatively, oligonucleotide-directed site-specific mutagenesis procedures can be employed to provide an altered nucleotide sequence having particular codons altered according to the substitution, deletion, or insertion required. Techniques for making such alterations are very well established and include, for example, those disclosed by Walder et al. (Gene 42:133, 1986); Bauer et al. (Gene 37:73, 1985); Craik (BioTechniques, January 1985, 12-19); Smith et al. (Genetic Engineering: Principles and Methods, Plenum Press, 1981); and U.S. Pat. Nos. 4,518,584 and 4,737,462, which are herein incorporated by reference in their entireties. Any cysteine residue not involved in maintaining the proper conformation of the polypeptide also can be substituted, generally with serine, to improve the oxidative stability of the molecule and prevent aberrant crosslinking. Conversely, cysteine bond(s) can be added to the polypeptide to improve its stability or facilitate oligomerization.
As used herein, the term “nucleic acid” or “nucleic acid sequence” refers to any molecule, preferably a polymeric molecule, incorporating units of ribonucleic acid, deoxyribonucleic acid or an analog thereof The nucleic acid can be either single-stranded or double-stranded. A single-stranded nucleic acid can be one nucleic acid strand of a denatured double-stranded DNA. Alternatively, it can be a single-stranded nucleic acid not derived from any double-stranded DNA. In one aspect, the nucleic acid can be DNA. In another aspect, the nucleic acid can be RNA. Suitable DNA can include, e.g., genomic DNA or cDNA. Suitable RNA can include, e.g., mRNA.
As used herein an “antibody” refers to IgG, IgM, IgA, IgD or IgE molecules or antigen-specific antibody fragments thereof (including, but not limited to, a Fab, F(ab′)2, Fv, disulphide linked Fv, scFv, single domain antibody, closed conformation multispecific antibody, disulphide-linked scfv, diabody), whether derived from any species that naturally produces an antibody, or created by recombinant DNA technology; whether isolated from serum, B-cells, hybridomas, transfectomas, yeast or bacteria.
In another example, an antibody includes two heavy (H) chain variable regions and two light (L) chain variable regions. It should be noted that a VH region (e.g. a portion of an immunoglobin polypeptide is not the same as a VH segment, which is described elsewhere herein). The VH and VL regions can be further subdivided into regions of hypervariability, termed “complementarity determining regions” (“CDR”), interspersed with regions that are more conserved, termed “framework regions” (“FR”). The extent of the framework region and CDRs has been precisely defined (see, Kabat, E. A., et al. (1991) Sequences of Proteins of Immunological Interest, Fifth Edition, U.S. Department of Health and Human Services, NIH Publication No. 91-3242, and Chothia, C. et al. (1987) J. Mol. Biol. 196:901-917; which are incorporated by reference herein in their entireties). Each VH and VL is typically composed of three CDRs and four FRs, arranged from amino-terminus to carboxy-terminus in the following order: FRI, CDR1, FR2, CDR2, FR3, CDR3, FR4.
The term “monospecific antibody” refers to an antibody that displays a single binding specificity and affinity for a particular target, e.g., epitope. This term includes a “monoclonal antibody” or “monoclonal antibody composition,” which as used herein refer to a preparation of antibodies or fragments thereof of single molecular composition, irrespective of how the antibody was generated.
As described herein, an “antigen” is a molecule that is bound by a binding site on an antibody. Typically, antigens are bound by antibody ligands and are capable of raising an antibody response in vivo. An antigen can be a polypeptide, protein, nucleic acid or other molecule or portion thereof The term “antigenic determinant” refers to an epitope on the antigen recognized by an antigen-binding molecule, and more particularly, by the antigen-binding site of said molecule.
As used herein, the term “affinity” refers to the strength of an interaction, e.g. the binding of an antibody for an antigen and can be expressed quantitatively as a dissociation constant (κD). Avidity is the measure of the strength of binding between an antigen-binding molecule (such as an antibody reagent described herein) and the pertinent antigen. Avidity is related to both the affinity between an antigenic determinant and its antigen binding site on the antigen-binding molecule, and the number of pertinent binding sites present on the antigen-binding molecule. Typically, antigen-binding proteins (such as an antibody reagent described herein) will bind to their cognate or specific antigen with a dissociation constant (KD of 10−5 to 10−12 moles/liter or less, and preferably 10−7 to 10−12 moles/liter or less and more preferably 10−8 to 10−12 moles/liter (i.e. with an association constant (KA) of 10−5 to 10−12 liter/moles or more, and preferably 107 to 1012 liter/moles or more and more preferably 108 to 1012 liter/moles). Any KDvalue greater than 10−4 mol/liter (or any KA value lower than 104 M−1) is generally considered to indicate non-specific binding. The KD for biological interactions which are considered meaningful (e.g. specific) are typically in the range of 10−10 M (0.1 nM) to 10−5 M (10000 nM). The stronger an interaction is, the lower is its KD. Preferably, a binding site on an antibody reagent described herein will bind to the desired antigen with an affinity less than 500 nM, preferably less than 200 nM, more preferably less than 10 nM, such as less than 500 pM. Specific binding of an antibody reagent to an antigen or antigenic determinant can be determined in any suitable manner known per se, including, for example, Scatchard analysis and/or competitive binding assays, such as radioimmunoassays (RIA), enzyme immunoassays (EIA) and sandwich competition assays, and the different variants thereof known per se in the art; as well as other techniques as mentioned herein.
As used herein, the term “specific binding” or “specificity” refers to a chemical interaction between two molecules, compounds, cells and/or particles wherein the first entity binds to the second, target entity with greater specificity and affinity than it binds to a third entity which is a non-target. In some embodiments of any of the aspects, specific binding can refer to an affinity of the first entity for the second target entity which is at least 10 times, at least 50 times, at least 100 times, at least 500 times, at least 1000 times or greater than the affinity for the third nontarget entity. Accordingly, as used herein, “selectively binds” or “specifically binds” refers to the ability of an agent (e.g. an antibody reagent) described herein to bind to a target, such a peptide comprising, e.g. the amino acid sequence of a given antigen, with a KD 10−5 M (10000 nM) or less, e.g., 10−6 M or less, 10−7 M or less, 10−8 M or less, 10−9 M or less, 10−10 M or less, 10−11 M or less, or 10−12 M or less. For example, if an agent described herein binds to a first peptide comprising the antigen with a KD of 10−5 M or lower, but not to another randomly selected peptide, then the agent is said to specifically bind the first peptide. Specific binding can be influenced by, for example, the affinity and avidity of the agent and the concentration of the agent. The person of ordinary skill in the art can determine appropriate conditions under which an agent selectively bind the targets using any suitable methods, such as titration of an agent in a suitable cell and/or a peptide binding assay.
As used herein, the term “chimeric”, as used in the context of an antibody, or sequence encoding an antibody refers to immunoglobin molecules characterized by two or more segments or portions derived from different animal species. For example, the variable region of the chimeric antibody is derived from a non-human mammalian antibody, such as murine monoclonal antibody, and the immunoglobin constant region is derived from a human immunoglobin molecule. The variable segments of chimeric antibodies are typically linked to at least a portion of an immunoglobulin constant region (Fc), typically that of a human immunoglobulin. Human constant region DNA sequences can be isolated in accordance with well-known procedures from a variety of human cells, such as immortalized B-cells (WO 87/02671; which is incorporated by reference herein in its entirety). The antibody can contain both light chain and heavy chain constant regions. The heavy chain constant region can include CH1, hinge, CH2, CH3, and, sometimes, CH4 regions. For therapeutic purposes, the CH2 domain can be deleted or omitted. Techniques developed for the production of “chimeric antibodies” are known in the art (see Morrison et al., Proc. Natl. Acad. Sci. 81:851-855 (1984); Neuberger et al., Nature 312:604-608 (1984); Takeda et al., Nature 314:452-454 (1985); which are incorporated by reference herein in their entireties), e.g., by splicing genes from a mouse, or other species, antibody molecule of appropriate antigen specificity together with genes from a human antibody molecule of appropriate biological activity.
As used herein, the term “humanized” refers to an antibody (or fragment thereof, e.g. a light or heavy chain) wherein the CDRs are not human in origin, but the sequence of the remaining sequence of the Ig protein (e.g. the framework regions and constant regions) is human in origin. One of skill in the art is aware of how to humanize a given antibody, see, e.g., U.S. Pat. Nos. 5,585,089; 6,835,823; 6,824,989.
As used herein, the term “engineered” refers to the aspect of having been manipulated by the hand of man. For example, a locus is considered to be “engineered” when two or more sequences, that are not linked together in that order in nature in that locus, are manipulated by the hand of man to be directly linked to one another in the engineered locus. For example, in some embodiments of the present invention, an engineered locus comprises various Ig sequences with a non-native V segment, all of which are found in nature, but are not found in the same locus or are not found in that order in the locus in nature. As is common practice and is understood by those in the art, progeny and copies of an engineered polynucleotide (and/or cells or animals comprising such polynucleotides) are typically still referred to as “engineered” even though the actual manipulation was performed on a prior entity.
As used herein, the term “recombination-defective” refers to a cell (or animal) in which recombination, particularly V(D)J recombination at the IgH and IgL loci cannot occur. Typically, a V(D)J recombination-defective cell is a cell comprising a mutation in a gene encoding a protein that is necessary for V(D)J recombination to occur. Mutations that will cause a cell and/or animal to be V(D)J recombination-defective are known in the art, e.g., RAG2-i-cells are V(D)J recombination defective and mice with such mutations are commercially available (see, e.g., stock number 008449, Jackson Laboratories, Bar Harbor, ME). A further non-limiting example of a V(D)J recombination-defective mutant is RAG1−/−. In some embodiments of any of the aspects, cells can be rendered V(D)J recombination-defective at only one locus, e.g. the IgH locus by, e.g. deleting the germline JH Segments.
As used herein, the term “cassette” refers to a nucleic acid molecule, or a fragment thereof, that can be introduced to a host cell and incorporated into the host cell's genome (e.g. using a cassette-targeting sequence as described elsewhere herein). A cassette can comprise a gene (e.g. an IgH gene), or a fragment thereof, e.g. a VH segment. A cassette can be an isolated nucleotide fragment, e.g. a dsDNA or can be comprised by a vector, e.g. a plasmid, cosmid, and/or viral vector.
As used herein, the term “B cell” refers to lymphocytes that play a role in the humoral immune response and is a component of the adaptive immune system. In this application the expressions “B cell”, “B-cell” and “B lymphocyte” refer to the same cell.
Immature B cells are produced in the bone marrow of most mammals. After reaching the IgM+ immature stage in the bone marrow, these immature B cells migrate to lymphoid organs, where they are referred to as transitional B cells, some of which subsequently differentiating into mature B lymphocytes. B-cell development occurs through several stages, each stage characterized by a change in the genome content at the antibody loci.
Each B cell has a unique receptor protein (referred to as the B-cell receptor (BCR)) on its surface that is able to bind to a unique antigen. The BCR is a membrane-bound immunoglobulin, and it is this molecule that allows to distinguish B cells from other types of lymphocytes, as well as playing a central role in B-cell activation in vivo. Once a B cell encounters its cognate antigen and receives an additional signal from a T helper cell, it can further differentiate into one of two types of B cells (plasma B cells and memory B cells). The B cell may either become one of these cell types directly or it may undergo an intermediate differentiation step, the germinal center reaction, during which the B cell hypermutates the variable region of its immunoglobulin gene (“somatic hypermutation”) and possibly undergoes class switching.
Plasma B cells (also known as plasma cells) are large B cells that have been exposed to an antigen and are producing and secreting large amounts of antibodies. These are short-lived cells and usually undergo apoptosis when the agent that induced the immune response is eliminated. Memory B cells are formed from activated B cells that are specific to an antigen encountered during a primary immune response. These cells are able to live for a long time, and can respond quickly following a second exposure to the same antigen.
As used herein, the term “GC reaction” refers to a process that occurs in the germinal center, during which B cells undergo SHM, memory generation, and/or class/isotype switch. The germinal center (GC) reaction is the basis of T-dependent humoral immunity against foreign pathogens and the ultimate expression of the adaptive immune response. GCs represent a unique collaboration between proliferating antigen-specific B cells, T follicular helper cells, and the specialized follicular dendritic cells that constitutively occupy the central follicular zones of secondary lymphoid organs.
As used herein, the term “somatic hypermutation” or “SHM,” refers to the mutation of a polynucleotide sequence at an Ig locus initiated by, or associated with the action of AID (activation-induced cytidine deaminase) on that polynucleotide sequence. SHM occurs during B cell proliferation and occurs at a mutation rate that is at least 105-106 fold greater than the normal rate of mutation in the genome.
As used herein, the term “stem cell” refers to a cell in an undifferentiated or partially differentiated state that has the property of self-renewal and has the developmental potential to naturally differentiate into a more differentiated cell type, without a specific implied meaning regarding developmental potential (i.e., totipotent, pluripotent, multipotent, etc.). By self-renewal is meant that a stem cell is capable of proliferation and giving rise to more such stem cells, while maintaining its developmental potential. Accordingly, the term “stem cell” refers to any subset of cells that have the developmental potential, under particular circumstances, to differentiate to a more specialized or differentiated phenotype, and which retain the capacity, under certain circumstances, to proliferate without substantially differentiating. The term “somatic stem cell” is used herein to refer to any stem cell derived from non-embryonic tissue, including fetal, juvenile, and adult tissue. Natural somatic stem cells have been isolated from a wide variety of adult tissues including blood, bone marrow, brain, olfactory epithelium, skin, pancreas, skeletal muscle, and cardiac muscle. Exemplary naturally occurring somatic stem cells include, but are not limited to, mesenchymal stem cells and hematopoietic stem cells. In some embodiments of any of the aspects, the stem or progenitor cells can be embryonic stem cells. As used herein, “embryonic stem cells” refers to stem cells derived from tissue formed after fertilization but before the end of gestation, including pre-embryonic tissue (such as, for example, a blastocyst), embryonic tissue, or fetal tissue taken any time during gestation, typically but not necessarily before approximately 10-12 weeks gestation. Most frequently, embryonic stem cells are totipotent cells derived from the early embryo or blastocyst. Embryonic stem cells can be obtained directly from suitable tissue, including, but not limited to human tissue, or from established embryonic cell lines. In one embodiment, embryonic stem cells are obtained as described by Thomson et al. (U.S. Pat. Nos. 5,843,780 and 6,200,806; Science 282:1145, 1998; Curr. Top. Dev. Biol. 38:133 ff, 1998; Proc. Natl. Acad. Sci. U.S.A. 92:7844, 1995 which are incorporated by reference herein in their entirety).
Exemplary stem cells include embryonic stem cells, adult stem cells, pluripotent stem cells, bone marrow stem cells, hematopoietic stem cells, and the like. Descriptions of stem cells, including method for isolating and culturing them, may be found in, among other places, Embryonic Stem Cells, Methods and Protocols, Turksen, ed., Humana Press, 2002; Weisman et al., Annu. Rev. Cell. Dev. Biol. 17:387 403; Pittinger et al., Science, 284:143 47, 1999; Animal Cell Culture, Masters, ed., Oxford University Press, 2000; Jackson et al., PNAS 96(25):14482 86, 1999; Zuk et al., Tissue Engineering, 7:211 228, 2001 (“Zuk et al.”); Atala et al., particularly Chapters 33 41; and U.S. Pat. Nos. 5,559,022, 5,672,346 and 5,827,735. Descriptions of stromal cells, including methods for isolating them, may be found in, among other places, Prockop, Science, 276:71 74, 1997; Theise et al., Hepatology, 31:235 40, 2000; Current Protocols in Cell Biology, Bonifacino et al., eds., John Wiley & Sons, 2000 (including updates through March, 2002); and U.S. Pat. No. 4,963,489.
As used herein, the term “corresponding to” refers to an amino acid or nucleotide at the enumerated position in a first polypeptide or nucleic acid, or an amino acid or nucleotide that is equivalent to an enumerated amino acid or nucleotide in a second polypeptide or nucleic acid. Equivalent enumerated amino acids or nucleotides can be determined by alignment of candidate sequences using degree of homology programs known in the art, e.g., BLAST.
The term “statistically significant” or “significantly” refers to statistical significance and generally means a two standard deviation (2SD) or greater difference.
Other than in the operating examples, or where otherwise indicated, all numbers expressing quantities of ingredients or reaction conditions used herein should be understood as modified in all instances by the term “about.” The term “about” when used in connection with percentages can mean±1%.
As used herein, the term “comprising” means that other elements can also be present in addition to the defined elements presented. The use of “comprising” indicates inclusion rather than limitation.
The term “consisting of” refers to compositions, methods, and respective components thereof as described herein, which are exclusive of any element not recited in that description of the embodiment.
As used herein the term “consisting essentially of” refers to those elements required for a given embodiment. The term permits the presence of additional elements that do not materially affect the basic and novel or functional characteristic(s) of that embodiment of the invention.
As used herein, the term “specific binding” refers to a chemical interaction between two molecules, compounds, cells and/or particles wherein the first entity binds to the second, target entity with greater specificity and affinity than it binds to a third entity which is a non-target. In some embodiments of any of the aspects, specific binding can refer to an affinity of the first entity for the second target entity which is at least 10 times, at least 50 times, at least 100 times, at least 500 times, at least 1000 times or greater than the affinity for the third nontarget entity. A reagent specific for a given target is one that exhibits specific binding for that target under the conditions of the assay being utilized.
The singular terms “a,” “an,” and “the” include plural referents unless context clearly indicates otherwise. Similarly, the word “or” is intended to include “and” unless the context clearly indicates otherwise. Although methods and materials similar or equivalent to those described herein can be used in the practice or testing of this disclosure, suitable methods and materials are described below. The abbreviation, “e.g.” is derived from the Latin exempli gratia, and is used herein to indicate a non-limiting example. Thus, the abbreviation “e.g.” is synonymous with the term “for example.”
Groupings of alternative elements or embodiments of the invention disclosed herein are not to be construed as limitations. Each group member can be referred to and claimed individually or in any combination with other members of the group or other elements found herein. One or more members of a group can be included in, or deleted from, a group for reasons of convenience and/or patentability. When any such inclusion or deletion occurs, the specification is herein deemed to contain the group as modified thus fulfilling the written description of all Markush groups used in the appended claims.
Unless otherwise defined herein, scientific and technical terms used in connection with the present application shall have the meanings that are commonly understood by those of ordinary skill in the art to which this disclosure belongs. It should be understood that this invention is not limited to the particular methodology, protocols, and reagents, etc., described herein and as such can vary. The terminology used herein is for the purpose of describing particular embodiments only, and is not intended to limit the scope of the present invention, which is defined solely by the claims. Definitions of common terms in immunology and molecular biology can be found in The Merck Manual of Diagnosis and Therapy, 19th Edition, published by Merck Sharp & Dohme Corp., 2011 (ISBN 978-0-911910-19-3); Robert S. Porter et al. (eds.), The Encyclopedia of Molecular Cell Biology and Molecular Medicine, published by Blackwell Science Ltd., 1999-2012 (ISBN 9783527600908); and Robert A. Meyers (ed.), Molecular Biology and Biotechnology: a Comprehensive Desk Reference, published by VCH Publishers, Inc., 1995 (ISBN 1-56081-569-8); Immunology by Werner Luttmann, published by Elsevier, 2006; Janeway's Immunobiology, Kenneth Murphy, Allan Mowat, Casey Weaver (eds.), Taylor & Francis Limited, 2014 (ISBN 0815345305, 9780815345305); Lewin's Genes XI, published by Jones & Bartlett Publishers, 2014 (ISBN-1449659055); Michael Richard Green and Joseph Sambrook, Molecular Cloning: A Laboratory Manual, 4th ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., USA (2012) (ISBN 1936113414); Davis et al., Basic Methods in Molecular Biology, Elsevier Science Publishing, Inc., New York, USA (2012) (ISBN 044460149X); Laboratory Methods in Enzymology: DNA, Jon Lorsch (ed.) Elsevier, 2013 (ISBN 0124199542); Current Protocols in Molecular Biology (CPMB), Frederick M. Ausubel (ed.), John Wiley and Sons, 2014 (ISBN 047150338X, 9780471503385), Current Protocols in Protein Science (CPPS), John E. Coligan (ed.), John Wiley and Sons, Inc., 2005; and Current Protocols in Immunology (CPI) (John E. Coligan, ADA M Kruisbeek, David H Margulies, Ethan M Shevach, Warren Strobe, (eds.) John Wiley and Sons, Inc., 2003 (ISBN 0471142735, 9780471142737), the contents of which are all incorporated by reference herein in their entireties.
In some embodiments of any of the aspects, the disclosure described herein does not concern a process for cloning human beings, processes for modifying the germ line genetic identity of human beings, uses of human embryos for industrial or commercial purposes or processes for modifying the genetic identity of animals which are likely to cause them suffering without any substantial medical benefit to man or animal, and also animals resulting from such processes.
Other terms are defined herein within the description of the various aspects of the invention.
All patents and other publications; including literature references, issued patents, published patent applications, and co-pending patent applications; cited throughout this application are expressly incorporated herein by reference for the purpose of describing and disclosing, for example, the methodologies described in such publications that might be used in connection with the technology described herein. These publications are provided solely for their disclosure prior to the filing date of the present application. Nothing in this regard should be construed as an admission that the inventors are not entitled to antedate such disclosure by virtue of prior invention or for any other reason. All statements as to the date or representation as to the contents of these documents is based on the information available to the applicants and does not constitute any admission as to the correctness of the dates or contents of these documents.
The description of embodiments of the disclosure is not intended to be exhaustive or to limit the disclosure to the precise form disclosed. While specific embodiments of, and examples for, the disclosure are described herein for illustrative purposes, various equivalent modifications are possible within the scope of the disclosure, as those skilled in the relevant art will recognize. For example, while method steps or functions are presented in a given order, alternative embodiments may perform functions in a different order, or functions may be performed substantially concurrently. The teachings of the disclosure provided herein can be applied to other procedures or methods as appropriate. The various embodiments described herein can be combined to provide further embodiments. Aspects of the disclosure can be modified, if necessary, to employ the compositions, functions and concepts of the above references and application to provide yet further embodiments of the disclosure. Moreover, due to biological functional equivalency considerations, some changes can be made in protein structure without affecting the biological or chemical action in kind or amount. These and other changes can be made to the disclosure in light of the detailed description. All such modifications are intended to be included within the scope of the appended claims.
Specific elements of any of the foregoing embodiments can be combined or substituted for elements in other embodiments. Furthermore, while advantages associated with certain embodiments of the disclosure have been described in the context of these embodiments, other embodiments may also exhibit such advantages, and not all embodiments need necessarily exhibit such advantages to fall within the scope of the disclosure.
The technology described herein is further illustrated by the following examples which in no way should be construed as being further limiting.
Some embodiments of the technology described herein can be defined according to any of the following numbered paragraphs:
Some embodiments of the technology described herein can be defined according to any of the following numbered paragraphs:
We have previously described a mouse model in which the most proximal mouse VH is replaced with a desired human VH in the context of deletion of the IGCR1 regulatory element. In such models, the inserted human VH is rearranged very frequently with either mouse D and JH or with inserted human DJH to generate a vast repertoire B cells most of which express the inserted human VH in association with unique antigen binding CDR3 that assembled via diversification processes that occur during assembly of VHDJH junctions ((Tian et al., 2016)).
We have taken this approach to take apart the VH, D, and JH segments of an existing anti-PD1 antibody described by BMS (Korman et al., U.S. Pat. No. 8,008,449 B2); similar antibodies have been widely used for cancer immunotherapy. These anti-PD1 IgH chains gene segments are employed form one of these antibodies in our rearrangement model to generate mice that express a vast array of IgH precursors from that antibody, with novel CDR3 antigen contact region (due to V(D)J recombination junctional diversification). We also made such antibodies using the PD1 IgH V segment described above for rearrangement to mouse Ds and JHs or the DJH of the original anti-PD1 antibody. All of these mice also expressed a fixed IgL chain from the original anti-PD1 antibody. Upon immunization with PD1, antibodies obtained from this mouse model had many novel humanized PD1 antibodies relative to their precursor and two therapeutically employed anti-PD1 antibodies. These novel antibodies have similar affinity for PD-1 as the high affinity antibody from which they were derived but altered overall binding characteristics and/or epitopes and significant sequence differences in CDR3 and other parts of the variable region sequences.
The largest impact on the diversity of BCR repertoires derives from CDR3, especially of the Ig heavy chain. There are a huge number of potential CDR3 sequences that can be generated in humans and mice, numbers greatly exceed the number of lymphocytes in mice or humans. Thus, the total diversity of the antibody repertoire is largely limited by the number of B cells. Humans have orders of magnitude more B cells than mice. For this reason, mice can only express a tiny fraction of the human CDR3 repertoires for a given antibody precursor in naive B cells. Thus, the success of the models described above relative to that of existing humanized antibody mouse models is based in large part on making mice that can express a larger, more human-like CDR3 repertoire for a one given set of human antibody IgH and IgL chains versus making antibodies from 100 s of IgH and IgL V(D)J combinations. Based on our new Ig repertoire sequencing method it has been found that humans tend to predominately use a subset of their IgH and IgL chains in their naïve repertoires. Therefore, described herein are certain mice, each of which rearranges a given highly utilized human VH segment and human VL segment. The mice described herein can be used for immunization of a desired target antigen to discover new humanized antibodies which can then be further optimized by the optimization methods outlined herein and in US Patent Publication 2016/0374320; which is incorporated by reference herein in its entirety.
To complement the Ig heavy chain diversification described above, also described herein are mice that will dominantly rearrange a specific IgL chain V segment based on findings that deletion of an element named Cer/Sis leads to increased proximal Vκ light chain utilization, similar to the effects of IGCR1 deletion in IgH. However, this effect is not as predominant as in IgH, likely because the IgH proximal VH segments have an additional element, termed a CBE, that enforces their over-utilization in the absence of IGCR1 (Jain et al., Cell in press; see also appended Igκ rearrangment data; and
TdT ectopic expression can also be introduced in into these mice as repertoire sequencing observations confirm the earlier speculation (Alt and Baltimore, 1982) that mouse IgL repertoire diversity is much less in mice that humans due to lack of TdT expression in mouse pro-B cells undergoing IgL rearrangement. These modifications will yield a mouse model that can express a much more human-like diverse repertoire of a selected human IgL VJ exons.
These IgH and IgL rearranging mice can be bred to make rearranging models that will each express large, more human-like repertoires of a given pair of rearranging IgH and IgL chains than conventional humanized mice with complete Ig loci that are now used for humanized antibody discovery. Immunization of this set of mice can permit the discovery of superior, novel humanized therapeutic antibodies, which can be further improved, if necessary, by our current antibody optimization mouse model.
Specifically contemplated herein is the immunization of a prototype of this new discovery model with PD-1. Several additional boosts can be performed, prior to isolation and characterization of high affinity humanized anti-PD1 antibodies.
Further contemplated herein is a second model, based on the original anti-PD-1 model described above. The model comprises mice engineered by replacing their VHs and JHS, e.g., via the now standard Cas-9gRNA Zygote injection/electroporation methodology. See, e.g., Wang et al. Cell. 2013; 153(4):910-8; Yang et al. Cell. 2013; 154(6):1370-9; Yasue et al. Scientific reports. 2014; 4:5705; Hashimoto et al. Developmental biology. 2016; 418(1):1-9; and Wang et al. BioTechniques. 2015; 59(4):201-2, 4, 6-8.
The invention in brief is the discovery that insertion of a CTCF-binding element (CBE) adjacent to an antibody variable region gene segment can greatly increase its rearrangement frequency. This invention permits the generation of mouse models focused on rearrangement of particular IgH and IgL to make a more human like repertoires of antibody precursors from which to selectively generate high affinity humanized antibodies.
Antigen-binding variable region exons of antibody molecules are assembled from germline V, D and J gene segments by a V(D)J recombination process. This process is initiated by the RAG endonuclease within a chromosomal V(D)J recombination center (RC) by cleaving between paired gene segments and flanking recombination signal sequences (RSSs). The mouse heavy chain locus (Igh) harbors a high density of sites that bind a ubiquitously expressed architectural protein called CTCF that facilitates chromosomal looping and plays an important role in organizing the genome into topologically associated domains that regulate various physiological processes. In the Igh locus, the vast majority of these CTCF-binding elements (CBEs) are spread across the VH domain. CBE organization is particularly striking in the DH-proximal part of the VH domain where CBEs lie immediately downstream of the RSS of functional VH segments.
It was found that mutation of the CBE that lies next to the most DH-proximal functional VH segment, VH81X, which is also the most highly rearranging VH segment in mouse progenitor cells, results in 50-100 fold reduction in VH81X utilization while rearrangement of a few immediately upstream VHs is increased. Similarly, mutation of the CBE flanking the next upstream VH segment resulted in a 100-fold reduction in the utilization of its associated VH segment while rearrangement of a few immediately upstream VHs increased.
Although VH81X is the most highly utilized VH segment in progenitor B cells, the most DH-proximal VH segment is an infrequently utilized pseudogene called VH5-1 that is flanked by a non-functional vestigial CBE. Restoration of this CBE converted VH5-1 into the most highly utilized VH while rearrangement of VH81X and other frequently rearranging upstream VHs was significantly reduced. Thus, the presence of a CBE tremendously enhances the recombination potential of the associated VH by making it accessible to RAG that linearly scans chromatin for its substrates. This scanning process initiates from the downstream RC and is likely mediated by loop extrusion during which VH-associated CBEs stabilize interactions of DH-proximal VHs first encountered by the RC, thereby promoting their dominant rearrangement.
A similar RAG scanning process operates in the mouse Igκ locus that encodes the antibody Iκ light chain (see, e.g.,
This approach can be tapped to generate diverse antibody repertoires using any V segment of choice simply by replacing the most proximal VH and/or Vκ segment with a corresponding human V segment of interest and retaining or inserting a CBE next to it. Also contemplated herein is the combination of this model with existing IgH models to make a fully VH and Vκ rearranging model that can be used to optimize the affinity of existing humanized therapeutic antibodies and to also discover new ones based on, e.g., V(D)J junctional regions making the major contribution to antigen binding with SHM followed by selection maturing the complete V(D)J exon (CDRs 1, 2 and 3)
RAG endonuclease initiates antibody heavy chain variable region exon assembly from V, D, and J segments within a chromosomal V(D)J recombination center (RC) by cleaving between paired gene segments and flanking recombination signal sequences (RSSs). The IGCR1 control region promotes DJH intermediate formation by isolating Ds, JHs, and RC from upstream VHs in a chromatin loop anchored by CTCF-binding elements (“CBEs”). How VHs access the DJHRC for VH to DJH rearrangement was previously unknown. It is described herein that CBEs immediately downstream of frequently rearranged VH-RSSs increase recombination potential of their associated VH far beyond that provided by RSSs alone. This CBE activity becomes particularly striking upon IGCR1 inactivation, which allows RAG, likely via loop extrusion, to linearly scan chromatin far upstream. VH-associated CBEs stabilize interactions of D-proximal VHs first encountered by the DJHRC during linear RAG scanning and, thereby, promote dominant rearrangement of these VHs by an unanticipated chromatin accessibility-enhancing CBE function.
Exons encoding immunoglobulin (Ig) or T cell receptor variable regions are assembled from V, D, and J gene segments during B and T lymphocyte development. V(D)J recombination is initiated by RAG1/RAG2 endonuclease (RAG), which introduces DNA double-stranded breaks (DSBs) between a pair of V, D, and J coding segments and flanking recombination signal sequences (RSSs) (Teng and Schatz, 2015). RSSs consist of a conserved heptamer, closely related to the canonical 5′-CACAGTG-3′ sequence, and a less-conserved nonamer separated by 12 (12RSS) or 23 (23RSS) base pair (bp) spacers. Physiological RAG cleavage requires RSSs and is restricted to paired coding segments flanked, respectively, by 12RSSs and 23RSSs (Teng and Schatz, 2015). RAG binds paired RSSs as a Y-shaped heterodimer (Kim et al., 2015; Ru et al., 2015), with cleavage occurring adjacent to heptamer CACs. Cleaved coding and RSS ends reside in a RAG post-cleavage synaptic complex prior to fusion of RSS ends and coding ends, respectively, by non-homologous DSB end-joining (Alt et al., 2013).
The mouse Ig heavy chain locus (Igh) spans 2.7 megabases (Mb), with more than 100 VHs flanked by 23RSSs embedded in the 2.4 Mb distal portion; 13 Ds flanked on each side by a 12RSS located in a region starting 100 kb downstream of the D-proximal VH (VH5-2; commonly termed “VH81X”), and 4 JHs flanked by 23RSSs lying just downstream of the Ds (Alt et al., 2013;
V(D)J recombination is regulated to maintain specificity and diversity of antigen receptor repertoires by modulating chromatin accessibility of particular Ig or TCR loci, or regions of these loci, for V(D)J recombination (Yancopoulos et al., 1986; Alt et al., 2013). Accessibility regulation was proposed based on robust transcription of distal VHs before rearrangement (Yancopoulos and Alt, 1985) and correlated with various epigenetic modifications (Alt et al., 2013). In this regard, germline transcription and active chromatin modifications in the nRC recruit RAG1 and RAG2 to form the active RC (Teng and Schatz, 2015). Genome organization alterations also positively impact VH “accessibility” by bringing distal VHs into closer physical proximity to the DJHRC via Igh locus contraction (Bossen et al., 2012). Conversely, the intergenic control region 1 (IGCR1) in the VH to D interval plays a negative, insulating role with respect to proximal VH accessibility (Guo et al., 2011). IGCR1 function relies on two CTCF looping factor binding elements (“CBEs”) that contribute to sequestering Ds, JHs and RC within a chromatin domain that excludes proximal VHs; thereby, mediating ordered D to JH recombination and preventing proximal VH over-utilization (Guo et al., 2011; Lin et al; 2015; Hu et al., 2015).
Eukaryotic genomes are organized into Mb or sub-Mb topologically associated domains (TADs) (Dixon et al., 2012; Nora et al., 2012) that often include contact loops anchored by pairs of convergent CBEs bound by CTCF in association with cohesin (Phillips-Cremins et al., 2013; Rao et al., 2014). In this regard, CTCF binds CBEs in an orientation-dependent fashion. Ability to recognize widely separated convergent CBEs may involve cohesin, or other factors, that progressively extrude a growing chromatin loop that is fixed into a domain upon reaching convergent CTCF-bound loop anchors (Sanbom et al., 2015; Nichols and Corces, 2015; Fudenberg et al., 2016; Dekker and Mimy, 2016). In mammalian cells, CBEs, TADs and/or loop domains have been implicated in regulation of various physiological processes (Dekker and Mimy, 2016; Merkenschlager and Nora, 2016; Hnisz et al., 2016), with convergent CBE-based loop organization implicated as critical for such regulation in some cases (Sanbom et al., 2015; Guo et al., 2015; de Wit et al., 2015; Ruiz-Velasco et al., 2017).
RAG can explore directionally from an initiating physiological or ectopically introduced RC for Mb distances within convergent CBE-based contact chromatin loop domains genome-wide (Hu et al., 2015). During such exploration, RAG uses RSSs in convergent orientation, including cryptic RSSs as simple as a CAC, for cleavage and joining to a canonical RSS in the RC (Hu et al., 2015; Zhao et a., 2016). This long-range directional RAG activity is impeded upon encounter of cohesin-bound convergent CBE pairs and potentially by other blockages that create chromatin sub-domains within loops (Hu et al., 2015; Zhao et al., 2016). The directionality and linearity of RAG activity across these domains implicated one-dimensional RAG tracking (Hu et al., 2015). Directional RAG tracking also occurs upstream of the DJHRC to IGCR1 (Hu et al., 2015). IGCR1 deletion extends this recombination tracking domain directionally upstream from the DJHRC to the proximal VHs, coupled with dramatically increased proximal VH to DJH joining, most dominantly VH81X (Hu et al., 2015). However, the nature of the tracked substrate and factors that drive RAG tracking remained speculative.
The mouse Igh harbors a high density of CBEs (Degner et al., 2011). Ten clustered CBEs (“3′CBEs”) lie at the downstream Igh boundary in convergent orientation to more than 100 CBEs embedded across the VH domain (Proudhon et al., 2015). VH CBEs are spread throughout the VH domain and, particularly for more proximal VHs, often found immediately downstream of VH RSSs (Choi et al., 2013; Bolland et al., 2016). Notably, VH CBEs and 3′CBEs are in convergent orientation with each other and with, respectively, the upstream and downstream IGCR1 CBEs (Guo et al., 2011). The striking number and organization of the CBEs across the VH portion of Igh has led to speculation of potential positive or negative VH CBE roles in Igh V(D)J recombination (Bossen et al., 2012; Guo et al., 2011; Benner et al., 2015; Degner et al., 2011; Lin et al., 2015). Our current studies reveal the function of proximal VH CBEs and provide new insights into the RAG tracking mechanism.
The VH81X-CBE Greatly Augments VH81X Utilization in Primary Pro-B Cells
To examine potential functions of the CBE immediately downstream of VH81X, 129SV ES cells were generated in which the 18-bp VH81X-CBE sequence was replaced with a scrambled sequence that does not bind CTCF (
VH81X is the most highly utilized VH in WT 129SV mouse pro-B cells being used in about 10% of total VDJH junctions, with VH2-2, which lies approximately 10 kb immediately upstream, being the second most highly utilized at 6% of junctions (
VH81X-CBE Greatly Augments VH81X to DJH Rearrangement in a v-Abl Pro-B Cell Line
To establish a cell culture model to facilitate further analyses of VH81X-CBE function in V(D)J recombination, it was first tested whether this element is required for VH81X rearrangement in v-Abl transformed, Ep-Bcl2-expressing pro-B cells viably arrested in the G1 cell-cycle phase by treatment with STI-571 to induce RAG expression and V(D)J recombination (Bredemeyer et al., 2006). For this purpose, a v-Abl pro-B line was derived that harbors an inert non-productive rearrangement of a distal VHJ558 that deletes all proximal VHs and Ds on one allele and a DHFL16.1 to JH4 rearrangement that actively undergoes VH to DJH recombination on the other allele (
Three separate HTGTS-Rep-Seq libraries were analyzed from both parent and VH81X-CBEdel DHFL16.1JH4 v-Abl pro-B lines. These analyses revealed that VH81X is utilized in approximately 45% of VDJH rearrangements in the parent line, but in only about 0.5% of VDJH rearrangements in the VH81X-CBEdel line, representing a 100-fold decrease (
Sequencing VH81X-CBE scrambled and deletion mutations in genomic DNA confirmed that both left the VH81X-RSS intact. Yet, the effect of VH81X-CBE mutations is nearly as profound and specific as expected for mutation of an RSS. To confirm that basic VH81X-RSS functions were intact subsequent to CBE deletion, a Cas9/gRNA approach was used to delete the approximately 101 kb sequence downstream of the VH81X-RSS in both DHFL16.1JH4 and VH81X-CBEdel DHFL16.1JH4 v-Abl cells, thereby positioning VH81X and its canonical RSS approximately 700 bp upstream of the DJHRC in both lines (
The VH81X-CBE Mediates Robust VH81X Rearrangement When Inverted
Several studies indicated that CBE orientation is critical for its function as a loop domain anchor (Rao et al., 2014; Sanborn et al., 2015), as well as for mediating enhancer-promoter interactions (Guo et al., 2015; de Wit et al., 2015) and regulating alternative splicing (Ruiz-Velasco et al., 2017). Convergent VH-CBE orientation with respect to IGCR1-CBE1 and the 3′CBEs suggested that such organization may be important for V(D)J recombination regulation (Guo et al., 2011; Lin et al., 2015; Benner et al., 2015; Aiden and Casellas, 2015; Proudhon et al., 2015). To test this notion, a Cas9/gRNA approach was used to invert a 40-bp sequence encompassing VH81X-CBE in the DHFL16.1JH4 v-Abl line to generate “VH81X-CBEinv” lines (
VH81X-CBE Promotes Interaction with the DJHnRC
To examine VH81X-CBE interactions with other Igh regions, an HTGTS-based methodology that provides high-resolution and reproducible interaction profiles of a bait locale of interest with unknown (prey) interacting sequences across Igh was developed (
To identify interaction partners of VH81X, 3C-HTGTS was performed on RAG2-deficient derivatives of control, VH81X-CBEdel, and VH81X-CBEinv DHFL16.1JH4 v-Abl lines using VH81X as bait (
To test the function of an additional VH-associated CBE, “VH2-2-CBEscr” DHFL16.1JH4 v-Abl lines weew generated in which the CBE just downstream of VH2-2 was replaced with a scrambled sequence that does not bind CTCF (
CBE-Dependent VH81X Dominance without IGCR1 Implicates RAG Chromatin Tracking
IGCR1 deletion results in tremendous over-utilization of proximal VHs, most dramatically VH81X, in association with RAG linear exploration of sequences upstream of IGCR1 via some form of tracking (Hu et al., 2015). To test whether the VH81X-CBE contributes to the immense over-utilization of VH81X in the context of IGCR1 deletion and RAG tracking, IGCR1-deleted (“IGCR1del”) DHFL16.1JH4 v-Abl cells were generated with or without the VH81X-CBEdel mutation (
To identify VH81X-CBE interaction partners in the context of IGCR1-deficiency, 3C-HTGTS was performed using VH81X bait on RAG2-deficient DHFL16.1JH4 v-Abl cells that also harbored either IGCR1del or IGCR1del VH81X-CBEdel mutations (
We also used iEμ within the DJHnRC as bait to examine interactions with other Igh sequences in this same set of RAG2-deficient control, IGCR1del, and IGCR1del VH81X-CBEdel DHFL16.1JH4 v-Abl lines. In all three genotypes, iEμ interacted with the 3′CBEs and with a region between Cγ1 and Cγ2b (Medvedovic et al., 2013). In the RAG2-deficient DHFL16.1JH4 control line, iEμ has barely detectable interaction with proximal VHs (
Restoration of a Vestigial CBE Converts VH5-1 into the Most Highly Rearranging VH
Mutation of the VH81X or VH2-2 CBEs remarkably reduce ability of these VHs to be utilized for V(D)J recombination, despite retention of their normal RSSs. In this regard, the most D-proximal VH5-1 has a canonical RSS (
Described herein is the major role of VH-associated CBEs in V(D)J recombination. Thus, V(D)J recombination potential of VH81X is dramatically enhanced in both primary pro-B cells in mice and in v-Abl pro-B lines by its associated CBE. Likewise, V(D)J recombination potential of the upstream VH2-2 is similarly enhanced by its associated CBE. Decades ago, we hypothesized one dimensional “recombinase scanning” as a possible mechanism for preferential proximal VH utilization, but noted that there must be an additional determinant based on low level VH5-1 pseudo-VH utilization despite its most proximal location downstream of VH81X and consensus RSS (Yancopoulos et al., 1984). Described herein is this additional determinant as a CBE by converting the “vestigial” CBE downstream of VH5-1 into a functional CBE and, thereby, rendering it the most frequently rearranged VH. However, the VH81X-CBE was not required for robust VH81X rearrangement when it was placed linearly adjacent to the DJHRC, indicating VH-CBE function is distinct from that of RSSs. To further assess the mechanism by which proximal VH-CBEs enhance V(D)J recombination potential, a highly sensitive 3C-HTGTS chromatin interaction method was developed. Effects of various tested loss and gain of function CBE mutations on V(D)J recombination potential of the 3 proximal VHs were mirrored by effects on their interactions with the DJHnRC. This relationship was most striking in the context of IGCR1 deletion, which leads to both dramatically increased VH81X utilization and dramatically increased VH81X interaction with the DJHnRC, with both increases being dependent on the VH81X-CBE. Thus, proximal VH-CBEs increase V(D)J recombination potential by increasing the frequency with which their associated VHs interact with the DJHRC.
RAG tracking in the absence of IGCR1 proceeds upstream to the most proximal VHs, resulting in their increased rearrangement to DJH intermediates (Hu et al., 2015). This dominant increase in VH81X rearrangement during tracking in the absence of IGCR1 is VH81X-CBE-dependent and associated with CBE-mediated DJHRC interactions. The imprint of linear tracking on proximal VH utilization in the absence of IGCR1 goes beyond VH81X. Thus, in v-Abl pro-B lines, where tracking effects are more pronounced in the absence of locus contraction, the three VHs just upstream of VH81X also show markedly increased utilization with relative utilization decreasing with upstream distance. Likewise, while VH81X utilization plummets in VH81X-CBEdel v-Abl cells lacking IGCR1, utilization of the upstream VH2-2 becomes dominant and that of the three upstream VHs again increases with levels inversely related to upstream distance. Also consistent with linear tracking, utilization of the most downstream CBE-less VH5-1 with a restored CBE increases substantially in the absence of IGCR1 becoming dominant even over VH81X. Relative VH utilization patterns during RAG upstream tracking in the absence of IGCR1 correlate well with proximal VH interactions with the DJHnRC. Together, these findings indicate that RAG scans chromatin, rather than DNA per se, allowing this process to be better described as linear RAG chromatin scanning; and they further indicate that proximal VH-CBEs promote over-utilization of associated VHs via a chromatin accessibility-enhancing function. The mechanism of this accessibility function likely involves CBE-mediated prolonged interaction of the VH with the DJHRC. It is described herein that long-range interactions critical to RAG chromatin scanning do not require a functional RAG complex. Thus, RAG bound to the DJHRC may harness a more general cellular mechanism operating within the Igh locus, such as cohesin-mediated chromatin loop extrusion, to scan distal sequences.
RAG Chromatin Scanning Shares Features with Chromatin Loop Extrusion
Inserting RSS pairs to generate ectopic “RCs” in various random genomic sites revealed orientation-specific linear RAG chromatin scanning within chromosome loop domains bounded by convergent CBE anchors, suggesting cohesin involvement (Hu et al., 2015). Features of RAG scanning overlap with those of cohesin-mediated loop extrusion (Dixon et al., 2016; Dekker and Mirny, 2016). Cohesin rings extrude chromatin loops that become progressively larger, bringing distal chromosomal regions into physical proximity in a linear fashion and having the potential to increase contact frequencies between loop anchors and sequences across extrusion domains (Fudenberg et al., 2016; Rao et al., 2017; Sanborn et al., 2015; Schwarzer et al., 2017). In this regard, CBEs bound by CTCF act as strong loop anchors and impede extrusion (Nichols and Corces, 2015; Fudenberg et al., 2016; Nora et al., 2017). Overlaps between loop extrusion and RAG scanning suggest that scanning may be driven by chromatin extrusion past a RAG-containing “RC anchor” (
After Igh locus contraction brings distal VHs into closer proximity of the DJHRC, they become directly associated with the RC via subsequent diffusion-related mechanisms (Lucas et al., 2014). Notably, however, utilization of the very most proximal VHs does not require locus contraction (Fuxa et al., 2004). In this regard, primary locus-contracted pro-B cells utilize VH81X and VH2-2 more frequently than more distal VHs. Likewise, in VH81X-CBE mutant primary pro-B cells utilization of the immediately upstream VH2-2 increases dramatically with utilization of the next two upstream VHs increasing to levels higher than those of more distal VHs. In v-Abl pro-B cells, which lack Igh contraction but have intact IGCR1, over-utilization of VH81X and the four immediately upstream VHs have a distance-dependent utilization pattern reminiscent of that when IGCR1 is inactivated. Likewise, deletion of the VH2-2-CBE increases relative utilization of upstream VHs, again with the same distance-related pattern, but has no effect on downstream VH81X utilization. Finally, ectopic introduction of an immediately downstream CBE renders proximal VH5-1 the most highly utilized VH, while, correspondingly, greatly dampening utilization of upstream VHs. Together, these findings indicate that the relatively high recombination potential of very most proximal functional VHs, even in normal, locus contracted pro-B cells, results from low level RAG chromatin scanning from the DJHRC into the proximal VH domain in the presence of IGCR1 CBEs. Beyond these proximal VHs, RAG linear scanning upstream from the DJHRC appears to have little, if any, impact, even in the absence of IGCR1; likely because dominant utilization of proximal VHs first encountered obviates most RAG scanning upstream.
Nearly all functional mouse VHs have CBEs directly adjacent or within several kb (
Mice. A 2.2-kb 5′ homology arm encompassing the VH81X gene segment sequence and containing an 18-bp scrambled mutation of VH81X-CBE that abrogates CTCF binding (
Cell lines, v-Abl kinase transformed pro-B cell lines were derived by retroviral infection of bone marrow cells from 4-6 weeks old mice with the pMSCV-v-Abl retrovirus, as previously described (Bredemeyer et al., 2006). Transfected cells were cultured in RPMI medium containing 15% (v/v) FBS for two months to recover stably transformed v-Abl pro-B cell lines. The “DHFL16.1JH4” line was generated by transiently inducing RAG expression in v-Abl pro-B cell lines derived from Eμ-Bcl2 transgenic mice by arresting them in G1 for 4 days by treatment with 3 pM STI-571 (Hu et al., 2015). Single cell clones were screened for VHDJH and DJH rearrangements first by PCR using degenerate VH and D primers together with a JH4 primer (Guo et al., 2011) and subsequently confirmed by Southern blotting to isolate the parental DHFL16.1JH4 line (See
All mutant lines analyzed in this study (except those shown in
Bone marrow pro-B cell purification. Single cell suspensions were derived from bone marrows of 4-6 weeks old mice and incubated in Red Blood Cell Lysing Buffer (Sigma-Aldrich, #R7757) to deplete the erythrocytes. Remaining cells were stained with anti-B220-APC (eBioscience, #1817-0452-83), anti-CD43-PE (BD Pharmingen, #553271), and anti-IgM-FITC (eBioscience, #11-5790-81) antibodies for 30 minutes at 40 C. Excess antibodies were washed off and B220+CD43highIgM-pro-B cells were isolated (Guo et al., 2011) by FACS sorting using a BD FACSARIA™ III cell sorter.
HTGTS-Rep-Seq to determine VH utilization frequencies. HTGTS-Rep-Seq was performed and data were analyzed with all duplicate junctions included in the analyses as previously described (Hu et al., 2016). Briefly, 2 μg of genomic DNA from sorted mouse primary pro-B cells or 50 μg of genomic DNA isolated from v-Abl lines following 4 days of G1 arrest by treatment with 3 pM STI-571, was sonicated for 25 seconds ON and 60 seconds OFF for two cycles on a Diagenode Bioruptor™ sonicator at low setting. Sonicated DNA was linearly amplified with a biotinylated JH4 coding end primer that anneals downstream of the JH4 segment. The biotin-labeled single-stranded DNA products were enriched with streptavidin C1 beads (Thermo Fisher Scientific, #65001), and 3′ ends were ligated with the bridge adaptor containing a 6-nucleotide overhang. The adaptor-ligated products were amplified by a nested JH4 coding end primer and an adaptor-complementary primer. The products were then prepared for sequencing on Illumina MiSeq™ platform after tagging with the P5-I5 and P7-I7 sequences (Hu et al., 2016). Junctions were aligned to AJ851868/mm9 hybrid genome by combining all of the annotated 129SV Igh sequences (AJ851868) and the distal VH sequences from the C57BL/6 background (mm9) starting from VH8-2 as described in Lin et al., 2016. The sequence of the JH4 coding end primer used for making HTGTS-Rep-Seq libraries is listed in Table 4. In primary pro-B cells, our assay recovers D-to-JH4 as well as VH-to-DJH4 junctions; whereas in the DHFL16.1JH4 rearranged v-Abl pro-B lines, we recover VH to DHFL16.1JH4 rearrangements using the JH4 baiting primer. In the DHFL16.1JH4 lines, this primer also amplifies across JH4 on the pre-rearranged VHDJH3 rearranged non-productive allele (
As our experiments are done in G1-arrested cells, all de novo rearrangements should represent unique events. However, rearrangements at low but variable levels can occur in cycling v-Abl lines and can be well above background in some sub-clones (e.g. Alt et al., 1981) Therefore, after each HTGTS experiment, data were analyzed for high levels of recurrent Igh V(D)J junctional sequences suggestive of a pre-rearranged V(D)J rearrangement that likely occurred in cycling cells during culture. Then, if necessary, experiments were repeated on additional sub-clones that lacked evidence of obvious pre-rearrangements.
For statistical analyses, each HTGTS library plotted for comparison in a figure panel was normalized for by random selection of the number of junctions recovered from the smallest library in the comparison set. While normalization was done for statistical comparison, we note that relative VH utilization patterns were essentially same in normalized and un-normalized libraries. The numbers of junctions used for normalization of IGCR1del or 101-kb intergenicdel experiments was much higher than those shown for panels comparing WT and other mutant backgrounds due to the greatly increased levels of VH to DJH junctions recovered upon IGCR1-deletion or 101-kb intergenic deletion as described in main text and shown in
For v-Abl lines, the same WT data is shown in
The non-productive fraction of VHDJH reads obtained from C57BL/6 pro-B cells shown in
3C libraries were generated as previously described (Splinter et al., 2012; Stadhouders et al., 2013). Briefly, 10 million cells were cross-linked with 2% (v/v) formaldehyde for 10′ at room temperature, followed by quenching with glycine at a final concentration of 125 mM. Cells were lysed in 50 mM Tris-HCl, pH 7.5, containing 150 mM NaCl, 5 mM EDTA, 0.5% NP-40, 1% TritonX-100 and protease inhibitors (Roche, #11836153001). Nuclei were digested with 700 units of NlaIII (NEB, #R0125) or MseI (NEB, #R0525) restriction enzyme at 370C overnight, followed by ligation under dilute conditions at 160C overnight. Crosslinks were reversed and samples were treated with Proteinase K (Roche, #03115852001) and RNase A (Invitrogen, #8003089) prior to DNA precipitation. The 3C libraries were sonicated for 25 seconds ON and 60 seconds OFF for two cycles on a Diagenode Bioruptor™ Sonicator at low setting. LAM-HTGTS libraries were then prepared and analyzed as described in “HTGTS-Rep-Seq to determine VH utilization frequencies” section (see also Hu et al., 2016) and data was aligned to AJ851868/mm9 hybrid genome as described in Lin et al., 2016 with an additional modification in which Chr12 coordinates from 114671120 to 114734564 in the AJ851868/mm9 hybrid genome were replaced with CCCCT to incorporate the DHFL16.1 to JH4 rearrangement for aligning data obtained from the DHFL16.1JH4 rearranged v-Abl pro-B lines. When using the iEμ bait, we also detected interactions with distal regions beyond VH1-2P in the DHFL16.1JH4 rearranged v-Abl pro-B lines due to close linear juxtaposition of this region to iEμ owing to the VHDJH rearrangement of VH1-2P on the non-productive allele. These interactions were not detected in the unrearranged v-Abl pro-B lines or primary pro-B cells as evident from data deposited in GEO database. The primers used for making 3C-HTGTS libraries are listed in Table 4. Data were plotted for comparison after normalizing junction from each experimental 3C-HTGTS library by random selection to the total number of genome-wide junctions recovered from the smallest library in the set of libraries being compared. However, chromosomal interaction patterns were very similar in normalized and un-normalized libraries.
Electrophoretic Mobility Shift Assay (EMSA). EMSA was performed with oligos (shown in
ChIP-seq, CTCF and Rad21 ChIP-seq data were extracted from Choi et al., 2013 (GEO: GSE47766). Pax5 and YY1 ChIP-seq data was extracted from Revilla-I-Domingo et al., 2012 (GEO: GSE38046) and Medvedovic et al., 2013 (GEO: GSE43008), respectively. The ChIP-seq data were re-analyzed by aligning to mm9 and ChIP-seq peaks were called using MACS with default parameters (Zhang et al., 2008).
An unpaired, two-tailed Student's t-test was used to determine the statistical significance of differences between samples, ns indicates p>0.05, * p≤0.05, ** p≤0.01 and *** p≤0.001.
The Gene Expression Omnibus (GEO) accession number for the datasets reported in this paper is GEO: GSE113023. Specifically, the accession numbers for pro-B-HTGTS-Rep-Seq, DHFL16.1JH4-HTGTS-Rep-Seq and 3C-HTGTS datasets reported in this paper are GEO: GSE112781, GEO: GSE112822 and GEO: GSE113022, respectively GSExxxxx.
18 ± 1 4
87 ± 10
79 ± 28
aRefers to the total number of VDJH junctions to which each replicate library was normalized to, n ≥ 3 (see Figures for details).
These averages were derived from three or more independent libraries generated from at least two independently derived mutant clones (except for VH81X-CBEdel lines, see STAR Methods for details), which gave essentially indistinguishable patterns of VH utilization.
aAligned reads include all DHFL16.1JH4 reads as well as VH to DHFL16.1JH4 junctions
aThese libraries were pooled together and then 3,500 VDJH junctions were randomly extracted from the pool and treated as one library to calculate the average data
SEQ ID NO: 13. Sequences and deletion strategy for Mouse Cer/Sis element (-6.7 kb region on mouse chr6): CRISPR/Cas9-sgRNA1 (GCTCCTGAAGAGCTTAAGTT (SEQ ID NO: 49)) and CRISPR/Cas9-sgRNA2 (GAGGAATCTATGTCCTGGAT (SEQ ID NO: 50)) are depicted in bold font, with the PAM sites in italics. The ˜650 bp Cer (HS1-2) (bp 860-1529 of SEQ ID NO: 13) and 3.7 kb Sis (HS3-6) (bp 3562-7288 of SEQ ID NO: 13) elements are underlined with single and double lines, respectively. Shown in double parenthesis in the following sequence are, in order, i) the CBE1 of Cer element (reverse orientation, pointing to Vk segment), i) the CBE2 of Cer element (reverse orientation, pointing to Vk segment), iii) the CBE1 of Sis element (Sense-strand orientation, pointing to Jk segment), and iv) the CBE2 of Sis element (Sense-strand orientation, pointing to Jk segment)
AGAGCTTAAGTT
TGGAGGAGTGTGTCCT
TATGGGGGTGGAGACCTGGCCAAATTGAGATCTCTGCTTTTGTTTGCAGGACAGTTCTGTGA
CCCATGACTGGGCCTCTGTAGACTTGCCGCTTATACAA((CACTGCCATCTGCTGATAC)AGC
ATTAGCACCCTGACTTGCTCTGGTGATAAACTGGAGGCACTGTGAGATCATTTCCTTGTCACT
GTTTCCTGTGCCACACCCATTCATATGTACTAGAAATAGTCTGAGAAGAAAAAGACGTTCAG
ATAGGAAGGGAGCATGTAATGTACCTATATATCTACATAGATACTTACTCAAGGGGAGGGA
GGGTTTGGTGTGTGTGTGTGTGTATCTCCCGTGCACACACACACACACGAAAAAGTTGGAGA
GGAAAGATTTTTTTTTTAAACAACAGTCTGATCCCATTATAGAGGTGGAGACCTGACAAGAT
TCAGATCACTGGCTTTGTTTGCAGGCCAGCTCAGTGACCCATGATCGGGCCTCCGTAGGCTC
ACTGCTTATACAG((CACTGCCATCTGCTGACAC))AGCTTCTCTGTTGACACAGCTTCTGCCCC
CTGCCATGCTCAGATAATGAGCTGTTCATTGGCTCTGTGAGATCGATTCCTTTTCATTGCTTT
CATTTTTGATATCTAAACAATGTTTCTACAATTCAGAGACACAAACAAATTGTATAAATAAC
CCTTGTATGGTGATGACATAAACGAACCATTGATGACTGAGCACTCAATCATCCCTTATTTTC
AGCGCTTTGACCCGTTATAAGTCTCTGCATAAAACACACACACACACACACACAAGGGAGG
AGGGTCTGAGCAACACTAATCTATAGGTTTTCAACCTAGCAATATTTACAAGGCCATTCAAA
AAACAAACCATGTCCATTTAGTGAAACAACAGCAGTAAGTTTCCCACTAGGGCCTTTAACCA
CACCCCCATAAGCCTTTAACCAGGTTTACAGTAGCAGGAATGAATACTGTGGAGTGGACCTC
AAACCCAATCAGAAAATGGTTGGTTACCCCCATTCTAGCCATACCACTATTGCTCCAGTGGG
CATATCTTACCTGCCGTGGGGTCGATCCAATAGGGCACAGGGTCTATAGCTAAATGCAACTG
CTGACTAGCTCCTTCACCCAGAAGACTGTGCACCATCGTCTGGCACTGTGAAATCCACCCAG
CTGAAAGGAAGCCTCCAATCGGCTCCATCTGGGCTTTTCCATGGTCTACAACCAGGAGCATG
GTGTATCCAGCAATAGGGTCTTAGCATCTAGAAATTAGCTATGGTAATTGCCTATATTGTTTG
AAGGACCTTAAGGACCTCAATGACCAACATATAGCATGGAATCTCACCCCTGGCACCAGGA
TTTTTATTTAATAACCTATGGCTTCCGGGAGCAGCTTTATCCTCCCATGCAGGGTACCTCACA
CCAACTCCTTTTTATTAATTGTACGTTAAATCACTTGCAAAGTAGTATTCTTCCTTACGGCTTT
TTCATGCACCCTCACGCAGCTTTGAATGGCCATCTCTCCCCCCTCTCCTCTTTCCCATTTTTCT
GCACTACATTCCTACTTCCTACAAAATTTGTCCTAAACAGTTTTTCCTTTCTCTCCACCATTGT
CCCTTCCAAGATTCCTAGATTTTGGTTACCTTAAATGCCAACAAGGTACAATTTTCAGAGGCT
TTACAGTAACAGAAAAAAAAGAGGTTACAAGGTACTTTTCAAATTTATTGATGGGCACAGG
AGTGCAGGTTAAAGCAAAGTGGGGAACCTCTGCTACAGACCTCGGATGCTATCTGACGGTCC
CAGTGTTTGCCGTGAGGATGCTGCTCGGCCAACAACTCAAGTCAGGATGAGTTGGGATCTGT
TCTTGTATTCCAAAGGATTTACCTAACAGTCACAAAGATGATAGGTCACAGACGGCAGTAAA
TGGCCTCAAGTAGCAGTTAA((TGGCCACTTGAGGGAGCTA))AAGATAACTTGTCTCTGGGCC
TGCACAGATTCCACCCCTCCACAGTCACTGAAGTTCTTTATTATCATTATTGTTGTTGCTGCT
GTTGTTGTTGTTGTTTTATATCCATAAATGTTGCCGCCCCCGCCCCCAGCCTCCCTTTGCAGA
TTTCTTCCCCAACACCCCCTTAGCTTCTAAGAGGGTACCCCCACCCCAGTCATCCCGCTTCCC
TCAGGCATCAAGTCTTTACAGGATTAGCTCATGCTCTCCCACTGAAGCCAAACAAATATGTC
TGCTACATATGTGCGGGGGAGGACGGGGGGGGGGGGGGGGACTCGGACCAGCCCATATATG
CTCTTTGGTTGCTGGATTTCTCTCTGGGAGCTCTCAGGGGTCTGGGTTAGATGACACTCTTCA
TCTTCCTATAGAGTTGCCATCCCTTCCTTTCCTTCAGTCCTTCCCCTAAGTCCTTCCCCTAACT
CTCCCATAGGGGTCCCCGACTTCAGTTCAATGGTTGGCAGTAAATATCTGTCTCAGTCAAGG
GCTGGTAGAGCCTCTCAGAGGACAGTCATCCAGGCTCCTGTCTGCAAGCACAACGTAGCATC
AATAATGGCGTCAGGGTTCTAATCGGTGATTCAGCCTTTGTAAAGTGGTCAACGTAAGGTGC
AGGTTCTTGGGGAGGGACTTGAAGGGGACACGAGGACTTTAATTCACATGGATAAAATAGA
AGACTGCCTCTATGAGAAAGGTGAGTCTGTGGACTAAATGGATTCTTTCCCGCAGAGAGAAA
TAGAGGAAGAATTTCAGATGCTCATTTTAAAGATAAAAGAATACTTGAAAAGAAGGGGGGG
TGGGAGGAAAGTATGACAGAGAAATCAGCTAAATGCTGCCCCCAGCTTACACTTCCTTAGA
AGGGAAAGGGAAGGGAAAGCTACTCCTGAAAGAAAAGCTAACCGAAGCAGAGCAGTCCCA
CCCTCAAGACAGGCACAGAGCTAGCTCTCACATGCTAAAGTACAGATGCAGAAACCTCTTGC
ATTGGGATCAGCCTTGGATAAAAATAAGTCGGTGAAAGACAGACTGCAAAGCTCAATG((TG
GCCAGCAGAGGCCCCTA))GTCAGCAACAAGGAAAACTCTCACGCTAACCAGACAAACAATA
CAGACTCAGCAAAAACATAAACGGAAGGATGTGCCCACAAGTTCACCTGACCCTCTTCCTCC
GTGAGTGTGCTTTTCTGAAGAGGCAGCTCCAACACTGCCTCACATCTTCCTCTCTATTGTTTT
CTTTGTGTATTCCCCCACAATACTCGCTTAGCAGGATTTTTACTGTATGTATTTGGGGTGGAT
ATATGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGTGCATGTGTGTGTG
TGTGTGTGTGTGTGTGTATTGTTATCTCTTTTCATACAATCATTTAATTTTGTTCGTGCGTTTT
TTCAGTTTAGAGCAGGTTTTTTTCCTTAGTTTCTTTCTTTTTTCCTTGTTTACTCCTGTGTCCCT
TACACATACACACACGCACACACACACACACACACACGTATTCATACTTCTAATTGTTTTAT
ACTTTTCTTAAGTTTTACCTTTTTCTCTTCTAGTTTTTGGTTGTCAACGCTTTCATTATTGTTAA
GTCTTTTTTTTCCCTACTTTTCCTTTTTCCCAAGTCTAGAAAAAGAAACAGACAGTGAAATAA
AAAAGGAATTGAGAACTCTAAACAGACTTCACAAGAGAAAATCCTCTTCACTCCATTTTATA
ATCAGAAAATTAAAAAAAAAAAATGTTAAGCAAAGAACAAATGTTAGGAACCGTAGGGGG
ACACCAGCTCATGCACGGGACACAAATTCCAGAGCACACAAATCCTCCCCTCTGCGGTCCTA
AAAGCCAGGAAAGTACGAAATGATGCCCTTCAATTCGGAAAGTAAATACCATCTAACCACG
CTTTAAATTGATAGCAAAGCTACTCGTGTAACAAGCAATCTATAAGTGAGTTCGTGACTGCC
AAGACTACACTACAAGATAGTTTTTAAAATTCTTTACAGAAAAGGAAGAATGACACCGGCC
ATCACAGAGGCACAGGAAAGACTGGGTTTCATGA
TGGGAG
AAATAACGTGTGAAATGT
TGTACTAAAAAACACAATCTTCAGATTTAATGCTATCACCATCATGGTTCTAGTGACATTCTT
TACAGAACTAGAAAGATAATACTAAGCTTGTATGGAAACACAAAAGACCATGAGTAGGCTA
AGCAGGACAAACCCTACATCACATCACATCACATCACATCACTGAAACTCACATTATCCCAC
ATAAGTGTAATGACAAAGAGAGTAAGGTGCATCCACAAAAGCAGACAGAGAACAATGAAC
TGGAAGAGGGCCCATAACCTGCACTTCTGTGGA...Jks
Current vaccine strategies to elicit the most effective broadly neutralizing antibodies (bnAbs) against HIV-1 are based on sequential immunizations with separate immunogens that target B cells expressing precursor and intermediate forms of the bnAb. Mice expressing human bnAb precursors have been used to assess the preclinical efficacy of candidate immunogens. Commonly used mouse models generated via conventional germline human IgH and IgL variable region exon knock-in technologies have well-known limitations related to the production of a monoclonal set of primary B cells. To avoid this issue, a recent study utilized mice engineered to contain fully human immunoglobulin (Ig) variable region loci that can generate complex primary B cell receptor (BCR) repertoires by V(D)J recombination. However, due to the relatively small size of the mouse B cell compartment, the BCR repertoire of such mice is far smaller than that of humans and, correspondingly the chance of generating B cells expressing an appropriate bnAb precursor is far lower than in humans. To circumvent the shortcomings of these mouse vaccine models, we have described a new type of mouse vaccine model for the potent VRCO1 class of HIV-1 bnAbs, based on a strategy that allows the precursor human Immunoglobulin heavy (IgH) chain variable region exon for this bnAb to be developmentally assembled via V(D)J recombination and to dominate the IgH repertoire of the mice. In this VRC01-rearranging model, most individual B cells express one of a multitude of different variations of the potential VRCO1 precursor IgH chain, providing much more human-like precursor VRC01 repertoire. Indeed, sequential immunization induced affinity maturation of VRC01-type HIV-1 neutralizing antibodies in the VRC01-rearranging mouse model, although it did not achieve fully mature VRC01-class bnAbs (Tian et al., Cell, 2016).
Described herein are even more physiologically relevant mouse models for, e.g., testing candidate HIV-1 vaccine strategies and for disovering/optimizing humanized antibodies. A strategy related to that of the VRCO1 IgH chain rearranging model is utilized to engineer a mouse model that generates highly diverse IgL chain repertoires of potential VRC0101 precursors. When combined with the VRCO1 IgH rearranging model, the IgL rearranging model generates extremely diverse primary BCR repertoire of VRCO1 precursors in mouse for testing immunization strategies to elicit VRC01-class bnAb. A model is provided herein in which expression of bnAb affinity maturation intermediate is targeted specifically to mouse germinal center B cells. This approach, which expresses bnAb intermediates at a physiologically relevant stage, while avoiding potential central or peripheral tolerance checkpoints, is especially important for testing boost immunogens in sequential vaccination strategies.
Because these models can be produced via the rapid RAG-2 deficient blastocyst complementation (RDBC) technology, cohorts of mice can be produced more quickly than with conventional germline breeding.
Mouse models expressing bnAbs or their precursors are commonly used as assay systems to test and optimize immunogens at the preclinical stage (1). To generate such mouse models, one approach has been to integrate pre-rearranged V(D)J exons encoding the IgH or IgL variable regions of presumptive unmutated common ancestor (UCA) of bnAbs into the endogenous mouse JH or Jκ loci. Models made by this conventional “knock-in” approach has several limitations, including:
As an alternative to the knock-in mouse model approach, recent studies employed mice with fully human Ig variable region loci, such as Kymab mice (13), that can generate more complex primary antibody repertoires. However, as mice have far fewer B cells than humans, the actual antibody repertoire in such humanized mice is far smaller than the typical human counterpart. For this reason, the chance of finding a specific bnAb precursor in such Ig-humanized mice is substantially lower than in humans. Thus, when candidate immunogens are tested in these mice, it is difficult to interpret negative outcomes, which could be ascribed either to ineffectual immunogens or to the lack of B cells expressing the relevant antibody at the time of immunization.
To address the limitations of the mouse HIV-1 vaccine models discussed above, described herein is a new type of mouse vaccine model for the potent VRCO1 class of HIV-1 bnAbs, based on a strategy that allows the precursor human Immunoglobulin heavy (IgH) chain variable region exon for this bnAb to be developmentally assembled by V(D)J recombination and to dominate the IgH repertoire of the mice (6). In this VRC01-rearranging model, most individual B cells express one of a multitude of different variations of the potential VRCO1 precursor IgH chain, providing much more human-like precursor VRCO1 repertoire than other types of mouse models described above. Indeed, sequential immunization induced affinity maturation of VRC01-type HIV-1 neutralizing antibodies in the VRC01-rearranging mouse model, although it did not achieve fully mature VRC01-class bnAbs (6). Described herein is the development of an even more physiologically relevant mouse model for testing candidate HIV-1 vaccine strategies.
One model is based on the general strategy, which was employed for the VRCO1 IgH chain rearranging model, to engineer a mouse model that generates highly diverse IgL repertoires of VRCO1 precursor antibodies. When combined with the VRCO1 IgH rearranging model, the IgL rearranging model will generate extremely diverse primary human BCR repertoires of VRCO1 precursors in mice for testing immunization strategies to elicit VRC01-class bnAb.
The second model involves the targeting of human bnAb affinity maturation intermediates specifically to mouse germinal centers B cells. This approach, which will express bnAb intermediates at a physiologically relevant stage, while avoiding potential central or peripheral tolerance checkpoints, will be important for testing boost immunogens in sequential vaccination strategies.
Finally, these models can be generated with RAG-2 deficient blastocyst complementation technology (14), which obviates the lengthy and costly process of germline breeding and which permits supply of mouse models in a timely manner.
Current vaccine strategies to elicit broadly neutralizing antibodies (bnAbs) against HIV-1 are based on sequential immunizations with separate immunogens that target B cells expressing precursor and intermediate forms of bnAbs, respectively (1-5). Described herein are novel and effective mouse models to test and optimize such sequential immunization protocols for eliciting the potent VRCO-1 class of HIV-1 bnAbs (6-9).
Each immunoglobulin heavy (IgH) or light (IgL) chain variable region contains three complementarity determining regions (CDRs) that are particularly important for antigen contact (10). CDR1 and CDR2 are encoded in each germline variable region segment (V) and are unique to each of the multiple germline IgH and IgL V segments. CDR3 is assembled at the junction of IgH V, D, and J segments or IgL V and J segments, in association with non-templated de novo junctional diversification mechanisms such as N region additions by TdT (11, 12). For this reason, CDR3 represents by far the most diverse portion of antibodies.
VRC01-class bnAbs target the CD4-binding site of HIV-1 Envelop (Env) protein and use exclusively the human IgH VH1-2 segment (6-9). In this regard, the germline VH1-2 encodes sequences that allow it to mimic CD4 interaction with gp120. In this unusual mode of antigen interaction, the VH1-2 accounts for nearly 60% of the interface of VRCO1 bnAbs with gp120 (7). In contrast, interactions of most other types of HIV-1 bnAbs with Env epitopes rely heavily on a unique, and in many cases exceptionally long, IgH chain CDR3s (CDR H3) (13).
In this regard, while VH1-2-based Ig heavy chains are quite common in human antibodies (14, 15), only a small number of individuals may harbor antibodies with the unusual de novo generated CDR H3 found in these other types of HIV-1 bnAbs. Thus, elicitation of VRCO1 class antibodies may be more probable in human populations than elicitation of other types of bnAbs. However, VRCO1 antibodies also require Ig K light chains with an unusually short 5-amino acid CDR L3 (6-9). Moreover, three VK segments (VK3-20, Vk3-11 and Vk1-33) are primarily involved in coding for VRCO1 Ig light chains, apparently because the short CDR L1 of these Vk segments can more easily accommodate glycans that shield the CD4 binding site. CDR H3, although not strictly conserved, also influences the function of VRCO1 antibodies (16).
The various restrictions outlined above reduce the pool of potential VRC01-like precursors to just a small subset of total human antibodies that employ VH1-2. Indeed, the frequency of human B cells expressing VRC01-like precursor antibodies was estimated to be about 1 in 2.4 million (17). Adding to the difficulty in their elicitation via immunization strategies, mature VRC01-class bnAbs exhibit a massive level (up to 40% of nucleotides) of somatic hypermutations, some of which are required for neutralization breadth and potency (6-9, 18). To elicit VRC01-class bnAbs via sequential immunization, priming immunogens have been designed to selectively activate the rare B cells expressing potential VRC01-like precursor antibodies (3, 4, 19). Following priming, a series of boost immunogens has been designed to gradually mature the precursor antibodies, through intermediate stages, and onward toward the high mutated mature VRC01-class bnAbs (20, 21). To facilitate the testing of such complex immunization strategies, we recently developed a new type of mouse vaccine model for VRC01-class bnAbs, based on a strategy that allows the precursor human IgH variable region exon for this bnAb to be developmentally assembled by V(D)J recombination and to dominate the mouse IgH repertoire (21). In this “VRCO1-IgH rearranging” model, most individual B cells, due to de novo CDR H3 diversification, express one of a multitude of different variations of the potential VRCO1 precursor IgH chain, providing much more human-like precursor VRCO1 repertoire than conventional transgenic mice that express a pre-rearranged germline-reverted VRCO1 IgH chain. Indeed, sequential immunization of this VRCO1 IgH rearranging mouse model induced affinity maturation of VRC01-type HIV-1 neutralizing antibodies, although it did not achieve fully mature VRC01-class bnAbs (21).
Described herein are two aims that are focused on developing two types of even more physiologically relevant mouse models for testing candidate HIV-1 vaccine strategies to elicit VRC01-class bnAbs. In a third aim, described is the use of the RAG-2 deficient blastocyst complementation (RDBC) approach (22) to rapidly generate cohorts of the existing VRCO1 model and new models.
Aim 1. Generation of VCR01 mouse models with diverse bnAb IgH and IgL precursor repertoires.
Our design for the prior VRC01 vaccine mouse mdoel was based primarily on the finding that rearrangement of the most D-proximal mouse VH gene segment (CH81x) is under the control of a regulatory element referred to herein to as intergenic control region 1(IGCR1)(23). When IGCR1 is inactivated, VH81X is used in most VH to DJH joining events, despite integrity of the remaining IgH locus (23, 24). Thus, when human VH1-2 was substituted for mouse VH81X and deleted IGCR1 on the same IgH allele in mice (
In immunization experiments with the VH1-2 rearranging model, affinity maturation of the VH1-2-based Ig heavy chain, which accounts for the bulk of antigen contact, was focused on. For this reason, described herein is a model that expressed a pre-rearranged version of the germline-reverted VRCO1 Ig Vκ 3-20 light chain (
The mouse Igκ repertoire shows relatively limited junctional diversity (e.g. N regions) compared to that of IgH, potentially due to lack of TdT expression in mouse pre-B cells in which Igk rearrangement occurs (26,27). In this regard, it has also been shown that certain dendritic T cells subsets that develop in the absence of TdT form repetitive (“canonical”) V(D)J junctions mediated by local micro-homologies (28). In contrast, data provided herein (and elsewhere) indicate that the human Igk repertoire exhibits evidence of substantial junctional diversification in CDR3, confirming prior observations made with a more limited data set (29).
It is contemplated herein that TdT expression in human pre-B cells may be responsible for increased CDR3 junctional diversification of the human Ig light chain repertoire than that of the mouse counterpart. Consistent with this hypothesis, constitutive expression of TdT throughout B cell development in a transgenic mouse led to evident N-nucleotide addition in CDR3 of mouse Ig light chains (30). To further humanize the Ig light chain repertoire in the VRCO1 mouse model, a TdT transgene driven by CD19 promoter can be introduced to the VRCO1 Igk rearranging mouse model. HTGTS-rep-seq assay can assess Igk CDR3 junctions in the Igx-rearranging model with or without enforced TdT expression and the levels and types of junctional diversifications compared to those found in human Igx repertoires. If enforced TdT expression does indeed generate a more human-like diverse Igk repertoire, this component will be built in as a feature of humanized Igk rearranging VRCO1 model to permit the mouse model to generate a Igk repertoire more representative of that of human B cells.
Prior VRCO1 models either rearrange mouse IgL chains or have a knock-in pre-rearranged human germline-reverted (gl) VRCO1 light chain. The fixed gl-VRCO1 light chain facilitates the initial testing of immunization strategies, but does not represent a physiological setting. On the other hand, the model without the gl-VRCO1 light chain relies on mouse Ig light chains, in association with the human VH1-2 heavy chain, to reconstitute VRC01-like antibodies. Although mouse Ig light chain rearrangements can also generate the signature 5-amino acid CDR L3s, other aspects of the human Ig light chain may also be important for the function of VRC01-class antibodies. Presumably for this reason, most VRC01-class bnAbs use human Vk3-20 and Vk1-33, which lack close mouse homologues.
These concerns are addressed herein by expressing diverse repertoires of both VH1-2 heavy chain and Vk3-20 and Vk1-33 Ig light chains. Thus, the new model represents a substantial improvement over prior models. As discussed above, this new model should also be superior to Kymice or similar Ig-humanized mice, because it is expected to contain a much higher frequencies of B cells expressing relevant IgH and IgL chains and, thus, a more “human-like” primary repertoire for the VRCO1 lineage. Also incorporated is enforced TdT expression to generate a more human-like diverse IgL chain CDR3 primary repertoire
In the design of this new IgH and IgL rearranging VRC01 mouse model, endogenous mouse D and J segments are employed. In this regard, the mouse JHs are very homologous to human JHs. For the VRCO1 lineage of antibodies, the human JH2 provides a key tryptophan residue to CDR H3 (16, 31). Mouse JH1 is homologous to human JH2 and contains the analogous tryptophan residue. Indeed, when the VRCO1 mouse model with mouse JHs was immunized with immunogens designed to elicit VRC01-like antibodies, all the HIV-1 neutralizing antibodies utilized mouse JH1 and contained the signature tryptophan residue in CDR H3 (21). This result indicates that a model with mouse JHs heads us down the right path.
To further increase the frequency of VRC01-like precursors in the model, there is provided herein a mouse line that incorporates both human JH2 and VH1-2. (Alt lab, unpublished results).
It is hard to ascertain the identities of the germline D segments in VRC01-class antibodies, because the CDR H3 region is subject to both junctional diversification and extensive somatic hypermutation. Other than the tryptophan residue mentioned above, no other conserved features are discernable in the CDR H3 region of VRCO1 family members. It is possible that precursor antibodies with a variety of CDR H3s may potentially evolve into VRC01-like antibodies. In conjunction with junctional diversification, mouse Ds are expected to contribute similar levels of diversity to CDR H3 region as human Ds, and should create a large repertoire of VRC01-like precursor that would serve as relevant targets for immunogens. There is no strong conservation of JK usage among VRCO1 lineage antibodies. In addition, mouse Jks are almost identical to human Jks. However, human Jks could be easily added to the model if desired.
Aim 2. Mouse Models that Express bnAb Intermediates Directly in Germinal Center B Cells
B cells expressing certain bnAbs or their precursors tend to be deleted during B cell maturation in mice (32). To overcome this hurdle, we developed a conditional expression approach that confines bnAb expression to mature B cells, thereby circumventing the hurdle of tolerance control in bone marrow. In this approach, B cell maturation is driven by innocuous Ig heavy and Ig light chain variable region exons, which are termed “driver Ig genes” (
It is contemplated herein that this conditional expression technology can be adapted to express both the Ig heavy and Ig light chains of affinity maturation intermediates of bnAbs by employing a conditional expression cassette in which cre expression is driven under the control of a germinal center-specific promoter (
Thus, immunization with HA antigen should induce germinal center reactions, during which deletion of the driver V gene will lead to the expression of the V(D)J exon encoding VRCO1 intermediate target antibodies. The survival and maturation of the nascent germinal center B cells that express bnAb intermediates will depend on antigens that can interact with their BCR. Thus, in some embodiments, the boost immunogen can be administered together with the HA antigen so that it will be available to stimulate affinity maturation in germinal center B cells that have switched on the expression of a given bnAb intermediate target antibody. Besides HA, antibodies against NP (B1-8) can be used as the driver, and in this case, immunization with NP will induce GC reaction and the expression of target antibody in GC B cells.
An alternative possibility will be to use the germline VRCO1 antibody as the driver V gene, in which case the mice will be immunized with a mixture of prime and boost immunogens. The prime immunogen will initiate germinal reactions and activate the expression of intermediate antibody. Then, the stage would be set for testing the boost immunogen. After this round of boost immunization, the memory B cells from the germinal center reaction will serve as targets for further boost immunizations.
The germinal center-specific expression model permits evaluation of boost immunogens in several respects. For example, it can be tested whether the immunogen can effectively promote somatic hypermutation of bnAb intermediates, recruit T follicular helper cells (Tfh) to the germinal center reaction, and favor memory B cell development over terminal differentiation to plasma cells. If bnAb maturation is accompanied by the acquisition of poly-reactivity or auto-reactivity, the model would also provide an opportunity to study the fate of such affinity maturation intermediates in germinal centers. The evolution of UCA to mature bnAb will involve many intermediates. For initial studies, the most potent VRC01-like neutralizing antibody isolated from our previous immunization experiments (21) can be used as the intermediate antibody in the system. Further intermediate antibodies of interest can be incorporated as desired.
There have been several examples where tolerance control mechanisms hinder the expression of bnAbs or their precursors, as shown in mouse models for 2F5, 4E10 (MPER bnAb) (39-41) or 3BNC60 (CD4 binding site bnAb) (42) and our own unpublished data on mouse models for DH270 (V3glycan bnAb) (35) and VRC26 (V1V2 bnAb) (34). Given these precedents, it is possible that expressing affinity maturation VRC01bnAB-lineage intermediates, or other desired antibodies we would seek to optimize via the conventional transgenic knock-in approach may run into similar roadblocks. Moreover, central and peripheral tolerance control mechanisms normally target precursor and naïve B cells, respectively. Since affinity maturation intermediates arise from GC reactions, they would not be subject to these checkpoints under physiological conditions. Thus, expressing intermediates with the conventional knockin strategy essentially imposes non-physiological restrictions on these antibodies. The present GC-specific expression strategy is specifically designed to address this issue. With the conventional knock-in approach, intermediate antibodies are expressed in naïve B cells. In contrast, in a normal immunization setting, memory B cells expressing intermediate antibodies are the physiological targets of boost immunization. Since naïve B cells and memory B cells can differ in their immune response, constitutive expression models of intermediate antibodies may not provide accurate assessments of boost immunogens. By expressing intermediate antibodies in germinal center B cells, a subset of which can differentiate into memory B cells, the most relevant setting is recreated for boost immunization.
To ensure an efficient supply of the existing and new mouse models for these immunization experiments, the Rag2-deficient complement (RDBC) system can be used to generate the mouse models in the context of chimeric mice (22). In this approach, the genetic modifications are introduced into ES cells which is injected into Rag2-deficient blastocysts to generate chimeric mice. Because Rag2 is essential for V(D)J recombination, all the B and T cells in the RDBC chimeras are derive from the injected ES cells. As already shown, such chimeric mice can be used directly for immunization experiments (21). The RDBC approach obviates the need for lengthy and costly breeding involved in conventional germline transmission; the advantages of this approach is especially obvious in the context of eliminating years of breeding to generate mouse models involving multiple genetic modifications, such as those proposed herein. As with the initial VRC01 model, the chimeras will also be bred for germline transmission Summary and Discussion.
Described herein are two types of mouse models that facilitate the development of sequential immunization approaches for the generation of HIV-1 vaccines. The rearrangement model described in Aim 1 can be used to test both priming and boosting steps of the immunization protocol, whereas the germinal center-specific model in Aim 2 would specifically aid in studying boost immunizations, including testing strategies to circumvent potential roadblocks that may be incurred.
Relative to Kymab mice, our proposed mouse Aim 1 mouse model, which expresses VH1-2 and Vk3-20, Vk1-33 through rearrangements, is designed to have higher frequencies of VRC01-like precursors. We can use appropriate probes, for example eOD-GT8, to assess the frequency of VRC01-like precursors (17). If the mouse contains readily detectable precursors, but does not respond to test immunogens, the result would suggest that the immunogen is not acting as effective activator of target B cells. The advantage of these proposed methods, especially the scheme to express intermediate antibodies specifically at the germinal center stage, will be important for testing boost immunogens. In a conventional mouse model, negative results in boost immunization could have at least two potential interpretations. One possibility is that the previous immunization, for instance the priming step, failed to elicit the relevant intermediate antibody targeted by the boost immunogen. Alternatively, the B cells expressing intermediate antibody may have been generated, but did not respond to the boost immunogen. These two potential possibilities would point to different directions for the next steps.
The Aim 2 model can eliminate these potentially confounding ambiguities by producing a population of germinal center B cells expressing a defined affinity maturation intermediate. In this model, lack of response in boost immunizations can be firmly ascribed to ineffectual boost immunization. If a novel priming immunogen eventually works more effectively in Kymab mice or similar mouse models than eOD-GT8, the paucity of VRC01-like precursors in these mice likely may still pose a formidable challenge in the boosting step, as discussed above.
These models, with higher frequencies of relevant vaccine targets and/or more appropriate expression patterns, offer a more tractable system for immunization studies. If priming immunization with eOD-GT8 does elicit VRC01-like antibodies in humans during clinical trials, the next major challenge is to devise boost immunization strategies to mature the intermediate antibodies further toward bnAbs. Like the development of priming immunogens, such as eOD-GT8, the optimization of boost immunogens would also require iterative experimentation in animal models, and the proposed mouse models would be well suited for this purpose. The proposed strategies, either the rearrangement model or GC-specific expression model, permit mouse models expressing intermediate VRC01-like antibodies identified in clinical trials, and these mouse models can be used to test boost immunogens for the next steps.
We have employed a Cas9-gRNA based approach to dete the Sis/Cer elements of the Igk locus in a mouse v-Abl pre-B cell line that we can induce in vitro to undergo Igx V(D)J recombination. After control and Sis/Cer deleted v-Abl pre-B cells were induced to undergo V(D)J recombination of their endogenous Igκ locus, HTGTS-based high throughput V(D)J recombination assay (3, 4) was used to analyze the frequency with which different endogenous VK segments rearranged to a Jk4 bait sequence, This study clearly demonstrates that deletion of Sis/Cer element substantially increased the rearrangement frequency of the proximal Vx3-1, Vx3-2 and Vx3-3 segment (
To further address whether Human VKJK repertoires might show increased junctional diversity versus those of mouse VkJk repertoires, HTGTS-Rep-seq analysis (4) was performed on DNA from WT mouse IgM+ splenic B cells and human peripheral blood mononuclear cells (PBMCs) using a mouse or human JK1 bait as a primer. To obviate the possibility of influences of cellular selection, presented are results of outof-frame (non-productive) WJK junctions. This preliminary analysis, while limited to just one human sample, shows a markedly greater incorporation of P and/or N junctional elements into the human VKJK junctions versus the mouse VKJK junctions (
The conditional expression strategy has been employed to generate a VRC26UCA mouse model that activates expression of the VRC26UCA in peripheral B cells (
In addition to the VRCO1 and VRC26 mouse models described above, we have also generated, or are in the process of generating, multiple mouse models for other types of bnAbs against HIV-1 and influenza virus. For example, we have generated mouse models expressing two types of UCAs for DH270, which targets the V3 glycan epitope of HIV-1 Env (5). We also have generated a mouse model expressing the Ig heavy chain of DH511UCA, which recognizes the Membrane External Proximal Region (MPER) (6), and we are completing the model by incorporating the DH511UCA light chain. In addition, we are building a mouse model for CH235UCA, which targets the CD4-binding site in a similar manner as VRC01, but utilizes VH1-46 gene segment instead of VH1-2 (7). We are producing a mouse model for the 56.a.09 bnAb that targets the stem region of influenza HA antigen (8).
This application is a divisional under 35 U.S.C. of § 121 of co-pending U.S. Ser. No. 16/973,125 filed Dec. 8, 2020, which is a 35 U.S.C. § 371 National Phase Entry Application of International Application No. PCT/US2019/036321 filed Jun. 10, 2019, which designates the U.S. and claims benefit under 35 U.S.C. § 119(e) of U.S. Provisional Application No. 62/684,367 filed Jun. 13, 2018, the contents of which are incorporated herein by reference in their entireties.
This invention was made with government support under Grant Number AI020047 and AI100645 awarded by the National Institutes of Health. The Government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
62684367 | Jun 2018 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 16973125 | Dec 2020 | US |
Child | 18607961 | US |