The contents of the text file submitted electronically herewith are incorporated herein by reference in their entirety: A computer readable format copy of the Sequence Listing (filename: RBYC_024_01US_SeqList_ST25.txt, date recorded: Aug. 2, 2021, file size ˜393 kilobytes).
The CD25 protein is the alpha chain of the interleukin-2 (IL-2) receptor and is a transmembrane protein present on regulatory T cells, and activated T cells. In a normal state, regulatory T cells constitutively express CD25 and act to suppress the expansion of effector T cells. Regulatory T cells maintain the healthy state and inhibit effector T cells from reacting against self antigens or over-reacting to foreign antigens. In a normal, protective immune response, effector T cells multiply after contact with foreign antigen and overcome inhibition by regulatory T cells. In case of proliferative diseases, however, cancer cells may disable the healthy immune response by increasing the amount of regulatory T cells and thereby limiting the generation of effector T cells against them. Thus, there is interest in therapeutics for to alter the proliferation of CD25-expressing regulatory T cells, for example to dampen the immune system for use in cancer therapies. These therapeutics may include CD25-targeting antibodies.
CD25-targeting antibodies can be produced by immunization of animals using CD25 immunogens, however, current methods of developing CD25 immunogens often lead to unpredictable, undesirable characteristics, such as antibody promiscuity or low cross-reactivity across species.
Thus, what is needed in the art are new engineered polypeptides having structural and/or dynamic similarity to CD25 or portions thereof, for example engineered polypeptides designed to mimic epitopes outside the IL-2 binding site.
In one aspect, the disclosure provides an engineered polypeptide, wherein the engineered polypeptide shares at least 46% structural and/or dynamic identity to a CD25 reference target, wherein the CD25 reference target is a portion of a CD25 selected from CD25 residues 55-63, 13-20:127-132, 5-17, 5-11:156-163, 77-89, 147-157, 11-14, or 44-56.
In embodiments, the engineered polypeptide shares at least 60% structural and/or dynamic identity to the CD25 reference target. In embodiments, the engineered polypeptide shares at least 80% structural and/or dynamic identity to the CD25 reference target. In embodiments, the engineered polypeptide shares at least 80% sequence identity to an amino-acid sequence selected from SEQ ID NOS: 1-16. In embodiments, the engineered polypeptide shares at least 46% structural and/or dynamic identity to a CD25 reference target, wherein the CD25 reference target is a portion of CD25 selected from CD25 residues 55-63, 13-20:127-132, 5-17, 5-11:156-163, 77-89, 147-157, 11-14, or 44-56. In embodiments, the engineered polypeptide shares at least 80% structural and/or dynamic identity to the CD25 reference target. In embodiments, the structural and/or dynamic identity to the CD25 reference target is determined using the structure of CD25 deposited at PDB ID NO: 2ERJ, chain A. In embodiments, the engineered polypeptide comprises an N-terminal modification or a C-terminal modification, optionally an N-terminal Biotin-PEG2- or a C-terminal -GSGSGK-Biotin (SEQ ID NO: 846).
In embodiments, between 10% to 98% of the amino acids of the engineered polypeptide meet one or more CD25 reference target-derived constraints. In embodiments, the amino acids that meet the one or more CD25 reference target-derived constraints have less than 8.0 Abackbone root-mean-square deviation (RSMD) structural homology with the CD25 reference target. In embodiments, the amino acids that meet the one or more CD25 reference target-derived constraints have a van der Waals surface area overlap with the reference of between 30 Å2 to 3000 Å2. In embodiments, the CD25 reference target-derived constraints are independently selected from the group consisting of: atomic distances; atomic fluctuations; atomic energies; chemical descriptors; solvent exposures; amino acid sequence similarity; bioinformatic descriptors; non-covalent bonding propensity; phi angles; psi angles; van der Waals radii; secondary structure propensity; amino acid adjacency; and amino acid contact. In embodiments, the engineered polypeptide shares 46%-96% RMSIP or more structural similarity to the reference target across the amino acids of the polypeptide that meet the one or more reference target-derived constraints.
In another aspect, the disclosure provides a CD25-specific antibody comprising an antigen-binding domain that specifically binds a CD25 epitope selected from CD25 residues 55-63, 13-20:127-132, 5-17, 5-11:156-163, 77-89, 147-157, 11-14, or 44-56. In embodiments, the antibody competes for binding of CD25 with an epitope-specific reference binding agent, wherein the epitope-specific binding agent is IL-2, daclizumab, basioliximab, and/or 7G7B6. In embodiments, the antibody does not compete with an off-target reference binding agent, wherein the off0target binding agent is IL-2, daclizumab, basioliximab, and/or 7G7B6. In embodiments, the antibody has a koff of less than 10−2/s, less than 10−3/s, or less than 10−4/s, wherein the koff is measured using biolayer interferometry with soluble human CD25. In embodiments, the antibody has a koff of between 10−2/s 10−5/s, wherein the koff is measured using biolayer interferometry with soluble human CD25. In embodiments, the antibody has a KD less than 100 nM, less than 25 nM, or less than 5 nM, wherein the KD is measured using biolayer interferometry with soluble human CD25. In embodiments, the antibody has a KD between 100 nM and 1 nM, wherein the KD is measured using biolayer interferometry with soluble human CD25.
In embodiments, the antibody specifically binds cells expressing CD25. In embodiments, the antibody binds cells expressing CD25 with a mean fluorescence intensity (MFI) of at least 104 or at least 105. In embodiments, the antibody binds cells expressing CD25 with a mean fluorescence intensity (MFI) of between 104 and 106. In embodiments, the antibody does not bind CD25(−) cells. In embodiments, the antibody binds CD25(−) cells with a mean fluorescence intensity (MFI) of less than 10′. In embodiments, the antibody comprises the six CDRs of any one of Combinations 1-126 of Table 7D.
In embodiments, the antibody comprising six complementarity determining regions (CDRs) for any one of YU390-B12, YU397-F01, YU397-D01, YU398-A11, YU404-H01, YU400-B07, YU400-D09, YU401-B01, YU401-G07, YU404-C02, YU403-G07, YU403-G05, YU391-B12, YU400-A03, YU400-D02, YU392-A09, YU392-B11, YU392-B12, YU392-E05, YU392-E06, YU392-G08, YU389-A03, YU392-G09, YU392-G12, YU392-H02, YU392-H04, YU402-F01, YU389-B111, YU394-D08, or YU390-A11, as provided in Table 3A and Table 3B.
In embodiments, the antibody comprises a heavy chain variable region and a light chain variable region that each share at least 90%, 95%, 99%, or 100% sequence identity with the heavy chain variable region and the light chain variable region of YU390-B12, YU397-F01, YU397-D01, YU398-A11, YU404-H01, YU400-B07, YU400-D09, YU401-B01, YU401-G07, YU404-C02, YU403-G07, YU403-G05, YU391-B12, YU400-A03, YU400-D02, YU392-A09, YU392-B11, YU392-B12, YU392-E05, YU392-E06, YU392-G08, YU389-A03, YU392-G09, YU392-G12, YU392-H02, YU392-H04, YU402-F01, YU389-B11, YU394-D08, or YU390-All, as provided in Table 5. In embodiments, the antibody is a full-length immunoglobulin G monoclonal antibody. In embodiments, the antibody comprises single chain variable fragment (scFv) that share at least 90%, 95%, 99%, or 100% sequence identity with the scFv sequence of YU390-B12, YU397-F01, YU397-D01, YU398-A11, YU404-H01, YU400-B07, YU400-D09, YU401-B01, YU401-G07, YU404-C02, YU403-G07, YU403-G05, YU391-B12, YU400-A03, YU400-D02, YU392-A09, YU392-B11, YU392-B12, YU392-E05, YU392-E06, YU392-G08, YU389-A03, YU392-G09, YU392-G12, YU392-H02, YU392-H04, YU402-F01, YU389-B11, YU394-D08, or YU390-A11, as provided in Table 5.
In embodiments, the antibody is a human antibody. In embodiments, the antibody is a humanized antibody. In embodiments, the antibody is a chimeric antibody. In embodiments, the antibody comprises a mouse variable domain and a human constant domain. In embodiments, the antibody also binds cynomologous monkey CD25.
In another aspect, the disclosure provides a pharmaceutical composition comprising any antibody of disclosure and optionally a pharmaceutically acceptable excipient. In another aspect, the disclosure provides a method of treating a subject in need of treatment comprising administering to the subject a therapeutically effective amount of any antibody or pharmaceutical composition of the disclosure. In embodiments, the subject suffers from a cancer. In embodiments, the subject suffers from an autoimmune disease or disorder. In another aspect, the disclosure provides a method of depleting the number of regulatory T cells in a subject comprising administering to the subject a therapeutically effective amount of any antibody or pharmaceutical composition of the disclosure. In embodiments, the subject suffers from a cancer. In embodiments, the subject suffers from an autoimmune disease or disorder.
In another aspect, the disclosure provides a kit comprising the antibodies of any antibody or pharmaceutical composition of the disclosure.
In some aspects, provided herein is an engineered immunogen having at least 60% sequence similarity to a sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, and SEQ ID NO: 11. In some embodiments, the engineered immunogen has at least 80% similarity to the sequence. In other embodiments, the engineered immunogen has at least 90% similarity to the sequence. In certain embodiments, the engineered immunogen shares at least one characteristic with CD25. In still further embodiments, the engineered immunogen binds to an antibody of CD25. In some embodiments, the engineered immunogen has higher binding affinity to an antibody of CD25 at pH below 7.0, compared to binding affinity at pH between about 7.3 and about 7.5. In some embodiments, the engineered immunogen has higher binding affinity to an antibody of CD25 at pH between about 6.4 and about 6.6, compared to binding affinity at pH between about 7.3 and about 7.5.
In yet other embodiments, provided herein is a method of producing an antibody, comprising immunizing an animal with an engineered immunogen having at least 60% sequence similarity to a sequence selected from the group consisting of SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, and SEQ ID NO: 11; and producing an antibody. In some embodiments of the method, the antibody is an antibody to CD25. In certain embodiments, the antibody exhibits higher binding affinity for CD25 at pH below 7.0, compared to binding affinity at pH between about 7.3 and about 7.5. In still further embodiments, the antibody exhibits higher binding affinity for CD25 at pH between about 6.4 and about 6.6, compared to binding affinity at pH between about 7.3 and about 7.5. In some embodiments, the antibody does not block binding of CD25 to IL-2. In other embodiments, the antibody does block binding of CD25 to IL-2. The method of any one of claims 8 to 11, wherein the antibody does not block binding of CD25 to IL-2. In some embodiments, the antibody prevents heterotrimerization of IL-2R-alpha, IL-2R-beta, and IL-2R-gamma. In certain embodiments, the antibody is capable of binding to both the cis orientation and the trans orientation of CD25.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee. The present application can be understood by reference to the following description taking in conjunction with the accompanying figures.
Provided herein are engineered polypeptides that share structural and/or dynamic identity with a portion of reference CD25 target. Epitopes of interest include but are not limited to the eight epitopes shown in
In some embodiments, the engineered polypeptide provided herein shares at least 4600 structural and/or dynamic identity to a CD25 reference target, wherein the CD25 reference target is a portion of CD25 selected from those listed in the table below. As generally provided herein, the % structural/dynamic identity is the root mean square inner product (RMI5P) identity (as provided herein above)×10000. In some embodiments, the structural identity refers to sequence identity.
In some embodiments, the engineered polypeptide provided herein 8000 sequence identity to an amino-acid sequence selected from:
MVY
CQPDCTAKCMHGCDRDTMKECCDRLK
DD
CPEVPHATFKGPGQKWEGPGGGDCSK
DD
CIEVPGPAECAERACRAQEERQRQPQCI
In some embodiments, the polypeptide shares at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% structural and/or dynamic identity to the CD25 reference target. In some embodiments, polypeptide shares at least 60%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, or at least 95% sequence identity to the CD25 reference target.
In some embodiments, the engineered polypeptide is designed to mimic a selected CD25 epitope. For example, in some embodiments, the polypeptide comprises a meso-scale engineered molecule, e.g. a meso-scale engineered polypeptide. Provided herein are methods of selecting meso-scale engineered polypeptides, and compositions comprising and methods of using said engineered polypeptides. For example, provided herein are methods of using engineered polypeptides in in vitro selection of antibodies.
The engineered polypeptides of the present disclosure are between 1 kDa and 10 kDa, referred to herein as “meso-scale”. Engineered polypeptides of this size may, in some embodiments, have certain advantages, such as protein-like functionality, a large theoretical space from which to select candidates, cell permeability, and/or structural and dynamical variability. The terms meso-scale peptides and meso-scale polypeptides are used interchangeably herein, and the term meso-scale molecules (MEM) is intended to cover these.
The methods provided herein comprise identifying a plurality of spatially-associated topological constraints, some of which may be derived from a CD25 reference target, constructing a combination of said constraints, comparing candidate peptides with said combination, and selecting a candidate that has constraints which overlap with the combination. By using spatially-associated topological constraints, different aspects of an engineered polypeptide can be included in the combination depending on the intended use, or desired function, or another desired characteristic. Further, not all constraints must, in some embodiments, be derived from a CD25 reference target. Through such methods, in some embodiments the selected engineered polypeptides are not simply variations of a CD25 reference target (such as might be obtained through peptide mutagenesis or progressive modification of a single reference), but rather may have a different overall structure than the reference peptide, while still retaining desired functional characteristics and/or key substructures.
Further provided herein are methods of using said engineered polypeptides, which include methods of programmable in vitro selection using one or more engineered polypeptides. Such selection may be used, for example, in the identification of antibodies.
These methods and engineered polypeptides are described in greater detail below.
In some aspects, provided herein are methods of selecting an engineered polypeptide, comprising:
identifying one or more topological characteristics of a CD25 reference target;
designing spatially-associated constraints for each topological characteristic to produce a combination of CD25 reference target-derived constraints;
comparing spatially-associated topological characteristics of candidate peptides with the combination derived from the CD25 reference target; and
selecting a candidate peptide with spatially-associated topological characteristics that overlap with the combination of constraints derived from the CD25 reference target.
In some embodiments, one or more additional spatially-associated topological constraints that are not derived from the CD25 reference target are included in the combination.
a. Spatially-Associated Topological Constraints
The engineered polypeptides described herein are selected based on how closely they match a combination of spatially-associated topological constraints. This combination may also be described using the mathematical concept of a “tensor”. In such a combination (or tensor), each constraint is independently described in three dimensional space (e.g., spatially-associated), and the combination of these constraints in three dimensional space provides, for example, a representational “map” of different desired characteristics and their desired level (if applicable) relative to location. This map is not, in some embodiments, based on a linear or otherwise pre-determined amino acid backbone, and therefore can allow for flexibility in the structures that could fulfill the desired combination, as described. For example, in some embodiments, the “map” includes a spatial area wherein the prescribed constraint limitations could be adequately met by two adjacent amino acids—in some embodiments, these amino acids could be directly bonded (e.g., two contiguous amino acids) while in other embodiments, the amino acids are not directly bonded to each other but could be brought together in space by the folding of the peptide (e.g., are not contiguous amino acids). The separate constraints themselves are also not necessarily based on structure, but could include, for example, chemical descriptors and/or functional descriptors. In some embodiments, constraints include structural descriptors, such as a desired secondary structure or amino acid residue. In certain embodiments, each constraint is independently selected.
For example,
In some embodiments, the combination of constraints comprises at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, between 3 to 12, between 3 to 10, between 3 to 8, between 3 to 6, or 3, or 4, or 5, or 6 independently selected spatially-associated topological constraints. One or more of the constraints is derived from a CD25 reference target. In some embodiments, each of the constraints is derived from the CD25 reference target. In other embodiments, at least one constraint is derived from the CD25 reference target, and the remaining constraints are not derived from the reference target. For example, in some embodiments, between 1 and 9 constraints, between 1 and 7 constraints, between 1 and 5 constraints, or between 1 and 3 constraints are derived from the CD25 reference target, and between 1 and 9 constraints, between 1 and 7 constraints, between 1 and 5 constraints, or between 1 and 3 constraints are not derived from the CD25 reference target.
Once the combination of constraints has been constructed, a series of candidate peptides is compared to said combination to identify one or more new engineered polypeptides which meet the desired criteria. In some embodiments, at least 5, at least 10, at least 15, at least 20, at least 25, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 125, at least 150, at least 175, at least 200, or at least 250 or more candidate peptides are compared to the combination to identify one or more new engineered polypeptides which meet the desired criteria. In some embodiments, more than 250 candidate peptides, more than 300 candidate peptides, more than 400 candidate peptides, more than 500 candidate peptides, more than 600 candidate peptides, or more than 750 candidate peptides are compared, for example. In some embodiments, topological characteristic simulations are used to evaluate the topological characteristic overlap, if any, of a candidate peptide compared to the combination of constraints. In some embodiments, one or more candidate peptides are also compared to the CD25 reference target, and overlap, if any, of candidate peptide topological characteristics with CD25 reference target topological characteristics is evaluated. In some embodiments, the engineered polypeptide is identified from a computational sample of more than 5, more than 10, more than 20, more than 30, more than 40, more than 50, more than 60, more than 70, more than 80, more than 90, or more than 100 distinct peptide and topological characteristic simulations and an engineered polypeptide is selected, wherein the selected engineered polypeptide has the highest topological characteristic overlap compared the CD25 reference target, out of the total sampled population.
The spatially-associated topological constraints used to construct the desired combination (e.g., the desired tensor) may each be independently selected from a wide group of possible characteristics. These may include, for example, constraints describing structural, dynamical, chemical, or functional characteristics, or any combinations thereof.
Structural constraints may include, for example, atomic distance, amino acid sequence similarity, solvent exposure, phi angle, psi angle, secondary structure, or amino acid contact, or any combinations thereof.
Dynamical constraints may include, for example, atomic fluctuation, atomic energy, van der Waals radii, amino acid adjacency, or non-covalent bonding propensity. Atomic energy may include, for example, pairwise attractive energy between two atoms, pairwise repulsive energy between two atoms, atom-level solvation energy, pairwise charged attraction energy between two atoms, pairwise hydrogen bonding attraction energy between two atoms, or non-covalent bonding energy, or any combinations thereof.
Chemical characteristics may include, for example, chemical descriptors. Such chemical descriptors may include, for example, hydrophobicity, polarity, atomic volume, atomic radius, net charge, log P, HPLC retention, van der Waals radii, charge patterns, or H-bonding patterns, or any combinations thereof.
Functional characteristics may include, for example, bioinformatic descriptors, biological responses, or biological functions. Bioinformatic descriptors may include, for example, BLOSUM similarity, pKa, zScale, Cruciani Properties, Kidera Factors, VHSE-scale, ProtFP, MS-WHIM scores, T-scale, ST-scale, Transmembrane tendency, protein buried area, helix propensity, sheet propensity, coil propensity, turn propensity, immunogenic propensity, antibody epitope occurrence, and/or protein interface occurrence, or any combinations thereof.
In some embodiments, designing the constraints incorporates information about per-residue energy, per-residue interaction, per-residue fluctuation, per-residue atomic distance, per-residue chemical descriptor, per-residue solvent exposure, per-residue amino acid sequence similarity, per-residue bioinformatic descriptor, per-residue non-covalent bonding propensity, per-residue phi/psi angles, per-residue van der Waals radii, per-residue secondary structure propensity, per-residue amino acid adjacency, or per-residue amino acid contact. In some embodiments, these characteristics are used for a subset of the total residues in the CD25 reference target, or a subset of the total residues of the total combination of constraints, or a combination thereof. In some embodiments, one or more different characteristics are used for one or more different residues. That is, in some embodiments, one or more characteristics are used for a subset of residues, and at least one different characteristic is used for a different subset of residues. In some embodiments, one or more of said characteristics used to design one or more constraints is determined by computer simulation. Suitable computer simulation methods may include, for example, molecular dynamics simulations, Monte Carlo simulations, coarse-grained simulations, Gaussian network models, machine learning, or any combinations thereof.
In some embodiments multiple constraints are selected from one category. For example, in some embodiments, the combination comprises two or more constraints that are independently a type of biological response. In some embodiments, two or more constraints are independently a type of secondary structure. In certain embodiments, two or more constraints are independently a type of chemical descriptor. In other embodiments, the combination comprises no overlapping categories of constraints.
In some embodiments, one or more constraints is independently associated with a biological response or biological function. In some embodiments, said constraint is a spatially defined atom(s)-level constraint, or spatially defined shape/area/volume-level constraint (such as a characteristic shape/area/volume that can be satisfied by several different atomic compositions), or a spatially defined dynamic-level constraint (such as a characteristic dynamic or set of dynamics that can be satisfied by several different atomic compositions).
In some embodiments, one or more constraints is derived from a protein structure or peptide structure associated with a biological function or biological response. For example, in some embodiments, one or more constraints is derived from an extracellular domain, such as a G protein-coupled receptor (GPCR) extracellular domain, or an ion channel extracellular domain. In some embodiments, one or more constraints is derived from a protein-protein interface junction. In some embodiments, one or more constraints is derived from a protein-peptide interface junction, such as MHC-peptide or GPCR-peptide interfaces. In certain embodiments, the atoms or amino acids constrained to such a protein or peptide structure are atoms or amino acids associated with a biological function or biological response. In some embodiments, the atoms or amino acids in the engineered polypeptide constrained to such a protein or peptide structure are atoms or amino acids derived from a CD25 reference target. In some embodiments, one or more constraints is derived from a polymorphic region of a CD25 reference target (e.g., a region subject to allelic variation between individuals).
In some embodiments, the one or more atoms associated with a biological function or biological response are selected from the group consisting of carbon, oxygen, nitrogen, hydrogen, sulfur, phosphorus, sodium, potassium, zinc, manganese, magnesium, copper, iron, molybdenum, and nickel. In certain embodiments, the atoms are selected from the group consisting of oxygen, nitrogen, sulfur, and hydrogen.
In some embodiments, wherein one of the constraints is one or more amino acids associated with a biological function or biological response, and/or the engineered polypeptide comprises one or more amino acids associated with a biological function or biological response, the one or more amino acids are independently selected from the group consisting of the 20 proteinogenic naturally occurring amino acids, non-proteinogenic naturally occurring amino acids, and non-natural amino acids. In some embodiments, the non-natural amino acids are chemically synthesized. In certain embodiments, the one or more amino acids are selected from the 20 proteinogenic naturally occurring amino acids. In other embodiments, the one or more amino acids are selected from the non-proteinogenic naturally occurring amino acids. In still further embodiments, the one or more amino acids are selected from non-natural amino acids. In still further embodiments, the one or more amino acids are selected from a combination of 20 proteinogenic naturally occurring amino acids, non-proteinogenic naturally occurring amino acids, and non-natural amino acids.
While the combination of constraints used to select an engineered polypeptide as described herein comprises at least one constraint derived from a CD25 reference target, in some embodiments one or more constraints of the combination are not derived from a CD25 reference target. Thus, in certain embodiments, the selected engineered polypeptide comprises one or more characteristics that are not shared with the CD25 reference target.
In some embodiments, one or more constraints derived from the CD25 reference target and used in the combination describes the inverse of the characteristic as observed in the CD25 reference target. Thus, for example, a CD25 reference target may have a certain pattern of positive charge, a constraint related to charge is derived from said CD25 reference target, and the derived constraint describes a similar pattern but of neutral charge, or of negative charge. Thus, in some embodiments one or more inverse constraints are derived from the CD25 reference target and included in the combination. Such inverse constraints may be useful, for example, in selecting engineered polypeptides as control molecules for certain assays or panning methods, or as negative selection molecules in the programmable in vitro selection methods described herein.
In some embodiments, the combination of spatially-defined topological constraints comprises one or more non-reference derived topological constraints. In some embodiments, the one or more non-reference derived topological constraints enforces or stabilizes one or more secondary structural elements, enforces atomic fluctuations, alters peptide total hydrophobicity, alters peptide solubility, alters peptide total charge, enables detection in a labeled or label-free assay, enables detection in an in vitro assay, enables detection in an in vivo assay, enables capture from a complex mixture, enables enzymatic processing, enables cell membrane permeability, enables binding to a secondary target, or alters immunogenicity. In certain embodiments, the one or more non-reference derived topological constraints constrains one or more atoms or amino acids in the combination of constraints (or subsequently selected peptide) that were derived from the CD25 reference target. For example, in some embodiments, the combination of constraints includes a secondary structure that was derived from the CD25 reference target, and the combination of constraints also comprises a constraint that stabilizes the secondary structural element (e.g., through additional hydrogen bonding, or hydrophobic interactions, or side chain stacking, or a salt bridge, or a disulfide bond), wherein the stabilizing constraint is not present in the CD25 reference target. In another example, in some embodiments the combination of constraints (or subsequently selected peptide) comprises one or more atoms or amino acids that was derived from the CD25 reference target, and the combination of constraints also includes a constraint that enforces atomic fluctuations in at least a portion of the atoms or amino acids derived from the target reference, wherein the constraint is not present in the target reference. In some embodiments, one or more non-reference derived constraints is an inverse constraint. For example, in some embodiments, two combinations of constraints are constructed to select engineered polypeptides with inverse characteristics. In some such embodiments, a first combination of constraints will comprise one or more constraints derived from the CD25 reference target, and one or more constraints not derived from the CD25 reference target; and a second combination of constraints will comprise the same one or more constraints derived from the CD25 reference target, and the inverse of one or more of non-CD25 reference target constraints of the first combination.
b. CD25 Reference Target
Any suitable CD25 reference target may be used to derive one or more spatially-associated topological constraints for use in the methods provided herein. In some embodiments, the CD25 reference target is a full-length native protein. In other embodiments, the CD25 reference target is a portion of a full-length native protein. In still further embodiments, the CD25 reference target is a non-native protein, or portion thereof.
In some embodiments, a CD25 reference target is selected from:
For example, in some embodiments, the CD25 reference target is a portion of CD25, such as an epitope or a predicted epitope. In some embodiments, the methods provided herein may be used to select one or more engineered polypeptides that are immunogens, and which may be used to raise one or more antibodies that specifically bind to the protein from which the target reference is derived. In still further embodiments, the methods provided herein may be used to select one or more engineered polypeptides which in turn may be used to select one or more binding partners of a protein of interest, such as an antibody, a Fab-displaying phage, or an scFv-displaying phage.
c. Comparison of Constraints
In some embodiments, the one or more constraints (e.g., reference-derived or non-reference derived) are determined by molecular simulation (e.g. molecular dynamics), or laboratory measurement (e.g. NMR), or a combination thereof. Once the constraints have been derived and combined, engineered polypeptide candidates are, in some embodiments, generated using a computational protein design (e.g., Rosetta). In some embodiments, other methods of sampling peptide space are used. Dynamics simulations may then be carried out on the candidate engineered polypeptides to obtain the parameters of constraints that have been selected. A covariance matrix of atomic fluctuations is generated for the CD25 reference target, covariance matrices are generated for the residues in each of the candidate engineered polypeptides, and these covariance matrices are compared to determine overlap. Principal component analysis is performed to compute the eigenvectors and eigenvalues for each covariance matrix—one covariance matrix for the CD25 reference target and one covariance for each of the candidate engineered polypeptides—and those eigenvectors with the largest eigenvalues are retained.
The eigenvectors describe the most, second-most, third-most, N-most dominant motion observed in a set of simulated molecular structures. Without wishing to be bound by any theory, if a candidate engineered polypeptide moves like the CD25 reference target, its eigenvectors will be similar to the eigenvectors of the CD25 reference target. The similarity of eigenvectors corresponds to their components (a 3D vector centered on each CA atom) being aligned, pointing in the same direction.
In some embodiments, this similarity between candidate engineered polypeptide and CD25 reference target eigenvectors is computed using the inner product of two eigenvectors. The inner product value is 0 if two eigenvectors are 90 degrees to each other or 1 if the two eigenvectors point precisely in the same direction. Without wishing to be bound by theory, since the ordering of eigenvectors is based on their eigenvalues, and eigenvalues may not necessarily be the same between two different molecules due to the stochastic nature by which molecular dynamics (MD) simulations sample the underlying energy landscape of those different molecules, the inner product between multiple, differentially ranked eigenvectors is, in some embodiments, needed (e.g. eigenvector 1 of the engineered polypeptide by eigenvector 2, 3, 4, etc. of the CD25 reference target). In addition, molecular motions are complex and may involve more than one (or more than a few) dominant/principal modes of motion. Thus, in some embodiments, the inner product between all pairs of eigenvectors in a candidate engineered polypeptide and the CD25 reference target are computed. This results in a matrix of inner products the dimensions of which are determined by the number of eigenvectors analyzed. For example, for 10 eigenvectors, the matrix of inner products is 10 by 10. This matrix of inner products can be distilled into a single value by computing the root mean-square value of the 100 (if 10 by 10) inner products. This is the root mean square inner product (RMSIP). From this comparison, one or more candidate engineered polypeptides that have similarity with the defined combination of constraints are selected.
d. Additional Steps
In some embodiments, selection of one or more engineered polypeptides comprises one or more additional steps. For example, in some embodiments an engineered polypeptide candidate is selected based on similarity to the defined combination of spatially-associated topological constraints, as described herein, and then undergoes one or more analyses to determine one or more additional characteristics, and one or more structural adjustments to impart or enforce said desired characteristics. For example, in some embodiments, the selected candidate is analyzed, such as through molecule dynamics simulations, to determine overall stability of the molecule and/or propensity for a particular folded structure. In some embodiments, one or more modifications are made to the engineered polypeptide to impart or reinforce a desired level of stability, or a desired propensity for a desired folded structure. Such modifications may include, for example, the installation of one or more cross-links (such as a disulfide bond), salt bridges, hydrogen bonding interactions, or hydrophobic interactions, or any combinations thereof.
The methods provided herein may further comprise assaying one or more selected engineered polypeptides for one or more desired characteristics, such as desired binding interactions or activity. Any suitable assay may be used, as appropriate to measure the desired characteristic.
In other aspects, provided herein are engineered polypeptides, such as engineered polypeptides selected through the methods described herein. In some embodiments, the engineered polypeptide has a molecular mass between 1 kDa and 10 kDa, and comprises up to 50 amino acids. In certain embodiments, the engineered polypeptide has a molecular mass between 2 kDa and 10 kDa, between 2 kDa and 10 kDa, between 3 kDa and 10 kDa, between 4 kDa and 10 kDa, between 5 kDa and 10 kDa, between 6 kDa and 10 kDa, between 7 kDa and 10 kDa, between 8 kDa and 10 kDa, between 9 kDa and 10 kDa, between 1 kDa and 9 kDa, between 1 kDa and 8 kDa, between 1 kDa and 7 kDa, between 1 kDa and 6 kDa, between 1 kDa and 5 kDa, between 1 kDa and 4 kDa, between 1 kDa and 3 kDa, or between 1 kDa and 2 kDa. In certain embodiments, the engineered polypeptide comprises up to 45 amino acids, up to 40 amino acids, up to 35 amino acids, up to 30 amino acids, up to 25 amino acids, up to 20 amino acids, at least 5 amino acids, at least 10 amino acids, at least 15 amino acids, at least 20 amino acids, at least 25 amino acids, at least 30 amino acids, at least 35 amino acids, or at least 40 amino acids.
In certain embodiments, the engineered polypeptide comprises a combination of spatially-associated topological constraints, wherein one or more of the constraints is a CD25 reference target-derived constraint. Any constraints described herein may be used in the combination, in some embodiments. In still further embodiments, between 10% to 98% of the amino acids of the engineered polypeptide meet the one or more CD25 reference target-derived constraints (e.g., if the engineered polypeptide comprises 50 amino acids, between 5 to 49 amino acids meet the one or more CD25 reference target-derived constraints). In some embodiments, between 20% to 98%, between 30% to 98%, between 40% to 98%, between 50% to 98%, between 60% to 98%, between 70% to 98%, between 80% to 98%, between 90% to 98%, between 10% to 90%, between 10% to 80%, between 10% to 70%, between 10% to 60%, between 10% to 50%, between 10% to 40%, between 10% to 30%, or between 10% to 20% of the amino acids of the engineered polypeptide meet the one or more CD25 reference target-derived constraints. In still further embodiments, the one or more amino acids that meet the one or more CD25 reference target-derived constraints have less than 8.0 Å, less than 7.5 Å, less than 7.0 Å, less than 6.5 Å, less than 6.0 Å, less than 5.5 Å, or less than 5.0 Å backbone root-mean-square deviation (RSMD) structural homology with the CD25 reference target. In some embodiments, the engineered polypeptide has a molecular mass of between 1 kDa and 10 kDa; comprises up to 50 amino acids; a combination of spatially-associated topological constraints, wherein one or more of the constraints is a CD25 reference target-derived constraint; between 10% to 98% of the amino acids of the engineered polypeptide meet the one or more CD25 reference target-derived constraints; and the amino acids that meet the one or more CD25 reference target-derived constraints have less than 8.0 Å backbone root-mean-square deviation (RSMD) structural homology with the CD25 reference target.
In some embodiments, the amino acids of the engineered polypeptide that meet the one or more CD25 reference target-derived constraints have between 10% and 90% sequence homology, between 20% and 90% sequence homology, between 30% and 90% sequence homology, between 40% and 90% sequence homology, between 50% and 90% sequence homology, between 60% and 90% sequence homology, between 70% and 90% sequence homology, or between 80% and 90% sequence homology with the CD25 reference target. In some embodiments, the amino acids that meet the one or more CD25 reference target-derived constraints have a van der Waals surface area overlap with the reference of between 30 Å2 to 3000 Å2, or between 100 Å2 to 3000 Å2, or between 250 Å2 to 3000 Å2, or between 500 Å2 to 3000 Å2, or between 750 Å2 to 3000 Å2, or between 1000 Å2 to 3000 Å2, or between 1250 Å2 to 3000 Å2, or between 1500 Å2 to 3000 Å2, or between 1750 Å2 to 3000 Å2, or between 2000 Å2 to 3000 Å2, or between 2250 Å2 to 3000 Å2, or between 2500 Å2 to 3000 Å2, or between 2750 Å2 to 3000 Å2.
The combination of constraints that the engineered polypeptide meets may comprise two or more, three or more, four or more, five or more, six or more, or seven or more CD25 reference target-derived constraints. The combination may comprise one or more constraints not derived from the CD25 reference target, as described elsewhere in the present disclosure. These reference-derived constraints, and non-reference derived constraints if present, may independently be any of the constraints described herein, such as any of the structural, dynamical, chemical, or functional characteristics described herein, or any combinations thereof.
In some embodiments, the engineered polypeptide comprises at least one structural difference when compared to the CD25 reference target. Such structural differences may include, for example, a difference in the sequence, number of amino acid residues, total number of atoms, total hydrophilicity, total hydrophobicity, total positive charge, total negative charge, one or more secondary structures, shape factor, Zernike descriptors, van der Waals surface, structure graph nodes and edges, volumetric surface, electrostatic potential surface, hydrophobic potential surface, local diameter, local surface features, skeleton model, charge density, hydrophilic density, surface to volume ratio, amphiphilicity density, or surface roughness, or any combinations thereof. In some embodiments, the difference in one or more characteristics (such as one or more characteristics described herein) is at least 10%, at least 20%, at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 100%, or greater than 100% when compared to the characteristic in the CD25 reference target, as applicable to the type of characteristic. For example, in some embodiments the difference is the total number of atoms, and the engineered polypeptide has at least 10%, at least 20%, or at least 30% more atoms than the CD25 reference target, or at least 10%, at least 20%, or at least 30% fewer atoms than the CD25 reference target. In some embodiments, the difference is in total positive charge, and the total positive charge of the engineered polypeptide is at least 10%, at least 20%, at least 30%, at least 40%, or at least 50% larger (e.g., more positive) than the CD25 reference target, while in other embodiments the total positive charge of the engineered polypeptide is at least 10%, at least 20%, at least 30%, at least 40%, or at least 50% smaller (e.g., less positive) than the CD25 reference target.
In some embodiments, the combination of spatially-defined topological constraints includes one or more secondary structural elements not present in the CD25 reference target. Thus, in some embodiments, the engineered polypeptide comprises one or more secondary structural elements that are not present in the CD25 reference target. In some embodiments, the combination and/or engineered polypeptide comprises one secondary structural element, two secondary structural elements, three secondary structural elements, four secondary structural elements, or more than four secondary structural elements not found in the CD25 reference target. In some embodiments, each secondary structural element is independently selected form the group consisting of helices, sheets, loops, turns, and coils. In some embodiments, each secondary structural element not present in the CD25 reference target is independently an α-helix, β-bridge, β-strand, 310 helix, π-helix, turn, loop, or coil.
In certain embodiments, the CD25 reference target comprises one or more atoms associated with a biological response or a biological function (such as one described herein); the engineered polypeptide comprises one or more atoms associated with a biological response or a biological function (such as one described herein); and the atomic fluctuations of said atoms in the engineered polypeptide overlap with the atomic fluctuations of said atoms in the CD25 reference target. Thus, for example, in some embodiments the atoms themselves are different atoms, but their atomic fluctuations overlap. In other embodiments, the atoms are the same atoms, and their atomic fluctuations overlap. In still further embodiments, the atoms are independently the same or different. In some embodiments, the overlap is a root mean square inner product (RMSIP) greater than 0.25. In some embodiments, the overlap is a RMSIP greater than 0.3, greater than 0.35, greater than 0.4, greater than 0.45, greater than 0.5, greater than 0.55, greater than 0.6, greater than 0.65, greater than 0.7, greater than 0.75, greater than 0.8, greater than 0.85, greater than 0.9, or greater than 0.95. In certain embodiments, the RMSIP is calculated by:
where n is the eigenvector of the engineered polypeptide topological constraints, and v is the eigenvector of the CD25 reference target topological constraints.
In some embodiments, the engineered polypeptide comprises atoms or amino acids (or combination thereof) associated with a biological response or biological function, and at least a portion of said atoms or amino acids or combination is derived from a CD25 reference target, and certain constraints of the set of atoms or amino acids in the engineered polypeptide and the set in the CD25 reference target can be described by a matrix. In some embodiments, the matrix is an L×L matrix. In other embodiments, the matrix is an S×S×M matrix. In still further embodiments, the matrix is an L×2 phi/psi angle matrix
For example in some embodiments, the atomic fluctuations of the atoms or amino acids in the engineered polypeptide that are associated with a biological response or biological function are described by an L×L matrix; a portion of said atoms or amino acids are derived from the CD25 reference target; and the atomic fluctuations in the CD25 reference target of said portion are described by an L×L matrix. In some embodiments, the adjacency of each set (related to amino acid location) is described by corresponding L×L matrices. In certain embodiments, the mean percentage error (MPE) across all matrix elements (i, j) of the engineered polypeptide L×L atomic fluctuation or adjacency matrix is less than or equal to 75% relative to the corresponding (i, j) elements in the CD25 reference target atomic fluctuation or adjacency matrix, for the fraction of the engineered polypeptide derived from the CD25 reference target. In some embodiments, the MPE is less than 70%, less than 65%, less than 60%, less than 55%, less than 50%, less than 45%, or less than 40% relative to the corresponding elements in the CD25 reference target matrix, for the fraction of the engineered polypeptide derived from the CD25 reference target. In some embodiments, wherein the matrices represent atomic fluctuations, L is the number of amino acid positions and the (i, j) value in the atomic fluctuation matrix element is the sum of intra-molecular atomic fluctuations for the ith and jth amino acid respectively if the (i, j) atomic distance is less than or equal to 7 Å, or zero if the (i, j) atomic distance is greater than 7 Å or if (i, j) is on the diagonal. Alternatively, in some embodiments the atomic distance can serve as a weighting factor for the atomic fluctuation matrix element (i, j) instead of a 0 or 1 multiplier. In certain embodiments, the ith and jth atomic fluctuations and distances can be determined by molecular simulation (e.g. molecular dynamics) and/or laboratory measurement (e.g. NMR). In other embodiments, wherein the matrices represent adjacency, L is the number of amino acid positions and the value in adjacency matrix element (i, j) is the intra-molecular atomic distance between the ith and jth amino acid respectively if the atomic distance is less than or equal to 7 Å, or zero if the atomic distance is greater than 7 Å or if (i, j) is on the diagonal. Alternatively, in some embodiments the atomic distance can serve as a weighting factor for the adjacency matrix element (i, j) instead of a 0 or 1 multiplier. In certain embodiments, the ith and jth atomic distances could be determined by molecular simulation (e.g. molecular dynamics) and/or laboratory measurement (e.g. NMR).
In certain embodiments, the atoms or amino acids associated with a response or function in the engineered polypeptide have a topological constraint chemical descriptor vector and a mean percentage error (MPE) less than 75% relative to the reference described by the same chemical descriptor, for the fraction of the engineered polypeptide derived from the CD25 reference target, wherein each ith element in the chemical descriptor vector corresponds to an amino acid position index. In some embodiments, the MPE is less than 70%, less than 65%, less than 60%, less than 55%, less than 50%, less than 45%, or less than 40% relative to the reference described by the same chemical descriptor, for the fraction of the engineered polypeptide derived from the CD25 reference target.
In still further embodiments, the matrix is an L×2 phi/psi angel matrix, and the atoms or amino acids associated with a response or function in the engineered polypeptide have an MPE less than 75% with respect to the reference phi/psi angles matrix in the fraction of the engineered polypeptide derived from the reference target, wherein L is the number of amino acid positions and phi, psi values are in dimensions (L,1) and (L,2) respectively. In some embodiments, the MPE is less than 70%, less than 65%, less than 60%, less than 55%, less than 50%, less than 45%, or less than 40% with respect to the reference phi/psi angles matrix in the fraction of the engineered polypeptide derived from the reference target. In some embodiments, the phi/psi values are determined by molecular simulation (e.g. molecular dynamics), knowledge-based structure prediction, or laboratory measurement (e.g. NMR).
In some embodiments, the matrix is an S×S×M secondary structural element interaction matrix, and the atoms or amino acids associated with a response or function in the engineered polypeptide have less than 75% mean percentage error (MPE) relative to the reference secondary structural element relationship matrix, in the fraction of the engineered polypeptide derived from the reference target, where S is the number of secondary structural elements and M is the number of interaction descriptors. In some embodiments, the MPE is less than 70%, less than 65%, less than 60%, less than 55%, less than 50%, less than 45%, or less than 40% relative to the reference secondary structural element relationship matrix, in the fraction of the engineered polypeptide derived from the reference target. Interaction descriptors may include, for example, hydrogen bonding, hydrophobic packing, van der Waals interaction, ionic interaction, covalent bridge, chirality, orientation, or distance, or any combinations thereof. In the secondary structural element interaction matrix index, (i, j, m)=mth interaction descriptor value between the i‘ and j’ secondary structural elements.
Mean Percentage Error (MPE) for different matrices as described herein may be calculated by:
where n is the topological constraint vector or matrix position index for the engineered polypeptide (engn) and the corresponding reference (refn), summed up to vector or matrix position n.
In some embodiments, the engineered polypeptide has an MPE of less than 75% compared to the CD25 reference target. In certain embodiments, the engineered polypeptide has an MPE of less than 70%, less than 65%, less than 60%, less than 55%, less than 50%, less than 45%, or less than 40% compared to the CD25 reference target. In some embodiments, the MPE is determined by Total Topological Constraint Distance (TCD), topological clustering coefficient (TCC), Euclidean distance, power distance, Soergel distance, Canberra distance, Sorensen distance, Jaccard distance, Mahalanobis distance, Hamming distance, Quantitative Estimate of Likeness (QEL), or Chain Topology Parameter (CTP).
e. Secondary Structural Element
In some embodiments, at least a portion of the engineered polypeptide is topologically constrained to one or more secondary structural elements. In some embodiments, the atoms or amino acids associated with a biological response or biological function in the engineered polypeptide are topologically constrained to one or more secondary structural elements. In some embodiments, the secondary structural element is independently a sheet, helix, turn, loop, or coil. In some embodiments, the secondary structural element is independently an α-helix, β-bridge, β-strand, 310 helix, π-helix, turn, loop, or coil. In certain embodiments, one or more of the secondary structural elements to which at least a portion of the engineered polypeptide is topologically constrained is present in the CD25 reference target. In some embodiments, at least a portion of the engineered polypeptide is topologically constrained to a combination of secondary structural elements, wherein each element is independently selected from the group consisting of sheet, helix, turn, loop, and coil. In still further embodiments, each element is independently selected from the group consisting of an α-helix, β-bridge, β-strand, 310 helix, π-helix, turn, loop, and coil.
In some embodiments, the secondary structural element is a parallel or anti-parallel sheet. In some embodiments, a sheet secondary structure comprises greater than or equal to 2 residues. In some embodiments, a sheet secondary structure comprises less than or equal to 50 residues. In still further embodiments, a sheet secondary structure comprises between 2 and 50 residues. Sheets can be parallel or anti-parallel. In some embodiments, a parallel sheet secondary structure may be described as having two strands i, j in a parallel (N-termini of i and j strands opposing orientation), and a pattern of hydrogen bonding of residues i:j. In some embodiments, an anti-parallel sheet secondary structure may also be described as having two strands i, j in an anti-parallel (N-termini of i and j strands same orientation), and a pattern of hydrogen bonding of residues i:j−1, i:j+1. In certain embodiments, the orientation and hydrogen bonding of strands can be determined by knowledge-based or molecular dynamics simulation and/or laboratory measurement.
In some embodiments, the secondary structural element is a helix. Helices may be right or left handed. In some embodiments, the helix has a residue per turn (residues/turn) value of between 2.5 and 6.0, and a pitch between 3.0 Å and 9.0 Å. In some embodiments, the residues/turn and pitch are determined by knowledge-based or molecular dynamics simulation and/or laboratory measurement.
In some embodiments, the secondary structural element is a turn. In some embodiments, a turn comprises between 2 to 7 residues, and 1 or more inter-residue hydrogen bonds. In some embodiments, the turn comprises 2, 3, or 4 inter-residue hydrogen bonds. In certain embodiments, the turn is determined by knowledge-based or molecular dynamics simulation and/or laboratory measurement.
In still further embodiments, the secondary structural element is a coil. In certain embodiments, the coil comprises between 2 to 20 residues and zero predicted inter-residue hydrogen bonds. In some embodiments, these coil parameters are determined by knowledge-based or molecular dynamics simulation and/or laboratory measurement.
In still further embodiments, the engineered polypeptide comprises one or more atoms or amino acids derived from the CD25 reference target, wherein said atoms or amino acids have a secondary structure. In some embodiments, these atoms or amino acids are associated with a biological response or biological function. In some embodiments, the secondary structure motif vector of the atoms or amino acids in the engineered polypeptide has a cosine similarity greater than 0.25 relative to the CD25 reference target secondary structure motif vector for the fraction of the engineered polypeptide derived from the CD25 reference target, wherein the length of the vector is the number of secondary structure motifs and the value at the ith vector position defines the identity of the secondary structure motif (e.g. helix, sheet) derived from a lookup table. In some embodiments, each motif comprises two or more amino acids. In certain embodiments, motifs include, for example, α-helix, β-bridge, β-strand, 310 helix, π-helix, turn, and loop. In some embodiments, the cosine similarity is greater than 0.3, greater than 0.35, greater than 0.4, greater than 0.45, or greater than 0.5 relative to the CD25 reference target secondary structure motif vector for the fraction of the engineered polypeptide derived from the CD25 reference target. Cosine similarity may be calculated by:
wherein A is the peptide vector of secondary structure motif identifiers, B is the reference vector of secondary structure motif identifiers, n is the length of the secondary structure motif vector, and i is the ith secondary structure motif.
In some embodiments, one or more atoms or amino acids of the engineered polypeptide which are derived from the CD25 reference target can be compared to the corresponding CD25 reference target atoms or amino acids using a total topological constraint distance (TCD). In some embodiments, the total TCD of said engineered polypeptide atoms or amino acids derived from the CD25 reference target is +/−75% relative to the TCD distance of the corresponding atoms in the CD25 reference target, wherein two intra-molecule topological constraints are interacting if their pairwise distance is less than or equal to 7 Å. In some embodiments, the atoms or amino acids in the engineered polypeptide being compared are associated with a biological function or biological response. The ith, jth pairwise distance of two atoms or amino acids can, in some embodiments, be determined by molecular simulation (e.g. molecular dynamics) and/or laboratory measurement (e.g. NMR). An exemplary equation for calculating total topological constraint distance (TCD) is:
where i, j are the intra-molecular position indices for amino acids (i, j), Sij is the difference between constraints S(i) and S(j), A(i,j)=1 if amino acids (i, j) are within the 7 Å interaction threshold, and L is the number of amino acid positions in the peptide or the corresponding CD25 reference target. Alternatively, in some embodiments, A(i,j) can serve as a weighting factor for the Sij difference instead of a 0 or 1 multiplier.
In some embodiments, one or more atoms or amino acids of the engineered polypeptide which are derived from the CD25 reference target can be compared to the corresponding CD25 reference target atoms or amino acids using a chain topology parameter (CTP). In some embodiments, the CTP of said engineered polypeptide atoms or amino acids is +/−50% relative to the CTP of the corresponding atoms or amino acids in the CD25 reference target, wherein intra-chain topological interaction is a pairwise distance less than or equal to 7 Å. In some embodiments, the atoms or amino acids in the engineered polypeptide being compared are associated with a biological function or biological response. In some embodiments, ith, jth pairwise distance can be determined by molecular simulation (e.g. molecular dynamics) and/or laboratory measurement (e.g. NMR). An exemplary equation for evaluating CTP is:
where i, j are the position indices for amino acids (i, j), Sij is the difference between topological constraints S(i) and S(j), A(i,j)=1 if amino acids (i, j) are within the 7 Å chain topological interaction threshold, L is the number of amino acid positions in the peptide or the corresponding CD25 reference target, and N is the total number of intra-chain contacts that meet the 7 Å topological interaction threshold in the engineered polypeptide or CD25 reference target. Alternatively, in some embodiments A(i,j) can serve as a weighting factor for the Sij difference instead of a 0 or 1 multiplier.
In some embodiments, one or more atoms or amino acids of the engineered polypeptide which are derived from the CD25 reference target can be compared to the corresponding CD25 reference target atoms or amino acids using a quantitative estimate of likeness (QEL). In some embodiments, the QEL of said engineered polypeptide atoms or amino acids is +/−50% relative to the QEL of the corresponding atoms or amino acids in the CD25 reference target. In some embodiments, the atoms or amino acids in the engineered polypeptide being compared are associated with a biological function or biological response. An exemplary equation for determining QEL is:
wherein di is a topological constraint for the ith amino acid or atom position, or a composition function (e.g. linear regression function) that combines multiple topological constraints for the ith amino acid or atom position, and n is the number of amino acid or atom positions in the peptide or the CD25 reference target.
In some embodiments, one or more atoms or amino acids of the engineered polypeptide which are derived from the CD25 reference target can be compared to the corresponding CD25 reference target atoms or amino acids using a topological clustering coefficient (TCC) vector and a mean percentage error (MPE). In some embodiments, the TCC vector and MPE is less than 75% relative to the TCC of the corresponding atoms or amino acids in the CD25 reference target, wherein each element (i) of the vector is a topological clustering coefficient for the ith amino acid position, intra-molecule clusters are defined by an interacting edge distance less than or equal to 7 Å, and two edges: i−j, j−1 from the ith amino acid position. In some embodiments, the atoms or amino acids in the engineered polypeptide being compared are associated with a biological function or biological response. In some embodiments, the ith, jth and 1th edge distance can be determined by molecular simulation (e.g. molecular dynamics) and/or laboratory measurement (e.g. NMR). An exemplary equation for evaluating the topological clustering coefficient for the ith position is:
wherein Δ(i,j)=1, Δ(i,1)=1, Δ(j,1)=1 if intra-molecular amino acid positions: (i, j), (i, 1), (j, 1) are within the 7 Å interacting edge threshold respectively, Sip is the combination (e.g. sum) of topological constraints for the ith, jth and 1th amino acid, L is the number of amino acid positions in the peptide vector or corresponding CD25 reference target vector, No is the number of intra-molecular interacting amino acid positions for the ith amino acid, meeting the 7 Å edge threshold and two edges: i−j, j−1 from the ith amino acid. Alternatively, in some embodiments, Δ(i,j), Δ(i,1) and Δ(j,1) can serve as weighting factors for the clustering coefficient vector element (i) instead of a 0 or 1 multiplier.
In still further embodiments, one or more atoms or amino acids of the engineered polypeptide which are derived from the CD25 reference target can be compared to the corresponding CD25 reference target atoms or amino acids using an L×M topological constraint matrix and mean percentage error (MPE) of: Euclidean distance, power distance, Soergel distance, Canberra distance, Sorensen distance, Jaccard distance, Mahalanobis distance, or Hamming distance across all M-dimensions. The L×M matrix element (1, m) contains the mth constraint value for the lth amino acid position, wherein L is the number of amino acid positions and M is the number of distinct topological constraints. In some embodiments, the MPE of the engineered polypeptide L×M matrix is less than 75% relative to the matrix of the corresponding CD25 reference target atoms or amino acids. In some embodiments, the MPE is less than 70%, less than 65%, less than 60%, less than 55%, less than 50%, or less than 45%. In some embodiments, the atoms or amino acids in the engineered polypeptide being compared are associated with a biological function or biological response.
In other aspects, further provided herein are methods of using the engineered polypeptides described herein in selecting binding partners using a series of programmed selection steps, wherein at least one selection step includes evaluating the interactions of a pool of potential binding partners with an engineered polypeptide.
In some embodiments, provided herein are methods of steering the selection of a binding molecule using two or more selection molecules. In some embodiments, the methods include subjecting a pool of candidate binding molecules to at least one round of selection, wherein each round comprises at least one negative selection step wherein at least a portion of the pool is screened against a negative selection molecule, and at least one positive selection step wherein at least a portion of the pool is screened against a positive selection molecule. In some embodiments the method comprises at least two rounds, at least three rounds, at least four rounds, at least five rounds, at least six rounds, at least seven rounds, at least eight rounds, at least nine rounds, at least ten rounds, or more, wherein each round independently comprises at least one negative selection step and at least one positive selection step. In some embodiments, each round independently comprises more than one negative selection step, or more than one positive selection step, or a combination thereof.
In some embodiments wherein the method comprises more than one round, each negative and positive selection molecule is independently chosen. In other embodiments, the same negative selection molecule, or the same positive selection molecule, or a combination thereof, may be used in more than one round. For example, in
Such methods of selection use positive (+) and negative (−) steps to steer the library of candidate binding molecules towards and away from certain desired characteristics, such as binding specificity or binding affinity. By using multiple steps with both positive and negative selection molecules, the pool of candidates can be directed in a stepwise manner to select for characteristics that are desirable and against characteristics that are undesirable. Further, in some embodiments the order of each step within each round, and the order of the rounds relative to each other can direct the selection in different directions. Thus, for example, in some embodiments a method comprising one round with (+) selection followed by (−) selection will result in a different final pool of candidates than if (−) selection is first, followed by (+) selection. Extrapolating this out to methods comprising multiple rounds, the order of selection steps may result in a different final pool of selected candidates even if the same positive and negative selection molecules are used overall.
In some embodiments a selection molecule is used that has in inverse characteristic of another selection molecule. This may be useful, for example, to ensure that the candidate binding partners identified using the positive selection molecule (or excluded because of a negative selection molecule) were identified (or excluded) because of a desired trait (or undesired trait), not because of a separate, unrelated binding interaction. To remove binding partners that are binding through unrelated interactions, an inverse selection molecule can be used that has similar or the same structure and characteristics as the selection molecule, except for the residues/structures conveying the desired trait (or undesired trait). For example, if interaction with a particular charge pattern in a positive selection molecule is desired, an inverse negative selection molecule may be used that has replaced the residues providing that charge pattern with uncharged residues, and/or residues of the opposite charge. Thus, for certain selection molecules, multiple different corresponding inverse selection molecules may be possible.
In the selection methods provided herein, at least one of the selection molecules is an engineered polypeptide as described herein. In some embodiments, more than one engineered polypeptide is used. In some embodiments, each engineered polypeptide is independently a positive or negative selection molecule. In certain embodiments, each selection molecule used in the one or more rounds of selection is independently an engineered polypeptide. In other embodiments, at least one molecule that is not an engineered polypeptide is used as a selection molecule. Such selection molecules that are not engineered polypeptides may comprise, for example, a naturally-occurring polypeptide, or a portion thereof. In other embodiments, one or more selection molecules that are not engineered polypeptides may comprise, for example, a non-naturally occurring polypeptide or portion thereof. For example, in some embodiments one or more selection molecules (e.g., positive selection molecule or negative selection molecule) is an immunogen, an antibody, cell-surface receptor, or a transmembrane protein, or a signaling protein, or a multiprotein complex, or a peptide-protein complex, or any portions thereof, or any combinations thereof. In some embodiments, one or more selection molecules is CD25 or a portion of any of CD25.
The positive and negative characteristics being selected for or against in each step may be selected from a variety of traits, and may be tailored depending on the desired features of the final one or more binding molecules obtained. Such desired features may depend, for example, on the intended use of the one or more binding molecules. For example, in some embodiments the methods provided herein are used to screen antibody candidates for one or more positive characteristics such as high specificity, and against one or more negative characteristics such as cross-reactivity. It should be understood that what is considered a positive characteristic in one context might be a negative characteristic in another context, and vice versa. Thus, a positive selection molecule in one series of selection rounds may, in some embodiments, be a negative selection molecule in a different series of selection rounds, or in selecting a different type of binding molecule, or in selecting the same type of binding molecule but for a different purpose.
In some embodiments, each selection characteristic is independently selected from the group consisting of amino acid sequence, polypeptide secondary structure, molecular dynamics, chemical features, biological function, immunogenicity, CD25 reference target(s) multi-specificity, cross-species CD25 reference target reactivity, selectivity of desired CD25 reference target(s) over undesired reference target(s), selectivity of reference target(s) within a sequence and/or structurally homologous family, selectivity of reference target(s) with similar protein function, selectivity of distinct desired reference target(s) from a larger family of undesired targets with high sequence and/or structurally homology, selectivity for distinct reference target alleles or mutations, selectivity for distinct reference target residue level chemical modifications, selectivity for cell type, selectivity for tissue type, selectivity for tissue environment, tolerance to reference target(s) structural diversity, tolerance to reference target(s) sequence diversity, and tolerance to reference target(s) dynamics diversity. In some embodiments, each selection characteristic is a different type of selection characteristic. In other embodiments, two or more selection characteristics are different characteristics but of the same type. For example, in some embodiments, two or more selection characteristics are polypeptide secondary structure, wherein one is a positive selection for a desired polypeptide secondary structure and one is a negative selection for an undesired polypeptide secondary structure. In some embodiments, two or more selection characteristics are selectivity for cell type, wherein a positive selection characteristic is selectivity for a specific desired cell type, and a negative selection characteristic is selectivity for a specific undesired cell type. In some embodiments, two or more, three or more, four or more, five or more, or six or more selection characteristics are of the same type.
In some embodiments, the selection characteristic is binding to an engineered polypeptide of the disclosure. For example, the engineered polypeptides shown in
In yet another aspect, provided herein is a composition comprising two or more selection steering polypeptides, wherein each polypeptide is independently a positive selection molecule comprising one or more positive steering characteristics, or a negative selection molecule comprising one or more negative steering characteristics. Such characteristics may, in some embodiments, be selected from the group consisting of amino acid sequence, polypeptide secondary structure, molecular dynamics, chemical features, biological function, immunogenicity, reference target(s) multi-specificity, cross-species reference target reactivity, selectivity of desired reference target(s) over undesired reference target(s), selectivity of reference target(s) within a sequence and/or structurally homologous family, selectivity of reference target(s) with similar protein function, selectivity of distinct desired reference target(s) from a larger family of undesired targets with high sequence and/or structurally homology, selectivity for distinct reference target alleles or mutations, selectivity for distinct reference target residue level chemical modifications, selectivity for cell type, selectivity for tissue type, selectivity for tissue environment, tolerance to reference target(s) structural diversity, tolerance to reference target(s) sequence diversity, and tolerance to reference target(s) dynamics diversity.
Thus, in further aspects, provided herein is a method of screening a library of binding molecules with a selection steering composition as described herein, wherein each round of selection comprises: a negative selection step of screening at least a portion of the pool against a negative selection molecule; and a positive selection step of screening at least a portion of the pool for a positive selection molecule; wherein the order of selection steps within each round, and the order of rounds, result in the selection of a different subset of the pool than an alternative order.
In some embodiments, the binding partners being evaluated using the composition of selection steering polypeptides as described herein, or the methods of screening as described herein, are a phage library, for example a Fab-containing phage library; or a cell library, for example a B-cell library or a T-cell library.
In some embodiments of the methods of screening provided herein, the methods comprise two or more, three or more, four or more, five or more, six or more, or seven or more rounds of selection. In some embodiments, wherein there is more than one round, each round comprises a different set of selection molecules. In other embodiments, wherein there is more than one round, at least two rounds comprise the same negative selection molecule, the same positive selection molecule, or both.
In some embodiments of the screening methods, the method comprises analyzing the subset of the pool prior to proceeding to the next round of selection. In certain embodiments, each subset pool analysis is independently selected from the group consisting of peptide/protein biosensor binding, peptide/protein ELISA, peptide library binding, cell extract binding, cell surface binding, cell activity assay, cell proliferation assay, cell death assay, enzyme activity assay, gene expression profile, protein modification assay, Western blot, and immunohistochemistry. In some embodiments, gene expression profile comprises full sequence repertoire analysis of the subset pool, such as next-generation sequencing. In some embodiments, statistical and/or informatic scoring, or machine learning training is used to evaluate one or more subsets of the pool in one or more selection rounds.
In some embodiments, the identity and/or order of positive and/or negative selection molecules for a subsequent round is determined by analyzing a subset pool from one selection round. In some embodiments, statistical and/or informatic scoring, or machine learning training, is used to evaluate one or more subsets of the pool in one or more selection rounds to determine the identity and/or order of the positive and/or negative selection molecules for a subsequent round (such as the next round, or a round further along in the program).
In still further embodiments, the methods of selection include modifying a subset pool obtained from a selection round before proceeding to the next selection round. Such modifications may include, for example, genetic mutation of the subset pool, genetic depletion of the subset pool (e.g., selecting a subset of the subset pool to move forward in selection), genetic enrichment of the subset pool (e.g., increasing the size of the pool), chemical modification of at least a portion of the subset pool, or enzymatic modification of at least a portion of the subset pool, or any combinations thereof. In some embodiments, statistical and/or informatic scoring, or machine learning training is used to evaluate a subset pool and determine the one or more modifications to make prior to moving the modified subset pool forward in selection. In certain embodiments, such statistical and/or informatic scoring, or machine learning training, is also used to determine the identity and/or order of positive and/or negative selection molecules for a subsequent round of selection.
Any suitable assay may be used to evaluate the binding of a pool of binding partners with the selection molecules in each step. In some embodiments, binding is directly evaluated, for example by directly detecting a label on the binding partner. Such labels may include, for example, fluorescent labels, such as a fluorophore or a fluorescent protein. In other embodiments, binding is indirectly evaluated, for example using a sandwich assay. In a sandwich assay, a binding partner binds to the selection molecule, and then a secondary labeled reagent is added to label the bound binding partner. This secondary labeled reagent is then detected. Examples of sandwich assay components include His-tagged-binding partner detected with an anti-His-tag antibody or His-tag-specific fluorescent probe; a biotin-labeled binding partner detected with labeled streptavidin or labeled avidin; or an unlabeled binding partner detected with an anti-binding-partner antibody.
In some embodiments, the binding partners being selected in each step are identified based on the binding signal, or dose-response, using any number of available detection methods. These detection methods may include, for example, imaging, fluorescence-activated cell sorting (FACS), mass spectrometry, or biosensors. In some embodiments, a hit threshold is defined (for example the median signal), and any with signal above that signal is flagged as a putative hit motif.
IV. Use of Engineered polypeptides to Produce Antibodies
The engineered polypeptides provided herein, and identified by the methods provided herein, may be used, for example, to produce one or more antibodies. In some embodiments, the antibody is a monoclonal or polyclonal antibody. Thus, in some embodiments, provided herein is an antibody produced by immunizing an animal with an immunogen, wherein the immunogen is an engineered polypeptide as provided herein. In some embodiments, the animal is a human, a rabbit, a mouse, a hamster, a monkey, etc. In certain embodiments, the monkey is a cynomolgus monkey, a macaque monkey, or a rhesus macaque monkey. Immunizing the animal with an engineered polypeptide can comprise, for example, administering at least one dose of a composition comprising the peptide and optionally an adjuvant to the animal. In some embodiments, generating the antibody from an animal comprises isolating a B cell which expresses the antibody. Some embodiments further comprise fusing the B cell with a myeloma cell to create a hybridoma which expresses the antibody. In some embodiments, the antibody generated using the engineered polypeptide can cross react with a human and a monkey, for example a cynomolgus monkey.
a. Characteristics of the Engineered Polypeptide
The engineered polypeptides provided herein have one or more characteristics in common with CD25. In some embodiments, they exhibit at least one characteristic of the surface of CD25, for example the functional interface surface that binds with a binding partner of CD25. In some embodiments, the binding partner is an antibody that binds specifically to CD25. In some embodiments, the engineered polypeptide exhibits at least one characteristic of a portion of the surface of CD25 that is not known to interact to an antibody to CD25.
In some embodiments of certain types of characteristics, the engineered polypeptide presents a mimic of a functional interface of CD25 (such as a binding surface), but the characteristic shared by the engineered polypeptide may be best described as being shared with CD25 as a whole. For example, one characteristic that is shared may be binding between a binding partner of CD25 and CD25, wherein the binding occurs with a functional binding interface of CD25, but the structure and orientation of the functional binding interface is supported by the rest of the CD25 protein.
Such shared characteristics may include, for example, structural metrics, or functional metrics, or combinations thereof. The at least one shared characteristic may include, for example, one or more structural similarities, similarity of conformational entropy, one or more chemical descriptor similarities, one or more functional binding similarities, or one or more phenotypic similarities, or any combinations thereof. In certain embodiments, the engineered polypeptide shares one or more of these characteristics with at least a portion of the surface of CD25, such as a functional interface, for example a binding surface.
In some embodiments, the engineered polypeptide has structural similarity to CD25 (or a portion of the surface of CD25, such as a binding surface), and this structural similarity is evaluated by backbone root-mean-square deviation (RMSD) or side-chain RMSD. RMSD evaluates the average distance between atoms, and can be applied to three-dimensional structures to compare how similar two separate structures are in three-dimensional space. In some embodiments, the RMSD of the backbone, or amino acid side chains, or both, between the engineered polypeptide and CD25 (or a functional interface of CD25) is lower than the RMSD between CD25 (or a functional interface of CD25) and a different molecule. In some embodiments, it is a portion of CD25 (or a portion of a functional interface of CD25) that is compared with the engineered polypeptide. The RMSD may be evaluated, for example, using the experimentally measured structure or the simulated structure of the engineered polypeptide; and the experimentally measured structure or the simulated structure of CD25 (or a functional interface thereof). In some embodiments, a engineered polypeptide is considered structurally similar to CD25 if the backbone of the engineered polypeptide has an average RMSD less than or equal to 6.0 Å relative to the backbone of an x-ray structure of CD25.
In some embodiments, the engineered polypeptide has similar conformational entropy to CD25 (or a portion of the surface of CD25, such as a binding surface), and this conformational entropy is evaluated, for example, using the experimentally measured structure or the simulated structure of the engineered polypeptide, and the experimentally measured structure or the molecular dynamics simulated motion of CD25 (or portion thereof). In such simulations, in some embodiments the experimentally measured structure or the molecular dynamics simulated motion of CD25 (or portion thereof, such as a portion of the binding surface) is used. In certain embodiments, the conformational entropy of the engineered polypeptide is considered similar to that of CD25 (or portion thereof) if an engineered polypeptide molecular dynamics ensemble run under standard physiological conditions has all states with all non-hydrogen atomic position RMSDs ≤6.0 Å relative to a known x-ray crystal structure of CD25 (or portion thereof).
In still other embodiments, the engineered polypeptide has one or more chemical descriptors similar to CD25 (or a portion thereof, such as the binding surface). In other embodiments, the engineered polypeptide has one or more chemical descriptors complementary to a binding partner of CD25 (e.g., an antibody to CD25). Such chemical descriptors (which may be similar or complementary) may include, for example, hydrophobicity patterns, H-bonding patterns, atomic volume/radii, charge patterns, or atomic occupancy patterns, or any combinations thereof. These chemical descriptors may, in some embodiments, be evaluated using the experimentally measured structure or the simulated structure of the engineered polypeptide, and the experimentally measured structure or the simulated structure of CD25 (or a portion thereof, such as the binding surface).
In still other embodiments, the engineered polypeptide has similar functional binding as CD25. For example, in some embodiments the engineered polypeptide has binding to a CD25 binding partner, or fragment thereof. In some embodiments, the binding partner is a fragment of the native binding partner, or is a modified native binding partner. Such modifications may include, for example, a fusion protein comprising at least a fragment of the native binding partner; labeling with a chromophore; labeling with a fluorophore; labeling with biotin; or labeling with a His-tag. In some embodiments, the engineered polypeptide has binding with a binding partner of CD25 that is within about two orders of magnitude, or within about one order of magnitude, of the binding of CD25 with a binding partner. In some embodiments, the similarity of binding is evaluated by comparing the binding constant (Kd), or the inhibitory constant (Ki), or the binding on-rate, or the binding off-rate, or the binding affinity of the binding pairs, or the Gibbs free energy of binding (AG). In some embodiments, the binding partner is an antibody to CD25.
In some embodiments, the binding constant (Kd) of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the Kd of CD25 with the binding partner. In other embodiments, the inhibitory constant (Ki) of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the Ki of CD25 and the binding partner. In still further embodiments, the binding on-rate of the engineered polypeptide with a CD25 binding partner is similar to the binding on-rate of CD25 and the binding partner. In some embodiments, the binding on-rate of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the on-rate of CD25 and the binding partner. In other embodiments, the binding off-rate of the engineered polypeptide with a CD25 binding partner is similar to the binding off-rate of CD25 and the binding partner. In some embodiments, the binding off-rate of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the off-rate of CD25 and the binding partner. In still further embodiments, the binding affinity of the engineered polypeptide with a CD25 binding partner is similar to the binding affinity of CD25 and the binding partner. In some embodiments, the binding affinity of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the binding affinity of CD25 and the binding partner. In some embodiments, the Gibbs free energy of binding of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the Gibbs free energy of binding of CD25 and the binding partner. In some embodiments, the CD25 binding partner is an antibody of CD25.
In yet other embodiments, the engineered polypeptide shares sequence similarity with CD25, or a portion thereof (such as a binding surface of CD25). The similarity may be compared to the continuous amino acid sequence of CD25 (or portion thereof), or to a discontinuous sequence of CD25 (or portion thereof). For example, in certain embodiments, a binding surface of CD25 is formed by discontinuous amino acid sequences, and the engineered polypeptide has sequence similarity with at least a portion of the discontinuous sequences that form the surface. In other embodiments, the engineered polypeptide has sequence similarity with at least a portion of a continuous amino acid sequence that forms a binding surface of CD25. In some embodiments, the binding surface of CD25 comprises an epitope that binds to an antibody to CD25.
In some embodiments, the engineered polypeptide has a sequence that is at least 40% identical, at least 45% identical, at least 50% identical, at least 55% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, or at least 90% identical, to a portion of the continuous sequence of CD25, for example a continuous sequence that forms a binding surface of CD25. In certain embodiments, the engineered polypeptide has a sequence that is at least 40% identical, at least 45% identical, at least 50% identical, at least 55% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, or at least 90% identical, to a portion of the discontinuous sequence of CD25, for example the discontinuous sequence that forms a binding surface of CD25. In certain embodiments, the engineered polypeptide has a sequence that is at least 40% identical, at least 45% identical, at least 50% identical, at least 55% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, or at least 90% identical, to a contiguous portion of a binding surface of CD25. In still further embodiments, the engineered polypeptide has a sequence that is at least 40% identical, at least 45% identical, at least 50% identical, at least 55% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, or at least 90% identical, to two or more discontiguous portions of a binding surface of CD25. In some embodiments, the engineered polypeptide has a sequence at least partly identical (as described herein) with a binding surface of CD25, wherein the binding surface comprises an epitope that binds to one or more antibodies to CD25.
In certain embodiments, sequence similarity of the engineered polypeptide and CD25 (or portion thereof) is evaluated using the peptide portion(s) of the engineered polypeptide, not including a linker, if present. In certain embodiments, one or more linking moieties are considered as well, for example if the engineered polypeptide comprises one or more linkers that comprise an amino acid.
b. Engineered Polypeptide
In some embodiments, the engineered polypeptide comprises more than one peptide, for example at least two peptides, or at least three peptides, or greater. In some embodiments, the engineered polypeptide comprises between 1 and 10 peptides, between 1 and 8 peptides, between 1 and 6 peptides, between 1 and 4 peptides, between 2 and 10 peptides, between 2 and 8 peptides, between 2 and 6 peptides, or between 2 and 4 peptides.
In some embodiments, the engineered polypeptide comprises between 2 to 100 amino acids, for example, between 2 to 80 amino acids, between 2 to 70 amino acids, between 2 to 60 amino acids, between 2 to 50 amino acids, between 2 to 40 amino acids, between 2 to 30 amino acids, between 2 to 25 amino acids, between 2 to 20 amino acids, between 2 to 15 amino acids, between 5 to 30 amino acids, between 5 to 25 amino acids, between 5 to 20 amino acids, between 5 to 15 amino acids, or between 9 and 15 amino acids.
In certain embodiments, the engineered polypeptide comprises greater than one peptide, for example at least two peptides, or at least three peptides, or at least four peptides, or greater, and each peptide independently comprises between 1 to 100 amino acids, or between 2 to 100 amino acids, for example, between 2 to 80 amino acids, between 2 to 70 amino acids, between 2 to 60 amino acids, between 2 to 50 amino acids, between 2 to 40 amino acids, between 2 to 30 amino acids, between 2 to 25 amino acids, between 2 to 20 amino acids, between 2 to 15 amino acids, between 5 to 30 amino acids, between 5 to 25 amino acids, between 5 to 20 amino acids, between 5 to 15 amino acids, or between 9 and 15 amino acids.
In some embodiments, the engineered polypeptide comprises only naturally occurring amino acids. In other embodiments, the engineered polypeptide comprises non-natural amino acids, for example a combination of naturally occurring and non-natural amino acids.
In some embodiments, wherein the engineered polypeptide comprises two or greater peptides, each peptide independently exhibits at least one characteristic of CD25, or a portion thereof (such as a binding surface). In some embodiments, each peptide independently exhibits 1 to 10, 1 to 9, 1 to 8, 1 to 7, 1, to 6, 1 to 5, 1 to 4, 1 to 3, or 1, or 2 characteristics of CD25, or a portion thereof. In some embodiments, the characteristics are shared with a portion of CD25 that interacts with an antibody of CD25.
In some embodiments, the engineered polypeptide has at least one characteristic that is complementary to a binding partner of CD25, for example an antibody of CD25.
In some embodiments, a peptide of the engineered polypeptide shares one or more structural similarities with CD25, or a portion thereof. The structural similarity may be, in some embodiments, evaluated by backbone RMSD or side-chain RMSD. For example, in certain embodiments, the RMSD of the backbone, or amino acid side chains, or both, between a peptide of the engineered polypeptide and CD25 (or a portion thereof) is lower than the RMSD between CD25 (or portion thereof) and a different molecule (such as a different peptide). In some embodiments, a portion of CD25 is compared with the peptide, for example a portion of the surface of CD25, such as a surface that interacts with an antibody to CD25. RMSD of structural similarity may be evaluated, for example, using the experimentally measured structure or the simulated structure of the peptide and the experimentally measured structure or the simulated structure of CD25 or portion thereof. In some embodiments, a peptide of the engineered polypeptide is considered structurally similar to CD25 (or portion thereof) if the backbone of the peptide has an average RMSD less than or equal to 6.0 Å relative to the backbone of a known x-ray structure of CD25, or the portion thereof.
In some embodiments, the engineered polypeptide has similar conformational entropy to CD25 or a portion thereof. In some embodiments, the experimentally measured structure or the molecular dynamics simulated motion of the peptide is used to compare the conformation entropy with the experimentally measured structure or the simulated structure of CD25, or a portion thereof. The conformational entropy is considered similar, in some embodiments, if a peptide molecular dynamics ensemble run under standard physiological conditions has all states with all non-hydrogen atomic portions RMSDs ≤6.0 Å relative to a known x-ray crystal structure of CD25, or portion thereof. In some embodiments, a portion of CD25 is compared with the peptide, for example a surface portion of CD25 that interacts with an antibody of CD25.
In further embodiments, the similarity between a peptide of the engineered polypeptide and CD25 (or portion thereof) may be one or more chemical descriptors. In some embodiments, the peptide has one or more chemical descriptors in common with CD25 (or a portion thereof), or one or more chemical descriptors that is complementary to a binding partner of CD25 (for example, an antibody to CD25). Chemical descriptors may include, for example, hydrophobicity patterns, H-bonding patterns, atomic volume/radii, charge patterns, or atomic occupancy patterns, or any combinations thereof. In some embodiments, a peptide of the engineered polypeptide has one or more hydrophobicity patterns, H-bonding patterns, atomic volume/radii, charge patterns, or atomic occupancy patterns, or any combinations thereof, similar those in CD25 or a portion thereof, or which is complementary to a binding partner of CD25 (such as an antibody to CD25). In some embodiments, the similarity is having the same chemical descriptor in common, such as one or more of the same hydrophobicity patterns, H-bonding patterns, atomic volume/radii, charge patterns, or atomic occupancy patterns. Complementary chemical descriptors includes, for example, a peptide with a positive charge pattern that complements the negative charge pattern of a binding partner of CD25, such as an antibody to CD25. These chemical descriptors may, in some embodiments, be evaluated using an experimentally measured structure or a simulated structure of the peptide, and an experimentally measured structure or a simulated structure of CD25, or the CD25 binding partner (e.g., for complementary evaluation).
For example, in some embodiments, the engineered polypeptide binds binding partner of CD25 that is similar to the binding of CD25 with the binding partner (for example, IL-2). In some embodiments, the binding partner is the native binding partner, a fragment of a native binding partner, or a modified native binding partner or fragment thereof, or an antibody that binds specifically to CD25. In some embodiments, the binding partner binds under certain circumstances but not others. In some embodiments, the binding partner binds under pathological conditions, or binds under non-pathological conditions. The binding partner may be, for example, constitutively expressed, or the product of a facultative gene, or comprise a protein or a fragment thereof. In certain embodiments, the binding partner is a fragment of a native binding partner, or is a modified native binding partner. Modifications may include, in some embodiments, a fusion protein comprising at least a fragment of the native binding partner; labeling with a chromophore; labeling with a fluorophore; labeling with biotin; or labeling with a His-tag.
In some embodiments, the engineered polypeptide has binding with a binding partner of CD25 that is within about two orders of magnitude, or within about one order of magnitude, of the binding of CD25 with the binding partner. In some embodiments, the similarity of binding is evaluated by comparing the binding constant (Kd), or the inhibitory constant (Ki), or the binding on-rate, or the binding off-rate, or the binding affinity of the binding pairs, or the Gibbs free energy of binding (AG). In some embodiments, the binding partner is an antibody to CD25.
In some embodiments, the binding constant (Kd) of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the Kd of CD25 with the binding partner. In other embodiments, the inhibitory constant (Ki) of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the Ki of CD25 and the binding partner. In still further embodiments, the binding on-rate of the engineered polypeptide with a CD25 binding partner is similar to the binding on-rate of CD25 and the binding partner. In some embodiments, the binding on-rate of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the on-rate of CD25 and the binding partner. In other embodiments, the binding off-rate of the engineered polypeptide with a CD25 binding partner is similar to the binding off-rate of CD25 and the binding partner. In some embodiments, the binding off-rate of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the off-rate of CD25 and the binding partner. In still further embodiments, the binding affinity of the engineered polypeptide with a CD25 binding partner is similar to the binding affinity of CD25 and the binding partner. In some embodiments, the binding affinity of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the binding affinity of CD25 and the binding partner. In some embodiments, the Gibbs free energy of binding of the engineered polypeptide with a CD25 binding partner is within 1000-fold, within 800-fold, within 600-fold, within 400-fold, within 200-fold, within 100-fold, within 90-fold, within 80-fold, within 70-fold, within 60-fold, within 50-fold, within 40-fold, within 30-fold, within 20-fold, within 10-fold, within 8-fold, within 6-fold, within 4-fold, within 2-fold, within 1.5-fold, within 1.2-fold, or about the same as the Gibbs free energy of binding of CD25 and the binding partner. In some embodiments, the CD25 binding partner is an antibody of CD25.
In some embodiments, the engineered polypeptide has sequence similarity with CD25, or a portion thereof. In some embodiments, the engineered polypeptide has sequence similarity with a portion of the surface of CD25 that binds to an antibody of CD25. In certain embodiments, the sequence similarity is compared to the continuous amino acid sequence of CD25. In other embodiments, the sequence similarity is compared to a discontinuous sequence of CD25. For example, in certain embodiments, a binding surface of folded CD25 is formed by discontinuous amino acid sequences, and the engineered polypeptide has sequence similarity with at least a portion of the discontinuous sequences that form the surface. In some embodiments, the engineered polypeptide has sequence similarity with at least a portion of a continuous amino acid sequence that forms a binding surface of CD25. In some embodiments, the engineered polypeptide has a sequence that is at least 50% identical, at least 55% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, at least 90% identical, at least 95% identical, or at least 99% identical to at least a portion of a continuous sequence of CD25, such as a continuous sequence that forms a binding surface. In certain embodiments, the engineered polypeptide has a sequence that is at least 40% identical, at least 45% identical, at least 50% identical, at least 55% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, or at least 90% identical, to at least a portion of the discontinuous sequence of CD25, for example the discontinuous sequence that forms a binding surface. In certain embodiments, the engineered polypeptide has a sequence that is at least 40% identical, at least 45% identical, at least 50% identical, at least 55% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, or at least 90% identical, to a contiguous portion of CD25. In still further embodiments, the engineered polypeptide has a sequence that is at least 40% identical, at least 45% identical, at least 50% identical, at least 55% identical, at least 60% identical, at least 65% identical, at least 70% identical, at least 75% identical, at least 80% identical, at least 85% identical, or at least 90% identical, to two or more discontiguous portions of CD25. In some embodiments, for engineered polypeptides that comprise at least two peptides, two or more peptides of the engineered immunogen independently share sequence similarity with CD25, such as with a binding surface of CD25. In some embodiments, the portion of CD25 that shares sequence similarity with the engineered polypeptide is a surface that binds to an antibody to CD25.
c. Linking Moiety
The engineered polypeptides provided herein optionally comprise a linking moiety. When present, the linking moiety may be, for example, independently a cross-link or a linker.
In some embodiments, the engineered polypeptide comprises N number of peptides, and N−1 number of linking moieties; or N number of peptides, and N−1 number of linking moieties; or N number of peptides, and N number of linking moieties; or N number of peptides, and N+1 number of linking moieties; or N number of peptides, and N+2 number of linking moieties; or N number of peptides, and N−2 number of linking moieties, wherein N is 3 or larger.
In some embodiments, the engineered polypeptide comprises at least one linking moiety, at least two linking moieties, at least three linking moieties, at least four linking moieties, at least five linking moieties, at least six linking moieties, between one to six linking moieties, between one to five linking moieties, between one to four linking moieties, between one to three linking moieties, one linking moiety, or two linking moieties. In some embodiments, each linking moiety is independently a cross-link or a linker. In certain embodiments, each linking moiety is a cross-link. In other embodiments, each linking moiety is a linker. In still further embodiments, at least one linking moiety is a cross-link, and the remaining linking moieties are independently cross-links or a linkers. In other embodiments, at least one linking moiety is a linker, and the remaining linking moieties are independently cross-links or a linkers.
A cross-link includes, for example, a covalent bond between the side chain of one amino acid and a moiety of another amino acid. The amino acids may be independently natural or non-natural amino acids. In some embodiments, cross-links include a covalent bond between the side chains of two amino acids, or between the side chain of one amino acid and the amine or carboxyl group of another amino acid. A cross-link may form within one peptide or between two separate peptides. In some embodiments, the engineered polypeptides provided herein comprise mixture of both intra-peptide and inter-peptide cross-links. In some embodiments, the cross-link is a disulfide bond between two thiol groups of amino acid side chains, such as a disulfide bond between two cysteines. In some embodiments, the cross-link is an amide bond between an amine group and a carboxylic acid group of two amino acids, wherein at least one of the amine and the carboxylic acid group is located on a side chain of an amino acid (e.g., the amide bond is not a backbone amide bond). In some embodiments, the cross-link is an amide bond formed between diaminopimelic acid and aspartic acid. In some embodiments, an amide cross-link is a lactam. In some embodiments, the cross-link is an oxime. In some embodiments, the cross-link is a hydrazone. In some embodiments, a cross-link comprises a covalent bond between a side chain of an amino acid and a moiety of another amino acid, wherein one or both of the side chain and the moiety are modified to form the covalent bond. Such modifications may include, for example, oxidation, reduction, reaction with a catalyst to form an intermediate, or other modifications known to one of skill in the art.
A linker includes, for example, a molecule that is covalently bonded to at least two sites of a peptide, or between at least two peptides. A linker may bond to two sites within one peptide or between two separate peptides, or a combination of both. For example, a linker that comprises at more than two peptide-attachment sites may form both intra-peptide and inter-peptide bonds. In engineered polypeptides comprising at least two peptides and at least one linker, the peptides and linker may be connected in a variety of different configurations. For example, an engineered polypeptide may have peptide-linker-peptide-etc. pattern, ending with a peptide. In some embodiments, an engineered polypeptide comprises a linker that forms a branching point, for example a linker that is independently attached to three peptides. In some embodiments, an engineered polypeptide comprises a linker with three peptide-attachment sites, wherein the linker is only attached to two peptides.
In some embodiments, a linker comprises one or more amino acids. Amino acids that form part of a linker may, in some embodiments, be identified separately from the the engineered polypeptide. In certain embodiments, the linker is a region that separates and presents peptides of the engineered polypeptide in a structural, chemical, and/or dynamical manner that reflects the structure and/or function of a functional interface of the interface protein. In still further embodiments, the linker does not have a function on its own when not connected to the peptides of engineered polypeptide, for example does not exhibit binding to a binding partner of CD25. In some embodiments, each linker independently comprises at least one, at least two, at least three, at least four, at least five, at least six, or more amino acids. In some embodiments, each linker independently comprises one amino acid, two amino acids, three amino acids, four amino acids, five amino acids, or six amino acids. Amino acids that form part of a linker may be, in some embodiments, naturally occurring amino acids or non-naturally occurring amino acids. Each linker may, in some embodiments, independently comprise one or more alpha-amino acids, one or more beta-amino acids, or one or more gamma-amino acids, or any combinations thereof. In certain embodiments, a linker independently comprises a cyclic beta residue. Cyclic beta residues may include, for example, APC or ACPC. In still further embodiments, a linker may comprise one or more glycine residues, one or more serine residues, or one or more proline residues. In some embodiments, a linker has an amino acid sequence selected from the group consisting of AP, GP, GSG (SEQ ID NO: 32), (GGGGS)n (SEQ ID NO: 33), (GSG)n (SEQ ID NO: 34), GGGSGGGGS (SEQ ID NO: 35), GGGGSGGGS (SEQ ID NO: 36), (PGSG)n (SEQ ID NO: 37), and PGSGSG (SEQ ID NO: 38), wherein n is an integer between 1 and 10. In some embodiments, the engineered polypeptide comprises at least one linker, wherein each linker does not comprise amino acids, or wherein each linker does not comprise natural amino acids, or wherein each linker comprises at least one non-natural amino acid.
In some embodiments, a linker comprises a polymer. In some embodiments, the polymer is polyethylene glycol (PEG). A linker comprising PEG may comprise, for example, at least 3 PEG monomer units, at least 4 PEG monomer units, at least 5 PEG monomer units, at least 6 PEG monomer units, at least 7 PEG monomer units, at least 8 PEG monomer units, at least 9 PEG monomer units, at least 10 PEG monomer units, at least 11 PEG monomer units, at least 12 PEG monomer units, or greater than 12 PEG monomer units. In some embodiments of a linker comprising PEG, the PEG comprises between 3 to 12 monomer units, between 3 to 6 monomer units, between 6 to 12 monomer units, or between 4 to 8 monomer units. In some embodiments, the engineered polypeptide comprises at least one linker comprising PEG3 (comprising 3 monomer units), PEG6, or PEG12. In some embodiments, at least one linker is independently PEG3, PEG6, or PEG12. In further embodiments, the linker comprises a multi-arm PEG. For example, in certain embodiments, at least one linker independently comprises a 4-arm PEG, or an 8-arm PEG. In certain embodiments, each arm independently comprises between 3 to 12 monomer units, or between 3 to 6 monomer units, or between 6 to 12 monomer units, or between 4 to 8 monomer units. In certain embodiments, each arm of the multi-arm PEG comprises the same number of monomer units, for example a 4- or 8-arm PEG wherein each arm comprises 3 monomer units, 6 monomer units, or 12 monomer units.
In other embodiments, a linker comprises a dendrimer. Dendrimers include, for example, molecules with a tree-like branching architecture, comprising a symmetric core from which molecular moieties radially extend, with branch points forming new layers in the molecule. Each new branch point introduces a new, larger layer, and these radial extensions often terminate in functional groups at the exterior terminal surface of the dendrimer. Thus, increasing the number of branch points in turn amplifies the possible number of terminal functional groups at the surface.
In some embodiments, at least one linker comprises a small molecule that is not an amino acid or polymer. In some embodiments, at least one linker comprises a benzodiazepine. In some embodiments, the linker comprises a moiety that is the product of a sulfhydryl-maleimide reaction, which may be a pyrrolidine dione moiety (for example a pyrrolidine-2,5-dione moiety). In some embodiments, the linker comprises an amidine moiety. In some embodiments, the linker comprises a thioether moiety.
In some embodiments, at least one linker comprises trans-pyrrolidine-3,4-dicarboxamide.
In some embodiments, wherein the engineered polypeptide comprises at least two linkers (e.g., in embodiments wherein the engineered polypeptide comprises at least two linking moieties wherein each linking moiety is independently a linker or a cross-link, or wherein each linking moiety is independently a linker), each linker is independently any of the linkers described herein. For example, in some embodiments, each linker is independently a linker comprising one or more amino acids, a linker comprising a polymer, a linker comprising a dendrimer, or a linker comprising a small molecule that is not an amino acid or polymer.
The one or more linking moieties of the engineered polypeptide may impart a particular structural or functional characteristic of interest, or a combination thereof. For example, in some embodiments a linking moiety is present in the engineered polypeptide to impart a structural characteristic, or a functional characteristic, or a combination thereof. Such structural characteristics may include, for example, increased structural flexibility, decreased structural flexibility, a directional feature, increased length, or decreased length. Directional features that may be of interest may include, for example, a structural turn, or maintaining a linear structure. Functional characteristics may include, for example, enhanced solubility, one or more protonation sites, one or more proteolytic sites, one or more enzymatic modification sites, one or more oxidation sites, a label, or a capture handle. In some embodiments, a linker comprises one or more functional characteristics, or one or more structural characteristics, or a combinations thereof.
In some embodiments, one or more linkers independently introduce a structural “turn” into the engineered polypeptide. Examples of such linker include Gly-Pro, Ala-Pro, and trans-pyrrolidine-3,4-dicarboxamide. In some embodiments, one or more linkers present in the engineered polypeptide increases structural flexibility of the engineered polypeptide, compared to the linker not being present, or the selection of a different linker. For example, a linker that is longer and/or less sterically hindered than another linker may, in some embodiments, result in the molecule having greater structural flexibility than if the linker were not present, or if another linker were used instead. In other embodiments, one or more linking moieties independently decreases structural flexibility in the engineered polypeptide, such as including a linker that is shorter and/or more sterically hindered than another linker, or a cross-link at a location or of a type that reduces flexibility of one or more peptides. The presence of a cross-link at a particular location between certain peptides, or between certain amino acid side chains, may result in the molecule having less structural flexibility than if the cross-link was at a different location or between different side chains (e.g., a disulfide or an amide cross-link), or if the cross-link were not present.
d. Additional Components
In some embodiments, the engineered polypeptides provided herein comprise one or more additional components. For example, in some embodiments, the engineered polypeptide comprises one or more moieties that attach the engineered polypeptide to a solid surface, such as a bead or flat surface. In some embodiments, the attachment moieties comprise a polymer (such as PEG), or biotin, or a combination thereof. In some embodiments, attaching the engineered polypeptide to a solid surface may, for example, enable assessment of one or more characteristics of the engineered polypeptide, such as assessment of binding with a binding partner of CD25 (for example, an antibody to CD25).
e. Sequence Similarity
In some embodiments, the engineered polypeptide provided herein has one of the sequences listed in Table 1:
In some embodiments, the engineered polypeptide has at least 6000 sequence similarity with any one of SEQ ID NOS: 1-21. In some embodiments, the engineered polypeptide has at least 7000 sequence similarity with any one of SEQ ID NOS: 1-21. In some embodiments, the engineered polypeptide has at least 8000 sequence similarity with any one of SEQ ID NOS: 1-21. In some embodiments, the engineered polypeptide has at least 90% sequence similarity with any one of SEQ TD NOS: 1-21. In some embodiments, the engineered polypeptide has at least 9500 sequence similarity with any one of SEQ ID NOS: 1-21. In some embodiments, the engineered polypeptide comprises any one of SEQ TD NOS: 1-21. In certain embodiments, the engineered polypeptide has any one of SEQ ID NOS: 1-21.
In some embodiments, the engineered polypeptide comprises any one of SEQ ID NOS: 1-21; and is modified at the N terminus, or the C terminus, or both. For example, in some embodiments the C terminus or the N terminus is covalently bonded to another molecule. In still further embodiments, the engineered polypeptide comprises any one of SEQ ID NOS: 1-21; and one or more amino acids at the N terminus or the C terminus, or both.
In some embodiments, the N-terminal molecule is a biotin-PEG2:
In some embodiments, the C-terminal molecule is a linker followed by biotin (e.g. a -GSGSGK-Biotin (SEQ ID NO: 846)). Other linkers suitable for attaching biotin to the C-terminus of the engineered polypeptide include GSG (SEQ ID NO: 32), GSS (SEQ ID NO: 39), GGS (SEQ ID NO: 40), GGSGGS (SEQ ID NO: 41), GSSGSS (SEQ ID NO: 42), GSGK (SEQ ID NO: 43), GSSK (SEQ ID NO: 44), GGSK (SEQ ID NO: 45), GGSGGSK (SEQ ID NO: 46), GSSGSSK (SEQ ID NO: 47), and the like.
V. Methods of Selecting an Engineered polypeptide
Further provided herein are methods of selecting an engineered polypeptide as described herein. Such methods may include, for example, using an iterative optimization of engineered polypeptide structural characteristics.
In some embodiments, one or more sections of CD25 are identified as the target interface. In some embodiments, at least a portion of the identified section(s) binds to an antibody of CD25. Thus, for example, in some embodiments a portion of CD25 that is an epitope for one or more antibodies is identified as the target interface. In other embodiments, a section of CD25 is identified as the target interface that does not bind to an antibody, or for which it is unknown if antibody binding occurs. In certain embodiments, the crystal structure for at least a portion of CD25 is unknown, and the initial selection of a target interface includes molecular dynamics simulations of CD25 and CD25 binding. In some embodiments, one or more initial input sequences are obtained from the identified section or sections, wherein each sequence is independently continuous or discontinuous. In developing an engineered polypeptide candidate, at least some of the interface residues of each sequence are retained, and one or more linking moieties are incorporated into the sequence to provide desired structural and dynamic characteristics. In some embodiments, one or more non-interface residues are added to the sequence, or one or more residues in the input sequence are replaced with one or more non-interface residues, to achieve desired structural and dynamic characteristics relative to the cognate target structure and dynamics. In some embodiments, these non-interface residues are not from the target interface of CD25, or do not share one or more characteristics with the target interface of CD25, or share fewer characteristics and/or share characteristics less strongly with the target interface of CD25 than the retained interface residues. These intermediate, non-interface residues may, in some embodiments, form part or all of an amino acid linker.
Next, in some embodiments the initial design (or multiple designs) is produced and the molecular dynamics simulated to determine flexibility and overall stability of the design. If this initial design does not meet RMSD requirements, it may undergo iterative optimization of one or more linking moieties (such as one or more cross-links, or intermediate linker residues) using computational mutagenesis, in some embodiments. During this optimization, in some embodiments the interface residues are fixed while one or more of the linking moieties is changed, or removed, or added. The iterative optimization may be repeated until the engineered polypeptide RMSD interface residue positions relative to the target interface and structural order metric meet certain requirements (for example, ≤6.0 Å and ≥0.25, respectively, wherein structural order is on a 0-1 normalized scale, where 1=perfect structural stability).
In some embodiments, the intermediate structural stability residue regions can range from 1-50 amino acids in length. In certain embodiments, these intermediate structural stability residue regions are linkers, for example amino acid linkers. In some embodiments, the relatively small size of an engineered polypeptide produced by certain embodiments of the methods provided herein (compared, for example, to approaches that graft an interface onto a large structurally stabilizing scaffold) may enable chemical synthesis of the molecule, in contrast to a larger molecule that may require an in vitro expression system. Furthermore, in some embodiments the methods provided herein enable the incorporation of non-natural amino acids into intermediate positions or the interface positions, which may allow for fine control of interface engineering with novel moieties and properties such as post-translational modifications, solubility, cell-permeability, enzyme reactivity, pH sensitivity, oxidation sensitivity, etc. In still further embodiments, an engineered polypeptide may be selected with a higher likelihood of species cross-reactivity or disease-related mutation reactivity in selected antibodies when the engineered polypeptide is used as an immunogen or epitope-bait.
In some embodiments, the optimized molecule is the engineered polypeptide provided herein. In other embodiments, the optimized molecule is a candidate engineered polypeptide that may undergo further evaluation, further adjustment, or be used to generate a peptide library or a candidate engineered polypeptide library, or any combinations thereof. In certain embodiments, the method further includes using the engineered polypeptide candidate to generate a peptide library, or an engineered polypeptide candidate library, and then contacting the library with a binding partner of CD25 (such as an antibody to CD25). The peptide library may include, for example, peptides which are smaller than and share at least some sequence similarity with the engineered polypeptide candidate, and in which certain residues are optionally replaced with other residues. An engineered polypeptide candidate library may include, for example, variations of the engineered polypeptide candidate.
In some embodiments, the peptides of the peptide library comprise between 2 to 15 amino acids, between 5 to 15 amino acids, between 10 to 15 amino acids, between 2 to 10 amino acids, or between 5 to 10 amino acids. In some embodiments, the total number of amino acids in each peptide of the library includes both the interface amino acids and structural amino acids, which may include, for example, linker amino acids. The engineered polypeptide candidate library may be prepared by, for example, varying one or more amino acids or linking moieties in the candidates to make new library members. The engineered polypeptide candidates in the engineered polypeptide candidate library, in some embodiments, independently comprise between 5 to 40 amino acids, between 10 to 35 amino acids, between 15 to 35 amino acids, or between 20 to 30 amino acids. In some embodiments, the total number of amino acids in each engineered polypeptide candidate of the candidate library can, in some embodiments, include both the interface amino acids and structural amino acids, which may include, for example, linker amino acids. The peptide library and the engineered polypeptide candidate library can, in some embodiments, independently comprise between 5,000 and 100,000 members, between 5,000 and 80,000 members, between 5,000 and 60,000 members, between 5,000 and 40,000 members, between 5,000 and 30,000 members, between 10,000 and 25,000 members, between 15,000 and 20,000 members, or about 17,000 members (e.g., distinct peptides or distinct engineered polypeptide candidates). In some embodiments, multiple separate libraries are produced and evaluated. In certain embodiments, the library members do not comprise certain cross-links. For example, in some embodiments, a library is evaluated wherein the library members do not have disulfide cross-links.
In some embodiments, to produce candidates for a candidate library, one or more linking moieties is added or removed, or location changed, in the design of the original engineered polypeptide candidate. For example, in some embodiments, a disulfide cross-link is removed, or is added, or the location of which is moved. In other embodiments, a lactam cross-link is removed, or is added, or the location of which is moved. In some embodiments, one or more amino acid residues is replaced. The binding of a CD25 binding partner to the peptide library, or engineered polypeptide candidate library, or both (if present), can provide additional information that may be used to further refine the design of the engineered polypeptide, or to select an engineered polypeptide. Additional information from screening these libraries may, for example, be used to make changes to the engineered polypeptide, for example to increase binding affinity with a binding partner of CD25. The engineered polypeptide candidate library can, in some embodiments, provide additional information regarding the effect of certain linker moieties on binding interactions (including presence or location of such moieties), such as cross-links including disulfide bonds and lactams. The peptide or engineered polypeptide candidate libraries, or both, may in some embodiments be used to identify common motifs (e.g., amino acid patterns or linking moieties, or combinations thereof) that may increase binding affinity or binding specificity for a binding partner of CD25, or provide other desired characteristics. Evaluating the binding of the cognate binding partner with the members of the peptide or the engineered polypeptide candidate libraries, or both, can provide additional structural and functional information, which may be used to further refine the engineered polypeptide design or to select an engineered polypeptide candidate.
a. Selection by Binding under Variable pH
In some embodiments, an engineered polypeptide is selected based, at least in part, on structural flexibility at physiological pH compared to structural flexibility at a lower pH. For example, CD25 may be overexpressed on tumor cells, and therefore binding of an antibody to CD25 with greater affinity in a tumor microenvironment may be desired in some embodiments. Therefore, in some embodiments, it may be desirable to select an engineered polypeptide that is more rigid at lower pH, or in which one or more amino acids have a particular orientation at lower pH, or has greater binding affinity or binding selectivity at lower pH, compared to the same engineered polypeptide at physiological pH. In many cancerous tumors, the growth rate of cancerous cells can outpace the oxygen supply available in portions of the tumor, resulting in a hypoxic microenvironment within the tumor. The level of oxygen in tissues can affect the pH of the tissue environment, and hypoxic levels can lead to decreased pH (including, for example, by the buildup of acidic metabolites from anaerobic glycolysis). Thus, in some embodiments, selecting an engineered polypeptide that has greater binding at low pH (e.g., has desirable structural characteristics that lead to binding interactions), but has reduced binding at physiological pH (e.g., has decreased, fewer, or no desirable structural characteristics that lead to binding interactions), can, in some embodiments, result in an engineered polypeptide that can produce an antibody with greater binding to the desired target in a tumor, compared to binding not in a tumor. Physiological pH is typically between about 7.35 and about 7.45, for example about 7.4. The pH of a tumor microenvironment may be, for example, less than about 7.45, less than about 7.45, between about 7.45 and about 6.0, between about 7.0 and about 6.0, between about 6.8 and about 6.2, between about 6.7 and about 6.3, about 6.4, about 6.5, about 6.6, about 6.7, about 6.8, about 6.9 or about 7.0. In some embodiments, an engineered polypeptide can be evaluated at different pHs using computational methods, for example molecular dynamics simulations. In other embodiments, an engineered polypeptide is selected based on differential pH characteristics using an in vitro method. Suitable in vitro methods may include, for example, phage panning at different pHs. For example, an antibody phage display library can be used to pan one or more engineered polypeptides at physiological pH, and phage that bind at that pH can be discarded. Then, a second round of panning can be carried out at a lower pH, and phage that bind to the one or more engineered polypeptides at the lower pH can be selected. In some embodiments, engineered polypeptides which bind to no phage at a lower pH, or which bind to phage with similar affinity at both low and physiological pH, may be less desirable for use in generating an antibody that targets tumor cells.
b. Inverse Peptide Evaluation
In still further embodiments, selecting an engineered polypeptide may include comparing the binding of the engineered polypeptide to binding of an inverse engineered polypeptide. An inverse engineered polypeptide may be based on the engineered polypeptide, but replacing one or more of the interface-interacting amino acid residues (e.g., based on the surface of CD25) with an amino acid that exhibits an inverse characteristic. For example, an amino acid with a large, sterically bulky, hydrophobic side chain may be replaced with an amino acid that has a smaller side chain, or hydrophilic side chain, or a side chain that is both smaller and hydrophilic. In some embodiments, an amino acid with a hydrogen bond-donating side chain may be replaced with an amino acid that has a hydrogen bond-accepting side chain, or with a an amino acid that has a side chain that does not hydrogen bond. Binding characteristics that may be compared using the engineered polypeptide and the inverse engineered polypeptide may include, in some embodiments, specificity and/or affinity. Comparing the binding characteristics of a engineered polypeptide with the binding characteristics of an inverse engineered polypeptide may, in some embodiments, help select engineered polypeptides in which the interface-interacting amino acids drive the binding interactions, rather than characteristics of a linking moiety such as a linker. Engineered polypeptides in which binding is driven by a linking moiety such as a linker may be less desirable in some embodiments as they may exhibit off-target binding, or other undesirable binding characteristics.
In further embodiments, the method further comprises modifying the selected engineered polypeptides.
c. Binding Evaluation
As described herein, in some embodiments, the method of selecting an engineered polypeptide provided herein comprises evaluating the binding of an engineered polypeptide candidate to a protein or fragment thereof, for example a binding partner of CD25 (such as an antibody to CD25). For example, in some embodiments, an engineered polypeptide candidate library or peptide library is screened for binding to a binding partner of CD25.
Binding of a protein or fragment thereof (e.g., a binding partner of CD25) with one or more peptides or engineered polypeptide candidates (such as a member of a library) may be evaluated in various ways. In some embodiments, binding is directly evaluated, for example by directly detecting a label on the protein or fragment thereof. Such labels may include, for example, fluorescent labels, such as a fluorophore or a fluorescent protein. In other embodiments, binding is indirectly evaluated, for example using a sandwich assay. In a sandwich assay, a peptide or engineered polypeptide candidate (such as a member of a library) binds to a binding partner, and then a secondary labeled reagent is added to label the bound binding partner. This secondary labeled reagent is then detected. Examples of sandwich assay components include His-tagged-binding partner detected with an anti-His-tag antibody or His-tag-specific fluorescent probe; a biotin-labeled binding partner detected with labeled streptavidin or labeled avidin; or an unlabeled binding partner detected with an anti-binding-partner antibody.
In some embodiments, peptides or engineered polypeptide candidates of interest are identified based on the binding signal, or dose-response, using any number of available detection methods. These detection methods may include, for example, imaging, fluorescence-activated cell sorting (FACS), mass spectrometry, or biosensors. In some embodiments, a hit threshold is defined (for example the median signal), and any with signal above that signal is flagged as a putative hit motif.
For the development of the combinatorial library, peptides identified from the peptide library based on binding with the protein or fragment thereof may, in some embodiments, be further clustered into distinct groups using sequence or structural information, or a combinations thereof. This grouping may be done, for example, using generally available sequence alignment, chemical descriptors, structural prediction, and entropy prediction informatics tools (e.g. MUSCLE, CLUSTALW, PSIPRED, AMBER, Hydropathy Calculator, and Isoelectric Point Calculator) and clustering algorithms (e.g., K-Means, Gibbs, and Hierarchical). Clusters of motifs (e.g., structural or functional motifs) present in peptide hits can be identified from this analysis. Individual peptide motif hits can also be identified. Using these motif clusters and individual motifs, in some embodiments, design rules can be formulated that define one or more of sequence, structure, and chemical characteristics of the motifs that appear to drive the protein interactions at the target interface. In some embodiments, the structure of the target interface is not necessary for identification of these interface motif design rules. Rather, the design rules can, in some embodiments, be derived from analysis of peptides identified from screening the peptide library.
In some embodiments, the binding assay has a sensitivity dynamic range of about 105. Thus, in some embodiments, an engineered polypeptide candidate is identified as of interest if it has a binding event with a CD25 binding partner that is within a 105 signal bracket of the native CD25:binding partner signal. The type of signal may be different depending on what type of assay is being used, or how it is being evaluated. For example, in some embodiments, the signal is response units in a sensorgram, fluorescence signal in an image-based readout, or enzymatic readout in an enzyme-based assay. The signal for binding events may be measured relative to CD25:binding partner signal.
In some embodiments, the engineered polypeptide candidate is modified prior to evaluating binding. For example, in some embodiments, biotin, PEG, or another attachment moiety, or combination thereof, is bonded to the C terminus or the N terminus of the peptide to enable it to be used with a binding evaluation system. For example, in some embodiments biotin-PEG12- is covalently attached to the N-terminus of the engineered polypeptide. In other embodiments, the engineered polypeptide candidate is modified at the C terminus with -GSGSGK-PEG4-biotin (SEQ ID NO: 48). In certain embodiments, such a biotin-modified engineered polypeptide candidate is then bound to a streptavidin bead through the biotin moiety, and the bead-supported immunogen is evaluated for binding to a binding partner of CD25.
The engineered polypeptides provided herein, and identified by the methods provided herein, may be used, for example, to produce one or more antibodies that bind specifically to CD25. In some embodiments, the antibody is a monoclonal or polyclonal antibody.
The term “antibody,” as used herein, refers to a protein, or polypeptide sequences derived from an immunoglobulin molecule, which specifically binds to an antigen. Antibodies can be intact immunoglobulins of polyclonal or monoclonal origin, or fragments thereof and can be derived from natural or from recombinant sources.
The terms “antibody fragment” or “antibody binding domain” refer to at least one portion of an antibody, or recombinant variants thereof, that contains the antigen binding domain, i.e., an antigenic determining variable region of an intact antibody, that is sufficient to confer recognition and specific binding of the antibody fragment to a target, such as an antigen and its defined epitope. Examples of antibody fragments include, but are not limited to, Fab, Fab′, F(ab′)2, and Fv fragments, single-chain (sc)Fv (“scFv”) antibody fragments, linear antibodies, single domain antibodies (abbreviated “sdAb”) (either VL or VH), camelid VHH domains, and multi-specific antibodies formed from antibody fragments.
The term “scFv” refers to a fusion protein comprising at least one antibody fragment comprising a variable region of a light chain and at least one antibody fragment comprising a variable region of a heavy chain, wherein the light and heavy chain variable regions are contiguously linked via a short flexible polypeptide linker, and capable of being expressed as a single polypeptide chain, and wherein the scFv retains the specificity of the intact antibody from which it is derived.
“Heavy chain variable region” or “VH” (or, in the case of single domain antibodies, e.g., nanobodies, “VHH”) with regard to an antibody refers to the fragment of the heavy chain that contains three CDRs interposed between flanking stretches known as framework regions, these framework regions are generally more highly conserved than the CDRs and form a scaffold to support the CDRs.
Unless specified, as used herein a scFv may have the VL and VH variable regions in either order, e.g., with respect to the N-terminal and C-terminal ends of the polypeptide, the scFv may comprise VL-linker-VH or may comprise VH-linker-VL.
The term “antibody light chain,” refers to the smaller of the two types of polypeptide chains present in antibody molecules in their naturally occurring conformations. Kappa (“K”).
Thus, in some embodiments, provided herein is an antibody produced by immunizing an animal with an immunogen, wherein the immunogen is an engineered polypeptide as provided herein. In some embodiments, the animal is a human, a rabbit, a mouse, a hamster, a monkey, etc. In certain embodiments, the monkey is a cynomolgus monkey, a macaque monkey, or a rhesus macaque monkey. Immunizing the animal with an engineered polypeptide can comprise, for example, administering at least one dose of a composition comprising the immunogen and optionally an adjuvant to the animal. In some embodiments, generating the antibody from an animal comprises isolating a B cell which expresses the antibody. Some embodiments further comprise fusing the B cell with a myeloma cell to create a hybridoma which expresses the antibody. In some embodiments, the antibody generated using the engineered polypeptide can cross react with a human and a monkey, for example a cynomolgus monkey.
In certain embodiments, the method of generating an antibody further comprises determining one or more epitopes for the antibody. In some embodiments, the method comprises screening the antibody for binding to two or more epitopes, for example by contacting an epitope library with the antibody, and evaluating binding of the antibody to epitopes of the library. In certain embodiments, an antibody that binds to two or more epitopes is discarded. In some embodiments, the engineered polypeptide mimics one epitope of CD25. In other embodiments, the engineered polypeptide mimics two or more epitopes of CD25. In certain embodiments, screening an antibody for binding to two or more epitopes, wherein the engineered polypeptide mimics two or more epitopes of the CD25, comprises contacting an epitope library with the antibody, and evaluating binding of the antibody to epitopes of the library, and discarding one or more antibodies that binds to two or more epitopes, wherein the epitopes are not those mimicked by the engineered polypeptide.
In some embodiments, the antibody produced using an engineered polypeptide as provided herein binds specifically to CD25. In certain embodiments, the antibody does not block binding of IL-2 with CD25 when the antibody is bound to CD25.
In some embodiments, the antibody is a non IL-2-blocking antibody (a non IL-2 blocker)—that is, the binding of the antibody to CD25 does not disrupt or prevent binding of the IL-2 ligand to CD25 (the IL-2 alpha chain), and does not affect IL-2 mediated signal transduction, e.g. signaling through the IL-2/JAK3/STAT-5 signaling pathway. In some embodiments, the antibody does not disrupt the binding of IL-2 ligand to CD25 (IL-2 alpha chain), and binds to a different epitope than where the 7G7B6 antibody binds. In some embodiments, the antibody does not disrupt the binding of the IL-2 ligand to CD25 (IL-2 alpha chain), but does disrupt the trimerization of the beta, gamma, and alpha (CD25) chains of the IL-2 receptor.
In some embodiments, the antibody is an IL-2 blocking antibody, e.g., the antibody disrupts or prevents binding of the IL-2 ligand to the alpha, beta, and/or gamma chains of the receptor, and decreases or inhibits IL-2 mediated signal transduction. In certain embodiments, the antibody disrupts or prevents binding of the IL-2 ligand to CD25. In some embodiments, the antibody disrupts or prevents the binding of the IL-2 ligand to CD25, and binds to a different epitope than to which either daclizumab or baciliximab bind.
In some embodiments, the CD25 antibody is a partially blocking antibody, and partially, but not completely, disrupts binding of the IL-2 ligand to the alpha, beta, and/or gamma chains of the IL-2 receptor (CD25), and/or partially, but not completely decreases IL-2 mediated signal transduction.
In some embodiments, the antibody disrupts or prevents heterotrimerization of the alpha, beta, and gamma IL-2 chains. In some embodiments, the antibody does not block binding of the IL-2 ligand with CD25, but does disrupt or prevent heterotrimerization of the alpha, beta, and gamma IL-2R chains. In certain embodiments, the antibody selectively binds to Treg cells. In other embodiments, the antibody selectively binds to Teff cells.
In still further embodiments, whether an antibody produced using an engineered polypeptide as provided herein blocks binding of CD25 with IL-2 is evaluated. In some embodiments, an antibody that does not block CD25 binding with IL-2 is selected. In other embodiments, an antibody that does block binding of CD25 with IL-2 is selected. Such blocking or non-blocking may be evaluated, for example, by coupling CD25 to a biosensor tip, and evaluating binding by the antibody in the presence and absence IL-2. In some embodiments, the an antibody is expressed with a 6×His tag that can be used with Ni-NTA in flow cytometry to evaluate binding of the antibody, and blocking or non-blocking of IL-2 binding to CD25. In certain embodiments, the binding of the antibody is evaluated at physiological pH (e.g., between about pH 7.3 and about pH 7.5, or about pH 7.4), and also at the pH of a tumor microenvironment (e.g., between about pH 6.4 and about pH 6.6, or about pH 6.5). In certain embodiments, the blocking/non-blocking activity is compared to the binding of an IL-2 blocker antibody (for example, daclizumab or bacliliximab). In certain embodiments, the blocking/non-blocking activity is compared to the binding of an IL-2 non-blocker antibody (for example, antibody 7G7B6). In certain embodiments, the blocking/non-blocking activity is compared to both an IL-2 blocking antibody and an IL-2 non-blocking antibody.
In some embodiments, the antibody is an agonist antibody to CD25. In other embodiments, the antibody is an antagonist antibody to CD25.
In some embodiments, the antibody binds to CD25 in the trans orientation. In other embodiments the antibody binds to CD25 in the cis orientation. In still further embodiments, the antibody is capable of binding to CD25 in either the cis or the trans configuration.
The antibody clone of origin can be identified by the ID shown, e.g. the Clone ID in Table 2. For example, the antibody may comprise the heavy chain complementary determining regions of antibody clone “YU389-A01” as presented in row 1 of Table 2.
In some embodiments, the antibody has a CDR-H1, CDR-H2, CDR-H3, CDR-L1, CDR-L2, and CDR-L3, each independently selected from those disclosed in Table 2.
In some embodiments, the CDR-H1 is selected from: GGTFSSYA (SEQ ID NO: 286), GGSISSGGYY (SEQ ID NO: 283), GFTFSSYG (SEQ ID NO: 275), GYTFTSYY (SEQ ID NO: 313), GYTFTSYG (SEQ ID NO: 312), GYTFTDYY (SEQ ID NO: 306), GGSISSGGYS (SEQ ID NO: 282), GGSISSSNW (SEQ ID NO: 284), GYSFTSYW (SEQ ID NO: 299), GFTFSNYG (SEQ ID NO: 271), GFTFSSSA (SEQ ID NO: 272), GFTFSSYW (SEQ ID NO: 277), GFIFSRHA (SEQ ID NO: 264), GYTFNNYG (SEQ ID NO: 302), GFTFSSYA (SEQ ID NO: 273), GYTFTTYA (SEQ ID NO: 314), GFTFNNAW (SEQ ID NO: 267), GFTFSSYE (SEQ ID NO: 274), GYSFTTYW (SEQ ID NO: 300), GYSFNTYW (SEQ ID NO: 297), GFTFRRYW (SEQ ID NO: 268), GYSFSTYW (SEQ ID NO: 298), GFAFSSYG (SEQ ID NO: 262), GYKFANYW (SEQ ID NO: 296), GYTFKNFG (SEQ ID NO: 301), GFTFSSYS (SEQ ID NO: 276), GDSISSSSYY (SEQ ID NO: 261), and GGSISRSNW (SEQ ID NO: 281);
In some embodiments, the CDR-H2 is selected from: IIPIFGTA (SEQ ID NO: 326), IIPIFGTA (SEQ ID NO: 326), IYYSGST (SEQ ID NO: 384), ISYDGSNK (SEQ ID NO: 362), INPSGGST (SEQ ID NO: 339), ISAYNGNT (SEQ ID NO: 346), IMPIFDTA (SEQ ID NO: 332), VDPEDGET (SEQ ID NO: 693), IYHSGST (SEQ ID NO: 365), IYPGDSDT (SEQ ID NO: 377), ISHDGHVK (SEQ ID NO: 349), IKQDGSEK (SEQ ID NO: 328), ISVYNGDI (SEQ ID NO: 359), INTNTGDP (SEQ ID NO: 344), IKSKTDGGTT (SEQ ID NO: 330), ISSSGSTI (SEQ ID NO: 351), ISSRGSTI (SEQ ID NO: 350), IYPSDSDT (SEQ ID NO: 383), ISGRKGNT (SEQ ID NO: 347), ISSSSSYI (SEQ ID NO: 352), INHSGST (SEQ ID NO: 333), IYHTGST (SEQ ID NO: 366), and ISYDGNNK (SEQ ID NO: 360);
In some embodiments, the CDR-H3 is selected from: AREMYYYYGMDV (SEQ ID NO: 103), AREMYYYYGMDV (SEQ ID NO: 103), ARGNLWSGYYF (SEQ ID NO: 111), AKELLEGAFDI (SEQ ID NO: 64), ARDRVTMVRGALAY (SEQ ID NO: 97), ARERSYYGMDV (SEQ ID NO: 105), ASWSERIGYQYGLDV (SEQ ID NO: 145), ARDILGLDY (SEQ ID NO: 81), ATEDTAMGGIDY (SEQ ID NO: 146), ATEGRYGMDV (SEQ ID NO: 147), AVEGGRAPGTYYYDSSGLAY (SEQ ID NO: 153), ARAGYYYGMDV (SEQ ID NO: 71), ARDLGTMVRGVIEPYYFDY (SEQ ID NO: 85), ARGVRGTGFDP (SEQ ID NO: 118), ARDRNGYFQH (SEQ ID NO: 94), AKDLLGELSFFDY (SEQ ID NO: 61), ARLENNWDYGGWFDP (SEQ ID NO: 127), ARDRSYYGMDV (SEQ ID NO: 96), ARDKGYYGMDV (SEQ ID NO: 83), AKEISPRSSVGWPLDY (SEQ ID NO: 63), ARDFWSGYNELGGMDV (SEQ ID NO: 76), ARTWFGEFFDY (SEQ ID NO: 134), ARVIGGWFDP (SEQ ID NO: 136), ARGRLAYGDTEGFDY (SEQ ID NO: 112), ARDILRGESSILDH (SEQ ID NO: 82), ARDRYYYGMDV (SEQ ID NO: 98), ARDLLGSGYDIIDY (SEQ ID NO: 86), ARVWGKNGDFDY (SEQ ID NO: 142), ARDRFHYGMDV (SEQ ID NO: 90), ARDRGDY (SEQ ID NO: 92), TTEGVELLSFGGAPFDY (SEQ ID NO: 683), ARRRGGGFDY (SEQ ID NO: 132), AREKGSWFDP (SEQ ID NO: 102), ARDRGDRVGGLVFDY (SEQ ID NO: 91), ARQVAGGLDY (SEQ ID NO: 131), ARDRGYYGMDV (SEQ ID NO: 93), FRFGEGFDY (SEQ ID NO: 259), ARDGGYYFDD (SEQ ID NO: 79), ARDFRMDV (SEQ ID NO: 75), ARDAYAYGLDV (SEQ ID NO: 73), ARDLMNYGMDV (SEQ ID NO: 87), AREYDYGDYVFDY (SEQ ID NO: 107), ARLENNWNYGGWFDP (SEQ ID NO: 128), ARDYYYYGMDV (SEQ ID NO: 101), ARDIGYYYGMDV (SEQ ID NO: 80), ARVGDGYSLDY (SEQ ID NO: 135), AKAITSIEPY (SEQ ID NO: 60), AKGQGDGMDV (SEQ ID NO: 66), ARLGWGMDV (SEQ ID NO: 129), ARVWGDTTLGYGMDV (SEQ ID NO: 141), AIPWDAELGNYGMDV (SEQ ID NO: 59), ARGRWSGLGDY (SEQ ID NO: 113), ARARGGRYFDY (SEQ ID NO: 72), ARDQLAARRGYYYGMDV (SEQ ID NO: 89), AKGDVNYGMDV (SEQ ID NO: 65), ARDFYYGSGSYPNGYYYGMDV (SEQ ID NO: 77), ARDFNPFSITIFEMDV (SEQ ID NO: 74), ANLAMGQYFDY (SEQ ID NO: 70), ARDLGEAKSSSPHEPDY (SEQ ID NO: 84), ARDQEMYYFDY (SEQ ID NO: 88), ARGKGSYAFDI (SEQ ID NO: 110), and AKGYSSSPGDY (SEQ ID NO: 67);
In some embodiments, the CDR-L1 is selected from: QSISSY (SEQ ID NO: 567), QSISSY (SEQ ID NO: 567), SSNIGNNF (SEQ ID NO: 650), QSISNY (SEQ ID NO: 563), NIETKS (SEQ ID NO: 455), KLGDKY (SEQ ID NO: 404), QSVSNY (SEQ ID NO: 574), QTISQW (SEQ ID NO: 582), SSNIGSNY (SEQ ID NO: 655), NFNIGNNL (SEQ ID NO: 453), RNIWSY (SEQ ID NO: 634), QSISSW (SEQ ID NO: 566), QSVSSR (SEQ ID NO: 576), QTISGL (SEQ ID NO: 581), DIESEM (SEQ ID NO: 163), NIGSKS (SEQ ID NO: 456), QSIGNY (SEQ ID NO: 558), QGISSW (SEQ ID NO: 489), QSVSSTY (SEQ ID NO: 577), QDISNY (SEQ ID NO: 485), NIESES (SEQ ID NO: 454), SSDVGAYNY (SEQ ID NO: 647), QDINNY (SEQ ID NO: 482), QGISNS (SEQ ID NO: 488), SSNIGNNY (SEQ ID NO: 651), EGIRTS (SEQ ID NO: 218), QGTSSW (SEQ ID NO: 491), SSDVGGYNY (SEQ ID NO: 649), QSVSNNY (SEQ ID NO: 573), QGINSY (SEQ ID NO: 487), QAVRID (SEQ ID NO: 472), QSISRY (SEQ ID NO: 564), QSIGYW (SEQ ID NO: 559), SSNVGSNY (SEQ ID NO: 656), QSIKNY (SEQ ID NO: 561), QDIKRR (SEQ ID NO: 480), SGSIASSY (SEQ ID NO: 640), NSNVGNNY (SEQ ID NO: 465), SLRSYY (SEQ ID NO: 643), KLGERF (SEQ ID NO: 405), SGSVSTSYY (SEQ ID NO: 641), SSNIGRNY (SEQ ID NO: 652), EDIRMY (SEQ ID NO: 215), QGISTY (SEQ ID NO: 490), SSNVGSRT (SEQ ID NO: 657), NIGTKS (SEQ ID NO: 459), NIGSKT (SEQ ID NO: 457), QSINSY (SEQ ID NO: 562), SSNIGSNT (SEQ ID NO: 654), QSIITY (SEQ ID NO: 560), QSLLHSDGKTY (SEQ ID NO: 570), and GGNIARNY (SEQ ID NO: 279).
In some embodiments, the CDR-L2 is selected from: AAS (SEQ ID NO: 49), AAS (SEQ ID NO: 49), DST (SEQ ID NO: 191), DDD (SEQ ID NO: 158), KDN (SEQ ID NO: 386), GAS (SEQ ID NO: 260), KAS (SEQ ID NO: 385), RNN (SEQ ID NO: 635), SNN (SEQ ID NO: 644), AND (SEQ ID NO: 69), DAF (SEQ ID NO: 156), DDS (SEQ ID NO: 159), AAT (SEQ ID NO: 50), AVS (SEQ ID NO: 154), DAS (SEQ ID NO: 157), GVS (SEQ ID NO: 295), DNN (SEQ ID NO: 190), DVS (SEQ ID NO: 201), RAS (SEQ ID NO: 619), GTS (SEQ ID NO: 291), EDN (SEQ ID NO: 216), DND (SEQ ID NO: 188), GKN (SEQ ID NO: 287), QYI (SEQ ID NO: 616), NTD (SEQ ID NO: 467), RNH (SEQ ID NO: 633), EGS (SEQ ID NO: 219), DGR (SEQ ID NO: 162), TAS (SEQ ID NO: 667), DDT (SEQ ID NO: 160), EVS (SEQ ID NO: 237), and EDD (SEQ ID NO: 214).
In some embodiments, the CDR-L3 is selected from: QQSYSTPPT (SEQ ID NO: 534), QQSYSTPPT (SEQ ID NO: 534), GSWDTNLSGYV (SEQ ID NO: 289), QVWDSSSGHREV (SEQ ID NO: 613), QAWDSSTYV (SEQ ID NO: 473), QQYNHWPPL (SEQ ID NO: 544), QQYSGDSMYT (SEQ ID NO: 553), AAWDDSLSGVV (SEQ ID NO: 56), AAWDDSLNGVV (SEQ ID NO: 53), ATWDDSLSGVV (SEQ ID NO: 151), QQSHSTPIT (SEQ ID NO: 526), QQYNSYSRT (SEQ ID NO: 551), QQYTNWPQT (SEQ ID NO: 554), LQYDRYSGA (SEQ ID NO: 426), QVWHTTNDHVL (SEQ ID NO: 615), QVWDSSSDHWV (SEQ ID NO: 611), QQSKQIPYT (SEQ ID NO: 528), QQSYSLPLT (SEQ ID NO: 531), QQFDISGGLI (SEQ ID NO: 518), QQYDNLPLT (SEQ ID NO: 542), QVWDSSSDHTVA (SEQ ID NO: 609), SSYTTTDTFV (SEQ ID NO: 665), QQYDNLPYT (SEQ ID NO: 543), QQYYSTPPH (SEQ ID NO: 556), QQSYSTPLT (SEQ ID NO: 532), QVWDSSSDHVV (SEQ ID NO: 610), GTWDSSLSAYV (SEQ ID NO: 294), QQTHTWPWT (SEQ ID NO: 538), QQANSFPLT (SEQ ID NO: 514), QQSYSTPYT (SEQ ID NO: 536), SSYTSSSTYV (SEQ ID NO: 664), QRYGSSPR (SEQ ID NO: 557), QQVHSFPFT (SEQ ID NO: 541), LQHNTFPYT (SEQ ID NO: 423), QQSHSTPLT (SEQ ID NO: 527), QQYNSYPFT (SEQ ID NO: 548), QQYNSSPLMYT (SEQ ID NO: 547), QQTYSTPLT (SEQ ID NO: 540), QQANTFPQT (SEQ ID NO: 516), QSYDGSSVV (SEQ ID NO: 579), GSWEARESVFV (SEQ ID NO: 290), QQTYNDPPT (SEQ ID NO: 539), NSRDSSGNHVV (SEQ ID NO: 466), QTWDGSIVV (SEQ ID NO: 583), VLYMGSGIWV (SEQ ID NO: 694), ATWDDALSGWV (SEQ ID NO: 149), SSYTSSSTLVV (SEQ ID NO: 660), QQSYSTPWT (SEQ ID NO: 535), SSYTSSSTWV (SEQ ID NO: 663), LQDYNYPPA (SEQ ID NO: 422), QQYYDDPQ (SEQ ID NO: 555), QQLNGYPTT (SEQ ID NO: 525), AAWDDSLIGHV (SEQ ID NO: 51), QVWDTSGDLHWA (SEQ ID NO: 614), QQSYTTPLT (SEQ ID NO: 537), QVWDSSSDLLWV (SEQ ID NO: 612), GTWDSSLSALV (SEQ ID NO: 292), AAWDDSLNGPV (SEQ ID NO: 52), MQTKQLPLT (SEQ ID NO: 443), QQANSFPPT (SEQ ID NO: 515), QSYDGNNHMV (SEQ ID NO: 578), and SSYTSSSTLWV (SEQ ID NO: 661).
In some embodiments, the antibody has a CDR-H1, CDR-H2, CDR-H3, CDR-L1, CDR-L2, and CDR-L3, each independently selected from those disclosed in Table 3 Å and Table 3B. It is possible to combine the CDRs from different antibodies in any combination to generate new antibodies. Gene synthesis and high-throughput screening technologies enable the skilled person to test all combinations of six CDRs without undue experimentation.
In some embodiments, the antibody has the six CDRs of any one of the combinations provided in Table 4.
In some embodiments, the antibody is an scFv selected from Table 5, or any antibody having an antigen-binding domain derived from the scFv's in Table 5. In embodiments, the full length heavy chain and light chain variable regions are extracted from the scFv sequences in Table 5 and used to generate soluble Fab fragments, monoclonal antibodies, bispecific antibodies, or any other type of antibody known in the art. Where an scFv in Table 5 is a VH:VL scFv, it is possible to reverse the order of the heavy and light chains to generate a VL:VH scFv. Where an scFv in Table 5 is a VL: VH scFv, it is possible to reverse the order of the heavy and light chains to generate a VH: VL scFv.
In some embodiments, the antibody has a CDR-H1, CDR-H2, CDR-H3, CDR-L1, CDR-L2, and CDR-L3, each independently selected from those disclosed in Table 14 Å and Table 14B. In some embodiments, the antibody has a CDR-H1, CDR-H2, CDR-H3, CDR-L1, CDR-L2, and CDR-L3, each independently selected from any one clone listed in Table 14 Å and Table 14B. In some embodiments, the antibody has a CDR-H1, CDR-H2, CDR-H3, CDR-L1, CDR-L2, and CDR-L3, each independently selected from those disclosed, in groups, in Table 15 Å and Table 15B. The disclosure provides antibodies having CDRs from individual clones or from matching any one CDR with any other five CDRs. The antibodies identified in Table 14 Å and Table 14B are derived from mouse phage-display library. Known methods may be used to convert these CDRs into humanized or chimeric antibodies.
In some embodiments, the CD25 antibodies provided herein are useful for therapeutics, e.g. for use in proliferative diseases or disorders such as cancer or for use in autoimmune diseases.
Accordingly provided herein are methods of treating a cancer comprising administering to a subject in need thereof a therapeutically effective amount of a therapeutic CD25 antibody. In some embodiments, the cancer is a primary cancer. In some embodiments, the cancer is a metastatic cancer. In some embodiments, the cancer involves a solid tumor; in other embodiments, the cancer involves a liquid tumor, e.g. a blood based cancer. In exemplary embodiments, the CD25 antibody is a non-IL-2 blocking antibody.
Accordingly provided herein are methods of treating an autoimmune-related disease or disorder comprising administering to a subject in need thereof a therapeutically effective amount of a therapeutic CD25 antibody. In exemplary embodiments, the CD25 antibody is an non IL-2 blocking antibody.
As used herein, a subject refers to any animal classified as a mammal, including humans, domestic and farm animals, and zoo, sport, or pet animals, such as dogs, horses, rabbits, cattle, pigs, hamsters, gerbils, mice, ferrets, rats, cats, and the like. Subjects may be male or female.
The administration of any of the therapeutic CD25 antibodies provided herein may be administered in combination with other known drugs/treatments (e.g. small molecule drugs, or biologics. The administration may be sequential or concurrent.
In vivo administration of the therapeutic CD25 antibodies described herein may be carried out intravenously, intratumorally, intracranially, intralesionally (e.g. intralesional injection, direct contact diffusion), intracavitary (intraperitoneal, intralpleural, intrauterine, intrarectal), intraperitoneally, intramuscularly, subcutaneously, topically, orally, transdermally, by implantation, by inhalation, intrathecally, intraventricularly, or intranasally. In an exemplary embodiment, the route of administration is by intravenous injection.
A therapeutically effective amount of the therapeutic antibody generally will be administered. The appropriate dosage of the therapeutic antibody may be determined based on the severity of the disease, the clinical condition of the subject, the subject's clinical history and response to the treatment, and the discretion of the attending physician.
The CD25 antibodies provided herein may be used for diagnostic and detection purposes. Depending on the application, the CD25 antibody may be detected and quantified in vivo or in vitro.
The CD25 antibodies provided herein are amendable for use in a variety of immunoassays. These immunoassays include, but are not limited to enzyme-linked immunosorbent assay (ELISA), Western blot, radioimmunoassay (RIA), flow cytometry, a radioimmunoassay, an immunofluorescence assay, spectrophotometry, radiography, electrophoresis, high performance liquid chromatography (HPLC), or thin layer chromatography (TLC).
The CD25 antibodies provided herein may be comprise a detectable label, for example detectable by spectroscopic, photochemical, biochemical, immunochemical, fluorescent, electrical, optical or chemical methods. Useful labels in the present invention include, but are not limited to fluorescent dyes, radiolabels, enzymes, colorimetric lables, avidin or biotin.
In some embodiments, the CD25 antibody is radiolabeled with an isotope, useful for imaging by nuclear medicine equipment (SPECT, PET, or scintigraphy).
The disclosure provides compositions comprising therapeutic CD25 antibodies, In some embodiments the composition is sterile. The pharmaceutical compositions generally comprise an effective amount of the therapeutic antibody in a pharmaceutically acceptable excipient.
The disclosure also provides for kits comprising any of the CD25 antibodies described herein, e.g. for either therapeutic or diagnostic uses. In some embodiments, the kits further contain a component selected from any of secondary antibodies, reagents for immunohistochemistry analysis, pharmaceutically acceptable excipient and instruction manual and any combination thereof. In some embodiments, the kit comprises any one or more of the therapeutic compositions described herein, with one or more pharmaceutically acceptable excipients.
The present application also provides articles of manufacture comprising any one of the therapeutic or diagnostic compositions or kits described herein. Examples of an article of manufacture include vials (e.g. sealed vials).
The description provided herein sets forth numerous exemplary configurations, methods, parameters, and the like. It should be recognized, however, that such description is not intended as a limitation on the scope of the present disclosure, but is instead provided as a description of exemplary embodiments.
The following Examples are merely illustrative and are not meant to limit any aspects of the present disclosure in any way.
A crystal structure of CD25 was obtained. A number of the crystal structures available for CD25 are missing a mobile loop section of the protein. Molecular dynamics simulations were performed to gain a greater understanding of this mobile loop, and binding interactions of CD25 with IL-2.
Different sections of CD25 were selected as inputs for developing an engineered immunogen. Some of these areas are shown in
Quantitative metrics for ranking the different designs using molecular dynamics (MD) simulation included similarity to CD25, evaluated through RMSD; and structural flexibility of the candidates.
Structural similarity was calculated using root mean square deviation (RMSD) between the atomic coordinates of each peptide conformation in the MD ensemble and the reference structure after RMS alignment to the reference structure. The RMSD was computed using the computationally designed engineered immunogen candidate structure as the reference structure or by using the experimentally characterized (e.g., X-ray crystal structure) structure as the reference. In these simulations, the functional interface residues of the candidate (in some simulations) and all residues including the structural residues of the candidate (in other simulations) were compared to the reference.
The ensemble of conformations sampled by MD were clustered into groups (clusters) structurally similar to each other based on RMSD. Disorder was evaluated as the fraction of the conformations in the MD ensemble that could not be grouped into a cluster of similar conformations due to structural dissimilarity (e.g., high RMSD) to all other conformations in the ensemble. Thus, an engineered immunogen candidate with more disorder than an alternative candidate was more flexible. Order was evaluated as the fraction of the conformations in the MD ensemble that were grouped into a cluster of similar conformations (low RMSD). An engineered immunogen candidate with more order than an alternate candidate was less flexible when a higher fraction of its ensemble of conformations fell into fewer clusters than the alternate candidate.
The clusters populated by an engineered immunogen candidate were compared with a reference structure using RMSD. If the RMSD of a cluster was below a threshold value of 4 Angstroms, the cluster was considered ordered (e.g., low flexibility) and similar to the reference (structural similarity). An engineered immunogen candidate with a high fraction of its ensemble meeting this criterion of low flexibility and high structural similarity is predicted to be more active than an alternate candidate with a low fraction of its ensemble meeting this criterion of low flexibility and high structural similarity.
This quantitative analysis was combined with qualitative analysis of the MD trajectories regarding biophysical, biological and physical-chemical interactions, and used to select given immunogen candidates to evaluate in vitro. Table 6 below lists eleven engineered immunogens prepared as described above.
The binding of the engineered immunogens prepared in Example 1 are evaluated using an antibody to CD25. The engineered immunogens are modified on the C-terminus with a -GSGSGK-biotin group (SEQ ID NO: 846), and then bound separately to a streptavidin-coated biosensor tip. Buffer containing the CD25 antibody is flowed over the tip during an association phase of 300 seconds, and then the flowed solution is switched to buffer without the CD25 antibody and the dissociation from the biosensor tip will be measured. A control is also run where the tip does not have any engineered immunogen or protein initially bound, to evaluate any background binding of the CD25 antibody to the tip. A second control is performed where full length CD25 is biotinylated and bound to the biosensor tip, to demonstrate the binding level of the CD25 antibody to full length CD25. The data obtained from these biosensor experiments is used to qualitatively rank binding of the engineered immunogens.
Engineered immunogens provided herein are evaluated using phage panning techniques.
Mouse HuCD25 immunized phage libraries are transformed by electroporation in TG1 and phage propagated with the addition of CM13 using standard Phage Display protocols. TG1 cultures secreting phage are PEG precipitated with PEG/NaCl after incubation on ice for one hour. Exemplary libraries that may be used include 7807, 7808, 7809, and 7810.
Tumor microenvironment (TME) pH subtractive selections: Phage panning is carried out physiological pH and TME pH. To deplete antibodies that bind with high affinity to full-length CD25 at physiological pH, subtractive panning is first carried out by counter-selection of 3×10{circumflex over ( )}11 pfu phage (1000-fold representation of a 3×10{circumflex over ( )}8) at pH 7.4 by absorption for 1 hour on ELISA plates coated with 10 ug/ml full-length CD25 (400 nM) in PBST pH 7.4. Resulting phage supernatant is collected and pH is adjusted to pH 6.5 with PBST. Subsequent phage panning selections are carried out at pH 6.5.
Panning selections are pre-cleared with 25 microliters streptavidin Dynabeads with no antigen after a one hour incubation. Phage are then added to new pre-blocked Eppendorf LoBind tube. Biotinylated engineered immunogens (such as those described in Example 1) are added at 100 nM concentration (in some cases, additional 500 mMNaCl was added to reduce non-specific binding of immunogen to phage) for 40 min to one hour. Samples are then incubated with 25 microliters streptavidin beads or streptavidin coated plates at RT for one hour. Samples are pelleted and washed using magnet/magnetic beads or with plates, washed 7-10 times with PBST. Tubes are changed twice to remove residual phage.
To elute phage, 50-800 μL glycine pH 2.2 is added to the beads and plates, respectively, and incubated for no more than ten minutes, then neutralized with high pH Tris 9.0. Eluted phage is added to 1-5 ml TG1 freshly grown (OD600˜0.5), and incubated for 20-30 minutes.
Fractional log dilution series are plated, and the remainder transferred to 25 ml 2×YT. 1 ml glycerol stock is saved for a subsequent panning round, and helper phage/IPTG added at OD600˜0.5.
The selection against the engineered immunogen at pH 6.5 with counter-selection at pH 7.4 is carried out once more. The periplasmic extracts are subsequently evaluated using phage ELISA and octet screening.
To ensure that fab phage also bind full-length CD25 in addition to the engineered immunogen, a final selection with full-length CD25 can optionally be carried out, with full-length CD25 in place of the engineered immunogen (2 rounds selection against engineered immunogen, then 1 round selection against full length CD25).
To carry out selection with full-length CD25, after preclearing the panning selections with 25 microliters streptavidin Dynabeads, and adding phage to new pre-blocked Eppendorf LoBind tubes, biotinylated full-length CD25 is added at 100 nm concentration for one hour. The samples are then incubated with 25 microliters streptavidin beads at RT for one hour. Pelleting, washing, and elution steps are followed as described above.
ELISA Extract Preparation: Phage ELISA and periplasmic extract preparation for Fab Octet screening are conducted.
The CD25 antigen is diluted, added to the ELISA plate wells, and incubated. Following the incubation, wells are washed twice with PBS, then blocked by adding BSA followed by incubation for 2 hours at 25° C. Phage are diluted two-fold in 1×PBST 1.0% BSA, pH 6.5, 50 microliters are added per and incubated for 5 minutes at room temperature. The blocking solution is shaken out of the wells, and 50 μL of the dilute phage preparation is added to each well, and incubated for 1 hour at room temperature. The ELISA plate wells are washed 3-5 times with 200 microliters PBST pH 6.5. HRP-conjugated anti-M13 antibodies are diluted (Abcam, ab50370) 1:5000 with 1×PBST 1.0% BSA pH 6.5. 50 microliters of diluted secondary antibody conjugate is added to each well, and incubated for 1 hour at room temperature. ELISA plate wells are washed 3-5 times with 200 microliters PBST pH 6.5. The ECL Lumo substrate is prepared (e.g. Supersignal ELISA Pico Chemiluminescent Substrate) as described, into a 1:1 mixture. 50 microliters substrate solution is added to each well, incubated at room temperature for 5 to 60 minutes before reading.
Colonies are inoculated in 0.03-4 ml 2×YT 0.2% Glucose with 0.1 ml overnight culture (1 ml cultures in 96-well plate or 4 ml cultures in 14-ml falcon tubes). They are incubated at 250-700 rpm at 37° C. until the OD600˜0.5-1.0. Cultures are induced with 50-400 μL 0.025-0.1M IPTG. In some cases, the temperature is reduced to 30° C. with shaking at 250 rpm. They are then incubated overnight. 1-4 ml cultures are harvested by pelleting 3400 rcf for 10-15 minutes. The supernatant is discarded. Cultures are resuspended with 50-75 μL PPB buffer (30 mM Tris-HCl, pH 8.0, 1 mM EDTA, 20% Sucrose) with 1× Halt Protease Inhibitor and incubated on a rocking platform for 15 minutes at room temperature or 4° C. for 10 min. Then, cultures are resuspended with 150-225 μL of cold ddH20 with 1× Halt Protease Inhibitor and incubated on a rocking platform for one hour at room temperature or 4° C. for 1-2 hours. The lysate suspension was spun at 15000 rcf for 10-15 min at 4° C. Supernatant is collected and diluted.
Fab Expression and Purification Protocol: Cell cultures are inoculated, grown up overnight, and then induced with 50 μL of 25 mM-1M IPTG. The temperature was reduced to 30° C. and rpm to 150. Incubation was done overnight. 50 ml cultures or plates were harvested by pelleting 3400 rcf for 15 minutes. The supernatant was discarded. Cell pellets from 50 mL cultures were placed in a −80° C. freezer for 1 hour, while cultures grown in plates had 75 μL of PPB added with 1× Halt protease inhibitor, EDTA-free (Thermo Fisher Scientific) and vortexed. Plates are shaken at 4° C. for 10 minutes at 1000 rpm. The volume of 225 uL of cold water with 1× Halt protease inhibitor, EDTA-free (Thermo Fisher Scientific) is added to each well. Samples were mixed and shaken at 4° C. for 1-2 hours at max speed i.e. 1000 rpm. Plates are spun at 3500 rpm for 10 mins at 4° C. The supernatant (PPE) is transferred to fresh plates and stored at −20° C. Cell pellets from the 50 mL cultures are removed from the freezer and 5 ml PBS, 10 mM Imidazole is added with 2.5 mg/ml lysozyme and 1× Halt protease inhibitor, EDTA-free (Thermo Fisher Scientific). After pellets are thawed at room temperature for 30 minutes and lysates were centrifuged for 15 minutes at 3400 rcf The supernatant is removed and pellet discarded. 500 μL Ni-NTA resin was added (pre-washed and pelleted) or a Ni-NTA spin column was used for Fab purification. Incubate with cleared lysate for 30 min-1 hr. This was spun at 1500 rcf. These were washed 5 times with 1 ml PBS, 10 mM Imidazole. Buffer was discarded after each spin. 1 ml PBS, 200 mM Imidazole were added and mixed, incubated for 30 minutes and spun at 1500 rcf for 15 minutes. The eluted protein was stored at 4° C. or 20° C. after determining protein concentration. Zeba columns were used for desalting/buffer exchange.
OctetBiosensor Screening: For Octet Koff rate screening in raw supernatants, 50 μL of lysate is used in 384-well Pall ForteBio Octet plates. Data is collected on an Octet RED 384 (MD ForteBio). Briefly, Human CD25 is coupled to AR2G tips (1 ug/ml). For data collection, baseline is assessed in PBST 1% BSA buffer for 60 seconds. Tips are then moved to 50 μL lysate and association measured for 300 seconds. Finally, tips are moved to PBST 1% BSA buffer. Tips are then regenerated with 200 mM Tris-Glycine, pH 2.5 and neutralized with PBST, 1% BSA. For data analysis, double referencing (no CD25 on tip as well as blank reference well) is performed on Octet HT 11.0 software for reference subtraction.
Antibodies are produced by immunizing mice with the engineered immunogens described herein. These antibodies are evaluated for cross-reactivity, cross-blocking, affinity, and off-rate estimation.
Protocol for cross-reactivity determination by Biosensor (Octet Red 384, Pall Forte Bio): This protocol is used to determine the ability of individual test clones (anti-human CD25 mouse monoclonals) to bind the target (antigen) from human, cynomologous monkey, and mouse species. The target proteins are either covalently coupled via primary amines to dextran coated sensor tips or by affinity capturing the 6×-His tagged target proteins on anti-6×-His monoclonal antibody coated sensor tip. The monoclonal supernatants, in solution, are made to bind to the antigen on the biosensor tip. The net binding signal is the binding signal with the subtraction of corresponding signal with blank media or buffer binding to blank or antigen coated tips. A signal >3× background binding is considered as real binding event.
Protocol for cross-blocking by Biosensor: This method is to determine if the individual test clones (anti-human CD25 mouse monoclonals) are able to cross-block control antibodies. Cross-blocking may indicate that the test clones recognize an epitope that overlaps with the corresponding epitopes of the control antibodies. Additionally, this might imply that the test antibodies could have similar functional properties as the control antibodies. For this protocol, the control antibodies are covalently coupled via primary amines to dextran coated sensor tips. The target antigens, in solution, are made to bind to the control antibodies. Following this step, the test antibody, in solution, is made to bind to the antigen in a sandwich format. If the test antibody can bind to the antigen, it indicates that it does not cross-block the control antibody, while a non-binding may be interpreted as an ability to cross-block the control antibody.
Protocol for affinity determination by Biosensor: This method is used to determine the affinities of the individual test clones with antigens, when the concentration of the test antibodies is known. A capture molecule, such as Protein G or anti-mouse IgG-monoclonal or anti-human IgG-monoclonal is coated on the biosensor tip. Test clones are captured on the capture molecule coated surface. To these test clones, antigens in solution are made to associate and dissociate for time periods ranging from 60 to 600 seconds for association phase and 60 to 1800 seconds for dissociation phase. The result data (or ‘sensograms’) are then fit using either a 1:1 Langmuir model or 2:1 heterogeneous model. The former assumes that the interacting pairs are homogenous if a 2:1 model for fitting the data results in a better fit, it indicates that the clones require further sub-cloning due to inherent heterogeneity. The data curve fits provide the dissociation constant as a ratio of the on and off-rate constants
Protocol for off-rate estimation by Biosensor: This method is used to estimate the dissociation rate constant of test clones when the concentration of antibodies is not known or if the test clones require further subcloning. A capture molecule, such as Protein G or anti-mouse IgG-monoclonal or anti-human IgG-monoclonal is coated on the biosensor tip. Test clones are captured on the capture molecule coated surface. To these test clones, antigens in solution are made to associate and dissociate for time periods ranging from 60 to 600 seconds for association phase and 60 to 1800 seconds for dissociation phase. The result data (or ‘sensograms’) are then fit using either a 1:1 Langmuir model or 2:1 heterogeneous model. The former assumes that the interacting pairs are homogenous if a 2:1 model for fitting the data results in a better fit, it indicates that the clones require further sub-cloning due to inherent heterogeneity. The data is fit only for the off-rate constant and not the on-rate (or association) rate constant. This provides the estimates of off-rate constant, which can be used to rank-order the test clones.
The sequence and three dimensional (3D) structure of CD25 was retrieved from the protein databank (PDB) (PDB ID NO: 2ERJ, chain Δ):
As shown in
Atomic distance and amino acid descriptor topology were determined. The atomic distance and amino acid descriptor topology of the reference target were obtained using dynamic simulations, and a covariance matrix of atomic fluctuations was generated for the epitope in the reference target. Next, different engineered polypeptide candidates were generated using computational protein design (e.g., Rosetta), dynamics simulations performed on the candidates, and the atomic distance and amino acid descriptor topologies determined. A covariance matrix of atomic fluctuations was generated for the reference target epitope, and for the residues in the candidates corresponding to the residues in the epitope of the reference target.
Principal component analysis was performed to compute the eigenvectors and eigenvalues for each covariance matrix—one covariance matrix for each reference target and one covariance for each of the candidates—and only those eigenvectors with the largest eigenvalues are retained. Eigenvectors describe the most, second-most, third-most, N-most dominant motion observed in a set of simulated molecular structures. If a candidate moves like the reference epitope, its eigenvectors will be similar to the eigenvectors of the reference target (epitope). The similarity of eigenvectors corresponds to their components (a 3D vector centered on each CA atom) being aligned—pointing in the same direction. This similarity between candidates and reference target eigenvectors was computed using the inner product of two eigenvectors. The inner product value was 0 if two eigenvectors are 90 degrees to each other or 1 if the two eigenvectors point precisely in the same direction.
Since the ordering of eigenvectors is based on their eigenvalues, and eigenvalues may not necessarily be the same between two different molecules due to the stochastic nature by which molecular dynamics simulations sample the underlying energy landscape of those different molecules, the inner product between multiple, differentially ranked eigenvectors was needed (e.g., eigenvector 1 of the candidate by eigenvector 2, 3, 4, etc. of the reference target). In addition, without wishing to be bound by any theory, molecular motions are complex and may involve more than one (or more than a few) dominant/principal modes of motion.
To solve these two challenges, the inner product between all pairs of eigenvectors in the candidates and the reference target were computed. This resulted in a matrix of inner products the dimensions of which were determined by the number of eigenvectors analyzed—for 10 eigenvectors, the matrix of inner products is 10 by 10. This matrix of inner products was distilled into a single value by computing the root mean-square value of the inner products. This is the root mean square inner product (RMSIP).
Principal component analysis (PCA) reduces the 3 L×3 L dimensional coordinate covariance matrices (L being number of atoms) into sets of eigenvectors, Φ (reference target) and Ψ (MEM), and eigenvalues, Λ. The set Φ contains N eigenvectors pi for the reference target and the set Ψ contains N eigenvectors ψj for the MEM, where eigenvectors are ordered in their respective sets by their associated eigenvalues. The eigenvector with the largest eigenvalue accounts for the largest fraction of total coordinate covariation. The inner product of each φi and ψj eigenvector is computed to compare the similarity of motion between the reference target and the MEM. The root mean square of all inner product combinations of φi and ψj eigenvectors renders the total similarity of motion of the engineered polypeptide candidate (MEM) to the reference target (RMSIP).
As shown in
MVY
CQPDCTAKCMHGCDRDTMKECCDRLK (SEQ ID NO: 2)
DD
CPEVPHATFKGPGQKWEGPGGGDCSK (SEQ ID NO: 3)
DD
CIEVPGPAECAERACRAQEERQRQPQCI (SEQ ID NO: 4)
Thirty-two different panning strategies (S1-S32) were devised, each comprising three rounds, of positive selection (Table 10). Each program used at least one engineered polypeptide as a selection molecule. A conventional selection was also included using conventional methods (CD25 as the positive target). Bovine serum albumin (BSA) was used a negative target selecting against non-specific binding.
The panning protocol began with a human naïve scFv library, and panning was performed in solution, with the selection molecules bound to biotin (but still in solution). For each round, the starting pool was combined with the negative selection molecule (BSA) first in solution, and then a streptavidin-coated substrate (e.g., magnetic beads) was applied to the mixture to bind the negative selection molecules. Thus, any phage in the pool that was bound to the negative selection molecule was also bound to the streptavidin-coated support. The remaining solution was removed, and this flow through was then taken on to the positive selection step. The flow through was combined with positive selection molecule (antigen 1), allowed to bind, and then a streptavidin-coated solid substrate applied to the mixture. In this step, the bound phage were retained while the remaining unbound phage were removed. Then the bound phage were eluted. E. coli were transfected with the eluted phage using a 30 minute cultivation, the transfected cells were split for next-generation sequencing and DNA isolation for analysis, and then the phage amplified for use in the subsequent panning round. For each panning program, in each round negative selection was performed first, and positive selection second.
384 clones for each strategy were selected for ELISA response analysis to full-length CD25 after three rounds of panning (
Of the hits 1475 were selected for further characterization because they met one of two criteria in ELISA: 1)>10:1 signal to noise (s/n) in full-length CD25 ELISA; or 2)>3:1 s/n in MEM ELISA and >5:1 s/n in CD25 ELISA.
The affinity of the different scFv antibodies were evaluated on a ForteBio® Octet RED384™ biolayer interferometry instrument, using a single-cycle kinetics assay design. His-tagged scFv's were immobilized on anti-his biosensor (Fortebio® HIS1K). Full-length CD25 analyte was washed over the sensor tip and the binding of the molecules in the analyte to the scFv's recorded. Each assay was run in duplicate. Controls were also run, using just a buffer (to control for sensor drift) and a separate control of purified polyclonal IgG isotype antibodies from human serum (to control for non-specific IgG binding).
As shown in
The CD25 specificity the different scFv antibodies were evaluated on flow cytometer using cells that express CD25 [CD25(+)] or do not express CD25 [CD25(−)]. As shown in
Next generation sequencing was performed on each round of panned phage. As shown in
Individual scFv's were sequenced using Sanger sequencing methods. Complete protein sequences for each scFv are provided in Table 5. Immunoglobulin gene usage and complementarity determining regions are provided in Table 12 and Table 2, respectively.
Analysis of the CDRs and germline usages suggest that the 1475 scFv's sequenced represent at least 126 distinct clones. The set includes 40 different VH+JH frameworks choices, and 35 VL+JL framework choices. Unique CDR sequences include:
Sequence analysis applied to scFv's directed to individual target epitopes identifies common CDR usage patterns within each set of antibody:
For CD25 epitope 1 (55-63), CDRs used include:
For CD25 epitope 2 (13-20:127-132), CDRs used include:
For CD25 epitope 3 (5-17), CDRs used include:
For CD25 epitope 4 (5-11:156-163), CDRs used include:
For CD25 epitope 5 (77-89), CDRs used include:
For CD25 epitope 6 (147-157), CDRs used include:
For CD25 epitope 7 (11-14), CDRs used include:
For CD25 epitope 8 (44-56), CDRs used include:
126 anti-CD25 clones were subjected to epitope resolution with a four-target competitive binding assay, as depicted in
Cross-competition assays were performed in the classical sandwich format, involves immobilizing the first antibody onto the biosensor, followed by incubation with the antigen, and then the second sandwiching antibody. His-tagged scFv were expressed and purified in situ on the biosensor using His-tag capture from supernatant. Biosensor His-tag capture was normalized across scFv clones by monitoring the tip loading response to a consistent level across all scFv measurements. The scFv's were each individually captured to an anti-His biosensor (Fortebio HIS1K). A baseline measurement was taken in running buffer. Then CD25 was captured to the antibodies. Finally each of various competitive analytes were added, including IL-2, 7G7B6, Basiliximab, or Daclizumab. The competitive analyte can bind the captured CD25 only if competitive analyte's binding epitope does not overlap that of immobilized scFv.
As shown in
As shown in
As shown in
As shown in
As shown in
As shown in
Alanine mutations were designed to confirm or reject that MEM-steered clones bin the intended epitopes (
As intended, binding scFv hits from in vitro selection with the 147-157 epitope-targeted engineered polypeptides are consistent with specificity for the intended portion of CD25.
Each of 117 scFv's from the screening campaign were tested against four alanine mutations pairs (
Thirty antibodies were selected for additional testing as full-length immunoglobulins. The heavy and light chain sequences were cloned into human immunoglobulin G (IgG1) format and expressed and purified. Binding to CD25 was assessed by Octet® as shown in Table 13.
A phage-display library was generated from the immunoglobulin genes of a mouse immunized with full-length CD25. This library, biased towards CD25-binding antibodies, was panned against the indicated engineered polypeptides, yielding the complementarity determining region sequences indicated in Table 14 Å and Table 14B.
Sequence analysis suggests that these antibodies derive from clonal lineages that may be grouped as indicated in Table 15 Å and Table 15B.
This application is a continuation of International Application No. PCT/US2019/061567, filed Nov. 14, 2019, which claims the priority benefit of U.S. Provisional Application No. 62/902,334, filed Sep. 18, 2019; and U.S. Provisional Application No. 62/767,431, filed Nov. 14, 2018, the entire contents of which are hereby incorporated by reference in their entirety for all purposes.
Number | Date | Country | |
---|---|---|---|
62902334 | Sep 2019 | US | |
62767431 | Nov 2018 | US |
Number | Date | Country | |
---|---|---|---|
Parent | PCT/US2019/061567 | Nov 2019 | US |
Child | 17320118 | US |