HIV-1 engages target cells through interactions between the viral spike glycoprotein Env and host receptors. HIV-1 Env is formed by a homotrimeric complex of gp160 subunits that are cleaved by host proteases during maturation into extracellular gp120 and membrane-tethered gp41, which remain non-covalently associated in a ‘closed’ conformation1. During infection, the gp120 subunit binds the primary host receptor CD4, inducing a change to an ‘open’ conformation of Env that exposes binding sites for a secondary co-receptor2-4. This co-receptor is one of either two chemokine receptors, CCR5 or CXCR4, and once bound, further conformational changes release fusogenic regions of gp41 that mediate membrane fusion and viral entry into the host cell.
Env is the only viral protein on the outside of a HIV-1 virion accessible to the humoral immune system, and therefore has been extensively studied for vaccine development °. Mature Env has a complex structural organization. At the apex most distal from the membrane is a trimerization domain that mediates contacts between gp120 subunits, and is formed by variable regions V1, V2, and V36-11. In the central region of the Env spike are the gp120 inner and outer domains, which face in towards the trimer axis or out towards bulk solvent, respectively. Below, the N- and C-termini of gp120 are encircled and ‘grasped’ by the extracellular region of gp41, with the gp41 heptad repeat HR1 forming trimer contacts that resemble a helical-bundle ‘spine’ at the center of the complex. Next is a trimeric association of gp41 transmembrane helices12, followed by gp41 cytoplasmic tails. Infection is initiated by CD4 binding to Env in a closed conformation, where there are close interactions between apical tips of the trimerization domains6. CD4 binding stabilizes large conformational changes that break apical contacts and promote opening of the structure6,13. The closed and open conformations have distinct antigenic profiles, with many broadly neutralizing antibodies (bNAbs) binding preferentially to closed Env14,15, while the open state presents strain-specific, poorly neutralizing epitopes in the V3 region16,17.
Env sequence diversity, exposure of non-neutralizing or strain-specific immunodominant epitopes, and epitope shielding by extensive glycosylation, all act to limit potency and breadth of the host response. Design and purification of Env immunogens that correctly fold into native-like, pre-fusion closed trimers is also challenging, due to intrinsic conformational flexibility. The most notable engineered form of Env for vaccine purposes contains the so-called SOSIP mutations (an introduced disulfide between residues 501 and 605 to prevent gp120-gp41 dissociation18, and an I559P mutation that destabilizes the post-fusion trimeric conformation19; all residue numbers throughout are based on the HXB2 reference strain), which permit the expression and purification of soluble extracellular Env as native trimers20,21. The best studied SOSIP construct is in the clade A strain BG505 sequence, but SOSIP mutations have also been introduced into Env from other strains in different clades. SOSIP constructs have been engineered with additional mutations and/or disulfide bonds to further stabilize the closed conformation recognized by most bNAbs22-24, reduce exposure of V3 region epitopes17,22-24, be expressed as single chain non-proteolyzed native trimers25,27, and to bind germline B cells with higher affinity to stimulate lineages that mature into potent bNAb secretors24,28. More reductionist approaches have trimmed down the extracellular regions of Env to produce stable fragments of gp12029,30 or, at the most extreme, stabilize isolated epitopes for focusing an immune response31-33. Understanding how Env sequence dictates conformation can therefore assist the design of sophisticated Env immunogens for optimum immunogenicity. Compared to these previous efforts, full or close-to-full length Env that contains the transmembrane domain has not been as extensively engineered to stabilize closed trimeric conformations. This is despite full-length Env being well suited for some vaccine formats, such as a DNA vaccine, virus-like particles (VLPs), or purified protein embedded in membrane nanodiscs.
Deep mutational scanning couples directed evolution of diverse sequence populations with next generation sequencing to track the phenotypic fitness of thousands of mutations simultaneously34. The method has been used for vaccine design to screen for mutations within SOSIP constructs that enhance direct interactions with an antibody and its germline precursors, reduce exposure of V3 epitopes, and improve thermostability24,35 22. Tissue culture propagation of viruses expressing Env variants has also been followed by next generation sequencing, and the mutational tolerance observed in these experiments closely aligns with observed diversity of natural Env sequences36. However, this experimental system has proven insightful for characterizing Env mutations that allow HIV-1 to escape neutralization from a broadly-neutralizing antibody (bNAb)37.
An embodiment provides an HIV-1 Env protein or fragment thereof comprising one or more of the amino acid mutations listed in Table 1, wherein the amino acids are numbered by HXB2 numbering.
Another embodiment provides an HIV-1 Env protein or fragment thereof comprising one or more of the sets of amino acid mutations listed in Table 2, wherein the amino acids are numbered by HXB2 numbering.
Even another embodiment provides an HIV-1 Env protein or fragment thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2, wherein the protein or fragment thereof has at least one mutation shown in Tables 1 or 2 and otherwise has about 95% or more sequence identity to an HIV-1 Env protein (such as a wild-type HIV-1 Env protein).
Yet another embodiment provides a trimeric complex or portion thereof comprising HIV-1 Env proteins or fragments comprising one or more of the mutations listed in Table and Table 2 in a trimeric conformation.
Still another embodiment provides an immunogen comprising an HIV-1 Env protein or fragment thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2 or an HIV-1 Env trimeric complex or portions thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2.
Even another embodiment provides a method of screening a compound for binding to one or more proteins thereof, wherein the one or more proteins comprise an HIV-1 Env protein or fragment thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2, a trimeric complex or portions thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2, or combinations thereof. The method comprises providing the one or more proteins, fragments, complexes or portions thereof; contacting the one or more proteins, fragments, complexes or portions thereof with the compound; and determining the ability of the compound to bind to the one or more proteins, fragments, complexes or portions. The one or more proteins, fragments, trimeric complexes, or portions thereof can comprise 2, 5, 10, 15, or more proteins, fragments, trimeric complexes, or portions thereof. The compound can inhibit an HIV-mediated activity. The compound can be provided in a library.
An embodiment provides a library comprising two or more (e.g., 2, 5, 10, 20, 30, 50, 100 or more) HIV-1 Env proteins or fragments thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2 or an HIV-1 Env trimeric complex or portions thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2.
Another embodiment provides a nucleic acid molecule encoding an HIV-1 Env protein or fragment thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2 or a trimeric complex or portions thereof comprising one or more of the amino acid mutations listed in Table 1 or Table 2.
Still another embodiment provides a vector comprising a nucleic acid molecule described herein.
Yet another embodiment provides a host cell comprising a vector described herein.
Even another embodiment provides a method of producing an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof comprising culturing a host cell described herein in a culture medium to produce the protein, fragment, complex, or portion thereof. The host cell can be a mammalian cell having the ability to glycosylate proteins.
An embodiment provides a composition comprising one or more HIV-1 Env proteins or fragments thereof described herein, one or more HIV-1 Env trimeric complexes or portions thereof described herein, and a pharmaceutically acceptable carrier. The composition can comprise one or more HIV-1 Env proteins or fragments thereof or one or more HIV-1 Env trimeric complexes or portions thereof. The composition can also comprise one or more HIV-1 Env proteins or fragments thereof and one or more HIV-1 Env trimeric complexes or portions thereof. The composition can further comprise an adjuvant.
Even another embodiment provides method for eliciting an immune response against an HIV-1 infected cell in a subject comprising administering to the subject an amount of an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof described herein, effective to elicit an immune response in the subject.
Yet another embodiment provides a method for preventing a subject from becoming infected with HIV-1 comprising administering to the subject a prophylactically effective amount of an amount of an HIV-1 Env protein, fragment thereof. HIV-1 Env trimeric complex or portion thereof described herein, such that the subject is prevented from becoming infected with HIV-1.
Still another embodiment provides a method for reducing the likelihood of a subject becoming infected with HIV-1 comprising administering to the subject an amount of an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof described herein, effective to reduce the likelihood of the subject becoming infected with HIV-1. The subject may have been exposed to HIV-1.
Another embodiment provides a method for delaying the onset of, or slowing the rate of progression of, an HIV-1-related disease or symptom in an HIV-1-infected subject comprising administering to the subject an amount of an amount of an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof described herein, effective to delay the onset of, or slow the rate of progression of the HIV-1-related disease or symptom in the subject.
An embodiment provides a method of isolating antibodies that specifically bind to an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof described herein. The method comprises administering an effective amount of an HIV-1 Env trimeric complex or portion thereof, an HIV-1 Env protein or fragment thereof, an HIV-1 Env nucleic acid molecule, a vector, a host cell or a pharmaceutical composition described herein to a subject to generate antibodies that specifically bind to an HIV-1 Env protein or HIV-1 Env trimeric complex; and isolating the antibodies.
Another embodiment provides a method of identifying antibodies that specifically bind to an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof as described herein. The method comprises administering an effective amount of an immunogen selected from: an HIV-1 Env trimeric complex or portion thereof, an HIV-1 Env protein or fragment thereof, an HIV-1 Env nucleic acid molecule, a vector, a host cell or a pharmaceutical composition described herein to B cells in an in vitro cell culture system to generate antibodies that specifically bind to the HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof. Antibodies specific for the administered complex or composition are then isolated.
Yet another embodiment provides a method of making an isolated hybridoma that produces a broadly neutralizing antibody that specifically binds to an HIV-1 Env trimeric complex, portion thereof, an HIV-1 Env protein, or fragment thereof. The method comprises immunizing a mammal with an effective amount of: an HIV-1 Env trimeric complex or portion thereof, an HIV-1 Env protein or fragment thereof, an HIV-1 Env nucleic acid molecule, a vector, a host cell or a pharmaceutical composition described herein. Splenocytes are isolated from the immunized mammal and the isolated splenocytes are fused with an immortalized cell line to form hybridomas. Individual hybridomas are screened for production of an antibody that specifically binds with the HIV-1 Env protein or fragment thereof or HIV-1 Env trimeric complex or portion thereof to isolate the hybridoma.
Still another embodiment provides a method of producing a stable HIV-1 Env trimer in a closed conformation. The method comprises making one or more of the amino acid mutations or sets of mutations described herein in an HIV-1 Env protein and expressing the protein such that a stable HIV-1 Env trimer in a closed conformation is produced.
HIV-1 infection is initiated by viral Env engaging the host receptor CD4, triggering Env to transition from a ‘closed’ to ‘open’ conformation during the early events leading to virus-cell membrane fusion. Using deep mutational scanning, sequence-activity landscapes were defined for Env interacting with CD4, antibody VRC01 that binds the CD4 site but fails to induce the open state, and antibody PG16 that recognizes closed Env. Compared to CD4 or VRC01 binding, the Env trimer interface was under selection for PG16 recognition, and mutations that enhance presentation of the PG16 quatemary epitope frequently reduce positive charge at the Env apex, suggesting this region is primed for opening by electrostatic repulsion. Positive stabilization of the closed state is insufficient to reduce CD4 binding, membrane fusion, or V3 epitope exposure, which instead requires explicit destabilization of the open conformation. Mutations described herein stabilize a closed Env conformation, are broadly applicable to different HIV-1 strains, and can assist in the engineering of Env-based immunogens that better present epitopes recognized by broadly neutralizing antibodies.
Env sequence preferences are determined independent of infection and virus propagation for interactions with three protein ligands that act as conformational probes: CD4, which induces the open Env conformation and binds monomeric gp120 with highest affinity; bNAb VRC01, which binds tightly to both monomeric gp120 and mature Env without inducing the open conformation; and bNAb PG16, which exclusively binds closed trimeric Env1,38-44. While soluble SOSIP proteins have been extensively engineered, less focus has been applied to conformational stabilization of full-length Env that may be better suited for virus-like particle or DNA vaccines. Mutations are provided herein that stabilize the PG16 quatemary epitope in full-length Env sequences from representative strains in clades A, B and C, while simultaneously enhancing presentation of epitopes recognized by PGT121 and PGT128 bNAbs, and also revealing an electrostatic repulsion mechanism for inducing the closed-to-open transition. It would be ideal if there were a suite of mutations for applying broadly to any HIV-1 strain, which stabilize Env in a closed trimer for improved bNAb elicitation. A HIV-1 vaccine could then be rapidly modified and updated to contain stabilized Env sequences from local prevailing strains. The HIV-1 Env mutants described herein address a pressing need in that they can be rapidly used as a focused mutational screen to conformationally stabilize Env from diverse HIV-1 strains.
Alignments of protein sequences observed in nature are very effective at revealing conserved residues in primary structure for correct folding and function. However, a natural sequence like HIV-1 Env is shaped by multiple activities; Env must fold, traffic to the cell surface, be incorporated into a budding virion, bind target receptors, mediate membrane fusion, and escape antibody neutralization. Here, in vitro selection is used to focus on the specific activities of folding to a trimeric closed conformation and binding to the CD4 receptor. Sequence-activity landscapes provide insight into Env mutational tolerance for acquiring closed and CD4-bound conformations, information which is not at all obvious from a multiple sequence alignment. This information provides mechanistic insights, and is leveraged for engineering full-length Env with properties that may prove especially useful for vaccines that incorporate membrane-anchored Env (e.g. virus-like particles or DNA vaccines), and can also be applied to soluble extracellular constructs like SOSIP.
Increased presentation of the PG16-recognized conformation can enhance fusion to target membranes, consistent with the closed state being relevant to the molecular events of infection and an excellent target for neutralization by vaccines. Env-QES.i01 and QES.i02 variants still bind CD4 and expose V3 region epitopes, either because of dynamic or induced conformational fluctuations, or persistent conformational heterogeneity that includes trimeric, monomeric, closed, open and/or misfolded forms. V3 region epitopes are hidden only by inclusion of an additional core mutation (V255M) to destabilize the CD4-bound open state, and some of the mutations identified herein from the mutational scan of DU422 gp140 (e.g. Y484W) may similarly act to destabilize the CD4-bound open state. Likewise, trimer-stabilized SOSIP gp140 constructs also have persistent binding to V3 antibodies17,20, and induce undesirable V3 non-neutralizing responses in immunized animals16,17. This has necessitated negative selection strategies to explicitly reduce V3 loop exposure17,24. The findings described herein emphasize that positive stabilization alone appears insufficient to prevent Env opening, and the open state must be explicitly destabilized.
HIV-1 is completely dependent upon the Env protein to enter cells. HIV-1 Env is formed by a homotrimeric complex of gp160 subunits that are cleaved by host proteases during maturation into extracellular gp120 and membrane-tethered gp41, which remain non-covalently associated in a ‘closed’ conformation. Production of Env trimeric complexes that mimic the native spike, however, is challenging in part because the recombinant trimers either are unstable or aggregate. Therefore, mutants of Env proteins that can form stable trimeric complexes in the closed conformation are desirable. Membrane distal and membrane proximal aspects of the HIV-1 trimer in the closed conformation include several distinct structural elements that are absent from the corresponding regions of the HIV-1 Env trimer in its CD4-bound open conformation. The following mutations can be used to generate mutant, stable Env trimeric complexes in the closed conformation across many clades of HIV-1. For example, the amino acid substitutions disclosed herein can be used to alter the Env protein in any clades or subtypes of Group M, O, N, or P HIV-1 strains.
HIV-1 Env proteins and nucleic acid sequences encoding Env proteins and methods for the manipulation and insertion of such nucleic acid sequences into vectors, are well known (see, e.g., HIV Sequence Compendium, Division of AIDS. National Institute of Allergy and Infectious Diseases (2013); HIV Sequence Database (hiv-web.lanl.gov/content/hiv-db/mainpage.html); Sambrook et al. (Molecular Cloning: A Laboratory Manual, 4th ed, Cold Spring Harbor, N.Y., 2012); Ausubel at al. (In Current Protocols in Molecular Biology, John Wiley & Sons, New York, through supplement 104, 2013). HIV-1 Env protein sequences are known and are available in the HIV Sequence Database (hiv-web.lanl.gov/content/hiv-db/mainpage.html).
Amino acid mutations or substitutions are described herein. An amino acid substitution is the replacement of one amino acid in a polypeptide with a different amino acid or with no amino acid (i.e., a deletion).
An embodiment provides an HIV-1 Env protein or fragment thereof comprising one or more of the amino acid mutations (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or more mutations) in Table A. A fragment can be about 10, 20, 30, 40, 50, 75, 100, 200, 300, 400, 500, 600, or more amino acids as long as it contains an amino acid mutation listed in Tables A-H and can induce an immunogenic response in a subject (e.g., a mammal such as a human). The amino acids are numbered by HXB2 numbering.
An embodiment provides an HIV-1 Env protein or fragment thereof comprising one or more of the combination of amino acid mutations (e.g., 1, 2, 3, 4, 5, or more sets of mutations) in Table B. The amino acids are numbered by HXB2 numbering.
An embodiment provides an HIV-1 EnvBaL protein or fragment thereof comprising one or more of the amino acid mutations (e.g., 1, 2, 3, 4, 5, or more mutations or 1, 2, 3, 4, 5, or more sets of mutations) in Table C. The amino acids are numbered by HXB2 numbering.
An embodiment provides an HIV-1 EnvDU422 protein or fragment thereof comprising one or more of the amino acid mutations (e.g., 1, 2, 3, 4, 5, or more mutations or 1, 2, 3, 4, 5, or more sets of mutations) in Table D. The amino acids are numbered by HXB2 numbering.
An embodiment provides an HIV-1 EnvQ769.d22 protein or fragment thereof comprising one or more of the amino acid mutations (e.g., 1, 2, 3, 4, 5, or more mutations or 1, 2, 3, 4, 5, or more sets of mutations) in Table E. The amino acids are numbered by HXB2 numbering.
An embodiment provides an HIV-1 Env Q842.d12 protein or fragment thereof comprising one or more of the amino acid mutations (e.g., 1, 2, 3, 4, 5, or more mutations or 1, 2, 3, 4, 5, or more sets of mutations) in Table F. The amino acids are numbered by HXB2 numbering.
An embodiment provides an HIV-1 Env 25711 protein or fragment thereof comprising one or more of the amino acid mutations (e.g., 1, 2, 3, 4, 5, or more mutations or 1, 2, 3, 4, 5, or more sets of mutations) in Table G. The amino acids are numbered by HXB2 numbering.
An embodiment provides an HIV-1 BG505 SOSIP.664 protein or fragment thereof comprising one or more of the amino acid mutations (e.g., 1, 2, 3, 4, 5, or more mutations or 1, 2, 3, 4, 5, or more sets of mutations) in Table H. The amino acids are numbered by HXB2 numbering.
An HIV-1 Env mutant protein described herein comprises one or more of the amino acid substitutions described herein (e.g., those shown in Tables A, B, C, D, E, F, G, or H). HIV-1 Env mutant proteins described herein can comprise the disclosed amino acid mutation or set of amino acid mutations and otherwise have 75, 80, 90, 95, 99% or more sequence identity to any HIV-1 Env protein (e.g., an HIV-1 EnvBaL protein, an HIV-1 EnvDU422 protein, an HIV-1 EnvQ769.d22 protein, an HIV-1 Env Q842.d12 protein, an HIV-1 Env 25711 protein or other HIV-1 Env proteins). In some examples, an amino acid in a polypeptide is substituted with an amino acid from a homologous polypeptide, for example, an amino acid in a recombinant Clade A HIV-1 Env polypeptide can be substituted with the corresponding amino acid from a Clade B HIV-1 Env polypeptide.
All amino acid numbering of HIV-1 Env used herein refers to HXB2 number. HXB2 numbering is described in detail in, for example, Korber et al. Numbering Positions in HIV Relative to HXB2CG, in Korber et al., eds., Human Retroviruses and AIDS 1998, pp. III-102-III-111, Los Alamos National Laboratory, Los Alamos, N. Mex., report LA-UR 99-1704, which is incorporated by reference in its entirety.
Korber presents a clearly numbered set of proteins, and the full length genome, for HIV HXB2, GenBank accession number K03455. HIV HXB2 is also known as: HXBc2, for HXB clone 2; HXB2R, and HXB2CG in GenBank, for HXB2 complete genome. The HXB2 Env sequence is shown in SEQ ID NO:1 below.
Therefore, the HIV-1 Env amino acid mutation “T49D” means that the T at position 49 of the Env sequence is changed to D. In some cases the first amino acid of the mutation abbreviation may not match precisely to the HXB2 numbering. HXB2 has a V at position 200, but one mutation described herein is A200E. That means the HIV strain studied has an A at the HXB2 200 position and that the A is mutated to E. One of skill in the art is well-versed in the HXB2 numbering system.
A protein is a polymer of amino acids covalently linked by amide bonds. A protein can be post-translationally modified. A purified protein is a protein preparation that is substantially free of cellular material, other types of proteins, chemical precursors, chemicals used in synthesis of the protein, or combinations thereof. A protein preparation that is substantially free of cellular material, culture medium, chemical precursors, chemicals used in synthesis of the protein, etc., has less than about 30%, 20%, 10%, 5%, 1% or more of other proteins, culture medium, chemical precursors, and/or other chemicals used in synthesis. Therefore, a purified protein is about 70%, 80%, 90%, 95%, 99% or more pure. A purified protein does not include unpurified or semi-purified cell extracts or mixtures of protein that are less than 70% pure.
The term “proteins” can refer to one or more of one type of protein (a set of proteins). “Proteins” can also refer to mixtures of two or more different types of proteins (a mixture of proteins). The terms “proteins” or “protein” can each also mean “one or more proteins.” As used in the specification, “proteins” refers to both full-length proteins and fragments of proteins. The HIV-1 Env proteins comprising one or more mutations described herein are non-naturally occurring.
HIV-1 Env proteins can be fragments of the proteins described herein. For example an HIV-Env can comprise a fragment of an HIV-1 Env protein, having one or more of the mutations described herein. A fragment can be about 20, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 500, 600, 700, 800 or more amino acids in length. A fragment can be about 800, 700, 600, 500, 450, 425, 400, 375, 350, 325, 300, 275, 250, 225, 200, 175, 150, 125, 100, 75, 50, 20, or less amino acids in length. In an embodiment, an HIV-1 Env protein has about 1, 5, 10, 20, 30, 40, 50 or more amino acids truncated from the C-terminus. Such truncations can advantageously remove motifs for internalization that reduce surface expression.
An HIV-1 Env protein or fragment thereof can be linked to an epitope or affinity tag such as polyhistidine, DYKDDD (SEQ ID NO:2) tag, c-myc tag, Strep tag, TAP tag, and HA tag.
A mutated protein comprises at least one deleted, inserted, and/or substituted amino acid, which can be accomplished via mutagenesis of polynucleotides encoding these amino acids. Mutagenesis includes well-known methods in the art, and includes, for example, site-directed mutagenesis by means of PCR or via oligonucleotide-mediated mutagenesis as described in Sambrook et al., Molecular Cloning-A Laboratory Manual, 2nd ed., Vol. 1-3 (1989).
A protein can include multiple polypeptide chains. For example, mature HIV-1 Env comprises gp120 and gp41 polypeptide chains. A single contiguous polypeptide chain of amino acid residues can include multiple polypeptides. For example, a single chain HIV-1 Env can comprise a gp120 polypeptide linked to a gp41 polypeptide by a peptide linker.
Proteins and polynucleotides have about 75, 80, 85, 90, 95, 96, 97, 98, 99% or more sequence identity to proteins and polynucleotides described herein (e.g., proteins have one or more of the mutations show in Table A and Table B) can be used herein. Proteins and polynucleotides that have about 75, 80, 85, 90, 95, 96, 97, 98, 99% or more sequence identity to polypeptides and polynucleotides described herein while retaining one or more of the mutations show in Table A and Table B can also be used herein.
Sequence identity is the similarity between amino acid sequences and is expressed in terms of similarity between sequences. Sequence identity can be measured in terms of percentage identity (or similarity or homology); the higher the percentage, the more similar the two sequences are. Homologs, orthologs, or variants of a protein or nucleic acid molecule will have a relatively high degree of sequence identity when aligned using standard methods.
Methods of alignment of sequences for comparison are well known in the art. Programs and alignment algorithms are described in, for example, Smith & Waterman, Adv. Appl. Math. 2:482, 1981; Needleman & Wunsch, J. Mol. Biol. 48:443, 1970; Pearson & Lipman, Proc. Natl. Acad. Sci. USA 85:2444, 1988; Higgins & Sharp, Gene, 73:237-44, 1988; Higgins & Sharp, CABIOS 5:151-3, 1989; Corpet et al., Nuc. Acids Res. 16:10881-90, 1988; Huang et al. Computer Appls. in the Biosciences 8, 155-65, 1992; and Pearson et al., Meth. Mol. Bio. 24:307-31, 1994. Altschul et al., J. Mol. Biol. 215:403-10, 1990, provides a detailed discussion of sequence alignment methods and homology calculations.
Two sequences can be aligned and the number of matches can be determined by counting the number of positions where an identical nucleotide or amino acid residue is present in both sequences. The percent sequence identity is calculated by dividing the number of matches either by the length of the sequence set forth in the identified sequence, or by an articulated length (such as 100 consecutive nucleotides or amino acid residues from a sequence set forth in an identified sequence), followed by multiplying the resulting value by 100. For example, a peptide sequence that has 1776 matches when aligned with a test sequence having 2000 is 88.8 percent identical to the test sequence (1776/2000×100=88.8). The percent sequence identity value is rounded to the nearest tenth. For example, 88.11, 88.12, 88.13, and 88.14 are rounded down to 88.1, while 88.15, 88.16, 88.17, 88.18, and 88.19 are rounded up to 88.2. The length value will always be an integer.
Homologs and variants of a protein and nucleic acid molecule are typically characterized by possession of at least about 75%, for example at least about 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% sequence identity counted over the full length alignment with the amino acid or nucleic acid sequence of interest. Proteins with even greater similarity to the reference sequences will show increasing percentage identities when assessed by this method, such as at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% sequence identity. When less than the entire sequence is being compared for sequence identity, homologs and variants will typically possess at least 80% sequence identity over short windows of 10-20 amino acids, and may possess sequence identities of at least 85% or at least 90% or 95% depending on their similarity to the reference sequence. Methods for determining sequence identity over such short windows are available at the NCBI website.
For sequence comparison of nucleic acid sequences, typically one sequence acts as a reference sequence, to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters are used. Methods of alignment of sequences for comparison are well known in the art. Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. Math. 2:482, 1981, by the homology alignment algorithm of Needleman & Wunsch, J. Mol. Biol. 48:443, 1970, by the search for similarity method of Pearson & Lipman, Proc. Natl. Acad. Sci. USA 85:2444, 1988, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by manual alignment and visual inspection (see, e.g., Sambrook et al. (Molecular Cloning: A Laboratory Manual, 4th ed, Cold Spring Harbor, N.Y., 2012) and Ausubel et al. (In Current Protocols in Molecular Biology, John Wiley & Sons, New York, through supplement 104, 2013). One example of a useful algorithm is PILEUP. PILEUP uses a simplification of the progressive alignment method of Feng & Doolittle, J. Mol. Evol. 35:351-360, 1987. The method used is similar to the method described by Higgins & Sharp, CABIOS 5:151-153. 1989. Using PILEUP, a reference sequence is compared to other test sequences to determine the percent sequence identity relationship using the following parameters: default gap weight (3.00), default gap length weight (0.10), and weighted end gaps. PILEUP can be obtained from the GCG sequence analysis software package, e.g., version 7.0 (Devereaux et al., Nucl. Acids Res. 12:387-395, 1984).
Another example of algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and the BLAST 2.0 algorithm, which are described in Altschul et al., J. Mol. Biol. 215:403-410, 1990 and Altschul et al., Nucleic Acids Res. 25:3389-3402, 1977. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (ncbi.nlm.nih.gov). The BLASTN program (for nucleotide sequences) uses as defaults a word length (W) of 11, alignments (B) of 50, expectation (E) of 10, M=5, N=−4, and a comparison of both strands. The BLASTP program (for amino acid sequences) uses as defaults a word length (W) of 3, and expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915, 1989).
In an embodiment an HIV-1 Env protein is from strain/clade BaL. DU422, YU-2, 25711, Q769.d22. Q842.d12, BG505, 191084, AD8, B41, SF162P3, 001428, SHIV327C, Hu_A10, AC10, ZM197M, CH110, H031, CH111, 257-31, PVO, CH115, or any other HIV-1 strain or clade and comprises one or more of the mutations or sets of mutations described herein.
An embodiment provides a library comprising two or more (e.g., 2, 5, 20, 10, 50, 75, 100 or more) of the mutated HIV-1 Env proteins disclosed herein. Another embodiment provides a display library of one or more of the mutated HIV-1 Env proteins disclosed herein. The mutated HIV-1 Env proteins can comprise a fragment of the mutated HIV-1 Env protein or a full-length mutated HIV-1 Env protein. A fragment of the mutated HIV-1 Env protein comprises at least one of the disclosed mutations and is about 10, 20, 50, 100, 200, 300, 400, 500, 600, 700, 800, or more amino acids in length. In an embodiment, a fragment can induce an immune response.
A mutated HIV-1 Env protein or fragment thereof can be displayed on the surface of a library, by for example the C terminus. This is a display library.
A mutant HIV-1 Env display library can comprise a phage display library. A phage display library can be a collection of phage that has been genetically engineered to express one or more mutant HIV-1 Env proteins or fragments thereof on their outer surface. In an embodiment nucleic acid molecules encoding the mutant HIV-1 Env proteins or fragments thereof are inserted in frame into a gene encoding a phage capsule protein. In another embodiment, a phage display library is a collection of phage that displays one or more mutant HIV-1 Env proteins or fragments thereof on their outer surface.
A display library can be, for example, a phage display library, a phagemid display library, a virus display library, a bacterial cell display library, a mammalian cell display library, yeast display library, a λgt11 library, an in vitro library selection system (CIS display), an in vitro compartmentalization library, an antibody-ribosome-mRNA (ARM) ribosome display library, or a ribosome display library.
Methods of making and screening such display libraries are well known to those of skill in the art and described in, e.g., Molek et al. (2011) Molecules 16, 857-887; Boder et al., (1997) Nat Biotechnol 15, 553-557; Scott et al. (1990) Science 249, 386-390; Brisette et al. (2007) Methods Mol Biol 383, 203-213; Kenrick et al. (2010) Protein Eng Des Sel 23, 9-17; Freudl et al. (1986) J Mol Biol 188, 491-494; Getz et al. (2012) Methods Enzymol 503, 75-97; Smith et al. (2014) Curr Drug Discov Technol 11, 48-55; Hanes, et al. (1997) Proc Natl Acad Sci USA 94, 4937-4942; Lipovsek et al., (2004) J Imm Methods 290, 51-67; Ullman et al. (2011) Brief. Funct. Genomics, 10, 125-134; Odegrip et al. (2004) Proc Natl Acad Sci USA 101, 2806-2810; and Miller et al. (2006) Nat Methods 3, 561-570.
A mutant HIV-1 Env library or display library can be screened for biological activity.
There is a need for stabilized HIV-1 Env trimeric complexes that have improved percentage of trimeric complex formation (e.g., about 10, 20, 30, 40, 50 60, 70% or more improved trimeric complex formation as compared to wild-type HIV-1 Env trimeric complex formation), improved trimeric complex yield (e.g., about 10, 20, 30, 40, 50 60, 70% or more improved trimeric complex yield as compared to wild-type HIV-1 trimeric complex yield), and/or improved trimeric complex stability (e.g., about 10, 20, 30, 40, 50 60, 70% or more better HIV-1 Env trimeric complex stability as compared to wild-type HIV-1 Env trimeric complex stability).
An HIV-1 Env trimeric complex comprises HIV-1 Env proteins comprising at least one mutation described in a mature trimeric conformation. Three HIV-1 Env proteins come together to form one HIV-1 Env trimeric complex Therefore, each HIV-1 Env trimeric complex has three gp120 subunits and three gp41 subunits. A portion of an HIV-1 Env trimeric complex can be about 100, 200, 300, 400, 500, 600, 700 or more amino acids, as long as it contains an amino acid mutation listed in Tables A-H, can induce an immunogenic response in a subject (e.g., a mammal such as a human), and includes at least a portion of three gp120 subunits and at least a portion of three gp41 subunits.
An HIV-1 Env trimeric complex can be a chimeric HIV-1 Env trimeric complex that comprises amino acid sequences from two or more different HIV-1 clades or amino acid sequences from two or more different mutant HIV-1 Env proteins.
HIV-1 trimeric complex formation can be measured by an antibody binding assay using antibodies that bind specifically to the trimeric form of the HIV-1 Env protein. Examples of trimeric complex specific antibodies that can be used to detect a trimer form include, but are not limited to, monoclonal antibodies (mAbs) PGT145, PGDM1400, PG16, and PGT151. Any antibody binding assay known in the art can be used to measure the percentage of trimer formation of a recombinant HIV-1 Env protein of the invention, such as ELISA, AlphaLISA, etc.
The amount of HIV-1 Env trimeric complexes formed and the total amount of envelope protein expressed can also be determined using, for example, chromatographic techniques (e.g., size exclusion chromatography multi-angle light scattering (SEC-MALS) that can separate the trimeric complex form from any other forms of the HIV-1 Env protein, e.g., the monomer form.
An HIV-1 Env trimeric complex can comprise three gp120-gp41 protomers comprising a gp120 polypeptide and a gp41 extracellular domain. Mature gp120 includes HIV-1 Env residues from about 31-511 (wherein the amino acids are numbered by HXB2 numbering), and contains most of the extemal, surface exposed, domains of the HIV-1 Env trimeric complex. The gp120 portion of the trimeric complex can bind to cellular CD4 receptors and can bind to cellular chemokine receptors. A mature gp120 polypeptide is an extracellular polypeptide that interacts with the gp41 extracellular domain (approximately HIV-1 Env positions 512-644) to form an HIV-1 Env protomer that trimerizes to form an HIV-1 Env trimeric complex. Mature gp41 comprises approximately HIV-1 Env amino acids 512-856, and includes cytosolic, transmembrane, and extracellular-domains. The gp41 extracellular domain comprises HIV-1 Env residues from about 512-644.
In an embodiment an HIV-1 Env protein comprises one or more of the mutations disclosed herein in an extracellular domain. In an embodiment an HIV1-Env protein comprises a fragment of an HIV-1 Env protein, which includes at least one extracellular domain or a portion of an extracellular domain having one or more of the mutations described herein. An extracellular fragment can be about 20, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, or more amino acids in length.
In an embodiment, the N-terminal residue of the gp120 polypeptide is one of HIV-1 Env positions 1-35 (i.e., one or more of amino acids 1-34 can be absent). In an embodiment, the C-terminal residue of the gp120 polypeptide is one of HIV-1 Env positions 503-512 (i.e., one or more of amino acids 504-512 can be truncated or removed). In an embodiment, the N-terminal residue of the gp41 extracellular domain is one of HIV-1 Env positions 512-522 (i.e., one or more of amino acids 512-521 are absent). In an embodiment the C-terminal residue of the gp41 extracellular domain is one of HIV-1 Env positions 640-683 (i.e., one or more of amino acids 640-683 is removed). All numbering refers to HXB2 numbering.
Truncations of the C-terminus of HIV-1 Env can be useful to (i) increase expression and/or (ii) make a soluble extracellular fragment.
An HIV-1 Env trimeric complex can be stable in a mature closed conformation. An HIV-1 Env trimeric complex comprises at least one mutant HIV-1 Env protein or fragment described herein and can exhibit increased retention of the mature closed conformation upon CD4 binding compared to a corresponding wild-type or naturally occurring HIV-1 Env trimeric complex.
A HIV-1 Env trimeric complex stabilized in the mature closed conformation can have at least about 60, 70, 80, 90, 95, 98, 99% or more reduced transition to the CD4-bound open conformation upon CD4 binding compared to a corresponding native HIV-1 Env trimeric complex. The stabilization of the mature closed conformation by one or more mutations described herein can be, for example, energetic stabilization (for example, reducing the energy of the mature closed conformation relative to the CD4-bound open conformation) and/or kinetic stabilization (for example, reducing the rate of transition from the mature closed conformation to the open conformation) and/or reduced conformational heterogeneity (for example, a greater fraction of the expressed protein is in the closed conformation). Additionally, stabilization of the HIV-1 Env trimeric complex in the mature closed conformation can include an increase in resistance to denaturation compared to a corresponding native HIV-1 Env trimeric complex.
In an embodiment, the inclusion of one or more mutations described herein increases the pool of HIV-1 Env trimeric complexes present in the closed state as compared to wild-type or naturally occurring HIV-1 Env trimeric complexes. That is, use of HIV-1 Env proteins or fragments thereof having one or more mutations described herein can result in about 5, 10, 20, 30, 40, 50, 60, 70, 80, 90% or more HIV-1 Env trimeric complexes present in the closed state as compared to the use of wild-type or naturally occurring HIV-1 proteins or fragments thereof.
Methods of determining if a HIV-1 Env trimeric complex is in the mature closed conformation include, for example, negative stain electron microscopy and antibody binding assays using a mature closed conformation specific antibody, such as VRC26 or PGT145. Methods of determining if a HIV-1 Env trimeric complex is in the CD4-bound open conformation include for example, negative stain electron microscopy and antibody binding assays using a CD4-bound open conformation specific antibody, such as 17b, which binds to a CD4-induced epitope.
In an embodiment an HIV-1 Env trimeric complex can comprise a non-natural disulfide bond between cysteine substitutions at positions 201 and 433, a non-natural disulfide bond between cysteine substitutions at positions 501 and 605, and a proline substitution at position 559.
In an embodiment an HIV-1 Env trimeric complex does not specifically bind to a CD4-induced antibody when incubated with a molar excess of soluble CD4.
An embodiment provides an immunogen comprising one or more of the HIV-1 Env proteins, fragments thereof, trimeric complexes or portions thereof described herein. An immunogen can also be a vector, nucleic acid molecule, or host cell as described herein. An immunogen can induce an immune response in a mammal, including for example, humans infected with HIV-1 or at risk of HIV-1 infection. Administration of an immunogen can lead to protective immunity and/or proactive immunity against HIV-1.
An immune response is a response of a cell of the immune system, such as a B cell, T cell, or monocyte, to a stimulus. In one embodiment, the response is specific for a particular HIV-1 Env antigen (an “antigen-specific response”). In one embodiment, an immune response is a T cell response, such as a CD4+ response or a CD8+ response. In another embodiment, the response is a B cell response, and results in the production of specific antibodies. In an embodiment, an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof as described herein comprises an epitope or other construct such that the protein or fragment thereof will bind an MHC molecule and induce an immune response, such as a cytotoxic T lymphocyte (“CTL”) response, and/or a B cell response (for example, antibody production), and/or a T-helper lymphocyte response against the antigen from which the protein or fragment thereof is derived.
Embodiments include nucleic acid molecules encoding the mutant HIV-1 Env protein, fragments thereof, HIV-1 Env trimeric complex or portion thereof disclosed herein. Polynucleotides contain less than an entire viral genome and can be single- or double-stranded nucleic acids. A polynucleotide can be RNA, DNA, cDNA, genomic DNA, chemically synthesized RNA or DNA or combinations thereof. A polynucleotide can comprise, for example, a gene, open reading frame, non-coding region, or regulatory element.
A gene is any polynudeotide molecule that encodes a protein or fragments thereof, optionally including one or more regulatory elements preceding (5′ non-coding sequences) and following (3′ non-coding sequences) the coding sequence. In one embodiment, a gene does not include regulatory elements preceding and following the coding sequence. A naturally occurring or wild-type gene refers to a gene as found in nature, optionally with its own regulatory elements preceding and following the coding sequence. A chimeric or recombinant gene refers to any gene that is not a naturally occurring or wild-type gene, optionally comprising regulatory elements preceding and following the coding sequence, wherein the coding sequences and/or the regulatory elements, in whole or in part, are not found together in nature. Thus, a chimeric gene or recombinant gene comprise regulatory elements and coding sequences that are derived from different sources or regulatory elements and coding sequences that are derived from the same source, but arranged differently than is found in nature. A gene can encompass full-length gene sequences (e.g., as found in nature and/or a gene sequence encoding a full-length polypeptide or protein) and can also encompass partial gene sequences (e.g., a fragment of the gene sequence found in nature and/or a gene sequence encoding a protein or fragment of a polypeptide or protein). A gene can include modified gene sequences (e.g., modified as compared to the sequence found in nature). Thus, a gene is not limited to the natural or full-length gene sequence found in nature.
Polynucleotides can be purified free of other components, such as proteins, lipids and other polynucleotides. For example, the polynucleotide can be 50%, 75%, 90%, 95%, 96%, 97%, 98%, 99%, or 100% purified. A polynucleotide existing among hundreds to millions of other polynucleotide molecules within, for example, cDNA or genomic libraries, or gel slices containing a genomic DNA restriction digest are not to be considered a purified polynucleotide. Polynucleotides can encode the proteins described herein (e.g., mutant HIV-1 Env proteins and fragments thereof).
Polynucleotides can comprise additional heterologous nucleotides that do not naturally occur contiguously with the polynucleotides. As used herein the term “heterologous” refers to a combination of elements that are not naturally occurring or that are obtained from different sources.
Polynucleotides can comprise other nucleotide sequences, such as sequences coding for linkers, signal sequences, TMR stop transfer sequences, transmembrane domains, or ligands useful in protein purification such as glutathione-S-transferase, histidine tag, and Staphylococcal protein A.
Polynucleotides can be isolated. An isolated polynucleotide is a naturally-occurring polynucleotide that is not immediately contiguous with one or both of the 5′ and 3′ flanking genomic sequences that it is naturally associated with. An isolated polynucleotide can be, for example, a recombinant DNA molecule of any length, provided that the nucleic acid sequences naturally found immediately flanking the recombinant DNA molecule in a naturally-occurring genome is removed or absent. Isolated polynucleotides also include non-naturally occurring nucleic acid molecules. Polynucleotides can encode full-length proteins, polypeptide fragments, and variant or fusion polypeptides.
Degenerate polynucleotide sequences encoding polypeptides described herein, as well as homologous nucleotide sequences that are at least about 80, or about 85, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% identical to polynucleotides described herein and the complements thereof are also polynucleotides. Degenerate nucleotide sequences are polynucleotides that encode a protein described herein or fragments thereof but differ in nucleic acid sequence from the wild-type polynucleotide sequence, due to the degeneracy of the genetic code. Complementary DNA (cDNA) molecules, species homologs, and variants of polynucleotides that encode biologically functional polypeptides also are polynucleotides.
Polynucleotides can be obtained from nucleic acid sequences present in, for example, an HIV virion. Polynucleotides can also be synthesized in the laboratory, for example, using an automatic synthesizer. An amplification method such as PCR can be used to amplify polynucleotides from either genomic DNA or cDNA encoding the polypeptides.
Polynucleotides can comprise coding sequences for naturally occurring polypeptides or can encode altered sequences that do not occur in nature. Unless otherwise indicated, the term polynucleotide or gene includes reference to the specified sequence as well as the complementary sequence thereof.
An embodiment includes a vector comprising an HIV-1 Env polynucleotide that encodes a mutant HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof as disclosed herein. A vector is used to introduce a nucleic acid molecule into a host cell, thereby producing a transformed host cell. Recombinant vectors are vectors having recombinant DNA. A vector can include nucleic acid sequences that permit it to replicate in a host cell, such as an origin of replication. A vector can also include one or more selectable marker genes and other genetic elements known in the art.
If desired, polynucleotides can be cloned into an expression vector comprising expression control elements, including for example, origins of replication, promoters, enhancers, or other regulatory elements that drive expression of the polynucleotides in host cells. A vector can be, for example, a virus (e.g., adenovirus or poxvirus), naked DNA, oligonucleotide, cationic lipid (e.g., liposome), cationic polymer (e.g., polysome), virosome, nanoparticle, or dentrimer. Other viral vectors include adeno-associated virus vectors, retrovirus vectors, poxviruses, vaccinia virus, herpesviruses, togaviruses, picomaviruses, and baculoviruses. Other vectors include bacteriophages, phagemids, yeast artificial chromosomes, bacterial artificial chromosomes, virus-like particles, cosmids (plasmids into which phage lambda cos sites have been inserted) and replicons (genetic elements that are capable of replication under their own control in a cell).
The nucleic acid molecules of a vector can be encapsulated in a lipid membrane or by structural proteins (e.g., capsid proteins), that can include one or more viral polypeptides (e.g., an HIV-1 Env protein or portion thereof). A vector can be used to infect cells of a subject, such that translation of the heterologous gene(s) of the vector occurs. In an embodiment an HIV-1 Env trimeric complex is formed.
Naked DNA or oligonudeotides encoding one or more of the HIV-1 Env proteins, fragments thereof, HIV-1 trimeric complexes or portions thereof described herein can also be used to express HIV-1 Env proteins in a cell or a subject to promote formation of HIV-1 Env trimeric complexes. See, e.g., Cohen, Science 259:1691-1692 (1993); Fynan et al., Proc. Natl. Acad. Sci. USA, 90:11478 (1993); and Wolff et al., BioTechniques 11:474485 (1991).
A virus-like particle (VLP) can comprise one or more HIV-1 Env proteins, fragments thereof, or HIV-1 Env trimeric complexes as described herein. VLPs lack the viral components that are required for virus replication and thus represent a highly attenuated form of a virus. The VLP can display an HIV-1 Env protein, fragment thereof, trimeric complex or portion thereof that is capable of eliciting an immune response to HIV-1 Env when administered to a subject. Virus like particles and methods of their production are known to the person of ordinary skill in the art, and viral proteins from several viruses are known to form VLPs, including human papillomavirus, HIV, Semliki-Forest virus, human polyomavirus, rotavirus, and others. The virus like particle can include any of the recombinant HIV-1 Env trimeric complexes or immunogenic fragments thereof that are disclosed herein.
An embodiment provides a host cell comprising one or more vectors or nucleic acid molecules as described above. A recombinant host cell, transgenic host cell, or transformed host cell is a cell into which one or more foreign or exogenous nucleic acid molecules, synthetic nucleic acid molecules, or plasmids have been introduced or inserted into the cell. The one or more foreign nucleic acid molecules, synthetic nucleic acid molecules, or plasmids do not occur in the host cell in nature. The cell may be prokaryotic or eukaryotic. The term also includes any progeny of the host cell. It is understood that all progeny may not be identical to the parental cell since there may be mutations that occur during replication. However, such progeny are included when the term “host cell” is used.
An embodiment provides methods of producing HIV-1 Env proteins, fragments thereof, HIV-1 trimeric complexes or portions thereof comprising one or more of the mutations described herein. In an embodiment nucleic acid molecules capable of expressing HIV-1 Env proteins, fragments thereof, HIV-1 Env trimeric complexes or portions thereof are cloned into one or more vectors. In an embodiment nucleic acid molecules encoding HIV-1 Env gp120 subunits are cloned into a first vector and nucleic acid molecules encoding HIV-1 Env gp140 are cloned into a second vector. Both vectors can be introduced into host cells. In an embodiment the nucleic acid molecules encoding HIV-1 proteins or fragments thereof are codon optimized for expression in human cells.
One or more host cells can be cultured in an appropriate medium to produce HIV-1 Env proteins, fragments thereof, HIV-1 trimeric complexes or portions thereof. A host cell can be a mammalian cell having the ability to glycosylate proteins.
An embodiment provides a method of producing a stable HIV-1 Env trimer in a closed conformation. The method comprises making one or more of the amino acid mutations or sets of mutations disclosed herein in an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof and expressing the protein in a host cell.
An embodiment provides methods of screening a compound or test agent for binding to one or more HIV-1 Env proteins, fragments thereof, HIV-1 Env complexes or portions thereof comprising one or more of the mutations described herein. The one or more mutant HIV-1 Env proteins, fragments thereof, trimeric complexes or portions thereof are contacted with one or more test agents or compounds. The ability of the test agent or compound to bind to the one or more mutant HIV-1 Env proteins, fragments, trimeric complexes or portions thereof is determined. The test agent can be any type of compound, molecule, biological molecule, or drug.
Binding assays such as competitive binding assays and direct binding assays are well known to those of skill in the art.
A plurality of mutant HIV-1 Env proteins (e.g., 2, 5, 10, 15, or more mutant HIV-1 Env proteins, fragments thereof, trimeric complexes or portions thereof disclosed herein) can be screened.
A compound or test agent can be tested for inhibition of an HIV-1-mediated activity. An HIV-1-mediated activity can be, for example, viral spread, infection, or cell fusion. Cell fusion may be, for example, target cell entry or syncytial formation. In an embodiment, the compound or test agent inhibits an HIV-mediated activity. The compound or test agent can be provided in a library or display library.
In an embodiment, a screening step can also serve as the step of recovering a test agent or compound that binds to the mutant HIV-1 Env protein, fragment thereof, or trimeric compound.
Methods well known to those skilled in the art can be applied to screening of libraries or display libraries of test agents or compounds. Examples include solid-phase screening methods and liquid-phase screening methods. Solid-phase screening methods can involve, for example, immobilizing test agents or compounds onto a solid phase, and contacting mutant HIV-1 Env proteins, fragments, trimeric complexes or portions thereof contained in a liquid phase with the test agents or compounds and removing unbound mutant HIV-1 Env proteins, fragments, trimeric complexes or portions thereof and nonspecifically bound mutant HIV-1 Env proteins, fragments thereof, trimeric complexes or portions thereof and then selectively separating mutant HIV-1 Env proteins, fragments, trimeric complexes or portions thereof bound with the test agent or compound to screen for a protein, fragment, trimeric complex or portion thereof having, for example, a desired binding activity. A liquid-phase screening method can involve, for example, contacting mutant HIV-1 Env proteins, fragments thereof, trimeric complexes or portions thereof with test agents in a solution, removing unbound mutant HIV-1 Env proteins, fragments thereof, trimeric complexes and portions thereof and nonspecifically bound HIV-1 Env proteins, fragments thereof, trimeric complexes and portions thereof and then selectively separating the HIV-1 Env proteins, fragments thereof, trimeric complexes or portions thereof bound with test agents or compounds.
In an embodiment, an HIV-1 Env protein, fragment thereof, HIV-1 trimeric complex, portion thereof, or combinations thereof can be presented as a membrane protein in nanodiscs. See, e.g., Bayburt et al., J. Struct. Biol. (1998); 123:37; Civjan et al., BioTechniques (2003) 35:556; Hagn et al., J. Am. Chem. Soc. (2013) 135:1919.
Nanodiscs have a phospholipid bilayer system held together by membrane scaffold proteins (MSPs), which wrap around a patch of a lipid bilayer to form a disc-like particle or nanodisc. MSPs have a hydrophobic surface facing the lipids, and a hydrophilic surface facing outward. Nanodiscs are therefore highly soluble in aqueous solutions. Once assembled into nanodiscs, membrane proteins can be kept in solution in the absence of detergents.
MSPs can be, for example, truncated forms of apolipoprotein (apo) A-I, MSP1D1, MSP1E3D1, MSP2N2, MSP2N3, or MSP1D1dH5. Nanodiscs can be about 7-17 nm in diameter depending on the type of MSP used. MSPs can be derived from for example, mouse, rat or human apo A-1 proteins. Use of mouse or rat apo A-1 proteins can improve antibody specificity when human HIV-1 Env target protein-nanodisc complexes are used for immunization.
Nanodiscs can be used to reconstitute HIV-1 Env in an artificial environment resembling the native membrane. These nanodisc-stabilized proteins can be directly purified by standard chromatographic procedures. The resulting purified membrane protein:nanodisc complex can be used in screening applications that require access to both the physiologically intracellular and extracellular surfaces of the protein and thus allows unrestricted access of antagonists, agonists, and other interaction partners. The nanodiscs can also be used as an HIV-1 Env immunogen or HIV-1 Env vaccine.
Nanodiscs can be made using cell-free expression systems. An HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, portions thereof or combinations thereof can be expressed from, for example, a plasmid. Pre-assembled nanodiscs are supplied in the mixture that integrate the nascent HIV-1 Env protein. Nanodiscs can also be made using a two-step reconstitution of detergent-solubilized proteins. Purified HIV-1 Env proteins, fragments thereof, HIV-1 Env trimeric complexes, portions thereof or combinations thereof are combined with a suitable detergent, and membrane scaffold proteins and phospholipids are added. Nanodiscs containing the membrane protein form spontaneously, and can be purified by affinity or size exclusion chromatography. Nanodiscs can also be made via direct solubilization from membranes expressing HIV-1 Env proteins, fragments thereof. HIV-1 Env trimeric complexes, portions thereof, or combinations thereof. Starting from membranes expressing the HIV-1 Env protein, detergent and membrane scaffold protein are added. Membrane phospholipids, HIV-1 Env protein and MSP assemble to form the nanodisc complex.
Phospholipids such as dimyristoyl-glycero-phosphocholine (DMPC), palmitoyl-oleoyl-phosphatidylcholine (POPC), and many other phospholipids can be used when making nanodiscs.
An embodiment provides a nanodisc comprising one or more of HIV-1 Env proteins, fragments thereof, HIV-1 Env trimeric complexes, portions thereof, or combinations thereof as described herein, a membrane scaffold protein; and one or more phospholipids. In an embodiment, the nanodiscs can be used to screen test agents or compounds for binding or biological activity. In an embodiment, the nanodiscs can be used as an immunogen or vaccine.
A pharmaceutical composition can comprise a mutant HIV-1 Env protein or fragment thereof (which comprise at least one mutation described herein), an HIV-1 Env trimeric complex comprising at least on mutant HIV-1 Env protein or portion thereof (which comprises at least one mutation described herein), nucleic acid molecule as described herein, a vector as described herein, a host cell as described herein, or a combination thereof combined with a pharmaceutically acceptable carrier.
Pharmaceutically acceptable carriers are well known in the art. See e.g., Remington's Pharmaceutical Sciences, by E. W. Martin, Mack Publishing Co., Easton, Pa., 19th Edition, 1995, which describes compositions and formulations suitable for pharmaceutical delivery.
In general, the nature of the carrier will depend on the particular mode of administration being employed. For instance, parenteral formulations usually comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (e.g., powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically neutral carriers, pharmaceutical compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate. In particular embodiments, a carrier may be sterile, and/or suspended or otherwise contained in a unit dosage form containing one or more measured doses of the composition suitable to induce the desired anti-HIV-1 immune response. It may also be accompanied by medications for its use for treatment purposes. The unit dosage form may be, for example, in a sealed vial that contains sterile contents or a syringe for injection into a subject, or lyophilized for subsequent solubilization and administration or in a solid or controlled release dosage.
A pharmaceutical composition can comprise an adjuvant to enhance antigenicity. An adjuvant can comprise, for example, a suspension of minerals (alum, aluminum hydroxide, or phosphate); or water-in-oil emulsion, for example, in which antigen solution is emulsified in mineral oil (Freund incomplete adjuvant), sometimes with the inclusion of killed mycobacteria (Freund's complete adjuvant) to further enhance antigenicity (inhibits degradation of antigen and/or causes influx of macrophages). Immunostimulatory oligonucleotides (such as those including a CpG motif) can also be used as adjuvants. Adjuvants can include biological molecules, such as costimulatory molecules. Exemplary adjuvants include IL-2, RANTES, GM-CSF, TNF-α, IFN-γ, G-CSF, LFA-3, CD72, B7-1, B7-2, OX-40L, SA-4-1 BBL and toll-like receptor (TLR) agonists, such as TLR-9 agonists. Adjuvants are well known in the art. See, e.g., Singh (ed.) Vaccine Adjuvants and Delivery Systems. Wiley-Interscience, 2007).
A pharmaceutical composition can comprise a detergent, such as a non-ionic detergent (e.g., a polyethylene type detergent).
An embodiment provides a method for eliciting an immune response against an HIV-1 infected cell in a subject (e.g., a mammal such as a human) comprising administering to the subject a therapeutically effective amount of an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof comprising one or more mutations described herein such that an immune response is elicited in the subject. In an embodiment a pharmaceutical composition as described herein can be used to elicit an immune response against an HIV-1 infected cell in a subject.
An embodiment provides a method for delaying the onset of, or slowing the rate of progression of, an HIV-1-related disease or symptom in an HIV-1-infected subject. Examples of symptoms of diseases (e.g., AIDS) include, for example, fever, muscle aches, coughing, sneezing, runny nose, sore throat, headache, chills, diarrhea, vomiting, rash, weakness, dizziness, bleeding under the skin, in internal organs, or from body orifices like the mouth, eyes, or ears, shock, nervous system malfunction, delirium, seizures, renal failure, personality changes, neck stiffness, dehydration, seizures, lethargy, paralysis of the limbs, confusion, back pain, loss of sensation, impaired bladder and bowel function, and sleepiness that can progress into coma or death.
The method comprises administering to the subject (e.g., a mammal such as a human) a therapeutically effective amount of an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof comprising one or more mutations described herein such that the onset of an HIV-1 disease or symptom is delayed or the progression of HIV-1 symptoms are slowed in comparison to an HIV-1 positive subject who does not receive the administration. In an embodiment a pharmaceutical composition as described herein can be used in the methods.
A therapeutically effective amount refers to an amount of a composition described herein that, when administered to a subject for treating a disease or disorder or at least one symptom of the disease or disorder, is sufficient to affect such disease, disorder, or symptom. A therapeutically effective amount can vary depending, for example, on the composition that is administered, the disease, disorder, and/or symptoms of the disease or disorder, severity of the disease, disorder, and/or symptoms of the disease or disorder, the age, weight, and/or health of the subject to be treated, the capacity of the individual's immune system to synthesize antibodies, the degree of protection desired, the formulation of the composition, the treating doctor's assessment of the medical situation, and other relevant factors. An appropriate amount in any given instance can be readily ascertained by those skilled in the art or capable of determination by routine experimentation. It is expected that the amount will fall in a relatively broad range that can be determined through routine trials. For example, in the context of administering a therapeutic agent (e.g., HIV-Env protein, fragments thereof, HIV-1 Env trimeric complexes, portions thereof, and pharmaceutical compositions described herein), the therapeutically effective amount is an amount sufficient to achieve a reduction in the level of HIV (e.g., as measured by a stabilization or decrease in HIV titer compared to a non-treated control), and/or an increase in the level of neutralizing anti-HIV antisera (e.g., as measured by an increase in serum neutralizing antibody levels relative to a non-treated control in a luciferase-based virus neutralization assay) as compared to a response obtained without administration of a therapeutic agent described herein, and/or to prevent the propagation of a HIV-1 in a subject (e.g., a human) having an increased risk of viral infection. I
In an embodiment, a therapeutically effective amount provides a therapeutic effect without causing a substantial cytotoxic effect in the subject. In general, a therapeutically effective amount of a composition administered to a subject (e.g., a human) will vary depending upon a number of factors associated with that subject, for example the overall health of the subject, the condition to be treated, or the severity of the condition. A therapeutically effective amount of a composition can be determined by varying the dosage of the product and measuring the resulting therapeutic response.
Administering means giving a dosage of a pharmaceutical composition as described herein to a subject. Administration can be done, for example, intramuscularly, intravenously, intradermally, percutaneously, intraarterially, intraperitoneally, intralesionally, intracranially, intraarticulariy, intraprostatically, intrapleurally, intratracheally, intranasally, intravitreally, intravaginally, intrarectally, topically, intratumorally, peritoneally, subcutaneously, subconjunctivally, intravesicularily, mucosally, intrapericardially, intraumbilically, intraocularly, orally, topically, locally, by inhalation, by injection, by infusion, by continuous infusion, by localized perfusion bathing target cells directly, by catheter, by lavage, by gavage, in creams, or in lipid compositions.
In an embodiment an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, portion thereof, or pharmaceutical composition induces a neutralizing immune response or a broadly neutralizing immune response to HIV-1 Env in the subject
In an embodiment an administration of the compositions described herein to a subject causes a reduction or decrease of an HIV-mediated activity (e.g., infection, fusion (e.g., target cell entry and/or syncytia formation), viral spread, etc.) and/or a decrease in viral titer). HIV-mediated activity and/or HIV titer may be decreased by about 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99% or more compared to that of a control subject (e.g., an untreated subject or a subject treated with a placebo).
An embodiment provides a vaccine, which is an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, portion thereof, or pharmaceutical composition as described herein that can provoke an immune response. Administration of a vaccine to a subject can confer at least some protective immunity against HIV-1 infection (e.g., enhancement of resistance to new infection, complete resistance to new infection, or reduction or elimination of clinical severity of the disease or symptoms).
An embodiment provides a method for preventing a subject (e.g., a mammal such as a human) from becoming infected with HIV-1. The method comprises administering to the subject a prophylactically effective amount of an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof comprising one or more mutations described herein such that a protective immune response is elicited in the subject. In an embodiment a pharmaceutical composition as described herein can be used to elicit a protective immune response. The subject can be prevented from becoming infected with at least 1, 2, 3, 4, 5, 6, or more strains of HIV-1.
A prophylactically effective amount is any amount of an agent (e.g., an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, portion thereof or pharmaceutical composition described herein) which, when administered to a subject prone to suffer from a disease or disorder, inhibits or prevents the onset of the disease or disorder. The prophylactically effective amount will vary with the subject being treated, the condition to be treated, the agent delivered, and the route of delivery. A person of ordinary skill in the art can perform routine titration experiments to determine such an amount. A prophylactic treatment is a treatment administered to a subject who does not exhibit signs of a disease or exhibits only early signs for the purpose of decreasing the risk of developing pathology.
A protective immunological response or protective immunity can be demonstrated by either a reduction or lack of clinical signs or symptoms normally displayed by an infected host, a quicker recovery time, and/or a lowered duration of infectivity or lowered pathogen titer in the tissues or body fluids or excretions of the infected host.
An embodiment provides a method for reducing the likelihood of a subject becoming infected with HIV-1. The method comprises administering to the subject an amount effective to reduce the likelihood of the subject becoming infected with HIV-1 of an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof comprising one or more mutations described herein such that an immune response is elicited in the subject. In an embodiment a pharmaceutical composition as described herein can be used to elicit an immune response. In an embodiment, the subject has been exposed to HIV-1.
An embodiment provides a DNA vaccine comprising one or more of the nucleic acid molecules encoding an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof, wherein then protein, fragment, complex or portion comprises one or more of the mutations described herein together with a pharmaceutically acceptable adjuvant. A DNA vaccine comprises genetically engineered DNA that is delivered directly to cells to produce an HIV-1 Env antigen (e.g., an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof as described herein), such that an immune response is generated. In an embodiment, the immune response is a protective immune response.
An embodiment provides a recombinant viral vector vaccine, which comprises a recombinant viral vector comprising one or more of the nucleic acid molecules encoding an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof, wherein then protein, fragment, complex or portion comprises one or more of the mutations described herein and a pharmaceutical acceptable adjuvant. The recombinant viral vector can be, for example, vaccinia vector, adenovirus vector, adena-associated virus vector, sendai virus vector, herpes simplex virus vector, human papillomavirus vector, retroviral vector or other viral vector. A recombinant viral vector can be a replicative viral vector.
An embodiment provides a recombinant bacterial vector vaccine, which comprises a recombinant bacterial vector comprising one or more of the nucleic acid molecules encoding an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof, wherein then protein, fragment, complex or portion comprises one or more of the mutations described herein and a pharmaceutical acceptable adjuvant.
Ex vivo transfection or transduction of cells can also be used to deliver the HIV-1 Env proteins, fragments thereof, HIV-1 Env trimeric complexes or portions thereof to a subject. The cells can be delivered into a subject to allow for the expression of one or more of the HIV-1 Env proteins, fragments thereof, trimeric complexes or portions thereof described herein. In embodiments, cells can be autologous or heterologous to the treated subject. Cells can be transfected or transduced ex vivo with, for example, one or more vectors or nucleic acid molecules described herein to allow for the temporal or permanent expression of one or more of the HIV-1 Env proteins, fragments thereof, HIV-1 Env trimeric complexes, or portions thereof in the treated subject. Once the modified cells are administered to the subject, the one or more vectors or nucleic acid molecules will be expressed, eliciting protective or therapeutic immune responses directed against the HIV-1 Env proteins or HIV-1 Env trimeric complexes.
Cells that can be isolated and transfected or transduced ex vivo can be, for example, blood cells, skin cells, fibroblasts, endothelial cells, skeletal muscle cells, hepatocytes, prostate epithelial cells, vascular endothelial cells, and totipotent, pluripotent, multipotent, or unipotent stem cells.
In an embodiment the HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, portion thereof, or pharmaceutical composition can be administered as part of a prime-boost regimen. In an embodiment, the immune response triggered by a single administration (prime) of a composition described herein may not be sufficiently potent and/or persistent to provide effective protection. Therefore, repeated administration (boost), such that a prime-boost regimen is established, may significantly enhance humoral and cellular responses to the antigen(s) of the composition.
The booster is administered to the subject after the primer. A skilled artisan will understand a suitable time interval between administration of the primer composition and the booster composition. In some embodiments, the primer composition, the booster composition, or both primer composition and the booster composition additionally include an adjuvant. In one embodiment, the primer composition is a DNA-based vaccine (or other vaccine based on gene delivery), and the booster composition is a protein-based vaccine.
HIV-1 Env proteins, fragments thereof, HIV-1 Env trimeric complexes, portions thereof, or pharmaceutical compositions can be administered in a prophylactically effective amount or a therapeutically effective amount that provides an immunogenic response and/or protective effect against HIV-1. A protein composition can be administered at between about 1 μg and about 1 mg of protein, or between about 50 μg and about 300 μg of protein.
A viral vector capable of expressing HIV-1 Env proteins, fragments thereof, HIV-1 Env trimeric complexes or portions thereof can be administered at least about 1×103 viral particles (vp)/dose, between about 1×101 and about 1×1014 vp/dose, between about 1×103 and about 1×1012 vp/dose, or between about 1×105 and about 1×1011 vp/dose.
Levels of induced immunity or immune response provided by the compositions can be monitored by, for example, measuring amounts of HIV-1 neutralizing anti-HIV antibodies. The dosages may then be adjusted or repeated as necessary to trigger the desired level of immune response.
Where a subject is administered an HIV-1 protein, fragment thereof, HIV-1 Env trimeric complex, portion thereof, or pharmaceutical composition the efficacy of treatment can be determined by monitoring the level of the HIV-1 Env proteins, fragments thereof, HIV-1 Env trimeric complexes or portions thereof expressed by or present in a subject following administration. For example, the blood or lymph of a subject can be tested for the HIV-1 Env proteins, using for example, standard assays known in the art (see, e.g., Human Interferon-Alpha Multi-Species ELISA kit and Human Interferon-Alpha Serum Sample kit from Pestka Biomedical Laboratories (PBL), Piscataway, N.J.).
A single dose of one or more of the pharmaceutical compositions described herein can achieve therapy in subjects. Multiple doses (e.g., 2, 3, 4, 5, or more doses) can also be administered to subjects.
HIV-1 proteins, fragments thereof, HIV-1 Env trimeric complexes, portions thereof or pharmaceutical compositions can be administered, for example, every 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20, 30, 40, 45, 50, 55, or 60 minutes, 2, 4, 6, 10, 15, or 24 hours, 2, 3, 5, or 7 days, 2, 4, 6 or 8 weeks, or even 3, 4, or 6 months pre-exposure or pre-diagnosis, or may be administered to the subject every 15-30 minutes or 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 20, 24, 48, or 72 hours, 2, 3, 5, or 7 days, 2, 4, 6 or 8 weeks, 3, 4, 6, or 9 months, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 15, 20 years or longer post-diagnosis or post-exposure or to HIV-1. A subject can be administered one or more doses of the HIV-1 proteins, fragments thereof, HIV-1 Env trimeric complexes, portions thereof or pharmaceutical compositions once daily, weekly, monthly, or yearly. When treating an HIV-1 infection, the compositions can be administered to the subject either before the occurrence of symptoms of an HIV infection or disease/syndrome (e.g., acquired immune deficiency syndrome (AIDS)) or a definitive diagnosis, or after diagnosis or symptoms become evident. HIV-1 proteins, fragments thereof, HIV-1 Env trimeric complexes, portions thereof or pharmaceutical compositions can be administered, for example, immediately after diagnosis or the clinical recognition of symptoms or every 2, 4, 6, 10, 15, or 24 hours, 2, 3, 5, or 7 days, 2, 4, 6 or 8 weeks, or even 3, 4, or 6 months after diagnosis or detection of symptoms.
An embodiment provides antibodies (i.e., an immunoglobulin or an antigen-binding fragment) that specifically bind and recognize an HIV-1 Env protein, an antigenic fragment thereof, or an HIV-1 Env trimeric complex or antigenic fragment thereof. The HIV-1 Env protein, antigenic fragment thereof or HIV-1 Env trimeric complex or antigenic fragment thereof comprise one or more of the mutations described herein. In an embodiment, antibodies are provided that specifically bind to an HIV-1 Env trimeric complex in a closed conformation, wherein the HIV-1 Env trimeric complex comprises one or more of the mutations described herein.
Antibodies include monoclonal antibodies, polyclonal antibodies, multispecific antibodies (e.g., bispecific antibodies), and antibody fragments, as long as they have antigen-binding activity. In an embodiment, antibodies and fragments thereof can be chimeric antibodies (see, e.g., U.S. Pat. No. 5,482,856), humanized antibodies (see, e.g., Jones et al., Nature 321:522 (1986); Reichmann et al., Nature 332:323 (1988)); Presta, Curr. Op. Struct. Biol. 2:593 (1992)), or human antibodies. Human antibodies can be made by, for example, direct immortalization, phage display, transgenic mice, or a Trimera methodology, see e.g., Reisener et al., Trends Biotechnol. 16:242-246 (1998).
Examples of antibody fragments include but are not limited to Fv, Fab, Fab′, Fab′-SH, F(ab′)2; diabodies; linear antibodies; single-chain antibody molecules (e.g. scFv); and multispecific antibodies formed from antibody fragments. Antibody fragments include antigen binding fragments either produced by the modification of whole antibodies or those synthesized de novo using recombinant DNA methodologies. See, e.g., Kontermann & Dubel (Ed), Antibody Engineering, Vols. 1-2, 2nd Ed., Springer Press, 2010).
In an embodiment, an antibody or specific binding fragment thereof that specifically binds HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof having one or more mutations described herein has a broader and higher neutralization activity to HIV-1 virus when compared to an antibody or fragment produced by induction with a wild-type (i.e., naturally occurring) HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof.
An embodiment provides methods of identifying antibodies that specifically bind to HIV-1 Env proteins or HIV-1 Env trimeric complexes described herein. An effective amount of an immunogen is administered to B cells in an in vitro cell culture system to generate antibodies that neutralize HIV-1 virus. An effective amount is an amount that is sufficient to generate antibodies specific for HIV-1 Env protein, fragment thereof, HIV-1 trimeric complex, or fragment thereof. In an embodiment the HIV-1 virus is heterologous to the virus strain or subtype from which the immunogen was derived. That is, use of the immunogen can generate antibodies that specifically bind HIV-1 Env proteins and HIV-1 Env trimeric complexes from more than one clade. An immunogen can be an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex, or portion thereof described herein. An immunogen can also be a nucleic acid molecule, vector, host cell, or pharmaceutical composition described herein.
In an embodiment, antibodies neutralize HIV-1 or broadly neutralize HIV-1. Broadly neutralizing Abs (BnAbs) can neutralize infection of a large spectrum of genetically diverse HIV-1 viruses. BnAbs can reduce the infectious titer of HIV-1 by binding to and inhibiting the function of related HIV-1 Env antigens. Related HIV-1 Env antigens share at least about 85%, 90%, 95%, 96%, 97%, 98%, 99% or more sequence identity with an antigenic surface HIV-1 Env. In some embodiments, broadly neutralizing antibodies to HIV-1 Env are distinct from neutralizing antibodies to HIV-1 Env in that they neutralize with an ID50>40 a high percentage (e.g., about 50%, 60%, 70%, 80% or more) of the many types of HIV-1 in circulation. In an embodiment, a BnAb can neutralize the function of HIV-1 Env from more than one clade. Therefore, broadly neutralizing antibodies to HIV-1 Env are distinct from other antibodies to HIV-1 Env in that they neutralize a high percentage of the many types of HIV in circulation.
Examples of broadly neutralizing antibodies include b12 and VRC01, which bind to the CD4 binding site of gp120; 2F5 and 4E10, which bind to the membrane-proximal external region (MPER) of gp41; and PG9 and PG16, which bind to the V1V2 domains of the trimer. Other examples include PGT122 and 35022.
Methods to assay for neutralization activity are well known in the art and include, but are not limited to, plaque reduction neutralization (PRNT) assays, microneutralization assays, flow cytometry based assays, single-cycle infection assays (see e.g., Martin et al. (2003) Nature Biotechnology 21:71-76), and pseudovirus neutralization assays (see e.g., Georgiev et al. (Science, 340, 751-756, 2013), Seaman et al. (J. Virol., 84, 1439-1452, 2005), Mascola et al. (J. Virol., 79, 10103-10107, 2005).
An antibody can be made in vivo in suitable laboratory animals or in vitro using recombinant DNA techniques. Means for preparing and characterizing antibodies are well known in the art. See, e.g., Dean, Methods Mol. Biol. 80:23-37 (1998); Dean, Methods Mol. Biol. 32:361-79 (1994); Baileg, Methods Mol. Biol. 32:381-88 (1994); Gullick, Methods Mol. Biol. 32:389-99 (1994); Drenckhahn et al. Methods Cell. Biol. 37:7-56 (1993); Morrison, Ann. Rev. Immunol. 10:239-65 (1992); Wright et al. Crit. Rev. Immunol. 12:125-68 (1992).
“Specifically binds” or “specific for” means that a first antigen, e.g., HIV-1 Env, a portion thereof, a HIV-1 Env trimeric complex or portions thereof recognizes and binds to an antibody or antigen binding fragment thereof with greater affinity than other non-specific molecules. A non-specific molecule is an antigen that shares no common epitope with the first antigen. In embodiments a non-specific molecule is not an HIV-1 Env and is not related to HIV-1 Env. For example, an antibody raised against a first antigen (e.g., HIV-1 Env) to which it binds more efficiently than to a non-specific antigen can be described as specifically binding to the first antigen. In embodiments an antibody or antigen-binding fragment thereof specifically binds to an HIV-1 Env, HIV-1 Env trimeric complex, or portion thereof when it binds with a binding affinity Ka of 107 l/mol or more. An antibody or binding fragment thereof can specifically bind to an HIV-1 Env protein, portion thereof, HIV-1 Env trimeric complex or portion thereof when the interaction has a KD of less than 10−6 Molar, such as less than 10−8 Molar, less than 10−8 Molar, less than 10−9, or even less than 10−10 Molar.
In an embodiment an antibody or antigen binding fragment can specifically bind to HIV-1 Env or trimeric complexes of HIV-1 Env from two or more clades.
Specific binding can be tested using, for example, an enzyme-linked immunosorbant assay (ELISA), a radioimmunoassay (RIA), or a western blot assay using methodology well known in the art.
In an embodiment an antibody or fragment thereof specifically binds to an HIV-1 Env protein, portion thereof, HIV-1 Env trimeric complex or portion thereof in the presence of a heterogeneous population of proteins and other biologics. Thus, an antibody or fragment thereof specifically binds to a particular target HIV-1 Env protein and does not bind in a significant amount to other proteins or polysaccharides present in the sample or subject.
An embodiment provides a method of treatment of HIV-1 related disease or symptom thereof comprising administrating one or more of the neutralizing or broadly neutralizing antibodies described above to a subject (e.g., a mammal such as a human).
An embodiment provides a method of enhancing the binding of an antibody to an HIV-1 Env trimeric complex or portion thereof. The method comprises making one or more amino acid mutations described herein to one or more HIV-1 Env proteins such that the HIV-1 Env trimeric complex or portion thereof comprises one or more of the mutations. An antibody specific for HIV-1 Env is contacted with the HIV-1 Env trimeric complex or portion thereof comprising one or more of the mutations. Binding of the antibody is enhanced as compared to binding of the antibody to an HIV-1 Env trimeric complex or portion thereof that does not comprise one or more of the mutations described herein.
An embodiment provides methods for isolating antibodies that specifically bind to a HIV-1 Env protein or fragment thereof comprising one or more of the mutations described herein or an HIV-1 trimeric complex or portion thereof comprising one or more of the mutations described herein. An effective amount of an immunogen is administered to a subject, such as a mammal and antibodies are isolated. An effective amount is an amount sufficient to elicit antibodies to the HIV-1 Env immunogen.
An embodiment provides a method of identifying antibodies that specifically bind to an HIV-1 Env protein or fragment thereof comprising one or more of the mutations described herein or an HIV-1 Env trimeric complex or fragment thereof comprising one or more of the mutations described herein. Methods comprise, for example, administering an effective amount of an immunogen to B cells in an in vitro cell culture system to generate antibodies that specifically bind to the HIV-1 Env protein or the HIV-1 Env trimeric complex and isolating antibodies specific for the administered complex or composition. An effective amount is an amount sufficient to generate antibodies to the HIV-1 Env immunogen.
An embodiment provides a method of making or screening for an isolated hybridoma that produces a broadly neutralizing antibody that specifically binds to an HIV-1 Env protein or fragment thereof comprising one or more of the mutations described herein or an HIV-Env trimeric complex or portion thereof comprising one or more of the mutations described herein. The method comprises immunizing a mammal with an effective amount of an immunogen as described herein and isolating splenocytes from the immunized mammal. An effective amount is an amount sufficient to elicit antibodies to the HIV-1 Env immunogen. The isolated splenocytes are fused with an immortalized cell line to form hybridomas. Individual hybridomas are screened for production of an antibody that specifically binds with said trimeric complex or protein thereof to isolate the hybridoma.
In all methods described above, an immunogen can be, for example, an HIV-1 Env trimeric complex or portion thereof comprising one or more of the mutations described herein; an HIV-1 Env protein or fragment thereof comprising one or more mutations described herein; a nucleic acid molecule, vector, or host cell encoding or capable of expressing an HIV-1 Env protein, fragment thereof, HIV-1 Env trimeric complex or portion thereof comprising one or more of the mutations described herein.
The compositions and methods are more particularly described below and the Examples set forth herein are intended as illustrative only, as numerous modifications and variations therein will be apparent to those skilled in the art. As used in the description herein and throughout the claims that follow, the meaning of “a”, “an”, and “the” includes plural reference unless the context clearly dictates otherwise. The term “about” in association with a numerical value means that the value varies up or down by 5%. For example, for a value of about 100, means 95 to 105 (or any value between 95 and 105).
The terms used in the specification generally have their ordinary meanings in the art, within the context of the compositions and methods described herein, and in the specific context where each term is used. Some terms have been more specifically defined below to provide additional guidance to the practitioner regarding the description of the compositions and methods.
All patents, patent applications, and other scientific or technical writings referred to anywhere herein are incorporated by reference herein in their entirety. The embodiments illustratively described herein suitably can be practiced in the absence of any element or elements, limitation or limitations that are not specifically disclosed herein. Thus, for example, in each instance herein any of the terms “comprising”, “consisting essentially of”, and “consisting of” may be replaced with either of the other two terms, while retaining their ordinary meanings. The terms and expressions which have been employed are used as terms of description and not of limitation, and there is no intention that in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present invention has been specifically disclosed by embodiments, optional features, modification and variation of the concepts herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention as defined by the description and the appended claims.
Whenever a range is given in the specification, for example, a temperature range, a time range, or a composition or concentration range, all intermediate ranges and subranges, as well as all individual values included in the ranges given are intended to be included in the disclosure. It will be understood that any subranges or individual values in a range or subrange that are included in the description herein can be excluded from the aspects herein.
In addition, where features or aspects of the invention are described in terms of Markush groups or other grouping of alternatives, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group or other group.
The following are provided for exemplification purposes only and are not intended to limit the scope of the invention described in broad terms above.
A. Tissue Culture
A derivative of Expi293F cells was used in which CXCR4 expression was knocked out47; these cells do not express CD4, CCR5, or CXCR4 receptors. Cells were cultured in Expi293 Expression Medium (Life Technologies), 8% CO2, 37° C., at 125 rpm, and transfected with Expifectamine (Life Technologies). Unless otherwise stated, for testing of targeted mutants, 500 ng plasmid DNA was transfected per 2×106/mL of cells, and ExpiFectamine Transfection Enhancers 1 and 2 (Life Technologies) were added 18 h later. Cells were analyzed 24-30 h post-transfection. For transfecting libraries such that, on average, cells acquire no more than one coding sequence47, 1 ng/ml library DNA was diluted with 1.5 μg/ml pCEP4-ΔCMV carrier DNA, and transfected using Expifectamine into CXCR4-knockout Expi293F cells at a density of 2×106 cells/ml. The medium was replaced 2 h post-transfection, and cells were prepared for sorting 24-26 h post-transfection. The pCEP4-ΔCMV carrier plasmid was generated by digesting pCEP4 with Sail and ligating the vector backbone back together, effectively removing the CMV promoter, multiple cloning site, and SV40 polyadenylation sequence, but maintaining the EBNA1 gene and oriP replication origin.
B. BaL gp160 Library Generation
A synthetic, codon-optimized Env gene from the BaL HIV-1 isolate (GenBank Accession No. AAA44191.1) was generated from gBlocks (Integrated DNA Technologies) and cloned into the Nhel-XhoI sites of pCEP4 (Invitrogen). The gene encodes EnvBaL residues E31-L856 (numbering based on the HXB2 reference strain) fused to an N-terminal CD5 leader peptide (sequence MPMGSLQPLATLYLLGMLVASVLA). When transfecting cells under conditions that yielded a single coding variant per cell, a pCEP4 derivative vector (pCEP4-intron) containing a strong 5′ chimeric intron was used for enhanced expression. This was created by cloning the intron from plasmid pRL-SV40 (Promega) into the Kpnl-Nhel sites of pCEP4. These plasmids are deposited with Addgene.
Single site-saturation mutagenesis (SSM) libraries were generated by overlap extension PCR76. Three separate SSM libraries were constructed focused on the Env N-terminus (a.a. 31-265; Library A), center (a.a. 266-529; Library B), and C-terminus (a.a. 530-856; Library C). The PCR products were cloned by restriction enzyme digestion and ligation into the Nhel-Bglll (Library A), BamHI-NotI (Library B), and PstI-XhoI (Library C) sites of BaLgp160 inserted into the Nhel-XhoI sites of pcDNA3.1(+) (Invitrogen), with the vector PstI and BgIII sites removed by QuikChange (Agilent) mutagenesis. Ligations were transformed into NEB 5-α electrocompetent E. coli (New England Biolabs), and plasmid DNA for each library was prepared using GeneJET Maxiprep Kit (Thermo Scientific). Following library construction in the pcDNA3.1(+) vector, the full-length diversified BaLgp160 library inserts was subcloned into the Nhel-XhoI sites of pCEP4-intron. At all cloning steps, the number of transformants was at least an order of magnitude greater than the possible library diversity. Combined, the three BaL gp160 SSM libraries covered 16,332 out of 16,520 possible single amino acid mutations, based on a minimum frequency of 5.7×10−6 (corresponding to approximately 10 reads) in the deep sequenced plasmid libraries.
C. DU422 gp140 Library Construction
A synthetic, codon-optimized gp160 gene from the DU422 HIV-1 isolate (GenBank Accession No. ABD83641.1) was cloned into the Nhel-XhoI sites of pCEP4 (Invitrogen). This plasmid is deposited with Addgene (#100926). Using PCR-based assembly, the gp140 ectodomain (a.a. N31-N677, HXB2 reference numbering) was fused to a C-terminal gly/ser-rich linker, 6his tag, and the transmembrane helix of HLA class I α chain for surface display, and inserted into the Nhel-XhoI sites of pCEP4.
Overlap extension PCR was used to create the SSM libraries76. Three separate SSM libraries were constructed focused on the gp140DU422 N-terminus (a.a. 31-279; NT library), center (a.a. 280-577; central library), and C-terminus (a.a. 578-677; CT library). Mutagenized PCR segments were ligated into the Nhel-Pfl23II (NT library), SbfI-HindIII (central library), or Pfl23II-Xho1 (CT library) sites of pCEP4-gp140DU422. Ligations were electroporated into NEB 5-β E. coli (New England Biolabs), and library plasmid DNA was purified using a GeneJET Maxiprep Kit (Thermo Scientific). Based on a minimum frequency of 5×10−6, all possible 12,820 single amino acid substitutions were present in the three SSM libraries.
D. Sorting BaL gp160 Libraries for Binding to sCD4, VRC01 and PG16
To evolve Env variants for high binding signals to soluble CD4 (sCD4), cells were harvested 24-26 hours following transfection with library DNA as described above. Cells were centrifuged at 500 g for 1 min at 4° C., the pellets were washed with cold phosphate buffered saline supplemented with 0.2% BSA (PBS-BSA), and incubated on ice for 40 minutes with 200 nM sCD4-183 (domains D1-D2). Soluble recombinant sCD4-183 was obtained through the NIH AIDS Reagent Program, Division of AIDS, NIAID, from Progenics. After incubation, cells were washed twice with cold PBS-BSA, incubated on ice for 30 minutes with FITC-conjugated anti-CD4 (clone M-T441, LifeSpan BioSciences, 1/200 dilution), washed twice, and resuspended in PBS-BSA.
To evolve the Env libraries for high binding signals to VRC01 or PG16, cells were washed with cold PBS-BSA 24-26 hours post-transfection, incubated on ice for 40 minutes with either 5 nM VRC01 (obtained through the NIH AIDS Reagent Program from John Mascola)44 or 2 nM PG16 (obtained through the NIH AIDS Reagent Program)39, washed twice, incubated on ice for 30 minutes with APC-conjugated anti-human IgG Fc antibody (BioLegend, clone HP6017, 1/300 dilution), washed twice more, and resuspended in PBS-BSA.
Labeled cells were sorted on a BD FACS Aria II at the Roy J. Carver Biotechnology Center. Immediately prior to sorting, 1 μg/ml propidium iodide was added to the cells, and dead cells that were propidium iodide-positive were removed during gating. Auto-fluorescent cells in either the APC channel (after staining with FITC-conjugated anti-CD4) or FITC channel (after staining with APC-conjugated anti-IgG Fc) were also removed during gating. In the sort gates for sCD4- or VRC01-bound cells, the highest 0.5% (Libraries A and B) and 1.0% (Library C) of FITC- or APC-positive cells were collected, respectively. In the sort gates for PG16-bound cells, the highest 0.6% (Libraries A and B) and 0.9% (Library C) of APC-positive cells were collected. These gated cell populations in the respective libraries had similar binding signals. Sort conditions are listed in Table 1. Sorted cells were collected by centrifugation (400 g, 3 min, 4° C.) and frozen at −80° C. To maintain cell viability and mRNA quality during the experiment, samples were sorted for a maximum of 4 hours into tubes that had been coated overnight with fetal bovine serum. To collect greater numbers of cells than one 4-hour sort provided, libraries were prepared again and frozen sorted cell pellets from multiple days' experiments were pooled during RNA extraction. Each replicate typically required 8 hours of sorting per library.
E. FACS-Based Selection of the Cell Libraries Expressing DU422 gp140
Recombinant sCD4-183 (provided by Progenics) and PG1639 were obtained from the NIH AIDS Reagent Program. Division of AIDS. NIAID. To select gp140DU422 variants based on sCD4 binding, cells were centrifuged at 500 g, 2 min, 4° C., 24-26 h post-transfection. Cells were washed with cold phosphate buffered saline supplemented with 0.2% BSA (PBS-BSA), incubated on ice for 40 minutes with 10 nM sCD4 in PBS-BSA, washed twice, incubated on ice for 30 minutes with fluorescein isothiocyanate (FITC)-conjugated anti-CD4 (clone M-T441, LifeSpan BioSciences, 1/200 dilution), washed twice, and resuspended in PBS-BSA. To select gp140DU422 variants based on PG16 binding, transfected cells were washed with cold PBS-BSA, incubated on ice for 40 minutes with 3 nM PG16, washed twice, incubated for 30 minutes with allophycocyanin (APC)-conjugated anti-human IgG Fc antibody (BioLegend, clone HP6017, 1/250 dilution), washed twice more, and resuspended in PBS-BSA.
Labeled cells were sorted on a BD FACS Aria II at the Roy J. Carver Biotechnology Center, University of Illinois. Single cells were gated based on FSC/SSC, and dead cells that were positive for propidium iodide (added to a final concentration of 1 μg/ml) were excluded. Autofluorescent cells in the APC channel (after staining with FITC-conjugated anti-CD4) or FITC channel (after staining with APC-conjugated anti-IgG Fc) were also excluded. In the final sort gates, the highest 0.3% (NT library) or 0.4% (central and CT libraries) of FITC-positive cells were collected during the sCD4 binding selections. For sorting PG16-bound cells, the highest 0.6% of APC-positive cells were collected. Sorted cells were centrifuged at 400 g, 3 min, 4° C., and pellets were frozen at −80° C. To maintain cell viability and mRNA quality, samples were sorted for a maximum of 4 h into tubes that had been coated overnight with fetal bovine serum. To collect greater numbers of cells than one sort provided (a typical replicate required 6-8 h of sort time), libraries were freshly prepared and frozen cell pellets from multiple sort runs were pooled during RNA extraction. Sorting conditions are summarized in Table 6.
F. Deep Sequencing the BaL gp160 Libraries
Total RNA was extracted from sorted cells using a GeneJET RNA Purification Kit (Thermo Scientific), and first strand cDNA was synthesized with high-fidelity AccuScript reverse transcriptase (Agilent Technologies) primed with oligonucleotides that annealed downstream of the diversified regions: BALgp160_libA_RT_rev (5′-CGTTCAGCTGAACAATGATG-3) (SEQ ID NO:4) annealing to a.a. 284-289 of Library A, BALgp160_libB_RT_rev (5′-TGTTGGACAATACCAGACAG-3) (SEQ ID NO:5) annealing to a.a. 545-550 of Library B, and the EBV-Reverse sequencing primer annealing to the 3′-UTR for Library C. To generate fragments for deep sequencing, the cDNA was PCR amplified in two rounds. In the first round (18 thermocycles), primer overhangs added complementary sequences to the Illumina sequencing primers. In the second round (15 thermocycles), primer overhangs added barcodes and adaptor sequences for annealing to the Illumina flow cell. Thermocycling was kept to a minimum to reduce the introduction of PCR biases and errors. Each of the gp160 libraries was amplified as three overlapping fragments to achieve full sequencing coverage. DNA was sequenced at the UIUC Roy J. Carver Biotechnology Center on an Illumina MiSeq v3 (2×300nt kit) or HiSeq 2500 (2×250nt kit).
G. Deep Sequencing the DU422 gp140 Libraries
Total RNA was extracted from sorted cells using a GeneJET RNA Purification Kit (Thermo Scientific). For the central and CT libraries, first strand cDNA was synthesized with high-fidelity AccuScript (Agilent Technologies) primed with a gene-specific oligonucleotide and the EBV-Reverse sequencing primer (5′-GTGGTTTGTCCAAACTCATC-3′ (SEQ ID NO:6); anneals to the 3′-UTR). For the NT library, cDNA was reverse transcribed with SuperScript IV Vilo Master Mix (Thermo Scientific). The cDNA was PCR amplified in two rounds to generate fragments for Illumina sequencing. In the first round (18 thermocycles), primer overhangs added complementary sequences to the Illumina sequencing primers. In the second round (9 thermocycles for the NT library, and 15 thermocycles for the central and CT libraries), primer overhangs added barcodes and adaptor sequences for annealing to the Illumina flow cell. Thermocycling was minimized to reduce PCR biases and errors. The NT and central gp140 libraries were each amplified as two overlapping fragments to achieve full sequencing coverage. DNA was sequenced at the UIUC Roy J. Carver Biotechnology Center on an Illumina HiSeq 2500 (2×250nt kit).
H. Deep Sequencing Analysis
Deep sequencing data were analyzed with Enrich77. Log2 enrichment ratios of mutants were normalized by subtracting the enrichment of the wildtype sequence. Commands are available in the data deposition with NCBI's Gene Expression Omnibus78 under series accession number GSE102276.
I. Directed Evolution of an EnvBaL Combinatorial Library
The ectodomain of EnvBaL (a.a. E31-K677) was cloned downstream of a CD5 leader peptide and upstream from a gly/ser-rich linker fused to a 6his tag and the TM helix of HLA class I α chain for surface display. A combinatorial library, containing core mutations enriched in the sequence-activity landscape for PG16 binding, was synthesized by oligo assembly and cloned into the Nhel-XhoI sites of pCEP4-intron. Expi293F CXCR4-KO cells (2×105 cells/ml) were transfected using Expifectamine with 1 ng/ml library DNA and 1.5 μg/ml pCEP4-ΔCMV. Cells were harvested 24-26 h post-transfection, and stained with PG16 as described for the SSM library selection. Cells with the highest APC fluorescence signal (top 0.3%) were collected on a BD FACS Aria II sorter and frozen at −80° C. Total RNA was extracted from the frozen cell pellet using a GeneJET RNA Purification Kit (Thermo Scientific), and first strand cDNA was synthesized with SuperScript IV VILO Master Mix (Thermo Scientific) reverse transcriptase. The Env insert was PCR-amplified from the cDNA using Phusion (Thermo Scientific), and re-cloned into pCEP4-intron for another round of enrichment. The numbers of transformants during cloning steps were orders of magnitude greater than the possible library diversity of 9.216 variants, ensuring that all possible mutant combinations in the library were adequately sampled. The library selection was repeated three times. To increase stringency, the PG16 concentration was decreased from 2 nM (sort 1; 21,500 cells collected) to 1 nM (sort 2; 39,100 cells) to 0.5 nM (sort 3; 31,600 cells). After the third round of directed evolution, plasmid DNA from individual clones was purified and tested.
J. EnvBaL Mutants Binding to Antibodies
Highly enriched EnvBaL mutants identified in both replicate evolution experiments for PG16 binding were transfected into Expi293F CXCR4-knockout cells. Transfected cells were washed with cold PBS-BSA 24 h post-transfection, incubated on ice for 40 minutes with PG16 (2 nM or 0.5 nM), washed twice, incubated on ice for 30 minutes with APC-conjugated anti-human IgG Fc antibody (BioLegend, clone HP6017, 1/300 dilution), washed twice and analyzed on a BD LSR II flow cytometer. For titrating different antibodies, a 1:3 serial dilution of the antibody was prepared in a 96-well round-bottomed plate. Cells were incubated with the antibody at 4° C. on a rocker and washed in the 96-well plate as described above. 39F79,80 was provided by the NIH AIDS Reagent Program from Dr. James E. Robinson. 268-D IV81, 244282 and 307483,84 were provided by Dr. Susan Zolla-Pazner through the NIH AIDS Reagent Program.
K. EnvBaL Glycosylation Site Mutants Binding to sCD4 and VRC01
Transfected EnvBaL (WT, N262Q, S264T, and S264A) cells were washed with cold PBS-BSA 24 h post-transfection. 1:3 Serial dilutions of sCD4 and VRC01 in PBS-BSA were prepared in 96-well round-bottomed trays. The washed cells were incubated with the ligands at 4° C. on a rocker for 40 minutes, washed twice, incubated for 30 minutes with secondary antibody (1/200 FITC-anti-CD4 clone M-T441 from LifeSpan BioSciences, or 1/300 APC-anti-IgG clone HP6017 from BioLegend), washed twice, and analyzed on a BD LSR II flow cytometer.
L. EnvYU2, Env25711, EnvDU422, EnvQ769.d12 and EnvQ642.d12 Binding to Antibodies and sCD4
Human codon-optimized Env sequences from HIV-1 strains YU2, 25711, DU422, Q769.d22 and Q842.d12 were cloned into the Nhel-XhoI sites of pCEP4 from gBlocks (Integrated DNA Technologies). In all cases, the native signal peptides were substituted with the strong CD5 leader sequence. These plasmids are deposited with Addgene. Mutations were made by overlap extension PCR.
Expi293F CXCR4-knockout cells were transfected at a density of 2×106/ml, with 1 ml of cells per well of a 12-well tissue culture treated plate. Cells were transfected with Expifectamine (Life Technologies) using 1000 ng DNA per well. Eighteen hours post-transfection ExpiFectamine Transfection Enhancers 1 and 2 (Life Technologies) were added. Cells were harvested 42 h post-transfection, washed with cold PBS-BSA, incubated on ice for 40 minutes with PG16 at the indicated concentrations, washed twice, incubated on ice for 30 minutes with APC-anti-human IgG Fc antibody (BioLegend, clone HP6017, 1/300 dilution), washed twice, and analyzed on a BD LSR II flow cytometer. For titration curves with different antibodies, 1:3 serial dilutions of the antibody were prepared and binding was assayed in 96-well round-bottomed plates. PGT12158, PGT12868, and PGT14568 were provided by IAVI through the NIH AIDS Reagent Program.
For binding assays to soluble CD4, cells were transfected as just described and collected 42 h post-transfection. Cells were washed with cold PBS-BSA, incubated on ice for 40 minutes with 1:3 serial dilutions of sCD4-183 prepared in 96-well round-bottomed plates, washed twice, incubated on ice for 30 minutes with FITC-anti-CD4 antibody (clone M-T441, LifeSpan BioSciences, 1/200 dilution), washed twice, and analyzed on a BD LSR II flow cytometer.
M. Structural Modeling
Homology modeling of BaL Env was based on crystal structures of sequences from other HIV-1 strains. VRC01-bound gp140BaL in the closed conformation was modeled by threading the BaL strain sequence onto PDB 5FYK85 with glycans removed, and rebuilding missing loops and minimizing side chain conformations in FoldIt86. CD4-bound gp120BaL, was generated by threading the BaL sequence onto PDB 1GC156, and minimizing side chain and backbone conformations with FoldIt. For modeling open-state gp140BaL bound to CD4, the sequence of BaL Env was threaded onto PDB 5VN313, which was then minimized with C3 symmetry using ROSETTA87. The model of the PG16-bound apical epitope was generated by threading the BaL sequence onto PDB 4DQO60. Coordinates for the glycan on N160 were kept, and a single N-acetyl-D-glucosamine sugar was added to N156. The structure in
Homology modeling of DU422 Env was based on the crystal structures of JR-FL SOSIP.664 (PDB 5FYK85) and CD4-bound B41 SOSIP.664 (PDB 5VN313). Glycans were removed, the DU422 sequence was threaded on, and missing loops were rebuilt using FoldIt86. Side chain and backbone conformations were then minimized with C3 symmetry imposed around the trimer axis using xml scripting in ROSETTA87. Point substitutions were modeled using FoldIt with local side chain minimization. Images were rendered with the PyMOL Molecular Graphics System, Schrödinger, LLC.
N. Western Blot
Cells transfected with BaL gp160 (WT, QES01, or QES02) were harvested, centrifuged and resuspended in reducing SDS dye. Sonicated samples were run on a 4-20% gradient polyacrylamide gel (Bio-Rad). Gels were transferred to PVDF membrane using a BioRad Mini Trans-Blot apparatus. Blots were blocked in TBST (Tris-buffered saline, 0.1% TWEEN® polysorbate 20) containing 3% BSA, washed with TBST, stained with goat polydonal anti-HIV gp160 (1/500: Abcam ab117122) for 30 minutes at room temperature, washed in TBST, stained with donkey ant-goat IgG H&L alkaline phosphatase-conjugate (1/1000; Abcam ab97112) for 30 minutes at room temperature, washed and visualized with Thermo Scientific 1-Step NBT/BCIP Substrate Solution.
O. Syncytia Formation Assay
Expi293F CXCR4-knockout cells (2×106 cells/ml) were co-transfected using Expifectamine with untagged CD4 (50 ng/ml pCMV3-CD4, Sino Biological HG10400-UT) plus N-terminal myc-tagged CCR5 (450 ng/ml pCEP4-myc-CCR5). Partner cells for fusion were transfected with 500 ng/ml of pCEP4 plasmid encoding Env. For controls, the plasmid was replaced with empty vector. Five hours post-transfedion 0.2×106 receptor- and Env-expressing cells were mixed with 600 μl Expi293 Expression Medium in a poly-L-lysine-coated well of a 12-well tissue culture tray. Wells had been previously coated with 0.01% poly-L-lysine (Sigma) for 30 minutes at room temperature, and washed with water and Expi293 Expression Medium. Plates were incubated with no agitation at 37° C., 8% CO2.
For quantitation by flow cytometry, wells were washed 20 h later with warm PBS, cells were detached for 15 minutes at 37° C. with 0.25% trypsin-2.21 mM EDTA, washed with cold PBS-BSA, and analyzed on a BD LSR II flow cytometer. The positive gate was set at 1% of untransfected cells for high FSC/SSC events.
For qualitative microscopic observation, cells were stained 20 h later with 0.06 μM calcein-AM (BioLegend) and NucBlue Fixed Cell Stain (Thermo Scientific) for ˜10 minutes. Medium was replaced, and cells were visualized on a Leica DMi8 inverted microscope. Image overlays were created in Fiji ImageJ.
M. Protein Purification and Size Exclusion Chromatography
To express gp120 subunits, residues E31-K510 (BaL), A31-K510 (Q769.d22), and V31-K510 (Q842.d12) were cloned downstream of a CD5 leader peptide, and fused at their C-termini to a linker (AGG) and 8-his tag. The EnvBaL gp140 construct encoded the soluble extracellular region upstream of the MPER (residues E31-D664) with a CD5 leader peptide, and fused at the C-terminus to a linker (GSGSGGSG) (SEQ ID NO:3) and 8-his tag. These coding sequences were cloned into the Nhel-XhoI sites of pcDNA3.1 (+) (Invitrogen).
Plasmids were transfected into Expi293F cultures using a protocol adapted from88. For each milliliter of culture, 1 μg DNA was mixed with 5 μg linearized polyethyleneimine (MW 25,000; Polysciences) in 100 μl of OptiMEM (Gibco). The mixture was incubated for 20 minutes at room temperature, then added to the cell culture at a density of 2×106/ml. Expi293 Transfection Enhancers (Life Technologies) were added 18 h post-transfection. Cells were centrifuged (1200×g, 15 minutes) 4 days post-transfection, and secreted protein was purified from the culture supernatant.
Protein was purified at 4° C. The supernatant was dialyzed against 20 mM Tris pH 8.0/225 mM NaCl for 6-8 h, followed by dialysis overnight against 20 mM Tris pH 8.0/20 mM imidazole/300 mM NaCl. Equilibrated NiNTA (50% slurry, 500 μl per 40 ml culture, Thermo Scientific) was incubated with the sample for 1 h on a rocker, collected in a gravity column, and washed with 20 ml of purification buffer (20 mM Tris pH 8.0/300 mM NaCl) containing 20 mM imidazole. Protein was eluted using a step gradient of purification buffer containing 50, 100 and 250 mM imidazole (1 ml per fraction). The 100 and 250 mM imidazole fractions were found to contain 8-his-tagged gp120 or gp140 based on SDS-PAGE analysis, and were pooled and concentrated using a 30 kD MWCO centrifugal device (Sartorius). Samples were separated by size exclusion chromatography using a Superose 6 increase 10/300 GL column on an ÄKTA pure system (GE Healthcare) with PBS as the running buffer.
N. BG505 SOSIP.664 Mutant Screening, Purification and ELISA Analysis
A codon-optimized gene fragment of BG505 SOSIP.664 (T332N) was cloned into the Nhel-XhoI sites of pCEP4. A CD5 leader peptide was placed at the N-terminus, and the sequence was fused C-terminally via a gly/ser-rich linker to a 6his tag followed by the TM helix of HLA class I α chain for tethering to the cell membrane. Mutations were made by overlap extension PCR and screened for PG16 binding in transfected Expi293F CXCR4-KO cells as described above.
For purification, BG505 SOSIP.664 (T332N) was subcloned into pCEP4 (Nhel-XhoI sites) with the C-terminal TM tether replaced by gly/ser-rich linker and 8his tag. The protein was expressed and purified as described above.
For PG16 binding analysis by ELISA, purified BG505 SOSIP protein (50 μl per well at 2 μg/ml in PBS) was incubated for 1 h at room temperature in a copper-coated 96-well plate (Thermo Scientific). Wells were washed 6 times with PBS containing 0.2% TWEEN® polysorbate 20 (PBS-T1), blocked with 100 μl 5% skim milk in PBS-T for 30 min, and then incubated for 2 h with 50 μl PG16 in PBS-T1 containing 2% skim milk. Wells were washed 6 times with PBS-T1, incubated for 1 h with peroxidase-conjugated donkey anti-human IgG-Fc (50 μl of a 1:15000 dilution in PBS-T1; Jackson ImmunoResearch Laboratories), and washed another 6 times with PBS-T1. For VRC01 binding analysis, BG505 SOSIP (50 μl per well at 2 μg/ml in PBS) was incubated for 1 h at room temperature in a copper-coated 96-well plate. Wells were washed 6 times with PBS containing 0.00005% TWEEN® polysorbate 20 (PBS-T2), blocked with 100 μl 3% BSA in PBS for 15 min, and then incubated for 2 h with 50 μl VRC01 in PBS containing 1.2% BSA. Wells were washed 6 times with PBS-T2, incubated for 1 h with peroxidase-conjugated donkey anti-human IgG-Fc (100 μl of a 1:5000 dilution in PBS), and washed another 6 times with PBS-T2. For CD4-IgG2 binding analysis, BG505 SOSIP (50 μl per well at 2 μg/ml in TBS: 50 mM Tris-CI pH 7.5, 150 mM NaCl) was incubated for 1 h at room temperature in a nickel-coated 96-well plate (Thermo Scientific). Wells were washed 5 times with TBS containing 0.05% TWEEN® polysorbate 20 (TBS-T), blocked with 100 μl TBS-T containing 3% BSA for 30 min, and then incubated for 2 h with 50 μl CD4-IgG2 (provided by Progenics Pharmaceuticals through the NIH AIDS Reagent Program) in TBS containing 1.2% BSA and 0.02% TWEEN® polysorbate 20. Wells were washed 5 times with TBS-T, incubated for 1 h with peroxidase-conjugated donkey anti-human IgG-Fc (100 μl of a 1:5000 dilution in TBS-T), and washed another 5 times with TBS-T. All ELISA plates were developed with 1-Step Ultra TMB-ELISA Substrate Solution (Thermo Scientific), and absorbance was measured at 630 nm.
P. Sequences
Native Env sequences with GenBank accession numbers from tested HIV-1 strains are shown below:
The following are HIV-1 Env sequences as shown above (SEQ ID NOs:7-11) but with the N-terminal signal peptides replaced with a CD5 leader sequence (capitalized; MPMGSLQPLATLYLLGMLVASVL (SEQ ID NO:12) for better expression. Experiments were performed using these constructs:
Q. Statistical Reporting
All measurements were from distinct samples. Replicate deep mutational scans were performed independently on different transfected cultures. Central tendency, variation, sample sizes, and statistical tests are described in figure legends.
R. Data Availability
S. Plasmid Availability
Plasmids have been deposited with Addgene: www.addgene.org/Erik_Procko/
Codon-optimized Env of the BaL isolate (a tier 1B virus from clade B)45 with a CD5 leader sequence for enhanced surface expression46 bound soluble CD4 (domains D1-D2), PG16 and VRC01 by flow cytometry when expressed on human Expi293F cells. Previously, deep mutational scanning of HIV-1 receptors CCR5 and CXCR4 found only qualitative agreement between replicate experiments47, likely in part due to under sampling the diverse mutant libraries. To increase sampling of mutations in the much larger Env protein, three separate single site-saturation mutagenesis (SSM) libraries were constructed focused on the Env N-terminus (a.a. 31-265; numbering based on the HXB2 reference strain), center (a.a. 266-529), and C-terminus (a.a. 530-856). Combined data from separate sorting experiments of each SSM library encompass 16,332 of the possible 16,520 single amino acid substitutions.
To maintain a tight link between genotype and phenotype, the Env libraries were transfected into Expi293F cells under conditions that yielded close to one sequence variant per cell, achieved by diluting the plasmid-based libraries with a large excess of carrier DNA47. However, when diluted to a single sequence variant per cell. Env expression was barely detected. It was critical that expression be increased by addition of an artificial intron in the 5′ untranslated region, and cotransfection with carrier DNA hypothesized to promote extra-chromosomal replication of the episomal Env plasmids.
Expi293F cells expressing the Env SSM libraries were bound to soluble CD4, VRC01, or PG16 near the apparent dissociation constants, and were screened by fluorescence-activated cell sorting (FACS) for highest binding signal (Table 1).
1Library A spans a.a. 31-265, Library B a.a. 266-529; Library C a.a. 530-856.
2Of the three SSM libraries, Library C had the most positive cells after ligand staining, as there are few deleterious imitations in the cytosolic Env tail. Therefore to sort cells with similar binding signals, a higher percentage of Library C was gated and collected.
The enrichment or depletion of Env mutants was determined by comparing the frequencies in the näive plasmid libraries with the transcripts in the evolved populations (
The Env sequence-activity landscapes are similar whether screened for CD4, VRC01 or PG16 binding (
Premature stop codons prior to the membrane-spanning helix (a.a. 684-705) are depleted, as expected. However, stop codons are tolerated in the cytosolic C-terminus immediately following the membrane anchor; this region contains a GYSPL motif that interacts with the AP-2 complex for clathrin-mediated endocytosis, normally maintaining low levels of surface Env expression48,49. Premature stop codons are again depleted around the N-terminus of the Kennedy Epitope, but then become highly enriched around residues 731-759. Env C-terminal deletions have previously been shown to increase surface expression50, which has been appropriated for elevated Env levels in virus-like particle vaccines. This indicates that even higher Env surface expression might be achieved in virus-like particles by using alternative premature stop codons (e.g. at position 731) to what have already been tested (for example51,52).
Stop codons are again depleted at a.a. 760-782 and 795-837, approximately corresponding to the lentivirus lytic peptide-2 (LLP-2) and LLP-3 to LLP-1 regions, respectively. Premature stop codons are weakly enriched near the very C-terminus, removing another endocytosis signal at the very end of the protein53 that would otherwise reduce the surface expressed pool. Lysine substitutions are depleted around a.a. 832-851 in LLP-1, in agreement with prior observations that Env expression is decreased when lysine is mutated into the arginine-rich LLP-1 region54, despite arginine and lysine sharing similar physicochemical properties. Overall, there are clearly distinct cytosolic sequence elements that modulate surface expression, and yet there is little sequence conservation after the TM helix, suggesting regulatory cytosolic elements lack folded structure. Disordered cytosolic tails with embedded regulatory motifs are common amongst transmembrane proteins55.
While the Env sequence-activity landscapes for interacting with the three protein ligands are similar when viewed globally, unique surface features are immediately apparent when the conservation scores are plotted on a model of trimeric BaL Env in the closed conformation (
CD4 binds the gp120 outer domain, stabilizing structural elements in the conformationally flexible subunit13,56,57. The defined binding site is highly conserved in the selection for CD4 binding (
VRC01 engages the CD4-binding site on the gp120 outer domain58, and yet this surface is distinctively not conserved in the Env sequence-activity landscape for VRC01 interaction (
PG16 binds the junction between two gp120 subunits at the Env apex, possibly forming bridging contacts to glycans from each subunit and explaining the antibody's strong preference for trimeric quatemary structure40. Crystal structures of PG16 and related bNAb PG9 bound to scaffolded V1-V2 demonstrated extensive contacts to the glycan on Env-N160, with a smaller contact surface to the adjacent glycan on Env-N156 (alternatively N173 in other HIV-1 strains)31,60. This correlates with mutagenesis data demonstrating greater importance of the N160 glycosylation motif39. An unusually long PG16 HCDR3 penetrates between the glycans and forms sequence-independent n-sheet-like interactions with the V1-V2 backbone31,60. Consistent with these prior results, most mutations of Env residues within the epitope are tolerated in the deep mutational scan for PG16 binding, with the critical exception of the conserved N160 glycosylation motif (
When the Env SSM library is evolved for PG16 interactions, trimer interface residues are under selection and show moderate sequence conservation (
Nearly a hundred mutations in the Env sequence-activity landscape for PG16 binding had log2 enrichment ratios greater than 1.5 in both selection replicates, though it is noted that because most substitutions in the N-terminal SSM library were severely depleted, neutral substitutions in V1-V2 could be misleadingly enriched. 42 substitutions that were highly enriched in both replicate selection experiments (Table 2) were individually tested. The screen was biased towards residues located on subunit surfaces in areas less likely to interact with bNAbs, including trimer or gp120-gp41 interfaces, and around the furin cleavage site.
1Expi293F cells expressing Env mutants were stained with 0.5 and 2 nM PG16 (n ≥ 2), and binding was assessed by flow cytometry.
2+ is below 0.75 wildtype signal, ++ is 0.75-0.9x, +++ is 0.9-1.1, ++++ is 1.1-1.25x, +++++ is 1.25-1.5x, ++++++ is 1.5-1.75x, +++++++ is 1.75-2x, and ++++++++ is > 2x wildtype binding signal.
Twenty mutations were validated to cause a slight to moderate increase in PG16 binding, and these are clustered at five sites (
The first site of Env mutations that enhance PG16 binding is located at the trimerization interface near the apex (
The second site of Env mutations for enhanced PG16 binding is a centrally located interfacial region where residues are in contact between Env protomers, and between gp41 and gp120 subunits (
The mutation I595M is found in site 3 at a trimer contact (
Mutations to engineer BaL Env for enhanced PG16 binding (Table 2) were combined. Multiple Env variants with similarly high PG16 binding signals were identified, often combining mutations from separate sites (mutations within a single site likely have epistatic interactions that limit their benefit when combined). Two sequences, termed BaL-QES.i01 and BaL-QES.i02 (“i” for “interfaces”, where most of these mutations are localized), were chosen for further characterization. These variants combine mutations T49D; P124D; I595M and T49D; P124D; L663N, respectively. The CD4 binding potential of BaL-QES.i01 and BaL-QES.i02 is unchanged from wildtype (
Env loses conformational flexibility upon proteolytic processing, increasing affinity for certain conformation-dependent antibodies, including PG1662. Western blot shows that wildtype, QES.i01 and QES.i02 gp160BaL were all cleaved to gp120 (
Most of the QES mutation sites identified in BaL Env are conserved across HIV-1 strains and clades, though in some cases other strains already carry the substituted amino acid, such as a threonine or tyrosine at positions 200 and 223. (Based on this observation, R315 of gp160BaL was substituted for glutamine, which is found at the equivalent position of Env in other HIV-1 strains, and again found PG16-binding was increased; Table 2.) A subset of the mutations in BaL Env that enhance PG16 binding was tested to determine if the mutations are effective in other HIV-1 strains. Q769.d22 and Q842.d12 are tier 2 strains from clade A, YU-2 is a tier 2 strain from clade B, and 25711 and DU422 are tier 1B and 2 strains from clade C, respectively45.
The mutations generally had little positive impact on PG16 binding to YU-2, 25711, or DU422 Env. However, many of the QES mutations increased PG16 binding to Q769.d22 and Q842.d22 Env (Table 3), and combinations of mutations were screened for even higher PG16 binding. Q769-QES.i03 has three mutations (A200E; F223Y; I595M), and bound PG16 with similar affinity (apparent KD of 3.7±0.6 nM and 5±3 nM for Q769-QES.i03 and wildtype Env, respectively) but 2.7-fold higher binding at saturation (
1Assessed by flow cytometry analysis of transfected Expi293F cells. n = 3, except for 25711 where n = 2.
2+ is below 0.75 wildtype signal, ++ is 0.75-0.9x, +++ is 0.9-1.1, ++++ is 1.1-1.25x, +++++ is 1.25-1.5x, ++++++ is 1.5-1.75x, +++++++ is 1.75-2x, and ++++++++ is >2x wildtype binding signal.
Q769-QES.i03 and wildtype Q769.d22 Env bound equivalent levels of soluble CD4 (
Neutralization of positive charge at the BaL, Q769.d22 and Q842.d12 Env apex increased presentation of the PG16 epitope, and excess positive charge within the turret may prime the closed Env trimer for opening by simple electrostatic repulsion. Instability of closed Env due to positive charge at the apex was also proposed in a recent structural study of PGT145-bound BG505 SOSIP, and it was further hypothesized that an arginine at position 315 increases the dynamics of apex opening61. However, no correlation between the identity of residue 315 and which Env sequences can be stabilized by neutralizing apex mutations was identified.
The QES mutations described thus far stabilize the closed conformation recognized by PG16 but do not diminish CD4 binding, and BaL-QES.i01 and QES.i02 variants still bind V3 region antibody clones 2442, 268-D IV, 39F, and 3074 similarly to wildtype Env (
From the deep mutational scan, mutations within the Env core to enhance PG16 binding were investigated. Mutations to 12 buried residues were incorporated into a combinatorial library of extracellular gp140BaL fused to a non-native TM helix for surface display; this served to both increase surface expression for higher PG16 binding signals, and ensured the selection didn't simply enrich for premature stop codons in the cytoplasmic tail. The combinatorial library was sorted for binding to PG16 for three rounds, and then individual clones were screened. An alignment of 7 sequences with elevated PG16 binding highlighted mutations I181L, T202N, V254T and V255M (
1Qualitatively assessed by flow cytometry (n ≥ 2).
2+ is below 0.75 wildtype signal, ++ is 0.75-0.9x, +++ is 0.9-1.1, ++++ is 1.1-1.25x, +++++is 1.25-1.5x, ++++++ is 1.5-1.75x, +++++++ is 1.75-2x, ++++++++ is 2-2.5x, +++++++++ is 2.5-3x, and ++++++++++ is >3x wildtype binding signal.
Mutation I181L is a subtle mutation just below the surface of the apical tip where PG16 binds (
QES mutations within the core could be broadly applied to full-length Env from strains DU422, 25711, Q842.d12 and Q769.d22 (Table 4). Mutations were combined to create Env constructs BaL-QES.i01.c01 (“c for”, “core”, containing I181L and V255M), Q769-QES.i03.V255M, Q842-QES.i04.1181L, 25711-QES.c02 (I181L; V254T), and DU422-QES.c03 (V181L; V254T; V255M), which bound up to 5-fold more PG16 at saturation when expressed on the cell surface (
The Env-QES variants mediated fusion to receptor-decorated membranes at levels similar to or greater than wildtype controls, unless the variant included the V255M substitution to destabilize the CD4-bound conformation, in which case membrane fusion was decreased (
Size exclusion chromatography (SEC) was used to determine whether some of the QES mutations could shift purified soluble gp140 and gp120 towards higher molecular weight species, but observed no decrease in elution volume compared to the wildtype proteins (
1Qualitatively assessed by flow cytometry. Expi293F cells were transfected with mutant BG505 SOSIP.664 tethered to the membrane via the TM helix of HLA class 1α chain, and stained with 1.5 nM PG16. n = 2.
2++ is <0.9x wildtype binding signal, +++ is 0.9-1.1x, ++++ is 1.1-1.3x, +++++ is 1.3-1.5x, ++++++ is 1.5-1.7x, and +++++++ is 1.7-1.9x wildtype signal.
While SOSIP particles are generally purified using antibodies that recognize correctly folded trimers, 8his-tagged BG505-QES.i03.c1 SOSIP was instead purified by nickel affinity chromatography to inspect the full cohort of protein forms. Based on SEC, the protein fraction in the trimer peak was elevated in the QES variant, and monomer/dimer forms also showed slightly reduced elution volume suggestive of increased transient associations (
Thus far, the application of deep mutational scanning to full-length Env from the clade B BaL strain has been described to qualitatively determine the impact of nearly all single amino acid substitutions on interactions with CD4 and the bNAbs VRC0144 and PG1639. By combining mutations (referred to as quatemary epitope stabilized or QES mutants) that enhance expression of a PG16-recognized closed trimer, Env was stabilized in a conformation/s that bound bNAbs PG16, PGT12168, and PGT12829,68. PG16-class antibodies bind at the Env apex between two protomers, explaining their strong preference for closed trimers40. PGT121 and PGT128 recognize the N332 glycan supersite on the gp120 outer domain29,68; antibodies targeting this site are frequently elicited bNAbs from multiple germline precursors, and therefore the N332 supersite may be an especially vulnerable epitope for the human antibody response to target65,68-71. The majority of the QES mutations neutralized positive charge at the apical protomer interfaces, thereby reducing electrostatic repulsion between the apical tips that contributes to dynamic instability61. Furthermore, inclusion of a mutation (V255M) in the gp120 core to destabilize the open state reduced CD4 binding and exposure of V3 region epitopes. These mutations were effective in both full-length Env and soluble BG505 SOSIP.
It would be ideal if there existed a suite of mutations for applying broadly to any HIV-1 strain, which stabilize Env in a closed trimer for improved bNAb elicitation. A HIV-1 vaccine could then be rapidly modified and updated to contain stabilized Env sequences from local prevailing strains. However, the identified QES mutations in BaL Env were only partially transferable to other strains. This was especially true of QES mutations at subunit interfaces. Why is it that mutations to neutralize electropositive charge in the apical cavity of BaL or various clade A Env sequences stabilize a closed trimer, yet the equivalent mutations in Env from the clade C DU422 or 25711 strains do not? Are there different underlying features in primary structure that regulate the closed-to-open Env transition in different strains? To address this, sequence-activity landscapes of Env were determined from the DU422 strain for binding to CD4 and PG16.
A deep mutational scan, which involves using next generation sequencing to track the in vitro evolution or selection of a diverse library of sequence variants34, requires a tight link between genotype and phenotype. For in vitro evolution of sequences under selection in human tissue culture, this can be accomplished by transfecting library DNA with a large excess of carrier DNA, such that a cell typically acquires no more than one coding sequence47. A synthetic codon-optimized gene encoding gp160 from the DU422 strain was fused downstream of a CD5 leader sequence, and expressed from an episomal plasmid that can replicate extra-chromosomally. However, when transfected with a large excess of carrier DNA to limit the copy number of coding sequences acquired per cell, DU422 gp160 surface expression in Expi293F cells was not detected. Expression remained undetectable even when the coding sequence was downstream of an intron to promote transcript processing and nuclear export, or when gp160DU422 was cotransfected with carrier DNA designed to promote episomal plasmid replication. Since the cytoplasmic tail of gp160 features multiple motifs that regulate trafficking and endocytic turnover50, only extracellular gp140DU422 (a.a. N31-N677) anchored to the membrane via a flexible gly/ser-rich linker and canonical transmembrane helix was expressed. Soluble gp140 has increased conformational heterogeneity compared to full-length Env21,72,73, and therefore interesting mutations identified by the deep mutational scan are subsequently validated in the full protein. DU422 gp140 was expressed on the cell surface at high levels based on flow cytometric analysis of PG16 and soluble CD4 (sCD4: domains D1-D2) binding, even when diluted with excess carrier DNA during transfection.
Three SSM libraries were prepared spanning gp140DU422 residues N31-N279 (NT library), N280-Q577 (central library), and T578-N677 (CT library). Together, the three libraries covered the full length of gp140DU422, and encoded all 12,820 single amino acid substitutions. Expi293F cell cultures transiently expressing each library were fluorescently labeled with sCD4 or PG16, and cells expressing gp140 sequences with the highest ligand-binding signals were collected by FACS (Table 6). Cells were incubated with concentrations of sCD4 (10 nM) and PG16 (3 nM) below their apparent dissociation constants (18 and 5 nM, respectively), ensuring that mutants with higher or lower affinity could be distinguished by fluorescence intensity. RNA was extracted from the sorted cell populations, and gene-specific fragments were amplified from cDNA and deep sequenced. The frequencies of all mutations in the sorted populations were compared to the naïve libraries. Beneficial mutations increase in frequency and have positive log2 enrichment ratios, whereas deleterious mutations are depleted and have negative log2 enrichment ratios. The enrichment ratios are plotted in
The sorting experiments were independently replicated, and the log2 enrichment ratios for all mutations within the NT and CT libraries weakly agree; neutral mutations in the correlation plots are clustered near the origin, while deleterious mutations fall in the negative quadrant (
When the gp140DU422 sequence is under selection for sCD4 binding, diversity is tolerated in variable regions V1 to V5, and in residues downstream of heptad repeat HR2 that begin the membrane proximal extemal region or MPER (
High conservation within the trimerization domain of DU422 gp140 starkly contrasts with BaL gp160, where PG16 binding is not only more tolerant of mutations in V1, V2 and V3, but mutations in these regions can also enhance presentation of the PG16 quatemary epitope. This is likely due to both strain-specific sequence constraints, and differences between gp140 versus gp160. Extracellular gp140 has an unstable structure with substantial conformational diversity, and a sizeable protein fraction likely adopts non-native conformations18,21,72,73,75 The trimeric closed state of gp140, already destabilized, may therefore be more sensitive to mutational insults than full-length gp160. That said, a defining feature of the BaL gp160 sequence-activity landscape was that neutralization of electropositive charge at the apical trimer interface stabilized the PG16-bound closed conformation. This is not the case for DU422 gp140. For example, most substitutions to basic residues K117 and R166 at the apex are generally predicted to be neutral, and (excluding the central library where data quality is poor) overall very few substitutions are predicted from the mutational scan to increase PG16 binding. This agrees with prior targeted mutagenesis, where mutations that increased PG16 recognition of BaL gp160 did not have the same effect in DU422 gp160. These strain-specific features justify the decision to deep mutationally scan DU422 Env, as a more comprehensive set of quatemary epitope stabilizing (QES) mutations to favor closed trimers can now be identified from two disparate strains (BaL and DU422), increasing the applicability of QES mutations for engineering Env from any strain of interest for vaccine incorporation.
Based on predictions from the deep mutational scan, four new QES mutations were screened and validated in full-length gp160DU422 that show increased PG16 binding when expressed in Expi293F cells (
This application claims the benefit of U.S. Ser. No. 62/534,191, filed Jul. 18, 2017, which is incorporated herein by reference in its entirety.
This invention was made with government support under contract number R01AI129719 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2018/042610 | 7/18/2018 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62534191 | Jul 2017 | US |