The present invention relates to a polypeptide which may be used as an immunogen to provoke an immune response. The invention further relates to a vaccine composition comprising the polypeptide. Aspects of the invention further relate to methods for enhancing a subdominant antibody response in a subject. Yet further aspects of the invention relate to methods for designing a peptide, preferably an immunogen, to mimic a complex and/or discontinuous structural configuration of a target peptide.
Throughout the last decades, vaccination has been a key countermeasure to control and eradicate infectious diseases. However, many pathogens (Respiratory Syncytial Virus (RSV), Influenza, Dengue and others) evade the immune system mounting antibody responses that fail to confer broad and potent protection against reinfection, and, in some cases, mediating disease enhancement. Deep profiling of human B-cells often reveals potent neutralizing antibodies emerging from natural infection, but are generally subdominant. A major challenge for next-generation vaccines is to overcome established immunodominance hierarchies, and focus antibody responses on crucial neutralization epitopes.
Vaccination has proven to be one of the most successful medical interventions to reduce the burden of infectious diseases, and the major correlate of vaccine-induced immunity is induction of neutralizing antibodies that block infection. However, classical vaccine approaches relying on inactivated or attenuated pathogen formulations have failed to induce protective immunity against numerous important pathogens, urging the need for novel vaccine development strategies. Structure-based approaches for immunogen design have emerged as promising strategies to elicit antibody responses focused on structurally defined epitopes sensitive to antibody mediated neutralization.
In recent years, advances in high-throughput B-cell technologies have revealed an impressive wealth of potently neutralizing antibodies (nAbs) for different pathogens which have resisted the traditional means of vaccine development for several decades, including HIV-1, Influenza, Respiratory Syncytial Virus (RSV), Zika, Dengue and others. Many of these antibodies have been structurally characterized in complex with their viral target proteins, unveiling the atomic details of neutralization-sensitive epitopes. The large-scale campaigns in antibody isolation, together with detailed functional and structural studies have provided comprehensive antigenic maps of the viral fusion proteins, which delineate epitopes susceptible to antibody-mediated neutralization and provide a roadmap for rational and structure-based vaccine design approaches.
The conceptual framework to leverage neutralizing antibody-defined epitopes for vaccine development is commonly referred to as reverse vaccinology. Although the reverse vaccinology inspired approaches have yielded a number of exciting advances in the last decade, the design of immunogens that elicit such focused antibody responses remains challenging. Successful examples of structure-based immunogen design approaches include the conformational stabilization of RSVF in its prefusion state (preRSVF), yielding superior serum neutralization titers when compared to its postfusion conformation. In the case of Influenza, several epitopes targeted by broadly neutralizing antibodies (bnAbs) were identified within the hemagglutinin (HA) stem domain, and an HA stem-only immunogen elicited a broader neutralizing antibody response than that of full length HA. Commonly, these approaches have aimed to focus antibody responses on specific conformations or subdomains of viral proteins. In a more aggressive approach, Correia et al. computationally designed a synthetic immunogen presenting the RSV antigenic site II, and provided a proof-of-principle for the induction of RSV neutralizing activity mediated by a single epitope in non-human primates.
Efforts to design novel proteins from first principles have revealed a variety of rules to control the structural features of de novo proteins (1-4). The design of function using computational approaches has been far more challenging as it requires high-precision energy functions and may entail critical parameters that are modelled inaccurately or neglected (e.g. molecular environment, conformational dynamics). Nevertheless, important advances have been made in the design of molecular recognition events by endowing designed proteins with structural motifs which perform their function by binding other proteins. With rare exceptions, the binding motifs transplanted were commonly found in existing protein structures, such as linear helical segments, allowing the grafting of such motifs without extensive backbone adjustments (5-8). Commonly, protein function is not contained within single, regular segments in protein structures but it arises from the 3-dimensional arrangement of several, often irregular structural elements that are supported by defined topological features of the overall structure (9, 10). As such, it is of utmost importance for the field to develop computational approaches to endow de novo designed proteins with irregular and multi-segment complex structural motifs that can perform the desired functions.
An important domain where functional protein design has raised expectations was on the modulation of immune responses, in particular, on the induction of neutralizing antibodies (nAbs) in vivo (11). Inducing nAbs targeting defined epitopes remains an overarching challenge for vaccine development. Our increasing structural understanding of many nAb-antigen interactions has provided templates for the rational design of immunogens for respiratory syncytial virus (RSV), influenza, HIV, dengue and others. Despite this extended structural knowledge, these and other pathogens are still lacking efficacious vaccines, highlighting the need for next-generation vaccines that efficiently guide antibody responses towards key neutralization epitopes in both naïve and pre-exposed immune systems. Indeed, the elicitation of antibody responses with defined epitope specificities has been a long-lasting challenge for immunogens derived from modified viral proteins.
A de novo design approach towards focusing antibody responses was shown by Correia and colleagues (11). Using computational protein design, the RSVF antigenic site II, a linear helix-turn-helix motif that is targeted by a clinically approved monoclonal antibody, was transplanted onto a heterologous protein scaffold. In vivo, the epitope-focused immunogen elicited nAbs in non-human-primates (NHPs) after repeated boosting immunizations. Albeit a proof-of-principle for the induction of functional antibodies with a computationally designed immunogen was demonstrated, several major bottlenecks arose: the lack of applicability of the computational approach to structurally complex epitopes and the inconsistent neutralization titers observed in the immunogenicity studies.
To address these limitations, here we designed epitope-focused immunogens mimicking two irregular and discontinuous RSV neutralization epitopes (site 0 (12) and IV (13) shown in
According to a first aspect of the present invention, there is provided a vaccine composition against a target pathogen, the composition comprising a plurality of non-naturally occurring immunogenic polypeptides; at least a first of said immunogenic polypeptides comprising a mimic peptide having an amino acid sequence having a tertiary structure which, when folded, mimics that of a complex and/or discontinuous neutralisation epitope from said target pathogen.
By “mimics” is meant that the tertiary structure of the amino acid sequence largely replicates that of the complex and/or discontinuous neutralisation epitope from said target pathogen; preferably there is sufficient similarity between the two tertiary structures at least to the extent that the mimic peptide can be bound by a neutralising antibody which targets the complex and/or discontinuous neutralisation epitope from said target pathogen. In most preferred embodiments, either and preferably both of the affinity and avidity of the antibody binding to the mimic peptide are at least 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95% or more of that of the antibody binding to the epitope from the target pathogen.
The invention is based on the design principles and peptides disclosed herein, which permit a complex or discontinuous epitope from a pathogen to be mimicked by a mimic peptide. In the examples herein, the pathogen is RSV. We have found that using a combination of immunogens in a composition—each of which may be designed as described herein, or which may be a combination of said designed immunogens and other immunogens—permits a range of immune responses to be elicited which provides a satisfactory immune response against the pathogen. In certain embodiments, the vaccine composition may be used to enhance an initial subdominant neutralising antibody response (for example, such a subdominant response may occur in response to an initial exposure to the pathogen; as the response is subdominant, it may be insufficient to neutralise the pathogen on subsequent exposure. Enhancing the subdominant response with the vaccine composition described herein may result in a neutralising response on subsequent exposure to the pathogen).
In preferred embodiments, each of said plurality of non-naturally occurring immunogenic polypeptides comprises a mimic peptide having an amino acid sequence which, when folded, mimics a complex and/or discontinuous neutralisation epitope from said target pathogen. Preferably each of said complex and/or discontinuous neutralisation epitopes are non-overlapping. In embodiments where a single mimic peptide is used (and other immunogens are also present), it is preferred that each of the immunogens presents a non-overlapping epitope. It is not necessarily that case, however, that all immunogens comprise a mimic peptide; at least one of the immunogens may be a naturally-occurring immunogen. In this way, multiple separate antibody responses may be elicited against a single pathogen. The combined immune response may be synergistic compared with eliciting individual immune responses to single immunogens.
In preferred embodiments, and as described in the examples herein, said target pathogen is RSV. However, the design principles illustrated herein may be used to prepare vaccines against other pathogens, and in particular against pathogens which may be resistant to conventional vaccine design, for example by virtue of being prone to eliciting subdominant neutralising antibody responses, and/or by virtue of frequent mutation in surface molecules which result in antibody targeting of strain specific epitopes rather than potent neutralising epitopes. Examples of other potentially suitable target pathogens include influenza, HIV, Dengue.
Where the target pathogen is RSV, the complex and/or discontinuous neutralisation epitopes are preferably selected from the group consisting of RSV site 0, site II, and site IV; more preferably epitopes from both sites 0 and IV are used, and most preferably epitopes from all of RSV sites 0, II, and IV are used.
Preferably a mimic peptide targeting RSV site 0 comprises or consists of an amino acid sequence selected from Tables 4 or 6, preferably from table 6, and most preferably comprises or consists of the S0_2.126 peptide sequence. A mimic peptide targeting RSV site IV may comprise of consist of an amino acid sequence selected from Tables 3 or 5, preferably from table 5, and most preferably comprises or consists of the S4_2.45 peptide sequence. A mimic peptide targeting RSV site II may comprise or consist of the FFL_001 or FFLM peptides (and preferably the FFLM peptide) described in Sesterhenn et al 2019 (PLoS Biol. 2019 Feb; 17(2): e3000164, doi: 10.1371/journal.pbio.3000164). The FFLM peptide has the amino acid sequence
ASREDMREEADEDFKSFVEAAKDNFN KFKARLRKGKITREHREMMKKLAKQNANKAKEAV RKRLSELLSKINDMPITNDQKKLMSNQVLQFADDAEAEIDQLAADATKEFTG (SEQ ID NO: 1), and is also referred to herein as S2_1 or S2_1.2.
In certain embodiments of the invention, the immunogenic peptide may comprise a scaffold, preferably a peptide scaffold, which presents the mimic peptide so as to assist the mimicking of the complex and/or discontinuous neutralisation epitope. For example, a designed mimic sequence may be fused to a scaffold sequence in a linear manner. Alternatively, a mimic sequence may be grafted onto or fused to two or more structural framework elements (eg, helices, sheets, etc) in a non-linear manner, so as to present the mimic sequence in a desired structural manner. The mimic sequence itself may comprise multiple sequences, in particular if presented on multiple structural elements. The scaffold may form a nanoparticle comprising multiple immunogenic peptides, with said nanoparticle preferably being soluble. In preferred embodiments, the scaffold may be selected from RSVN and ferritin.
The vaccine composition of the invention may be provided in combination with a vaccine composition comprising a native immunogen from the target pathogen. These may be provided separately (as separate compositions) or together (as a single vaccine composition). Also provided by the invention is a kit comprising multiple vaccine compositions, as described herein. For example, where the pathogen is RSV, the native immunogen may be an additional RSV-derived protein or glycoprotein, and most preferably the RSVF glycoprotein, or an RSVF protein precursor (for example, the core peptide sequence, or preRSVF). An example RSVF protein sequence is given in the UniProt KB database as entry A0A110BF16 (A0A110BF16_HRSV). Note that reference to a “native immunogen” does not require that the immunogen is obtained directly from the pathogen; clearly other expression systems or synthetic sources can be used. The intention is that the immunogen has the same sequence/configuration as the immunogen when present in or on the pathogen. Where the vaccines are provided as separate compositions, the vaccines may be administered in a prime:boost schedule (that is, administration of the native immunogen vaccine as a “prime” administration, followed thereafter by the other vaccine as a “boost”); such a schedule is believed to enhance an initial subdominant neutralising immune response seen in response to the prime vaccine. In certain embodiments of the invention, the vaccine composition (without the native immunogen) may be administered to a subject who has previously been exposed to the native immunogen. Such an administration schedule is believed to have a similar effect to the prime:boost schedule without the need for a separate prime administration. The schedule may comprise administering multiple boost vaccinations. The prime and first boost vaccinations may be administered according to any suitable schedule; for example, the two vaccinations may be administered one, two, three, four or more weeks apart. Where multiple boost vaccinations are administered, these too may be administered according to any suitable schedule; for example, the two vaccinations may be administered one, two, three, four or more weeks apart. Preferably two boost vaccinations are administered.
In one particularly preferred aspect of the invention, there is provided a vaccine composition comprising the S0_2.126 peptide sequence as described herein, and the S4_2.45 peptide sequence as described herein. The composition may further comprise the FFL_001 or FFLM peptides described in Sesterhenn et al 2019 (FFLM is also referred to herein as S2_1 or S2_1.2). Either or preferably both of the S0_2.126 and the S4_2.45 peptide sequences may be conjugated to ferritin. The FFL_001 or FFLM peptide sequence may be conjugated to RSVN.
Vaccine compositions described herein may further comprise one or more pharmaceutically acceptable carriers, and/or adjuvants. The adjuvant may be AS04, AS03, alhydrogel, and so forth.
The vaccine compositions described herein may be administered via any route including, but not limited to, oral, intramuscular, parenteral, subcutaneous, intranasal, buccal, pulmonary, rectal, or intravenous administration.
Also provided is a vaccine composition as described herein, wherein said target pathogen is RSV, for use in a method for immunising a subject against RSV, the method comprising a) administering said vaccine composition to a subject; and b) prior to said administration, administering a further vaccine composition comprising an RSV-derived protein or glycoprotein, preferably the RSVF glycoprotein, or wherein the vaccine composition of any preceding claim is administered to a subject who has previously been exposed to RSV infection.
Yet further provided by the present invention is a peptide sequence as described herein; a nucleic acid sequence encoding a peptide sequence as described herein; and a vector comprising such a nucleic acid sequence. Still further provided is use of a peptide sequence as described herein in the manufacture of a vaccine composition. Also provided is a method of vaccinating a subject, the method comprising administering a vaccine composition as described herein.
A further aspect of the invention provides a method for designing a peptide (preferably an immunogen) to mimic a complex and/or discontinuous structural configuration of a target peptide (preferably also an immunogen), the method comprising the steps of:
The method may further comprise the steps of identifying a plurality of said variants having improvements, and providing a further peptide having a combination of variations from said plurality of variants. The method may further be repeated for further rounds of generation and selection of variants.
The step of identifying a preliminary mimic peptide may comprise selecting a peptide from a peptide database having a structural similarity to the desired target peptide; or said step may comprise combining an amino acid sequence from said target peptide with one or more structural peptide elements such that said preliminary mimic peptide has a structural similarity to the desired target peptide.
Yet further provided herein is a design protocol as described; at least in part with reference to the TopoBuilder design protocol as described herein. In said design protocol, the placement of idealized secondary structure elements are sampled parametrically, and are then connected by loop segments (for example, structural elements such as loops, sheets, helices), to assemble topologies that can stabilize the desired conformation of the structural motif. These topologies are then diversified to enhance structural and sequence diversity with a folding and design stage. This permits two design objectives to be achieved: (1) building stable topologies de novo that stabilize the epitope, while mimicking its native quaternary environment; (2) Fine-tuning the topology's secondary structure arrangement to maximize the fold stability and optimize epitope presentation for high affinity antibody binding.
These and other aspects of the invention will now be described in detail, and with reference to the following figures.
References for legends to
Designing proteins with structurally complex functional sites has remained a largely unmet challenge on the field of computational protein design. We sought to design accurate mimetics of RSV neutralization epitopes, which have been particularly well studied structurally, and evaluate their functionality in immunization studies. We chose antigenic sites 0 and IV (
The computational design of proteins mimicking structural motifs has previously been performed by first identifying compatible protein scaffolds, either from naturally occurring structures or built de novo, which then serve as design templates to graft the motif (5, 6, 8, 15, 16). Given the structural complexity of sites 0 and IV, this approach did not yield any promising matches even with loose structural criteria (
Thus, for site IV, we noticed that a small structural domain that resembles an immunoglobulin fold containing the epitope could be excised from the preRSVF structure, hypothesizing this would be a conservative approach to maintain its native, distorted epitope structure (
The discontinuous structure of site 0 was not amenable for a domain excision and stabilization approach. We searched for template structures that mimicked the helical segment of the epitope, and simultaneously allowed to graft the loop segment, and selected a designed helical repeat protein as design template (PDB 5cwj) (
The primary goals for the designs were achieved in terms of the stabilization of irregular and complex binding motifs in a conformation relevant for antibody binding, however, the overall strategy presented important limitations with respect to its general utility. Despite the large number of structures available to serve as design templates, the number of those that are practically useful for the design of functional proteins becomes increasingly limited with the structural complexity of the motif. As described above, suboptimal design templates require extensive backbone flexibility on the design process and multiple rounds of directed evolution until a sequence with high-affinity binding is identified. Additionally, the starting topology determines the overall shape of the designed protein, which may be suboptimal for the accurate stabilization of the motif, and may oppose unwanted tertiary steric constraints that interfere with the designed function. In particular, for immunogen design it would be advantageous to preserve native-like accessibility of the epitope to maximize the induction of functional antibodies that can cross-react with the proteins presented by the pathogen. An illustrative example on how a template-based design approach can fail to fulfil these criteria is the comparison between the quaternary environment of the site 0 epitope in preRSVF and S0_1.39 showing that this topology does not mimic such environment, albeit allowing the binding of several monoclonal antibodies (
To overcome these limitations, we developed a template-free design protocol—the TopoBuilder—that generates tailor-made topologies to stabilize complex functional motifs. Within the TopoBuilder, we sample parametrically the placement of idealized secondary structure elements which are then connected by loop segments, to assemble topologies that can stabilize the desired conformation of the structural motif. These topologies are then diversified to enhance structural and sequence diversity with a folding and design stage using Rosetta FunFoldDes (see
To present antigenic site IV, we designed a fold composed of a β-sheet with 4 antiparallel strands and one helix (
We screened a defined set of computationally designed sequences using yeast display and applied two selective pressures—binding to 101F and resistance to the unspecific protease chymotrypsin, an effective method to digest partially unfolded proteins (5, 20, 21). Deep sequencing of populations sorted under different conditions revealed that S4_2_bb2-based designs were strongly enriched under stringent selection conditions for folding and 101F binding, showing that subtle topological differences in the design template can have substantial impact on function and stability. We expressed 15 S4_2_bb2 design variants and successfully purified and biochemically characterized 14. The designs showed mixed alpha/beta CD spectra and bound to 101F with affinities ranging from 1 nM to 1 μM (
Similarly, we built a minimal de novo topology to present the tertiary structure of the site 0 epitope. The choice for this topology was motivated by the fact that site 0, in its native environment preRSVF, is accessible for antibody binding from diverse angles (14), in contrast to the S0_39 natural template which topologically constrained site 0 accessibility (
We explored the topological space within the shape constraints of preRSVF and built three different helical orientations that support both epitope segments. Evaluation of the designed sequences with Rosetta abinitio showed that only sequences generated based on one of the three topologies (S0_2_bb3) presented a funnel-shaped energy landscape (
We selected 5 sequences, differing in 3-21 positions, for further biochemical characterization (
Overall, the properties of the designs generated by topological assembly with the TopoBuilder showed improved binding affinities and thermal stabilities as compared to those using available structural templates. To investigate whether this design and screening procedure yielded scaffolds that better mimicked the viral epitope presented, or rather revealed sequences with a highly optimized interface towards the antibodies used during the selection, we determined the affinity of S4_2.45 and S0_2.126 against a panel of site-specific antibodies. Compared to the first-generation designs, S4_2.45 and S0_2.126 showed large affinity improvements to diverse panels of site-specific antibodies, exhibiting a geometric mean affinity closely resembling that of the antibodies to preRSVF (
To evaluate the structural accuracy of the computational design approach, we solved the crystal structure of S4_2.45 in complex with 101F at 2.6 Å resolution. The structure closely matched our design model, with a full-atom RMSD of 1.5 Å. The epitope was mimicked with an RMSD of 0.135 ↑, and retained all essential interactions with 101F (
Next, we solved an unbound structure of S0_2.126 by NMR, confirming the accuracy of the designed fold with a backbone RMSD between the average structure and the model of 2.8 Å (
Lastly, we sought to evaluate the designed antigens for their ability to elicit antibody responses in vivo. Our rationale for combining site 0, II and IV immunogens in a cocktail formulation is that all three sites are non-overlapping, as verified by electron microscopy analysis (
In mice, Trivax1 elicited low levels of RSVF cross-reactive antibodies, and sera did not show
RSV neutralizing activity in most animals (
In parallel, we sought to test the potential of a trivalent immunogen cocktail in NHPs. The previously designed site II immunogen showed promise in NHPs, but induced neutralizing titers were low and inconsistent across animals, requiring up to five immunizations to elicit neutralizing antibodies in 2/4 animals (11). We immunized seven RSV naïve NHPs with Trivax1, as detailed in
While immunization studies in naïve animals are important to test the designed immunogens, an overarching challenge for vaccine development to target pathogens such as RSV, influenza, dengue and others is to focus or reshape pre-existing immunity of broad specificity on defined neutralizing epitopes that may be of higher-quality and mediate long-term protection (23). To mimic a serum response of broad specificity, we immunized 13 NHPs with prefusion RSVF. All animals developed strong preRSVF-specific titers and cross-reactivity with all the epitope-focused immunogens, indicating that epitope-specific antibodies were primed and recognized by the designed immunogens (
Altogether, we concluded that both design strategies yielded antigens for complex neutralization epitopes that induce neutralizing antibodies upon cocktail formulation, providing a strong rationale for including multiple, ideally non-overlapping epitopes in an epitope-focused vaccination strategy. While the first-generation immunogens were inferior according to biophysical parameters and failed to induce neutralization in mice, but were successful under two different immunological scenarios in NHPs, we show that a second generation with improved biophysical properties and proven accurate mimicry of the epitope can now induce neutralizing antibodies in mice. This is an important step as it now allows to optimize and test the different nanoparticles, formulations and delivery routes in a small animal model, and we foresee that these second-generation immunogens will prove superior in inducing neutralizing serum responses in NHPs.
Here, we have showcased computational protein design strategies to design accurate mimetics of structurally complex epitopes, and validated their functionality to elicit neutralizing antibody responses in cocktail formulations both in mice and NHPs.
We have shown that through computational design of pre-existing templates with full backbone flexibility, irregular and discontinuous epitopes were successfully stabilized in heterologous scaffolds. However, this design strategy required extensive in vitro evolution optimization and the resulting scaffolds remained suboptimal regarding their biochemical and biophysical properties. In addition, the lack of precise topological control of the designed proteins is a major limitation for the design of functional proteins that require specific topological similarity on top of the local mimicry of the transplanted site. For instance, the design template of the site 0 immunogen did not mimic the quaternary environment of the epitope of interest, which may have contributed to the low levels of functional antibodies induced in mice. To overcome these limitations, we developed the TopoBuilder, a motif-centric design approach that tailors a protein fold directly to the functional site of interest. Compared to previously employed de novo design approaches, in which a stable scaffold topology was constructed first and endowed with binding motifs in a second step (5), our method has significant advantages for structurally complex motifs. First, it allows to tailor the topology to the structural requirements of the functional motif from the beginning of the design process, rather than through the adaptation (and often destabilization) of a stable protein to accommodate the functional site. Second, the topological assembly and fine-tuning allowed to select for optimal backbone orientations and sequences that stably folded and bound with high affinity in a single screening round, without requiring further optimization through directed evolution, as often used in computational protein design efforts (5, 24, 25). Together, our approach enabled the computational design of de novo proteins presenting irregular and discontinuous structural motifs that are typically required to endow proteins with diverse biochemical functions (e.g. binding or catalysis), thus providing a new means for the de novo design of functional proteins.
On the functional aspect of our design work, we showed in vivo that these immunogens consistently elicited neutralizing serum levels in mice and NHPs as cocktail formulations. The elicitation of focused neutralizing antibody responses by vaccination remains the central goal for vaccines against pathogens that have frustrated conventional vaccine development efforts. Using RSV as a model system, we have shown that cocktails of computationally designed antigens can robustly elicit neutralizing serum levels in naïve animals. These neutralization levels were much superior to any previous report on epitope-focused immunogens (11) and provide a strong rationale for an epitope-focused vaccination strategy involving multiple, non-overlapping epitopes. Also, their capability to dramatically reshape the nature of non-naïve repertoires in NHPs, addresses an important challenge for many next-generation vaccines to target pathogens for which efficacious vaccines are needed. An important pathogen from this category is influenza, where the challenge is to overcome established immunodominance hierarchies (26) that favour strain-specific antibody specificities, rather than cross-protecting nAbs found in the hemagglutinin stem region (27). The ability to selectively boost subdominant nAbs targeting defined, broadly protective epitopes that are surrounded by strain-specific epitopes could overcome a long-standing challenge for vaccine development, given that cross-neutralizing antibodies were shown to persist for years once elicited (28). A tantalizing future application for epitope-focused immunogens could marry this technology with engineered components of the immune system and they could be used to stimulate antibody production of adoptively transferred, engineered B-cells that express monoclonal therapeutic antibodies in vivo (29).
Altogether, this study provides a blueprint for the design of an epitope-focused vaccination strategy against pathogens that have eluded traditional vaccine development approaches. Beyond immunogen design, the design strategy presented opens a door for the de novo design of proteins stabilizing complex binding sites, applicable to the design of novel functional proteins with defined structural properties.
An experimental method for selecting stable proteins from combinatorial libraries. Biochemistry 38, 11604-11612 (1999).
The structural segments entailing the antigenic site 0 were extracted from the prefusion stabilized RSVF Ds-Cav1 crystal structure, bound to the antibody D25 (PDB ID: 4JHVV) (1). The epitope consists of two segments: a kinked helical segment (residues 196-212) and a 7-residue loop (residues 63-69).
The MASTER software (2) was used to perform structural searches over the Protein Data Bank (PDB, from August 2018), containing 141,920 protein structures, to select template scaffolds with local structural similarities to the site 0 motif. A first search with a Cα RMSD threshold below 2.5 Å did not produce any usable structural matches both in terms of local mimicry as well as global topology features. A second search was performed, where extra structural elements that support the epitope in its native environment were included as part of the query motif to bias the search towards matches that favoured motif-compatible topologies rather than those with close local similarities. The extra structural elements included were the two buried helices that directly contact the site 0 in the preRSVF structure (4JHW residues 70-88 and 212-229). The search yielded initially 7,600 matches under 5 Å of backbone RMSD, which were subsequently filtered for proteins with a length between 50 and 160 residues, high secondary structure content, as well as for accessibility of the epitope for antibody binding. Remaining matches were manually inspected to select template-scaffolds suitable to present the native conformation of antigenic site 0. Subsequently, we selected a computationally designed, highly stable, helical repeat protein (3) consisting of 8 regular helices (PDB ID: SCWJ) with an RMSD of 4.4 Å to the query (2.82 A for site 0 segments only). To avoid steric clashes with the D25 antibody, we truncated the SCWJ template structure at the N-terminus by 29 residues, resulting in a structural topology composed of 7 helices.
Using Rosetta FunFolDes (4) the truncated SCWJ topology was folded and designed to stabilize the grafted site 0 epitope recognized by D25. We generated 25,000 designs and selected the top 300 by Rosetta energy score (RE), designed backbones that presented obvious flaws, as low packing scores, distorted secondary structural elements and buried unsatisfied atoms were discarded. From the top 300 designs, 3 were retained for follow-up iterative cycles of structural relaxation and design using Rosetta FastDesign (5), generating a total of 100 designed sequences.
The best 9 designs by Rosetta energy score were recombinantly expressed in E. coli. 2 designed sequences derived from the same backbone, were successfully expressed and purified. The best variant was named S0_1.1, and subjected to experimental optimization using yeast surface display (
When the design simulations were carried out, there was no structure available of the full RSVF protein in complex with a site IV-specific nAb, nevertheless a peptide epitope of this site recognized by the 101F nAb had been previously reported (PDB ID: 3O41) (7).
The crystallized peptide-epitope corresponds to the residues 429-434 of the RSVF protein. Structurally the 101F-bound peptide-epitope adopts a bulged strand and several studies suggest that 101F recognition extends beyond the linear β-strand, contacting other residues located in antigenic site IV (8). Despite the apparent structural simplicity of the epitope, structural searches for designable scaffolds failed to yield promising starting templates. However, we noticed that the antigenic site IV of RSVF is self-contained within an individual domain that could potentially be excised and designed as a soluble folded protein. To maximize these contacts, we first truncated the seemingly self-contained region from RSVF pre-fusion structure (PDB ID: 4JHW, residue: 402-459) forming a β-sandwich and containing site IV. We used Rosetta FastDesign to optimize the core positions of this minimal topology, obtaining our initial design: S4_wt. However, S4_wt did not show a funnel-shaped energy landscape in Rosetta ab initio simulations, and we were unable to obtain expression in E. coli.
In an attempt to improve the conformation and stabilization of S4_wt, we used Rosetta FunFolDes to fold and design this topology, while keeping the conformation of the site IV epitope fixed. Out of 25,000 simulations, the top 1% decoys according to RE score and overall RMSD were selected for manual inspection, and 12 designed sequences were selected for recombinant expression in E. coli.
Given the limited availability of suitable starting templates to host structurally complex motifs such as site 0 and site IV, we developed a template-free design protocol, which we named
TopoBuilder. In contrast to adapting an existing topology to accommodate the epitope, the design goal is to build protein scaffolds around the epitope from scratch, using idealized secondary structures (beta strands and alpha helices). The length, orientation and 3D-positioning are defined by the user for each secondary structure with respect to the epitope, which is extracted from its native environment. The topologies built were designed to meet the following criteria: (1) Small, globular proteins with a high contact order between secondary structures and the epitope, to allow for stable folding and accurate stabilization of the epitope in its native conformation (2) Context mimicry, i.e. respecting shape constraints of the epitope in its native context (
For site 0, the short helix of S0_1.39 preceding the epitope loop segment was kept, and a third helix was placed on the backside of the epitope to: (1) provide a core to the protein and (2) allow for the proper connectivity between the secondary structures.
A total of three different orientations (45°, 0° and −45° degrees to the plane formed by site 0) were tested for the designed supporting alpha helix (
In the case of site IV, the known binding region to 101F (residues 428F-434F) was extracted from prefusion RSVF (PDB 4JWH). Three antiparallel beta strands, pairing with the epitope, plus an alpha helix on the buried side, were assembled around the 101F epitope. Three different configurations (45°, (−45°, 0°, 10°) and −45° degrees with respect to the β-sheet) were sampled parametrically for the alpha helix (
The structural sketches were used to generate C! distance constraints to guide Rosetta FunFolDes (4) folding trajectories. Around 25,000 trajectories were generated for each sketch. The newly generated backbones were further subjected to layer-based FastDesign (5), meaning that each amino acid position was assigned a layer (combining ‘core’, ‘boundary’, ‘surface’ and ‘sheet’ or ‘helix’) on the basis of its exposure and secondary structure type, that dictated the allowed amino acid types at that position.
After iterative cycles of sequence design, unconstraint FastRelax (9) (i.e sidechain repacking and backbone minimization) was applied over the designs to evaluate their conformational stability of the epitope region. After each relax cycle, structural changes of the epitope region were evaluated (epitope RMSD drift). Designs with epitope RMSD drifts higher than 1.2 Å were discarded. Designs were also ranked and selected according hydrophobic core packing (packstat score), with a cutoff of 0.5 for site 0 and 0.6 for the site IV design series, and a cavity volume of <50 Å3. Between 1,000 and 10,000 of the designed sequences were generated from this computational protocol. We evaluated sequence profiles for the designs, and encoded the critical positions combinatorially by assembling overlapping oligos. Upon PCR assembly, libraries were transformed in yeast and screened for antibody binding and stability as assessed by protease digestion assays (10-12).
All animal experiments were approved by the Veterinary Authority of the Canton of Vaud (Switzerland) according to Swiss regulations of animal welfare (animal protocol number 3074). Female Balb/c mice (6-week old) were purchased from Janvier labs. Immunogens were thawed on ice, mixed with equal volumes of adjuvant (2% Alhydrogel, Invivogen or Sigma Adjuvant System, Sigma) and incubated for 30 minutes. Mice were injected subcutaneously with 100 μl vaccine formulation, containing in total 10 μg of immunogen (equimolar ratios of each immunogen for Trivax immunizations). Immunizations were performed on day 0, 21 and 42. 100-200 μl blood were drawn on day 0, 14 and 35. Mice were euthanized at day 56 and blood was taken by cardiac puncture.
Twenty-one african green monkeys (AGM, 3-4 years) were divided into three experimental groups with at least two animals of each sex. AGMs were pre-screened as seronegative against prefusion RSVF (preRSVF) by ELISA. Vaccines were prepared 1 hour before injection, containing 50 μg preRSVF or 300 μg Trivax1 in 0.5 ml PBS, mixed with 0.5 ml alum adjuvant (Alhydrogel, Invivogen) for each animal. AGMs were immunized intramuscularly at day 0, 28, 56, and 84. Blood was drawn at days 14, 28, 35, 56, 63, 84, 91, 105 and 119.
The RSV neutralization assay was performed as described previously (13). Briefly, Hep2 cells were seeded in Corning 96-well tissue culture plates (Sigma) at a density of 40,000 cells/well in 100 μl of Minimum Essential Medium (MEM, Gibco) supplemented with 10% FBS (Gibco), L-glutamine 2 mM (Gibco) and penicillin-streptomycin (Gibco), and grown overnight at 37° C. with 5% CO2. Serial dilutions of heat-inactivated sera were prepared in MEM without phenol red (M0, Life Technologies, supplemented with 2 mM L-glutamine and penicillin/streptomycin) and were incubated with 800 pfu/well (final MOI 0.01) RSV-Luc (A2 strain carrying a luciferase gene) for 1 hour at 37° C. Serum-virus mixture was added to Hep2 cell layer, and incubated for 48 hours. Cells were lysed in lysis buffer supplemented with 1 μg/ml luciferin (Sigma) and 2 mM ATP (Sigma), and luminescence signal was read on a Tecan Infinite 500 plate reader. The neutralization curve was plotted and fitted using the GraphPad variable slope fitting model, weighted by 1/Y2.
Monomeric Trivax1 immunogens (S2_1, S0_1.39 and S4_1.5) were used to deplete the site 0, II and IV specific antibodies in immunized sera. HisPurTM Ni-NTA resin slurry (Thermo Scientific) was washed with PBS containing 10 mM imidazole. Approximately 1 mg of each immunogen was immobilized on Ni-NTA resin, followed by two wash steps to remove unbound scaffold. 60 μl of sera pooled from all animals within the same group were diluted to a final volume of 600 μl in wash buffer, and incubated overnight at 4° C. with 500 μl Ni-NTA resin slurry. As control, the same amount of sera was incubated with Ni-NTA resin that did not contain scaffolds. Resin was pelleted down at 13,000 rpm for 5 minutes, and the supernatant (depleted sera) was collected and then used for neutralization assays.
A SSM library was assembled by overhang PCR, in which 11 selected positions surrounding the epitope in the S4_1.1 design model were allowed to mutate to all 20 amino acids, with one mutation allowed at a time. Each of the 11 libraries was assembled by primers (Table 1) containing the degenerate codon ‘NNK’ at the selected position. All 11 libraries were pooled, and transformed into EBY-100 yeast strain with a transformation efficiency of 1×106 transformants.
Combinatorial sequence libraries were constructed by assembling multiple overlapping primers (Table 2) containing degenerate codons at selected positions for combinatorial sampling of hydrophobic amino acids in the protein core. The theoretical diversity was between 1×106 and 5×106. Primers were mixed (10 μM each), and assembled in a PCR reaction (55° C. annealing for 30 sec, 72° C. extension time for 1 min, 25 cycles). To amplify full-length assembled products, a second PCR reaction was performed, with forward and reverse primers specific for the full-length product. The PCR product was desalted, and transformed into EBY-100 yeast strain with a transformation efficiency of at least 1×107 transformants (14).
All genes of designed proteins were purchased as DNA fragments from Twist Bioscience, and cloned via Gibson assembly into either pET11b or pET21b bacterial expression vectors. Plasmids were transformed into E. coli BL21 (DE3) (Merck) and grown overnight in LB media. For protein expression, precultures were diluted 1:100 and grown at 37° C. until the OD600 reached 0.6, followed by the addition of 1 mM IPTG to induce expression. Cultures were harvested after 12-16 hours at 22° C. Pellets were resuspended in lysis buffer (50 mM Tris, pH 7.5, 500 mM NaCl, 5% Glycerol, 1 mg/ml lysozyme, 1 mM PMSF, 1 μg/ml DNase) and sonicated on ice for a total of 12 minutes, in intervals of 15 seconds sonication followed by 45 seconds pause. Lysates were clarified by centrifugation (20,000 rpm, 20 minutes) and purified via Ni-NTA affinity chromatography followed by size exclusion on a HiLoad 16/600 Superdex 75 column (GE Healthcare) in PBS buffer.
Plasmids encoding cDNAs for the heavy chain of IgG were purchased from Genscript and cloned into the pFUSE-CHIg-hG1 vector (Invivogen). Heavy and light chain DNA sequences of antibody fragments (Fab) were purchased from Twist Bioscience and cloned separately into the pHLsec mammalian expression vector (Addgene, #99845) via Gibson assembly. HEK293-F cells were transfected in a 1:1 ratio with heavy and light chains, and cultured in FreeStyle medium (Gibco) for 7 days. Supernatants were collected by centrifugation and purified using a 1 ml HiTrap Protein A HP column (GE Healthcare) for IgG expression and 5 ml kappa-select column (GE Healthcare) for Fab purification. Bound antibodies/Fabs were eluted with 0.1 M glycine buffer (pH 2.7), immediately neutralized by 1 M Tris ethylamine buffer (pH 9), and buffer exchanged to PBS.
The construct encoding the thermostabilized the preRSVF protein corresponds to the sc9-10 DS-Cav1 A149C Y458C S46G E92D S215P K465Q variant (referred to as DS2) reported by Joyce and colleagues (15). The sequence was codon-optimized for mammalian cell expression and cloned into the pHCMV-1 vector flanked with two C-terminal Strep-Tag II and one 8x His tag. Expression and purification were performed as described previously (13).
The full-length N gene derived from the human RSV strain Long, ATCC VR-26 (GenBank accession number AY911262.1) was PCR amplified and cloned into pET28a+ at Ncol-Xhol sites to obtain the pET-N plasmid. Immunogens presenting sites 0, II and IV epitopes were cloned into the pET-N plasmid at Ncol site as pET-S0_1.39-N, pET-S2_1.2-N and pET-S4_1.5-N, respectively. Expression and purification of the nanoring fusion proteins was performed as described previously (13).
The gene encoding Helicobacter pylori ferritin (GenBank ID: QAB33511.1) was cloned into the pHLsec vector for mammalian expression, with an N-terminal 6x His Tag. The sequence of the designed immunogens (S0_2.126 and S4_2.45) were cloned upstream of the ferritin gene, spaced by a GGGGS linker. Ferritin particulate immunogens were produced by co-transfecting a 1:1 stochiometric ratio of “naked” ferritin and immunogen-ferritin in HEK-293F cells, as previously described for other immunogen-nanoparticle fusion constructs (16). The supernatant was collected 7-days post transfection and purified via Ni-NTA affinity chromatography and size exclusion on a Superose 6 increase 10/300 GL column (GE).
RSVN and Ferritin-based nanoparticles were diluted to a concentration of 0.015 mg/ml. The samples were absorbed on carbon-coated copper grid (EMS, Hatfield, Pa., United States) for 3 mins, washed with deionized water and stained with freshly prepared 0.75% uranyl formate.
The samples were viewed under an F20 electron microscope (Thermo Fisher) operated at 200 kV. Digital images were collected using a direct detector camera Falcon III (Thermo Fisher) with the set-up of 4098×4098 pixels. The homogeneity and coverage of staining samples on the grid was first visualized at low magnification mode before automatic data collection. Automatic data collection was performed using EPU software (Thermo Fisher) at a nominal magnification of 50,000×, corresponding to pixel size of 2 Å, and defocus range from −1 μm to −2 μm.
CTFFIND4 program (17) was used to estimate the contrast transfer function for each collected image. Around 1000 particles were manually selected using the installed package XMIPP within SCIPION framework (18). Manually picked particles were served as input for XMIPP auto-picking utility, resulting in at least 10,000 particles. Selected particles were extracted with the box size of 100 pixels and subjected for three rounds of reference-free 2D classification without CTF correction using RELION-3.0 Beta suite (19).
20 μg of RSVF trimer was incubated overnight at 4° C. with 80 μg of Fabs (Motavizumab, D25 or 101F). For complex formation with all three monoclonal Fabs, 80 μg of each Fab was used. Complexes were purified on a Superose 6 Increase 10/300 column using an Akta Pure system (GE Healthcare) in TBS buffer. The main fraction containing the complex was directly used for negative stain EM. Purified complexes of RSVF and Fabs were deposited at approximately 0.02 mg/ml onto carbon-coated copper grids and stained with 2% uranyl formate. Images were collected with a field-emission FEI Tecnai F20 electron microscope operating at 200 kV. Images were acquired with an Orius charge-coupled device (CCD) camera (Gatan Inc.) at a calibrated magnification of ×34,483, resulting in a pixel size of 2.71 A. For the complexes of RSVF with a single Fab, approximately 2,000 particles were manually selected with Cryosparc2 (20). Two rounds of 2D classification of particle images were performed with 20 classes allowed. For the complexes of RSVF with D25, Motavizumab and 101F Fabs, approximately 330,000 particles were picked using Relion 3.0 (19) and subsequently imported to Cryosparc2 for two rounds of 2D classification with 50 classes allowed.
SPR measurements were performed on a Biacore 8K (GE Healthcare) with HBS-EP+ as running buffer (10 mM HEPES pH 7.4, 150 mM NaCl, 3 mM EDTA, 0.005% v/v Surfactant
P20, GE Healthcare). Ligands were immobilized on a CM5 chip (GE Healthcare #29104988) via amine coupling. Approximately 2000 response units (RU) of IgG were immobilized, and designed monomeric proteins were injected as analyte in two-fold serial dilutions. The flow rate was 30 μl/min for a contact time of 120 seconds followed by 400 seconds dissociation time. After each injection, surface was regenerated using 3 M magnesium chloride (101F as immobilized ligand) or 0.1 M Glycine at pH 4.0 (Motavizumab and D25 IgG as an immobilized ligand). Data were fitted using 1:1 Langmuir binding model within the Biacore 8K analysis software (GE Healthcare #29310604).
To quantify the epitope-specific antibody responses in bulk serum from immunized animals, we performed an SPR competition assay with the monoclonal antibodies (D25, Motavizumab and 101F) as described previously (13). Briefly, approximately 400 RU of prefusion RSVF were immobilized on a CM5 chip via amine coupling, and serum diluted 1:10 in running buffer was injected to measure the total response ((RUnon-blocked surface). After chip regeneration using 50 mM NaOH, the site 0/II/IV epitopes were blocked by injecting saturating amounts of either D25, Motavizumab, or 101F IgG, and serum was injected again to quantify residual response (RUblocked surface). The delta serum response (ΔSR) was calculated as follows:
ΔSR=RU(non-)blocked surface−RUBaseline
Percent blocking was calculated for each site as:
Size exclusion chromatography with an online multi-angle light scattering (MALS) device (miniDAWN TREOS, Wyatt) was used to determine the oligomeric state and molecular weight for the protein in solution. Purified proteins were concentrated to 1 mg/ml in PBS (pH 7.4), and 100 μl of sample was injected into a Superdex 75 300/10 GL column (GE Healthcare) with a flow rate of 0.5 ml/min, and UV280 and light scattering signals were recorded. Molecular weight was determined using the ASTRA software (version 6.1, Wyatt).
Far-UV circular dichroism spectra were measured using a Jasco-815 spectrometer in a 1 mm path-length cuvette. The protein samples were prepared in 10 mM sodium phosphate buffer at a protein concentration of 30 μM. Wavelengths between 190 nm and 250 nm were recorded with a scanning speed of 20 nm min−1 and a response time of 0.125 sec. All spectra were averaged 2 times and corrected for buffer absorption. Temperature ramping melts were performed from 25 to 90° C. with an increment of 2° C./min in presence or absence of 2.5 mM TCEP reducing agent. Thermal denaturation curves were plotted by the change of ellipticity at the global curve minimum to calculate the melting temperature (Tm).
Libraries of linear DNA fragments encoding variants of the designed proteins were transformed together with linearized pCTcon2 vector (Addgene #41843) based on the protocol previously described by Chao and colleagues (14). Transformation procedures generally yielded ˜107 transformants. The transformed cells were passaged twice in SDCAA medium before induction. To induce cell surface expression, cells were centrifuged at 7,000 r.p.m. for 1 min, washed with induction media (SGCAA) and resuspended in 100 ml SGCAA with a cell density of 1×107 cells/ml SGCAA. Cells were grown overnight at 30° C. in SGCAA medium. Induced cells were washed in cold wash buffer (PBS+0.05% BSA) and labelled with various concentration of target IgG or Fab (101F, D25, and 5C4) at 4° C. After one hour of incubation, cells were washed twice with wash buffer and then incubated with FITC-conjugated anti-cMyc antibody and PE-conjugated anti-human Fc (BioLegend, #342303) or PE-conjugated anti-Fab (Thermo Scientific, #MA1-10377) for an additional 30 min. Cells were washed and sorted using a SONY SH800 flow cytometer in ‘ultra-purity’ mode. The sorted cells were recovered in SDCAA medium, and grown for 1-2 days at 30° C. In order to select stably folded proteins, we washed the induced cells with TBS buffer (20 mM Tris, 100 mM NaCl, pH 8.0) three times and resuspended in 0.5 ml of TBS buffer containing 1 μM of chymotrypsin. After incubating five-minutes at 30° C., the reaction was quenched by adding 1 ml of wash buffer, followed by five wash steps. Cells were then labelled with primary and secondary antibodies as described above.
96-well plates (Nunc MediSorp platesf Thermo Scientific) were coated overnight at 4° C. with 50 ng/well of purified antigen (recombinant RSVF or designed immunogen) in coating buffer (100 mM sodium bicarbonate, pH 9) in 100 μl total volume. Following overnight incubation, wells were blocked with blocking buffer (PBS +0.05% Tween 20 (PBST) containing 5% skim milk (Sigma)) for 2 hours at room temperature. Plates were washed five times with PBST. 3-fold serial dilutions were prepared and added to the plates in duplicates, and incubated at room temperature for 1 hour. After washing, anti-mouse (abcam, #99617) or anti-monkey (abcam, #112767) HRP-conjugated secondary antibody were diluted 1:1,500 or 1:10,000, respectively, in blocking buffer and incubated for 1 hour. An additional five wash steps were performed before adding 100 μl/well Pierce TMB substrate (Thermo Scientific). The reaction was terminated by adding an equal volume of 2 M sulfuric acid. The absorbance at 450 nm was measured on a Tecan Safire 2 plate reader, and the antigen specific titers were determined as the reciprocal of the serum dilution yielding a signal two-fold above the background.
Protein samples for NMR were prepared in 10 mM sodium phosphate buffer, 50 mM sodium chloride at pH 7.4 with the protein concentration of 500 μM. All NMR experiments were carried out in a 18.8T (800 MHz proton Larmor frequency) Bruker spectrometer equipped with a CPTC 1H,13C,15N 5 mm cryoprobe and an Avance III console. Experiments for backbone resonance assignment consisted in standard triple resonance spectra HNCA, HN(CO)CA, HNCO, HN(CO)CA, CBCA(CO)NH and HNCACB acquired on a 0.5 mM sample doubly labelled with 13C and 15N (21). Sidechain assignments were obtained from HCCH-TOCSY experiments acquired on the same sample plus HNHA, NOESY-15N-HSQC and TOCSY-15N-HSQC acquired on a 15N-labeled sample. The NOESY-15N-HSQC was used together with a 2D NOESY collected on an unlabelled sample for structure calculations.
Spectra for backbone assignments were acquired with 40 increments in the 15N dimension and 128 increments in the 13C dimension, and processed with 128 and 256 points by using linear prediction. HCCH-TOCSY were recorded with 64-128 increments in the 13C dimensions and processed with twice the number of points. 15N-resolved NOESY and TOCSY spectra were acquired with 64 increments in 15N dimension and 128 in the indirect 1H dimension, and processed with twice the number of points. 1H-1H 2D-NOESY and 2D TOCSY spectra were acquired with 256 increments in the indirect dimension, processed with 512 points. Mixing times for NOESY spectra were 100 ms and TOCSY spin locks were 60 ms. Heteronuclear 1H-15N NOE was measured with 128 15N increments processed with 256 points, using 64 scans and a saturation time of 6 seconds. All samples were prepared in 20 mM phosphate buffer pH 7, with 10% 2H2O and 0.2% sodium azide to prevent sample degradation.
All spectra were acquired and processed with Bruker's TopSpin 3.0 (acquisition with standard pulse programs) and analyzed manually with the program CARA (http://cara.nmr.ch/doku.php/home) to obtain backbone and sidechain resonance assignments. Peak picking and assignment of NOESY spectra (a 15N-resolved NOESY and a 2D NOESY) were performed automatically with the program UNIO-ATNOS/CANDID (22, 23) coupled to Cyana 2.1 (24), using standard settings in both programs. The run was complemented with dihedral angles derived from chemical shifts with Talos-n (25).
After overnight incubation at 4° C., the S0_2.126/D25 Fab complex was purified by size exclusion chromatography using a Superdex200 26 600 (GE Healthcare) equilibrated in 10 mM Tris pH 8, 100 mM NaCl and subsequently concentrated to ˜10 mg/ml (Amicon Ultra-15, MWCO 3,000). Crystals were grown at 291K using the sitting-drop vapor-diffusion method in drops containing 1 μl purified protein mixed with 1 μl reservoir solution containing 10% PEG 8000, 100 mM HEPES pH 7.5, and 200 mM calcium acetate.
For cryo protection, crystals were briefly swished through mother liquor containing 20% ethylene glycol.
Diffraction data was recorded at ESRF beamline ID30B. Data integration was performed by XDS (26) and a high-resolution cut at I/σ=1 was applied. The dataset contained a strong off-origin peak in the Patterson function (88% height rel. to origin) corresponding to a pseudo translational symmetry of 1/2, 0, 1/2. The structure was determined by the molecular replacement method using PHASER (27) using the D25 structure (1) (PDB ID 4JHVV) as a search model. Manual model building was performed using Coot (28), and automated refinement in Phenix (29). After several rounds of automated refinement and manual building, paired refinement (30) determined the resolution cut-off for final refinement.
The complex of S4_2.45 with the F101 Fab was prepared by mixing two proteins in 2:1 molar ratio for 1 hour at 4° C., followed by size exclusion chromatography using a Superdex-75 column. Complexes of S4_2.45 with the 101F Fab were verified by SDS-PAGE. Complexes were subsequently concentrated to 6-8 mg/ml. Crystals were grown using hanging drops vapor-diffusion method at 20° C. The S4_2.45/101F protein complex was mixed with equal volume of a well solution containing 0.2 M Magnesium acetate, 0.1 M Sodium cacodylate pH 6.5, 20%(w/v) PEG 8000. Native crystals were transferred to a cryoprotectant solution of 0.2 M Magnesium acetate, 0.1 M Sodium cacodylate pH 6.5, 20% (w/v) PEG 8000 and 15% glycerol, followed by flash-cooling in liquid nitrogen.
Diffraction data were collected at SSRL facility, BL9-2 beamline at the SLAC National Accelerator Laboratory. The crystals belonged to space group P3221. The diffraction data were initially processed to 2.6 Å with X-ray Detector Software (XDS) (Table 9). Molecular replacement searches were conducted with the program PHENIX PHASER using 101F Fab model (PDB ID: 3041) and S4_2.45/101F Fab computational model generated from superimposing epitope region of S4_2.45 with the peptide-bound structure (PDB ID: 3041), and yielded clear molecular replacement solutions. Initial refinement provided a Rfree of 42.43% and Rwork of 32.25% and a complex structure was refined using Phenix Refine, followed by manual rebuilding with the program COOT. The final refinement statistics, native data and phasing statistics are summarized in Table 9.
After sorting, yeast cells were grown overnight, pelleted and plasmid DNA was extracted using Zymoprep Yeast Plasmid Miniprep II (Zymo Research) following the manufacturer's instructions. The coding sequence of the designed variants was amplified using vector-specific primer pairs, Illumina sequencing adapters were attached using overhang PCR, and PCR products were desalted (Qiaquick PCR purification kit, Qiagen). Next generation sequencing was performed using an Illumina MiSeq 2×150 bp paired end sequencing (300 cycles), yielding between 0.45-0.58 million reads/sample.
For bioinformatic analysis, sequences were translated in the correct reading frame, and enrichment values were computed for each sequence. We defined the enrichment value E as follows:
The high selective pressure corresponds to low labelling concentration of the respective target antibodies (100 μM D25, 10 nM 5C4 or 20 μM 101F, as shown in
indicates data missing or illegible when filed
indicates data missing or illegible when filed
indicates data missing or illegible when filed
Number | Date | Country | Kind |
---|---|---|---|
19183026.4 | Jun 2019 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2020/051581 | 7/1/2020 | WO |