All patents, patent applications and publications cited herein are hereby incorporated by reference in their entirety. The disclosures of these publications in their entireties are hereby incorporated by reference into this application in order to more fully describe the state of the art as known to those skilled therein as of the date of the invention described and claimed herein.
This patent disclosure contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure as it appears in the U.S. Patent and Trademark Office patent file or records, but otherwise reserves any and all copyright rights.
The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on May 21, 2021, is named 2933311-041WO1_SL and is 1,540,060 bytes in size.
The invention relates, in general, to modified SARS-CoV-2 proteins, nucleic acids encoding these, methods of making recombinant proteins and nucleic acids, compositions comprising these and their use in vaccination regimens, and diagnostic assays.
The ongoing global pandemic of the new SARS-CoV-2 coronavirus presents an urgent need for the development of effective preventative and treatment therapies.
Development of an effective vaccine for prevention of coronavirus (SARS-2) infection is a global priority.
In certain aspects the invention provides SARS-CoV-2 (“SARS-2”) spike protein designs. In certain embodiments, the protein design provides a stabilized protein conformation(s) of the SARS-2 spike protein trimer. In non-limiting embodiments, the modified SARS-2 spike protein comprising S383C D985C (rS2d) amino acid changes as described in
In certain embodiments, the inventive designs are recombinant proteins. In certain embodiments, the inventive designs are nucleic acids. Nucleic acids include without limitation modified mRNAs.
In certain aspects, the invention provides modified SARS-2 spike proteins, for example but not limited in a stabilized conformation, nucleic acid molecules and vectors encoding these proteins, and methods of their use and production are disclosed. In several embodiments, the modified SARS-2 spike proteins and/or nucleic acid molecules can be used to generate an immune response to coronavirus in a subject. In some embodiments, the proteins and/or nucleic acid molecules can be used to generate an immune response to SARS-2 in a subject. In additional embodiments, the therapeutically effective amount of the modified SARS-2 spike proteins and/or nucleic acid molecules can be administered to a subject in a method of treating or preventing coronavirus infection. In some embodiments, the proteins and/or nucleic acid molecules can be administered to a subject in a method of treating or preventing SARS-2 infection. In certain embodiments, the proteins of the invention can be used in diagnostic assays.
In certain aspects, the invention provides coronavirus (e.g. SARS-2) S protein ectodomain trimers in a stabilized conformation, nucleic acid molecules and vectors encoding these proteins, and methods of their use and production. In several embodiments, the coronavirus (e.g. SARS-2) S protein ectodomain trimers and/or nucleic acid molecules can be used to generate an immune response to coronavirus in a subject. In some embodiments, the proteins and/or nucleic acid molecules can be used to generate an immune response to SARS-2 in a subject. In additional embodiments, the therapeutically effective amount of the coronavirus (e.g. SARS-2) S protein ectodomain trimers and/or nucleic acid molecules can be administered to a subject in a method of treating or preventing coronavirus infection. In some embodiments, the proteins and/or nucleic acid molecules can be administered to a subject in a method of treating or preventing SARS-2 infection. In certain embodiments, the proteins of the invention can be used in diagnostic assays.
In certain embodiments, the modified SARS-2 spike proteins do not include modification as described in US Patent Publication 20200061185. In certain embodiments the modified SARS-2 spike proteins do not include the two proline modification (K986P+V987P (2P)) substitutions in the S2 domain. See Edwards et al. Nature Structural & Molecular Biology volume 28, pages 128-131(2021) and references therein.
The invention provides amino acid or nucleic acids sequences encoding such spike protein designs. Provided are also nucleic acids, including modified mRNAs which are stable and can be used as immunogens. Non-limiting embodiments include recombinant proteins, trimers, multimerized proteins, e.g. but not limited to nanoparticles. Provided also are nucleic acids optionally designed as vectors, for example for recombinant expression and/or stable integration, e.g. but not limited to, a DNA encoding trimer for stable expression, or virus-like particle (VLP) incorporation. In non-limiting embodiments a DNA encodes a SARS-2 spike protein for stable expression. In non-limiting embodiments a DNA encodes a SARS-2 spike protein for stable expression as a protomer which trimerizes to form a SARS-2 spike protein trimer. In non-limiting embodiments, nucleic acids are mRNA, including but not limited to modified mRNA which are used immunogens. Modified mRNAs can be formulated in any suitable formulation, including but not limited to lipidnanoparticles (LNPs) and/or liposomes.
In some embodiments a protein design is based on SARS-2 spike protein if it is characterized as having 99%, 98%, 97%, 96%, 95%, 94%, 93%, 92%, 91%, 90%, 89%, 88%, 87%, 86%, 85%, 84%, 83%, 82%, 81%, 80%, 79%, 78%, 77%, 76%, or 75% similarity or identity to the designs described herein.
In non-limiting embodiments the invention provides SARS-2 S protein trimers stabilized in a prefusion conformation, nucleic acid molecules and vectors encoding these proteins, and methods of their use and production. In several embodiments, the SARS-2 S protein trimers and/or nucleic acid molecules can be used to generate an immune response to coronavirus in a subject. In some embodiments, the protein trimers and/or nucleic acid molecules can be used to generate an immune response to SARS-2 in a subject. In additional embodiments, the therapeutically effective amount of the SARS-2 S protein trimers and/or nucleic acid molecules can be administered to a subject in a method of treating or preventing coronavirus infection. In some embodiments, the protein trimers and/or nucleic acid molecules can be administered to a subject in a method of treating or preventing SARS-2 infection.
In certain aspects, the invention provides a modified SARS-2 spike protein comprising a sequence modified with amino acid changes as described herein. In certain aspects, the invention provides a recombinant, non-naturally occurring SARS-2 spike protein comprising a sequence modified with amino acid changes as described herein. Non-limiting embodiments of sequences are shown in
In certain aspects, the invention provides a recombinant SARS-2 spike protein comprising all the consecutive amino acids after the signal peptide of the amino acid sequences described herein. For specific non-limiting embodiments of sequences see
In certain aspects, the invention provides a nucleic acid encoding the modified SARS-2 spike protein described herein. In non-limiting embodiments, the nucleic acid is a modified mRNA. In certain embodiments, the mRNA is in a composition comprising LNPs. In certain embodiments, the mRNA is in a composition comprising liposomes.
In certain embodiments, the nucleic acid is comprised in a vector and is operably linked to a promoter.
In certain embodiments the sequence is modified with modifications described as Clusters 1-11. In certain embodiments the design can comprise any combination of modifications within any one Cluster, and/or combination of modifications from any of the modifications from any one of Clusters 1-11 (
In certain embodiments, the invention provides modified SARS-2 spike protein comprising any combination of modifications within any one Cluster and further comprising N165A variation or the N234A variation as described in Example 4. Additional stabilizing mutations can be added to these modified SARS-2 spike designs.
Any one of the modifications described herein can be engineered in a full length SARS-2 S sequence or in a fragment, e.g. but not limited to the ectodomain.
In certain aspects the invention provides a composition comprising a recombinantly produced modified SARS-2 spike protein of any one of the claims and a carrier. In certain embodiments, the compositions are immunogenic. In certain embodiments the compositions comprised an adjuvant. Any suitable adjuvant can be used.
In certain aspects the invention provides a composition comprising a nucleic acid encoding any of the modified SARS-2 spike proteins and a carrier. Non-limiting embodiments of nucleic acids are shown in
In certain embodiments the designs produce a soluble protein. In certain embodiments the designs are comprised in a protomer which can form a trimer. In certain embodiments the designs comprise a transmembrane (TM) domain.
In certain embodiments the compositions comprise a SARS-2 S ectodomain trimer comprising protomers comprising sequence modification as described here in.
In non-limiting embodiments, the designs comprise additional modifications to allow multimerization. In non-limiting embodiments, wherein the design comprises a soluble ectodomain, additional modifications can be included to allow multimerization. In a non-limiting embodiment, a C-terminal residue of the protomers in the ectodomain of the modified SARS-2 spike protein is linked to a trimerization domain by a peptide linker, or is directly linked to the trimerization domain. In some embodiments, the trimerization domain is a T4 fibritin trimerization domain. In one example, a T4 fibritin trimerization domain comprises the amino acid sequence set forth as GYIPEAPRDGQAYVRKDGEWVLLSTF (SEQ ID NO: 1). In some embodiments, a protease cleavage site (such as a thrombin cleavage site) can be included between the C-terminus of the recombinant SARS-2 spike protein ectodomain and the T4 fibritin trimerization domain to facilitate removal of the trimerization domain as needed, for example, following expression and purification of the recombinant SARS-2 S ectodomain.
In certain embodiments, the modified SARS-2 spike protein further comprises one or more additional amino acid substitutions that stabilize the recombinant ectodomain trimer in the prefusion conformation.
In certain embodiments, the modified SARS-2 spike designs further comprise furin protease cleavage sites and/or a cathepsin L cleavage site. In certain embodiments, the modified SARS-2 spike protein trimer is soluble.
In certain embodiments, a C-terminal residue of the protomers in the ectodomain of the modified SARS-2 spike protein is linked to a transmembrane domain by a peptide linker, or is directly linked to the transmembrane domain. In certain embodiments, the modified SARS-2 spike protein is linked to form a protein nanoparticle subunit by a peptide linker, or is directly linked to the protein nanoparticle subunit. In certain embodiments, the protein nanoparticle subunit is a ferritin nanoparticle subunit.
In certain aspects the invention provides a protein nanoparticle, comprising any one of the protein immunogens of the invention.
In certain aspects the invention provides a virus-like particle comprising any one of the immunogens of the invention.
In certain aspects the invention provides an isolated nucleic acid molecule encoding a protomer of the modified SARS-2 spike protein of the invention. In certain embodiments, the nucleic acid molecule is operably linked to a promoter. In certain embodiments, the nucleic acid molecule is an RNA molecule.
In certain aspects, the invention provides a vector comprising a nucleic acid molecule encoding any one of the inventive proteins. In certain embodiments, the vector is a viral vector.
In certain aspects, the invention provides an immunogenic composition comprising any one of the proteins and/or nucleic acids of the invention, and a pharmaceutically acceptable carrier.
In certain aspects, the invention provides a method of producing a recombinant SARS-2 spike protein of the invention, comprising: expressing the nucleic acid molecule or vector comprising a nucleic acid encoding in a host cell to produce the recombinant protein, which in certain embodiments is a trimer; and purifying the recombinant protein.
In certain aspects, the invention provides a recombinant cell comprising a nucleic acid encoding the modified SARS-2 spike protein of the invention.
In certain aspects the invention provides a method for generating an immune response to an SARS-CoV-2 in a subject, comprising administering to the subject an effective amount of any one of the immunogens, wherein the immunogen is a recombinant protein, a nucleic acid, and/or a combination thereof to induce an immune response. In certain embodiments, the recombinant protein is formulated with any suitable adjuvant. In certain embodiments, the nucleic acid is DNA which can be administered by any suitable method. In certain embodiments, the nucleic acid is an mRNA, which can be administered by any suitable methods. In certain embodiments, the mRNA is formulated in an LNP. In certain embodiments, the mRNA is formulated in a liposome LNP. A skilled artisan can readily determine the dose and number of immunizations needed to induce immune response. Various assays are known and used in the art to measure to level, breadth and durability of the induced immune response.
In certain aspects the invention provides modified coronavirus spike proteins designs including but not limited to protein designs comprising spike protein and/or various spike portions/domains from SARS-CoV-2 (SARS-2), SARS-CoV-1 (CoV1), MERS, or any other coronavirus spike protein, wherein in certain embodiments these proteins are designed to form multimeric complexes. The invention provides amino acid and nucleic acid sequences of recombinant coronavirus spike proteins or portions thereof, wherein in certain embodiments these spike proteins or portions/domains are multimerized, and can be used as an antigen to induce an immunogenic response. In some embodiments the antigen comprises any suitable portion from a spike protein. In non-limiting embodiments are portions of the spike protein which comprise epitopes conserved between different coronaviruses. In some embodiments the antigen comprises RBD domain from a spike protein. In some embodiments the antigen comprises NTD domain from a spike protein. In some embodiments the antigen comprises FP domain from a spike protein. The sequence of the spike protein is any suitable sequence coronavirus sequence including without limitation SARS-CoV1, SARS-CoV2, MERS, bat coronavirus, pangolin or other animal coronaviruses. In non-limiting embodiments the spike protein sequences comprise any variation in amino acid sequences, including without limitation Wuhan SARS-CoV2 sequence, UK SARS-CoV2 variant B.1.1.7, South African variant 1.351, US SARS-CoV-2 variants with L452R mutations and Brazilian variant P.1. Additional SARS-2 spike protein sequences from circulating viruses are found in the GISAID EpiFlu™ Database. These sequences can also be designed with any of the modifications described herein. In certain embodiments, the immune response treats, prevents or inhibits infection with the SARS-CoV-2. In certain embodiments, the immune response generated by the immunogens inhibits replication of the SARS-CoV-2 in the subject.
In certain aspects the invention provides a modified SARS-2 spike protein sequence or amino acid sequence encoding the same, wherein the protein sequence comprising amino acid changes as described in
In certain aspects the invention provides a recombinant SARS-2 spike protein comprising all the consecutive amino acids after the signal peptide of the polypeptide sequences in
In certain aspects the invention provides a nucleic acid encoding the modified SARS-2 spike protein of any of the preceding claims.
In certain embodiments, the nucleic acid of any of the preceding claims is a modified mRNA. In certain embodiments, the mRNA is in a composition comprising LNPs.
In certain embodiments, the nucleic acid is comprised in a vector and is operably linked to a promoter.
In certain aspects, the invention provides a composition comprising a recombinantly produced modified SARS-2 spike protein of any one of the claims and a carrier. In certain embodiments, the compositions comprise a trimer comprising protomers with amino changes as described herein. In certain embodiments, the compositions are immunogenic. In certain embodiments the compositions comprised an adjuvant. Any suitable adjuvant can be used.
In certain aspects, the invention provides a composition comprising a nucleic acid encoding any of the modified SARS-2 spike proteins and a carrier.
In certain aspects, the invention provides a protein nanoparticle, comprising any one of the protein immunogens of the invention. In certain embodiments, the protein nanoparticle subunit is a ferritin nanoparticle subunit.
In certain aspects, the invention provides a virus-like particle comprising any one of the immunogens of the invention.
In certain aspects, the invention provides a host cell comprising a nucleic acid molecule encoding a modified SARS-2 spike protein of the invention.
In certain aspects, the invention provides a method of producing a recombinant SARS-2 protein of the invention, comprising: expressing the nucleic acid molecule or vector comprising a nucleic acid encoding in a host cell to produce the recombinant protein, which in certain embodiments is a trimer; and purifying the recombinant protein.
In certain aspects, the invention provides an immunogenic composition comprising any one of the proteins, nucleic acids, nanoparticle or VLP of the preceding claims and a pharmaceutically acceptable carrier. The immunogenic composition of the preceding claim, further comprising an adjuvant.
In certain aspects, the invention provides a method for inducing an immune response to an SARS-2 in a subject, comprising administering to the subject an effective amount of any one of the immunogens and/or the immunogenic composition of the preceding claims to induce an immune response.
In certain aspects, the invention provides a modified SARS-2 spike protein comprising the amino acid sequence of the N165A variant or the N234A variant.
In certain aspects, the invention provides a recombinant SARS-2 spike protein comprising all the consecutive amino acids after the signal peptide of a modified SARS-2 spike protein comprising the amino acid sequence of the N165A variant or the N234A variant.
In certain aspects, the invention provides a nucleic acid encoding a modified SARS-2 spike protein of the invention or a recombinant SARS-2 spike protein of the invention.
In certain embodiments, the nucleic acid of the invention is comprised in a vector and is operably linked to a promoter. In certain embodiments, the nucleic acid of the invention is operably linked to a promoter suitable for in vitro mRNA expression.
In certain embodiments, a nucleic acid of the invention is a modified mRNA. In certain embodiments, the mRNA is in a composition comprising LNPs.
In certain aspects, the invention provides a composition comprising a recombinantly produced modified SARS-2 spike protein, or a nucleic acid encoding a recombinant protein of the invention and a carrier. In certain embodiments, the compositions comprise a trimer comprising protomers with amino changes as described herein. In certain embodiments, the compositions are immunogenic. In certain embodiments the compositions comprised an adjuvant. Any suitable adjuvant can be used.
In certain aspects the invention provides a protein nanoparticle, comprising a modified recombinant SARS-2 spike protein of the invention. In certain embodiments, the protein nanoparticle subunit is a ferritin nanoparticle subunit.
In certain aspects the invention provides a virus-like particle, comprising a modified recombinant SARS-2 spike protein of the invention.
In certain aspects the invention provides a host cell comprising a nucleic acid molecule encoding the modified SARS-2 spike protein of the invention. In certain aspects the invention provides an in vitro transcription reaction comprising a nucleic acid encoding anyone of the modified SARS-2 spike protein of the invention and reagents suitable for carrying out the in vitro transcription reaction to produce mRNA, including without limitation modified mRNA.
In certain aspects, the invention provides methods of producing a modified SARS-2 spike protein of the invention, comprising: expressing the nucleic acid molecule or vector comprising a nucleic acid encoding in a host cell to produce the recombinant protein, which in certain embodiments is a trimer; and purifying the recombinant protein.
In certain aspects, the invention provides an immunogenic composition comprising any one of the proteins, nucleic acids, nanoparticle or VLP of the invention and a pharmaceutically acceptable carrier. In certain embodiments, the immunogenic composition further comprises an adjuvant.
In certain aspects the invention provides, a method for inducing an immune response to an SARS- 2 in a subject, comprising administering to the subject an effective amount of any one of the immunogens and/or the immunogenic composition of the invention in an amount and manner sufficient to induce an immune response.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
Vaccine-induced neutralizing antibodies against pseudotyped (top panels) or live (bottom panels) SARS-CoV-2 viruses. ID50, inhibitory dilutions at which 50% viruses were neutralized. Each dot indicates one animal, and the bars show geometric means. Pseudovirus assays were performed in 293T/ACE2 cells, and live SARS-CoV-2 microneutralization assays were performed in Vero cells.
The invention provides proteins and nucleic acids, including modified mRNAs which are stable and can be used as immunogens. Provided also are nucleic acids optionally designed as vectors, for example for recombinant expression and/or stable integration, e.g. but not limited, full-length S protein DNA encoding trimer for stable expression, or VLP incorporation.
Detailed descriptions of one or more preferred embodiments are provided herein. It is to be understood, however, that the present invention may be embodied in various forms. Therefore, specific details disclosed herein are not to be interpreted as limiting, but rather as a basis for the claims and as a representative basis for teaching one skilled in the art to employ the present invention in any appropriate manner.
The singular forms “a”, “an” and “the” include plural reference unless the context clearly dictates otherwise. The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.”
Wherever any of the phrases “for example,” “such as,” “including” and the like are used herein, the phrase “and without limitation” is understood to follow unless explicitly stated otherwise. Similarly, “an example,” “exemplary” and the like are understood to be nonlimiting.
The term “substantially” allows for deviations from the descriptor that do not negatively impact the intended purpose. Descriptive terms are understood to be modified by the term “substantially” even if the word “substantially” is not explicitly recited.
The terms “comprising” and “including” and “having” and “involving” (and similarly “comprises”, “includes,” “has,” and “involves”) and the like are used interchangeably and have the same meaning. Specifically, each of the terms is defined consistent with the common United States patent law definition of “comprising” and is therefore interpreted to be an open term meaning “at least the following,” and is also interpreted not to exclude additional features, limitations, aspects, etc. Thus, for example, “a process involving steps a, b, and c” means that the process includes at least steps a, b and c. Wherever the terms “a” or “an” are used, “one or more” is understood, unless such interpretation is nonsensical in context.
The term “about” is used herein to mean approximately, roughly, around, or in the region of. When the term “about” is used in conjunction with a numerical range, it modifies that range by extending the boundaries above and below the numerical values set forth. In general, the term “about” is used herein to modify a numerical value above and below the stated value by a variance of 20 percent up or down (higher or lower).
SARS-2 Coronavirus S Protein Designs
The ongoing global pandemic of the new SARS-CoV-2 coronavirus presents an urgent need for the development of effective preventative and treatment therapies. The viral-host cell fusion (S) protein spike is a prime target for such therapies owing to its critical role in the virus lifecycle. The S protein is divided into two regions: the N-terminal S1 domain that caps the C-terminal S2 fusion domain. Binding to host receptor via the Receptor Binding Domain (RBD) in S1 is followed by proteolytic cleavage of the spike by host proteases. Large conformational changes in the S-protein result in S1 shedding and exposure of the fusion machinery in S2. Class I fusion proteins such as the coronavirus (CoV) S protein that undergo large conformational changes during the fusion process must, by necessity, be highly flexible and dynamic. Indeed, cryo-EM structures of the SARS-CoV-2 (SARS-2) spike protein reveal considerable flexibility and dynamics in the S1 domain1,2, especially around the RBD that exhibits two discrete conformational states—a “down” state that is shielded from receptor binding, and an “up” state that is receptor-accessible. We will use our robust, high-throughput computational and experimental pipeline to define the detailed trajectory of the “down” to “up” transition of the SARS-2 S protein, identify early metastable intermediates in the fusion pathway, and exploit their structures and dynamics for identifying drug and vaccine candidates that target SARS-2.
A wealth of structural information on CoV spike proteins, including recently determined cryo-EM structures of the SARS-2 spike1-11, provides a rich source of detailed data from which to begin precise examination of macromolecular transitions underlying triggering of this fusion machine. In certain aspects the invention provides that(a) analysis quantifying CoV S1 domain movements around which structurally conserved domains undergo rigid body motions, (b) in silico, prescreened panel of differentially domain position stabilizing mutations, and (c) integrated computational and experimental approach with unprecedented, dedicated access to >300 accelerated compute devices (GPUs), rapid and priority access to a K3 direct electron detector equipped Titan Krios electron microscope, and high-throughput structural determination pipeline. Together, this puts us in a unique position to provide atomically detailed mechanistic insight into the fusion mechanism of the SARS-2 virus. The scientific premise of this study is that understanding the structural dynamics and early transition kinetics of mobile regions of the SARS-2 spike will allow optimal control of vaccine and drug responses, and facilitate the development of new antiviral drugs and protective vaccines. The goal of this study is to define mechanistically-derived transition states of the pre-fusion SARS-2 spike that can be exploited for vaccine and drug design.
The invention is based on work to define domain motions in the pre-fusion SARS-2 spike. The idea is that while the RBD undergoes a dramatic “up” and “down” hinge motion, other subtle movements in the pre-fusion SARS-2 spike play an important role in defining antibody and ligand binding specificity. Analysis of CoV S protein structures revealed subtle shifts in S1 that make and break interactions with adjacent domains, resulting in multistate or disordered behavior of the RBD in its “down” position. Here, we identify a set of mutations that lock and stabilize the SARS-2 S protein with the RBD in discrete “down” positions, each with different but specific RBD positions rather than the usual multistate behavior observed in all CoV spike structures determined to date. Deploying rapid assays to assess protein expression, thermostability, and antigenicity, we will generate a set of stabilized SARS-2 spike variants with defined reactivity to patient-sera. We will determine high-resolution cryo-EM structures to define the metastable RBD “down” state orientations in these mutants, and use the combined experimental information from structures, biochemistry and biophysics to iterate the structure-guided computational design cycle.
The invention is based on work to define the trajectory of the transition between the “down” and “up” states of the SARS-2 S protein. The idea is that the SARS-2 S protein transitions through multiple metastable intermediate states between the known “down” and “up” states. Using an integrated approach, we will interrogate the mechanism by which the SARS-2 S protein transitions from its “down” state to the receptor-accessible “up” state. Our initial examination of the available CoV S protein structures quantifies specific rigid body domain movements within each state. Using a combination of path finding and adaptive sampling molecular dynamics (MD) simulation techniques, we will develop a theoretical model of this initial triggering event. Structural details from the putative path will be used to stabilize predicted intermediate states. Provided are experiments to study the biochemical and biophysical properties of these putative intermediates, and determine their structures using high-resolution cryo-EM. We will assess the reactivity of these structures to patient-sera and known SARS-2 spike ligands to define state antigenicity.
In certain aspects the invention provides methods to determined structures of multiple “down”, “up”, and intermediate states of the SARS-2 S protein. Given the current global health emergency we will prioritize rapid dissemination of results to the community. Importantly, we will make available coordinates from the experimentally refined transition ensemble determined via MD simulation to enable close examination of the presented transition by researchers in the fields of drug and vaccine design. Overall, these studies will provide atomically detailed structural and mechanistic information that can be exploited for vaccine and therapeutics design.
On Mar. 11, 2020, the World Health Organization (WHO) characterized the ongoing spread of COVID-19, a highly contagious respiratory disease caused by the new betacoronavirus SARS-CoV-2 (SARS-2), a pandemic. Originating in the Wuhan province of China, now spread to over 100 countries, the virus has infected >150,000 individuals and caused >8000 deaths world-wide. As the virus continues to spread, there is an urgent need to understand as much as possible, as rapidly as possible, about this new virus.
The transmembrane SARS-2 S protein spike trimer (
The SARS-2 S protein displays striking structural similarities with the S proteins of the previously identified SARS-CoV, MERS-CoV, and other human and murine CoV viruses. However, most S-targeting antibodies to SARS and MERS do not cross-react with SARS-2. Conformational evasion is among the many host immune evasion tools available to viruses. Dramatic shifts in the conformational ensemble of states for CoVs have in fact been demonstrated1,2. Therefore, a detailed understanding of structure and dynamics of the SARS-2 S protein in comparison to is orthologs will reveal how genetic drift can give rise to the large phenotypic differences that drive viral evolution and host immune evasion.
Thus, the urgent need to understand the SARS-2 virus that is responsible for the ongoing pandemic makes this study significant and relevant to public health.
Provided are studies that use an integrated structural biology approach to harness the latest innovations in high-throughput cryo-electron microscopy and computational methodologies to approach this urgent global healthcare problem. These studies include use of Titan Krios microscope fitted with a K3 camera for rapid determination of high-resolution structures, access to a Philips EM420 microscope, as well as to a Talos Arctica for cryo-screening and data collection at the National Institutes for Environmental Health Sciences (NIEHS), NIH.
Studies will be able to test immunogenicity in mice and rabbits of any promising SARS-2 spike variants generated in this study.
In non-limiting embodiments, aspects of the invention are based on the idea that protein dynamics impact its antigenic and immunogenic properties. Coronavirus designs are based on an integrated approach that closely couples structure and molecular dynamics-driven protein engineering with biophysical, biochemical, virological and immunological studies.
Conformationally distinct structural states of the CoV S-protein spike are well defined. The transmembrane CoV S protein spike trimer is composed of interleaved protomers that include an N-terminal receptor binding S1 domain and a C-terminal S2 domain that contains the fusion elements (
A detailed structural schematic defining the geometry and internal rearrangements of movable domains. An understanding of macromolecular structural dynamics requires a precise definition of structural states. Examination of the available SARS and MERS S-protein structures revealed: 1) the NTD, RBD, subdomains and internal S2 domains move as rigid bodies and 2) these domains display a remarkable array of relative shifts between the S1 region's domains and the S2 region's β-sheet motif and CD (
SARS-2 S Protein Production, Purification and Structural Characterization
The SARS-2 S protein ectodomain2 was expressed in 293F cells and purified using published methods to yield ˜4 mg/L purified spike (
Advanced molecular simulation results for the SARS ectodomain spike indicate rapid exchange between metastable states. In order to examine the breadth and time scales of the dynamics of CoV spike protein structure, we initiated an adaptive sampling simulation of the symmetric all ‘down’, closed state of the SARS CoV soluble S-protein (PDB ID 6ACC6). To overcome the sampling problem in MD, the adaptive scheme periodically monitors multiple simultaneous simulations and launches additional simulations in regions of the coordinate space along transition paths. In this way, difficult to observe slow processes become accessible. In total, we obtained 539 independent 50 ns simulations totaling ˜27 μs of simulation time. Monitoring contacts between each protomer's RBD and their adjacent RBD, NTD, and HR1 C-terminus, we further projected the data using the time-lagged independent component analysis (TICA) approach. TICA components point in the direction of the slowest processes in the simulation data which means that transitions along the so call TICs can correspond to transitions between metastable states (
In certain aspects the invention provides methods to define symmetric and asymmetric down state domain arrangements in the SARS-2 S protein. Our analysis of the available CoV S-protein structures reveals a wide breadth of conformational states. We therefore ask the following questions: 1) Is it possible to eliminate or markedly reduce S1 flexibility? 2) How does the stabilizing strategy affect distant domain arrangements? 3) The MERS spike domain arrangement is distinct from SARS and SARS- 2; is it possible to insert MERS residue substitutions in SARS-2 to induce this arrangement? 4) Does a change in domain arrangement impact ectodomain antigenicity? In order to answer these questions, we designed differentially stabilized the S-protein domains. To this end, we have screened in silico, using the Schrödinger software suite, a large panel of mutations designed to stabilize specific regions of the S-protein (
Approach and Methods:
Small-scale transfections of plasmids encoding the mutated S-protein (
Testing of cell-culture supernatants for binding to 1) Streptavidin, in a biolayer interferometry (BLI)-based screen similar to that performed in our recent HIV-1 stabilization manuscript12, to determine expression levels and 2) to the ACE-2 receptor, and other RBD-reactive ligands such as antibodies CR3022 and 47D11 13-16, that will report on the disposition of the RBD within the spike. Supernatants from untransfected cells will be used as control.
Constructs showing optimal expression and certainACE-2 binding phenotypes will be purified using the PureSpeed (Mettler Toledo) IMAC based high-throughput purification system.
Purified proteins will then be characterized using SDS-PAGE, western blotting, rapid fluorescence-based thermostability assays 17-19 size exclusion chromatography (SEC) and NSEM.
Constructs with confirmed expression of the trimeric spike protein (SDS-PAGE, SEC and NSEM), and improved properties, e.g. but not limited to melting temperature at least 5° C. higher than the unmutated construct, will be selected as candidates for the next round of selection. For these constructs we will determine 1) ACE2 binding and affinity, 2) thermal stability via differential scanning calorimetry, and 3) high-resolution structures via cryo-EM. Collecting large cryo-EM datasets of at least 2 million particles for each mutated construct will allow heterogenous 3D classification. We will compare the structures with that of the unmutated construct, determining changes in residue-residue contacts, epitope shape and accessibility, shifts in the probability of finding the construct in any particular state, and measures of conformational shifts using our vector-based analysis.
Finally, we will test for differential changes in antigenicity of these constructs using sera from infected patient. This will provide selection criteria for subsequent studies.
Constructs will also be tested for immunogenicity in any suitable animal model, including without limitation mouse studies, NHP studies, and so forth.
Small-scale transfections can yield relatively small quantities of protein for some constructs. NSEM and fluorescence-based thermostability measurements require very small amount of protein (less than 10 μg), increasing the chances that most of the constructs will yield sufficient protein for the small-scale screens. For those constructs that do not, we will use a larger transfection volume. The ones that fail to express we will remove from our list. Structural determination by cryo-EM also requires very small amounts of protein thus ensuring for most constructs we will be able to obtain high resolution feedback on the designs. 2. Failed designs: Some of the designs may not show expected phenotypes, a risk inherent in this type approach. The large number of in silico designs we are starting with along with the high throughput assays, can allow us to quickly select the designs that show promise and rapidly iterate the experimental and design cycles. This approach has been successful in structure-guided vaccine design20,21. If a particular set fail to provide a stabilized construct, we will turn to the high-resolution cryo-EM structures determined in our heterogenous refinement of the unmutated construct to initiate additional design iterations.
At the successful conclusion of this study, without wishing to be bound by theory, we will provide a detailed, high-resolution mapping of conformational states occupied by the SAR-2 S-protein and shifts in conformational distribution with changes in domain interface interactions. Further, we will demonstrate the degree to which changes in the conformational distribution alter S-protein antigenicity. These results will provide a framework from which to consider how genetic drift in the SARS-2 can affect the spread of the disease and how containment by vaccination can be affected by the selection of conformationally varying mutants.
In certain aspects the invention provides methods to define, in atomic detail, the transition between the down and up states of the SARS-2 S protein spike. While the HIV-1 Env utilizes a complex network of allosteric machinery to signal receptor binding, the CoV S protein appears to use a kinetic strategy toward receptor recognition and triggering (
Approach and Methods:
An in-house developed projection method calculating the pairwise relative angles between the NTD, NTD′, RBD, subdomains 1 and 2, the S2-sheet motif, and the CD of each protomer will used in the adaptive scheme.
Converged Markov model transition intermediates will be used as “bait” to isolate minor populations of intermediates by heterogenous classification of cryo-EM data
These MD based particle sets will be unbiased via independent ab initio map reconstruction and subsequent high-resolution refinement for comparison against the MD state.
The equilibrium distribution of states determined by cryo-EM will be compared to the MD predicted equilibrium. Upon validation, we will analyze the MD transition kinetics, thermodynamics, and path(s).
All atom simulations will be carried out using HTMD24 and Amberz1825 for the adaptive sampling protocol. The Amber ff14SB26 and Glycam27 forcefields using a truncated octahedral TIP3P28 water box and a time-step of 4 fs using hydrogen mass repartitioning29 in the NVT ensemble will be used throughout for production runs. Simulations will be lengthened and the number of iterations increased if model validation demonstrate a need. Markov models will be built using the PyEMMA22 software package. Markov model convergence will be monitored based upon linearity in the implied timescale plots and the Chapman-Kolmogorov test and uncertainty will be determined via bootstrapping of the simulated data.
As blind sampling of the adaptive states can lead to significant simulation time spent in irrelevant states, we will use the FAST′ algorithm to focus sampling in the direction of the known open, ‘up’ state. Due to the size and complexity of the CoV structure, a divide and conquer approach toward simulating the opening process can be necessary. We will split the approach into several distinct modelling steps via a combination of proven approaches31-33 as needed. The coordinate projection method and the Markov model lag time must be optimized as well. We will test multiple projection methods and compare using the so-called VAMP scoring criteria34. Finally, if inconsistencies between the simulated and experimental results arise our path forward will involve a sequential shift toward relying upon the experimental data to drive the description of the transition. Even if the model rates and equilibrium values disagree with the cryo-EM data we will still be able to discern possible paths and identify mutations that affect the distribution. Determination of transition kinetics via ACE2 binding, thermal melting temperatures, and cryo-EM state distributions can instead be used to define the transition while still retaining the utility of the MD approach.
These studies can provide key details important for understanding the transition from the prefusion, closed to the post fusion open structures of the SARS-2 fusion protein. This will include a detailed description of metastable intermediate states, transition states, transition kinetics, and transition free energies. This mechanism will be supported by high-resolution structures. Together, this information will provide atomic details important for both drug and vaccine design as well as in the prediction of conformational evasion mutations in the evolving SARS-2 virus.
The SARS-2 S protein includes the receptor binding domain and is a target for neutralizing antibodies. We have designed recombinant DNA constructs that express SARS-2 coronavirus S protein (GenBank Accession number: YP_009724390.1, which is incorporated by reference) as the full-length, transmembrane S protein or a truncated version of the S protein that lacks the C-terminal transmembrane domain and cytoplasmic tail. The truncated S protein is secreted from expressing cells, whereas the full-length version of the plasmid is expressed on the cell surface. Additional SARS-2 S protein sequences from circulating viruses are found in the GISAID EpiFlu™ Database. These sequences can also be modified with any of the modifications described herein.
The S protein designs have several modifications from the wildtype reference sequence from GenBank. First, the SARS-2 protein sequence encodes furin cleavage sites and a cathepsin L cleavage site. The recombinant protein will be made with and without these protease cleavage sites to see if they affect protein quality, yield, and immunogenicity. Second, the natural signal peptide that directs intracellular trafficking of the S protein will be exchanged for the bovine prolactin signal peptide. The bovine prolactin signal peptide is a strong signal peptide that directs proteins into the secretory pathway. This signal peptide is predicted by the SignalP 5.0 program to be cleaved off of the mature S protein more efficiently than the natural virus signal peptide sequence. Third, the secreted S protein can trimerize in order to resemble the native, membrane-bound S protein on coronavirus virions. However, the truncated, secreted S protein lacks the transmembrane domain and thus may not form a stable trimeric protein. To facilitate trimerization, we added a trimerization domain to the C-terminus of some truncated S proteins. The trimerization domain can be a 29 amino acid sequence called foldon for T4 bacteriophage fibritin protein (Strelkov S V et al. Biochemistry. 1999; Frank S et al. J Mol Biol. 2001). Fourth, we have encoded de novo cysteines to the protein sequence to create new intramolecular and intermolecular disulfide bonds. The bonds prevent conformational changes within the S protein. Non-limiting examples are represented by Cluster modifications 1-11. Fifth, we have encoded two new prolines in between HR1 and the central helix in the S protein to stabilize the polypeptide turns in the S2 protein (Pallesen et al. PNAS. 2017). Sixth, we have added an AviTag to the truncated S proteins to facilitate functionalization by streptavidin binding.
For development as a vaccine immunogen, we have also created multimeric nanoparticles that display SARS-CoV-2 S protein on their surface. The rationale for creating such immunogens is that presenting multiple copies of the immunogen allows for a more avid interaction between the immunogen and naïve B cell receptors during the immune response. Thus, weak affinity interactions between the B cell receptor and immunogen are enhanced due to the multiple interactions that work in concert. This improved interaction with B cells can underlies the improved uptake of multimeric immunogens by B cells. The internalized immunogen is then presented to T cells in the context of MHC molecules. The T cells in turn provide the required costimulatory signals to the B cells to promote B cell maturation. Additionally, the SARS-CoV-2 S protein has 22 glycosylation sites, which can interact with lectins to facilitate trafficking to secondary lymphoid organs. Multimerization of viral spike glycoproteins improves their interaction with mannose binding lectin, thereby increasing antigen trafficking to sites with abundant immune cells.
The nanoparticle immunogens are composed of various fragments of SARS-CoV-2 S protein and self-assembling ferritin protein derived from Helicobacter pylori. Each nanoparticle displays 24 copies of the S protein on its surface. The S protein is displayed as a soluble spike trimer that has the transmembrane domain and cytoplasmic tail removed and a foldon trimerization domain added. To focus antibodies to neutralizing targets, the S protein will be truncated down to only the receptor binding domain (RBD), which is a known target for neutralizing antibodies. This construct has the potential to generate neutralizing antibodies, while not eliciting binding antibodies to other sites that mediate antibody-dependent enhancement of virus infectivity (Wang et al. Biochem Biophys Res Commun. 2014 Aug. 22; 451(2):208-14; Jaume et al. J Virol. 2011 October; 85(20):10582-97).
Nucleic Acid Sequences
In certain aspects, the invention provides nucleic acids comprising sequences encoding proteins of the invention. In certain embodiments, the nucleic acids are DNAs. In certain embodiments, the nucleic acids are mRNAs. In certain aspects, the invention provides expression vectors comprising the nucleic acids of the invention.
In certain aspects, the invention provides a pharmaceutical composition comprising mRNAs encoding the inventive antibodies. In certain embodiments, these are optionally formulated in lipid nanoparticles (LNPs) or liposomes. In certain embodiments, the mRNAs are modified. Modifications include without limitations modified ribonucleotides, poly-A tail, and/or 5′ cap.
In certain aspects the invention provides nucleic acids encoding the inventive protein designs. In non-limiting embodiments, the nucleic acids are mRNA, modified or unmodified, suitable for use any use, e.g but not limited to use as pharmaceutical compositions. In certain embodiments, the nucleic acids are formulated in lipid, such as but not limited to LNPs or liposomes.
In some embodiments the antibodies are administered as nucleic acids, including but not limited to mRNAs which can be modified and/or unmodified. See US Pub 20180028645A1, US Pub 20170369532, US Pub 20090286852, US Pub 20130111615, US Pub 20130197068, US Pub 20130261172, US Pub 20150038558, US Pub 20160032316, US Pub 20170043037, US Pub 20170327842, US Pub 20180344838A1 at least at paragraphs [0260]-[0281] for non-limiting embodiments of chemical modifications, wherein the content of each is incorporated by reference in its entirety.
mRNAs delivered in LNP formulations have advantages over non-LNPs formulations. See US Pub 20180028645A1.
In certain embodiments the nucleic acid encoding a protein is operably linked to a promoter inserted an expression vector. In certain aspects the compositions comprise a suitable carrier. In certain aspects the compositions comprise a suitable adjuvant.
In certain aspects the invention provides an expression vector comprising any of the nucleic acid sequences of the invention, wherein the nucleic acid is operably linked to a promoter. In certain aspects the invention provides an expression vector comprising a nucleic acid sequence encoding any of the polypeptides of the invention, wherein the nucleic acid is operably linked to a promoter. In certain embodiments, the nucleic acids are codon optimized for expression in a mammalian cell, in vivo or in vitro. In certain aspects the invention provides nucleic acids comprising any one of the nucleic acid sequences of invention. In certain aspects the invention provides nucleic acids consisting essentially of any one of the nucleic acid sequences of invention. In certain aspects the invention provides nucleic acids consisting of any one of the nucleic acid sequences of invention. In certain embodiments the nucleic acid of the invention, is operably linked to a promoter and is inserted in an expression vector. In certain aspects the invention provides an immunogenic composition comprising the expression vector.
In certain aspects the invention provides a composition comprising at least one of the nucleic acid sequences of the invention. In certain aspects the invention provides a composition comprising any one of the nucleic acid sequences of invention. In certain aspects the invention provides a composition comprising at least one nucleic acid sequence encoding any one of the polypeptides of the invention.
In one embodiment, the nucleic acid is an RNA molecule. In one embodiment, the RNA molecule is transcribed from a DNA sequence described herein. In some embodiments, the RNA molecule is encoded by one of the inventive sequences. In another embodiment, the nucleotide sequence comprises an RNA sequence transcribed by a DNA sequence encoding the polypeptide sequences described herein, or a variant thereof or a fragment thereof. Accordingly, in one embodiment, the invention provides an RNA molecule encoding one or more of inventive antibodies. The RNA can be plus-stranded. Accordingly, in some embodiments, the RNA molecule can be translated by cells without needing any intervening replication steps such as reverse transcription.
In some embodiments, an RNA molecule of the invention can have a 5′ cap (e.g. but not limited to a 7-methylguanosine, 7mG(5′)ppp(5′)NlmpNp). This cap can enhance in vivo translation of the RNA. The 5′ nucleotide of an RNA molecule useful with the invention can have a 5′ triphosphate group. In a capped RNA this can be linked to a 7-methylguanosine via a 5′-to-5′ bridge. An RNA molecule may have a 3′ poly-A tail. It can also include a poly-A polymerase recognition sequence (e.g. AAUAAA) near its 3′ end. In some embodiments, a RNA molecule useful with the invention can be single-stranded. In some embodiments, a RNA molecule useful with the invention can comprise synthetic RNA.
The recombinant nucleic acid sequence can be an optimized nucleic acid sequence. Such optimization can increase or alter the immunogenicity of the protein. Optimization can also improve transcription and/or translation. Optimization can include one or more of the following: low GC content leader sequence to increase transcription; mRNA stability and codon optimization; addition of a kozak sequence (e.g., GCC ACC) for increased translation; addition of an immunoglobulin (Ig) leader sequence encoding a signal peptide; and eliminating to the extent possible cis-acting sequence motifs (i.e., internal TATA boxes).
Methods for in vitro transfection of mRNA and detection of protein expression are known in the art.
Methods for expression and immunogenicity determination of nucleic acid encoded proteins are known in the art.
A non-limiting embodiment of a neutralization assay is described in Zhao, G., Du, L., Ma, C. et al. A safe and convenient pseudovirus-based inhibition assay to detect neutralizing antibodies and screen for viral entry inhibitors against the new human coronavirus MERS-CoV. Virol J 10, 266 (2013). doi.org/10.1186/1743-422X-10-266, which content is incorporated by reference in its entirety. This assay can be adapted for use for SARS CoV-2.
Non-limiting embodiments of determining antibody responses are described in the following publication: “SARS-CoV-2 specific antibody responses in COVID-19 patients” Okba et al. doi.org/10.1101/2020.03.18.20038059. See also US Patent Publication 20200061185 which is incorporated by reference in its entirety.
Non-limiting embodiments of various assays, reagents, and technologies for evaluating the immunogens of the invention are described in Muthumani et al. Science Translational Medicine 19 Aug. 2015: Vol. 7, Issue 301, pp. 301ra132, DOI: 10.1126/scitranslmed.aac7462. The assays, reagents, and techniques can be adapted for use for SARS CoV-2.
Recombinant protein production of coronavirus proteins is known. See e.g. in US Patent Pub 20200061185 which disclosure is incorporated by reference in its entirety.
In some embodiments the SARS-2 S proteins of the invention are in a trimeric configuration. In some embodiments the SARS-2 S proteins of the invention are expressed as protomers which form trimers. These designs can comprise any suitable trimerization domain.
Non-limiting examples of exogenous multimerization domains that promote stable trimers of soluble recombinant proteins include: the GCN4 leucine zipper (Harbury et al. 1993 Science 262:1401-1407), the trimerization motif from the lung surfactant protein (Hoppe et al. 1994 FEBS Lett 344:191-195), collagen (McAlinden et al. 2003 J Biol Chem 278:42200-42207), and the phage T4 fibritin Foldon (Miroshnikov et al. 1998 Protein Eng 11:329-414), any of which can be linked to a recombinant coronavirus (e.g. SARS-2) S protein ectodomain described herein (e.g., by linkage to the C-terminus of S2) to promote trimerization of the recombinant coronavirus (e.g. SARS-2) S protein ectodomain.
In some examples, the C-terminus of the S2 subunit of the SARS-2 S protein ectodomain can be linked to a T4 fibritin Foldon domain. In specific examples, the T4 fibritin Foldon domain can include the amino acid sequence GYIPEAPRDGQAYVRKDGEWVLLSTF (SEQ ID NO: 1), which adopts a .beta.-propeller conformation, and can fold and trimerize in an autonomous way (Tao et al. 1997 Structure 5:789-798). Optionally, the heterologous trimerization is connected to the recombinant coronavirus (e.g. SARS-2) S protein ectodomain via a peptide linker, such as an amino acid linker. Non-limiting examples of peptide linkers that can be used include glycine, serine, and glycine-serine linkers.
In some embodiments, the SARS-2 spike protein ectodomain trimer can be membrane anchored, for example, for embodiments where the coronavirus (e.g. SARS-2) S protein ectodomain trimer is expressed on an attenuated viral vaccine, or a virus like particle. In such embodiments, the protomers in the trimer can each comprise a C-terminal linkage to a transmembrane domain, such as the transmembrane domain (and optionally the cytosolic tail) of the corresponding coronavirus. For example, the protomers of a disclosed SARS-2 S protein ectodomain trimer can be linked to a SARS-2 S protein transmembrane and cytosolic tail. In some embodiments, one or more peptide linkers (such as a gly-ser linker, for example, a 10 amino acid glycine-serine peptide linker can be used to link the recombinant SARS-2 S protein ectodomain protomer to the transmembrane domain.
The protomers linked to the transmembrane domain can include any of the modifications provided herein (or combinations thereof) as long as the recombinant coronavirus (e.g. SARS-2) S protein ectodomain trimer formed from the protomers linked to the transmembrane domain retains certain properties (e.g., the coronavirus S protein prefusion conformation).
The inventive protein or fragments thereof can be produced using recombinant techniques, or chemically or enzymatically synthesized.
In some embodiments a protein nanoparticle is provided that includes one or more of the disclosed recombinant SARS-2 S proteins, including but not limited to SARS-2 S protein trimers. Non-limiting example of nanoparticles include ferritin nanoparticles, encapsulin nanoparticles, Sulfur Oxygenase Reductase (SOR) nanoparticles, and lumazine synthase nanoparticles, which are comprised of an assembly of monomeric subunits including ferritin proteins, encapsulin proteins, SOR proteins, and lumazine synthase, respectively. Additional protein nanoparticle structures are described by Heinze et al., J Phys Chem B., 120(26):5945-52, 2016; Hsia et al., Nature, 535(7610):136-9, 2016; and King et al., Nature, 510(7503):103-8, 2014; each of which is incorporated by reference herein. To construct such protein nanoparticles a protomer of the SARS-2 S protein ectodomain trimer can be linked to a subunit of the protein nanoparticle (such as a ferritin protein, an encapsulin protein, a SOR protein, or a lumazine synthase protein) and expressed in cells under appropriate conditions. The fusion protein self-assembles into a nanoparticle and can be purified.
In some embodiments, a protomer of a disclosed recombinant SARS-2 S protein ectodomain trimer can be linked to a ferritin subunit to construct a ferritin nanoparticle. Ferritin nanoparticles and their use for immunization purposes (e.g., for immunization against influenza antigens) have been disclosed in the art (see, e.g., Kanekiyo et al., Nature, 499:102-106, 2013, incorporated by reference herein in its entirety). Ferritin is a globular protein that is found in all animals, bacteria, and plants, and which acts primarily to control the rate and location of polynuclear Fe(III)2O3 formation through the transportation of hydrated iron ions and protons to and from a mineralized core. The globular form of the ferritin nanoparticle is made up of monomeric subunits, which are polypeptides having a molecule weight of approximately 17-20 kDa. In certain embodiments, the modified coronavirus spike protein or the portion thereof is linked to form a protein multimerizing/nanoparticle subunit by a peptide linker in a sortase reaction, or is directly linked to the protein multimerizing/nanoparticle subunit. In certain embodiments, the protein nanoparticle subunit is a ferritin nanoparticle subunit.
In non-limiting embodiments the multimeric complexes comprising a ferritin sequence are designed and are assembled via sortase reaction. In non-limiting embodiments the multimeric complexes comprise encapsulin.
Following production, these monomeric subunit proteins self-assemble into the globular ferritin protein. Thus, the globular form of ferritin comprises 24 monomeric, subunit proteins, and has a capsid-like structure having 432 symmetry. Methods of constructing ferritin nanoparticles are known to the person of ordinary skill in the art and are further described herein (see, e.g., Zhang, Int. J. Mol. Sci., 12:5406-5421, 2011, which is incorporated herein by reference in its entirety).
In non-specific examples, the ferritin polypeptide is E. coli ferritin, Helicobacter pylori ferritin, human light chain ferritin, bullfrog ferritin or a hybrid thereof, such as E. coli-human hybrid ferritin, E. coli-bullfrog hybrid ferritin, or human-bullfrog hybrid ferritin. Exemplary amino acid sequences of ferritin polypeptides and nucleic acid sequences encoding ferritin polypeptides for use to make a ferritin nanoparticle including a recombinant SARS-2 S protein can be found in GENBANK, for example at accession numbers ZP_03085328, ZP_06990637, EJB64322.1, AAA35832, NP 000137 AAA49532, AAA49525, AAA49524 and AAA49523, which are specifically incorporated by reference herein in their entirety. In some embodiments, a recombinant protein of the invention can be linked to a ferritin subunit to form a nanoparticle.
Polynucleotides encoding a protomer of any of the disclosed recombinant proteins are also provided. These polynucleotides include DNA, cDNA and RNA sequences which encode the protomer, as well as vectors including the DNA, cDNA and RNA sequences, such as a DNA or RNA vector used for immunization. The genetic code to construct a variety of functionally equivalent nucleic acids, such as nucleic acids which differ in sequence but which encode the same protein sequence, or encode a conjugate or fusion protein including the nucleic acid sequence.
Another approach to multimerize expression constructs uses staphylococcus Sortase A transpeptidase ligation to conjugate inventive spike ectodomain trimers or spike subunits, for e.g. but not limited to cholesterol or self multimerizing protein. The trimers can be embedded into liposomes via the conjugated cholesterol.
To conjugate the trimer a C-terminal LPXTG tag (SEQ ID NO: 2) or a N-terminal pentaglycine repeat tag is added to the spike trimer gene, where X signifies any amino acid, such as Ala, Ser, Glu. Cholesterol is also synthesized with these two tags. Sortase A is then used to covalently bond the tagged spike subunit to the cholesterol. The sortase A-tagged spike trimer protein or portion thereof can also be used to conjugate the trimer to other peptides, proteins, or fluorescent labels. In non-limiting embodiments, the sortase A tagged trimers or spike portions are conjugated to ferritin to form nanoparticles.
In several embodiments, the nucleic acid molecule encodes a precursor of the protomer, that, when expressed in an appropriate cell, is processed into a recombinant SARS-2 S protein protomer that can self-assemble into the corresponding recombinant trimer. For example, the nucleic acid molecule can encode a recombinant SARS-2 S protein ectodomain including a N-terminal signal sequence for entry into the cellular secretory system that is proteolytically cleaved in the during processing of the recombinant protein in the cell. Recombinant proteins with different signal peptide sequences are embodied by the invention.
In certain embodiments, amino acid sequences of the invention described herein comprise a signal peptide. A skilled artisan can readily determine the signal peptide sequences. Signal peptide sequences can be removed during recombinant production of proteins. In non-limiting embodiments, provided are amino acid sequences of recombinant proteins which do not include amino acids of comprising a signal peptide.
In some embodiments, the nucleic acid molecule encodes a precursor SARS-2 S polypeptide that, when expressed in an appropriate cell, is processed into a recombinant SARS-2 S protomer including S1 and S2 polypeptides, wherein the recombinant protein includes any of the appropriate modifications described herein, and optionally can be linked to a trimerization domain, such as a T4 Fibritin trimerization domain.
Exemplary nucleic acids can be prepared by molecular and cloning techniques. A wide variety of cloning methods, host cells, and in vitro amplification methodologies are well known to persons of skill, and can be used to make the nucleic acids and proteins of the invention.
The polynucleotides encoding a disclosed recombinant protomer can include a recombinant DNA which is incorporated into a vector (such as an expression vector) into an autonomously replicating plasmid or virus or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (such as a cDNA) independent of other sequences. The nucleotides can be ribonucleotides, deoxyribonucleotides, or modified forms of either nucleotide. The term includes single and double forms of DNA.
Polynucleotide sequences encoding a disclosed recombinant protomer can be operatively linked to expression control sequences. An expression control sequence operatively linked to a coding sequence is ligated such that expression of the coding sequence is achieved under conditions compatible with the expression control sequences. The expression control sequences include, but are not limited to, appropriate promoters, enhancers, transcription terminators, a start codon (i.e., ATG) in front of a protein-encoding gene, splicing signal for introns, maintenance of the correct reading frame of that gene to permit proper translation of mRNA, and stop codons.
DNA sequences encoding the disclosed recombinant protomer can be expressed in vitro by DNA transfer into a suitable host cell. The cell can be prokaryotic or eukaryotic. The term also includes any progeny of the subject host cell. All progeny need not be identical to the parental cell since there can be mutations that occur during replication. Methods of stable transfer, meaning that the foreign DNA is continuously maintained in the host, are known in the art.
Host systems for recombinant production can include microbial, yeast, insect and mammalian organisms. Methods of expressing DNA sequences having eukaryotic or viral sequences in prokaryotes are well known in the art. Non-limiting examples of suitable host cells include bacteria, archea, insect, fungi (for example, yeast), plant, and animal cells (for example, mammalian cells, such as human). Exemplary cells of use include Escherichia coli, Bacillus subtilis, Saccharomyces cerevisiae, Salmonella typhimurium, SF9 cells, C129 cells, 293 cells, Neurospora, and immortalized mammalian myeloid and lymphoid cell lines. Techniques for the propagation of mammalian cells in culture are well-known (see, e.g., Helgason and Miller (Eds.), 2012, Basic Cell Culture Protocols (Methods in Molecular Biology), 4.sup.th Ed., Humana Press). Examples of mammalian host cell lines are VERO and HeLa cells, CHO cells, and WI38, BHK, and COS cell lines, although cell lines can be used, such as cells designed to provide higher expression, desirable glycosylation patterns, or other features. In some embodiments, the host cells include HEK293 cells or derivatives thereof, such as GnTI−/− cells, or HEK-293F cells.
In some embodiments, the disclosed recombinant coronavirus (e.g. SARS-2) S protein ectodomain protomer can be expressed in cells under conditions where the recombinant coronavirus (e.g. SARS-2) S protein ectodomain protomer can self-assemble into trimers which are secreted from the cells into the cell media. In such embodiments, each recombinant coronavirus (e.g. SARS-2) S protein ectodomain protomer contains a leader sequence (signal peptide) that causes the protein to enter the secretory system, where the signal peptide is cleaved and the protomers form a trimer, before being secreted in the cell media. The medium can be centrifuged and recombinant coronavirus (e.g. SARS-2) S protein ectodomain trimer can be purified from the supernatant.
A nucleic acid molecule encoding a protomer can be included in a viral vector, for example, for expression of the immunogen in a host cell, or for immunization of a subject as disclosed herein. In some embodiments, the viral vectors are administered to a subject as part of a prime-boost vaccination. In several embodiments, the viral vectors are included in a vaccine, such as a primer vaccine or a booster vaccine for use in a prime-boost vaccination.
In several examples, the viral vector can be replication-competent. For example, the viral vector can have a mutation in the viral genome that does not inhibit viral replication in host cells. The viral vector also can be conditionally replication-competent. In other examples, the viral vector is replication-deficient in host cells.
A number of viral vectors have been constructed, that can be used to express the disclosed antigens, including polyoma, i.e., SV40 (Madzak et al., 1992, J. Gen. Virol., 73:15331536), adenovirus (Berkner, 1992, Cur. Top. Microbiol. Immunol., 158:39-6; Berliner et al., 1988, Bio Techniques, 6:616-629; Gorziglia et al., 1992, J. Virol., 66:4407-4412; Quantin et al., 1992, Proc. Natl. Acad. Sci. USA, 89:2581-2584; Rosenfeld et al., 1992, Cell, 68:143-155; Wilkinson et al., 1992, Nucl. Acids Res., 20:2233-2239; Stratford-Perricaudet et al., 1990, Hum. Gene Ther., 1:241-256), vaccinia virus (Mackett et al., 1992, Biotechnology, 24:495-499), adeno-associated virus (Muzyczka, 1992, Curr. Top. Microbiol. Immunol., 158:91-123; On et al., 1990, Gene, 89:279-282), herpes viruses including HSV and EBV (Margolskee, 1992, Curr. Top. Microbiol. Immunol., 158:67-90; Johnson et al., 1992, J. Virol., 66:29522965; Fink et al., 1992, Hum. Gene Ther. 3:11-19; Breakfield et al., 1987, Mol. Neurobiol., 1:337-371; Fresse et al., 1990, Biochem. Pharmacol., 40:2189-2199), Sindbis viruses (H. Herweijer et al., 1995, Human Gene Therapy 6:1161-1167; U.S. Pat. Nos. 5,091,309 and 5,2217,879), alphaviruses (S. Schlesinger, 1993, Trends Biotechnol. 11:18-22; I. Frolov et al., 1996, Proc. Natl. Acad. Sci. USA 93:11371-11377) and retroviruses of avian (Brandyopadhyay et al., 1984, Mol. Cell Biol., 4:749-754; Petropouplos et al., 1992, J. Virol., 66:3391-3397), murine (Miller, 1992, Curr. Top. Microbiol. Immunol., 158:1-24; Miller et al., 1985, Mol. Cell Biol., 5:431-437; Sorge et al., 1984, Mol. Cell Biol., 4:1730-1737; Mann et al., 1985, J. Virol., 54:401-407), and human origin (Page et al., 1990, J. Virol., 64:5370-5276; Buchschalcher et al., 1992, J. Virol., 66:2731-2739). Baculovirus (Autographa californica multinuclear polyhedrosis virus; AcMNPV) vectors are also known in the art, and can be obtained from commercial sources (such as PharMingen, San Diego, Calif.; Protein Sciences Corp., Meriden, Conn.; Stratagene, La Jolla, Calif).
In several embodiments, the viral vector can include an adenoviral vector that expresses a protomer of the invention. Adenovirus from various origins, subtypes, or mixture of subtypes can be used as the source of the viral genome for the adenoviral vector. Non-human adenovirus (e.g., simian, chimpanzee, gorilla, avian, canine, ovine, or bovine adenoviruses) can be used to generate the adenoviral vector. For example, a simian adenovirus can be used as the source of the viral genome of the adenoviral vector. A simian adenovirus can be of serotype 1, 3, 7, 11, 16, 18, 19, 20, 27, 33, 38, 39, 48, 49, 50, or any other simian adenoviral serotype. A simian adenovirus can be referred to by using any suitable abbreviation known in the art, such as, for example, SV, SAdV, SAV or sAV. In some examples, a simian adenoviral vector is a simian adenoviral vector of serotype 3, 7, 11, 16, 18, 19, 20, 27, 33, 38, or 39. In one example, a chimpanzee serotype C Ad3 vector is used (see, e.g., Peruzzi et al., Vaccine, 27:1293-1300, 2009). Human adenovirus can be used as the source of the viral genome for the adenoviral vector. Human adenovirus can be of various subgroups or serotypes. For instance, an adenovirus can be of subgroup A (e.g., serotypes 12, 18, and 31), subgroup B (e.g., serotypes 3, 7, 11, 14, 16, 21, 34, 35, and 50), subgroup C (e.g., serotypes 1, 2, 5, and 6), subgroup D (e.g., serotypes 8, 9, 10, 13, 15, 17, 19, 20, 22, 23, 24, 25, 26, 27, 28, 29, 30, 32, 33, 36-39, and 42-48), subgroup E (e.g., serotype 4), subgroup F (e.g., serotypes 40 and 41), an unclassified serogroup (e.g., serotypes 49 and 51), or any other adenoviral serotype. The person of ordinary skill in the art is familiar with replication competent and deficient adenoviral vectors (including singly and multiply replication deficient adenoviral vectors). Examples of replication-deficient adenoviral vectors, including multiply replication-deficient adenoviral vectors, are disclosed in U.S. Pat. Nos. 5,837,511; 5,851,806; 5,994,106; 6,127,175; 6,482,616; and 7,195,896, and International Patent Application Nos. WO 94/28152, WO 95/02697, WO 95/16772, WO 95/34671, WO 96/22378, WO 97/12986, WO 97/21826, and WO 03/02231 1.
In some embodiments, a virus-like particle (VLP) is provided that comprises a recombinant protomer of the invention. In some embodiments, a virus-like particle (VLP) is provided that includes a recombinant trimer of the invention. Such VLPs can include a recombinant coronavirus (e.g. SARS-2) S protein ectodomain trimer that is membrane anchored by a C-terminal transmembrane domain, for example the recombinant coronavirus (e.g. SARS-2) S protein ectodomain protomers in the trimer each can be linked to a transmembrane domain and cytosolic tail from the corresponding coronavirus. VLPs lack the viral components that are required for virus replication and thus represent a highly attenuated, replication-incompetent form of a virus. However, the VLP can display a polypeptide (e.g., a recombinant coronavirus (e.g. SARS-2) S protein ectodomain trimer) that is analogous to that expressed on infectious virus particles and can eliciting an immune response to the corresponding coronavirus (e.g. SARS-2) when administered to a subject. Virus like particles and methods of their production are known and familiar to the person of ordinary skill in the art, and viral proteins from several viruses are known to form VLPs, including human papillomavirus, HIV (Kang et al., Biol. Chem. 380: 353-64 (1999)), Semliki-Forest virus (Notka et al., Biol. Chem. 380: 341-52 (1999)), human polyomavirus (Goldmann et al., J. Virol. 73: 4465-9 (1999)), rotavirus (Jiang et al., Vaccine 17: 1005-13 (1999)), parvovirus (Casal, Biotechnology and Applied Biochemistry, Vol 29, Part 2, pp 141-150 (1999)), canine parvovirus (Hurtado et al., J. Virol. 70: 5422-9 (1996)), hepatitis E virus (Li et al., J. Virol. 71: 7207-13 (1997)), and Newcastle disease virus. The formation of such VLPs can be detected by any suitable technique. Examples of suitable techniques known in the art for detection of VLPs in a medium include, e.g., electron microscopy techniques, dynamic light scattering (DLS), selective chromatographic separation (e.g., ion exchange, hydrophobic interaction, and/or size exclusion chromatographic separation of the VLPs) and density gradient centrifugation.
The immunogens of the invention can be combined with any suitable adjuvant.
A skilled artisan can readily determine the dose and number of immunizations needed to induce immune response. Various assays are known and used in the art to measure to level, breadth and durability of the induced immune response. In non-limiting embodiments the methods comprise two immunizations. The interval between immunizations can be readily determined by a skilled artisan. In non-limiting embodiments, the first and second immunization are about 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 weeks apart.
In certain embodiments the protein dose is in the range of 1-1000 micrograms. In certain embodiments the protein dose is in the range of 10-1000 micrograms. In certain embodiments the protein dose is in the range of 100-1000 micrograms. In certain embodiments the protein dose is in the range of 100-200 micrograms. In certain embodiments the protein dose is in the range of 100-300 micrograms. In certain embodiments the protein dose is in the range of 100-400 micrograms. In certain embodiments the protein dose is in the range of 100-500 micrograms. In certain embodiments the protein dose is in the range of 100-600 micrograms. In certain embodiments the protein dose is in the range of 50-100 micrograms. In certain embodiments the protein dose is in the range of 50-150 micrograms. In certain embodiments the protein dose is in the range of 50-200 micrograms. In certain embodiments the protein dose is in the range of 50-250 micrograms. In certain embodiments the protein dose is in the range of 50-300 micrograms. In certain embodiments the protein dose is in the range of 50-350 micrograms. In certain embodiments the protein dose is in the range of 50-400 micrograms. In certain embodiments the protein dose is in the range of 50-450 micrograms. In certain embodiments the protein dose is in the range of 50-500 micrograms. In certain embodiments the protein dose is in the range of 50-550 micrograms. In certain embodiments the protein dose is in the range of 50-600 micrograms. In certain embodiments the protein dose is in the range of 75-100 micrograms. In certain embodiments the protein dose is in the range of 75-125 micrograms. In certain embodiments the protein dose is in the range of 75-150 micrograms. In certain embodiments the protein dose is in the range of 75-175 micrograms. In certain embodiments the protein dose is in the range of 75-200 micrograms. In certain embodiments the protein dose is in the range of 75-225 micrograms. In certain embodiments the protein dose is in the range of 75-250 micrograms. In certain embodiments the protein dose is 10, 25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 625, 650, 700, 750, 800, 850, 900, 950 or 1000 micrograms.
In certain embodiments adjuvant dose is in the range of 1-200 micrograms. In certain embodiments adjuvant dose is in the range of 1-100 micrograms. In certain embodiments the adjuvant dose is 1-50 micrograms. In certain embodiments the adjuvant dose is 1-25 micrograms. In certain embodiments the adjuvant dose is 1-50 micrograms. In certain embodiments the adjuvant dose is 1-20 micrograms. In certain embodiments the adjuvant dose is 1-50 micrograms. In certain embodiments the adjuvant dose is 1-15 micrograms. In certain embodiments the adjuvant dose is 1-50 micrograms. In certain embodiments the adjuvant dose is 1-10 micrograms. In certain embodiments the adjuvant dose is 1-5 micrograms. In certain embodiments the adjuvant dose is 5-10 micrograms. In certain embodiments the adjuvant dose is 5-15 micrograms. In certain embodiments the adjuvant dose is 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, or 45-50 micrograms. Non-limiting examples of evaluating the immunogenicity and effectiveness of the immunogens of the invention are shown in US Patent Pub 20200061185 which disclosure is incorporated by reference in its entirety.
Any of the SARS-2 designs, including without limitation as listed in
SARS-2 designs expressed as nucleic acids or proteins will be expressed, characterized and tested for antigenicity and immunogenicity. Immuonogenicity studies include animal challenge studies. A non-limiting embodiment of an animal study is outlined in Example 2.
Animal study NHP #174: non-human primates (NHPs) are immunized with SARS-2 immunogen designs of the invention. Immune response was evaluated and animals were challenge with SARS-2 stock. The animal study design and immunogen are summarized in
Data from the animal study are summarized in
These results show that immunization with the disulfide-stabilized spike ectodomain mRNA-LNP in rhesus macaques elicited IgG antibodies against the receptor binding, N-terminal, and S2 domains of SARS-CoV-2 spike protein. The serum from disulfide-stabilized spike ectodomain mRNA-LNP-immunized macaques blocked ACE2 binding to the receptor domain of SARS-CoV-2 spike protein. Consistent with blocking the ACE2 receptor binding to SARS-CoV-2 spike, the serum neutralized both pseudotyped virus and replication-competent SARS-CoV-2. The vaccine-induced immunity suppressed SARS-CoV-2 replication in the lower respiratory tract and to a lesser extent in the upper respiratory tract. Additionally, inflammatory cytokine production in the lung was decreased in disulfide-stabilized spike ectodomain mRNA-LNP-immunized compared to macaques that received mRNA-LNP encoding an irrelevant protein. Thus, immunization with disulfide-stabilized spike ectodomain mRNA-LNP-immunized generated immunity that protected against SARS-CoV-2 infection.
Further analyses of the animal study include immunogenicity, levels of antibodies, types of antibodies—neutralizing or not, serum neutralization of pseudo-virus, diversity of epitopes targeted by the induced antibodies, protection after challenge with virus, and any other suitable assay.
Controlling the SARS-CoV-2 Spike Glycoprotein Conformation
Abstract
The coronavirus (CoV) viral host cell fusion spike (S) protein is the primary immunogenic target for virus neutralization and the current focus of many vaccine design efforts. The highly flexible S-protein, with its mobile domains, presents a moving target to the immune system. Here, to better understand S-protein mobility, we implemented a structure-based vector analysis of available β-CoV S-protein structures. We found that despite overall similarity in domain organization, different β-CoV strains display distinct S-protein configurations. Based on this analysis, we developed two soluble ectodomain constructs in which the highly immunogenic and mobile receptor binding domain (RBD) is locked in the all-RBDs ‘down’ position or is induced to display a previously unobserved in SARS-CoV-2 2-RBDs ‘up’ configuration. These results demonstrate that the conformation of the S-protein can be controlled via rational design and provide a framework for the development of engineered coronavirus spike proteins for vaccine applications.
The ongoing global pandemic of the new SARS-CoV-2 (SARS-2) coronavirus presents an urgent need for the development of effective preventative and treatment therapies. The viral S-protein is a prime target for such therapies owing to its critical role in the virus lifecycle. The S-protein is divided into two regions: an N-terminal S1 domain that caps the C-terminal S2 fusion domain. Binding to host receptor via the Receptor Binding Domain (RBD) in S1 is followed by proteolytic cleavage of the spike by host proteases1. Large conformational changes in the S-protein result in S1 shedding and exposure of the fusion machinery in S2. Class I fusion proteins, such as the CoV-2 S-protein, undergo large conformational changes during the fusion process and must, by necessity, be highly flexible and dynamic. Indeed, cryo-electron microscopy (cryo-EM) structures of SARS-2 spike reveal considerable flexibility and dynamics in the S1 domain1,2, especially around the RBD that exhibits two discrete conformational states—a ‘down’ state that is shielded from receptor binding, and an ‘up’ state that is receptor-accessible.
The wealth of structural information for β-CoV spike proteins, including the recently determined cryo-EM structures of the SARS-2 spike1-11, has provided a rich source of detailed geometric information from which to begin precise examination of the macromolecular transitions underlying triggering of this fusion machine. The transmembrane CoV S-protein spike trimer is composed of interwoven protomers that include an N-terminal receptor binding S1 domain and a C-terminal S2 domain that contains the fusion elements (
In this study we have quantified the variability in the S1 and S2 geometric arrangements to reveal important regions of flexibility to consider and to target for structure-based immunogen design. Based on these analyses, we have designed mutations that alter the conformational distribution of the domains in the S-protein. We visualized the effect of our designs using a structural determination pipeline relying first on single particle analysis by negative stain electron microscopy (NSEM) for rapid and low-cost assessment of the spike ectodomains at low resolution, followed by cryo-EM for high-resolution information on the changes introduced by these mutations. Our results reveal a heterogeneous conformational landscape of the SARS-CoV-2 spike that is highly susceptible to modification by the introduction of mutations at sites of contact between the S1 and S2 domains. We also present data on modified SARS-2 ectodomain constructs stabilized in conformations that have not yet been seen in the current available structures, with great interest and direct application in vaccine design.
Results
Detailed Structural Schema Defining the Geometry and Internal Rearrangements of Movable Domains of the SARS-2 Spike.
To characterize the unique arrangement of distinct domains in the CoV spike, we first aimed to develop a precise quantitative definition of their relative positions. Examination of available SARS and MERS S-protein structures revealed: 1) the NTD and RBD subdomains and internal S2 domain move as rigid bodies, and 2) these domains display a remarkable array of relative shifts between the domains in the S1 region and the S2 region's β-sheet motif and connector domain (CD) (
Identification of Sites for Differential Stabilization of the SARS-2 Ectodomain Spike RBD Orientation.
Based on the observed variability in the geometric analysis of β-CoV spikes, we asked whether the propensity for the RBD to display the ‘down’ and ‘up’ states can be modified via mutations without altering exposed antigenic surfaces. To this end, we identified protomer to protomer interactive sites amenable to modification and down selected mutations at these sites using the Schrödinger Biologics suite. In an effort to eliminate exposure of the receptor binding site of the RBD, we examined the potential for disulfide linkages between the RBD and its contact with S2 near the C-terminus of HR1 to prevent RBD exposure. We identified a double cysteine mutant, S383C and D985C (RBD to S2 double mutant; rS2d;
NSEM Analysis of the SARS-2 Spike Ectodomain Proteins.
To assess the quality of the purified spike proteins and to obtain low resolution visualization of the structures, we performed NSEM analysis. The micrographs showed a reasonably uniform distribution of particles consistent with the size and shape of the SARS-2 spike ectodomain (
Cryo-EM Analysis of the SARS-2 Spike Ectodomain Proteins.
To visualize the mutations and their effect on the structure of the spike, we collected cryo-EM datasets for the rS2d and u1S2q constructs (
In contrast to the rS2d design, the u1S2q design displayed widespread rearrangement of the S1 domains (
#
$Resolutions are reported according to the FSC 0.143 gold-standard criterion
Conformational plasticity is a hallmark of enveloped-virus fusion-protein structure, owing to the necessity of protecting the conserved viral fusion elements from host immune responses while retaining a sufficiently steep free-energy gradient to enable host cell fusion16. Exposed elements can be well conditioned to be permissive and responsive to mutations through genetic drift and host immune adaptation. Conformational plasticity, however, presents an important difficulty in the context of vaccine and drug design. Indeed, lessons learned in the continued effort to produce a broadly protective HIV-1 vaccine have demonstrated the importance of a detailed understanding and control of fusion protein dynamics17-28. The new SARS-CoV-2 is no exception in this regard and indeed the conformational plasticity of the SARS-2 S-protein appeared greater than that of the HIV-1 Env. We aimed to develop a quantitative understanding of β-CoV structural states between strains and within each RBD down and up state configuration. The wide breadth of domain arrangements along with the relatively small contact area between the S1 and S2 subunits observed here indicated that, despite a relatively low mutation rate, dramatic changes in S-protein structure can occur from few mutations. Indeed, recent evidence for a mutation in the SD2 to S2 contact region indicates a fitness gain for acquisition of such interfacial residues29. Based upon our results, this mutant, D614G, can indeed alter the conformational landscape of the SARS-CoV-2 S-protein.
From the perspective of immunogen development, the constructs developed here present an opportunity to examine the ability of differentially stabilized S-protein particles to induce two different, yet important antibody responses. First, without wishing to be bound by theory, the disulfide linked ‘down’ state locked double mutant (rS2d) can eliminate receptor binding site targeting antibodies which make up the majority of observed responses30,31. Indeed, a study of MERS responses indicate non-RBD responses (such as NTD and S2 epitopes) will play an important role in vaccine induced protection32. From a theoretical perspective, the wide control over the RBD ‘up’/‘down’ distribution available to the virus indicates that, by analogy to known difficult to neutralize HIV-1 strains, conformational blocking of antibody responses is not be unusual. Although this can result in a fitness cost to the virus, it does not necessarily make the virion non-infectious. Using the double mutant rS2d as an immunogen provides a platform from which to induce such non-RBD responses that can be needed to protect against such an evasion. The second area of interest comprises cryptic pocket targeting antibodies which have proven effective in the neutralization of SARS. These antibodies target an epitope presented only in the ‘up’ state RBDs and appear to require a two RBD ‘up’ configuration33. The current stabilized ectodomain construct in wide use in SARS-CoV-2 clinical trials was demonstrated previously, and recapitulated here by NSEM, to display only the ‘down’ and one RBD ‘up’ states. However, the u1S2q, SD1/S2 targeting design developed here display a prominent two RBD ‘up’ state distribution compatible with these cryptic-epitope targeting MAbs. This indicates it can induce such antibodies. While complicating factors, such as vaccine enhancement, can favor the use of truncated, single domain constructs which can display fewer weakly or non-neutralizing epitopes, these, along with the designs presented here will allow for a detailed characterization of not only vaccine immunogenicity but also antigenicity, paving the way for next generation vaccines for the new SARS-CoV-2 and the development of a broadly neutralizing β-CoV vaccine. Thus, while the previous generation of stabilizing mutations ensure well folded trimer, the rational design approach developed here provides a means by which precisely controlling the RBD orientation distribution, thus allowing exploratory efforts to understand the role of conformational dynamics from the perspective of vaccine and drug development.
Methods
Vector Based Analysis
Vector analysis was performed using available cryo-EM structures for SARS-213,14, SARS4,5,7,8, MERS4,12, and other human2,10 and murine11 β-CoV spike proteins. Domains for the vector analysis were selected based upon visual inspection of alignments between SARS, MERS, and SARS-CoV-2 structures. Specifically, Cα centroids for the S1 NTD, RBD, SD1, SD2 (SARS-CoV-2 residues, 27-43 and 54-271, 330-443 and 503-528, 323-329 and 529-590, 294-322 and 591-696, respectively; equivalent SARS/MERS/Murine/HKU1/OC43 residues selected based upon structural alignment with SARS-CoV-2) as well as a β-sheet motif in the NTD (residues 116-129 and 169-172) and a helix motif in the RBD (residues 403-410) were determined. The NTD was split into two regions with the SD1 contacting, SD2 adjacent portion referred to here as the NTD′ (residues 44-53 and 272-293). Cα centroids in the S2 domain were obtained for a β-sheet motif (residues 717-727 and 1047-1071) and the CD domain (711-716 and 1072-1122). Vector magnitudes, angles, and dihedrals between these centroids were determined and used in the subsequent analysis. Vector analysis was performed using the VMD34 Tcl interface. Principal component analysis performed in R with the vector data centered and scaled35.
Rational, Structure-Based Design
Structures for SARS (PDB ID 5X584), MERS (PDB ID 6Q0436), and SARS-CoV-2 (PDB ID 6VXX15) were prepared in Maestro37 using the protein preparation wizard38 followed by in silico mutagenesis using Schrödinger's cysteine mutation39 and residue scanning40 tools. Residue scanning was first performed for individual selected sites allowing mutations to Leu, Ile, Trp, Tyr, and Val followed by scanning of combinations for those which yielded a negative overall score. Scores and visual inspection were used in the selection of the prepared constructs.
Protein Expression and Purification
The SARS-CoV-2 ectodomain constructs were produced and purified as described previously. Briefly, a gene encoding residues 1-1208 of the SARS-CoV-2 S (GenBank: MN908947) with proline substitutions at residues 986 and 987, a “GSAS” (SEQ ID NO: 3) substitution at the furin cleavage site (residues 682-685), a C-terminal T4 fibritin trimerization motif, an HRV3C protease cleavage site, a TwinStrepTag and an 8XHisTag (SEQ ID NO: 4) was synthesized and cloned into the mammalian expression vector pαH. All mutants were introduced in this background. expression plasmids encoding the ectodomain sequence were used to transiently transfect FreeStyle293F cells using Turbo293 (SpeedBiosystems). Protein was purified on the sixth day post transfection from the filtered supernatant using StrepTactin resin (IBA).
Cryo-EM Sample Preparation and Data Collection
Purified SARS-CoV-2 spike preparations were diluted to a concentration of ˜1 mg/mL in 2 mM Tris pH 8.0, 200 mM NaCl and 0.02% NaN3. 2.5 uL of protein was deposited on a CF-1.2/1.3 grid that had been glow discharged for 30 seconds in a PELCO easiGlow™ Glow Discharge Cleaning System. After a 30 s incubation in >95% humidity, excess protein was blotted away for 2.5 seconds before being plunge frozen into liquid ethane using a Leica EM GP2 plunge freezer (Leica Microsystems). Frozen grids were imaged in a Titan Krios (Thermo Fisher) equipped with a K3 detector (Gatan). Data were acquired using the Leginon system41. The dose was fractionated over 50 raw frames and collected at 50 ms framerate. This dataset was energy-filtered with a slit width of 30 eV. Individual frames were aligned and dose-weighted. CTF estimation, particle picking, 2D classifications, ab initio model generation, heterogeneous refinements, and homogeneous 3D refinements were carried out in cryoSPARC42.
Cryo-EM Structure Fitting
Structures of the all ‘down’ state (PDB ID 6VXX) and single RBD ‘up’ state (PDB ID 6VYB) from the previously published SARS-CoV-2 ectodomain were used to fit the cryo-EM maps in Chimera43. The 2 RBD ‘up’ state was generated in PyMol using the single RBD ‘up’ state structure. Mutations were made in PyMol44. Coordinates were then fit manually in Coot45 following iterative refinement using Phenix46 real space refinement and subsequent manual coordinate fitting in Coot. Structure and map analysis was performed using PyMol and Chimera.
Without being bound by theory, the results in Example 2 indicate that the rs2d can require further modifications.
Rs2d designs comprising hexapro mutations are evaluated and discussed in Edwards et al. Nature Structural & Molecular Biology volume 28, pages128-131(2021).
Any of the SARS-2 designs will be expressed as nucleic acids or proteins will be expressed, characterized and tested for antigenicity and immunogenicity. Immuonogenicity studies include animal challenge studies.
Glycans on the SARS-CoV-2 Spike Control the Receptor Binding Domain
Conformation
Abstract
The glycan shield of the beta-coronavirus (β-CoV) Spike (S) glycoprotein provides protection from host immune responses, acting as a steric block to potentially neutralizing antibody responses. The conformationally dynamic S-protein is the primary immunogenic target of vaccine design owing to its role in host-cell fusion, displaying multiple receptor binding domain (RBD) ‘up’ and ‘down’ state configurations. Here, we investigated the potential for RBD adjacent, N-terminal domain (NTD) glycans to influence the conformational equilibrium of these RBD states. Using a combination of antigenic screens and high-resolution cryo-EM structure determination, we show that an N-glycan deletion at position 234 results in a dramatically reduced population of the ‘up’ state RBD position. Conversely, glycan deletion at position N165 results in a discernable increase in ‘up’ state RBDs. This indicates the glycan shield acts not only as a passive hinderance to antibody meditated immunity but also as a conformational control element. Together, our results demonstrate this highly dynamic conformational machine is responsive to glycan modification with implications in viral escape and vaccine design.
Introduction
The ongoing SARS-CoV-2 (SARS-2) pandemic presents an urgent need for the development of a protective vaccine. The primary immunogenic target for the vaccines in development is the viral transmembrane S-protein trimer. Each protomer of the trimer is split into an N-terminal receptor binding S1 subunit and a C-terminal fusion element containing S2 subunit, demarcated by the presence of a host protease cleavage site. The S1 subunit is further split into an N-terminal domain (NTD), two subdomains (SD1 and SD2) as well as the receptor binding domain (RBD) that together cap the conserved elements of the S2 subunit. The fusion event is marked by the shedding of the S1 subunit and large conformational transitions in the S2 subunit. The necessity to maintain a large free energy gradient between the prefusion, immune protective state of the molecule and the post-fusion state results in a highly dynamic macromolecular structure. The S1 subunit is dynamic, presenting the RBD in two distinct states: a receptor binding site occluded ‘down’ state in which the RBDs rest against their adjacent protomer's NTD, and a receptor binding site exposed ‘up’ state. It is this RBD ‘up’ state to which the majority of neutralizing responses are observed in convalescent SARS-2 infected individuals′. As conformational evasion is a well-known virus escape mechanism, it is critical to understand the mechanism by which the dynamics are controlled.
Structural studies of the β-CoV S-protein have focused primarily on a soluble, ectodomain construct with and without stabilizing proline mutations (2P). This includes structures for SARS-21,2, SARS3-7, MERS3,8, and other human9,10 and murine11 β-CoV ectodomains. Structures for the SARS and MERS ectodomains revealed the presence of one and two RBD ‘up’ states with a three RBD ‘up’ state observed in the MERS ectodomain demonstrating the breadth of RBD configurations available to the spike. Interestingly, these states were not observed in the human β-CoVs HKU1 and OC43 nor in a Murine β-CoV, indicating mutations in the spike protein can confer dramatic differences in the propensity of the RBD to sample its available conformational space.
Our quantitative examination of the available β-CoV S-protein structures recently revealed the S1 and S2 subunit domains of different β-CoV viruses occupy a diverse array of configurations12. Based upon this analysis we predicted the S-protein conformation was sensitive to mutations at the interfaces between domains and subunits. Indeed, mutations at these sites had major impacts on the configuration of the protein, especially on the RBD ‘up’/‘down’ distribution12. While these and other studies13,14,15 have demonstrated the role of protein-protein contacts in determining the conformation of the S-protein, the influence on RBD configuration of glycosylation at or near interfacial domain regions is poorly understood.
Like other class I viral fusion proteins, the β-CoV S-proteins are heavily glycosylated, obscuring the spike surface and limiting the targetable area for immune responses. A recent site-specific analysis of the glycosylation patterns of the SARS-2 S-protein revealed variation in the glycan type, indicating marked differences in processing enzyme accessibility at each site16. Together, the wide variation in spike conformation coupled with the presence of glycans adjacent to the RBD indicates among the many factors affecting the RBD position, glycosylation patterns can provide a means by which to control its conformational equilibrium.
In this study we have investigated the potential for two SARS-2 NTD glycans in close proximity to the RBD to influence the conformational distribution of the RBD ‘up’ and ‘down’ states. Analysis of the available SARS-2 ‘up’ state structures indicated N165 and N234 glycans can interact with the ‘up’ state RBD acting as both direct stabilizers of the ‘up’ state and as steric blocks to transitions to the ‘down’ state. We combined binding studies by surface plasmon resonance, with structural studies using negative stain electron microscopy (NSEM) and single-particle cryo-electron microscopy (cryo-EM) to define shifts in the ‘up’/‘down’ state equilibrium in glycan-deleted mutants of the SARS-2 spike ectodomain. Together, our results demonstrate that RBD proximal glycans can influence the propensity of the S-protein adopt multiple configurations indicating a means for viral escape and therefore the need to consider non-RBD neutralizing responses in vaccine design.
Results
Structure Analysis Identifies Glycans with the Potential to Modify the S-Protein Conformation
In order to establish whether glycans can indeed alter the RBD orientation, we first examined the SARS-2 glycan density at positions 165 and 234 in the cryo-EM maps from three previously published SARS-2 structures. In the ‘down’ state, the N234 glycan resides in a cleft formed by the NTD and RBD (
RBD Conformation and Antigenicity of the N-Glycan Deleted S-Proteins Reveals Differential Stabilization of RBD ‘Up’ and ‘Down’ States
In order to examine the extent to which the N234 and N165 glycans influence the conformational distribution of the S-protein, we produced di-proline (2P) stabilized8 S-protein ectodomain2 N234A and N165A mutants.
The parent nCoV sequence (“nCoV-1 nCoV-2P”) is shown in
The N165A mutant sequence is shown in
The N234A mutant sequence is shown in
The protein yields after StrepTactin purification were 2.0 mg and 0.8 mg per 1 L culture supernatant, respectively for the N234A and the N165A mutant. (
We next examined the ‘up’/‘down’ state distribution of both mutants via negative stain electron microscopy (NSEM). Heterogenous classification of the N234A mutant particles revealed a dramatic shift from a ˜1:1 ‘up’ v. ‘down’ state distribution in the unmutated 2P2,12,17 to a ratio of ˜1:4 in the down state (
High-Resolution Cryo-EM Structures of the N-Glycan Deleted Constructs Indicates Modest Perturbation to S-Protein Configuration
We next turned to cryo-EM for high resolution structure determination to visualize the impact of the glycan deletions on the local and global configuration of the S-protein domains. We collected and processed 7,269 and 8,068 images for the N165A and N234A mutant, respectively, to yield particle stacks cleaned up by 2D classification, that were then subjected to multiple rounds of ab initio classification and heterogenous refinement in cryoSPARC18 using 20 Å low pass filtered ‘up’ state and ‘down’ state maps generated from available SARS-2 structures. Initial maps for high resolution refinement were generated from sorted particles via ab initio reconstruction (
We next examined the high-resolution details of the cryo-EM maps. Refinement of the N234A mutant ‘down’ state using C3 symmetry resulted in a 3.0 Å map with coordinates fit to this map yielding a structure aligning to the unmutated 2P structure (PDB ID 6VXX) with a ˜0.6 Å RMSD. Alignment of the S2 subunit revealed the structures to be nearly identical in these regions (RMSD ˜0.4 Å). Examination of the NTD to RBD interface using this alignment revealed a shift of the NTD toward the RBD (
Refinement of the N165A ‘up’ and ‘down’ states resulted in maps with resolutions of 3.6 Å using C1 symmetry and 3.3 Å using C3 symmetry, respectively. Similar to the N234A mutant, the N165A mutant structures showed an overall similar arrangement of the various domains. Alignment of the ‘down’ state structure of the N165A mutant with that of the unmutated spike yielded an RMSD of 0.81 Å with an S2 subunit alignment RMSD of 0.36 Å. Unlike the N234A mutant, the N165A mutant NTD is shifted away from the adjacent RBD (
Discussion
Viral fusion proteins are often heavily glycosylated with the SARS-2 S-protein being no exception. Though decorated with fewer glycans than the HIV-1 Envelope protein, with 22 glycans per protomer16, the SARS-2 spike is well shielded from immune surveillance. The SARS-2 spike protein has proven remarkably sensitive to domain-domain interfacial mutations12-15,19 which led us to ask whether glycans near the NTD-RBD interface can also impact the configuration of the spike. Here we have investigated the role of two NTD glycans at positions 234 and 165 in modulating S protein conformational dynamics by tracking the shift of RBD disposition in glycan-deleted mutants using binding to ACE-2 receptor, NSEM and cryo-EM analysis. While the specific magnitudes of differences vary between the different analysis methods, all the results track in the same direction to show that deletion of glycan 234 shifts the RBD dynamics more toward the “down” state, whereas deletion of glycan 165, retains or slightly enhances the distribution toward more “up” states. The 2-RBD “up” state observed in the NSEM analysis was not found in the cryo-EM data, indicating that the RBD up/down configuration in this construct can be sensitive to its environment. The shift in the position of the NTD toward the RBD in the ‘down’ state N234A mutant indicates the N234 glycan plays a direct role in destabilizing the ‘down’ state RBD position such that removal allows tighter packing of the RBD to the NTD. Additionally, the observed shift in the position of the ‘up’ state RBD indicates a role for the N234 glycan in modulating RBD stability. This is consistent with a recently released theoretical study investigating ‘up’ state RBD sensitivity to the presence of N165/N234 glycans via molecular simulation20. This investigation found that the absence of these glycans resulted in a comparatively unstable ‘up’ state RBD. The results here confirm the prediction from these simulations that loss of the N234 glycan results in an increased prevalence of the ‘down’ state. Deletion of the glycan at position 165 here indicates an opposite effect on the conformation of the spike relative to the N234A mutant, with the NTD shifting away from the adjacent RBD. Though this appears to relieve strain caused by the restriction imposed by the N234 glycan, the resultant lack of packing between the RBD and NTD can be sufficient to favor transitions to the ‘up’ state. Further, this shift indicates the N165 glycan interacts directly with the RBD. Though direct interactions are not observed in the cryo-EM densities here or in previously published SARS-2 structures, the presence of ‘down’ state conformational heterogeneity evinced by the poor resolution of the RBD and NTD elements of the spike is consistent with the possibility of such an interaction. A more detailed examination of this heterogeneity and the influence of these glycans on the various states of the spike will require large datasets with improved orientational sampling to better resolve these apical regions. Nevertheless, the results here demonstrate that the conformational ensemble of the SARS-2 spike and β-CoV spikes are sensitive to glycosylation patterns, especially near the NTD-RBD interface.
Our results from this study lend insights into two key questions—what role do the glycans at positions 165 and 234 play in modulating RBD dynamics and the biology of the native SARS-2 spike and how do these findings impact vaccine design? Toward the first question, we recognize that the results we describe are in the context of a stabilized, ectodomain construct and differences between these and what occurs on the spike in its native context can be determined. Indeed, a recent report for a detergent solubilized, full-length SARS-2 spike indicated greater stability in the ‘down’ state RBD21. Yet our experimental results revealing the role of the N165 and N234 glycans in modulating the conformational landscape of the S protein, taken together with the findings from the computational analysis performed in the context of the full-length spike20, and our analysis of the RBD-proximal NTD glycans of diverse β-CoVs (
Studies have shown that the NTD and RBD are quite mobile. We therefore asked whether the observed shifts in the NTD of the N165A and N234A mutants in the ‘down’ state are related to changes in the propensity of the domain to occupy positions or due to access to new states. We first classified the ‘down’ state 2P, N165A, and N234A particles using C1 symmetry yielding 4, 4, and 3 states, respectively. In order to quantify differences in the positions of the S1 domains, we generated a set of vectors between protomer RBD's and SD1's centroids and their adjacent NTD's centroids (
Vector magnitudes and relevant angles and dihedrals were determined for each of the three RBD-NTD pairings. Examination of the distance between adjacent RBDs and NTDs revealed markedly shifted positions between the three constructs (
The results indicate the 2P and N165A constructs display similar angles with geometric means of 52.9 And 53.0 Å, respectively (
A previous molecular dynamics-based study of the one ‘up’ state RBD indicated the N165 glycan “props up” the RBD. We therefore classified the 2P and N165A construct ‘up’ states in order to determine the extent to which the RBD positions. Each classified into four states with some overlap in the relative position of the RBDs. However, the 2P construct displayed an RBD more distant from the primary trimer axis as compared to those of the N165A construct while the N165A construct displayed a state much closer to the primary axis (
Methods
Vector Based Analysis
Vector analysis was performed as previously described. Specifically, Cα centroids for the S1 NTD, RBD, S131, SD2 (SARS-CoV-2 residues, 27-43 and 54-271, 330-443 and 503-528, 323-329 and 529-590, 294-322 and 591-696, respectively) as well as a β-sheet motif in the NTD (residues 116-129 and 169-172) and a helix motif in the RBD (residues 403-410) were determined. The NTD was split into two regions with the SD1 contacting, SD2 adjacent portion referred to here as the NTD′ (residues 44-53 and 272-293). Cα centroids in the S2 subunit were obtained for a β-sheet motif (residues 717-727 and 1047-1071) and the CD domain (711-716 and 1072-1122). Vector magnitudes, angles, and dihedrals between these centroids were determined and used in the subsequent analysis. Vector analysis was performed using the VMD22 Tcl interface.
Protein Expression and Purification
The SARS-CoV-2 ectodomain constructs were produced and purified as described previously2. Briefly, a gene encoding residues 1-1208 of the SARS-CoV-2 S (GenBank: MN908947) with proline substitutions at residues 986 and 987, a “GSAS” substitution at the furin cleavage site (residues 682-685), a C-terminal T4 fibritin trimerization motif, an HRV3C protease cleavage site, a TwinStrepTag and an 8XHisTag (SEQ ID NO: 4) was synthesized and cloned into the mammalian expression vector pall. All mutants were introduced in this background. Expression plasmids encoding the ectodomain sequence were used to transiently transfect FreeStyle293F cells using Turbo293 (SpeedBiosystems). Protein was purified on the sixth day post-transfection from the filtered supernatant using StrepTactin resin (IBA).
The ACE-2 gene was cloned as a fusion protein with a mouse Fc region attached to its C-terminal end. A 6× His-tag (SEQ ID NO: 5) was added to the C-terminal end of the Fc domain. ACE-2 with mouse FC tag was purified by Ni-NTA chromatography.
Thermal Shift Assay
The thermal shift assay was performed using Tycho NT. 6 (NanoTemper Technologies). Spike variants were diluted (0.15 mg/ml) in nCoV buffer (2 mM Tris, pH 8.0, 200 mM NaCl, 0.02% sodium azide) and run in duplicates in capillary tubes. Intrinsic fluorescence was recorded at 330 nm and 350 nm while heating the sample from 35-95° C. at a rate of 3° C./min. The ratio of fluorescence (350/330 nm) and the Ti were calculated by Tycho NT. 6.
Cryo-EM Sample Preparation, Data Collection and Processing
Purified SARS-CoV-2 spike preparations were diluted to a concentration of ˜1 mg/mL in 2 mM Tris pH 8.0, 200 mM NaCl and 0.02% NaN3. 2.5 μL of protein was deposited on a CF-1.2/1.3 grid that had been glow discharged for 30 seconds in a PELCO easiGlow™ Glow Discharge Cleaning System. After a 30 s incubation in >95% humidity, excess protein was blotted away for 2.5 seconds before being plunge frozen into liquid ethane using a Leica EM GP2 plunge freezer (Leica Microsystems). Frozen grids were imaged in a Titan Krios (Thermo Fisher) equipped with a K3 detector (Gatan). Data were acquired using the Leginon system23. The dose was fractionated over 50 raw frames and collected at 50 ms framerate. This dataset was energy-filtered with a slit width of 30 eV. Individual frames were aligned and dose-weighted′. CTF estimation, particle picking, 2D classifications, ab initio model generation, heterogeneous refinements, homogeneous 3D refinements and local resolution calculations were carried out in cryoSPARC25.
Cryo-EM Structure Fitting and Analysis
Structures of the all ‘down’ state (PDB ID 6VXX) and single RBD ‘up’ state (PDB ID 6VYB) from the previously published SARS-CoV-2 ectodomain were used to fit the cryo-EM maps in Chimera26. Mutations were made in PyMol27. Coordinates were fit to the maps first using ISOLDE28 followed by iterative refinement using Phenix29 real space refinement and subsequent manual coordinate fitting in Coot as needed. Structure and map analysis were performed using PyMol, Chimera26 and ChimeraX30.
Surface Plasmon Resonance
The binding of ACE-2 to the SARS-2 spike constructs was assessed by surface plasmon resonance on Biacore T-200 (GE-Healthcare) at 25° C. with HBS-EP+ (10 mM HEPES, pH 7.4, 150 mM NaCl, 3 mM EDTA, and 0.05% surfactant P-20) as the running buffer. ACE-2 tagged at its C-terminal end to a mouse Fc region was captured on an anti-Fc surface. Binding was assessed by flowing over different concentrations of the spike constructs over the ACE-2 surface. The surface was regenerated between injections by flowing over 3M MgCl2 solution for 10s with flow rate of 100 μl/min. Blank sensorgrams were obtained by injection of the same volume of HBS-EP+ buffer in place of IgGs and Fab solutions. Sensorgrams were corrected with corresponding blank curves. Sensorgram data were analyzed using the BiaEvaluation software (GE Healthcare).
This application claims the benefit of and priority to U.S. Patent Application No. 63/009,969 filed Apr. 14, 2020, U.S. Patent Application No. 63/026,588 filed May 18, 2020 and U.S. Patent Application No. 63/044,629 filed Jun. 26, 2020, the content of each application is herein incorporated by reference in its entirety.
This invention was made with government support under administrative supplement to NIH R01 AI145687 for coronavirus research and a grant from the State of North Carolina from the Federal CARES Act. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/027242 | 4/14/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63009969 | Apr 2020 | US | |
63026588 | May 2020 | US | |
63044629 | Jun 2020 | US |