This application is being filed along with a Sequence Listing in electronic format. The Sequence Listing is provided as a file entitled SeqList-DURC048.012APC, created Jan. 23, 2023, which is approximately 85 kilobytes in size, which is replaced by a Replacement Electronic Sequence Listing submitted herewith as a file entitled HOFM028001APCREPLACEMENTSEQLIST.txt, which is 138,440 bytes in size and was created on Aug. 14, 2023. The information in the electronic format of the Sequence Listing is incorporated herein by reference in its entirety.
This application relates to the medical field of COVID-19 diagnosis or treatment, and in particular, it relates to fusion proteins comprising severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) receptor binding domain (RBD) or a fragment thereof. Said fusion proteins are useful for the development of assays capable of screening reagents that inhibit binding of the viral spike (S) protein to the angiotensin converting enzyme 2 (ACE2).
SARS-COV-2 is an enveloped RNA virus from the Coronaviridae family (Gorbalenya, A.E, et al., 2020, Nature Microbiology, 5(4):p.536-544) that has several structural components, including Spike (S), Envelope (E), Membrane (M) and Nucleocapsid (N) proteins (Lu, R., et al., 2020, Lancet 395(10224): p.565-574). The S protein consists of two subunits (S1 and S2) that form a trimer on the viral membrane; S1 contains the RBD which is responsible for binding to the ACE2 host cell receptor (Hoffmann, M., et. al., 2020, Cell, 181(2):p.271-280.e8), while S2 enables the fusion between the host and viral membranes (Lan, J., et al., 2020, Nature, 581(7807):215-220; Wrapp, D., et al., 2020, Science, 367(6483) :p. 1260-1263).
SARS-COV-2 has caused a widespread COVID-19 pandemic that infected millions worldwide and claimed hundreds of thousands of lives. Currently, the main and most accurate method of diagnosis is by PCR testing of nasopharyngeal swabs (Peng et al., 2020, J Med Virol. 24;10.1002/jmv.25936); yet, there is an urgent need to develop reliable, highly sensitive and specific antibody tests capable of identifying all infected individuals, irrespective of clinical symptoms. This information will be critical to establish community surveillance and implement policies that contain the viral spread.
The US Food and Drug Administration (FDA) has granted Emergency Use Authorizations (EUA) to multiple immunoassay tests in the market, but none of those assays has been fully validated. Because of the lack of validated immunoassays, key to understand risk, epidemiological factors, pathogenesis and mortality, the present inventors developed fusion proteins that comprise RBD molecular designs aimed at being a reagent in SARS-COV-2 immunoassays.
The spike RBD represents a promising antigen for the detection of anti-SARS-COV-2 IgGs aimed at identifying current and past infections; and because the RBD is poorly conserved among other SARS-CoVs and pathogenic human coronavirus, it shows an enhanced capacity to recognize total anti-SARS-COV-2 Igs and IgMs (Premkumar, L. et al., 2020, Science Immunology, (10):p1126-1140). The concerns of lower assay sensitivity due to the small size of the RBD protein may be overcome by the molecular fusion of RBD and N proteins. The goal of this invention is to improve assay specificity (RBD truncations and RBD mutations) and sensitivity (RBD-N fusions, RBD-multimerization domains; RBD-horseradish peroxidase (HRP)).
The inventors of the present invention have developed RBD fusion proteins and molecular designs that facilitate the identification of hyperimmune human sera to be used as a therapeutic or for therapeutic development. A large fraction of antibodies developed against RBD show neutralizing properties, the rationale being that these mAbs disrupt the interaction between S and hACE2 proteins, preventing viral entry. As of Jun. 29, 2020 the FDA has not approved convalescent plasma therapy, however it recommends under investigational studies and clinical trials, to use a titer of at least 1:160 for human passive immunization studies. Because RBD elicits the development of antibodies with antiviral activity, these proteins will be essential for the development of inhibitory assays that identify neutralizing antibodies against SARS-COV-2.
The present invention describes a new composition of matter for the production of RBD fusion proteins. This invention embodies the methods for producing RBD fusion proteins as well as the nucleic acid molecules encoding RBD, their expression vectors and host cells. It also covers RBD truncations, multimerization domains and fusions to N protein. This novel composition of matter also embodies mutations identified by molecular dynamics simulations and affinity maturation that have been described as enhancers of expression or affinity to ACE2. The described molecular designs can be used as key reagents in antibody titer, inhibitor/neutralization screening assays, vaccine development or as agents to elicit the production of therapeutic antibodies with antiviral activity. These fusion proteins can also be fused to HRP for enabling SARS-COV-2 detection and quantification.
The present inventors have developed non-obvious RBD molecular designs containing IgG1, IgG2aFc and p53 dimerization and tetramerization domains, with the goal of increasing assay avidity and sensitivity; while also producing high quality, well characterized and reproducible material. In addition, embodiments where the RBD with N proteins are fused together were designed, as well as RBD and HRP with the goal of improving assay sensitivity during the acute phase of infection, as N protein is detected early during the infection.
The described molecules are specifically recognized by anti-SARS-COV-2 S/S1/RBD polyclonal rabbits antibodies and can be used as single entities in capturing anti-SARS-COV-2 total IgG or IgM antibodies in immunoassay platforms. When a full assay is developed, these molecules can be immobilized in a solid support such as a microtiter plate, a membrane, a bead, a polypeptide chip, or a chromatography column. A subset of the presented designs has been experimentally tested with similar or better performance (measured as affinity to hACE2) than other commercial counterparts.
Finally, due to the strong antiviral activity of RBD-specific antibodies, the RBD proteins herein described can be used as vaccine candidates to elicit broadly effective anti-SARS-COV-2 antibodies (Robbiani, D. et al., 2020, Nature, doi: https://doi.org/10.1101/2020.05.13.092619; Huo, J. et al., 2020, Cell Host & Microbe, (28):p1-10).
In a first aspect, the present invention relates to a fusion protein comprising the SARS-COV-2 receptor binding domain (RBD) of the SARS-COV-2 spike protein or a fragment thereof, and a N-terminal signal peptide, and at least one of a polyhistidine tag, a linker, an oligomerization tag, a region in spike protein outside RBD, a horseradish peroxidase binding domain or a protease cleavage site.
In one embodiment, said N-terminal signal peptide is selected from a SARS-COV-2 spike endogenous signal peptide, or a tissue plasminogen activator (tPa) signal peptide. In one, embodiment, said N-terminal signal peptide has an amino acid sequence selected from SEQ ID NO:1 and SEQ ID NO:2.
In one embodiment, said polyhistidine tag consists of 8 or 10 histidine residues. In one, embodiment, said polyhistidine tag has an amino acid sequence selected from SEQ ID NO:7 and SEQ ID NO:8.
In one embodiment, said oligomerization tag is selected from a murine IgG1-Fc (CH2, CH3 only), a murine IgG1-Fc dimerization domain, a murine IgG-2a-Fc (CH2, CH3 only), a murine IgG-2a-Fc dimerization domain, a p53 tetramerization domain, a SARS-COV-2 nucleocapsid N-terminal domain and a SARS-COV-2 nucleocapsid C-terminal domain. In one embodiment, said oligomerization tag has an amino acid sequence selected from SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14 and SEQ ID NO: 15.
In one embodiment, said linker is a flexible linker. In one embodiment, said linker has an amino acid sequence selected from SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6.
In one embodiment, the streptavidin binding peptide tag has or comprises the amino acid sequence of the SEQ ID NO: 17. In one embodiment, said horseradish peroxidase binding domain has an amino acid sequence selected from SEQ ID NO:18.
In one embodiment, said protease cleavage site is selected from a tobacco etch virus cleavage site (TEV). In one embodiment, said protease cleavage site has an amino acid sequence selected from SEQ ID NO: 19.
In one embodiment, said receptor binding domain (RBD) of the SARS-COV-2 spike protein or a fragment thereof has an amino acid sequence of at least about 90%, or at least 95% sequence identity with SEQ ID NO:20.
In one embodiment, said fusion protein has an amino acid sequence of at least 90%, or at least 95% sequence identity with SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, or SEQ ID NO:57.
In one embodiment, said SARS-COV-2 RBD protein comprises a mutation in one or more of the following positions: G404, A475, T478, N481, G485, F490, Q493, G496, Q498, N501, or V503.
In a further aspect, the present invention refers to a cell, comprising the fusion protein as described above.
In a further aspect, the present invention refers to a nucleic acid comprising a nucleotide sequence encoding the fusion protein, a promoter operably linked to the nucleotide sequence and a selectable marker.
In another aspect, the present invention refers to a cell comprising the above-mentioned nucleic acid.
Finally, the present invention refers to a composition comprising the above-mentioned fusion protein and a solid support, wherein the fusion protein is covalently or non-covalently bound to the solid support.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art pertinent to the methods and compositions described. As used herein, the following terms and phrases have the meanings ascribed to them unless specified otherwise.
The terms “a,” “an,” and “the” include plural referents, unless the context clearly indicates otherwise.
Throughout this specification, unless the context requires otherwise, the word “comprise”, or variations such as “comprises” or “comprising”, will be understood to imply the inclusion of a stated element or integer or group of elements or integers but not the exclusion of any other element or integer or group of elements or integers.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. Exemplary methods and materials are described below, although methods and materials similar or equivalent to those described herein can also be used and will be apparent to those of skill in the art. All publications and other references mentioned herein are incorporated by reference in their entirety. In case of conflict, the present specification, including definitions, will control. The materials, methods, and examples are illustrative only and not intended to be limiting.
Each embodiment in this specification is to be applied mutatis mutandis to every other embodiment unless expressly stated otherwise.
The following terms, unless otherwise indicated, shall be understood to have the following meanings:
As used herein, the term “nucleic acid” refers to any materials comprised of DNA or RNA. Nucleic acids can be made synthetically or by living cells.
As used herein, the term “protein” refers to large biological molecules, or macromolecules, consisting of one or more chains of amino acid residues. Many proteins are enzymes that catalyze biochemical reactions and are vital to metabolism. Proteins also have structural or mechanical functions, such as actin and myosin in muscle and the proteins in the cytoskeleton, which form a system of scaffolding that maintains cell shape. Other proteins are important in cell signaling, immune responses, cell adhesion, and the cell cycle. However, proteins may be completely artificial or recombinant, i.e., not existing naturally in a biological system.
As used herein, the term “polypeptide” refers to both naturally-occurring and non-naturally-occurring proteins, and fragments, mutants, derivatives and analogs thereof. A polypeptide may be monomeric or polymeric. A polypeptide may comprise a number of different domains (peptides) each of which has one or more distinct activities.
As used herein, the term “recombinant” refers to a biomolecule, e.g., a gene or protein, that (1) has been removed from its naturally occurring environment, (2) is not associated with all or a portion of a polynucleotide in which the gene is found in nature, (3) is operatively linked to a polynucleotide which it is not linked to in nature, or (4) does not occur in nature. The term “recombinant” can be used in reference to cloned DNA isolates, chemically synthesized polynucleotide analogs, or polynucleotide analogs that are biologically synthesized by heterologous systems, as well as proteins and/or mRNAs encoded by such nucleic acids.
As used herein, the term “fusion protein” refers to proteins comprising two or more amino acid sequences that do not co-exist in naturally-occurring proteins. A fusion protein may comprise two or more amino acid sequences from the same or from different organisms. The two or more amino acid sequences of a fusion protein are typically in frame without stop codons between them and are typically translated from mRNA as part of the fusion protein.
The term “fusion protein” and the term “recombinant” can be used interchangeably herein.
As used herein, the term “antigen” refers to a biomolecule that binds specifically to the respective antibody. An antibody from the diverse repertoire binds a specific antigenic structure by means of its variable region interaction.
The terms “antibody” or “immunoglobulin”, as used herein, have the same meaning, and will be used equally in the present invention. The term “antibody” as used herein refers to immunoglobulin molecules and immunologically active portions of immunoglobulin molecules, i.e., molecules that contain an antigen binding site that specifically binds an antigen. As such, the term antibody encompasses not only whole antibody molecules, but also antibody fragments or derivatives.
The term “binding affinity”, as used herein, refers to the strength of interaction between an antigen's epitope and an antibody's antigen binding site.
As used herein, a “promoter” is a specific nucleic acid sequence that is recognized by a DNA-dependent RNA polymerase (“transcriptase”) as a signal to bind to the nucleic acid and begin the transcription of RNA at a specific site.
The terms “modified sequence” and “modified genes” are used interchangeably herein to refer to a sequence that includes a deletion, insertion or interruption of naturally occurring nucleic acid sequence. In some preferred embodiments, the expression product of the modified sequence is a truncated protein (e.g., if the modification is a deletion or interruption of the sequence). In some particularly preferred embodiments, the truncated protein retains biological activity. In alternative embodiments, the expression product of the modified sequence is an elongated protein (e.g., modifications comprising an insertion into the nucleic acid sequence). In some embodiments, an insertion leads to a truncated protein (e.g., when the insertion results in the formation of a stop codon). Thus, an insertion may result in either a truncated protein or an elongated protein as an expression product.
As used herein, the terms “mutant sequence” and “mutant gene” are used interchangeably and refer to a sequence that has an alteration in at least one codon occurring in a host cell's wild-type sequence. The expression product of the mutant sequence is a protein with an altered amino acid sequence relative to the wild-type. The expression product may have an altered functional capacity (e.g., enhanced binding affinity).
The term “region” or “fragment” as used herein, refers to a portion of an amino acid sequence wherein said portion is smaller than the entire amino acid sequence. In some embodiments, refers to a portion of the receptor-binding domain (RBD) of the SARS-COV-2 with a sequence identity of at least about 90% to the amino acid sequence of the RBD. In some embodiments, refers to a portion of the spike protein outside the RBD of the SARS-COV-2 with a sequence identity of at least about 90% to the amino acid sequence of the spike protein outside the RBD.
The term “receptor-binding domain” or “RBD” refers to a protein in SARS-COV-2 S that bound strongly to human and bat angiotensin-converting enzyme 2 (ACE2) receptors.
The term “spike protein”, “S protein” or “S” refers to a large type I transmembrane protein ranging from 1,160 amino acids for avian infectious bronchitis virus (IBV) and up to 1,400 amino acids for feline coronavirus (FCoV). In addition, this protein is highly glycosylated as it contains 21 to 35 N-glycosylation sites. Spike proteins assemble into trimers on the virion surface to form the distinctive “corona”, or crown-like appearance. The ectodomain of all CoV spike proteins share the same organization in two domains: a N-terminal domain named S1 that is responsible for receptor binding and a C-terminal S2 domain responsible for fusion. CoV diversity is reflected in the variable spike proteins (S proteins), which have evolved into forms differing in their receptor interactions and their response to various environmental triggers of virus-cell membrane fusion. It's been reported that 2019-nCOV can infect the human respiratory epithelial cells through interaction with the human ACE2 receptor. Indeed, the recombinant Spike protein can bind with recombinant ACE2 protein.
The term “angiotensin converting enzyme 2” or “ACE2” refers to an enzyme attached to the cell membranes of cells in the lungs, arteries, heart, kidney, and intestines. ACE2 lowers blood pressure by catalysing the hydrolysis of angiotensin II (a vasoconstrictor peptide) into angiotensin (1-7) (a vasodilator). ACE2 counters the activity of the related angiotensin-converting enzyme (ACE) by reducing the amount of angiotensin-II and increasing Ang(1-7) making it a promising drug target for treating cardiovascular diseases. ACE2 also serves as the entry point into cells for some coronaviruses, including HCoV-NL63, SARS-COV, and SARS-COV-2. The human version of the enzyme is often referred to as hACE2.
The term “horseradish peroxidase” or “HRP” is used extensively in biochemistry applications. It is a metalloenzyme with many isoforms, of which the most studied type is C. It catalyzes the oxidation of various organic substrates by hydrogen peroxide.
As used herein, the term “N-terminal signal peptide” is a short peptide (usually 10-30 amino acids long) present at the N-terminus of the majority of newly synthesized proteins that are destined toward the secretory pathway. These proteins include those that reside either inside certain organelles (the endoplasmic reticulum, Golgi or endosomes), secreted from the cell, or inserted into most cellular membranes. Although most type I membrane-bound proteins have signal peptides, the majority of type II and multi-spanning membrane-bound proteins are targeted to the secretory pathway by their first transmembrane domain, which biochemically resembles a signal sequence except that it is not cleaved. They are a kind of target peptide.
As used herein, the term “purification tag” or “affinity tag” refers to a polypeptide used to purify proteins that simplifies purification and enables use of standard protocols. In the present invention, the purification tag is a polyhistidine tag of 4, 6, 7, 8, 9, 10, 11 or 12 histidine residues. Preferably, the histidine tag has 8 or 10 histidine residues.
As used herein, the term “linker” refers to a polypeptide comprising of 1-10 amino acids, preferably 3-6 amino acids. The amino acids of the linker may be selected from the group consisting of leucine (Leu, L), isoleucine (Ile, I), alanine (Ala, A), glycine (Gly, G), valine (Val, V), proline (Pro, P), lysine (Lys, K), arginine (Arg, R), Serine (Ser, S), asparagine (Asn, N), and glutamine (Gln, Q), tryptophan (Trp, W), methionine (Met, M), aspartic acid (Asp, D), cysteine (Cys, C), glutamic acid (Glu, E), histidine (His, H), phenylalanine (Phe, F), threonine (The, T), and tyrosine (Tyr, Y). In some preferred embodiments, the linker is a flexible linker that may consist of a sequence of consecutive amino acids that typically include at least one glycine and at least one serine. Exemplary flexible linkers include the amino acid sequences set forth in SEQ ID NO: 3 (GGGS), SEQ ID NO: 4 (GGGP), SEQ ID NO: 5 (GGSGG) or SEQ ID NO: 6 (GGSGGGGS), although the precise amino acid sequence of the linker is not particularly limiting. As used herein, the term “oligomerization tag” refers to a polypeptide for increasing assay avidity and sensitivity. In the present invention, the oligomerization tag are selected from a murine IgG1-Fc (CH2, CH3 only), a murine IgG1-Fc dimerization domain, a murine IgG-2a-Fc (CH2, CH3 only), a murine IgG-2a-Fc dimerization domain, a p53 tetramerization domain, a SARS-COV-2 nucleocapsid N-terminal domain and a SARS-COV-2 nucleocapsid C-terminal domain.
As used herein, the term “region in spike protein outside RBD” refers to a polypeptide comprising of 1-30 amino-acids of SARS-COV-2 which are not part of the RBD protein.
As used herein, the term “horseradish peroxidase binding domain” refers to an enzyme used in conjugates (molecules that have been joined genetically or chemically) to determine the presence of a molecular target.
As used herein, the term “tobacco etch virus cleavage site” or “TEV” refers to a highly site-specific cysteine protease that is found in the tags from fusion proteins. The optimal temperature for cleavage is 30° C.; also it can be used at temperature as low as 4° C. It is recommended that the cleavage for each fusion protein be optimized by varying the amount of recombinant viral TEV protease, reaction time, or incubation temperature. It can be removed by Ni2+ affinity resin. The optimum recognition site for this enzyme is the sequence Glu-Asn-Leu-Tyr-Phe-Gln-(Gly/Ser) [ENLYFQ(G/S)] and cleavage occurs between the Gln and Gly/Ser residues. The most commonly used sequence is ENLYFQG. The protease is used to cleave affinity tags from fusion proteins.
The term “diagnostic” or “diagnosed”, as used herein, means identifying the presence or nature of a pathologic condition or a patient susceptible to a disease. Diagnostic methods differ in their sensitivity and specificity. The “sensitivity” of a diagnostic assay is the percentage of diseased individuals who test positive (percent of “true positives”). Diseased individuals not detected by the assay are “false negatives”. Subjects who are not diseased and who test negative in the assay, are termed “true negatives.” The “specificity” of a diagnostic assay is 1 minus the false positive rate, where the “false positive” rate is defined as the proportion of those without the disease who test positive. While a particular diagnostic method may not provide a definitive diagnosis of a condition, it suffices if the method provides a positive indication that aids in diagnosis.
As used herein, the term “Biolayer interferometry (BLI)” is a label-free technology for measuring biomolecular interactions. It is an optical analytical technique that analyzes the interference pattern of white light reflected from two surfaces: a layer of immobilized protein on the biosensor tip, and an internal reference layer. Any change in the number of molecules bound to the biosensor tip causes a shift in the interference pattern that can be measured in real-time.
The present invention relates to a fusion protein comprising the SARS-COV-2 receptor binding domain (RBD) of the SARS-COV-2 spike protein or a fragment thereof, and a N-terminal signal peptide, and at least one of a polyhistidine tag, a linker, a oligomerization tag, a region in spike protein outside RBD, a horseradish peroxidase binding domain or a protease cleavage site.
The SARS-COV-2 full length Spike (FLS, GenBank MN908947.3), comprises two domains, namely S1 and S2, are responsible for the binding step. S1 contains the RBD, which directly binds to the peptidase domain (PD) of ACE2, whereas S2 is responsible for membrane fusion. When S1 binds to the host receptor ACE2, another cleavage site on S2 is exposed and is cleaved by host proteases, a process that is critical for viral infection. The S protein of SARS-COV-2 may also exploit ACE2 for host infection.
The fusion proteins of the present invention can be obtained by methods well-known to the skilled person. For example, said fusion proteins can be obtained recombinantly in bacteria, yeasts, fungi, or mammalian cells. In one embodiment, the fusion proteins of the present invention are produced in prokaryotic cells, such as Escherichia coli, but other prokaryotic cells can be used. In another embodiment, the fusion proteins of the present inventions are produced in human embryotic kidney (HEK) or Chinese hamster ovary (CHO) cells, but other eukaryotic cells can be used.
The fusion proteins of the present invention can be purified from the cells by methods well known to the skilled person. Said methods include, without limitation, filtration, conjugation, affinity chromatography, ion exchange chromatography, hydrophobic interaction chromatography, and size exclusion chromatography.
Regarding the signal peptides included in the fusion proteins of the present invention, these signal peptides could result in improved expression and/or secretion of the protein during recombinant production. Moreover, inclusion of different signal peptides can alter post translational modification (PTMs) and potentially the function of the protein. Therefore, it is non-obvious that the fusion proteins of the present invention can be produced or be functional. In one embodiment, said N-terminal signal peptide is selected from a spike endogenous signal peptide and a tissue plasminogen activator (tPa). Said N-terminal signal peptide has an amino acid sequence selected from SEQ ID NO:1 and SEQ ID NO:2.
As previously described, the use of polyhistidine tag simplifies purification and enables use of standard protocols in the production of fusion proteins. For example, the histidine (His) tag (also known as polyhistidine or polyHis) is known to be useful, for example, in the purification by Immobilized Metal Affinity Chromatography (IMAC). Other uses of the polyhistidine tag are also well-known by the skilled person and therefore the polyhistidine tag of the present invention is not limited to the purification functionality. In the present invention, said polyhistidine tag can be of 6, 8 or 10 histidine residues. It is important to evaluate the impact of a tag at both the N and C termini of the protein both to produce the protein but also for the functionality and aggregation states of the protein. The impact the location of the tag will have is non-obvious. Moreover, the utility of the tag in purification or any assay development is unknown. The inclusion of the TEV cleavage site was done with N-terminal tagging. If an N-terminally tagged construct were chosen, it would be possible to generate a tag free version. Additionally, the promiscuity of the TEV tag was utilized to support the possible production of a scar-free protein. Preferably said polyhistidine tag has an amino acid sequence selected from SEQ ID NO:7 and SEQ ID NO:8.
In another embodiment, oligomerization tags or domains have been included in the fusion proteins of the present invention which are selected from a murine IgG1-Fc (CH2, CH3 only), a murine IgG1-Fc dimerization domain, a murine IgG-2a-Fc (CH2, CH3 only), a murine IgG-2a-Fc dimerization domain, a p53 tetramerization domain, a SARS-COV-2 nucleocapsid N-terminal domain and a SARS-COV-2 nucleocapsid C-terminal domain. Said oligomerization tag has an amino acid sequence selected from SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14 and SEQ ID NO: 15. The RBD molecular designs contain IgG1, IgG2aFc and p53 dimerization and tetramerization domains with the goal of increasing assay avidity and sensitivity.
Linkers can be also present in the fusion proteins of the present invention. In one embodiment, said linker can be a flexible linker. Flexible linkers are included when fusing domains of different proteins together. Most of these linkers are a combination of glycine and serine while in some cases proline was added to kink the protein. These flexible linkers may help to improve the tolerance for assembly of those domains, and are often a combination of glycine and serine. However, it is not obvious to the skilled person if the inclusion of the selected linkers would produce functional fusion proteins. In one embodiment, said linker is a flexible linker to add flexibility. Said linker has an amino acid sequence selected from SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5 and SEQ ID NO:6.
The use of streptavidin binding domain (SBP) (SEQ ID NO: 17) to support assay development in either plate coating or conjugation of fluorophores or HRP tags for readout. The goal was to avoid labelling residues key to the protein interaction with hACE2 receptor or antibodies. A horseradish peroxidase (HRP) binding domain refers to an enzyme used in conjugates (molecules that have been joined genetically or chemically) to determine the presence of a molecular target. In some embodiments, said horseradish peroxidase binding domain has an amino acid sequence selected from SEQ ID NO: 18.
In some embodiments, said protease cleavage site is a tobacco etch virus cleavage site (TEV). Said protease cleavage site has an amino acid sequence selected from SEQ ID NO: 19.
In some embodiments, said receptor binding domain (RBD) of the SARS-COV-2 spike protein or a fragment thereof has an amino acid sequence of at least 90%, or at least 95% sequence identity with SEQ ID NO:20.
This invention also encompasses high affinity RBD mutations in specific RBD formats, in order to cover emergent SARS-COV-2 mutations that enhance binding to hACE2. Some of these novel protein designs harbor SARS-COV-2 mutations that emerged in nature (Pango lineage variants: B1.1.7, B. 1.351, B1.617.2, B.1.427 and P.1). In addition, molecular dynamic simulation and affinity maturation software from Schrodinger (Bio luminate) was used to predict the AA mutations in the primary sequence of RBD that would confer higher affinity to hACE2. Among those mutations we found that in silico, and in light to what has been described in the literature mutations V367F and G502D which increase expression of RBD and N501F, N501T and Q498Y.
In some embodiments, said fusion protein has an amino acid sequence of at least 90%, or at least 95% sequence identity with SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO: 43, SEQ ID NO: 44, SEQ ID NO: 45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO: 48, SEQ ID NO: 49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, or SEQ ID NO:57.
The present inventors also designed embodiments where the RBD with N proteins are fused together, as well as RBD and HRP with the goal of improving assay sensitivity during the acute phase of infection, as N protein is detected early during the infection.
In some embodiments, the invention also embodies high affinity RBD mutations that enhance binding to human ACE2. The present inventors used molecular dynamic simulation and affinity maturation software from Schrodinger (Bio luminate) to predict the AA mutations in the primary sequence of RBD that would confer higher affinity to hACE2. Among those mutations are V367F and G502D which increases expression of RBD and N501F, N501T and Q498Y. In some embodiments, said SARS-COV-2 RBD protein comprises a mutation in one or more of the following positions: G404, A475, T478, N481, G485, F490, Q493, G496, Q498, N501, or V503.
The present invention also relates to nucleic acids comprising a nucleotide sequence encoding the fusion proteins described herein. The nucleic acid may be DNA or RNA. DNA comprising a nucleotide sequence encoding a fusion protein described herein typically comprises a promoter that is operably-linked to the nucleotide sequence. The promoter is preferably capable of driving constitutive or inducible expression of the nucleotide sequence in an expression cell of interest. Said nucleic acid may also comprise a selectable marker useful to select the cell containing said nucleic acid of interest. Useful selectable markers are well known by the skilled person. The precise nucleotide sequence of the nucleic acid is not particularly limiting so long as the nucleotide sequence encodes a fusion protein described herein. Codons may be selected, for example, to match the codon bias of an expression cell of interest (e.g., a mammalian cell such as a human cell) and/or for convenience during cloning. DNA may be a plasmid, for example, which may comprise an origin of replication (e.g., for replication of the plasmid in a prokaryotic cell).
In one embodiment described herein, the present invention refers to a nucleic acid comprising a nucleotide sequence encoding the fusion protein, a promoter operably linked to the nucleotide sequence and a selectable marker.
Various aspects of the present invention also relate to a cell comprising a nucleic acid comprising a nucleotide sequence that encodes a fusion protein as described herein. The cell may be an expression cell or a cloning cell. Nucleic acids are typically cloned in E. coli, although other cloning cells may be used.
If the cell is an expression cell, the nucleic acid is optionally a nucleic acid of a chromosome, i.e., wherein the nucleotide sequence is integrated into the chromosome, although then nucleic acid may be present in an expression cell, for example, as extrachromosomal DNA or vectors, such as plasmids, cosmids, phages, etc. The format of the vector should not be considered limiting.
In one embodiment described herein, the cell is typically an expression cell. The nature of the expression cell is not particularly limiting. Expression cells which may be used are prokaryotic cells such as E. coli and Bacillus spp. and eukaryotic cells such as yeast cells (e.g. S. cerevisiae, S. pombe, P. pastoris, K lactis, H polymorpha), insect cells (e.g. Sf9), fungal, plant cells or mammalian cells. Mammalian expression cells may allow for favorable folding, post-translational modifications, and/or secretion of a fusion protein, although other eukaryotic cells or prokaryotic cells may be used as expression cells. Exemplary expression cells include TunaCHO, ExpiCHO, Expi293, BHK, NS0, Sp2/0, COS, C127, HEK, HT-1080, PER.C6, HeLa, and Jurkat cells. The cell may also be selected for integration of a vector, more preferably for integration of a plasmid DNA.
The fusion proteins of the present invention can be produced by appropriate transfection strategy of the nucleic acids comprising a nucleotide sequence that encodes the fusion proteins into mammalian cells. The skilled person is aware of the different techniques available for transfection of nucleic acids into the cell line of choice (lipofection, electroporation, etc). Thus, the choice of the mammalian cell line and transfection strategy should not be considered limiting. The cell line could be further selected for integration of the plasmid DNA.
Various aspects of the present invention also relate to a cell comprising the fusion proteins described herein.
Various aspects of the present invention relate to compositions comprising a fusion protein as described herein. In some embodiments, the composition may comprise a pharmaceutically-acceptable carrier and/or a pharmaceutically-acceptable excipient. The composition may be, for example, a vaccine.
Various embodiments of the present invention relate to a method of treating or preventing a SARS-COV-2 infection in a human patient comprising administering to the patient a composition comprising a fusion protein as described herein. The term “preventing” as used herein refers to prophylaxis, which includes the administration of a composition to a patient to reduce the likelihood that the patient will become infected with SARS-COV-2 relative to an otherwise similar patient who does not receive the composition. The term preventing also includes the administration of a composition to a group of patients to reduce the number of patients in the group who become infected with SARS-COV-2 relative to an otherwise similar group of patients who do not receive the composition.
Various embodiments of the invention relate to a method of treating or preventing a SARS-COV-2 infection in a human patient comprising administering to the patient a vaccine according to the embodiments described herein.
A patient may be infected with SARS-COV-2, a patient may have been exposed to SARS-COV-2, or a patient may present with an elevated risk for exposure to and/or infection with SARS-COV-2.
In one embodiment described herein, the composition comprises the fusion protein of the present invention and a solid support.
In other embodiment, the composition comprises the fusion protein of the present invention and a solid support, wherein the fusion protein is covalently or non-covalently bound to the solid support. The term “non-covalently bound,” as used herein, refers to specific binding such as between an antibody and its antigen, a ligand and its receptor, or an enzyme and its substrate, exemplified, for example, by the interaction between streptavidin binding protein and streptavidin or an antibody and its antigen.
In other embodiment, the composition comprises the fusion protein of the present invention and a solid support, wherein the fusion protein is directly or indirectly bound to a solid support. The term “direct” binding, as used herein, refers to the direct conjugation of a molecule to a solid support, e.g., a gold-thiol interaction that binds a cysteine thiol of a fusion protein to a gold surface. The term “indirect” binding, as used herein, includes the specific binding of a fusion protein to another molecule that is directly bound to a solid support, e.g., a fusion protein may bind an antibody that is directly bound to a solid support thereby indirectly binding the fusion protein to the solid support. The term “indirect” binding is independent of the number of molecules between the fusion protein and the solid support so long as (a) each interaction between the daisy chain of molecules is a specific or covalent interaction and (b) a terminal molecule of the daisy chain is directly bound to the solid support.
A solid support may comprise a particle, a bead, a membrane, a surface, a polypeptide chip, a microtiter plate, or the solid-phase of a chromatography column.
A composition may comprise a plurality of beads or particles, wherein each bead or particle of the plurality of beads or particles are directly or indirectly bound to at least one fusion protein as described herein. A composition may comprise a plurality of beads or particles, wherein each bead or particle of the plurality of beads or particles are covalently or non-covalently bound to at least one fusion protein as described herein.
Various aspects of the embodiments relate to a kit for detecting the presence of antibodies against the fusion protein of the present invention, and/or fragment thereof in a sample, said kit comprising a fusion protein and a solid support or composition as described herein.
The compositions and kits described herewith can be either for use in an assay or in compositions that are generated during the performance of an assay. Various aspects of the invention relate to a diagnostic medical device comprising a composition as described herein.
Various aspects of the invention relate to assays for detection of anti-SARS-COV-2 antibodies.
An assay may be an assay for measuring the relative binding affinity of the fusion protein of the present invention to anti-RBD, fragment anti-RBD and/or fragment anti-RBD in a sample (e.g., relative to one or more control samples or standards). An assay may be an assay for measuring the relative binding affinity of the fusion protein of the present invention to any anti-RBD (e.g., relative to one or more control samples or standards).
Assays typically feature a solid support that either allows for measurement, such as by turbidimetry, nephelometry, UV/Vis/IR spectroscopy (e.g., absorption, transmission), fluorescence or phosphorescence spectroscopy, or surface plasmon resonance, or aids in the separation of components that directly or indirectly bind the solid support from components that do not directly or indirectly bind the solid support, or both. For example, an assay may include a composition comprising particles or beads and/or that aid in the mechanical separation of components that directly or indirectly bind the particles or beads.
Other exemplary assays that may include the fusion protein or the composition of the present invention includes but it is not limited to ELISA, lateral flow, single molecule counting (SMC), viscoelastic tests such as Sonoclot, gel technologies, fluorescence assay and other point-of-care testing using any of these techniques.
The fusion proteins of the present invention will be further illustrated by the following non-limiting examples.
The RBD proteins were produced in Expi293 cells and affinity purified from the supernatant. The affinity purification was carried out according to IMAC standard protocols that include imidazole washes and elution. After spin concentration and buffer exchange, the proteins were subjected to functional evaluation by SDS-PAGE Western-blot under reducing and non-reducing conditions.
Evaluation of pxENB14-RBD and pxENB17-RBD proteins by SDS-PAGE Western-blot revealed existence of RBD monomers, dimers and tetramers. This data was corroborated by SECMALS. Both proteins were recognized by rabbit polyclonal antibodies on a Western blot, demonstrating bioactivity. Intact mass analysis was performed using N— and O, D-, glycosylation and reducing conditions (Table 1). Both pxENB14-RBD and pxENB17-RBD showed the shame MW shift suggesting the existence of a non-identified PTM by intact mass spectrometry analysis.
The diversity of SARS-COV-2 pandemic RBD sequences remains low. However, a subset of mutations has been observed, with 10 particular mutants appearing to be under high positive selection pressure to spread across the world. According to some studies, three RBD mutants emerged in Wuhan, Shenzhen, Hong Kong and France and these mutants showed higher affinity to the ACE2 receptor when in comparison with to the prototype Wuhan-Hu-1 strain. Two mutations (F342L, R408I) showed similar affinity to ACE2 as the original Wuhan strain but four mutations were identified (N354D, D364Y, V367F, W436R) (Ou, J. et al. 2020, bioRxiv, doi: https://doi.org/10.1101/2020.03.15.991844).
In light of the emergent RBD mutations, protein modelling was performed with residue scanning and affinity maturation of a structure of SARS-COV-2 receptor-binding domain in complex with the human ACE2 receptor. These studies were performed using Schrodinger's BioLuminate Software and were focused on the RBD-ACE2 interaction (
The goal of this study was to identify novel and potential emergent mutations that could result in stronger binding to ACE2. The results from the study are summarized in Table 2. These mutations can be utilized individually or in combination and the number of mutations is not limiting for any of the designs proposed in the present invention.
In order to find high affinity RBD mutations that enhance binding to human ACE2, the present inventors used molecular dynamic simulation and affinity maturation software from Schrodinger (Bio luminate) to predict the AA mutations in the primary sequence of RBD that would confer higher affinity to hACE2. Among those mutations are V367F and G502D, which increased expression of RBD and N501F, N501T and Q498Y.
The functionality of pxENB14-RBD and pxENB17-RBD was evaluated by BLI. Briefly, biotinylated hACE2 was immobilized on the surface of a streptavidin biosensor and incubated with RBD proteins at concentrations ranging from 12.5 to 0.38 nM (
The inventors evaluated the expression of a subset of RBD truncations and fusions in Expi293. The RBD truncations and multimeric versions were produced in Expi293 cells (
In addition, multimeric RBD proteins were incubated at protein concentrations ranging from 25 to 0.38 nM and tested by binding to biotinylated hACE2 immobilized on the surface of streptavidin biosensors, similarly to what has been described in
Binding curves of immobilized hACE2 with SARS-COV-2 multimeric RBD proteins in
Functionality of the RBD mutant proteins by BLI based on pxENB14RBD (
Mutants pxENBRBD14-B1.617 (SEQ ID NO:52) shows a particular low affinity to the rhACE2 receptor, as seen by the increase observed in the affinity based on KD values from 17 nM to 76.1 nM. All RBD mutants, except pxENB-RBD14 B1.617 (SEQ ID NO:52) show a lower rate of dissociation than pxENB14RBD, suggesting that these bind to the rhACE2 stronger than the original protein.
VPRDCGCKPCICT
VPEVSSVFIFPPKPKDVLTITLTPKVT
PRGPTIKPCPPCKCPAPNLLGGPSVFIFPPKIKDVLMISLS
This application is the U.S. National Phase Application under 35 U.S.C. § 371 of International Application No. PCT/IB2021/057546, filed Aug. 17, 2021, designating the U.S. and published as WO 2022/038501 A1 on Feb. 24, 2022, which claims the benefit of Provisional Application No. 63066684, filed Aug. 17, 2020. Any and all applications for which a foreign or a domestic priority is claimed is/are identified in the Application Data Sheet filed herewith and is/are hereby incorporated by reference in their entireties under 37 C.F.R. § 1.57.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2021/057546 | 8/17/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63066684 | Aug 2020 | US |