Protein M fusion proteins and uses

Abstract
Fusion proteins with immunoglobulin binding properties, their uses and related methods and compositions are disclosed. The fusion proteins are comprised of an antibody-binding fragment of protein M from Mycoplasma spp. conjugated to a receptor fragment. The receptor fragment is a protein fragment to which a pathogen, a toxin or a cancer cell can specifically bind. The fusion proteins can be used to neutralize or eradicate a wide group of pathogens, toxins or cancer cells.
Description
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

Not Applicable.


SEQUENCE LISTING ON ASCII TEXT

This patent application file contains a Sequence Listing submitted in computer readable ASCII text format (file name: DELA-02-US-Sequence-Listing.txt, date recorded: Aug. 30, 2021, size: 267,396 bytes). The Sequence Listing, which is a part of the present disclosure, includes a computer readable form and a written sequence listing comprising nucleotide and/or amino acid sequences of the present invention. The sequence listing information recorded in computer readable form is identical to the written sequence listing. The content of the Sequence Listing file is incorporated herein by reference in its entirety.


TECHNICAL FIELD

The present teachings relate to methods and compositions that utilize Protein M fusion proteins. Some of the disclosed methods and compositions relate to methods of neutralizing or eradicating various human pathogens and toxins.


INTRODUCTION

Many emerging and known pathogens continue to present a serious threat to human health and safety. In the past few decades, many infectious diseases, such as those caused by the SARS-CoV-2 virus, the human immunodeficiency virus (HIV) and others have effectively migrated from animal to human hosts and devastated entire populations and economies. Despite some successes in treatment of these pathogens, the options remain limited or not available, like in the case of the SARS-CoV-2 virus. Thus, there remains a need in the art for an efficient general method for neutralizing pathogens or clearing out pathogens from human body.


SUMMARY

The present teachings include a method for neutralizing a pathogen, wherein the pathogen has a specific binding affinity for a receptor fragment, the method comprising: providing conditions for interaction between the pathogen and a fusion protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO: 1 or the sequence SEQ ID NO: 2 conjugated to the receptor fragment, whereby the fusion protein binds to and neutralizes the pathogen.


In accordance with a further aspect, the receptor fragment is a protein fragment of a cellular receptor.


In accordance with a further aspect, the pathogen is SARS-CoV-2 virus and the receptor fragment has the sequence SEQ ID NO:15.


In accordance with a further aspect, conjugation of the polypeptide and the receptor fragment is made through a spacer.


In accordance with a further aspect, the spacer is a peptide having one of the following sequences: SEQ ID NO: 12-14.


In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.


In accordance with a further aspect, the receptor fragment has one of the following sequences: SEQ ID NO: 16-36.


In accordance with a further aspect, the fusion protein neutralizes the pathogen via recruitment of C1q protein.


The present teachings also include a method for eradicating a bloodborne pathogen in a subject, wherein the pathogen has a specific binding affinity for a receptor fragment inside the subject's body, the method comprising:


receiving a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins;


adding a fusion protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to the receptor fragment to the sample, wherein the fusion protein binds to the immunoglobulins present in the sample;


administrating the sample having the fusion protein bound to the immunoglobulins into the subject's body, in an amount sufficient to eradicate the pathogen in the subject.


In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.


In accordance with a further aspect, the fusion protein bound to the immunoglobulins eradicates the pathogen via recruitment of C1q protein.


The present teachings also include a fusion protein having a specific binding affinity for an immunoglobulin molecule, comprising a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to a fusion partner, wherein the fusion partner has a sequence that is at least 90% identical to one of the following sequences: SEQ ID NO: 15-36.


In accordance with a further aspect, conjugation of the polypeptide and the fusion partner is made through a spacer.


In accordance with a further aspect, the spacer is a cleavable peptide having one of the following sequences: SEQ ID NO: 96-98.


The present teachings also include a method for neutralizing a toxin in a subject, wherein the toxin has a specific binding affinity for a receptor fragment, the method comprising:


receiving a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins;


adding a conjugated protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to the receptor fragment to the sample, wherein the conjugated protein binds to the immunoglobulins present in the sample;


administrating the sample having the conjugated protein bound to the immunoglobulins into the subject's body, in an amount sufficient to eradicate the toxin in the subject.


In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.


The present teachings also include a method for detecting immunoglobulins that are present in a solution or on a solid support matrix, but not bound to their cognate antigen, the method comprising: contacting immunoglobulins with conjugated proteins in the solution, wherein each conjugated protein comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to a detectable probe, whereby the conjugated proteins bind to immunoglobulins that are not bound to their cognate antigen; separating conjugated proteins that are bound to immunoglobulins from conjugated proteins that are not bound to immunoglobulins; detecting the conjugated proteins that are bound to immunoglobulins by utilizing the detectable probe, thereby detecting immunoglobulins that are not bound to their cognate antigen. Examples of solid support matrix include: blots, beads, microplate well, resin.


In accordance with a further aspect, conjugation of the polypeptide and the detectable probe is made through a spacer.


In accordance with a further aspect, the spacer is a cleavable peptide having one of the following sequences: SEQ ID NO: 96-98.


In accordance with a further aspect, the detectable probe is an enzyme that has a fluorogenic, luminescent or chromogenic substrate.


In accordance with a further aspect, the detectable probe is a protein having a sequence chosen from SEQ ID NO:67-69.


In accordance with a further aspect, the detectable probe is a fluorescent or a luminescent or a radioactive molecule.


In accordance with a further aspect, the detectable probe is an epitope tag having a sequence chosen from SEQ ID NO: 70-81.


In accordance with a further aspect, the detectable probe is a polypeptide having a sequence chosen from SEQ ID NO:82-85 and configured to bind streptavidin and/or avidin.


In accordance with a further aspect, the detectable probe is a polypeptide having a sequence chosen from SEQ ID NO:86-92 or from SEQ ID NO:93-94, and configured to attach to its cognate binding partner, either covalently or non-covalently.


In accordance with a further aspect, the detectable probe is a fluorescent protein having the sequence SEQ ID NO:95.


The present teachings also include a codon-optimized polynucleotide that encodes the fusion protein according to claim 14.


In accordance with a further aspect, the codon-optimized polynucleotide according to claim 29 has a sequence that is at least 95% identical to one of the following nucleic acid sequences: SEQ ID NO: 40-61.


In accordance with a further aspect, the codon-optimized polynucleotide according to claim 29 is inserted in a vector configured for replication and protein expression in mammalian cells.


These and other features, aspects and advantages of the present teachings will become better understood with reference to the following description, examples and appended claims.





BRIEF DESCRIPTION OF THE DRAWINGS

Those of skill in the art will understand that the drawings, described below, are for illustrative purposes only. The drawings are not intended to limit the scope of the present teachings in any way.



FIG. 1. A schematic representation of plasma antibodies of differing isotypes and differing specificities mixed with armY-ACE2 fusion protein. When armY-ACE2 is added to the plasma, armY-ACE2 binds and blocks the antibody antigen-binding region via the armY component (Protein M). The ACE2 component of the fusion protein endows the antibodies with a new binding specificity to the spike protein of SARS-CoV-2, allowing the antibodies to bind SARS-CoV-2 and mark for eradication.



FIG. 2. A schematic representation of one proposed route of administration of armY-ACE2 therapeutic, whereby plasma (containing antibodies) is obtained from the patient or a ABO-compatible donor through apheresis. A measured amount of armY-ACE2 is added to the plasma antibodies, which then acquire SARS-CoV-2 specificity as described in FIG. 1 and as demonstrated in Example 16, FIG. 14 below. A patient with active COVID-19 is treated with armY-ACE2 plasma antibodies and allowed to fully recover.



FIG. 3. Myc-tagged Protein M binds to human IgG antibody coated on wells in a dose-dependent fashion. Bound Protein M is detected via its myc-tag using a mouse IgG1 anti-myc antibody followed by an HRP-labeled goat anti-mouse IgG. Neither antibodies bind the human IgG antibody coated on the well (assay buffer, no Protein M).



FIG. 4A-B. Goat anti-human IgG (FIG. 4A) or Chicken anti-human IgG (FIG. 4B) is neutralized or blocked by Protein M in a dose-dependent fashion and prevented from binding their antigen human IgG, immobilized on the wells.



FIG. 5A-D. Protein M-HRP fusion protein detection of antibodies. Detection of goat F(ab′)2 antibody fragment (FIG. 5A), dose-dependent detection of human IgG antibody (FIG. 5B) and detection of two mouse monoclonal IgG1 antibodies: anti-myc and anti-CD28 (FIG. 5C) coated on wells. FIG. 5D. Protein M-HRP indirect detection of mouse IgG1 antibody in solution. Protein M-HRP incubated with increasing concentration of mouse IgG1 antibody resulted in the loss of Protein M binding to the human IgG antibody coated on the wells. In the absence of mouse IgG1 antibody in solution, binding of Protein M-HRP to the human IgG antibody coated on the wells is not hindered.



FIG. 6A-B. Antibodies bound to antigen are not detected by Protein M. Myc-specific mouse antibody bound to myc-tagged protein coated on wells in (FIG. 6A) or ASIP-specific rabbit antibody bound to ASIP protein coated on wells in (FIG. 6B) failed to be detected by Protein M-HRP. This is consistent with the described function of Protein M and its inability to bind antibodies already engaged in a complex with their cognate antigen. Presence of antibodies bound to their coated antigen was confirmed using detecting HRP-labeled anti-mouse IgG (A) and anti-rabbit IgG-biotin/streptavidin-HRP (B).



FIG. 7A-B). FIG. 7A. Biotinylated irrelevant (non-specific) antibody binds to SARS-CoV-2 spike protein coated on the wells only when the antibody is in a complex with armY-ACE2. ACE2 domain mediates binding between the spike protein and armY-bound antibody since antibody with Protein M lacking the ACE2 domain is unable to bind the spike protein coated on the well. As expected, antibody alone does not bind the SARS-CoV-2 spike protein, which requires a physical association with armY-ACE2 that binds the spike protein coated on the well. Moreover, armY-ACE2 alone does not produce any signals since the presence of signal requires the interaction between the biotinylated antibody and armY-ACE2 bound to the spike protein coated on the well. FIG. 7 B. Anti-histidine (his) tag was used to detect the his (6× histidine)-tagged SARS-CoV-2 spike protein in a complex with armY-ACE2. Although, not included in this experiment, SARS-CoV-2 spike protein does not bind to human IgG (previous observation), a complex between armY-ACE2 and SARS-CoV-2 spike protein was required for detection by anti-his tag antibody. Myc-specific antibody detected myc-tagged armY-ACE2 binding to human IgG coated wells regardless of whether it was bound or not bound to SARS-CoV-2 spike protein.



FIG. 8. Purified human IgG or antibodies in human serum formed a complex with armY-ACE2 in solution preventing army-ACE2 binding to antibody coated on the wells. As expected, in the absence of antibody in solution binding of armY-ACE2 to the antibody coated wells was not prevented.



FIG. 9. armY-ACE2 engaged antibody binds to K562 cells expressing FcγRII receptor (right panel, army-ACE2+antibody). No binding to K562 cells was observed in the absence of antibody (left panel, army-ACE2 alone, no antibody), therefore, the observed binding of armY-ACE2 to K562 cells is dependent upon the association between the antibody and armY-ACE2.



FIG. 10. Binding of [armY-ACE2+antibody] to K562 cells is prevented by blocking FcγRII receptor using anti-CD32 (IV.3) (left panel). Binding is not blocked by the isotype-match antibody (right panel), demonstrating that armY-ACE2 engaged antibody maintains Fc-receptor binding activity.



FIG. 11A-B. FIG. 11A. Only a complex between armY-ACE2 and mouse IgM resulted in binding to immobilized human C1q complement component. FIG. 11B. Binding of [armY-ACE2+mouse IgG1] complexes to immobilized human C1q complement component is inhibited by pre-incubation of the complex with soluble human C1q, which also suggest that army-ACE2 primes mouse IgG1 antibody to bind C1q in solution. A complex between Protein M (lacking ACE2) with mouse IgG1 did not result in binding to C1q, suggesting the requirement of the fusion partner domain, ACE2, to induce a conformation resulting in antibody binding to C1q.



FIG. 12A-B. FIG. 12A. armY-ACE2 exhibits ACE2 activity in a dose dependent fashion. FIG. 12B. [armY-ACE2+antibody] complexes exhibit ACE2 activity comparable to armY-ACE2 alone, suggesting that binding of armY-ACE2 to antibodies does not interfere with the enzymatic function of ACE2.



FIG. 13A-B. FIG. 13A. Diagram of armY-ACE2 construct showing a myc-tag at the N-terminus, followed by the human ACE2, a linker and Protein M “armY” at the C-terminus. FIG. 13B. Photograph of the SDS-PAGE gel of purified non-reduced (left lane) and reduced (right lane) of armY-ACE2 showing a ˜180 kDa protein band.



FIG. 14A-C. FIG. 14A. Non-immune serum antibodies armed with armY-ACE2 gain the ability to bind to SARS-CoV-2 spike protein. Unarmed non-immune serum (pre-vaccine) does not bind to SARS-CoV-2 spike protein (a) but gain the ability to bind after incubation with armY-ACE2 (b). Approximately one month post-vaccination with the Moderna Covid19 vaccine, serum antibodies bind to the SARS-CoV-2 spike protein coated on the well as expected and served as an assay positive control (c). The assay does not detect armY-ACE2 alone when added to the SARS-CoV-2 spike protein coated wells (d), suggesting a requirement for serum antibodies to be in a stable complex with armY-ACE2 to bind the SARS-CoV-2 spike protein for assay detection. The photo to the right, representative of the duplicate wells, shows the corresponding SARS-CoV-2 spike protein coated wells 20 minutes after the addition of the mixtures, followed by addition of detecting antibody and addition of the TMB substrate, which give rise to the appearance of the blue color product indicative of antibody presence bound to the SARS-CoV-2 spike protein. FIG. 14B. Non-immune plasma (anticoagulant: ACD-A) antibodies armed with armY-ACE2 also gain the ability bind to SARS-CoV-2 spike protein, comparable to non-immune serum antibodies armed with armY-ACE2. As expected, unarmed serum or plasma antibodies do not bind to SARS-CoV-2 spike protein. FIG. 14C. Less than 1 ug/ml of free-unengaged army-ACE2 remain detectable after a 60 minutes incubation with either serum- or plasma-antibodies at 37° C., suggesting at least 95% of armY-ACE2 added (20 ug/ml) readily engage and arm antibodies in solution.



FIG. 15A-B. FIG. 15A. Monoclonal antibody (mAb) armed with armY-ACE2 gains the ability to bind the SARS-CoV-2 spike protein, while in FIG. 15B mAb is no longer able to bind its natural target antigen.



FIG. 16. Specific detection of antibody light-chain (LC, ˜25 KDa), but not heavy-chain (HC, ˜50 KDa) by mono-biotinylated protein M on 1D gel electrophoresis by Western blot analysis. Antibody sample was loaded alone (right lane, Ab) or in a mixture with E. coli lysate (middle lane, E+Ab). E. coli lysate alone was also loaded (left lane, E) as control. The molecular weight standard values (KDa, left) are derived from the Coomassie blue stained blots.



FIG. 17. Biotinylated Protein M is immobilized in streptavidin-coated wells and serves as a surrogate antigen for a monoclonal antibody. Increasing amount of antibody is added to the wells and the level of bound antibody is measured in an ELISA-based method.



FIG. 18. Protein M fusion blocks the binding of a monoclonal antibody (mAb) to its natural antigen thereby a) shows that antigen binding is Fab dependent and b) confirms the antibody's binding specificity.





DETAILED DESCRIPTION
Abbreviations and Definitions

Unless otherwise noted, technical terms are used according to conventional usage. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety for all purposes. To facilitate understanding of the invention, a number of terms and abbreviations as used herein are defined below as follows.


The terms “polypeptide”, “protein” and “peptide” are used herein interchangeably to refer to amino acid chains in which the amino acid residues are linked by peptide bonds or modified peptide bonds. The amino acid chains can be of any length of greater than two amino acids. Unless otherwise specified, the terms “polypeptide”, “protein” and “peptide” also encompass various modified forms thereof. Such modified forms may be naturally occurring modified forms or chemically modified forms. Examples of modified forms include, but are not limited to, glycosylated forms, phosphorylated forms, myristoylated forms, palmitoylated forms, ribosylated forms, acetylated forms, and the like. Modifications also include intra-molecular crosslinking and covalent attachment of various moieties such as lipids, flavin, biotin, polyethylene glycol or derivatives thereof, and the like. In addition, modifications may also include cyclization, branching and cross-linking. Further, amino acids other than the conventional twenty amino acids encoded by genes may also be included in a polypeptide. The term “polypeptide” or “protein” may also encompass a “purified” polypeptide that is substantially separated from other polypeptides in a cell or organism in which the polypeptide naturally occurs (e.g., 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, 100% free of contaminants).


Conservative changes: As used herein, when referring to mutations in a nucleic acid molecule, “conservative changes” are those in which at least one codon in the protein-coding region of the nucleic acid has been changed such that at least one amino acid of the polypeptide encoded by the nucleic acid sequence is substituted with another amino acid having similar characteristics. Examples of conservative amino acid substitutions are ser for ala, thr, or cys; lys for arg; gin for asn, his, or lys; his for asn; glu for asp or lys; asn for his or gin; asp for glu; pro for gly; leu for ile, phe, met, or val; val for ile or leu; ile for leu, met, or val; arg for lys; met for phe; tyr for phe or trp; thr for ser; trp for tyr; and phe for tyr.


Isolated polypeptide: The term “isolated polypeptide” as used herein means a polypeptide molecule is present in a form other than found in nature in its original environment with respect to its association with other molecules. The term “isolated polypeptide” encompasses a “purified polypeptide”, which is used herein to mean that a specified polypeptide is in a substantially homogenous preparation, substantially free of other cellular components, other polypeptides, viral materials, or culture medium, or when the polypeptide is chemically synthesized, substantially free of chemical precursors or byproducts associated with the chemical synthesis. For a purified polypeptide, preferably the specified polypeptide molecule constitutes at least 15 percent of the total polypeptide in the preparation. A “purified polypeptide” can be obtained from natural or recombinant host cells by standard purification techniques, or by chemical synthesis.


An “isolated” biological component (such as a nucleic acid molecule, protein, or virus) has been substantially separated or purified away from other biological components (e.g., other chromosomal and extra-chromosomal DNA and RNA, proteins and/or organelles). Nucleic acids, proteins, and/or viruses that have been “isolated” include nucleic acids, proteins, and viruses purified by standard purification methods. The term also embraces nucleic acids, proteins, and viruses prepared by recombinant expression in a host cell, as well as chemically synthesized nucleic acids or proteins. The term “isolated” (or purified) does not require absolute purity; rather, it is intended as a relative term. Thus, for example, an isolated or purified nucleic acid, protein, virus, or other active compound is one that is isolated in whole or in part from associated nucleic acids, proteins, and other contaminants.


As used herein, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. The term “vector” comprises an “expression vector”, e.g. a vector that is capable of directing the expression of genes to which they are operatively linked. The vector often includes sequences that effect the expression of a desirable molecule, e.g., a promoter, a coding region and a transcriptional termination sequence. An expression vector can be an integrative vector (i.e., a vector that can integrate into the host genome), or a vector that does not integrate but self-replicates, in which case, the vector includes an origin of replication which permits the entire vector to be reproduced once it is within the host cell. Preferred vectors are those capable of autonomous replication and/expression of nucleic acids to which they are linked.


Nucleic acid molecules encoding fusion proteins are also within the scope of the invention. Such nucleic acids can be made by preparing a construct (e g., an expression vector) that expresses a fusion protein when introduced into a suitable host. For example, such a construct can be made by ligating a first polynucleotide encoding a single-domain antibody, or fragment or variant thereof, fused in frame with a second polynucleotide encoding another protein such that expression of the construct in a suitable expression system yields a fusion protein. Polynucleotides that encode fusion proteins can be present in isolation, or can be inserted in a vector for expression in cells. Such vector may be suitable for replication and protein expression in bacterial, mammalian or insect cells. Polynucleotides that encode fusion proteins can be codon-optimized for expression in particular type of cells by standard methods known in the art.


A “codon-optimized” nucleic acid or polynucleotide refers to a nucleic acid sequence that has been altered such that the codons are optimal for expression in a particular system (such as a particular species or group of species). For example, a nucleic acid sequence can be optimized for expression in mammalian cells or in a particular mammalian species (such as human cells). Codon optimization does not alter the amino acid sequence of the encoded protein.


The term “neutralizing a pathogen” used herein is synonymous to “inactivating a pathogen” and means that the pathogen will no longer be able to interact with a specific receptor molecule either in vitro or in vivo, or will no longer be able to infect cells of an organism.


The term “neutralizing a toxin” used herein is synonymous to “inactivating a toxin” and means that the toxin will no longer be able to interact with its target, either in vitro, or in a subject's body.


The term “eradicating a pathogen” used herein refers to neutralizing the pathogen in a subject.


As used herein, the term “Protein M” or “armY” refers to antibody-binding fragment of protein from Mycoplasma genitalium that has an amino acid sequence SEQ ID NO:1 (Grover R K, et al., Science, 2014), or to antibody-binding fragment of protein from Mycoplasma pneumoniae that has an amino acid sequence SEQ ID NO:2 (Blötz C, et al., Front Microbiol. 2020), or to a polypeptide with immunoglobulin-binding activity having a sequence with at least 90% identity over its entire length to one of the following sequences: SEQ ID NO: 3-8. In some embodiments, the term “Protein M” or “armY” also includes an immunoglobulin-binding fragment of Protein M from Mycoplasma genitalium or Mycoplasma pneumoniae.


As used herein, the term “ACE2” refers to the human cellular angiotensin-converting enzyme 2 receptor.


As used herein, the term “fusion protein” refers to an artificial, non-natural polypeptide that consists of at least two unrelated covalently linked polypeptides. The linkage between these polypeptides can be of different nature, including a peptide bond, a short flexible amino acid spacer, or a spacer of another type. The spacer joins the polypeptides together, yet preserves some distance between the polypeptides such that both polypeptides can properly fold independently.


The term “immunoglobulin,” “Ig” or “antibody” (used interchangeably herein) refers to a glycoprotein formed in response to administration of bacteria, viruses or other antigens to a mammalian organism, said glycoprotein has the ability to specifically bind cognate antigen and consists of two heavy (H) chains and two light (L) chains connected and stabilized by interchain disulfide bonds. Immunoglobulins or antibodies may be monoclonal or polyclonal and may exist in monomeric or polymeric form, for example. IgM antibodies which exist in pentameric form and/or IgA antibodies which exist in monomeric, dimeric or multimeric form. The term “fragment” refers to a part or portion of an antibody or antibody chain comprising fewer amino acid residues than an intact or complete antibody or antibody chain. Fragments can be obtained via chemical or enzymatic treatment of an intact or complete antibody or antibody chain. Fragments can also be obtained by recombinant means. Exemplary fragments include Fab, Fab′, F(ab′)2, Fc and/or Fv fragments.


The term “antigen-binding fragment” refers to a polypeptide portion of an immunoglobulin or antibody that binds an antigen or competes with intact antibody (i.e. with the intact antibody from which they were derived) for antigen binding (i.e. specific binding). Binding fragments can be produced by recombinant DNA techniques, or by enzymatic or chemical cleavage of intact immunoglobulins. Binding fragments include Fab, Fab′, F(ab′)2, Fv, single chains, and single-chain antibodies.


As used herein, the term “toxin” refers to an endogenous entity or exogenous substance that is harmful to a subject (preferably, human subject). Examples of harmful endogenous entities are excessive inflammatory cytokines that may be produced during a cytokine storm in the subject. A harmful endogenous entity can be soluble or membrane bound. Examples of harmful exogenous substances are Botulinum neurotoxin A, Botulinum neurotoxin B, Staphylococcal enterotoxin A and B, Staphylococcal enterotoxin A, Staphylococcal enterotoxin B, Clostridium perfringens Epsilon toxin (ETX), Ricin, Anthrax.


As used herein, the term “donor compatible with the subject” refers to a human subject having compatibility for a blood transfusion (compatibility based on ABO blood groups, Rh Type).


As used herein, the term “receptor fragment” refers to a fragment of a protein to which a pathogen (usually, a protein from the pathogen's coat) or a toxin has a specific binding affinity, or can specifically bind. Preferably, receptor fragment is a protein fragment of a cellular receptor that the pathogen or toxin binds to and utilizes to enter the cell. Preferably, receptor fragment is located inside a subject's body.


Unless otherwise defined, technical and scientific terms used in the present teachings described herein shall have the meanings that are commonly understood by those of ordinary skill in the art. Further, unless otherwise required by context, plural terms shall include the singular and singular terms shall include pluralities. Generally, nomenclatures utilized in connection with molecular biology, cell and tissue culture, protein and oligo- or polynucleotide chemistry described herein are well-known and commonly used in the art. Standard techniques are used, for example, for recombinant nucleic acid and protein preparation, purification and analysis, for oligonucleotide synthesis. Purification techniques and enzymatic reactions are performed according to manufacturer's specifications or as described herein or as commonly accomplished in the art. The techniques and procedures described herein are generally performed according to conventional methods well known in the art and as described in various general references that are cited and discussed throughout the instant specification. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (Third ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 2000). The nomenclatures utilized in connection with, and the laboratory procedures and techniques described herein are those well-known and commonly used in the art.


The present invention is directed to methods and compositions for inactivating or eliminating a pathogen, preferably a bloodborne pathogen having a specific binding affinity for a receptor fragment, by utilizing a fusion protein that comprises Protein M and the receptor fragment. Preferably, Protein M is chosen from an extracellular domain of Mycoplasma genitalium protein (Grover R K, et al., Science, 2014; SEQ ID NO: 1) or an extracellular domain of Mycoplasma pneumoniae protein (Blötz C, et al., Front Microbiol. 2020; SEQ ID NO: 2) that strongly bind to immunoglobulin molecules (antibodies). Typical binding affinities (Kd) of Protein M to immunoglobulin molecules are from 1.2 to 5.2 nM (Grover R K, et al., Science, 2014).


Orthologs of Protein M can be found in several related species of Mycoplasma: M. penetrans, Mycoplasma tullyi, Mycoplasma iowae, Mycoplasma imitans, Mycoplasma alvi and M. gallisepticum (disclosed herein in the Sequence listing). These sequences are also disclosed herein and can be used to create fusions or fusion proteins according to the present invention. Protein M is functionally similar to other bacterial-derived proteins that bind antibodies (e.g., protein A, protein G and protein L) with the exception that Protein M blocks the antibody's binding site and prevent it from binding its cognate antigen. Therefore, harnessing the antibody binding property of Protein M, it can be used to couple any attached compounds (e.g., genetic fusion or chemical conjugation) to an antibody regardless of the antibody's specificity. Consequently, interaction with the Protein M fusion protein will result in the loss of the antibody's specificity and acquire the specificity as that of the attached compound. The properties of Protein M fusion protein with the compound will be a combination of the antibody's stability, antibody's functional properties (such as ability to engage Fc receptors on immune cells, activate the complement system, an increased binding avidity and the compound properties (affinity to a pathogen).


Preferred nucleic acid molecules for use in the invention are polynucleotides that encode fusion proteins shown herein in the appended Sequence Listing. Nucleic acid molecules utilized in the present invention may be in the form of RNA or in the form of DNA (e.g., cDNA, genomic DNA, and synthetic DNA). The nucleic acid molecule may be double-stranded or single-stranded, and if single-stranded may be the coding (sense) strand or non-coding (anti-sense) strand. The coding sequence which encodes a fusion Protein May be identical to one of the nucleotide sequences provided in the appendices, or it may also be a different coding sequence which, as a result of the redundancy or degeneracy of the genetic code, encodes the provided fusion protein.


In some embodiments, variant fusion proteins displaying substantial differences in structure can be generated by making nucleotide substitutions that cause less than conservative changes in the encoded polypeptide. Examples of such nucleotide substitutions are those that cause changes in (a) the structure of the polypeptide backbone; (b) the charge or hydrophobicity of the polypeptide; or (c) the bulk of an amino acid side chain. Nucleotide substitutions generally expected to produce the greatest changes in protein properties are those that cause non-conservative changes in codons. Examples of codon changes that are likely to cause major changes in protein structure are those that cause substitution of (a) a hydrophilic residue, e.g., serine or threonine, for (or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, valine or alanine; (b) a cysteine or proline for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., lysine, arginine, or histidine, for (or by) an electronegative residue, e.g., glutamic acid or aspartic acid; or (d) a residue having a bulky side chain, e.g., phenylalanine, for (or by) one not having a side chain, e g., glycine.


Sequence Identity: As used herein, the term “sequence identity” means the percentage of identical subunits at corresponding positions in two sequences when the two sequences are aligned to maximize subunit matching, i.e., taking into account gaps and insertions. Sequence identity is present when a subunit position in both of the two sequences is occupied by the same nucleotide or amino acid, e.g., if a given position is occupied by an adenine in each of two DNA molecules, then the molecules are identical at that position. For example, if 7 positions in a sequence of 10 nucleotides in length are identical to the corresponding positions in a second 10-nucleotide sequence, then the two sequences have 70% sequence identity. Sequence identity of a polynucleotide is typically measured using sequence analysis software (e.g., the Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705).


In preferred embodiments, variant fusion proteins displaying only non-substantial or negligible differences in structure can be generated by making nucleotide substitutions that cause only conservative amino acid changes in the encoded polypeptide. By doing this, fusion protein variants that comprise a sequence having at least 90% (90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%) sequence identity with the fusion protein sequences provided in the attached appendices, and retain at least one functional activity, e g., immunoglobulin binding activity. The invention also covers non-naturally occurring polynucleotides or variants that encode the fusion protein variants having at least 90% sequence identity over the entire length with the fusion protein sequences provided in the attached appendices, and retain at least one functional activity, e g., immunoglobulin binding activity. Methods of making targeted amino acid substitutions, deletions, truncations, and insertions are generally known in the art. For example, amino acid sequence variants can be prepared by mutations in the DNA. Methods for polynucleotide alterations are well known in the art, for example, Kunkel et al. (1987) Methods in Enzymol. 154:367-382; U.S. Pat. No. 4,873,192 and the references cited therein.


Therapeutically Effective Amount: As used herein, the term “therapeutically effective amount” refers to those amounts that, when administered to a particular subject in view of the nature and severity of that subject's disease or condition, will have a desired therapeutic effect, e.g., an amount which will cure, prevent, inhibit, or at least partially arrest or partially prevent a target disease or condition. By other words, this is an amount of an agent or composition that alone, or together with a pharmaceutically acceptable carrier or one or more additional agents, induces the desired response. Effective amounts of a therapeutic agent can be determined in many different ways, such as assaying for a reduction in symptoms or improvement of physiological condition of a subject. Effective amounts also can be determined through various in vitro, in vivo, or in situ assays.


In some embodiments, variants of fusion proteins having a reduced immunogenicity in humans may be generated by making amino acid substitutions in the fusion proteins that remove or modify human T-cell or B-cell epitopes present in said fusion protein. Fusion proteins that have less potential human T-cell or B-cell epitopes in the sequence are less prone to activate an unwanted immune response in a subject. The unwanted immune response includes development of anti-fusion protein antibodies that may neutralize said fusion protein. Several methods for identifying, modifying and removing potential human T-cell or B-cell epitopes in protein sequences are known and disclosed in, for example, Jawa V, Terry F, Gokemeijer J, et al. T-Cell Dependent Immunogenicity of Protein Therapeutics Pre-clinical Assessment and Mitigation-Updated Consensus and Review 2020. Front Immunol. 2020; 11:1301; Mazor R, Crown D, Addissie S, Jang Y, Kaplan G, Pastan I. Elimination of murine and human T-cell epitopes in recombinant immunotoxin eliminates neutralizing and anti-drug antibodies in vivo. Cell Mol Immunol. 2017; 14(5):432-442; U.S. Ser. No. 10/751,397 B2, US2018161419A1, the contents of which are incorporated herein by reference in its entirety.


Disclosed herein are methods for making and using fusion proteins that comprises amino acid sequences of Protein M or amino acid sequences that are at least 90% identical over the entire length with the sequences of Protein M. An example of such fusion protein is armY-ACE2, which consists of the Protein M sequence fused to the sequence of the ACE2 receptor, or to a fragment of the ACE2 receptor to which the envelope spike S protein of the SARS-CoV-2 virus is bound. Fusion protein armY-ACE2 can bind to immunoglobulin molecules of different classes, blocking their original specificity and instead directing them to interact with the envelope spike S protein of the SARS-CoV-2 virus (FIG. 1). As a result, the SARS-CoV-2 virus will be no longer capable of infecting human cells via its envelope spike S protein, and will be eliminated by macrophages that recognize immunoglobulin-bound targets, and by engaging via a complement factor activated by the bound immunoglobulins, or by other mechanisms. By utilizing knowledge of specific cellular receptors recognized by pathogens (virus or microorganism) and toxins for cellular entry, various armY-fusion proteins may be created and utilized according to the present invention. To make armY-fusion proteins, various fragments of the receptor may be used, including, without restriction, a full extracellular domain of the receptor or a fragment of the receptor which is necessary and sufficient for interaction with the pathogen or toxin.


Non-limiting examples of pathogens and toxins and their cellular attachment receptors suitable to make armY-fusion proteins are listed as follows: (a) armY-ACE2 (Angiotensin-converting enzyme 2) for the SARS-CoV and SARS-CoV-2, as well as human coronavirus NL63/HCoV-NL6; (b) armY-CD209 (DC-SIGN) for HIV-1, HIV-2, Ebolavirus, Cytomegalovirus, HCV, Dengue virus, Measles virus, Herpes simplex virus 1, Influenza virus, SARS-CoV, Japanese encephalitis virus, Lassa virus, Respiratory syncytial virus, Rift valley fever virus, West-nile virus, Marburg virus, Uukuniemi virus, and Yersinia Pestis; (c) armY-C-type lectin domain family 4 member M for Ebolavirus, Hepatitis C virus, HIV-1, Human coronavirus 229E, Human cytomegalovirus/HHV-5, Influenza virus, SARS-CoV, West-nile virus, Japanese encephalitis virus, Marburg virus glycoprotein, and M. bovis; (d) armY-CD4 for HIV; (e) armY-Synaptic vesicle glycoprotein 2A for the C. botulinum neurotoxin type A2 (BoNT/A, botA); (f) armY-Synaptic vesicle glycoprotein 2B for the C. botulinum neurotoxin type A2 (BoNT/A, botA). Probably also for the closely related C. botulinum neurotoxin type A1; (g) armY-Synaptic vesicle glycoprotein 2C for C. botulinum neurotoxin type A (BoNT/A, botA) and C. botulinum neurotoxin type A2; (h) armY-Synaptotagmin I for C. botulinum neurotoxin type B (BoNT/B, botB); (i) armY-Synaptotagmin II for C. botulinum neurotoxin type B (BoNT/B, botB); (j) armY-HLA class II histocompatibility antigen, DRB1 beta chain for Epstein-Barr virus and Staphylococcal enterotoxin A and B; (k) armY-HLA class II histocompatibility antigen, DR alpha chain for Epstein-Barr virus BZLF2/gp42, Staphylococcus aureus enterotoxin A/entA, enterotoxin B/entB, enterotoxin C1/entC1, enterotoxin D/entD, and enterotoxin H/entH; (1) armY-T cell receptor beta variable 7-9 for Staphylococcus aureus enterotoxin A/entA; (m) armY-T cell receptor beta variable 19 for Staphylococcus aureus enterotoxin B/entB; (n) armY-Hepatitis A virus cellular receptor 1 for Hepatitis A virus, Ebola virus, Marburg virus and Dengue virus and Clostridium perfringens Epsilon toxin (ETX); (o) armY-Myelin and lymphocyte protein for Clostridium perfringens Epsilon toxin (ETX); (p) armY-Complement factor H for Streptococcus pneumoniae, Neisseria meningitides, Staphylococcus aureus, Borrelia burgdorferi and West nile virus; (q) armY-Hepatocyte growth factor receptor for Listeria monocytogenes internalin InlB; (r) armY-Membrane cofactor protein (CD46) for Adenovirus subgroup B2 and Ad3, Measles virus, Herpesvirus 6/HHV-6, Neisseria and Streptococcus pyogenes; (s) armY-Glycophorin-A for Plasmodium falciparum, Influenza virus, Hepatitis A virus (HAV), Streptococcus gordonii; (t) armY-C-type lectin domain family 4 member K (Langerin, CD207) for Candida species, Saccharomyces species, Malassezia furfur, human immunodeficiency virus-1 (HIV-1) and Yesinia pestis; (u) armY-Anthrax toxin receptor 1 for Anthrax toxin; and (v) armY-Anthrax toxin receptor 2 for Anthrax toxin.


In some embodiments, codon-optimized polynucleotides are disclosed that contain a nucleic acid sequence at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to SEQ ID NO: 40-61. These polynucleotides are codon-optimized for expression in human cells.


Taking as an example armY-ACE2 fusion protein and SARS-CoV-2 as a pathogen, several advantages of the armY-ACE2 approach can be shown over the other known potential virus inactivating strategies, such as (a) monoclonal antibody (mAb) therapy; (b) ACE2 or ACE2-Fc fusion proteins therapy; (c) Convalescent plasma antibody therapy, and (d) anti-viral vaccine.


As to (a), mAb therapy is subject to viral escape due to a mutation in a targeted viral epitope. Most viruses possess a high mutation rate; after a mutation in the mAb-recognizing area the mAb therapy is no longer effective, and mutated viruses will proliferate and eventually will be enriched. Instead, armY-ACE2 will bind the SARS-CoV-2 virus regardless of any mutation, because all SARS-CoV-2 viruses bind ACE2 for entry into human cells. Also, since armY can bind all antibody isotypes, armY-ACE2 can arm all antibody isotypes with the capacity to target SARS-CoV-2 viruses, hence mimicking a generalized antibody-mediated immune response.


As to (b), ACE2 monotherapy suffers from rapid renal clearance due to the small size of ACE2. ACE2-Fc fusion proteins is of a single isotype, usually, an IgG. It is known that other isotypes e.g., IgM, IgA are also efficacious in pathogen clearance. Thus, armY-ACE2 can arm all isotypes or a specific isotype with the capacity to target SARS-CoV-2. In addition, Fc fusion proteins do not activate the complement system. Instead, armY-ACE2 complex with antibody maintains Fc functionality, and is able to prime the antibody to bind the C1q complement factor, a required step for complement activation. Being able to harness the full effector potential of antibodies may be critical in the overall eradication of the targeted pathogen, such as SARS-CoV-2.


As to (c), convalescent plasma therapy requires blood from donors previously exposed to SARS-CoV-2, and no longer with COVID-19 symptoms. It might take as long as 7-10 days to test for lack of blood-borne pathogens, anti-SARS-CoV-2 titer levels and ABO blood type matching requirements. Instead, armY-ACE2 could arm the patient's own plasma antibodies, and can be available to the patient in less than 2-4 hours. Donor plasma can also be used, but these can be from regular donors that have already been screened, so this could be made available to the patient even faster as long as ABO blood type and Rh type match is achieved.


As to (d), SARS-CoV-2 vaccine is prophylactic in its use and the uninfected person will require time to develop a level of protective immunity. Vaccines cannot be a therapeutic for those with on-going COVID-19. Moreover, vaccine efficacy is subject to many variables including state of health of the individual and potential side-effects, e.g., anaphylactic reaction that might hinder completion of immunization protocol. armY-ACE2 is applicable to subjects with on-going COVID-19.


The abovementioned advantages apply to other fusion proteins that are disclosed herein.


Treatment with Protein M fusion proteins changes the specificity of antibodies in plasma to a new target (e.g, a virus, bacterium or a toxin) for immune recognition and elimination; provide more optimal pharmacokinetics and activity of a compound attached to a larger more stable antibody, and improvement of bioavailability of compounds; deliver therapeutic or diagnostic compounds to an antibody-binding target (e.g., antibody binding bacteria, tissue or cell); disrupt interaction between two or more entities required for pathogenicity.


Possible routes of administration for Protein M fusion proteins include parenteral, oral and/or inhalation. In a preferred embodiment, ex-vivo plasma/serum (patient-derived or from a compatible donor) is mixed with Protein M-fusion protein and administered to patient. Preferably, Protein M fusion proteins are administered in the form of a pharmaceutical composition, comprising additional pharmaceutically acceptable excipients.


In some embodiments, Protein M fusion proteins are stored or administered in a suitable formulation that provides stability to the fusion proteins. Such formulation includes one or several pharmaceutically acceptable excipients. By “pharmaceutically acceptable” it is meant the excipient is compatible with the other ingredients of the formulation and not deleterious to the recipient thereof. Excipients for protein formulations may be picked up by methods known in the art, and may include buffers, stabilizers, antioxidants, salts, polysorbates, amino acids, among others.


Other potential uses of Protein M fusion proteins include detecting presence of antibodies and/or antibody-binding factors found in blood, tissue or cells. For example, fusing a reporter enzyme (e.g., Horseradish peroxidase, HRP) or attaching a detectable probe or label (e.g., biotin-avidin, biotin-streptavidin) to Protein M can be used to detect antibodies that are present but not bound to their cognate antigen as observed in immunoassays that exhibit “false-positive” activity and thus serve as a false-positive detection tool.


In some embodiments, Protein M can be conjugated with the following detectable probes: HRP (chromogenic), alkaline phosphatase (chromogenic), biotin (for example, via Avi-Tag peptide), myc epitope antigen, Luciferase (bioluminescence), avidin (attachment of biotin conjugates), streptavidin (attachment of biotin conjugates), streptavidin-binding peptide, phycoerythrin (fluorescence), GFP (fluorescence), a radioactive label. Protein M-radiolabel peptide can be produced by fusing Protein M to the short peptide KGRPLVY (SEQ ID NO:62). As disclosed in Mebrahtu et al. 2013, the KGRPLVY peptide contains a metal chelate attachment [K-lysine for labeling Protein M with Cu-64 and DOTA] and radio-halogen attachment (Y-tyrosine for labeling Protein M with I-125, I-123 or I-131).


In some embodiments, Protein M-detectable probe fusions can be used in ELISA, western blotting, lateral flow assays, multiplex bead array assays, pull down assays, SPR (biacore, octet) assays, flow cytometry assays, for purification or for delivery of a cargo.


Protein M fusion proteins can also be used to: 1) neutralize antibodies by occupying their antigen binding site (Useful in decreasing non-specific signals in immunoassays, useful in in-vitro cell assays as well as in in-vivo settings to determine the role of antibodies or a specific antibody by essentially blocking its binding activity); 2) eliminate antibodies by increasing clearance from circulation or tissue by directing antibodies to immune cells or delivering degrading enzymes or compounds to antibodies; 3) deplete antibodies in solution by promoting clearance of unengaged antibodies, which are not bound to antigen. Protein M can be attached to a resin (e.g., agarose beads), added to a solution to pull down/remove or harvest such antibodies for use in process, for analysis or for elimination.


Protein M fusion proteins can also be used to protect antibodies from degradation by enzymes, microbes and cellular mechanisms; protect antibodies from bacterial escape mechanisms (e.g., protein A of S. aureus binds to antibodies and avoid antibody detection and clearance); deliver cargo to an antibody.


In some embodiments, the receptor fragment is a protein fragment of a cellular receptor, which is a target used by a pathogen for cell entry. In some embodiments, the pathogen is a virus, a bacterium or a fungus that can cause illnesses. In one embodiment, an antigen is a cell surface molecule of a pathogen, or antigenic parts or fragments thereof.


A fusion protein can be made by creating a nucleic acid molecule encoding the fusion protein and expressing the fusion protein from such nucleic acid in a recombinant expression system. The nucleic acid molecule encoding the fusion can be generated by linking a nucleic acid sequence encoding Protein M in frame with a nucleic acid sequence encoding a receptor fragment of a pathogen or a ligand of a toxin. Methods for constructing a fusion protein are known in the art (see Sambrook J. et al., Molecular Cloning, Cold Spring Harbor Press, New York (2001)).


In some embodiments, Protein M is fused to the N-terminus of the receptor fragment of a pathogen or the ligand of a toxin. In this orientation, an N-terminal tag can be attached for detection and purification of the fusion protein. In addition, the leader sequence (secretory signal peptide) can be attached for facilitating the secretion of the fusion protein. Alternatively, other appropriate leader sequences, suitable for guiding the fusion protein to the ER and the secretory pathway in the host cell, can be used. In other embodiments, Protein M is fused to the C-terminus of the receptor fragment of a pathogen or the ligand of a toxin.


In still another embodiment, a spacer can be incorporated between the Protein M sequence and the receptor fragment of a pathogen or the ligand of a toxin. In preferred embodiments, spacer is a short peptide sequence that joins both polypeptides, yet preserves some distance between the polypeptides such that both polypeptides can properly fold independently. Generally, the spacer consists of between 2 or 3 amino acids to 50 amino acids, typically between 3 to 25, or 3 to 20, or 3 to 15 amino acids. In a specific embodiment, the space consists of 3-10 amino acids. Although there is no specific restriction on the selection of amino acids for the spacer region, the amino acids can be selected to accommodate the folding, net charge, hydrophobicity or other properties of the fusion protein. Typical amino acids for use in a spacer region include Gly, Ala, Ser, Thr and Asp.


One of skill would recognize that modifications can be made to a fusion protein without diminishing their biological activities. Some modifications may be made to facilitate the cloning, expression, or incorporation of the constituent molecules into a fusion protein. For example, amino acids can be placed on either terminus to create conveniently located restriction sites or termination codons; and a methionine can be added at the amino terminus to provide an initiation site.


Recombinant Expression of the Fusion Proteins.


For recombinant expression of a fusion protein, a nucleic acid molecule encoding the fusion protein is generally placed in an expression vector in an operable linkage to a promoter (such as the T7, trp, or lambda promoters for expression in bacteria, or a CMV promoter for expression in mammalian cells) and a 3′ transcription termination sequence, and optionally additional suitable transcriptional and/or translational regulatory elements such as a transcription enhancer sequence and a sequence encoding suitable mRNA ribosomal binding sites. Additional sequences that can be included in the expression vector include an origin of replication, and a selection marker gene to facilitate identification of transformants such as genes conferring resistance to antibiotics (e.g., the amp, kana, gpt, neo, and hyg genes).


Host cells suitable for use in the recombinant expression of the fusion protein include bacterial cells such as E. coli, and eukaryotic cells including but not limited to yeast, insect cells (e.g. SF9 cells), and mammalian cells such COS, CHO, HeLa cells and HEK293.


The expression vectors can be introduced into a host cell by well-known methods such as calcium chloride transformation for bacterial cells, and calcium phosphate treatment or electroporation for mammalian cells. Cells transformed by the expression vectors can be selected based on the phenotype provided by the selectable marker gene.


Once expressed, the recombinant fusion proteins can be purified according to standard methods available in the art, such as ammonium sulfate precipitation, affinity columns, chromatography, gel electrophoresis, among others. In one embodiment, the fusion protein is purified based on affinity chromatography using antibodies that bound to Protein M. In another embodiment, a purification tag is inserted at the N-terminus or the C-terminus of the fusion protein and is used for purification. The examples of such tags are: 6 His-tag, myc-tag, strep-tag and others.


In some embodiments, uses for Protein M fusion proteins include the following.


The present teachings include a pharmaceutical composition comprising: a Protein M fusion protein having an antibody-binding domain and an ACE2 cellular receptor (referred to as armY-ACE2), serum or plasma from the subject or from a compatible donor, and a pharmaceutically acceptable carrier or a pharmaceutically acceptable excipient, wherein the fusion protein acts to eradicate SARS-CoV and SARS-COV-2 coronaviruses in patients infected with the virus, wherein the fusion protein arms immunoglobulins to recognize and bind with high affinity to the S1 spike protein expressed by the SARS-CoV and SARS-COV-2 coronaviruses. In some embodiments, the Protein M fusion protein optionally comprises a linker, the antibody-binding domain comprises the Protein M protein from Mycoplasma sp., the antibody-binding domain comprises Protein M that binds with high affinity to the antibody Fab domain and blocks the antibody's antigen binding site; the antibody-binding domain comprises Protein M that does not bind to the antibody whose Fab domain binding site is engaged with its cognate antigen; the antigen-binding domain comprises a cellular receptor, ACE2, that binds with high affinity to the S1 spike protein expressed by the SARS-CoV and SARS-COV-2 coronaviruses; the antibody-binding domain comprises Protein M that binds with high affinity to the antibody Fab domain and blocks the antibody's cognate antigen binding site. In some preferred embodiments, immunoglobulins bound with the disclosed fusion proteins retain at least partially Fc-linked functional activities (effector functions), such as Fc-receptor binding and complement activation.


In some embodiments, Protein M fusion proteins comprise a linker between Protein M and receptor fragment. Non-limiting examples of such linkers include











(SEQ ID NO: 12)



GGGGSGGGGSGGGGS,






(SEQ ID NO: 13)



GGGGSGGGGS



or






(SEQ ID NO: 14)



GGGGS.






The present teachings also include a pharmaceutical composition comprising: a Protein M fusion protein having an antibody-binding domain and a fusion domain comprising a protein, peptide or chemical group able to bind a pathogen, a toxin, any biologic entity or a chemical group, serum or plasma from the subject or from a compatible donor, and a pharmaceutically acceptable carrier or a pharmaceutically acceptable excipient. In some embodiments, the antigen to be bound by the fusion protein comprises an antigen arising from a pathogen, a toxin, a subject, arising from a disease state within the subject, or arising from a disease related organism within the subject and the disease state within the subject is caused by a virus, bacteria, tumor, abnormal cell or by exposure to an external disease-causing agent, wherein the antigen-binding domain comprises one or more protein or peptide or chemical group (collectively referred to as molecules) chosen from the group consisting of: a soluble molecule, a soluble molecule bound to a matrix, an insoluble molecule bound to a matrix, an insoluble aggregate of molecules, a molecule comprising one or more epitopes, a nonviable cell-associated molecule, a nonviable organism-associated molecule, or a molecule conjugated with a liposome.


The present teachings also include a Protein M fusion protein having an antibody-binding domain and a fusion partner domain comprising a protein, peptide or chemical group, wherein the antibody-binding domain comprises Protein M that does not bind to the antibody whose Fab domain binding site is engaged with its cognate antigen. In some embodiments, the fusion partner domain may be an endogenous protein or peptide; the fusion partner domain may be an exogenous protein or peptide; the fusion partner domain may be an enzyme, wherein the enzyme is a reporter enzyme horseradish peroxidase fusion protein (HRP). Protein M-HRP may be used to detect immunoglobulins in solution or in a matrix, wherein the immunoglobulins detected are not engaged with their cognate antigen. Thus, Protein M-HRP may be used to identify or rule out false positive test results in antibody-based detection of antigen. The fusion partner domain may permit for a chemical modification, wherein the chemical modification is, for example, an addition of biotin by an enzymatic conjugation of a single biotin on a unique 15 amino acid peptide tag using the biotin ligase (BirA).


The present teachings also include a Protein M fusion protein having an antibody-binding domain and a fusion partner domain comprising a protein, peptide or chemical group, wherein the antibody-binding domain comprises Protein M that binds with high affinity to the antibody Fab domain and blocks the antibody's antigen binding site, wherein the fusion partner domain may be a cytokine, chemokine, hormone, growth factor, receptor, ligand, neurotransmitters or a synthesized molecule. In some embodiments, the fusion partner domain is made to increase or decrease the bioavailability of bound antibodies, or the fusion partner domain immunogenicity is increased or decreased when bound to antibodies; or the fusion partner domain is made to increase or decrease the immunogenicity of bound antibodies.


In some embodiments, Protein M fusion proteins arm free non-antigen bound immunoglobulin to bind a pathogen or toxin (both referred heretofore as “target”) with a high affinity. This is made possible through (a) Protein M component of the fusion protein that engages the immunoglobulin rendering it no longer able to bind its cognate antigen, and (b) the fused receptor or ligand, which is the same attachment receptor or ligand found on cells that the target uses to attach and gain entry. Protein M fusion protein-armed immunoglobulins (referred heretofore as “armY-fusion”) binding to their target is the initial step in the mechanism of target eradication. Once bound to target, armY-fusion will block the interaction between the target and the attachment receptor found on host cells, thereby, neutralizing the target and prevent it from infecting the cell. Whereas Protein M fusions serve to associate immunoglobulins with the target and neutralize the target, the immunoglobulin serves to mark the target for destruction and clearance by the innate immune system including cells that bear Fc receptors (e.g., macrophages) and complement factors.


Complement is part of the innate surveillance system involved in the first line of defense against pathogens. One mechanism to direct complement to a specific pathogen is via the classical complement pathway, which is initiated by antibodies that are bound to antigen. C1q recruitment to antibodies is an essential first step in the activation of the complement cascade. Antibody binding to antigen (found on the pathogen or in solution as an immune complex) induces a change in the antibody's three-dimensional structure that exposes a C1q binding site found within the CH2 portion of the antibody Fc region. Upon C1q binding and activation, additional complement factors are recruited resulting in the formation of other effector molecules such as C3b, the main effector of the complement system. These events culminate in the formation of the membrane attack complex (MAC) that forms holes or pores on the surface of pathogens including bacteria, viruses and cancer cells resulting in subsequent clearance. C3b also serves as a potent opsin able to tag pathogens, immune complexes (antigen-antibody), and apoptotic cells for phagocytosis by immune cells that express C3b receptors. Together, MAC and C3b serve to effectively eradicate pathogens targeted by antibodies that recruit C1q. C1q is composed of 18 polypeptide chains: six A-chains, six B-chains, and six C-chains. Each chain contains a collagen-like region located near the N terminus and a C-terminal globular region.


In some embodiments, Protein M fusion protein in complex with an antibody can engage C1q and activate classical complement pathway that would contribute to eradication of the pathogen or a cancer cell, to which the Protein M fusion protein is targeted. Normally, for C1 q to bind the antibody, the antibody must first bind its antigen (immobilized on a cell or pathogen or in solution as an immune complex), and then the antibody undergoes a conformational change that permits C1q binding. However, Protein M fusion protein-IgG complex can specifically recruit C1q as demonstrated, for example, in Example 10, FIG. 11 below. Thus, Protein M-fusions can be considered a tool to specifically induce a conformational change in antibodies (while in solution) resulting in its ability to engage C1q and activate the complement pathway.


EXAMPLES

Aspects of the present teachings may be further understood in light of the following examples, which should not be construed as limiting the scope of the present teachings in any way. Below, exemplary methods to develop and characterize Avi-/myc-tagged Protein M, myc-tagged Protein M-HRP fusion protein and Protein M-ACE2 fusion protein (aka, armY-ACE2) are disclosed. These and similar methods can be applied to generate and use different Protein M fusion proteins.


Example 1. Gene Construction of Protein M

Protein M (also referred to as armY) (SEQ ID NO:10 and 38) was constructed using the mature amino acid sequence of Protein M (37-556 amino acid) containing a myc-tag (EQKLISEEDLLRKR) and linker sequence (AANGGGGSGGGGS) and a mono-biotinylation sequence “Avi-Tag” (MAGGLNDIFEAQKIEWHEGG) at its N-terminal end. The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57 (GenScript, Inc.). The insert was amplified and cloned into a previously constructed mammalian cell expression vector pcDNA3(−) containing a myc-tag-Protein M sequence by replacing the myc-tag-Protein M sequence with the above myc-tag sequence that included a mono-biotinylation sequence, producing a final Protein M construct (IL-2 leader sequence—biotinylation tag—myc tag—linker—Protein M). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).


Example 2A. Characterization of Protein M as an Antibody Neutralizer and Blocking Reagent Tool. Binding of Protein M to Immobilized Antibody (FIG. 3)

Protein M binding to plate bound antibody was demonstrated by measuring the amount of myc-tagged Protein M bound to the antibody coated on a 96-well plate by an ELISA-based method.


Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), Protein M in expression medium diluted in assay buffer (0.5% BSA in PBS+Tween 20) or assay buffer was added to antibody-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged Protein M. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 2B. Characterization of Protein M as an Antibody Neutralizer and Blocking Reagent Tool. Protein M Neutralizes/Blocks Antibody Binding to Cognate Antigen (FIG. 4)

The ability of Protein M to block binding to its cognate antigen was demonstrated by measuring the amount of unblocked, free antibody bound to its antigen coated on a 96-well plate by an ELISA-based method.


Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. RP-labeled goat anti-human IgG antibody (GenScript, Inc.) or a RP-labeled chicken anti-human IgG antibody (Aves Labs, Inc.) was added to Protein M in expression medium or to expression medium alone and allowed to form complexes at room temperature for approximately 2 hours. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG coated wells in duplicate. After approximately 45 minutes at room temperature, the wells were washed 3×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 3. PROTEIN M-HRP Fusion Protein (SEQ ID NO:11 and 39). Gene Construction of Protein M-HRP

The mature amino acid sequence of horseradish peroxidase HRP (31-338 amino acid) was generated containing a myc-tag (EQKLISEEDL) and linker (AAN) sequence at its N-terminal end. The amino acid sequence encoding 3 sets of 4 glycine residues and 1 serine residue (e.g., GGGGS)3 linker followed by the mature amino acid sequence of Protein M (37-556 amino acid) was added to its C-terminal end producing a final Protein M-HRP construct containing (IL-2 leader sequence—myc tag—HRP—linker—Protein M). The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57 (GenScript USA Inc.). The insert was amplified and cloned into a mammalian cell expression vector, pcDNA3(−). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).


Example 4. Characterization of Protein M-HRP as a Novel Antibody Detection Reagent Tool. Detection of Immobilized F(Ab′)2, Antibody or Antibody in Solution by Protein M-HRP Fusion Protein

Protein M-HRP direct detection of plate bound F(ab′)2, antibody or indirect detection of antibody in solution was demonstrated by measuring the amount of Protein M-HRP bound to antibody coated on a 96-well plate by an ELISA-based method (FIG. 5). Direct detection of antibody by Protein M-HRP fusion protein: Briefly, 2 ug/ml of goat F(ab′)2 or 5 ug/ml of human IgG (Sigma) or 2 ug/ml of mouse IgG1 isotype anti-myc or anti-human CD28 antibodies was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), Protein M-HRP in expression medium or expression medium diluted in assay buffer (0.5% BSA in PBS+Tween 20) was added to F(ab′)2 or antibody-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software). Indirect detection of antibody in solution by Protein M-HRP fusion protein: Briefly, Protein M-HRP was incubated with varying amounts of mouse IgG1 antibody in assay buffer and allowed to form complexes at room temperature for approximately 30 minutes. Protein M-HRP alone was also included as a positive control. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 5. Absence of Detection of Antibody Bound to an Immobilized Antigen by Protein M-HRP Fusion Protein

Protein M does not bind to antibodies already bound to antigen. The absence of detection of antibody bound to an immobilized antigen by Protein M-HRP fusion protein was demonstrated by measuring the amount of Protein M-HRP bound to the antibody engaged with its antigen on a 96-well plate by an ELISA-based method (FIG. 6).


Briefly, 1 ug/ml of a myc-tagged protein or 2 ug/ml of human ASIP (agouti-signaling protein, RnD Systems) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer, rabbit anti-ASIP antibody (Thermofisher) or assay buffer alone was added to the myc-tagged protein or ASIP, coated wells, respectively, in duplicates. After approximately 60 minutes, the wells were washed 3×s and Protein M-HRP was added. To show that mouse anti-myc and rabbit anti-ASIP bound to myc-tagged protein or ASIP coated wells, anti-mouse IgG labeled with HRP or biotinylated anti-rabbit IgG+SA-RP was added to another set of coated wells, respectively. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 6. PROTEIN M-ACE2 Fusion Protein (Referred to as armY-ACE2) (SEQ ID NO:9 and 37). Gene Construction of armY-ACE2 (Protein M Fused to Angiotensin I-Converting Enzyme 2 (ACE2))

The mature amino acid sequence of human ACE2 (18-740 amino acid) was generated containing a myc-tag (EQKLISEEDLLRKR) and linker (GSPGGA) sequence at its N-terminal end. The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57. The insert was amplified and cloned into the mammalian cell expression vector pcDNA3(−) containing the myc-tag-Protein M-HRP sequence (see above) by replacing the myc-tag-HRP sequence with the above myc-tag-ACE2 sequence, upstream of the sequence encoding 3 sets of 4 glycine residues and 1 serine residue (e.g., GGGGS)3 linker followed by the mature amino acid sequence of Protein M (37-556 amino acid), producing a final armY-ACE2 construct containing (IL-2 leader sequence—myc tag—ACE2—linker—Protein M). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).


Example 7A. Protein M, Protein M-HRP and armY-ACE2 Gene Expression

The human 293T kidney cell line was transfected with the expression vector encoding the Protein M, Protein M-HRP or armY-ACE2 (Protein M-ACE2) fusion protein sequences, by calcium phosphate transfection method. After 7-16 hours, the transfection solution was replaced with protein expression medium and the supernatant harvested after approximately 48 hours. To purify the proteins, the supernatant was harvested and pass through an anti-myc antibody-coupled agarose resin and the captured proteins eluted using 0.1M Glycine pH 2.5 and neutralized by 1M Tris-HCl pH 8.0. The eluted proteins were dialyzed against a phosphate buffered saline solution and stored in 4° C.


Example 7B. Characterization of armY-ACE2 as a Novel Therapeutic in the Treatment of Coronavirus Infection. Targeting of [armY-ACE2+Antibody] to SARS-CoV-2 Spike Protein and Binding of [armY-ACE2+SARS-CoV-2 Spike Protein] Complex to Antibody

Complex [armY-ACE2+antibody] targeting of SARS-CoV-2 spike protein was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method. Binding of [armY-ACE2+ to SARS-CoV-2 spike protein] complexes to immobilized antibody was demonstrated by measuring the amount of [armY-ACE2+SARS-CoV-2 spike protein] complexes bound to the antibody coated on a 96-well plate by an ELISA-based method (FIG. 7).


Briefly, 50 ul of 1 ug/ml histidine (his)-tagged SARS-CoV-2 spike protein (GenScript, Inc.) or 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+antibody] complexes were allowed to form at room temperature by adding 0.25 ug/ml biotinylated goat IgG (Jackson ImmunoResearch Inc.) to armY-ACE2 in expression medium for 60 minutes. Biotinylated antibody was also added to Protein M (lacking ACE2 domain) or expression medium as negative controls. armY-ACE2 alone in expression medium was prepared as an additional negative control. The samples were diluted in assay buffer (0.5% BSA in PBS+Tween 20) and added to SARS-CoV-2 spike protein coated wells, washed twice with PBS+Tween 20 (wash buffer, Pierce), in duplicate. After approximately 30 minutes at room temperature, the wells were washed 3×s and streptavidin-horseradish peroxidase (SA-HRP) (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 20 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software). Binding of [armY-ACE2+ to SARS-CoV-2 spike protein] complexes to immobilized human IgG: armY-ACE2+SARS-CoV-2 spike protein complexes were allowed to form at room temperature by adding 2 ug/ml of SARS-CoV-2 spike protein to armY-ACE2 in expression medium. armY-ACE2 alone in expression medium was prepared as a negative control. After approximately 60 minutes at room temperature, the wells were washed and mouse IgG1 anti-histidine tag (GenScript, Inc.) or mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the histidine-tagged SARS-CoV-2 spike protein or myc-tagged armY-ACE2 bound to human IgG coated on the well, respectively. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 8. ArmY-ACE2 Binding to Immobilized Antibody or Antibody in Solution

armY-ACE2 binding to immobilized antibody or antibody in solution was demonstrated by measuring the amount of free or antibody-bound armY-ACE2 in an ELISA based method (FIG. 8).


Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. Binding of antibody in solution by armY-ACE2: Briefly, armY-ACE2 was incubated with purified human IgG, 2% human serum (containing antibodies) or PBS in assay buffer and allowed to form complexes at room temperature for approximately 2 hours. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG-coated wells in duplicate. After approximately 60 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged armY-ACE2 bound to human IgG coated on the well. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 9. [armY-ACE2+Antibody] Complexes Engage Fc Receptors on K562 Erythroleukemic Cell Line

Binding of antibodies to Fc-receptor expressed on cells (e.g., innate immune cells, antigen presenting cells) requires interaction with the antibody Fc region. [armY-ACE2+antibody] complex engagement of Fc receptors was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the human FcγRII (CD32) expressed on K562, a human erythroleukemic cell line, by flow cytometry (FIG. 9).


Briefly, K562 cells were taken from cell culture medium and centrifuged (3000 rpm for 3 minutes) and supernatant removed by vacuum aspiration. After a wash with chilled FACS buffer (0.5% BSA in PBS+0.1% sodium azide), 100,000 cells was transferred to 1.5 ml microcentrifuge tubes in FACS buffer and the supernatant removed after centrifugation and the cells kept on ice. 5 ug/ml of human IgG (Sigma Aldrich) was added to armY-ACE2 in expression medium and kept at room temperature for approximately 30 minutes to form complexes, and tubes transferred to ice to chill. armY-ACE2 alone in expression medium was also prepared as a negative control. 100 ul of [army-ACE2+antibody] complexes or armY-ACE2 alone was added to K562 cells and allowed to incubate on ice for approximately 30 minutes. After two washes in FACS buffer, anti-myc (clone 9E10 mouse antibody) was added to detect the myc-tagged army-ACE2 and allowed to incubate for approximately 20 minutes. After two washes, anti-mouse IgG-Alexafluor-488 (Biolegend, Inc.) was added to detect anti-myc antibody and allowed to incubate for approximately 20 minutes. After two washes, cells were resuspended in FACS buffer and analyzed by flow cytometry (BD FACS Calibur and CellQuest Pro analysis software). At least 5,000 events were acquired per sample. Cells incubated with negative controls as described above served as source of background basal percent value. The percentage of cells staining positive for [army-ACE2+antibody] complexes was determined by the percentage of cells present within a gate established such that <6% of the positive events of cells incubated with negative control samples measured represented background fluorescence.


To demonstrate that binding of [army-ACE2+antibody] complexes to K562 was through a specific interaction with Fc-receptors expressed on the cells, K562 cells were pre-incubated with FcγRII blocking anti-CD32 (clone IV.3, mouse IgG2b, kappa) (FIG. 10).


Briefly, 1 ug of anti-CD32 or an isotype-matched mouse IgG2b, kappa control antibody was added to 100 ul of FACS buffer and added to approximately 100,000 K562 cells and placed on ice for approximately 15 minutes. After 2 washes, 100 ul of [army-ACE2+antibody] complexes prepared as described above was added to the cells and kept on ice for approximately 20 minutes. After two washes, fluorescein (FITC)-labeled anti-myc (Biotium, Inc.) was added to K562 cells and allowed to incubate on ice for approximately 15 minutes. After two washes, cells were resuspended in FACS buffer and analyzed by flow cytometry (BD FACS Calibur and CellQuest Pro analysis software). At least 5,000 events were acquired per sample. Cells incubated with negative control served as source of background basal percent value. The percentage of cells staining positive for [army-ACE2+antibody] complexes was determined by the percentage of cells present within a gate established such that <2% of the positive events of cells incubated with negative control samples measured represented background fluorescence.


Example 10. [armY-ACE2+Antibody] Complex Binding to Purified Human C1q Complement Component

The binding of the C1q complement component to antibody is the initial step towards the activation of the classical complement pathway. The [armY-ACE2+antibody] complex binding to C1q complement component was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the purified C1q coated on a 96-well plate by an ELISA-based method (FIG. 11).


Briefly, 50 ul of 5 ug/ml purified human C1q (>95% pure by SDS-PAGE analysis, Complement Technology, Inc.) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. armY-ACE2+antibody complexes were allowed to form at room temperature by adding 5 ug/ml FITC-labeled mouse IgM (Biolegend, Inc. Cat #401607) or 10 ug/ml FITC-labeled mouse IgG1 (Biolegend, Inc. Cat #200305) to armY-ACE2 or Protein M (lacking ACE2 domain) in expression medium. FITC-labeled antibody or armY-ACE2 added to expression medium served as negative controls. To block binding of [armY-ACE2+antibody] complex to immobilized C1q coated on the well, 10 ug/ml of soluble C1q was added to the [armY-ACE2+antibody] complexes and allowed to incubate at room temperature for 30 minutes. C1q-coated wells were washed twice with PBS+Tween 20 (wash buffer, Pierce), and the samples were added in duplicate. After approximately 30 minutes at room temperature, the wells were washed 3×s and biotinylated anti-FITC (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 45 minutes. The wells were washed 3×s and SA-HRP (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 25 minutes. After 4 washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 11. armY-ACE2 or [armY-ACE2+Antibody] Complexes Exhibit ACE2 Activity

ACE2 activity in armY-ACE2 or [armY-ACE2+antibody] complexes was demonstrated by measuring the fluorescence emitted after cleavage of the ACE2 fluorogenic substrate MCA-APK-(Dnp). ACE2-dependent removal of the quenching Dnp group induces fluorescence, which is measured by a fluorescence plate reader (FIG. 12).


Briefly, armY-ACE2 or [armY-ACE2+antibody] complexes were diluted in ACE2 buffer (1 mol/L NaCl, 75 mmol/L Tris HCl, pH 7.5, and 50 μmol/L ZnCl2) and 30 μl of diluted samples were combined with 170 μl the ACE2 fluorogenic substrate MCA-APK(Dnp) (AnaSpec, Inc. Cat #AS-60757) in ACE2 buffer. The final concentration of ACE2 substrate was 20 μM in a final volume of 200 μl. The samples were kept in the dark for 16 hours at room temperature. 100 μl of samples were transferred to a flat bottom NUNC Black 96 Microwell strip plate and fluorescence measured using a fluorescence plate reader (Cytofluor 4000, Gain 75, Ex 360/40, Em 460/40).


Example 12. SDS-PAGE Analysis of Purified armY-ACE2

SDS-PAGE analysis of purified armY-ACE2 was performed under non-reducing and reducing conditions and showed the expected band of ˜180 kDa (theoretical molecular weight: 150 kDa) (FIG. 13).


Briefly, 8 ul of sample buffer (Invitrogen) was added to 24 ul of eluted fractions and mixed. The sample were heated in 80° C. water bath for 10 minutes. Reducing agent (10×, Invitrogen) was added to some of the tubes containing the samples and mixed. Non-reduced and reduced samples were loaded onto a 4-12% NuPAGE pre-cast SDS-PAGE gel and separated at 175V for 30 minutes in MES-SDS running buffer (Invitrogen). PageRuler unstained protein ladder (10-200 kDa, Invitrogen) was also included. After electrophoresis, the gel was rinsed in distilled water and the protein bands stained using SimplyBlue Safe Stain (Invitrogen) and the gel photographed.


Example 13. Evaluating Neutralization of SARS-CoV-2 by armY-ACE2 In-Vitro

Live SARS-CoV-2 virus has to be handled under biosafety level 3 conditions due to its high pathogenicity and infectivity and the lack of effective vaccines and therapeutics. Recently, a VSV pseudovirus production system, a pseudovirus-based neutralization assay has been developed for evaluating neutralizing antibodies against SARS-CoV-2 in biosafety level 2 facilities (Nie et.al., 2020). Pseudoviruses are useful tools because of their safety and versatility, especially for emerging and re-emerging viruses. This example utilizes a validated Pseudovirus neutralization protocol slightly modified from Nie et.al., to test the efficacy of armY-ACE2 by measuring the ability of armY-ACE2 to inhibit SARS-CoV-2 pseudovirus binding and infection of ACE2 expressing cells.


Briefly, the vesicular stomatitis virus (VSV) pseudovirus system (G*AG-VSV) is used, which packages expression cassettes for firefly luciferase instead of VSV-G in the VSV genome. The SARS-CoV-2 pseudovirus is produced by transfecting human 293T cells with the expression plasmid pcDNA3.1 containing the codon-optimized SARS-CoV-2 spike protein sequence, followed by infection with G*AG-VSV pseudovirus. Post infection, SARS-CoV-2 pseudoviruses is harvested and stored until use.


Huh7 human hepatocellular cell line naturally express the human ACE2 receptor protein and is an ideal cell line for SARS-CoV-2 pseudovirus infection as it demonstrates high luciferase activity upon infection. Viral inocula of approximately 650 TCID50 (the 50% tissue culture infectious dose of SARS-CoV-2 pseudovirus) is used for the assay.


Neutralization of SARS-CoV-2 pseudovirus infection of Huh7 is confirmed by the reduction in luciferase gene expression upon infection. Neutralization condition: SARS-CoV-2 pseudovirus is incubated with serial dilutions of armY-ACE2+human plasma containing immunoglobulins (six, 1:3 dilutions, or half-log dilutions) in duplicate. Human plasma added Protein M or human plasma alone are included as negative controls. Recombinant ACE2-Ig fusion protein (commercially available from GenScript Inc., catalog #Z03484) has been demonstrated to neutralize SARS-CoV-2 pseudovirus infection previously (Lei et.al., 2020) and is used in this assay as a positive control. After incubation for 1 hour at 37° C. in a 96-well plate format, 5×10{circumflex over ( )}4 Huh7 cells is added to each well. After 24 hours of incubation in a 5% CO2 chamber at 37° C., luminescence is measured by adding luciferase substrate and the luminescence measured using a 96-well plate luminescence plate reader. Upon subtraction of background luminescence, relative light units (RLU) versus the concentration of test sample and controls is plotted to generate an inhibitory dose response curve from which the IC50 is calculated. Human plasma added armY-ACE2 is neutralizing SARS-CoV-2 pseudovirus infection of Huh7 in a dose-dependent fashion. Human plasma added Protein M or human plasma alone is not neutralizing SARS-CoV-2 pseudovirus infection in this assay.


Example 14. Evaluating Eradication of SARS-CoV-2 by armY-ACE2 In Vivo

While the Example 13 evaluates the efficacy of armY-ACE2 engaged immunoglobulins to neutralize SARS-CoV-2 in vitro, this Example will demonstrate the efficacy of armY-ACE2 to promote eradication of SARS-CoV-2 in vivo, thereby protecting the animal from a severe clinical disease and succumbing to a lethal infection.


Protein M binds to immunoglobulin of various species including those of man and mice. Commercially available human ACE2 transgenic mice K18-hACE2 (The Jackson Laboratory, Stock #034860) develops severe clinical disease upon infection with SARS-CoV (McCray et.al., 2007) to a similar degree observed in patients with severe Covid-19. According to CDC, “Among patients who developed severe disease, the median time to dyspnea from the onset of illness or symptoms ranged from 5 to 8 days, the median time to acute respiratory distress syndrome (ARDS) from the onset of illness or symptoms ranged from 8 to 12 days, and the median time to ICU admission from the onset of illness or symptoms ranged from 10 to 12 days.”


According to JAX laboratory, “These K18-hACE2 mice develop a rapidly lethal infection after intranasal inoculation with a human strain of SARS-CoV. Infection begins in airway epithelia, with subsequent alveolar involvement and extrapulmonary virus spread to the brain. Infection results in macrophage and lymphocyte infiltration in the lungs and upregulation of proinflammatory cytokines and chemokines in both the lung and the brain. By days 3 to 5 postinfection, K18-hACE2 mice begin to lose weight and become lethargic with labored breathing.” K18-hACE2 mice become moribund 4 days after inoculation, and all mice are dead 7 days after inoculation.


Recently, it was determined that K18-hACE2 mice “present with more symptomatic disease than other hACE2 mouse models of SARS-CoV-2 infection.” (Moreau et.al., 2020) For this reason and because the k18-hACE2 mice are readily commercially available, we employed the k18-hACE2 SARS-CoV model to evaluate the efficacy of army-ACE2 engaged immunoglobulins to eradicate SARS-CoV or SARS-CoV-2 in vivo following the methods as described (McCray et.al., 2007) with slight modifications.


Infection of K18-hACE2 mice with SARS-CoV or SARS-CoV-2. SARS-CoV and SARS-CoV-2 strains is obtained from the Centers for Disease Control, Atlanta, Ga. The virus is propagated and titered on Vero E6 cells in a biosafety level 3 laboratory and the virus titer is determined by a plaque assay.


Mice are lightly anesthetized with isoflurane and infected intranasally with the indicated dosage of SARS-CoV or SARS-CoV-2 in 30 ul of Dulbecco's modified Eagle medium. Infected mice are examined, weighed and evaluated for severe clinical disease on a daily basis monitoring for appearances of lethargy, labored breathing, moribund and death.


Treatment with armY-ACE2. Plasma from mice of the same background (C57BL/6J×SJL/J) as K18-hACE2 mice is harvested and mixed with armY-ACE2 and allowed to incubate at 37° C. for 1-2 hours to permit arming of plasma immunoglobulins. Infected mice (n=6) receive daily injections of 0.2 ml of armY-ACE2 plasma beginning one day after infection for 7 days. Two other cohorts of mice receive Protein M+plasma or plasma alone and serve as negative control treatment groups.


To obtain specimens for virus titers, a few animals are sacrificed before injection and after 1, 2, 3, 4, 5 and 6 days after infection, and organs are aseptically removed into sterile phosphate-buffered saline. In some cases, blood is obtained via catheterization of the inferior vena cava. Tissues are homogenized using a manual homogenizer, and the 50% tissue culture infective dose (TCID) is determined as described previously (Subbarao et.al., 2004) to determine the amount of virus per gram of tissue. Mice treated with army-ACE2 plasma do not succumb to infection, whereas mice in the negative control groups succumb. Surviving mice are permitted to continue in the study over an additional 2 months. These mice developed immunity to the virus and are protected from a subsequent challenge with the virus. Surviving mice are re-infected and examined, weighed daily and evaluated.


At termination, whole-lung lavage is performed and the lavage is evaluated for cellular and biochemical changes using standard techniques. Significantly lower cellular infiltrates and inflammatory markers in armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups. Lungs and other organs are examined by histology and immunohistochemistry to evaluate the degree of disease pathology and detect viral antigen. Significantly lower severe disease pathology and viral presence in the lungs and organs or armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups, indicating effective viral clearance and eradication. Similar findings are observed in armY-ACE2 plasma treated mice that had developed immunity to the virus and re-challenged with the virus.


Extraction of total RNA and quantitative reverse transcription-PCR (RTPCR) are performed to measure levels of viral RNA in various tissue specimen. An aliquot of cDNA is subjected to PCR using a MyiQ single-color real-time PCR detection system with iQ SYBR green Supermix. A set of primers is used for the SARS-CoV or SARS-CoV-2 nucleocapsid (N) gene or a house-keeping gene. Significantly lower viral genes in specimens acquired from armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups, indicating effective viral clearance and eradication. Similar findings are found in armY-ACE2 plasma treated mice that had developed immunity to the virus and re-challenged with the virus.


Example 15. Identification of Two Potential Immunogenic Peptide Regions in Protein M (a.a. 469-556)

The online B-cell epitope prediction tools (IEDB Analysis Resource) were used to determine potential immunogenic peptide regions in Protein M (a.a. 469-556). Using six online prediction tools [Bepipred Linear Epitope Prediction 2.0, Bepipred Linear Epitope Prediction, Chou & Fasman Beta-Turn Prediction, Emini Surface Accessibility Prediction, Karplus & Schulz Flexibility Prediction and Parker Hydrophilicity Prediction] two peptide regions in protein M c-terminal end (469-556 amino acid) were determined to be potentially immunogenic. The following peptide substitutions are proposed for these two regions to mitigate immunogenicity of Protein M C-terminal end (469-556 amino acid), which are listed below in a) an b), and additionally shown in the following Table 1. Complete Protein M amino acid sequences with substitutions are presented as SEQ ID NO: 63-66.











a) 494-507 amino acid of protein M



(based on SEQ ID NO: 1)



1. QALANATASALAAM






2. AKLANATASALARM






3. QALEADADSALEAM






4. AKLANDTASSAERA






b) 527-540 amino acid of protein M



(based on SEQ ID NO: 1)



1. AIAGVASATNAVAS






2. AIAGVASATNAVKS






3. DIAGVSADTAEVAS






4. AITGASSATNAVKA













TABLE 1





Proposed alanine substitutions for Protein M c-terminal


end (469-556 amino acid) to mitigate immunogenicity.





























a)
494
495
496
497
498
499
500
501
502
503
504
505
506
507





Original
Q
K
L
E
N
D
T
D
S
S
L
E
R
M


Subs #1
Q
A
L
A
N
A
T
A
S
A
L
A
A
M


Subs #2
A
K
L
A
N
A
T
A
S
A
L
A
R
M


Subs #3
Q
A
L
E
A
D
A
D
S
A
L
E
A
M


Subs #4
A
K
L
A
N
D
T
A
S
S
A
E
R
A





b)
527
528
529
530
531
532
533
534
535
536
537
538
539
540





Original
D
I
T
G
V
S
S
D
T
N
E
V
K
S


Subs #1
A
I
A
G
V
A
S
A
T
N
A
V
A
S


Subs #2
A
I
A
G
V
A
S
A
T
N
A
V
K
S


Subs #3
D
I
A
G
V
S
A
D
T
A
E
V
A
S


Subs #4
A
I
T
G
A
S
S
A
T
N
A
V
K
A









Example 16. Non-Immune Human Serum or Plasma Antibodies Armed with armY-ACE2 Bind to SARS-CoV-2 Spike Protein (FIG. 14A-C)

Binding to SARS-CoV-2 spike protein by armY-ACE2 armed non-immune serum antibodies was demonstrated by measuring the amount of armed antibodies that bind the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method.


Briefly, 50 ul of 5 ug/ml SARS-CoV-2 spike protein was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed 2×s with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+antibody] complexes were allowed to form by mixing armY-ACE2 with non-immune serum (pre-vaccine) diluted 1:200 in assay medium for 60 minutes in a 37° C. incubator. Mixtures containing pre-vaccine serum diluted 1:200 in assay medium, post-vaccine (Moderna SARS-CoV-2 spike mRNA vaccine) serum diluted 1:200 in assay medium or armY-ACE2 in assay medium, were included as controls and placed in a 37° C. incubator for 60 minutes. The final concentration of armY-ACE2 was 20 ug/ml. Binding of armY-ACE2 armed non-immune plasma (ACD-A) antibodies to SARS-CoV-2 spike protein was demonstrated following the same procedure described above.


After the incubation period, the samples were added to SARS-CoV-2 spike protein coated wells that had been washed 2×s with PBS+Tween 20 (wash buffer, Pierce), in duplicate. After approximately 120 minutes at room temperature, the wells were washed 4×s and anti-human IgG labeled with HRP (Genscript) was added to the wells and allowed to incubate at room temperature for approximately 25 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the blue color absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software) and a photo taken with a digital camera.


After incubating serum- or plasma-antibodies with armY-ACE2, the amount of free armY-ACE2 was determined by measuring the amount of unengaged armY-ACE2 that can bind to immobilized human IgG. Briefly, serum- or plasma samples incubated with armY-ACE2 as described above, were added to human IgG coated wells, in duplicate. 1 ug/ml of armY-ACE2 alone in assay buffer was added to separate wells as a reference positive control. After approximately 100 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged armY-ACE2 bound to human IgG coated on the wells. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 20 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 17. Monoclonal Antibody (mAb) Armed with armY-ACE2 Gains the Ability to Bind the SARS-CoV-2 Spike Protein, but is No Longer Able to Bind its Natural Target Antigen (FIG. 15 A-B)

Binding to SARS-CoV-2 spike protein by armY-ACE2 armed mAb (originally anti-selectin) was demonstrated by measuring the amount of armed mAbs that bind to the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method.


Briefly, 50 ul of 5 ug/ml SARS-CoV-2 spike protein was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed 2×s with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+mAb] complexes were allowed to form by mixing armY-ACE2 with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium, armY-ACE2 in assay medium, or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and armY-ACE2 were 1 ug/ml and 30 ug/ml, respectively.


After the incubation period, the samples were added to SARS-CoV-2 spike protein coated wells, in duplicate, that had been washed 2×s with PBS+Tween 20 (wash buffer, Pierce). After approximately 120 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 45 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


The inability of armY-ACE2 armed mAb (originally anti-selectin) to bind to its natural antigen was demonstrated by measuring the amount of armed mAb that bind to selectin protein coated on a 96-well plate by an ELISA-based method.


Briefly, 100 ul of 2 ug/ml biotinylated selectin protein was prepared in PBS and added to a flat bottom 96-well plate (streptavidin coated wells) after 2 washes with PBS+Tween 20 (wash buffer, Pierce).


[armY-ACE2+mAb] complexes were allowed to form by mixing armY-ACE2 with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and armY-ACE2 were 63 ng/ml and 15 ug/ml (50× molar excess) or 7.5 ug/ml (25× molar excess), respectively.


After 2 hours, the selectin-coated wells were washed 2×s with wash buffer and the mixtures added in duplicate wells and allowed to incubate for 1 hour at room temperature. After the incubation period, the wells were washed 3×s. After approximately 60 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech)+2% mouse serum was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 18. Biotinylated Protein M Detects the Light-Chain of Antibody, but not the Heavy-Chain on Western Blot (FIG. 16)

Protein M containing an N-terminal biotinylation “Avi-Tag” sequence was mono-biotinylated using the Accelagen TurboBiotinylation kit following the reaction protocol (Accelagen, TurboBiotinylation-protocol). The use of mono-biotinylated protein M fusion as an immunologic research tool for detection of antibody light-chain was demonstrated using a 1D gel electrophoresis Western blot method.


Briefly, the antibody sample was diluted to 1.0 mg/mL with sodium dodecyl sulfate (SDS) boiling buffer and heated to 95° C. for 10 minutes, and further diluted to 0.01 mg/mL. The E. coli (K12 MG1655) lysate sample was diluted to 2.5 mg/mL in SDS boiling buffer. SDS slab gel electrophoresis was carried out under reducing conditions according to the method of Laemmli, U. (Nature 227: 680-685, 1970) as modified by O'Farrell (J Biol. Chem. 250: 4007-4021). The samples were loaded in wells in 10% acrylamide slab gels (0.75 mm thick). SDS slab gel electrophoresis was carried out for about 4 hours at 15 mA/gel. The following proteins (Millipore Sigma) were used as molecular weight standards: myosin (220,000), phosphorylase A (94,000), catalase (60,000), actin (43,000), carbonic anhydrase (29,000), and lysozyme (14,000, not shown). After slab gel electrophoresis, the gel for blotting was placed in transfer buffer (10 mM CAPS, pH 11.0, 10% methanol) and transblotted onto PVDF membranes overnight at 145 mA and approximately 100 volts/two gels. The blots were stained with Coomassie Brilliant Blue R-250, cut into pieces at the dark lines and flatbed scanned (not shown).


Western Blot analysis. The membrane sections were destained in 100% MeOH and rinsed briefly in Tween-20 tris buffer saline (TTBS). The blot was blocked for two hours in Superblock with 0.05% Tween-20 (Superblock-T). The blot was then incubated overnight in Superblock-T and rinsed 3×10 minutes in TTBS. The blot was then placed in mono-biotinylated protein M diluted to 1.0 μg/ml in Superblock-T for two hours and rinsed as above. The blot was then placed in poly-HRP streptavidin (ThermoFisher, Cat #N200) diluted 1:500,000 in Superblock-T for two hours, rinsed as above, treated with ThermoFisher Pierce ECL, and exposed to x-ray film for 3 minutes.


Example 19. Protein M Fusion Serves as a Surrogate Antigen and May be Used to Confirm that a) Antigen Binding is Via the Fab Domain of the Antibody and b) the Antibody's Target Antigen and/or Specificity (FIG. 17)

Binding of antibody to antigen is mediated by the Fab arm of the antibody, which contains the variable region where the antigen binding site is found. Protein M binds specifically to the light-chain variable region in the Fab and blocks the antigen binding site. Therefore, protein M may serve as a surrogate antigen and as an immunologic research tool, and used to a) confirm that the antibody binds via its Fab domain and b) confirms its specificity as it loses its antigen binding ability when bound by protein M. A 96-well ELISA-based method was used to demonstrate such protein M uses.


Protein M fusion serves as a surrogate antigen.


Briefly, 100 ul of 2 ug/ml mono-biotinylated protein M was prepared in PBS and added to a flat bottom 96-well plate (streptavidin coated wells) after 2 washes with PBS+Tween 20 (wash buffer, Pierce).


After approximate 2 hours at room temperature, the coated wells were washed 2×s with wash buffer and the varying amounts of monoclonal antibody (originally anti-selectin) were added in duplicate wells and allowed to incubate for approximately 2 hours at room temperature. After the incubation period, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Example 20. Protein M is Used to Confirm that a) the Antibody Binds Via its Fab Domain and b) it Loses its Antigen Binding Ability when Bound by Protein M (FIG. 18)

Briefly, 100 ul of 2 ug/ml biotinylated selectin protein was prepared in PBS and added to a flat bottom 96-well plate (streptavidin coated wells) after 2 washes with PBS+Tween 20 (wash buffer, Pierce).


Protein M+monoclonal antibody (mAb) complexes were allowed to form by mixing protein M with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and protein M were 125 ng/ml and 3.4 ug/ml (50× molar excess) or 1.7 ug/ml (25× molar excess), respectively.


After 2 hours, the selectin-coated wells were washed 2×s with wash buffer and the mixtures added in duplicate wells and allowed to incubate for approximately 90 minutes at room temperature. After the incubation period, the wells were washed 3×s. After approximately 60 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).


Other Embodiments

The detailed description set-forth above is provided to aid those skilled in the art in practicing the present invention. However, the invention described and claimed herein is not to be limited in scope by the specific embodiments herein disclosed because these embodiments are intended as illustration of several aspects of the invention. Any equivalent embodiments are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description which do not depart from the spirit or scope of the present inventive discovery. Such modifications are also intended to fall within the scope of the appended claims.


REFERENCES CITED

All publications, patents, patent applications and other references cited in this application are incorporated herein by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application or other reference was specifically and individually indicated to be incorporated by reference in its entirety for all purposes. Citation of a reference herein shall not be construed as an admission that such is prior art to the present invention.


Specifically intended to be within the scope of the present invention, and incorporated herein by reference in its entirety, is the following publication:

  • Grover R K, Zhu X, Nieusma T, et al. A structurally distinct human mycoplasma protein that generically blocks antigen-antibody union. Science. 2014; 343(6171):656-661.
  • Blötz C, Singh N, Dumke R, Stülke J. Characterization of an Immunoglobulin Binding Protein (IbpM) From Mycoplasma pneumoniae. Front Microbiol. 2020; 11:685. Lei C, Qian K, Li T, et al. Neutralization of SARS-CoV-2 spike pseudotyped virus by recombinant ACE2-Ig. Nat Commun. 2020; 11(1):2070.
  • Nie J, Li Q, Wu J, et al. Establishment and validation of a pseudovirus neutralization assay for SARS-CoV-2. Emerg Microbes Infect. 2020; 9(1):680-686.
  • Mccray P B, Pewe L, et al. Lethal infection of K18-hACE2 mice infected with severe acute respiratory syndrome coronavirus. J Virol. 2007; 81(2):813-21.
  • G. Brett Moreau, Stacey L. Burgess, Jeffrey M. Sturek, Alexandra N. Donlan, William A. Petri Jr., Barbara J. Mann Evaluation of K18-hACE2 mice as a model of SARS-CoV-2 infection. bioRxiv 2020.06.26.171033.
  • Subbarao K, Mcauliffe J, Vogel L, et al. Prior infection and passive transfer of neutralizing antibody prevent replication of severe acute respiratory syndrome coronavirus in the respiratory tract of mice. J Virol. 2004; 78(7):3572-7.
  • Mebrahtu E, Zheleznyak A, Hur M A, Laforest R, Lapi S E. Initial characterization of a dually radiolabeled peptide for simultaneous monitoring of protein targets and enzymatic activity. Nucl Med Biol. 2013; 40(2):190-6.











SEQUENCE LISTING



1)




Mycoplasma genitalium




The mature protein M sequence



(37-556 amino acid)



SEQ ID NO: 1



TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVK






LKSLSLNDGSYQSEIDLSGGANFREKFRNFANELS






EAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFK






AGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTT






NGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTS






NNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAI






DAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNEL






QLPESVKKVSLYGDYTGVNVAKQIFANVVELEFYS






TSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDL






TQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQF






QGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKEL






NLHLEEAYREGDNTYYRVNENYYPGASIYENERAS






RDSEFQNEILKRAEQNGVTFDENIKRITASGKYSV






QFQKLENDTDSSLERMTKAVEGLVTVIGEEKFETV






DITGVSSDTNEVKSLAKELKTNALGVKLKL






2)




Mycoplasma pneumoniae




IgG-blocking mature protein M



sequence (36-582 amino acid)



SEQ ID NO: 2



AVLIVNEVLRLQSGETLIASGRSGNLSFQLYSKVN






QNAKSKLNSISLTDGGYRSEIDLGDGSNFREDFRN






FANNLSEAITDAPKDLLRPVPKVEVSGLIKTSSTF






ITPNFKAGYYDQVAADGKTLKYYQSTEYFNNRVVM






PILQTTNGTLTANNRAYDDIFVDQGVPKFPGWFHD






VDKAYYAGSNGQSEYLFKEWNYYVANGSPLYNVYP






NFIHFKQIKTIAFDAPRIKQGNTDGINLNLKQRNP






DYVIINGLTGDGSTLKDLELPESVKKVSIYGDYHS






INVAKQIFKNVLELEFYSTNQDNNFGFNPLVLGDH






TNIIYDLFASKPFNYIDLTSLELKDNQDNIDASKL






KRAVSDIYIRRRFERQMQGYWAGGYIDRYLVKNTN






EKNVNKDNDTVYAALKDINLHLEETYTHGGNTMYR






VNENYYPGASAYEAERATRDSEFQKEIVQRAELIG






VVFEYGVKNLRPGLKYTVKFESPQEQVALKSTDKF






QPVIGSVTDMSKSVTDLIGVLRDNAEILNITNVSK






DETVVAELKEKLDRENVFQEIRT






3)




Mycoplasma iowae




IgG-blocking mature protein M



sequence (21-497 amino acid)



SEQ ID NO: 3



VGVYVATTNTQNTSVNVNNNENINYKTNGTVVTGD






KLTFSAVVQQNSNISTQAFINDGTKPVGTYNKEIN






LGKDSITPKYTSGYVETYLESGDTVSRYSSSEYHN






NRTLMPILDTKEHYYTSERTYSEIQKGIYRGWEIS






TKSINYGEQFAYSASPVLKTVFRDLKQETIKAVQF






NLGLSDTSIESINSFLKTNTGIQFVTIKGISQDTD






LSKLVLPESVQKLTLLGQRNTINDLKLPSELQEIE






IYLGSSLKSIDPLIFPKSANIISDVVMNNTSSVFT






EIKLSDSTIDNNSPKLQKAIDDVYTYRIKERAFQG






LVPGGYIASWDLTGTKVTSFNNVNIPPLNDGTGRF






YIAHVEVKTDGNFGNSQNESIGSKPSNDSQINDWF






DWGGGWQKVQEVVVSSSENVSLETATQEIMGFIAK






YPNVKKINIVNVKLTDGSTHEQLKDNVIKAITAKY






GEESQYKDIEFVLPETVPSPVA






4)




Mycoplasma tullyi




IgG-blocking mature protein M



sequence (30-517 amino acid)



SEQ ID NO: 4



IVYTSVKISNTLNQDKQIAGSNLSPTQSNRLIGFQ






TLTKFKIQDLDFELQRKIYSSRLNSAELITKSAVV






LDQSTLQNHDGEVASGQPAPQVPPPVRIPAKEQTG






HTSDFISGYSENNLYYQTPYYYNDRVYMPILDSRK






TYLRNERTTTDIGLNNYEGWITSDHSRVNNRVNVF






NYRPSPELLAKYTDLAADKLIFTMTIDLYQANPEM






INEILKEYSPDFVILSNADSQVMKQLVFPSSVKKL






TIKSNLLDRFDFSLANTEIQELELYTPRLTEYNPF






ALNPNTHLIFDSNYSKPFTSINLYGVPLTHQQVLS






ALEDVFVRRHYERALQGSFSGGYISSLDLSNTGIT






SLSNLMIKNINPYYDSYTMSVKYNSNKNGEIELLK






TNSWKNPNPAPVSTPAASSPTTPTVPSTPGDSTIN






VQDKDLGLLVSSEVKVDPQVLINVVSKYLHNNPRV






NVLDISKVSLKSGSLVDVATNLKAKIDYLNVTI






5)




Mycoplasma imitans




IgG-blocking mature protein M



sequence (29-507 amino acid)



SEQ ID NO: 5



GIIYTSVKISSSQFNKQISNPIEVPKRNNTLIGFQ






TLARFKIENLDFELQKNIYSQNENALVNKAAVVQD






NSIINHDGEPTGQNERQVPAPVKILAKEQTGHTSD






FISGYTDNNSYYQSPFYYNDRVFMPILDSHSIYLK






NERTSKEIGLDSYEGWDKIGYSTINSRVSFVQYRA






TDQLIAKFNPSNKQIFAMMINLYQADPAVINNTLR






NYLPDFVILSNADNQIIKRLVFPSSVKKLTIKSNL






LDRFDFSLANSNIQELELYTPNLTEYNPLALNPDT






HLIFDTAYSKPFTSINLYGAKLTTQETQEAFNDIF






VRRYYERYLQGAFVGGYISLLDLSNTGINSVNDYV






VKNINPAYSSYTLSVTYNPGDPGQISILRTTTSIP






SETQPTNPSNNTPSQPTDPNITTQIDAKEKDLKLV






VSSTIQVDTQVVINVVGKYLLNNPRVNNVDISRIQ






LKSGTLVDIANNFKTKMSYLNVSV






6)



Mycoplasma gallisepticum



IgG-blocking mature protein M



sequence (29-417 amino acid)



SEQ ID NO: 6



GIIYTSVKISNSLYQDKLISGQNQPLAPVNRLIGF






QTLAKFRIEGLDFELKKKIYSSTVESVELVNRSAV






LVDDSVLENHDGELTSVQSDPQVPAPVKILAKEQT






GHTSDFVSGYSDDNKYYQSPYYYNDRVYMPILDSP






TIYLKNERTSSDIGLNNYQGWIAVGHARVNSRVSV






FNYRATDELLAKFNNLPDRLIFTMSIDLYQANPAM






INETLKEYSPDFVILSNADSQTMKQLVFPSSVKKL






TIKSNILDRFDFSLVNSEIQELELYTPNLTEYNPL






ALNPKTHLIFDADYSTRFLSINLYGAQLTNQQALA






ALEDVFVHRYYERALQGSFVDGYISSLVLSDTGIT






SLNNLVIKNINPNYDSYIMSVKYHSNDSGQIELLK






TTAW






7)




Mycoplasma alvi




IgG-blocking protein M sequence,



signal peptide included (28-540



amino acid)



SEQ ID NO: 7



ISIPFIIQSTHTNNANSTIPNVSKPSGSSLAPINY






SYDNFVNNYDGTLTSNSLVFSASGSKEVKSSLQTR






AITVDGLNDIDSSMGLVDAMSQGLLDNSYDPKYNE






VREVIDMDGAHRKIVTTKCFDNNRKYMPILTYNND






TYYSYSESRTWDDVNRSIYPGWNLNRSNLSSHNQN






KMIGVDILVYTPTEVLKTAYPSVTDKIIGLSISLS






NLISTYGDQTKQVLSQLIDAVNPSLVNFWGVSDSN






LDKLPDLSSNTNIKKISIRGDYSNLNGFVFPSSVL






ELEFSSQNYKAVDPLQIPESAAIIYEQGYSSYFTS






IDLSTHKGMSNEDLQKAVNVVYQQRIHERAFQGDF






AGGYIYSWNLRNTGIYSFNNVTIPMLTDGTGRFYI






AYVAVETDGNQGPIANEVISDNSSKPSNDSQINEW






FDWNQNGWSTITEVKITAKDNVKLNFNNTVQEILG






FINKYPNIKVVDISALQFSNDETLDELIDAVNKAI






ADKYTGMDGTPTVKLDFIKVNYL






8)




Mycoplasma penetrans




IgG-blocking mature protein M



sequence (31-505 amino acid)



SEQ ID NO: 8



LVTSNNNHENSLNNSSSNNGSNLKVNGSVISTDNL






NIVATGLSSNVSSQVSRQSLSSSSSSESTVDSKYT






AKKKLTTVSGQEKEYLVSTVYENNRKFMPILAYDE






DISYNNYQQSREYKDVVYGNFPGWDKKVAVVHQID






NVDLSKAYASVAEFTPTEILKKHFQVLQTSVKQLY






VALDSKTMTADVITKLVDRYQPDYLRIESVDDTSI






KQLPDMKYFSTVKKVDLGGAFTTIKGVSFPTTTQE






LKISSDNIKSIDPLQIPESAAIITETVHDARFTEI






DLSSHTDLTTDQLQKAVNIVYKDRIKERAFQGNFA






GGYIYSWNLQNTGITSFNDVSIPKLNDGTDRFYIA






YVAVSSGNSNGTANETITGGKEPSNDSQIGEWWDS






SSDGWSKVSKVTVTAKNGASLDYNKTLTEIMGFLA






KYPNVKTIDISLLKFEDASKTLDGLKTELTNQIKS






KYGEDSSYAKIDFIITSQSN






9)



Artificial armY-ACE2 fusion protein



sequence (1,298 amino acids).



Including the human IL-2 signal



sequence, human myc-peptide epitope



tag, linker, human



ACE2, linker, Mycoplasmagenitalium



protein M



SEQ ID NO: 9




MYRMQLLSCIALSLALVTNSEQKLISEEDLLRKRG








SPGGAQSTIEEQAKTFLDKFNHEAEDLFYQSSLAS







WNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQM






YPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLN






TILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIM






ANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLK






NEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQL






IEDVEHTFEEIKPLYEHLHAYVRAKLMNAYPSYIS






PIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNID






VTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGF






WENSMLTDPGNVQKAVCHPTAWDLGKGDFRILMCT






KVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGAN






EGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDN






ETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGE






IPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPAS






LFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEG






PLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALEN






VVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWS






TDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYL






FRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKP






RISFNFFVTAPKNVSDIIPRTEVEKAIRMSRSRIN






DAFRLNDNSLEFLGIQPTLGPPNQPPVSGGGGSGG







GGSGGGGSTNLVNQSGYALVASGRSGNLGFKLFST







QSPSAEVKLKSLSLNDGSYQSEIDLSGGANFREKF






RNFANELSEAITNSPKGLDRPVPKTEISGLIKTGD






NFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRV






LMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNT






KATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFF






EDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSG






NGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFAN






VVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFAS






KPFTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYN






YRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDL






VYRSLKELNLHLEEAYREGDNTYYRVNENYYPGAS






IYENERASRDSEFQNEILKRAEQNGVTFDENIKRI






TASGKYSVQFQKLENDTDSSLERMTKAVEGLVTVI






GEEKFETVDITGVSSDTNEVKSLAKELKTNALGVK






LKL






10)



Artificial



Protein M with peptide tags



(aka: armY) protein sequence



(587 amino acids).



Including the human IL-2 signal



sequence. Avi-Tag, human myc-



peptide epitope tag, linker.




Mycoplasmagenitalium protein M




SEQ ID NO: 10




MYRMQLLSCIALSLALVTNSMAGGLNDIFEAQKIE







WHEGGEQKLISEEDLLRKRAANGGGGSGGGGSTNL







VNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKS








LSLNDGSYQSEIDLSGGANFREKFRNFANELSEAI








TNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGY








YDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGT








LMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNL








TYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAIDAK








DISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP








ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSK








ANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQV








TLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGY








FAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLH








LEEAYREGDNTYYRVNENYYPGASIYENERASRDS








EFQNEILKRAEQNGVTFDENIKRITASGKYSVQFQ








KLENDTDSSLERMTKAVEGLVTVIGEEKFETVDIT








GVSSDTNEVKSLAKELKTNALGVKLKL







11)



Artificial



Protein M horseradish peroxidase



(HRP) fusion protein sequence



(876 amino acids).



Including the human IL-2 signal



sequence, human myc-peptide



epitope, linker. HRP, linker,




Mycoplasmagenitalium protein M




SEQ ID NO: 11




MYRMQLLSCIALSLALVTNSEQKLISEEDLAANQL







TPTFYDNSCPNVSNIVRDTIVNELRSDPRIAASIL






RLHFHDCFVNGCDASILLDNTTSFRTEKDAFGNAN






SARGFPVIDRMKAAVESACPRTVSCADLLTIAAQQ






SVTLAGGPSWRVPLGRRDSLQAFLDLANANLPAPF






FTLPQLKDSFRNVGLNRSSDLVALSGGHTFGKNQC






RFIMDRLYNFSNTGLPDPTLNTTYLQTLRGLCPLN






GNLSALVDFDLRTPTIFDNKYYVNLEEQKGLIQSD






QELFSSPNATDTIPLVRSFANSTQTFFNAFVEAMD






RMGNITPLTGTQGQIRLNCRVVNSNSGGGGSGGGG







SGGGGSTNLVNQSGYALVASGRSGNLGFKLFSTQS







PSAEVKLKSLSLNDGSYQSEIDLSGGANFREKFRN






FANELSEAITNSPKGLDRPVPKTEISGLIKTGDNF






ITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLM






PILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKA






TTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFED






VKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNG






SQLNELQLPESVKKVSLYGDYTGVNVAKQIFANVV






ELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKP






FTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYR






RFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVY






RSLKELNLHLEEAYREGDNTYYRVNENYYPGASIY






ENERASRDSEFQNEILKRAEQNGVTFDENIKRITA






SGKYSVQFQKLENDTDSSLERMTKAVEGLVTVIGE






EKFETVDITGVSSDTNEVKSLAKELKTNALGVKLK






L






12)



Artificial



Set of three Glycine (G4)-Serine



(S1) linker sequence



(1-15 amino acid)



SEQ ID NO: 12



GGGGSGGGGSGGGGS






13)



Artificial



Set of two Glycine (G4)-Serine



(S1) linker sequence



(1-10 amino acid)



SEQ ID NO: 13



GGGGSGGGGS






14)



SEQ ID NO: 14



Artificial



Set of one Glycine (G4)-Serine



(S1) linker sequence



(1-5 amino acid)



GGGGS






15)



SEQ ID NO:  15






Human


Angiotensin-Converting Enzyme 2 (ACE2) Extracellular Domain Protein Sequence (18-740 Amino Acid)


Essential counter-regulatory carboxypeptidase of the renin-angiotensin hormone system that is a critical regulator of blood volume, systemic vascular resistance, and thus cardiovascular homeostasis. This receptor acts as an attachment receptor for human coronaviruses SARS-CoV and SARS-CoV-2, as well as human coronavirus NL63/HCoV-NL63











16)



SEQ ID NO: 16



QSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNT






NITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQE






IQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNT






MSTIYSTGKVCNPDNPQECLLLEPGLNEIMANSLD






YNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMAR






ANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVE






HTFEEIKPLYEHLHAYVRAKLMNAYPSYISPIGCL






PAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAM






VDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSM






LTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMD






DFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHE






AVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEIN






FLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQ






WMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVS






NDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKC






DISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAK






NMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSP






YADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSV






AYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFN






FFVTAPKNVSDIIPRTEVEKAIRMSRSRINDAFRL






NDNSLEFLGIQPTLGPPNQPPVS






Human


CD209 (DC-SIGN) Extracellular Domain Protein Sequence (59-404 Amino Acid).


A pathogen-recognition receptor expressed on the surface of immature dendritic cells (DCs) and involved in initiation of primary immune response. This receptor acts as an attachment receptor for HIV-1, HIV-2, Ebolavirus, Cytomegalovirus, HCV, Dengue virus, Measles virus, Herpes simplex virus 1, Influenza virus, SARS-CoV, Japanese encephalitis virus, Lassa virus, Respiratory syncytial virus, Rift valley fever virus, West-nile virus, Marburg virus, Uukuniemi virus, and Yersinia Pestis











17)



SEQ ID NO: 17



QVSKVPSSISQEQSRQDAIYQNLTQLKAAVGELSE






KSKLQEIYQELTQLKAAVGELPEKSKLQEIYQELT






RLKAAVGELPEKSKLQEIYQELTWLKAAVGELPEK






SKMQEIYQELTRLKAAVGELPEKSKQQEIYQELTR






LKAAVGELPEKSKQQEIYQELTRLKAAVGELPEKS






KQQEIYQELTQLKAAVERLCHPCPWEWTFFQGNCY






FMSNSQRNWHDSITACKEVGAQLVVIKSAEEQNFL






QLQSSRSNRFTWMGLSDLNQEGTWQWVDGSPLLPS






FKQYWNRGEPNNVGEEDCAEFSGNGWNDDKCNLAK






FWICKKSAASCSRDEEQFLSPAPATPNPPPA






Human


C-Type Lectin Domain Family 4 Member M Extracellular Domain Protein Sequence (71-399 Amino Acid).


Probable pathogen-recognition receptor involved in peripheral immune surveillance in liver. This receptor acts as an attachment receptor for Ebolavirus, Hepatitis C virus, HIV-1, Human coronavirus 229E, Human cytomegalovirus/HHV-5, Influenza virus, SARS-CoV, West-nile virus, Japanese encephalitis virus, Marburg virus glycoprotein, and M. bovis.











18)



SEQ ID NO: 18



QVSKVPSSLSQEQSEQDAIYQNLTQLKAAVGELSE






KSKLQEIYQELTQLKAAVGELPEKSKLQEIYQELT






RLKAAVGELPEKSKLQEIYQELTRLKAAVGELPEK






SKLQEIYQELTRLKAAVGELPEKSKLQEIYQELTE






LKAAVGELPEKSKLQEIYQELTQLKAAVGELPDQS






KQQQIYQELTDLKTAFERLCRHCPKDWTFFQGNCY






FMSNSQRNWHDSVTACQEVRAQLVVIKTAEEQNFL






QLQTSRSNRFSWMGLSDLNQEGTWQWVDGSPLSPS






FQRYWNSGEPNNSGNEDCAEFSGSGWNDNRCDVDN






YWICKKPAACFRDE






Human


CD4 Extracellular Domain Protein Sequence (26-396 Amino Acid).


Integral membrane glycoprotein that plays an essential role in the immune response and serves multiple functions in responses against both external and internal offenses. In T-cells, functions primarily as a coreceptor for MHC class II molecule:peptide complex. This coreceptor acts as an attachment receptor for HIV.











19)



SEQ ID NO: 19



KKVVLGKKGDTVELTCTASQKKSIQFHWKNSNQIK






ILGNQGSFLTKGPSKLNDRADSRRSLWDQGNFPLI






IKNLKIEDSDTYICEVEDQKEEVQLLVFGLTANSD






THLLQGQSLTLTLESPPGSSPSVQCRSPRGKNIQG






GKTLSVSQLELQDSGTWTCTVLQNQKKVEFKIDIV






VLAFQKASSIVYKKEGEQVEFSFPLAFTVEKLTGS






GELWWQAERASSSKSWITFDLKNKEVSVKRVTQDP






KLQMGKKLPLHLTLPQALPQYAGSGNLTLALEAKT






GKLHQEVNLVVMRATQLQKNLTCEVWGPTSPKLML






SLKLENKEAKVSKREKAVWVLNPEAGMWQCLLSDS






GQVLLESNIKVLPTWSTPVQP






Human


Synaptic Vesicle Glycoprotein 2A Extracellular Domain Protein Sequence (469-598 Amino Acid).


Plays a role in the control of regulated secretion in neural and endocrine cells, enhancing selectively low-frequency neurotransmission. This protein acts as an attachment receptor for the C. botulinum neurotoxin type A2 (BoNT/A, botA).











20)



SEQ ID NO: 20



PDMIRHLQAVDYASRTKVFPGERVEHVTFNFTLEN






QIHRGGQYFNDKFIGLRLKSVSFEDSLFEECYFED






VTSSNTFFRNCTFINTVFYNTDLFEYKFVNSRLIN






STFLHNKEGCPLDVTGTGEGAYMVY






Human


Synaptic Vesicle Glycoprotein 2B Extracellular Domain Protein Sequence (412-535 Amino Acid).


Probably plays a role in the control of regulated secretion in neural and endocrine cells. This protein acts as an attachment receptor for the C. botulinum neurotoxin type A2 (BoNT/A, botA). Probably also serves as a receptor for the closely related C. botulinum neurotoxin type A1.











21)



SEQ ID NO: 21



PDMIRYFQDEEYKSKMKVFFGEHVYGATINFTMEN






QIHQHGKLVNDKFTRMYFKHVLFEDTFFDECYFED






VTSTDTYFKNCTIESTIFYNTDLYEHKFINCRFIN






STFLEQKEGCHMDLEQDND






Human


Synaptic Vesicle Glycoprotein 2C Extracellular Domain Protein Sequence (459-578 Amino Acid).


Plays a role in the control of regulated secretion in neural and endocrine cells, enhancing selectively low-frequency neurotransmission. This protein acts as an attachment receptor for C. botulinum neurotoxin type A (BoNT/A, botA). Also serves as a receptor for the closely related C. botulinum neurotoxin type A2.











22)



SEQ ID NO: 22



KPLQSDEYALLTRNVERDKYANFTINFTMENQIHT






GMEYDNGRFIGVKFKSVTFKDSVFKSCTFEDVTSV






NTYFKNCTFIDTVFDNTDFEPYKFIDSEFKNCSFF






HNKTGCQITFDDDYS






Human


Synaptotagmin I Extracellular Domain Protein Sequence (1-57 Amino Acid).


Calcium sensor that participates in triggering neurotransmitter release at the synapse. This protein acts as an attachment receptor for C. botulinum neurotoxin type B (BoNT/B, botB)











23)



SEQ ID NO: 23



MVSESHHEALAAPPVTTVATVLPSNATEPASPGEG






KEDAFSKLKEKFMNELHKIPLP






Human


Synaptotagmin II Extracellular Domain Protein Sequence (1-62 Amino Acid).


Exhibits calcium-dependent phospholipid and inositol polyphosphate binding properties. This protein acts as an attachment receptor for C. botulinum neurotoxin type B (BoNT/B, botB)









24) SEQ ID NO: 24


MRNIFKRNQEPIVAPATTTATMPIGPVDNSTESGGAGESQEDMFAKLKE


KLFNEINKIPLPP


Human






HLA Class II Histocompatibility Antigen, DRB1 Beta Chain Extracellular Domain Protein Sequence (30-227 Amino Acid).


A beta chain of antigen-presenting major histocompatibility complex class II (MHCII) molecule. This protein acts as an attachment receptor for Epstein-Barr virus and Staphylococcal enterotoxin A and B.









25) SEQ ID NO: 25


GDTRPRFLWQPKRECHFFNGTERVRFLDRYFYNQEESVRFDSDVGEFRA





VTELGRPDAEYWNSQKDILEQARAAVDTYCRHNYGVVESFTVQRRVQPK





VTVYPSKTQPLQHHNLLVCSVSGFYPGSIEVRWFLNGQEEKAGMVSTGL





IQNGDWTFQTLVMLETVPRSGEVYTCQVEHPSVTSPLTVEWRARSESAQ





SK





Human






Human


HLA Class II Histocompatibility Antigen, DR Alpha Chain Extracellular Domain Protein Sequence (26-216 Amino Acid).


Binds peptides derived from antigens that access the endocytic route of antigen presenting cells (APC) and presents them on the cell surface for recognition by the CD4 T-cells. This protein acts as an attachment receptor for Epstein-Barr virus BZLF2/gp42, Staphylococcus aureus enterotoxin A/entA, enterotoxin B/entB, enterotoxin C1/entC1, enterotoxin D/entD, and enterotoxin H/entH.









26) SEQ ID NO: 26


IKEEHVIIQAEFYLNPDQSGEFMFDFDGDEIFHVDMAKKETVWRLEEFG





RFASFEAQGALANIAVDKANLEIMTKRSNYTPITNVPPEVTVLTNSPVE





LREPNVLICFIDKFTPPVVNVTWLRNGKPVTTGVSETVFLPREDHLFRK





FHYLPFLPSTEDVYDCRVEHWGLDEPLLKHWEFDAPSPLPETTE






Human


T Cell Receptor Beta Variable 7-9 Mature Protein Sequence (22-115 Amino Acid).


V region of the variable domain of T cell receptor (TR) beta chain that participates in the antigen recognition. This protein acts as an attachment receptor for Staphylococcus aureus enterotoxin A/entA.









27) SEQ ID NO: 27 


GVSQNPRHKITKRGQNVTFRCDPISEHNRLYWYRQTLGQGPEFLTYFQN


EAQLEKSRLLSDRFSAERPKGSFSTLEIQRTEQGDSAMYLCASSL






Human


T Cell Receptor Beta Variable 19 Mature Protein Sequence (22-114 Amino Acid).


V region of the variable domain of T cell receptor (TR) beta chain that participates in the antigen recognition. This protein acts as an attachment receptor for Staphylococcus aureus enterotoxin B/entB.









28) SEQ ID NO: 28


GITQSPKYLFRKEGQNVTLSCEQNLNHDAMYWYRQDPGQGLRLIYYSQI


VNDFQKGDIAEGYSVSREKKESFPLTVTSAQKNPTAFYLCASSI






Human


Hepatitis a Virus Cellular Receptor 1 Extracellular Domain Protein Sequence (21-364 Amino Acid).


May play a role in T-helper cell development and the regulation of asthma and allergic diseases. This protein acts as an attachment receptor for Hepatitis A virus, Ebola virus, Marburg virus and Dengue virus and Clostridium perfringens Epsilon toxin (ETX).









29) SEQ ID NO: 29


SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHV





TYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKI





TVSLEIVPPKVTTTPIVTTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP





TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMP





LPRQNHEPVATSPSSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGNDT





VTESSDGLWNNNQTQLFLEHSLLTANTTKGIYAGVCISVLVLLALLGVI





IAKKYFFKKEVQQLSVSFSSLQIKALQNAVEKEVQAEDNIYIENSLYAT





D






Human


Myelin and Lymphocyte Protein Protein Sequence (1-153 Amino Acid).


Could be an important component in vesicular trafficking cycling between the Golgi complex and the apical plasma membrane. This protein acts as an attachment receptor for Clostridium perfringens Epsilon toxin (ETX).









30) SEQ ID NO: 30


MAPAAATGGSTLPSGFSVFTTLPDLLFIFEFIFGGLVWILVASSLVPWP





LVQGWVMFVSVFCFVATTTLIILYIIGAHGGETSWVTLDAAYHCTAALF





YLSASVLEALATITMQDGFTYRHYHENIAAVVFSYIATLLYVVHAVFSL





IRWKSS






Human


Complement Factor H Mature Protein Sequence (19-1231 Amino Acid).


Glycoprotein that plays an essential role in maintaining a well-balanced immune response by modulating complement activation. This protein binds to Streptococcus pneumoniae, Neisseria meningitides, Staphylococcus aureus, Borrelia burgdorferi and West nile virus.









31) SEQ ID NO: 31


EDCNELPPRRNTEILTGSWSDQTYPEGTQAIYKCRPGYRSLGNVIMVCR





KGEWVALNPLRKCQKRPCGHPGDTPFGTFTLTGGNVFEYGVKAVYTCNE





GYQLLGEINYRECDTDGWTNDIPICEVVKCLPVTAPENGKIVSSAMEPD





REYHFGQAVRFVCNSGYKIEGDEEMHCSDDGFWSKEKPKCVEISCKSPD





VINGSPISQKIIYKENERFQYKCNMGYEYSERGDAVCTESGWRPLPSCE





EKSCDNPYIPNGDYSPLRIKHRTGDEITYQCRNGFYPATRGNTAKCTST





GWIPAPRCTLKPCDYPDIKHGGLYHENMRRPYFPVAVGKYYSYYCDEHF





ETPSGSYWDHIHCTQDGWSPAVPCLRKCYFPYLENGYNQNYGRKFVQGK





SIDVACHPGYALPKAQTTVTCMENGWSPTPRCIRVKTCSKSSIDIENGF





ISESQYTYALKEKAKYQCKLGYVTADGETSGSITCGKDGWSAQPTCIKS





CDIPVFMNARTKNDFTWFKLNDTLDYECHDGYESNTGSTTGSIVCGYNG





WSDLPICYERECELPKIDVHLVPDRKKDQYKVGEVLKFSCKPGFTIVGP





NSVQCYHFGLSPDLPICKEQVQSCGPPPELLNGNVKEKTKEEYGHSEVV





EYYCNPRFLMKGPNKIQCVDGEWTTLPVCIVEESTCGDIPELEHGWAQL





SSPPYYYGDSVEFNCSESFTMIGHRSITCIHGVWTQLPQCVAIDKLKKC





KSSNLIILEEHLKNKKEFDHNSNIRYRCRGKEGWIHTVCINGRWDPEVN





CSMAQIQLCPPPPQIPNSHNMTTTLNYRDGEKVSVLCQENYLIQEGEEI





TCKDGRWQSIPLCVEKIPCSQPPQIEHGTINSSRSSQESYAHGTKLSYT





CEGGFRISEENETTCYMGKWSSPPQCEGLPCKSPPEISHGVVAHMSDSY





QYGEEVTYKCFEGFGIDGPAIAKCLGEKWSHPPSCIKTDCLSLPSFENA





IPMGEKKDVYKAGEQVTYTCATYYKMDGASNVTCINSRWTGRPTCRDTS





CVNPPTVQNAYIVSRQMSKYPSGERVRYQCRSPYEMFGDEEVMCLNGNW





TEPPQCKDSTGKCGPPPPIDNGDITSFPLSVYAPASSVEYQCQNLYQLE





GNKRITCRNGQWSEPPKCLHPCVISREIMENYNIALRWTAKQKLYSRTG





ESVEFVCKRGYRLSSRSHTLRTTCWDGKLEYPTCAKR






Human


Hepatocyte Growth Factor Receptor Extracellular Domain Protein Sequence (25-932 Amino Acid).


Receptor tyrosine kinase that transduces signals from the extracellular matrix into the cytoplasm by binding to hepatocyte growth factor/HGF ligand. This receptor acts as an attachment receptor for Listeria monocytogenes internalin InlB, mediating entry of the pathogen into cells.









32) SEQ ID NO: 32


ECKEALAKSEMNVNMKYQLPNFTAETPIQNVILHEHHIFLGATNYIYVL





NEEDLQKVAEYKTGPVLEHPDCFPCQDCSSKANLSGGVWKDNINMALVV





DTYYDDQLISCGSVNRGTCQRHVFPHNHTADIQSEVHCIFSPQIEEPSQ





CPDCVVSALGAKVLSSVKDRFINFFVGNTINSSYFPDHPLHSISVRRLK





ETKDGFMFLTDQSYIDVLPEFRDSYPIKYVHAFESNNFIYFLTVQRETL





DAQTFHTRIIRFCSINSGLHSYMEMPLECILTEKRKKRSTKKEVFNILQ





AAYVSKPGAQLARQIGASLNDDILFGVFAQSKPDSAEPMDRSAMCAFPI





KYVNDFFNKIVNKNNVRCLQHFYGPNHEHCFNRTLLRNSSGCEARRDEY





RTEFTTALQRVDLFMGQFSEVLLTSISTFIKGDLTIANLGTSEGRFMQV





VVSRSGPSTPHVNFLLDSHPVSPEVIVEHTLNQNGYTLVITGKKITKIP





LNGLGCRHFQSCSQCLSAPPFVQCGWCHDKCVRSEECLSGTWTQQICLP





AIYKVFPNSAPLEGGTRLTICGWDFGFRRNNKFDLKKTRVLLGNESCTL





TLSESTMNTLKCTVGPAMNKHFNMSIIISNGHGTTQYSTFSYVDPVITS





ISPKYGPMAGGTLLTLTGNYLNSGNSRHISIGGKTCTLKSVSNSILECY





TPAQTISTEFAVKLKIDLANRETSIFSYREDPIVYEIHPTKSFISGGST





ITGVGKNLNSVSVPRMVINVHEAGRNFTVACQHRSNSEIICCTTPSLQQ





LNLQLPLKTKAFFMLDGILSKYFDLIYVHNPVFKPFEKPVMISMGNENV





LEIKGNDIDPEAVKGEVLKVGNKSCENIHLHSEAVLCTVPNDLLKLNSE





LNIEWKQAISSTVLGKVIVQPDQNFT






Human


Membrane Cofactor Protein (CD46) Extracellular Domain Protein Sequence (35-343 Amino Acid).


Acts as a cofactor for complement factor I, a serine protease which protects autologous cells against complement-mediated injury by cleaving C3b and C4b deposited on host tissue. This protein acts as an attachment receptor for Adenovirus subgroup B2 and Ad3, Measles virus, Herpesvirus 6/HHV-6, Neisseria and Streptococcus pyogenes.









33) SEQ ID NO: 33


CEEPPTFEAMELIGKPKPYYEIGERVDYKCKKGYFYIPPLATHTICDRN





HTWLPVSDDACYRETCPYIRDPLNGQAVPANGTYEFGYQMHFICNEGYY





LIGEEILYCELKGSVAIWSGKPPICEKVLCTPPPKIKNGKHTFSEVEVF





EYLDAVTYSCDPAPGPDPFSLIGESTIYCGDNSVWSRAAPECKVVKCRF





PVVENGKQISGFGKKFYYKATVMFECDKGFYLDGSDTIVCDSNSTWDPP





VPKCLKVLPPSSTKPPALSHSVSTSSTTKSPASSASGPRPTYKPPVSNY





PGYPKPEEGILDSLD






Human


Glycophorin-A Extracellular Domain Protein Sequence (20-91 Amino Acid).


Glycophorin A is the major intrinsic membrane protein of the erythrocyte. This protein acts as an attachment receptor for Plasmodium falciparum, Influenza virus, Hepatitis A virus (HAV), Streptococcus gordonii.









34) SEQ ID NO: 34


SSTTGVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAHEVSEISVR


TVYPPEEETGERVQLAHHFSEPE






Human


C-Type Lectin Domain Family 4 Member K (Langerin, CD207) Extracellular Domain Protein Sequence (65-328 Amino Acid).


Calcium-dependent lectin displaying mannose-binding specificity. This protein binds to Candida species, Saccharomyces species, Malassezia furfur, human immunodeficiency virus-1 (HIV-1) and Yesinia pestis.









35) SEQ ID NO: 35


PRFMGTISDVKTNVQLLKGRVDNISTLDSEIKKNSDGMEAAGVQIQMVN





ESLGYVRSQFLKLKTSVEKANAQIQILTRSWEEVSTLNAQIPELKSDLE





KASALNTKIRALQGSLENMSKLLKRQNDILQVVSQGWKYFKGNFYYFSL





IPKTWYSAEQFCVSRNSHLTSVTSESEQEFLYKTAGGLWWIGLTKAGME





GDWSWVDDTPFNKVQSVRFWIPGEPNNAGNNEHCGNIKAPSLQAWNDAP





CDKTFLFICKRPYVPSEP






Human


Anthrax Toxin Receptor 1 Mature Protein Sequence (33-564 Amino Acid).


Plays a role in cell attachment and migration. Interacts with extracellular matrix proteins and with the actin cytoskeleton. This protein acts as an attachment receptor for Anthrax toxin.









36) SEQ ID NO: 36


EDGGPACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMS





FIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQ





IYYENRQGYRTASVIIALTDGELHEDLFFYSEREANRSRDLGAIVYCVG





VKDFNETQLARIADSKDHVFPVNDGFQALQGIIHSILKKSCIEILAAEP





STICAGESFQVVVRGNGFRHARNVDRVLCSFKINDSVTLNEKPFSVEDT





YLLCPAPILKEVGMKAALQVSMNDGLSFISSSVIITTTHCSDGSILAIA





LLILFLLLALALLWWFWPLCCTVIIKEVPPPPAEESEEEDDDGLPKKKW





PTVDASYYGGRGVGGIKRMEVRWGEKGSTEEGAKLEKAKNARVKMPEQE





YEFPEPRNLNNNMRRPSSPRKWYSPIKGKLDALWVLLRKGYDRVSVMRP





QPGDTGRCINFTRVKNNQPAKYPLNNAYHTSSPPPAPIYTPPPPAPHCP





PPPPSAPTPPIPSPPSTLPPPPQAPPPNRAPPPSRPPPRPSV






Human


Anthrax Toxin Receptor 2 Extracellular Domain Protein Sequence (34-318 Amino Acid).


Necessary for cellular interactions with laminin and the extracellular matrix. This protein acts as an attachment receptor for Anthrax toxin.









37) SEQ ID NO: 37


QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAERFVSPEMRLSF





IVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQI





QKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKISRSLGASVYCVGVLD





FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELQPSSV





CVGEEFQIVLSGRGFMLGSRNGSVLCTYTVNETYTTSVKPVSVQLNSML





CPAPILNKAGETLDVSVSFNGGKSVISGSLIVTATECSNG














Artificial



armY-Angiotensin-converting enzyme 2 (ACE2) fusion protein with N-terminal Myc-tag


codon-optimized (for human) nucleotide sequence (3,897 bp)


38) SEQ ID NO: 38



atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtgagcaaaagcttatctctgaagaggacttact






aagaaagcggggcagcccaggcggagcgcagagcacaatcgaggaacaggccaagaccttcctggacaagttcaaccacgaagctgaagac





ctgttctaccaatctagcctggctagttggaactacaacaccaacattacagaagagaacgtgcagaacatgaacaacgcaggcgacaagtggtcc





gccttccttaaagagcagtctacactggcccagatgtaccctctgcaagagattcagaatctgaccgtgaagctgcagctgcaggctctccagcaga





atgggtccagcgtgctgtctgaggataagagcaagcggctgaacaccatcctgaatacaatgagcaccatctacagcaccggcaaagtgtgtaac





cctgacaacccccaggagtgtctgctgctggaacctggcctgaacgaaatcatggccaactccctggactacaacgagagactgtgggcctggga





gagctggcgtagcgaggtgggaaaacagctgcgccccctgtatgaggagtacgtggtgctgaagaatgagatggccagagccaaccactacga





ggactacggcgactattggagaggcgattatgaagtcaacggcgttgacggctacgactacagccggggacagctgatcgaagacgtggaacat





acgtagaggagatcaagcctctgtacgagcacctgcacgcctacgtaagagccaaactgatgaatgcctaccccagctacatctcccctatcggct





gcctgcccgcccatctgctcggcgacatgtggggcagattctggaccaacctgtattctctgacagtgcctttcggccagaaacctaacatcgacgt





gacagatgccatggtggaccaggcctgggatgcccaaagaatcttcaaggaagccgagaaattcttcgtgtccgtggggctgcctaatatgaccca





gggcttctgggaaaacagcatgctcaccgatcctggcaacgtgcagaaggcagtgtgccaccccaccgcctgggaccttggaaagggcgacttc





cggattctgatgtgcaccaaggtgaccatggacgacttcctgaccgctcaccacgagatgggccacatccagtacgacatggcctacgccgctca





gcctttcctcctgagaaacggcgctaatgaaggcttccacgaggccgtgggcgaaatcatgagcctgagcgccgccacccctaagcacctgaagt





ctatcggactgctgagccccgactttcaggaggacaacgaaactgagatcaacttcttgctgaaacaggccctgacaatcgttggcaccctgccctt





tacctacatgctggaaaagtggagatggatggtctttaagggcgaaatccccaaggaccaatggatgaagaagtggtgggagatgaagcgggaa





atcgtgggcgtggtggaacctgtgccccacgacgagacatactgcgatcctgctagcctctttcacgtgagcaatgattactcattcatccggtacta





caccagaactctgtaccagttccagttccaggaggccctgtgccaggccgccaagcacgagggccctctgcacaagtgcgacatctctaacagca





ccgaggccggccagaagctgttcaacatgctgagactgggcaagagcgaaccttggacactggccctggagaacgtggtcggagccaagaaca





tgaacgtgagaccactgctgaactacttcgagcccctgttcacctggctgaaggatcaaaacaagaacagcttcgtgggctggtccacagactgga





gcccatacgctgatcagagcatcaaagtgaggatctctctgaagagcgccctgggagataaggcctacgagtggaacgataatgagatgtacctgt





tcagaagcagcgtggcctacgccatgcggcagtacttcctgaaagtgaagaaccagatgatcctgtttggcgaggaggatgtgagagtggccaat





ctgaaaccaagaatcagctttaactttttcgttaccgctcctaagaacgtgtctgatatcatccctagaaccgaggtggaaaaggccatcagaatgag





ccggtccagaatcaacgatgccttccgactgaatgacaactccctggagttcctgggaatccagcccaccctgggccctcctaaccagcctccagt





cagcggcggaggaggatctggcggtggaggctctggcggcggcggttcaacaaatctggtgaaccagagcggctacgccctggtggccagcg





gcagatccggcaatctgggcttcaagctgttcagcacccagtctccatctgccgaggtgaagctgaagagcctgagccttaacgacggcagctac





cagtccgagatcgacctgtcaggcggcgccaacttccgagaaaagttcagaaacttcgccaatgagctgagcgaggccatcacaaacagcccta





aaggcctggacagacctgtgcccaagacggaaatcagcggcctgatcaagacaggcgacaactttatcacccctagcttcaaggccggatattat





gaccacgtggcctctgatggctccctactgagctactaccagtccaccgagtacttcaacaacagagttctgatgcctatcctgcagacaacaaacg





gcactctgatggccaacaaccggggctacgacgacgttttcagacaagtgccctctttcagcggctggagcaacacaaaggccaccactgtgtcc





acaagcaacaatctgacatacgataagtggacctatttcgccgccaaaggcagccccctgtacgacagctaccccaaccacttcttcgaggacgtg





aagacactggccattgacgctaaggacatcagcgccctgaaaaccaccatcgacagcgagaagcctacctacctgattatccggggactgagcg





gaaacggcagccagctgaacgagctgcaactgcctgagtccgtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggctaagc





agatcttcgccaacgttgtggaactggaattctacagcaccagcaaggctaactcttttggctttaaccccctggtcctgggatctaaaacgaacgtga





tctacgacctgttcgcaagcaagcccttcacccacatcgacctgacacaggtgaccctgcaaaacagcgataattccgccatcgatgccaacaagc





tgaagcaagctgtgggcgatatctacaactacaggcggttcgagagacagtttcagggctacttcgccggaggctacatcgacaagtacctggtga





agaacgtcaataccaacaaggatagcgatgacgatctggtctaccggagcctgaaagagctgaacctccacctggaggaagcctacagagaagg





cgataacacctactacagagtgaatgagaactattaccctggagctagcatctacgagaacgagagagccagcagagacagcgagttccagaac





gagatcctgaagcgagccgagcagaacggcgtgacatttgacgagaacatcaaaagaatcacagccagcggcaagtatagcgtgcagttccaaa





agctagaaaatgataccgattccagcctggaaagaatgaccaaggccgtggaaggccttgtgaccgtgatcggcgaggaaaagttcgagacagt





ggatatcaccggcgtgtctagcgataccaatgaagtgaaaagcctggccaaggaactgaagaccaacgccctgggcgtcaagctgaaactctaa





Artificial


Protein M with N-terminal peptide Avi- and Myc-tags (aka: armY) codon-optimized (for


human) nucleotide sequence (1,764 bp)


39) SEQ ID NO: 39



atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtatggctggtggcctgaatgacatctttgaggc






ccagaagatcgagtggcatgagggaggagagcagaagctgatctccgaggaagatctgctgagaaagcgggccgccaacggcggaggagga





tctggcggtggaggctctaccaatctggtgaaccagagcggatacgccctggtggcctctgggagaagcggaaatctgggatttaagctgttcagt





acccagtctccaagcgctgaagtgaagctgaaaagcctctccctgaacgacggctcttatcagagcgagatcgacctgagcggcggcgctaactt





ccgggagaagttccgcaacttcgctaatgagctgtctgaagccatcacaaacagccctaagggcctggatagacctgtgcccaagacagaaatca





gcggcctgatcaagactggagataactttatcacccctagctttaaggccggctactacgaccatgtggctagcgacggttcactgctgtcctactac





cagtctacagagtactttaacaaccgggtgctgatgcctatactgcagaccaccaacggcaccctgatggccaataacagaggctacgatgacgtg





ttccggcaggtgcccagcttcagcggctggagcaacacaaaggccacaaccgtgagcacctccaacaacctgacctacgacaagtggacctact





tcgccgccaagggctctccactgtatgacagctatcctaaccacttcttcgaggacgtgaagacactggccatcgacgccaaggacatctctgccct





gaagaccaccatcgacagtgagaaacctacatacctgattatcagaggactgtccggcaacggcagccagctgaacgagcttcagctgcctgaga





gcgtgaaaaaggtgagcctgtacggcgactacacaggcgtcaatgtagctaagcaaatcttcgccaacgtggtggaactcgaattctacagcacat





ccaaggccaacagcttcggcttcaaccccctggtgctgggcagcaagaccaacgtgatctacgacctgttcgccagcaagcctttcacccacatcg





acctgacacaagtgaccctgcagaacagcgataacagcgccattgatgccaacaagctcaaacaggccgtgggcgatatctacaactacagaag





attcgagaggcagtttcagggctacttcgccggaggctatatcgataagtacctggtcaagaacgtgaacaccaacaaggactccgacgacgacct





ggtgtaccggagcctgaaggaactgaacctgcacctggaagaggcctacagagagggcgataatacctactacagagtgaacgagaactactac





cccggagctagcatctacgagaacgagagagcctctagagatagcgagttccagaacgagatcctgaagcgggccgagcagaatggcgtgaca





ttcgacgagaacatcaagcggatcaccgccagcggcaagtactccgtgcagttccaaaaactggaaaatgacaccgacagcagcctggaaagaa





tgaccaaggctgtggaaggcctggttacagttatcggcgaggagaagtttgaaaccgtggacatcaccggcgtgagctccgataccaatgaggtg





aaatctctggccaaagaactgaagacaaatgccctgggcgtcaaattaaaactgtaa





Artificial


Protein M horseradish peroxidase (HRP) fusion protein with N-terminal Myc-tag codon-


optimized (for human) nucleotide sequence (2,631 bp)


40) SEQ ID NO: 40



atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtgagcagaaactcatctcagaagaggatctgg






cagcaaatcagctgaccccaaccttctacgacaattcttgtccaaacgtctccaacatcgtgcgggacaccattgtgaacgagctgagaagcgacc





ctagaatcgccgcttctatcctgagactgcatttccacgactgcttcgtgaatggctgcgacgcctccatcctgctggacaacaccaccagcttccgg





acagagaaagacgccttcggaaatgccaacagcgctagaggcttccccgttatcgacagaatgaaggctgccgtggaatctgcctgccctcggac





cgtgagctgtgccgacctgctgaccatcgccgcccagcagagcgtgaccctggccggcggtcctagctggcgggtgcctctgggccggagaga





tagtctgcaggccttcctggatctggctaatgctaacctccccgctcctttctttaccctgcctcagctgaaggacagctacggaacgtcggcctaaa





cagaagcagcgacctggtggccctgtccggaggccacaccttcggcaagaaccagtgcagattcatcatggaccggctgtacaacttcagcaata





ccggcctgccagatcctacactgaacacaacctacctgcagacactgagaggcctgtgccccctcaacgggaatctgagcgccttggtggacttc





gacctgagaacccctaccatcttcgacaacaagtactacgtgaacctggaagaacagaagggcctgatccaaagcgatcaggagctgttctcttcc





cctaatgccacagacaccatccccctggtgcggtcattcgccaacagtacccagaccttttttaacgcttttgtggaagccatggatagaatgggcaa





catcacccctctgaccggaacacagggacagatcagactgaattgcagagtggtgaacagcaactctggcggaggaggatctggcggtggagg





ctctggcggcggcggttcaacaaatctggtgaaccagagcggctacgccctggtggccagcggcagatccggcaatctgggcttcaagctgttca





gcacccagtctccatctgccgaggtgaagctgaagagcctgagccttaacgacggcagctaccagtccgagatcgacctgtcaggcggcgccaa





cttccgagaaaagttcagaaacttcgccaatgagctgagcgaggccatcacaaacagccctaaaggcctggacagacctgtgcccaagacggaa





atcagcggcctgatcaagacaggcgacaactttatcacccctagcttcaaggccggatattatgaccacgtggcctctgatggctccctactgagct





actaccagtccaccgagtacttcaacaacagagttctgatgcctatcctgcagacaacaaacggcactctgatggccaacaaccggggctacgac





gacgttttcagacaagtgccctctttcagcggctggagcaacacaaaggccaccactgtgtccacaagcaacaatctgacatacgataagtggacc





tatttcgccgccaaaggcagccccctgtacgacagctaccccaaccacttcttcgaggacgtgaagacactggccattgacgctaaggacatcagc





gccctgaaaaccaccatcgacagcgagaagcctacctacctgattatccggggactgagcggaaacggcagccagctgaacgagctgcaactg





cctgagtccgtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaacgttgtggaactggaattctac





agcaccagcaaggctaactcttttggctttaaccccctggtcctgggatctaaaacgaacgtgatctacgacctgttcgcaagcaagcccttcaccca





catcgacctgacacaggtgaccctgcaaaacagcgataattccgccatcgatgccaacaagctgaagcaagctgtgggcgatatctacaactaca





ggcggttcgagagacagtttcagggctacttcgccggaggctacatcgacaagtacctggtgaagaacgtcaataccaacaaggatagcgatgac





gatctggtctaccggagcctgaaagagctgaacctccacctggaggaagcctacagagaaggcgataacacctactacagagtgaatgagaact





attaccctggagctagcatctacgagaacgagagagccagcagagacagcgagttccagaacgagatcctgaagcgagccgagcagaacggc





gtgacatttgacgagaacatcaaaagaatcacagccagcggcaagtatagcgtgcagttccaaaagctagaaaatgataccgattccagcctggaa





agaatgaccaaggccgtggaaggccttgtgaccgtgatcggcgaggaaaagttcgagacagtggatatcaccggcgtgtctagcgataccaatg





aagtgaaaagcctggccaaggaactgaagaccaacgccctgggcgtcaagctgaaactctaa





Artificial


armY-Angiotensin-converting enzyme 2 (ACE2) fusion protein codon-optimized (for human)


nucleotide sequence (3,837 bp)


41) SEQ ID NO: 41



atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtcagagcacaatcgaggaacaggccaagac






cttcctggacaagttcaaccacgaagctgaagacctgttctaccaatctagcctggctagttggaactacaacaccaacattacagaagagaacgtg





cagaacatgaacaacgcaggcgacaagtggtccgccttccttaaagagcagtctacactggcccagatgtaccctctgcaagagattcagaatctg





accgtgaagctgcagctgcaggctctccagcagaatgggtccagcgtgctgtctgaggataagagcaagcggctgaacaccatcctgaatacaat





gagcaccatctacagcaccggcaaagtgtgtaaccctgacaacccccaggagtgtctgctgctggaacctggcctgaacgaaatcatggccaact





ccctggactacaacgagagactgtgggcctgggagagctggcgtagcgaggtgggaaaacagctgcgccccctgtatgaggagtacgtggtgc





tgaagaatgagatggccagagccaaccactacgaggactacggcgactattggagaggcgattatgaagtcaacggcgttgacggctacgacta





cagccggggacagctgatcgaagacgtggaacatacgtagaggagatcaagcctctgtacgagcacctgcacgcctacgtaagagccaaactg





atgaatgcctaccccagctacatctcccctatcggctgcctgcccgcccatctgctcggcgacatgtggggcagattctggaccaacctgtattctct





gacagtgcctacggccagaaacctaacatcgacgtgacagatgccatggtggaccaggcctgggatgcccaaagaatcttcaaggaagccgag





aaattcttcgtgtccgtggggctgcctaatatgacccagggcttctgggaaaacagcatgctcaccgatcctggcaacgtgcagaaggcagtgtgc





caccccaccgcctgggaccttggaaagggcgacttccggattctgatgtgcaccaaggtgaccatggacgacttcctgaccgctcaccacgagat





gggccacatccagtacgacatggcctacgccgctcagcctacctcctgagaaacggcgctaatgaaggcttccacgaggccgtgggcgaaatca





tgagcctgagcgccgccacccctaagcacctgaagtctatcggactgctgagccccgactacaggaggacaacgaaactgagatcaacttcttgc





tgaaacaggccctgacaatcgttggcaccctgccattacctacatgctggaaaagtggagatggatggtattaagggcgaaatccccaaggacc





aatggatgaagaagtggtgggagatgaagcgggaaatcgtgggcgtggtggaacctgtgccccacgacgagacatactgcgatcctgctagcct





ctttcacgtgagcaatgattactcattcatccggtactacaccagaactctgtaccagttccagttccaggaggccctgtgccaggccgccaagcacg





agggccctctgcacaagtgcgacatctctaacagcaccgaggccggccagaagctgttcaacatgctgagactgggcaagagcgaaccttggac





actggccctggagaacgtggtcggagccaagaacatgaacgtgagaccactgctgaactacttcgagcccctgttcacctggctgaaggatcaaa





acaagaacagcttcgtgggctggtccacagactggagcccatacgctgatcagagcatcaaagtgaggatctctctgaagagcgccctgggagat





aaggcctacgagtggaacgataatgagatgtacctgttcagaagcagcgtggcctacgccatgcggcagtacttcctgaaagtgaagaaccagat





gatcctgtttggcgaggaggatgtgagagtggccaatctgaaaccaagaatcagctttaactttttcgttaccgctcctaagaacgtgtctgatatcatc





cctagaaccgaggtggaaaaggccatcagaatgagccggtccagaatcaacgatgccttccgactgaatgacaactccctggagttcctgggaat





ccagcccaccctgggccctcctaaccagcctccagtcagcggcggaggaggatctggcggtggaggctctggcggcggcggttcaacaaatct





ggtgaaccagagcggctacgccctggtggccagcggcagatccggcaatctgggcttcaagctgttcagcacccagtctccatctgccgaggtga





agctgaagagcctgagccttaacgacggcagctaccagtccgagatcgacctgtcaggcggcgccaacttccgagaaaagttcagaaacttcgc





caatgagctgagcgaggccatcacaaacagccctaaaggcctggacagacctgtgcccaagacggaaatcagcggcctgatcaagacaggcg





acaactttatcacccctagcttcaaggccggatattatgaccacgtggcctctgatggctccctactgagctactaccagtccaccgagtacttcaaca





acagagttctgatgcctatcctgcagacaacaaacggcactctgatggccaacaaccggggctacgacgacgttttcagacaagtgccctctttcag





cggctggagcaacacaaaggccaccactgtgtccacaagcaacaatctgacatacgataagtggacctatttcgccgccaaaggcagccccctgt





acgacagctaccccaaccacttcttcgaggacgtgaagacactggccattgacgctaaggacatcagcgccctgaaaaccaccatcgacagcga





gaagcctacctacctgattatccggggactgagcggaaacggcagccagctgaacgagctgcaactgcctgagtccgtgaaaaaggtgagcctg





tacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaacgttgtggaactggaattctacagcaccagcaaggctaactcttttggct





ttaaccccctggtcctgggatctaaaacgaacgtgatctacgacctgttcgcaagcaagcccttcacccacatcgacctgacacaggtgaccctgca





aaacagcgataattccgccatcgatgccaacaagctgaagcaagctgtgggcgatatctacaactacaggcggttcgagagacagtttcagggcta





cttcgccggaggctacatcgacaagtacctggtgaagaacgtcaataccaacaaggatagcgatgacgatctggtctaccggagcctgaaagagc





tgaacctccacctggaggaagcctacagagaaggcgataacacctactacagagtgaatgagaactattaccctggagctagcatctacgagaac





gagagagccagcagagacagcgagttccagaacgagatcctgaagcgagccgagcagaacggcgtgacatttgacgagaacatcaaaagaat





cacagccagcggcaagtatagcgtgcagttccaaaagctagaaaatgataccgattccagcctggaaagaatgaccaaggccgtggaaggcctt





gtgaccgtgatcggcgaggaaaagttcgagacagtggatatcaccggcgtgtctagcgataccaatgaagtgaaaagcctggccaaggaactga





agaccaacgccctgggcgtcaagctgaaactctaa





Artificial


armY-CD209 (DC-SIGN) fusion protein codon-optimized (for human) nucleotide sequence


(2,706 bp)


42) SEQ ID NO: 42



atgtaccgaatgcagctgctgtcttgtattgccctgtccctggccctggttaccaattctcaagtgagcaaggtgcccagcagcatctctc






aggagcagagcagacaggacgccatctaccagaacctgactcaactgaaggcggctgtgggcgaactgagcgagaagtctaagctgcaggag





atctatcaggaactgacacaactgaaggctgccgtgggggaattacccgagaagagcaagctgcaggaaatctaccaggagctgaccagactca





aagccgccgtgggcgagctgccagagaagtctaaactgcaggaaatctaccaggaattgacatggctgaaggcagctgttggcgagctgcctga





gaaaagcaagatgcaggagatttaccaggagctcacacggctgaaggccgccgtcggcgaactccccgagaaaagcaagcagcaggagatct





accaggagcttacaagacttaaggccgctgtgggagagctgcctgagaagtccaaacaacaggaaatctaccaagaactgaccagactgaaagc





cgccgtgggagaactgccagaaaaaagcaagcagcaggagatctaccaagaactgacacagcttaaagcagctgttgagcggctgtgtcaccca





tgcccttgggagtggacattcttccagggcaactgctacttcatgagcaatagccaaaggaactggcacgacagcatcacagcctgcaaggaagt





gggggcccagctggtggtgatcaagtccgccgaagaacaaaatttcctgcagctgcagtcctccagaagcaacagattcacatggatgggcctgt





cagacctgaaccaagaaggcacctggcagtgggtcgatggcagccccctgctgccctattcaagcagtactggaaccgcggcgagcctaacaa





tgtgggcgaggaagattgcgccgagtttagcggcaacggctggaatgacgacaagtgcaacctcgccaagttctggatctgtaaaaagtccgccg





cctcctgcagccgcgacgaggagcagtttctgtcccctgcccccgccacccctaatcctcctcccgccggcggtggcggaagcggcggcggcg





gcagcggaggaggcggcagcaccaacctggtgaatcagagcggctacgccctggtggcctctggtagatctggcaacctgggattcaagctgtt





cagcacacagtctcctagtgccgaagtgaagctgaagtcactgagcctgaacgacggcagctaccagagcgaaatcgacctgtctggcggtgcta





acttcagagagaagttccggaacttcgccaacgagctgtccgaggccattaccaacagtcccaagggcctggaccggcctgtgcctaagaccga





gatcagcggcctgatcaagaccggcgacaacttcatcacccctagattaaggctggctactacgaccacgtggcctccgatggctctctgctgtcc





tattatcagagcacagagtacttcaacaatagagtgctgatgcctatcctgcaaacaaccaacggcaccctgatggccaataataggggatacgac





gacgtattcggcaggtgcctagcttctccggctggagcaacaccaaggccacaaccgtgtctacaagcaacaacctgacatacgacaagtggac





ctactttgccgccaaggggagccctctgtacgactcttatcctaatcatttcttcgaggacgtgaagaccctggccatcgatgccaaggatatcagcg





ccctgaagaccaccatcgacagcgaaaaacccacctacctgatcatccggggcctgagcggcaatggcagccagctgaacgaactgcagctgc





cagaaagcgtgaagaaggtgtctctgtacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaatgttgttgagcttgagttctacag





cacgagcaaggccaactcattcggatcaaccccctggtgctgggaagtaagacaaacgtgatctatgacctgtttgccagcaaacctttcacccac





atcgacctgacccaggtgaccctgcagaacagcgacaacagcgccattgatgctaacaagctgaaacaggccgtgggagacatctacaactacc





ggagattcgagagacagttccaaggctacttcgccggcggctatatcgataagtacctggtgaaaaacgtgaacaccaacaaggatagcgatgac





gacctggtgtacagaagcctgaaggaactgaacctgcacctggaggaagcctacagagaaggcgataacacatactacagagtgaacgagaact





actaccctggagccagcatctacgagaacgagagagcctctcgggactccgagttccagaacgaaatcctgaaacgggccgagcagaacggcg





tgacatttgatgaaaacatcaagagaatcaccgctagcggcaagtacagcgtgcagtttcagaagctggagaacgacactgattctagcctggaaa





gaatgaccaaggcggtcgagggcctggtgaccgtgatcggcgaggagaagttcgaaaccgtggacatcaccggcgtgtccagcgacaccaatg





aggtgaaatctctggccaaagagctgaagaccaacgccctcggagtgaagctgaagctgtaa





Artificial


armY-C-type lectin domain family 4 member M fusion protein codon-optimized (for human)


nucleotide sequence (2,655 bp)


43) SEQ ID NO: 43



atgtaccggatgcagctgctgtcttgtatcgccctgagcctggccctggtcaccaattctcaggtgtctaaggtgccttctagcctgagcc






aggagcagtctgagcaggacgctatctaccagaacctgacacagcttaaggccgctgtgggcgaactgtcagaaaagtctaagctccaagagatc





taccaggagcttacacagctgaaagccgccgtgggcgagctgcctgagaagtccaagttgcaagagatctaccaggagctgacccggctgaaag





ccgccgtgggagagctgcccgagaagagcaaactgcaggaaatctatcaggagctgaccagactgaaggccgccgtgggagagctgcccgag





aaatccaagctacaggagatctaccaggagctgacaagactgaaggccgcagtgggcgagctgccagaaaagagcaagctgcaggagatctac





caggaactgacagagctgaaggccgccgttggagaactgcctgaaaagtccaaactgcaggaaatctatcaggagctgacacagctgaaggctg





ccgtgggcgaactccctgaccagtccaagcagcagcagatttaccaggaactgaccgacctgaaaacagccttcgagagactgtgtagacactgc





cctaaggactggacattcttccagggcaactgctacttcatgagcaacagccagcggaactggcacgacagcgtgaccgcctgtcaggaggtgc





gggcccagctggtggtcatcaagaccgccgaagagcaaaacttcctgcagctgcaaacaagcagaagcaacagattcagctggatgggcctga





gcgatctgaaccaggagggcacctggcagtgggtggatggaagccctctgtctccaagcttccaaagatactggaacagcggagagcctaacaa





ctctggaaatgaggactgcgccgagttcagcggttctggctggaatgacaacagatgcgacgtggacaactactggatctgcaagaaacccgccg





cctgcttccgagatgagggcggtggcggaagcggcggcggaggcagcggaggcggcgggagtaccaacctggtgaatcagagcggctacgc





cctggtcgcctcgggcagatccggcaatctgggcttcaagctgttcagcacacaaagcccttctgctgaagtgaaactgaagagcctgagcctgaa





tgatggctcttaccagagcgagatcgacttatccgggggagccaactttcgggaaaaattcagaaacttcgctaacgagctgagcgaggccatcac





caactcccccaagggcctggatagacctgtgcccaagacagagatcagcggcctgatcaagaccggcgataacttcatcacccctagctttaagg





ccggatactacgaccacgtggcttccgatggcagcctgctgagctactaccagagcaccgagtacttcaacaacagagtactgatgcctatcctgc





agacaacaaatggcaccctgatggccaacaataggggctacgatgacgtgttcagacaggttccttcattcagcggctggagcaatacgaaggct





acaaccgtgtcgaccagcaacaacctgacctatgacaagtggacctacttcgccgctaagggcagccctctgtacgacagctaccccaaccacttc





ttcgaggatgtgaaaaccctggccattgacgccaaggacatcagcgccctgaaaaccaccatcgacagcgagaagcctacatacctgatcatcag





aggcctgtcaggcaacggctcccagctgaacgaactgcaactgccagagagtgttaagaaggtgagcctgtacggcgactatacaggagtgaac





gtggctaagcagatcttcgctaatgtggtggaactggaattctacagcaccagcaaagccaacagcttcggctttaaccccctggtgctgggcagca





agaccaacgtgatctacgaccttttcgccagcaagcccttcacccacatcgacctgacccaggtgaccctgcagaatagcgacaattctgccattga





cgccaacaagctgaaacaggccgtgggcgatatctacaactacaggcggttcgaaagacagttccaaggctattttgccggcggctacatcgaca





agtacctggtcaagaacgtgaacaccaacaaggattccgacgacgatctagtgtaccggagcttgaaggaactcaacctgcatctggaagaggcc





tacagagaaggcgacaacacatactaccgcgtgaacgagaactactaccctggcgccagcatctacgagaacgaacgggcttctagagatagcg





agtttcagaatgaaatcctgaagagagccgaacagaacggcgtgaccttcgacgagaacattaagcggatcacagcctctggcaagtacagcgtg





cagtttcagaagctggaaaacgacaccgacagctctctcgagagaatgaccaaggccgttgagggcctggtgacagtgatcggcgaggaaaagt





tcgaaaccgtggacatcaccggcgtgtcctctgataccaacgaggtgaagagcctggcaaaggaactgaagaccaacgccctgggcgtgaagct





gaagctgtaa





Artificial


armY-CD4 fusion protein codon-optimized (for human) nucleotide sequence (2,781 bp)


44) SEQ ID NO: 44



atgtacagaatgcagctgctgagctgcatcgccctgtccctggccctggttacaaacagcaagaaggtggtgctgggaaaaaagggc






gacaccgtggaactgacctgcaccgctagccagaagaagagcatccaatttcactggaagaacagcaaccagatcaaaatcctggggaaccagg





gctctttcctgacaaagggcccctctaagctgaatgatagagccgacagccggagatcgctgtgggaccagggcaacttccccctgatcatcaaga





acctgaagatcgaggatagtgacacatacatctgcgaggtggaagatcagaaggaagaggtgcaactgctggtgttcggactgaccgccaacag





cgacactcacctgctgcagggccagtctctcacactaaccctggaaagccctcctggaagctctccaagcgtccagtgtagatctcctagaggcaa





gaacatccagggcggcaagaccctactgtgtctcagctggagctgcaggactcaggcacctggacatgtaccgtactgcaaaatcagaaaaagg





tggaattcaagatcgacatcgttgtgctggccttccagaaggccagcagcatcgtgtacaagaaggaaggagagcaggtggagttttctttccctct





cgcctttaccgtggaaaaactgaccggttcaggcgagctgtggtggcaggccgagcgcgcaagctccagcaagagctggatcacattcgacctta





agaacaaagaggtgagcgtgaagagagtgacccaggaccccaagctgcagatgggcaagaagctgcccctgcacctgaccctcccgcaagcc





ctgcctcagtacgccggatccggcaacctgacactggccctcgaagccaaaaccggaaagctgcaccaggaggtgaacctggtggtgatgaga





gccacccagctgcagaaaaatctgacctgcgaagtgtggggccctacaagccctaagctcatgctgagtcttaaactggagaacaaggaggctaa





agtgagcaagcgggaaaaggccgtgtgggtgctgaatcctgaggccggcatgtggcagtgcctgctgtctgacagcgggcaagtgctgctggaa





tctaacatcaaggtcctgcccacctggtccacccctgtgcagccaggcggcggaggatctggcggcggcggcagcggaggcggcggctccac





caacctggtgaatcagagcggctacgccctggtggctagcggtagatccggcaatctgggattcaagcttttctccacacagagccctagcgccga





agtgaagttgaaatctctgagcctgaacgacggctcctaccagtccgagatcgacctgagcggcggcgctaatttcagagagaagtacggaactt





cgccaatgagctgtctgaagctatcaccaacagccctaaaggacttgatcgcccagtgcccaagaccgagattagcggcctgatcaagacaggcg





ataactttatcacccctagtttcaaggctggctattatgaccacgtggccagcgacggaagcctgctgagctactaccagagcacagagtacttcaac





aaccgggtgctgatgcctatcctgcagaccaccaacggcacgctgatggccaacaacagaggctacgacgacgtgttccggcaggtgcctagctt





tagcggatggagcaacaccaaggctacaactgtgagcaccagcaacaacctgacctacgataagtggacctacttcgccgccaaaggcagccct





ctgtacgatagctaccctaaccacttcttcgaggacgtgaagacactggctatcgacgccaaggacattagcgccctgaaaaccacaattgactctg





aaaagcccacctacctgatcatcagaggactgagcggcaacggcagccagctgaacgagctgcagctgcctgaatctgtgaaaaaagtcagcctt





tacggcgactacaccggcgtgaacgtggccaagcagatcttcgccaatgtggtggaactggagttctacagcacctctaaagccaacagtacggc





ttcaaccccctggtgctgggctctaaaaccaatgtaatttatgacctcttcgctagcaagcctacacacacatcgatctgacccaggtgacactgcag





aactctgacaacagcgccatcgatgccaataagctgaagcaggccgtgggcgacatctacaactaccggagattcgagagacagtacagggcta





ctttgccggcggctacatcgataagtacctggttaagaacgtgaataccaacaaggactctgatgacgacctggtgtacagaagcctgaaggaact





gaacctgcatctggaagaggcctacagagaaggcgacaacacctactatcgggtgaatgagaactactatcccggcgcttctatctacgagaatga





gcgggccagcagagatagtgagttccaaaatgagatcctgaagcgggcagagcaaaacggcgtgaccttcgacgagaacatcaagagaatcac





cgcctccggcaaatacagcgtgcagttccagaaactggaaaacgacactgatagcagcctggaacggatgaccaaggccgtagagggcctggt





caccgtgatcggcgaggagaagtttgagacagtggacatcacaggcgtgagctccgataccaacgaggtgaagagcctggccaaggaactgaa





gaccaacgccctgggagtgaagctgaagctataa





Artificial


armY-Synaptic vesicle glycoprotein 2A fusion protein codon-optimized (for human)


nucleotide sequence (2,058 bp)


45) SEQ ID NO: 45



atgtacagaatgcagctgctgtcatgcatcgccctctccctcgccctggtgaccaacagccccgacatgatcagacacctgcaggccg






tcgactacgccagcagaaccaaagtgttccccggagaacgggtggaacacgtgacatttaacttcaccctggaaaaccagatccacagaggcgg





ccagtacttcaacgacaagttcatcggcctgagactgaagtccgtgtccttcgaggatagcctgtagaggaatgctactagaggacgtgacatcta





gcaatacctttttccggaactgcacattcatcaacaccgtgttctacaacaccgatctgtttgaatacaagttcgtgaacagcagactgatcaacagca





cctactgcacaacaaggagggctgtcctttagatgtgaccggaacgggcgagggcgcctacatggtgtacggcggcggaggctccggcggcg





gtggcagcggtggaggaggcagcaccaatctggtcaaccaatctggctatgccctggtcgccagtggcagaagcgggaacctgggcttcaagct





gttcagcacacagagccctagcgctgaagtgaaactgaagagcctgtctctgaacgacggctcttatcagagcgagatcgacctgtccggaggcg





ccaatttcagagagaagttcaggaacttcgccaacgagctgagcgaggccatcaccaattcccctaagggactggatagacctgtgccaaaaacc





gagattagcggcctgattaagaccggagataatttcatcacacccagctttaaggccggatattacgaccacgtggcctctgacggcagcctgctga





gctactaccagagcaccgagtacttcaacaaccgggtgctgatgcctatcctgcaaacaacaaatggcacactgatggccaacaaccggggatat





gacgacgtgttccgccaggtgcccagcttcagcggctggagcaacacaaaggctacaaccgtgtctaccagcaacaacctgacctacgataagtg





gacctacttcgccgctaaaggcagccctctgtacgacagctaccccaaccacttcttcgaggacgtcaagaccctggcgatagacgccaaagaca





tcagcgctctgaagaccaccatcgacagcgaaaagccaacatacctgatcatcagaggcctgagcggcaacggctcacagctgaacgagctgca





gctgcctgagagcgtgaaaaaggtgtcactgtacggcgattacaccggcgtgaacgtggccaagcagatcttcgcaaacgttgtggaactggaatt





ctactctacaagcaaggccaacagcttcggctttaatcctctggtgctggggtctaagacaaacgtgatctacgacctgttcgccagtaagcctacac





ccacatcgacctgacccaggttacactgcagaactccgacaacagcgccatcgacgccaacaagctgaaacaggccgtgggcgacatctacaac





tacaggagattcgaaagacagttccagggctattttgccggcggctacatcgacaagtacctggtgaagaacgtgaataccaacaaggactctgat





gacgatctcgtgtaccggagcctgaaggaactgaatctgcatctggaagaagcttaccgggaaggcgacaatacctactacagagtgaacgagaa





ctactaccctggcgctagcatctacgagaacgaacgggccagcagagattctgagttccaaaacgagatcctgaagcgggccgagcagaatggc





gtcaccttcgacgagaacatcaagagaatcaccgcctctggcaaatacagcgtgcagttccaaaaactggaaaacgatactgatagctcccttgag





agaatgaccaaggccgtggaaggactggtgaccgtgatcggcgaagagaagttcgagacagtggacatcacaggcgtgtccagcgataccaat





gaggtgaagagcctggccaaggagctgaaaaccaacgccctcggcgtgaagctgaagctgtaa





Artificial


armY-Synaptic vesicle glycoprotein 2B fusion protein codon-optimized (for human)


nucleotide sequence (2,040 bp)


46) SEQ ID NO: 46



atgtacagaatgcagttgctgtcttgtatcgccctcagcctggctctggtgacgaatagcccagacatgatccgctacttccaggacgag






gaatacaagagcaagatgaaggtgttctaggcgagcatgtgtacggcgccaccatcaacttcaccatggaaaaccagatccaccagcacggcaa





gctggttaatgacaagtttacaagaatgtactttaagcacgtgctgttcgaggataccttttttgatgagtgctacttcgaggacgtgacaagcaccgac





acatacttcaagaactgcaccatcgagagcaccatcttctacaacaccgacctgtatgagcacaagttcatcaactgcagatttatcaacagcaccttc





ctggaacagaaagagggctgccacatggacctggaacaagacaatgatggaggcggaggaagcggcggcggaggcagcggcggcggggg





aagcaccaatctggtgaatcaaagcggctacgccctggtggctagcggcagaagcggcaacctgggcttcaagctgtttagcacacagagcccta





gcgctgaagtgaagctgaagtctctctctctgaatgacggctcctaccagtctgagatcgacctcagcggaggcgccaacttcagggaaaagttcc





ggaacttcgccaacgagctgagcgaggccattacaaacagccctaagggcctggacagacctgtgcccaagaccgagatcagcggcctgatca





agactggagataattttattacccctagcttcaaggcaggctactacgaccacgtggcctccgatggctctctgctgtcctattatcagagcacagagt





actttaacaacagagtgctgatgcctatcctgcagaccacaaacggcaccctgatggccaacaatagaggctatgatgatgtgttcagacaggtgcc





ttctttcagcggatggtccaacacaaaggccacaacagtttctacaagcaacaacctgacctacgataagtggacatacttcgccgccaagggctct





ccactgtacgacagctaccctaaccacttcttcgaagatgtgaagaccctggccatcgacgccaaggacatcagcgcccttaaaacaaccattgac





agcgagaagcctacctacctgatcatcagaggactgagcggaaacggctcccagctgaacgaactgcaactgcctgagtctgtgaaaaaggtga





gcctgtacggcgattacaccggcgttaacgtggctaaacagatcttcgccaacgtggtggaactggagttctacagcaccagcaaggccaatagct





tcgggttcaaccccctggtccttggctccaaaaccaacgtcatctacgacctgttcgcttctaagcccttcacacacatcgacctgacccaggttaccc





tgcagaactcagacaacagtgctatcgacgccaacaaactgaagcaggccgtgggcgatatctataactaccggagattcgagcggcagttccaa





ggctacttcgccggcggatatatcgacaagtacctggtcaagaacgtgaacaccaacaaggacagcgatgacgacctggtgtaccggagcctga





aggaactgaacctgcacctggaagaagcctaccgggaaggcgacaacacctactaccgggtgaacgagaattactaccccggcgctagcatcta





cgagaacgagagagcctccagagattcagagttccagaacgagatcctgaaaagagccgagcagaatggcgtgaccttcgacgagaacatcaa





gcggatcacagcctctggcaaatacagcgtgcagttccagaagctggaaaatgataccgatagcagcctggaaagaatgaccaaggcggtggaa





ggcttggtcaccgtgatcggcgaggagaagttcgagacagtggacatcaccggcgtgtccagcgacaccaacgaggtgaaaagcctggccaag





gaactgaagaccaacgccctgggcgtgaagctgaagctgtaa





Artificial


armY-Synaptic vesicle glycoprotein 2C fusion protein codon-optimized (for human)


nucleotide sequence (2,028 bp)


47) SEQ ID NO: 47



atgtaccgcatgcagctgctgagctgcatcgccctgagcctggctctggtgacaaacagcaaacctctgcagagcgacgagtacgcc






ctgctgacaagaaacgtcgagcgggacaagtacgccaattttaccatcaactttaccatggaaaaccagatccacaccggaatggaatacgataat





ggcagattcattggcgttaagttcaaaagcgtgacattcaaagatagcgtgttcaagagctgtacattcgaagatgtgaccagcgtaaatacctacttc





aaaaactgcaccttcatcgacaccgtgttcgacaacaccgatttcgagccttacaagttcatcgacagcgagttcaagaactgcagctttttccacaac





aaaaccggatgtcagatcaccttcgacgacgactacagcggcggcggcggctcgggcggaggaggctctggtggcggcggcagcacaaacct





ggtcaaccagagcgggtatgccctggtggccagcggcagaagcggcaatctgggcttcaagctgttcagcacacagtccccaagcgctgaggtg





aagctcaaatctctgtcccttaacgacggcagttaccaaagcgagatcgacctgagcggcggagccaacttccgggaaaagttcagaaatttcgct





aatgaactgagcgaggccatcacgaatagccctaagggcctggatagacccgtgcccaagactgagatcagcggcctgattaagacaggagata





acttcatcacacctagcttcaaggccggctattacgaccacgtggcctcagacggctccctgctgagctactaccagagcacagagtacttcaacaa





ccgggtgctgatgcctatcctgcagaccaccaacggaacactgatggccaacaacagaggctatgacgatgtgtttagacaggtcccctcttttagc





ggatggtccaacaccaaggctacaacagtgtccaccagcaacaacctgacctacgacaagtggacatatttcgccgccaagggaagccctctgta





cgacagctacccaaaccacttcttcgaggacgtgaagaccctggccattgacgccaaagacatcagcgccctgaagaccacaatcgattctgaga





aacctacctatctgatcatcagaggactctctggcaacggcagccagctgaacgagctgcagctgcctgagagcgtgaaaaaggtgtccctgtac





ggcgattacaccggcgtgaacgtggccaagcagatcttcgccaacgtggtggaacttgagttctacagcaccagcaaggccaattctttcggcttca





accccctggtcctgggcagcaagacaaatgtgatctacgacctgttcgcctctaagcctttcacccacatcgacctgacccaggtgacactgcaaaa





ttccgataacagcgccatcgacgctaacaagctgaagcaggccgtgggcgacatctacaactaccggcggtttgagcggcagtttcagggctactt





tgctggcggatacatcgacaagtacctggtgaagaacgtgaacacaaacaaggactctgatgacgacctggtttaccggtctctgaaggaactgaa





cctccatctggaagaagcctacagagaaggcgacaacacctactacagggtgaacgagaactactaccccggcgctagcatctacgagaacgaa





agagcctctagagatagcgaatttcagaacgagatcctgaagagagctgaacagaatggcgtgacctttgatgagaacatcaagcggatcaccgc





ctccggcaagtacagcgtgcagttccaaaagctggagaatgataccgactccagcctggaaagaatgaccaaggcagtggagggcctggtgacc





gtgatcggcgaggaaaagttcgagacagtggacatcaccggcgttagcagcgacaccaacgaggtgaagtctctggccaaggaactgaagacc





aacgccctgggagtgaaactgaagctgtaa





Artificial


armY-Synaptotagmin I fusion protein codon-optimized (for human) nucleotide sequence


(1,839 bp)


48) SEQ ID NO: 48



atgtacagaatgcagctgctgagctgcatcgccctgagcctggccctggttacaaacagcatggtgtccgagagccaccacgaggcc






ttagcagctcctcctgtgaccaccgtggctacagtgctgcccagcaatgccaccgagcctgccagccctggagagggaaaagaggacgcctttag





caagctgaaggagaagttcatgaacgagctgcataagatccctctgcctggaggtggcggcagcggaggaggtggctccggcggcggcggctc





caccaacctggtgaaccagagcggctacgccctggtggccagcggaagaagcggcaacctgggcttcaagctgttttctacgcagagccccagc





gccgaagtgaagctgaagagcctgtcactgaacgacggcagctatcagtctgagatcgacctgtctggcggggccaatttcagagagaaatttag





aaacttcgctaatgagctgagcgaggccatcaccaactcgcccaagggcctggacagacctgtgcccaagaccgaaatcagcggcctgattaaa





acaggcgataacttcatcaccccttcttttaaggctggctactacgaccacgtggccagcgatggcagcctgctgtcttactaccagagcacagagt





actttaacaacagagtgctgatgcctatcctgcagaccaccaacggaacactgatggccaacaaccggggctacgacgacgtcttcagacaggtg





cctagcttctctggctggtccaacaccaaggcgacaaccgtgtccaccagcaacaatctgacatacgataagtggacctacttcgctgccaagggc





tccccactgtacgactcttatccaaaccacttcttcgaggatgtgaaaactctggctatcgacgccaaggacatcagcgctctgaagaccacaatcga





cagcgaaaagcccacctacctgatcatcagaggactgagcggaaatggctcacagctgaacgaactgcagctgcctgagtctgtgaagaaggtgt





ccctctacggcgactacaccggcgtcaacgtggccaagcaaatcttcgccaatgtggtggaactggaattctacagcaccagcaaggccaacagc





ttcggcttcaaccccctggtgctggggagcaaaacaaacgtgatctatgacctgttcgccagcaagcctttcacccacatcgatctgacccaagtga





ccctgcagaacagcgataatagcgccatcgacgccaacaagctcaagcaggccgtgggcgatatctacaactacaggcggttcgagagacagtt





tcagggctacttcgccggcggctacatcgacaaatacctggtcaagaacgtgaacaccaacaaagactctgatgacgacctggtctaccggagcc





tgaaagagcttaatctgcacctggaagaggcctaccgggaaggcgacaacacatactacagagtgaacgagaactactacccaggcgccagtatt





tacgagaacgaacgcgcctctagagatagcgagttccaaaatgagattttaaaaagagccgagcagaacggcgtgacattcgacgagaacatcaa





gcggatcaccgcctccggcaagtacagcgtgcagttccagaagctggaaaatgataccgacagcagcctggaacggatgaccaaggccgtgga





aggcctggtgaccgtgatcggcgaggaaaagttcgaaaccgtcgacatcacaggcgtgtctagcgacaccaatgaggtgaagagccttgctaag





gaactgaagacaaacgccctgggcgtgaaactgaagctgtaa





Artificial


armY-Synaptotagmin II fusion protein codon-optimized (for human) nucleotide sequence


(1,854 bp)


49) SEQ ID NO: 49



atgtaccggatgcagctgctgagctgcatcgccctgtccctggccctggtgacaaacagcatgagaaacattttcaagagaaaccagg






agcctatcgtggcccctgctacaaccacagccacaatgcctatcggccctgtggataattcgactgaaagcggcggagccggcgagtcccaaga





agatatgttcgccaagctgaaagagaaactgttcaacgagatcaacaagatccccctgcctccaggcggcggcggcagcggaggaggcggcag





cggtggcggcggcagcacaaatctggtaaaccagagcggctacgccctggttgcctccggaagaagcggaaacctgggatttaagctgttcagc





acccagtccccatctgctgaagtgaaactgaagagcctgagcctgaatgacggctcttaccagagcgagatcgacctgagtggaggcgccaattt





cagagagaaattccgcaacttcgccaatgagctgagcgaggccatcaccaacagccctaagggcctggacagacctgtgcccaagaccgaaatc





agcggactgatcaagaccggcgacaacttcatcaccccttcttttaaggctggatattacgaccacgtggcctctgacggatctctgctgagctacta





ccagtctaccgagtacttcaacaaccgggtgctgatgccaattcttcagacaaccaacggcaccctgatggccaacaatagaggctacgacgatgt





gttccggcaagtgcctagcttttctggctggagcaacaccaaggccaccaccgtgtccaccagcaacaacctcacctatgataagtggacctacttt





gctgctaaaggcagccccctgtacgactcttatcctaaccacttcttcgaagatgtgaagaccctggctatcgatgccaaggacatcagcgccctga





aaaccaccatcgacagcgagaagcccacctacctgatcatcagaggcctatctggcaacggcagccagctgaacgagctgcagctccctgagag





cgtgaagaaggtgtctctgtacggcgattacaccggcgttaatgtggctaaacagatcttcgccaacgtggtggaactggaattctacagcacatcta





aagcaaacagttttggcttcaatcctctggtgctgggcagcaagaccaacgtgatctacgacctgtttgctagcaagcccttcacacacatcgatctg





acccaggtgaccctgcaaaactccgataatagcgccattgacgccaacaaactcaagcaggccgtgggcgatatctacaactacaggcggttcga





gagacagttccagggctacttcgccggcggatatatcgacaagtacctggtcaagaacgtcaacacaaacaaggacagcgatgacgacctggtct





accggagcctgaaggaactgaacctgcatctggaggaagcctacagagaaggcgacaacacctactacagagtgaacgagaactactaccccg





gcgccagcatctacgagaatgaaagagcctcaagagattccgagttccagaacgagatcctgaagcgggccgagcagaacggcgtgacattcg





acgagaacatcaagcggatcaccgccagcggcaagtacagcgtgcagtttcagaagctggaaaacgacaccgactcaagcctggaaagaatga





caaaggccgtggaaggcctggtgactgtgatcggcgaagagaagttcgagacagtggacatcacaggcgtgtctagcgacaccaacgaggtga





aaagcctggccaaggaactgaagacaaacgccctgggcgtgaagctgaagctataa





Artificial


armY-HLA class II histocompatibility antigen, DRB1 beta chain fusion protein codon-


optimized (for human) nucleotide sequence (2,262 bp)


50) SEQ ID NO: 50



atgtaccggatgcagctgctgagctgcatcgccctgtctcttgccctggtgaccaactctggagacaccagacctagattcctgtggca






gcccaagagggaatgtcactttttcaacggtacagagcgggtgagattcctggaccggtacttctacaaccaggaggaaagcgtgcggtttgatag





cgacgtgggcgagttccgggctgtgactgaactgggccggcccgatgccgagtactggaacagccagaaggatatcctggagcaggccagag





ccgcagtggacacctactgcagacacaactacggcgttgtggaaagcttcaccgtgcaaagaagagtgcagcctaaagtgaccgtgtacccatct





aaaacacagcctctgcagcaccacaatctgctggtatgcagcgtgtccggcttctaccctggcagcatcgaggtgcggtggttcctgaacggccag





gaggaaaaagccggcatggtgtctaccggcctgatccagaatggcgactggaccttccagaccctggtgatgctggaaacagtgcctagatccgg





cgaggtgtacacctgccaggtggagcaccccagcgtcaccagcccactgaccgtggaatggcgggccagatctgagagcgctcagagcaagg





gcggcggcggaagcggcggcggaggaagcggcggcggcggcagcacaaatctggtcaaccagagcggctacgccctggtggccagtggca





gaagcgggaacctgggctttaagctgtttagcacccagagccccagcgccgaagtgaagctgaaaagcctgtccctgaacgacggcagctacca





gagcgagatcgacctgtccggcggagccaacttcagagagaagttcagaaactttgccaacgagctgagcgaggccattacaaatagccctaag





ggcctggatagaccagtgcctaagaccgagattagcggcctgatcaagaccggcgataacttcatcacaccttcctttaaggccggttactatgacc





acgtggccagcgacggctccctcctgagctactatcagtctaccgagtacttcaacaaccgggtgctgatgcctatcctgcaaacaacaaacggca





ccctgatggccaacaacagaggctacgacgatgtgttcagacaagtgccctctttcagcggatggagcaacaccaaggctacaaccgtctccacta





gcaacaacctcacctacgacaagtggacctattttgccgccaagggcagccctctgtacgacagctaccctaaccacttcttcgaggacgtgaaga





ccctggccatcgacgctaaggacatcagcgcccttaagaccacaatcgattctgagaagcctacctacctgatcatccggggcttatctggcaacg





gctctcagctgaatgagctgcagctgccggaaagcgtgaagaaggtgtccctctacggcgactacacaggcgtgaatgttgccaagcagatcttc





gccaacgtggtggaactagaattctactccaccagcaaggctaacagctttggcttcaatcctctggtgctgggcagcaaaaccaatgtgatctatga





tctgttcgcttctaagcccttcacccacatcgatctgacacaggtgaccctgcagaacagcgacaatagcgccatcgacgctaacaagctgaaaca





ggctgtgggcgacatctacaactaccggagattcgagagacaattccagggctacttcgccggaggatatatcgacaagtacctggtgaaaaacgt





gaacaccaacaaggattctgatgacgacctggtttacaggagcctgaaggaactgaaccttcatctggaagaagcctacagagagggcgacaata





catactacagagtgaacgagaattactaccccggcgccagcatctacgagaacgaaagagcctctagagacagcgagttccaaaacgaaatcctc





aagcgcgctgagcagaacggagtgacattcgacgagaacattaagcggatcaccgccagcggcaagtacagcgtccagttccagaaactggaa





aacgacaccgattctagcctggaaaggatgaccaaggccgtggaaggcctggtaacagtgatcggagaggagaaattcgagacagttgacatca





ccggggtgagcagcgatacaaatgaggtgaagtctctggccaaggaactgaaaaccaacgccctgggagtcaagctgaagctgtaa





Artificial


armY-HLA class II histocompatibility antigen, DR alpha chain fusion protein codon-


optimized (for human) nucleotide sequence (2,241 bp)


51) SEQ ID NO: 51



atgtaccggatgcagctgctgtcatgcatcgccctgagcctcgctctggttaccaatagcatcaaggaagagcacgtgatcatccaggc






cgagttctacctgaatcctgatcagagcggagagttcatgttcgacttcgacggcgatgagatctacatgtggacatggccaaaaaggaaaccgtgt





ggcggctggaagagtaggccggttcgcctccttcgaggcccagggagctaggccaatatcgccgtggacaaggccaatctggagatcatgacc





aagcggagcaactacacccctatcaccaacgtgccacctgaggtgacagtgctgaccaatagccccgtggagctgcgggaacctaacgttctgat





ctgcttcatcgacaagtttacaccccccgtggtgaatgttacatggctgagaaacgggaagcctgtgaccacaggagtgtccgagacagtgttcctg





cctagagaagaccacctgttccggaagttccactacctgcccttcctgccttccaccgaggacgtgtacgattgtagagtggaacactggggcctgg





acgagcctctcctgaagcactgggagtttgacgcaccatcccctctgcctgagacaaccgaaggcggaggcggctccggcggcggaggtagcg





gaggcggcggcagcaccaacctggtcaaccagtccggatacgccctggtggccagcggcagatctggcaatctcggcttcaagcttttcagcac





gcagtcccctagcgccgaagtgaaactgaaatctctgtctctgaacgacggcagctaccagagcgagatcgacctgagcggcggcgccaatttca





gagagaagtttcggaacttcgccaacgagctgtccgaggctattaccaacagtccaaagggactggatagacctgtgcccaagaccgagatcagc





ggcctgatcaagacaggcgacaacttcatcacccctagcttcaaggccggctactacgaccacgtggcttctgatggctctctactgagctactacc





agagcacagaatactttaacaatagagtgctgatgcctatcctgcagaccactaacggcaccctgatggccaacaacagaggctacgacgacgtgt





tcagacaagtgccttcttttagcggatggtccaacacgaaggccaccacagtgtctacatctaacaacctgacatatgacaagtggacctacttcgcc





gccaagggcagccctctgtacgacagctatcctaatcacttcttcgaggatgtgaaaacactggctatcgacgcgaaagacattagcgccctgaag





accaccatcgatagcgaaaagcccacctacctgatcatcagaggcctctctggcaacggctctcagctgaacgagctgcaacttccggagagcgt





gaagaaagtgtccctgtacggcgactacaccggcgtgaacgtcgctaaacagatctttgccaacgtcgtggaactggaattctatagcaccagcaa





ggccaacagcttcggcttcaaccccctggtgctgggaagcaagaccaacgtgatctatgacctctttgcttctaaacctttcacccacatcgacctga





cccaggtcacactgcagaacagcgacaacagcgccatcgacgccaacaagctgaagcaggctgtgggcgatatctacaactaccgtagattcga





gcgccagttccagggctatttcgccggcggctacatcgacaagtacctggtgaagaacgtgaacacaaacaaggacagcgacgatgatctggtct





acagaagcctgaaggagctgaacctgcacctggaagaagcctacagagagggcgataacacctactacagggttaatgagaattactaccccgg





cgctagcatctacgagaacgagcgcgccagcagagattctgaattccaaaacgagatcctgaaaagagccgaacagaacggcgtgacattcgat





gagaacatcaagcggatcacagccagcggcaagtacagtgtgcagtttcagaaactggaaaacgacaccgacagcagcctggagagaatgacc





aaggccgtggaaggcctggtgaccgtgatcggcgaggaaaagttcgaaaccgttgacattaccggcgtgtctagcgataccaacgaggtgaaga





gcctggccaaggagctgaagacaaacgccctgggggtgaagctgaagttataa





Artificial


armY-T cell receptor beta variable 7-9 fusion protein codon-optimized (for human) nucleotide


sequence (1,950 bp)


52) SEQ ID NO: 52



atgtaccgcatgcagctgctgagctgcatcgccctgagcctcgccctggtgaccaacagcggcgttagccagaacccccggcacaa






gattaccaagoggggccagaacgtgaccttcagatgtgaccccatcagcgaacacaaccggctgtactggtacagacagacactgggccaagga





cctgagttcctgacctacttccagaacgaagcccagctggagaaatctagactgctttccgatagattcagcgccgagaggcctaagggctcttttag





cacactggagatccagagaacagagcagggcgatagcgcaatgtacctgtgcgccagcagcctgggcggcggcggcagcggcggaggcgg





ctccggcggcggcggatctaccaacctggtgaaccagtctggctacgccctggtggcctctggtagaagcggcaacctgggctttaagctgtttag





cacacagagtccctctgccgaggtgaagctgaagagcctgtccctgaacgacggcagctatcagtccgagatcgatctgagtggcggagctaact





tccgggaaaagttcagaaacttcgccaatgagctgtctgaagccatcaccaatagccctaagggcctggacagacctgtgcctaagaccgagattt





ctggcctgatcaagacaggtgataatttcatcacccctagctttaaggctggctactacgaccacgtggccagcgatggaagcctgctgagctacta





ccagtccaccgagtacttcaacaacagagtgctcatgcctatcctgcaaaccacaaacggaacactgatggccaacaacagaggatatgatgacgt





gttcagacaggtgccatctttttccggctggagcaacaccaaggccaccaccgtgtctacaagcaacaacctgacatatgacaagtggacctacttc





gccgccaagggctccccactgtacgacagctaccctaaccacttcttcgaggacgtaaagacactggctatcgatgccaaagacatcagcgcctta





aagaccaccatcgacagcgagaagcccacctacctgatcatcagaggactgagtggcaacggcagccagctgaatgaactgcagctgcctgaat





ctgtgaagaaggtgtccctgtacggcgactacaccggagtgaacgtggccaagcagatcttcgctaatgtggtcgagctggaattctacagcacca





gcaaggccaatagcttcggcttcaaccctctggtcctcggctctaagaccaacgtcatctacgacctattcgctagcaagcctttcacccacatcgac





ctgacccaggtgaccctgcagaacagtgacaatagcgccatcgacgccaacaagctgaagcaagccgtgggggacatctacaactaccggaga





tttgagcggcagttccagggctatttcgctggcggatacatcgacaagtacctggtgaaaaacgtgaatacaaacaaggacagcgacgacgatctg





gtgtaccgctctctgaaggaactgaacctgcatctggaagaggcctacagagagggcgataatacctactaccgggtgaacgagaactactaccc





cggcgcctccatctacgagaacgaacgggccagccgggacagcgaattccaaaacgagatcctgaaaagagctgaacagaatggcgtgacctt





cgacgagaacatcaagagaatcaccgcctccggcaagtacagcgtgcagttccagaagctggaaaatgacactgattctagcttggaaagaatga





caaaagccgtggaaggcctggtcacagtgatcggcgaggaaaagttcgagacagtggacatcacaggcgtgagcagcgataccaacgaggtg





aaaagcctggctaaagagctgaagaccaacgccctgggcgttaaactgaaactgtaa





Artificial


armY-T cell receptor beta variable 19 fusion protein codon-optimized (for human) nucleotide


sequence (1,947 bp)


53) SEQ ID NO: 53



atgtatagaatgcagctgctgtcctgcatagccctgtctctggctctggtgaccaactctgggatcacccagtccccaaagtacttgtttag






aaaggagggccagaacgtcaccctgtcttgtgaacagaacctcaaccacgacgccatgtactggtaccggcaggaccctggacagggcctgag





actgatctactacagccaaatcgttaatgatttccaaaagggagatattgctgagggctacagcgtgtccagagaaaagaaagaaagcttccctctg





accgtgaccagcgcccagaagaaccctaccgccttctacctgtgcgcctccagcattggcggcggcggcagcggaggcggaggcagcggagg





cggcggctcaacaaacctggttaaccagtccggctacgccctggtcgcctccggaagaagcggcaacctcggcttcaagctgttcagcacccag





agcccttccgccgaggtgaagctgaagagcctgagcctgaacgacggcagctaccagagcgagatcgacctgtctggcggagctaatttccgcg





agaagttcagaaacttcgccaacgagctgagcgaggccatcacaaacagccctaagggcctggacagacctgtgcctaagacagagatcagcg





gcctgatcaagaccggcgataatttcatcacaccatcttttaaggccggatattacgaccacgtggccagcgatggcagcctgctgagctactacca





gtctaccgagtactttaacaacagggtccttatgccaatcctgcaaacaacaaacggcacactgatggccaacaatcggggctatgatgatgtgttca





gacaggtgccctctttcagcggatggtccaacaccaaggccaccacagtgtctaccagcaacaacctgacctacgataagtggacttacttcgccg





ccaagggctcacccctgtacgacagctaccctaaccatttcttcgaagatgtgaagacgctggccatcgacgcaaaggacatcagcgccctgaag





accaccatcgacagcgaaaaacccacctacctgatcatccggggcctaagcgggaatggtagccagctgaacgagctgcagctgcctgagagc





gtgaaaaaggtgagcctgtacggcgactacacaggcgtgaacgtggccaaacagatcttcgctaatgtggtggaactggaattctattctacatcca





aggccaacagcttcggcttcaaccccctggtgctgggctctaaaacaaacgtgatctacgacctgttcgctagcaagcctttcacccacatcgacct





gacccaagtgaccctgcagaatagcgataacagcgctatcgacgccaacaagctgaagcaggccgtgggagacatctacaattacagaagatttg





aaagacagttccagggctacttcgccggcggctacatcgacaaatacctggtgaagaacgtgaataccaacaaggattctgacgacgacctggtct





accggtctctgaaagagctgaacctgcacctggaagaggcctaccgggagggagataacacctattaccgggtgaacgagaattactaccccgg





cgcctccatctatgagaacgagagagccagcagagacagcgagttccagaacgagatcctgaaaagagccgagcagaacggcgtgaccttcga





cgagaacatcaagcggatcaccgccagtggcaagtacagcgtgcagtttcaaaagctagaaaacgacacagatagcagcctggaaagaatgac





caaggctgtggaaggcctggtgaccgtgatcggcgaggaaaagtttgagacagtggacatcaccggcgtgagctctgacaccaatgaggtcaaa





agcctggctaaggaactgaagaccaacgccctgggcgtgaagctgaaactctaa





Artificial


armY-Hepatitis A virus cellular receptor 1 fusion protein codon-optimized (for human)


nucleotide sequence (2,700 bp)


54) SEQ ID NO: 54



atgtaccgcatgcagcttctgtcttgtatcgccctgagcctggcgctggtcaccaacagcagcgtgaaagttggcggagaggccggtc






ctagcgtcaccctgccttgccactactctggcgctgtgaccagcatgtgctggaaccggggcagctgtagcctgttcacctgccagaatggcatcgt





gtggacaaacggtacacacgtgacatacagaaaggacacaagatacaagctgctgggcgacctgtcaagacgggatgtgtctctgaccatcgag





aacaccgctgtaccgacagcggcgtgtactgctgcagagtggagcacagaggctggttcaatgacatgaagatcaccgtgagcctggagatcgt





gcctccaaaggtgaccaccacgcctatcgtgacaaccgtacctacagtgaccaccgtgcggaccagcacaaccgtgcctaccaccaccaccgtg





cccatgaccacggtgcccaccacaaccgtgccaaccaccatgagcatccccaccacgacaacagtgctgacaaccatgaccgtactacaacaac





atcagtgcctaccacaacaagcattcccacaaccacaagcgtgcctgtcacaacaaccgtgtccacattcgtgcctcctatgcccctgcctagacag





aatcacgagcctgtggctacctctcctagctcccctcagcctgccgagacacaccctactaccctgcagggcgccatccggagagaacccaccag





cagccctctgtatagttacaccaccgacggcaatgataccgtgaccgaaagcagcgatggactgtggaacaacaaccaaacacagctgttcctgg





aacattccctgctgacagccaatacaaccaagggcatctacgccggagtgtgcatctccgtgctggtcctgctggcactgctgggagttatcatcgc





caagaagtactttttcaagaaggaagtgcagcagctgagcgtgagcttctccagcctgcagatcaaagattgcagaacgccgtggaaaaggaagt





gcaagccgaagataacatctacatcgagaactccctgtacgccaccgatggcggcggaggctccggcggcggaggaagcggcggcggcggc





tccacaaatctggtgaaccagagcgggtacgccctggtggccagcggcagaagcggaaatctgggcttcaagctgtttagcacccagagcccttc





tgccgaggtgaaactgaaaagcctgtccctcaacgacggcagctaccagagcgagattgacctgagcggcggagccaatttcagagagaagttc





cgcaacttcgctaacgagctgtctgaagcaatcacaaactcccctaagggactggatagacccgtgcctaaaaccgagatcagcggcctgatcaa





gactggagacaatttcatcacccctagctttaaggccggctactatgaccacgttgcctccgacggcagcctgctgagctactaccagtctacagag





tactttaacaacagagtgctgatgcctattctgcagacaactaacggcacactgatggccaacaatcggggctacgatgacgtgttcagacaagtgc





ccagctttagcggctggagcaacaccaaggctactaccgtgtctaccagcaacaacctgacctacgacaagtggacctacttcgccgctaagggct





ccccactgtatgacagttaccccaaccacttcttcgaggacgtaaagaccctggccattgacgccaaggatatcagcgccctgaaaaccaccatcg





acagtgagaagcccacctacctgatcatccggggcctgagcggcaacggctctcagcttaacgagctgcagctgcctgagagcgtgaaaaaggt





gagtctatacggcgactacaccggcgtgaacgtggccaaacagatcttcgccaacgtggtggagctggaattctacagcaccagcaaggccaact





ctttcggcttcaaccccctcgtgctgggctccaagacaaacgtgatctacgacctgtttgcttctaaacctttcacccacatcgacctcacccaggtga





ccctgcaaaatagcgataacagcgccatcgacgccaacaagctgaagcaggctgttggagatatctataactaccggagattcgaaagacagttcc





aaggctatttcgccggcggctacatcgacaaatacctggtgaaaaacgtgaataccaacaaggacagcgacgatgacctggtgtacagatctctga





aggagctgaacctgcacctggaagaggcctacagagaaggcgacaacacatactacagagtgaacgagaactactacccaggagcttctatcta





cgagaatgaaagagccagcagagactctgagttccagaacgagatcctgaagcgggccgagcagaacggcgtgaccttcgacgagaatatcaa





gagaatcaccgcctccggcaagtacagcgtgcagtttcagaagctggaaaacgatacagactccagcctggaacggatgacaaaggccgtgga





gggcctggtgaccgtgatcggcgaggaaaaattcgaaaccgtggacatcaccggcgtctccagcgataccaacgaggtgaagagcctggccaa





ggaactgaagaccaacgccctgggagtgaagctgaagctataa





Artificial


armY-Myelin and lymphocyte protein fusion protein codon-optimized (for human) nucleotide


sequence (2,127 bp)


55) SEQ ID NO: 55



atgtacagaatgcagctgctgagctgcatcgccctgtccctggccctggtgaccaatagcatggcccctgccgccgctaccggcggta






gcacactgcctagcggcttcagcgtgtttacaacactgcctgacctgctctttatcttcgagttcatcttcggcggcctggtgtggatcctggtggcctc





tagcctggtcccttggcccctggtgcagggctgggtcatgttcgtgtccgtgttctgcttcgtggcaacaaccacactgatcatcctgtacattatcgg





cgcccacggtggcgagacaagctgggtgacactggacgccgcttatcattgtaccgccgctctgttttacctgtcagcaagcgtgctggaagccctt





gccaccatcaccatgcaggatggctttacctacaggcactaccacgagaacatcgccgccgtggtgttctcctacatcgccacactgctgtatgtcgt





gcacgccgtgttcagcctgattagatggaagtccagcggcggcggcggatctggcggaggcggaagcggcggcggaggctctaccaacctgg





tgaaccagagcggatacgccctggtggcctctggcagaagcggaaacctgggcttcaaactgttcagcacccagtccccaagcgccgaggtgaa





actgaagagcctgagcctgaatgacggcagctaccagagcgagattgacctctctggtggagccaatttcagagagaagttccggaacttcgcca





acgaactgtctgaagccatcaccaacagcccaaaaggcctcgatagaccagtgcccaagaccgaaatcagcggactgatcaagaccggcgata





atttcattacccctagctttaaggctggctattacgaccacgtggcttctgacggcagcctgctgagctactaccagagcaccgagtactttaacaata





gagtgctgatgcctatcctgcagaccaccaacggcaccctgatggccaacaacagaggttacgacgacgtgttcagacaggtgcctagcttcagc





ggctggtccaacaccaaggcgactaccgtctccacaagcaacaacctgacctacgataagtggacctacttcgccgcaaagggctctcctctgtac





gacagctaccccaaccacttcttcgaagatgtgaagaccctggctatcgatgctaaagatatcagtgccctgaagacaacaatcgacagcgagaaa





cctacctacctgatcatcagaggcctgagcggaaatggctcgcagctgaacgagctgcagctgcctgagtccgtgaaaaaggtgtccctctacgg





cgactataccggcgtgaacgttgccaagcagatctttgctaatgtggttgagctggagttctacagcacctctaaggccaattcttttggcttcaacccc





ctggtgctgggcagcaagaccaacgtgatctacgacctgttcgccagcaagcccttcacccacatcgatctcacccaagtgacactgcaaaactcc





gacaacagcgccatcgacgccaacaagctgaagcaggccgtgggcgatatctacaactacagacggttcgagagacagttccagggatatttcg





ccggcggctacatcgacaagtacctggtcaagaacgtgaacacgaacaaggatagcgatgacgacctggtgtaccggagcctgaaggaactga





acctgcacctggaagaggcttaccgggaaggcgacaacacctactaccgcgtgaatgaaaactactaccctggcgccagcatctacgagaacga





gcgggcctcccgggacagcgaattccagaatgaaatcctgaaaagagccgagcagaacggggtgaccttcgacgagaacatcaagcggatcac





cgccagcggcaagtactccgtgcagttccaaaagctggaaaacgataccgacagcagcctggaaagaatgactaaggccgtcgagggcctggtt





acagtgatcggcgaggaaaaatttgagacagtggacatcacaggcgtcagcagcgacacaaacgaggtgaagtctctggccaaggagctgaag





accaacgcccttggagttaagctgaagttataa





Artificial


armY-Complement factor H fusion protein codon-optimized (for human) nucleotide sequence


(5,307 bp)


56) SEQ ID NO: 56



atgtacagaatgcagctgctgtcctgcatcgccctgtctctggccctggttaccaattcagaagattgcaacgagctgcctcctcggcgg






aacaccgaaatcctgaccggatcctggagcgaccagacataccccgagggcacccaggccatttacaagtgtcggcctggctacaggtcactgg





ggaacgttatcatggtgtgccggaaaggcgagtgggtggccctgaaccctctgcggaagtgccagaaacggccatgtggccaccctggcgaca





cccctttcggaaccttcaccctcacaggtggcaacgtctttgagtacggcgtgaaagccgtttacacatgcaatgagggataccagctgctcggaga





gatcaactacagagagtgtgataccgacggatggaccaacgacatccccatctgtgaagtggtgaagtgcctccctgtcacagcccctgaaaacg





gcaagatcgtgtcttctgctatggagcctgatagagaatatcactttggccaggccgtgagattcgtgtgcaactctgggtacaaaatcgagggagat





gaggaaatgcactgctctgatgacggcttctggagcaaggaaaagcctaagtgcgtggagatcagctgcaagagtcctgacgtgatcaacggctc





ccctatctcacagaagatcatttacaaggagaacgaaagattccagtacaaatgtaacatgggatacgagtactctgaaagaggtgatgccgtttgta





ctgaatccggctggcggcctctgcctagctgcgaggagaagagctgtgacaatccttacatccccaatggagattacagccctctcagaatcaagc





accgcaccggcgacgagatcacctaccagtgtcgcaacggattttaccccgctacccggggcaacaccgccaagtgtacctccacaggctggat





ccctgcccccagatgcaccctgaaaccctgcgactaccctgatatcaagcacggcggcctgtatcacgagaacatgagaagaccttacttccctgt





ggccgtgggcaagtactactcttattactgcgatgaacactttgaaacccctagcggcagctactgggatcacatccactgtacccaggatggctgg





tctccagctgtgccatgtctgcgcaagtgctacttcccctacctggaaaacggctacaaccagaactacggtagaaagttcgtgcagggcaagtcta





tcgacgtggcatgccaccccggctacgccctacctaaggctcagaccacagtgacctgtatggaaaacggttggtctcccaccccacgctgcatcc





gggtgaagacctgctccaagtcttctatcgatattgaaaacggcttcatctctgaatcccaatacacctatgctctgaaggaaaaggccaagtaccag





tgtaagctgggatacgtgaccgccgacggcgagacatctggctccatcacctgtggcaaggacggctggagcgcacagcccacatgcattaagt





cttgcgacatcccggtgttcatgaacgccagaaccaagaacgatttcacctggttcaagctgaacgacacactggattacgagtgtcacgacggata





tgaaagcaataccggcagcaccaccggcagcatagtgtgcggctacaacggctggagcgatctgcccatctgctacgaaagagaatgcgagctg





cctaagatcgatgtgcacctggtgcccgatcggaagaaggaccagtacaaggtgggcgaagtgctgaagtttagctgcaagcccggattcacaat





cgtgggaccaaattctgtgcagtgctaccacttcggcctgagccccgacctgcccatctgcaaggaacaagtgcagagctgtggacctcctcctga





gctgctgaacggaaacgtgaaagagaagacaaaggaggagtacggccattctgaggtggtcgagtactactgtaaccctagattcctgatgaagg





gccctaacaagatccaatgcgtggacggagagtggaccaccctgcccgtttgcatagtggaggaaagcacctgtggcgacatcccggaactgga





acacggctgggcccagctgagcagccctccctactactacggcgattctgtcgaatttaactgtagcgagtcattcaccatgatcggccatagaagc





attacttgcatccacggagtgtggactcagttacctcagtgcgttgccatcgacaagctgaagaagtgtaaatctagcaacctgatcattctggaaga





acacctgaagaacaagaaagaattcgaccacaattcaaacatcagatacagatgccggggcaaagagggctggatccacaccgtgtgcatcaac





ggcagatgggaccccgaggtgaactgcagcatggcccagatccagctgtgtcctcctcccccccagatcccaaacagccacaacatgaccacca





cgctgaactaccgagacggcgagaaggtgagcgtgctgtgccaggagaactacctgatccaggagggcgaagagatcacatgtaaggacggtc





gttggcagagcatccccctgtgcgttgaaaagatcccctgcagccagcctcctcaaatcgagcacggcaccatcaacagctccagatcctcccag





gagtcctacgcccacggcacaaaactgagctacacatgcgaaggcggattccggatttctgaagagaacgagaccacctgctacatgggcaagt





ggagctctccccctcaatgtgagggcctgccttgcaagagccctcctgagatcagccacggcgtggttgcccacatgtctgatagctaccaatacg





gcgaggaagtgacttataagtgcttcgaggggtttgggatcgatggtcccgccattgccaagtgcctgggagaaaaatggtctcatccaccatcatg





tatcaagaccgactgcctgagtttgcctagctttgagaatgctatccctatgggcgagaagaaggacgtatacaaagccggcgagcaggtgacata





cacatgtgccacctactacaaaatggacggcgccagcaatgtaacgtgtataaatagcagatggacaggcagacctacctgcagagatacaagct





gcgtgaatcctcccacagtccaaaatgcttatatcgtgagtcggcagatgagcaagtaccctagcggcgagagagtgagataccagtgcaggtcc





ccctacgagatgttcggcgacgaggaggtgatgtgcctaaacggcaactggacggaacctcctcagtgcaaagacagcaccggaaaatgcggc





cctcctcctcctattgacaacggcgatatcaccagattccactgagcgtgtacgctcctgcttcatctgtcgagtaccaatgccagaatctgtaccag





ctggaaggtaataagagaatcacctgcagaaacggacagtggagcgaacctcctaagtgcctgcacccttgcgtgatctccagagagatcatgga





aaactacaacatcgccctgagatggaccgccaaacagaagctgtacagccggaccggcgagagcgtcgagttcgtgtgtaagagaggttaccga





ctgtcctctagaagccataccctgcggaccacctgctgggacggcaaactagagtaccctacgtgcgccaagcggggcggaggtggctcagga





ggcggcggctctggcggcggcggctctacaaacctggtgaaccagagcggttatgccctggtggccagcggcaggtctggaaatctgggcttta





agctgttttcaacgcagagcccttccgccgaagttaagctgaaatcactgagcctgaatgacggctcctaccagagcgagatcgacctgtctggag





gagctaactttagagagaagttcaggaacttcgctaacgagctgagcgaagccatcaccaatagccctaaaggcttggacagacctgtgcccaag





actgagatcagcggcttgatcaagaccggcgacaacttcatcaccccatcttttaaggccggctactacgaccacgtggcctctgacggaagcctg





ctatcctactatcagtctactgagtacttcaacaacagagtgctgatgcctatcttgcagaccaccaatggcaccctgatggccaacaaccggggata





tgacgatgtgttcagacaggtgcctagcttcagcggatggagcaacaccaaggcgacaaccgtgagcacatccaacaacctgacatacgacaagt





ggacatattttgcggccaagggctctccactgtatgatagctaccccaatcacttcttcgaggacgtgaagaccctggccatcgacgccaaagacat





cagcgcccttaagacaacgatcgattccgagaagcctacctacctgatcattagaggcctgagcggcaacggcagccagctgaacgagctgcag





ctgccagagtccgtgaagaaagtgtccctgtatggcgactacacaggcgtcaacgtggccaagcaaatcttcgctaatgtggtggaacttgagttct





acagcacatcgaaggctaactctacggcttcaaccccctggtgctgggcagcaagaccaatgtgatttacgacctgttcgccagcaagcccttcac





acacatcgacctgacccaagtgacactgcaaaacagcgataacagcgccatcgacgccaacaagctgaagcaggctgtgggcgacatctacaa





ctaccggagattcgagagacagttccagggctacttcgccggcggctacatcgataagtacctggtgaagaacgtgaataccaacaaagactctga





tgacgacctggtgtacagaagcctgaaagagctgaacctgcatctggaagaagcctaccgggagggcgataacacctactaccgggtgaacgaa





aactactatcctggcgctagcatctacgagaacgaacgagccagcagggattctgaattccagaacgagatcctgaagcgggccgagcagaacg





gagtgacatttgatgagaacatcaaacggatcaccgccagcggcaaatactccgttcagttccaaaaactggaaaatgatacagacagcagcctgg





agagaatgaccaaggccgtggaaggcctggtgacggtgatcggcgaagagaaattcgagaccgtggacatcaccggcgtaagctctgacacca





acgaagtgaagagcctggctaaggaactgaagaccaacgccctgggggtcaagctgaagctgtaa





Artificial


armY-Hepatocyte growth factor receptor fusion protein codon-optimized (for human)


nucleotide sequence (4,392 bp)


57) SEQ ID NO: 57



atgtaccggatgcaactgctgagctgcatagccttatctctggcactggtgaccaacagcgagtgcaaggaagccctcgccaagagtg






aaatgaacgtgaatatgaaataccagctgcctaacttcaccgccgaaacccctatccagaacgtcatcctgcatgagcaccacatcttcctgggcgc





tacaaattacatctacgtgctgaatgaggaggacttgcagaaagtcgccgaatacaagaccggacccgtgctggagcacccggactgcttcccatg





tcaggattgcagttctaaggccaacctgagtggtggcgtttggaaggacaacatcaacatggccctggtggtcgacacatattacgacgatcagctg





attagctgtggcagcgtgaaccggggcacctgccagagacacgtgttccctcacaaccacactgccgacatccagagcgaagtgcactgcatctt





cagcccccagatcgaggagcctagccagtgtcctgactgcgtggtgtcagccctgggtgctaaggtactgtccagcgttaaggacagattcatcaa





ctttttcgtgggtaacacaatcaacagcagctacttccccgatcaccctctgcacagcatatccgtgcggagactcaaggaaacaaaggacggcttc





atgttcctgacagaccagagctatatcgatgtgctgcctgagttcagagattcttaccccatcaagtacgtgcacgccttcgagagcaacaattttatct





atttcctgacagtccaaagggagacactcgatgcccagaccttccacaccagaatcatccggttctgcagcattaacagtggactgcactcttatatg





gaaatgcccctggaatgtatcctcacagagaaaaggaagaaaagaagcactaagaaggaggtgttcaacattctgcaggctgcttacgtgtccaag





cctggcgctcagctggccagacagatcggcgccagcctgaacgatgacatcctgttcggcgtcttcgcccaatctaagcctgacagcgccgagcc





catggacagatctgctatgtgcgctttccccatcaagtacgtgaatgacttcttcaacaagatcgtgaacaagaacaacgtgcggtgcctgcaacact





tctacggccctaaccacgagcactgttttaatagaaccctactgcggaactcctctggttgtgaagctagaagagacgaataccggaccgagttcac





caccgccctgcagagggtggacctgttcatgggccaattcagcgaggtcctgctgacatctataagcaccttcatcaagggagatctgacaatcgc





caacctgggcaccagtgagggcagattcatgcaggtggtggtgagtagatccggccctagtacaccccatgttaacttcctgctggactcacaccc





cgtgtcccctgaggtgatcgtggaacatacactgaaccagaatggctatacactggtgatcaccggaaagaagattaccaagattcctctgaacgg





cctgggctgcagacacttccagagctgtagccagtgcctgagcgcccctccttttgtgcagtgcggctggtgccacgacaagtgcgtgcgcagcg





aggagtgcctgagcggcacctggacacagcagatctgtctgcctgccatctacaaggtctttccaaacagcgccccattggaaggcggaactcgg





ctgacaatctgcggctgggacttcggctttcggcggaacaacaagtttgacctgaagaagacccgggtgctgctgggcaacgagagctgtaccct





gaccctgagcgaaagcaccatgaacacgctgaaatgcaccgtgggcccagccatgaacaaacacttcaacatgtctatcatcatcagcaatggcc





acggcacaacccagtacagcacgttcagctacgtggaccctgtgatcaccagcatctcaccgaagtacggccctatggccggcggcacattgctg





accctgaccggaaattatctgaactcgggcaacagccgtcacatctccataggcggaaagacatgcacgctgaagtcggtgtctaacagcatcctg





gagtgctacacaccagcccagaccatctcgacagaattcgctgtaaagctgaagatcgatctcgctaatcgagagacaagcatcttttcttacagag





aggatcctatcgtgtacgagatccaccctacaaagtctttcatcagcggcggcagcaccatcacaggcgtgggaaaaaacctgaactctgtgtctgt





gccgagaatggtgatcaacgtgcacgaggctggcagaaacttcacagtggcctgccagcatagaagcaacagcgaaatcatctgctgcaccacc





ccctcgctgcagcagcttaatctgcagctgcccctgaaaacgaaggccttcttcatgctggatgggatcctgtctaagtacttcgatctcatctacgtg





cacaatcctgtgtttaagccattcgagaagcccgtcatgatctctatgggcaacgagaacgtgctcgagatcaagggcaatgatatcgaccctgagg





ccgtgaaaggcgaggtgctgaaagtgggcaacaaaagctgcgaaaacatccacctgcacagcgaagccgtgctgtgcaccgtgcctaacgactt





gctgaagctgaactccgagctgaatatcgagtggaagcaggccatcagctctaccgtcctgggcaaggtgattgtgcaacctgaccagaacttcac





cggcggtggcggtagtggaggcggcgggagcggaggcggaggaagcaccaacctggtgaaccagagcgggtacgccctggtagctagcgg





cagaagcggcaacctgggctttaagctgttttctacccagagccctagcgccgaagtgaagctgaagagcctgagcctgaacgacggcagttacc





aatccgagatcgacctgtctggcggcgccaacttcagagagaagttcagaaacttcgctaatgagctgtctgaggccatcaccaacagccctaagg





gcctggatagacctgtgccaaagaccgagatctccggcctgatcaaaaccggcgataactttatcacacctagctttaaggccggctactacgacc





acgtggcctccgacggctccctgctgtcctactaccagagcacagaatacttcaacaacagagtgctgatgcctatcctgcaaaccacaaacggca





ccctgatggccaacaacagaggctacgacgatgtgttccggcaggtgcctagcttctccggctggagcaacaccaaggccactaccgtttctacca





gtaacaacctgacctacgataagtggacctactttgccgccaagggcagccccctgtacgactcataccccaatcacttctttgaagatgtgaagac





cctggccatcgatgccaaagatatcagcgctctgaaaacaaccatcgactccgagaagcccacctaccttattatcagaggcctgtccggcaacgg





ctctcagctgaatgagctgcagctcccagaaagcgtgaagaaggtgtcgctgtacggcgactacaccggcgtcaatgtggccaaacagatatttgc





caacgtagtagaattggaattctactctacaagcaaagccaactcttttggatttaaccccttagtgctaggatctaagacaaacgtgatctacgacctg





ttcgccagcaaacctttcacccacatcgacctgacccaagtgaccctgcagaacagcgacaacagcgctatcgacgccaacaagctgaagcagg





ccgtcggcgatatatacaattaccggcggttcgagagacagttccagggctacttcgccggaggatacatcgacaagtacctggtgaagaacgtga





acactaataaggacagcgacgacgacctcgtgtacagaagcctgaaagaactgaatctgcacctggaagaagcctaccgggaaggagacaaca





cctactacagagtgaacgaaaactactaccctggcgccagcatctatgagaacgagagagccagcagagattctgaattccagaacgagattctga





aacgggccgagcagaatggcgtgaccttcgacgagaatattaagcgcatcaccgccagcggcaaatattccgtccagtttcagaagctcgagaac





gacaccgacagcagcctggaaagaatgaccaaggccgtggaaggcctggtgaccgtgatcggcgaggaaaaattcgagaccgtggatatcacc





ggcgtgagcagcgacacaaacgaagtgaagagcctggccaaggaactgaagaccaacgccctgggagtgaagctcaagctgtaa





Artificial


armY-Membrane cofactor protein (CD46) fusion protein codon-optimized (for human)


nucleotide sequence (2,595 bp)


58) SEQ ID NO: 58



atgtaccgcatgcagctgctgagctgcatcgccctgtctctggctctggtgaccaacagctgcgaggaacctccaaccttcgaggccat






ggaactgatcggcaagccaaagccctactatgagattggcgaaagagtggattacaaatgcaagaaaggctacttttacatcccccccctggccac





ccacaccatctgtgatagaaaccacacatggctgcctgtctccgacgacgcctgttaccgggagacatgcccttacatccgagaccctctcaatgga





caggccgtgcctgctaatggcacatatgagttcggataccaaatgcacttcatctgcaacgagggctactacctgatcggcgaagaaatcctgtact





gcgagctgaaaggctcggtggctatttggtccggcaaacctcctatctgtgaaaaggtgctgtgcacccctcctcctaagatcaaaaacggcaagc





acacctttagcgaggtggaagtgttcgagtacctggatgccgtgacatatagctgtgaccccgcccctggccctgatcccttcagcctgattggcga





gagcaccatctattgcggcgataactctgtgtggagccgggccgcccctgaatgcaaggtggtgaagtgcagattccctgtggtggaaaacggaa





agcagatctccggctttggcaaaaagttctactataaggctaccgtgatgttcgagtgcgacaagggattctacctggacggctctgatacaatcgtg





tgcgacagcaactctacgtgggaccctccagtgcctaagtgtctgaaagttctgcctcctagctctacaaagccccccgccctgagccacagcgtgt





ccaccagcagcacaaccaagtccccagccagcagcgccagcggacctagacccacctacaagcctcctgtgtccaactaccctggctaccccaa





gcctgaggaaggcatcctggatagcctggatggcggcggcggctccggcggtggaggatctggcggcggaggaagcacattatctggtgaatc





agagcggctacgccctggttgccagcggcagaagcggcaacctgggcttcaagctgtttagcacacagagccccagcgccgaggtgaagctga





agagcttgtcgctaaatgatggctcctaccagtctgagatcgatctgagcgggggcgccaattttagagagaagttccggaacttcgcaaacgagct





gtctgaagccatcaccaacagccctaaggggctggacagacctgtgccaaagaccgagattagcggcctcatcaagacaggcgacaatttcatca





cacctagcttcaaggccggatactatgaccacgtggcctccgacggcagcctgctgagctactaccagagcacagagtacttcaacaacagagtg





ctgatgcctatcctgcagaccaccaacggcaccctcatggccaacaatcggggctatgacgacgtgttcaggcaggtgcctagcttcagcggctg





gagcaacaccaaggccaccactgtgtctacctccaacaacctgacctacgacaagtggacctacttcgcagctaaaggctctccactgtacgatag





ctacccaaaccacttcttcgaggacgtgaagaccctggctattgacgccaaggacatctctgccctgaagaccacaatcgacagcgagaagccta





cctacctgatcatccggggcctgagcggaaacggcagccagctgaacgagctgcagctgcccgagtccgtgaaaaaagtgtccctgtacggcga





ctacaccggcgtgaacgtggccaagcagatcttcgctaatgtggtggaacttgagttctactctaccagtaaggccaactcctttggatttaaccccct





ggtgctgggcagcaagaccaacgtgatctacgacctgttcgcctctaaacctttcacccatatcgacctgacccaggttacactgcaaaacagcgat





aactctgccatcgatgccaacaagctgaagcaagccgtgggcgacatctacaactaccgcagatttgaacggcagttccagggctacttcgccgg





cggctacatcgacaagtacttggtcaagaacgtgaataccaacaaggatagcgacgatgacctggtctaccggagcctgaaggaactgaacctgc





acctggaagaagcctacagagaaggtgacaatacctactatagagtgaacgagaactactacccgggagccagtatctacgagaacgaaagagc





ctctagagatagcgagttccaaaacgagatcctgaaaagagctgaacagaacggcgtgaccttcgacgagaacatcaagagaatcaccgccagc





ggcaagtacagcgtgcagtttcagaagctggaaaacgacaccgacagctccctggaacggatgaccaaggctgttgagggcctggtcacagtga





tcggagaggaaaagttcgaaacagtggatatcacgggcgttagcagcgacaccaacgaggtcaagagcctggccaaagagctgaagacaaac





gccctgggcgtgaagctgaagctgtaa





Artificial


armY-Glycophorin-A fusion protein codon-optimized (for human) nucleotide sequence


(1,884 bp)


59) SEQ ID NO: 59



atgtaccgtatgcagctgctgtcttgcatcgccctcagcctggctctggtgaccaacagctctagcacaacaggcgttgccatgcacac






cagcaccagctctagcgtgaccaagagttacatctcttctcagaccaacgatacccacaagagagacacgtacgccgccaccccaagagcccatg





aggtgtctgaaatcagcgtgcggaccgtgtacccccccgaggaagaaaccggcgagcgggtgcagctggcccaccacttttctgagcctgaggg





aggtggaggcagcggcggcggcggcagcggcggaggcggcagcaccaacctggttaaccagtccggctatgccctggtggctagcggcaga





tccggcaacctgggctttaagctgttcagcacccagagccccagcgccgaggtgaaactgaagagtctgagcctgaatgacggctcttatcagag





cgagatcgacctgagcggcggcgccaatttcagagagaagtttcggaacttcgccaatgaactgtccgaagccatcaccaacagcccaaagggc





ctggacagacccgtgcctaaaacagaaatcagcggactgatcaagaccggcgataatttcatcacacctagcttcaaggccggctactacgacca





cgtggccagcgacggctccctcctgagctactaccaaagcacagagtacttcaacaaccgggtgctgatgcctatcctgcagaccacaaatggca





ccctcatggccaataacagaggctatgatgacgtgttccggcaggtgcccagctttagcggatggagcaacaccaaggccacaaccgtgtccaca





tccaacaacctgacctacgacaagtggacctacttcgctgctaagggcagccctctgtacgactcttaccctaaccacttcttcgaggatgtgaagac





gctggctatcgacgccaaggacatctcggccctgaagaccacaatcgacagcgagaagcctacatacctgatcatcagaggactgagcggcaac





ggcagccaactgaatgagctgcagctgcctgagagcgtgaaaaaggtgagcctgtacggcgactataccggcgtgaatgtggctaagcagatctt





cgccaacgtcgtggaactggaattctacagcaccagcaaggctaactccttcggctttaaccccctggtgctgggctccaaaacaaacgtgatctac





gacctgttcgcctccaaacctttcacccacatcgacctgacacaagtgacactgcaaaatagcgataacagcgccatcgacgccaacaagcttaag





caggccgtgggcgacatctacaactacagaagattcgagagacagtttcagggctatttcgccggaggctatattgataaatacctggtgaagaac





gtgaacaccaacaaagatagcgacgacgatctggtgtacagatctctgaaagagctgaacctgcacctggaagaggcctaccgggaaggagata





acacctactacagggtcaacgagaactactaccctggagccagcatctacgagaacgagagagcttctagagatagcgagttccagaatgaaatc





ctgaagcgggccgaacagaacggagtgacattcgacgagaacattaagcggatcaccgcctctgggaagtacagcgtgcagttccagaagctg





gagaacgacaccgattcttctctggaaagaatgaccaaggcagtcgagggcctggtgaccgtgatcggagaggaaaagttcgagacagtcgaca





tcactggcgtgagctcggacaccaacgaggtaaagagcctggccaaggaactgaagaccaacgccctgggcgtgaagctcaaactgtaa





Artificial


armY-C-type lectin domain family 4 member K (Langerin, CD207) fusion protein codon-


optimized (for human) nucleotide sequence (2,460 bp)


60) SEQ ID NO: 60



atgtatcggatgcagctgctgagctgcatcgccttatccctggctctggtgacaaactcccctagattcatgggcaccatcagcgacgtg






aaaacgaacgtgcagctgctgaagggaagagtggacaacatctctaccctggattctgagatcaaaaagaactccgatggcatggaagctgctgg





cgtgcaaatccagatggtgaatgagagcctgggctacgtgcggtcccagttcctgaagctgaagaccagcgtggaaaaggccaacgcccagatt





cagatcctgacaagaagctgggaggaagtgtctacactgaatgctcagatccccgagctgaaaagcgatctcgagaaggctagcgccctgaaca





ccaagatccgggccttgcaaggctctctggaaaacatgagcaagctgctgaagagacagaacgatatcctgcaggtcgtgtctcagggctggaag





tacttcaagggcaacttctactacctctgatccctaagacctggtactctgccgagcagttctgcgtgtccagaaacagccacctgaccagcgtta





ccagtgagagcgagcaggagttcctgtataagacagccggaggcctgatctattggatcggcctgaccaaggccggcatggagggcgattggag





ctgggtcgacgacacccctacaacaaagtgcagagcgtgcggttaggatccccggcgagcctaacaacgccggcaacaacgagcactgcggc





aatatcaaagcccctagcctgcaggcctggaacgatgccccgtgcgacaagacatactgttcatctgtaaaaggccttacgtgcccagcgaaccc





ggcggcggcggcagcggaggcggcggctctggcggaggaggaagcaccaacctggtgaaccagagcggctacgccctggtcgccagcggc





agaagcggaaatctgggcttcaagctgtttagcacacagagcccatctgcagaggtgaaactgaagagcctgagcctgaacgacggcagctacc





agtctgagatcgacctgtctggcggggccaatttccgggaaaagttccggaacttcgctaacgagctgtctgaagccatcaccaatagtccaaagg





gcctggaccggcctgtgcctaagactgagatactggccttatcaagacaggcgacaacttcatcacccctagattaaggccggctactacgacca





cgtggccagcgatgggtctctgctgagctactaccagagcacagagtacttcaacaatagagtgctgatgccaatcctgcaaacaacaaatggcac





actgatggccaacaaccggggctacgacgatgtgttcagacaggttcctagcttcagcggctggtccaacaccaaggccaccaccgtgagcaca





agcaacaacctgacatatgataagtggacctacttcgccgctaagggcagccctctgtacgacagctaccctaaccatacttcgaggacgtgaaga





cgctggccattgacgccaaagacatctcggccctgaagaccaccatcgacagcgaaaaacctacctacctgatcatcagaggcctgagcggcaa





cggatctcagctgaacgagctgcagctgcccgagagcgtgaagaaggtgagcctctacggcgactacaccggcgtgaacgtggccaagcagat





tttcgcaaacgtggtggaactggaattttacagcacctccaaggctaacagcttcggctttaaccccctggtgctgggatctaagaccaatgtgatcta





cgacctcttcgcttccaagccctttacccacatcgacctgacccaggtgaccctgcaaaattcagataatagcgccatcgacgccaacaagctgaaa





caagccgtgggcgacatctacaactacagaagattcgagcgccagttccagggctattttgctggcggttacatcgacaagtacctggtgaaaaac





gtgaacaccaacaaggacagcgacgatgacctggtgtacagatccctgaaagagctgaacctgcacctggaagaggcctacagagagggcgat





aatacctactatagagtgaatgagaactactaccctggcgccagtatctacgagaacgaaagagctagcagagacagcgagttccagaacgagat





cctgaagcgggccgagcagaatggcgtgaccttcgacgagaacatcaagcggatcacagccagcggcaagtacagcgtgcagttccagaaact





ggaaaacgacacagatagcagcctcgagagaatgaccaaggccgtggaaggactggtgaccgtcatcggcgaagaaaagttcgaaacggtgg





acatcaccggagtgtcctccgacaccaatgaggtgaagtccctggccaaggaactgaagaccaatgccctcggagtgaagctgaagctataa





Artificial


armY-Anthrax toxin receptor 1 fusion protein codon-optimized (for human) nucleotide


sequence (3,264 bp)


61) SEQ ID NO: 61



atgtacagaatgcagctgttgagctgtatcgccctgagcctggccctggtgaccaacagcgaggacggtggccctgcctgctacggc






gggtagacctgtacttcatcctggataagtccggttctgtgctgcaccactggaacgaaatctactacttcgtggaacagctggcccacaagtttatct





cccctcagctgcggatgagcttcatcgtgttctccacaagaggcaccaccctgatgaagctgaccgaggatcgcgagcagatcagacagggactg





gaagagctgcagaaagtgctgcctggcggcgatacatacatgcacgagggatttgagagagcctccgagcagatctattacgagaacagacagg





gctaccgcaccgccagcgtgatcattgccctgacagacggcgagctgcatgaagatctgttcttctacagcgagcgcgaggccaacagaagccg





ggacctgggcgccatcgtgtactgtgtgggcgtgaaggacttcaacgaaacccagctggccagaatcgccgatagcaaggatcacgtgttccctg





tgaacgacggattccaggccctgcagggcatcatccacagcattctaaagaagtcctgcatcgagatcctggctgctgaacccagcaccatctgcg





ccggcgagagcttccaggtggtggtgcggggcaacggcttccggcacgccagaaacgtggacagagttctgtgcagctttaagatcaatgatagc





gtgacacttaacgagaagcccttcagcgtggaagatacctacctgctgtgtcctgctccaatcttaaaagaggtgggaatgaaagccgccctgcaa





gtgtccatgaacgatggcctctcttttatcagttccagcgtgatcatcaccacaacccactgttctgatggtagcatcctggccatcgccctgctcatcc





tgtttctgctgctggccttggccctgctgtggtggttctggcctctgtgctgcaccgtgatcatcaaagaagtgcctcctcctcccgctgaagagagcg





aagaggaggacgacgacggcctgcctaagaaaaagtggcccacagtcgatgcttcttactacggcggcagaggcgttggcgggatcaagcgga





tggaagtgcggtggggagaaaagggcagtaccgaggaaggagctaagctggaaaaggccaagaatgccagagtgaagatgcctgagcagga





gtacgagttccccgagcctcggaacctgaacaacaacatgagacggccctcctctccaagaaagtggtacagccctatcaagggcaagctggac





gccctctgggtcctgctgagaaagggctacgacagagtgagcgtgatgcggccccagcctggcgacactggcagatgcatcaactttaccaggg





tgaagaacaaccagcctgccaagtaccccctgaacaacgcctaccacacaagctctcctcctcccgctcccatctacactccgccccccccagcc





ccacactgccctcccccaccaccctctgcccctacccctcccatccccagccccccttcaaccctgcctccccctccgcaagcccctccaccaaac





agagcacctccacctagcagaccccctcctagaccttctgtgggcggcggcggcagcggcggaggcggcagcggcggaggcgggagcacca





acctggtgaaccagagcggctacgccctggtggcctccggcagaagcggcaacctgggcttcaagctgttctcgacccagagcccttctgccga





ggtgaagctgaaaagcctgtcactgaatgacggctcttaccagagcgagatcgacctgagcggcggagctaacttcagagaaaagttccggaact





tcgccaacgagctgtctgaggccatcaccaacagccctaagggcctggacagacccgtacccaagaccgagatcagcggactgattaagacgg





gcgacaacttcatcacaccttccttcaaggctggatactacgatcatgtggccagcgacggcagcctgctgagctactaccagtccacagagtactt





caacaacagagtcctgatgcctatcctccagaccaccaatggcaccctgatggccaacaatagaggctacgacgacgtgttcaggcaggttccttc





tttctccggctggagcaacacaaaggccaccacagtgagcacaagcaataacctcacctacgacaaatggacctacttcgctgccaagggcagcc





ccctctacgactcttatcctaaccactttttcgaggatgtgaaaacactggctatcgatgccaaggacatcagcgcccttaaaacaacaatcgactccg





agaaacctacctacctgatcatcagaggcctgtccggcaatggcagccagctgaacgagctgcaactgcctgaaagcgtgaaaaaagtgagcct





gtatggggactacaccggcgtgaacgtggccaagcagatcttcgccaatgtggtggaactggagttctacagcactagcaaggccaattctttcgg





ctttaaccccctggtgctgggcagcaagacaaacgtgatctacgatctgttcgccagcaagcctttcacccacatcgacctgacacaggtgacgctg





cagaacagcgacaacagcgccatcgacgccaacaagctgaagcaggccgtgggcgacatttacaactaccggagattcgagagacaatttcag





ggctatttcgccggcggatacatcgacaagtatctggtcaaaaatgtgaataccaacaaggatagcgacgacgacctggtataccggtccctgaaa





gaactgaacctgcacttggaggaagcctacagagagggcgacaatacctactatagagtcaacgagaactactaccctggcgcctccatctacga





aaatgaacgggcctctagagactctgagttccaaaacgagatcctgaaaagagcagagcagaatggcgtcaccttcgacgagaacatcaagcgc





attaccgccagcggaaagtactccgtgcagttccagaagctggagaacgataccgacagctctctggaacggatgaccaaggccgtggagggac





tggtcaccgtgatcggcgaagagaagttcgaaaccgtggacatcaccggcgtgtcttctgacacaaacgaagtgaaaagcctggctaaagagctg





aagacaaacgccctgggagtgaagctgaagctgtaa





Artificial


armY-Anthrax toxin receptor 2 fusion protein codon-optimized (for human) nucleotide


sequence (2,523 bp)


62) SEQ ID NO: 62



atgtacagaatgcagctgctctcttgcattgccctgagcctggccctggtgaccaatagccaggagcaacctagctgcagaagagcctt






cgacctctacttcgtgctggataagtccggcagcgtcgccaacaattggatcgagatctacaacttcgtacagcagctggccgaacgcttcgtgagc





cccgagatgagactgagcttcatcgtgttctcttcccaggccaccatcatcctgcctctgaccggcgacagaggcaaaatctcaaagggcctggaa





gatctgaaaagagtgtcccccgtcggcgagacatacatccacgagggcctgaagctggccaatgaacagatccagaaggccggcggactgaag





accagcagcatcatcattgccctgaccgacggcaaactggacggcctggtccctagctacgccgagaaggaagccaagatcagccggagcctg





ggcgcttctgtgtactgcgtgggagtgctggacttcgagcaggctcaactggagaggatcgctgatagcaaggagcaggttttcccagtgaaagg





cggctttcaagccctgaaaggcatcatcaacagcatcctggcccagagctgtacagagatcctggaactccagcctagcagcgtgtgcgtcggcg





aagagttccagatcgtgttaagcggcagaggcttcatgctgggcagcagaaacggcagcgtgctgtgcacatacaccgtcaatgagacctacaca





acaagcgtgaagcccgtgtccgtgcagctgaatagcatgctgtgtcctgcccctatcctcaacaaggccggcgaaaccctggacgtgtccgtgtctt





tcaatggcggcaagagcgtaatctccggctctctgatcgtgacagccaccgagtgcagcaacggaggcggaggcggatctggtggcggaggat





cgggcggtggcggtagcaccaacctggtgaaccagtcaggctacgcccttgtggccagcggaagatccggcaacctgggctttaagctgttttcta





cacagagcccatctgctgaagtgaagctgaagtctctcagcctgaacgacggctcttatcagtccgagatcgatctgagcggaggagccaatttcc





gggagaagttcagaaactttgctaatgagctgagcgaagccatcacaaacagccctaagggcctggatagacctgtgcccaagaccgagatcag





cggactgatcaagacaggcgacaacttcatcaccccaagcttcaaggctggctactatgaccacgtggcctctgatggatccctgctgtcttattacc





agagcacagaatacttcaacaacagagtgctgatgcctatcctgcaaaccaccaatggaacgctgatggccaacaaccggggctacgatgacgtg





ttcagacaggtgcctagcttcagcggatggagcaacaccaaggccacaacagtcagcacctctaacaacctgacctacgacaagtggacctacttt





gccgctaagggctctccactgtacgatagctaccccaaccacttctttgaggacgtgaagacactggccatcgatgccaaagacatatctgcgctga





agaccaccatcgacagcgagaagcctacatatctgatcatcagaggcttgagcggcaacgggtctcagctgaacgagcttcagctgcctgagagc





gtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggccaagcagatcttcgctaacgtggtggaattagagttctacagcaccag





caaggccaacagcttcggcttcaaccccctggtgctgggctctaagacaaacgtgatctacgatctgttcgccagcaaacccttcacccacatcgat





ctgacccaggtgaccctgcagaactccgacaacagcgccatcgacgccaacaagctgaaacaggccgtgggcgacatctacaattaccggagat





tcgagcggcaattccagggctactttgcgggcggctacatcgacaagtacctggtgaagaacgtgaacacgaacaaggacagcgacgacgacct





ggtgtaccggagccttaaggagctgaacctgcatctggaagaagcctaccgggagggcgataacacatattaccgggtgaatgagaactactacc





ctggcgccagcatctacgagaacgagagagccagcagagatagcgaattccaaaacgaaatcctgaagcgggccgagcagaacggcgtgactt





tcgacgagaatattaagagaatcaccgcctccggaaagtacagcgtgcagtttcagaaactggaaaacgatacagactcaagcttggagcgcatg





accaaggccgtggaaggcctggtgaccgtaatcggcgaggaaaaattcgaaaccgtggacattaccggcgtgtcttctgacaccaacgaggtga





agagcctggctaaagagctgaagaccaacgccctgggcgtcaagctgaagctgtaa





Artificial


Protein M with radiolabel peptide tag (KGRPLVY) protein sequence (555 amino acids).


Including the human IL-2 signal sequence, radiolabel tag, linker, Mycoplasma genitalium


protein M


63) SEQ ID NO: 63




MYRMQLLSCIALSLALVTNSKGRPLVYGGSGGGGSTNLVNQSGYALVASGRSGNLG







FKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANFREKFRNFANELSEAITNSPKGLDRPVP





KTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGTLMANN





RGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAI





DAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFANV





VELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQVTLQNSDNSAIDANKLKQA





VGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLHLEEAYREGD





NTYYRVNENYYPGASIYENERASRDSEFQNEILKRAEQNGVTFDENIKRITASGKYSVQFQKL





ENDTDSSLERMTKAVEGLVTVIGEEKFETVDITGVSSDTNEVKSLAKELKTNALGVKLKL





Substitution #1: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted


in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in


Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids).


64) SEQ ID NO: 64



TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF






REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY





QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW





TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP





ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT





HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD





SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE





QNGVTFDENIKRITASGKYSVQFQALANATASALAAMTKAVEGLVTVIGEEKFETVAIAGVA





SATNAVASLAKELKTNALGVKLKL





Substitution #2: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted


in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in


Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids).


65) SEQ ID NO: 65



TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF






REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY





QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW





TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP





ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT





HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD





SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE





QNGVTFDENIKRITASGKYSVQFAKLANATASALARMTKAVEGLVTVIGEEKFETVAIAGVA





SATNAVKSLAKELKTNALGVKLKL





Substitution #3: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted


in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in


Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids).


66) SEQ ID NO: 66



TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF






REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY





QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW





TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP





ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT





HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD





SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE





QNGVTFDENIKRITASGKYSVQFQALEADADSALEAMTKAVEGLVTVIGEEKFETVDIAGVS






ADTAEVASLAKELKTNALGVKLKL






Substitution #4: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted


in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in


Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids).


67) SEQ ID NO: 67



TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF






REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY





QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW





TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP





ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT





HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD





SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE





QNGVTFDENIKRITASGKYSVQFAKLANDTASSAERATKAVEGLVTVIGEEKFETVAITGASS






ATNAVKALAKELKTNALGVKLKL







Armoracia rusticana



Horseradish peroxidase mature protein sequence (31-338 amino acids).


68) SEQ ID NO: 68



QLTPTFYDNSCPNVSNIVRDTIVNELRSDPRIAASILRLHFHDCFVNGCDASILLDNTTS






FRTEKDAFGNANSARGFPVIDRMKAAVESACPRTVSCADLLTIAAQQSVTLAGGPSWRVPLG





RRDSLQAFLDLANANLPAPFFTLPQLKDSFRNVGLNRSSDLVALSGGHTFGKNQCRFIMDRL





YNFSNTGLPDPTLNTTYLQTLRGLCPLNGNLSALVDFDLRTPTIFDNKYYVNLEEQKGLIQSD





QELFSSPNATDTIPLVRSFANSTQTFFNAFVEAMDRMGNITPLTGTQGQIRLNCRVVNSNS






Escherichia coli



Alkaline phosphatase mature protein sequence (22-471 amino acids).


69) SEQ ID NO: 69



RTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDS






EITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYN





GALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCP





GNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSD





AASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLA





QMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNT





LVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYG





PHAANVVGLTDQTDLFYTMKAALGLK






Photinus pyralis



Luciferase protein sequence (1-550 amino acid).


70) SEQ ID NO: 70



MEDAKNIKKGPAPFYPLEDGTAGEQLHKAMKRYALVPGTIAFTDAHIEVNITYAEYF






EMSVRLAEAMKRYGLNTNHRIVVCSENSLQFFMPVLGALFIGVAVAPANDIYNERELLNSM





NISQPTVVFVSKKGLQKILNVQKKLPIIQKIIIMDSKTDYQGFQSMYTFVTSHLPPGFNEYDFV





PESFDRDKTIALIMNSSGSTGLPKGVALPHRTACVRFSHARDPIFGNQIIPDTAILSVVPFHEIGF





GMFTTLGYLICGFRVVLMYRFEEELFLRSLQDYKIQSALLVPTLFSFFAKSTLIDKYDLSNLHE





IASGGAPLSKEVGEAVAKRFHLPGIRQGYGLTETTSAILITPEGDDKPGAVGKVVPFFEAKVV





DLDTGKTLGVNQRGELCVRGPMIMSGYVNNPEATNALIDKDGWLHSGDIAYWDEDEHFFIV





DRLKSLIKYKGYQVAPAELESILLQHPNIFDAGVAGLPDDDAGELPAAVVVLEHGKTMTEKE





IVDYVASQVTTAKKLRGGVVFVDEVPKGLTGKLDARKIREILIKAKKGGKSKL





Artificial


Xpress tag, a peptide recognized by an antibody


71) SEQ ID NO: 71



DLYDDDDK






Artificial


E-tag, a peptide recognized by an antibody (13 amino acid)


72) SEQ ID NO: 72



GAPVPYPDPLEPR






Artificial


FLAG-tag, a peptide recognized by an antibody (8 amino acid)


73) SEQ ID NO: 73



DYKDDDDK






Artificial


HA-tag, a peptide recognized by an antibody (9 amino acid)


74) SEQ ID NO: 74



YPYDVPDYA






Artificial


HA-tag, a peptide recognized by an antibody (9 amino acid)


75) SEQ ID NO: 75



YPYDVPDYA






Artificial


His-tag, 5-10 histidines bound by a nickel or cobalt chelate or antibody (6 amino acid)


76) SEQ ID NO: 76



HHHHHH






Artificial


Myc-tag, a short peptide recognized by an antibody (14 amino acid)


77) SEQ ID NO: 77



EQKLISEEDLLRKR






Artificial


S-tag, a short peptide recognized by an antibody (15 amino acid)


78) SEQ ID NO: 78



KETAAAKFERQHMDS






Artificial


Softag 1, for mammalian expression, a short peptide recognized by an antibody (13 amino


acid)


79) SEQ ID NO: 79



SLAELLNAGLGGS






Artificial


VSV-tag, a peptide recognized by an antibody (11 amino acid)


80) SEQ ID NO: 80



YTDIEMNRLGK






Artificial


Softag 3, for prokaryotic expression, a short peptide recognized by an antibody (8 amino acid)


81) SEQ ID NO: 81



TQDPSRVG






Artificial


VS tag, a peptide recognized by an antibody (14 amino acid)


82) SEQ ID NO: 82



GKPIPNPLLGLDST






Artificial


Avi-Tag, a peptide allowing biotinylation by the enzyme BirA and so the protein can be


isolated by streptavidin and/or avidin (20 amino acid)


83) SEQ ID NO: 83



MAGGLNDIFEAQKIEWHEGG






Artificial


SBP-tag, a peptide which binds to streptavidin (38 amino acid)


84) SEQ ID NO: 84



MDEKTTGWRGGHVVEGLAGELEQLRARLEHHPQGQREP






Artificial


Strep-tag (Strep-tag II), a peptide which binds to streptavidin or the modified streptavidin


called streptactin (8 amino acid)


85) SEQ ID NO: 85



WSHPQFEK







Escherichia coli



BCCP (Biotin Carboxyl Carrier Protein), a protein domain biotinylated by BirA enabling


recognition by streptavidin (73-156 amino acids)


86) SEQ ID NO: 86



PAAAEISGHIVRSPMVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEAMKMMNQIEA



DKSGTVKAILVESGQPVEFDEPLVVIE





Artificial


TC tag, a tetracysteine tag that is recognized by FlAsH and ReAsH biarsenical compounds (6


amino acid)


87) SEQ ID NO: 87



CCPGCC






Artificial


Calmodulin-tag, a peptide bound by the protein calmodulin (26 amino acid)


88) SEQ ID NO: 88



KRRWKKNFIAVSAANRFKKISSSGAL






Artificial


Polyglutamate tag, a peptide binding efficiently to anion-exchange resin such as Mono-Q (6


amino acids)


89) SEQ ID NO: 89



EEEEEE




Rhodococcus sp./Artificial



Halo-tag, a mutated hydrolase that covalently attaches to the HaloLin Resin (297 amino acid)


90) SEQ ID NO: 90



MAEIGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYVWRNIIPHVAP






THRCIAPDLIGMGKSDKPDLGYFFDDHVRFMDAFIEALGLEEVVLVIHDWGSALGFHWAKR





NPERVKGIAFMEFIRPIPTWDEWPEFARETFQAFRTTDVGRKLIIDQNVFIEGTLPMGVVRPLT





EVEMDHYREPFLNPVDREPLWRFPNELPIAGEPANIVALVEEYMDWLHQSPVPKLLFWGTPG





VLIPPAEAARLAKSLPNCKAVDIGPGLNLLQEDNPDLIGSEIARWLSTLEISG






Escherichia coli



Maltose binding protein-tag, a protein which binds to amylose agarose (27-396 amino acid)


91) SEQ ID NO: 91



KIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGP






DIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKD





LLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVG





VDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNY





GVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVAL





KSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQ





TRITK






Escherichia coli



Nus-tag, recognized by an antibody (1-495)


92) SEQ ID NO: 92



MNKEILAVVEAVSNEKALPREKIFEALESALATATKKKYEQEIDVRVQIDRKSGDFDT






FRRWLVVDEVTQPTKEITLEAARYEDESLNLGDYVEDQIESVTFDRITTQTAKQVIVQKVREA





ERAMVVDQFREHEGEIITGVVKKVNRDNISLDLGNNAEAVILREDMLPRENFRPGDRVRGVL





YSVRPEARGAQLFVTRSKPEMLIELFRIEVPEIGEEVIEIKAAARDPGSRAKIAVKTNDKRIDPV





GACVGMRGARVQAVSTELGGERIDIVLWDDNPAQFVINAMAPADVASIVVDEDKHTMDIAV





EAGNLAQAIGRNGQNVRLASQLSGWELNVMTVDDLQAKHQAEAHAAIDTFTKYLDIDEDF





ATVLVEEGFSTLEELAYVPMKELLEIEGLDEPTVEALRERAKNALATIAQAQEESLGDNKPAD





DLLNLEGVDRDLAFKLAARGVCTLEDLAEQGIDDLADIEGLTDEKAGALIMAARNICWFGDE





A






Escherichia coli



Thioredoxin-tag is commonly used in expression and purification of recombinant proteins. It


improves the solubility of that protein of interest. Recognized by an antibody (2-109 amino acid)


93) SEQ ID NO: 93



SDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKL



NIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLA





Artificial


Isopeptag, a peptide which binds covalently to pilin-C protein (16 amino acid)


94) SEQ ID NO: 94



TDKDMTITFTNKKDAE






Artificial


SpyTag, a peptide which binds covalently to SpyCatcher protein (13 amino acids)


95) SEQ ID NO: 95



AHIVMVDAYKPTK







Aequorea victoria



Green fluorescent protein-tag, a protein which is spontaneously fluorescent and can be bound


by antibodies (1-238 amino acid)


96) SEQ ID NO: 96



MSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPW






PTLVTTFSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTL





VNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADH





YQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYK





Artificial


Allows for cleavage by TEV protease between the Gln and Ser residues (7 amino acid)


97) SEQ ID NO: 97



ENLYFQS






Artificial


Allows for cleavage by Thrombin protease between Arg and Gly residues (6 amino acid)


98) SEQ ID NO: 98



LVPRGS






Artificial


Allows for cleavage by PreScission protease between the Gln and Gly residues (8 amino acid)


99) SEQ ID NO: 99



LEVLFQGP






Human


C1q A-chain mature amino acid sequence (23-245amino acid)


100) SEQ ID NO: 100



EDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIRTGIQGLKGDQGEPGPSGNP






GKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAIRRNPPMGGNVVIFDTVITNQEEP





YQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGM





VLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFLIFPSA





Human


C1q B-chain mature amino acid sequence (28-253 amino acid)


101) SEQ ID NO: 101



QLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKGEKGLPGLAGDHGEFGEKGDPGIPGNP






GKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQKIAFSATRTINVPLRRDQTIRFDHVITN





MNNNYEPRSGKFTCKVPGLYYFTYHASSRGNLCVNLMRGRERAQKVVTFCDYAYNTFQVT





TGGMVLKLEQGENVFLQATDKNSLLGMEGANSIFSGFLLFPDMEA





Human


C1q C-chain mature amino acid sequence (29-245 amino acid)


NTGCYGIPGMPGLPGAPGKDGYDGLPGPKGEPGIPAIPGIRGPKGQKGEPGLPGHPGK





NGPMGPPGMPGVPGPMGIPGEPGEEGRYKQKFQSVFTVTRQTHQPPAPNSLIRFNAVLTNPQ





GDYDTSTGKFTCKVPGLYYFVYHASHTANLCVLLYRSGVKVVTFCGHTSKTNQVNSGGVLL





RLQVGEEVWLAVNDYYDMVGIQGSDSVFSGFLLFPD





Claims
  • 1. A conjugate comprising: a first polypeptide having a specific binding affinity for an immunoglobulin molecule, wherein the first polypeptide has at least 99% identity over its entire length to one of SEQ ID NOS: 1, 2, and 63-66; conjugated to a second polypeptide having a specific binding affinity for a pathogen, a toxin or a cancer cell.
  • 2. The conjugate of claim 1, wherein the second polypeptide is a protein fragment of a cellular receptor.
  • 3. The conjugate of claim 2, wherein the second polypeptide has at least 95% identity over its entire length to one of SEQ ID NOS: 15-36.
  • 4. The conjugate of claim 3, wherein the pathogen is SARS-CoV-2 virus and the second polypeptide comprises the sequence of SEQ ID NO: 15.
  • 5. The conjugate of claim 1, wherein the second polypeptide is a single-domain antibody.
  • 6. The conjugate of claim 1, further comprising a spacer between the first polypeptide and the second polypeptide.
  • 7. The conjugate of claim 6, wherein the spacer is a cleavable peptide having a sequence of one of SEQ ID NOS: 96-98.
  • 8. The conjugate of claim 1, wherein the conjugate protein bound to an immunoglobulin is capable of recruiting the C1q complement component.
  • 9. A fusion protein comprising: a first polypeptide having a specific binding affinity for an immunoglobulin molecule, wherein the first polypeptide has at least 99% identity over its entire length to one of SEQ ID NOS: 1, 2, and 63-66; genetically fused toa second polypeptide having a specific binding affinity for a pathogen, a toxin or a cancer cell.
  • 10. The fusion protein of claim 9, wherein the second polypeptide is a protein fragment of a cellular receptor.
  • 11. The fusion protein of claim 10, wherein the second polypeptide has at least 95% identity over its entire length to one of SEQ ID NOS: 15-36.
  • 12. The fusion protein of claim 11, wherein the pathogen is SARS-CoV-2 virus and the second polypeptide comprises the sequence of SEQ ID NO: 15.
  • 13. The fusion protein of claim 9, wherein the second polypeptide is a single-domain antibody.
  • 14. The fusion protein of any one of claim 9, further comprising a spacer between the first polypeptide and the second polypeptide.
  • 15. The fusion protein of claim 14, wherein the spacer is a cleavable peptide having a sequence of one of SEQ ID NOS: 96-98.
  • 16. The fusion protein of claim 9, wherein the fusion protein bound to an immunoglobulin is capable of recruiting the C1q complement component.
  • 17. A method for inactivating, eliminating or eradicating a bloodborne pathogen or cancer cell in a subject, comprising: obtaining a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins;adding the conjugate of claim 1 or the fusion protein of claim 9 to the sample,wherein the first polypeptide binds to the immunoglobulins present in the sample, andwherein the second polypeptide has a specific binding affinity for the pathogen or cancer cell; andadministrating the sample comprising the conjugate or fusion protein bound to the immunoglobulins to the subject;thereby inactivating, eliminating or eradicating the bloodborne pathogen or cancer cell in the subject.
  • 18. The method of claim 17, wherein the second polypeptide is a protein fragment of a cellular receptor.
  • 19. The method of claim 18, wherein the second polypeptide has at least 95% identity over its entire length to one of SEQ ID NOS: 15-18, 24-25, 28, and 30-34.
  • 20. The method of claim 19, wherein the pathogen is SARS-CoV-2 virus and the second polypeptide comprises the sequence of SEQ ID NO: 15.
  • 21. The method of claim 17, wherein the second polypeptide is a single-domain antibody.
  • 22. The method of claim 17, the fusion protein comprises a spacer between the first polypeptide and the second polypeptide.
  • 23. The method of claim 22, wherein the spacer is a cleavable peptide having a sequence of one of SEQ ID NOS: 96-98.
  • 24. The method of claim 17, wherein the conjugate or fusion protein bound to the immunoglobulins inactivates, eliminates or eradicates the bloodborne pathogen or cancer cell via recruitment of the C1q complement component.
  • 25. A method for neutralizing a toxin in a subject, comprising: obtaining a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins;adding the conjugate of claim 1 or the fusion protein of claim 9 to the sample,wherein the first polypeptide binds to the immunoglobulins present in the sample, andwherein the second polypeptide has a specific binding affinity for the toxin; andadministrating the sample comprising the conjugate or fusion protein bound to the immunoglobulins to the subject;thereby neutralizing the toxin in the subject.
  • 26. The method of claim 25, wherein the second polypeptide is a protein fragment of a cellular receptor.
  • 27. The method of claim 26, wherein the second polypeptide has at least 95% identity over its entire length to one of SEQ ID NOS: 19-29 and 35-36.
  • 28. The method of claim 25, wherein the second polypeptide is a single-domain antibody.
  • 29. The method of claim 25, the fusion protein comprises a spacer between the first polypeptide and the second polypeptide.
  • 30. The method of claim 29, wherein the spacer is a cleavable peptide having a sequence of one of SEQ ID NOS: 96-98.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims priority to U.S. provisional application 63/079,815 filed Sep. 17, 2020, the content of which is incorporated herein.

Non-Patent Literature Citations (4)
Entry
Arfi et al. (2021) mBio 12(6)e0197421: 1-13.
Blötz et al. (2020) Front. Microbiol. 11(685): 1-12.
Grover et al. (2014) Science 343(6171): 656-661.
Rentero et al. (2011) Chimia 65: 843-845.
Related Publications (1)
Number Date Country
20220089656 A1 Mar 2022 US
Provisional Applications (1)
Number Date Country
63079815 Sep 2020 US