Protein M fusion proteins and uses

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR DEVELOPMENT

Not Applicable.

SEQUENCE LISTING ON ASCII TEXT

This patent application file contains a Sequence Listing submitted in computer readable ASCII text format (file name: DELA-02-US-Sequence-Listing.txt, date recorded: Aug. 30, 2021, size: 267,396 bytes). The Sequence Listing, which is a part of the present disclosure, includes a computer readable form and a written sequence listing comprising nucleotide and/or amino acid sequences of the present invention. The sequence listing information recorded in computer readable form is identical to the written sequence listing. The content of the Sequence Listing file is incorporated herein by reference in its entirety.

TECHNICAL FIELD

The present teachings relate to methods and compositions that utilize Protein M fusion proteins. Some of the disclosed methods and compositions relate to methods of neutralizing or eradicating various human pathogens and toxins.

INTRODUCTION

Many emerging and known pathogens continue to present a serious threat to human health and safety. In the past few decades, many infectious diseases, such as those caused by the SARS-CoV-2 virus, the human immunodeficiency virus (HIV) and others have effectively migrated from animal to human hosts and devastated entire populations and economies. Despite some successes in treatment of these pathogens, the options remain limited or not available, like in the case of the SARS-CoV-2 virus. Thus, there remains a need in the art for an efficient general method for neutralizing pathogens or clearing out pathogens from human body.

SUMMARY

The present teachings include a method for neutralizing a pathogen, wherein the pathogen has a specific binding affinity for a receptor fragment, the method comprising: providing conditions for interaction between the pathogen and a fusion protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO: 1 or the sequence SEQ ID NO: 2 conjugated to the receptor fragment, whereby the fusion protein binds to and neutralizes the pathogen.

In accordance with a further aspect, the receptor fragment is a protein fragment of a cellular receptor.

In accordance with a further aspect, the pathogen is SARS-CoV-2 virus and the receptor fragment has the sequence SEQ ID NO:15.

In accordance with a further aspect, conjugation of the polypeptide and the receptor fragment is made through a spacer.

In accordance with a further aspect, the spacer is a peptide having one of the following sequences: SEQ ID NO: 12-14.

In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.

In accordance with a further aspect, the receptor fragment has one of the following sequences: SEQ ID NO: 16-36.

In accordance with a further aspect, the fusion protein neutralizes the pathogen via recruitment of C1q protein.

The present teachings also include a method for eradicating a bloodborne pathogen in a subject, wherein the pathogen has a specific binding affinity for a receptor fragment inside the subject's body, the method comprising:

receiving a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins;

adding a fusion protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to the receptor fragment to the sample, wherein the fusion protein binds to the immunoglobulins present in the sample;

administrating the sample having the fusion protein bound to the immunoglobulins into the subject's body, in an amount sufficient to eradicate the pathogen in the subject.

In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.

In accordance with a further aspect, the fusion protein bound to the immunoglobulins eradicates the pathogen via recruitment of C1q protein.

The present teachings also include a fusion protein having a specific binding affinity for an immunoglobulin molecule, comprising a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to a fusion partner, wherein the fusion partner has a sequence that is at least 90% identical to one of the following sequences: SEQ ID NO: 15-36.

In accordance with a further aspect, conjugation of the polypeptide and the fusion partner is made through a spacer.

In accordance with a further aspect, the spacer is a cleavable peptide having one of the following sequences: SEQ ID NO: 96-98.

The present teachings also include a method for neutralizing a toxin in a subject, wherein the toxin has a specific binding affinity for a receptor fragment, the method comprising:

receiving a sample of blood, serum or plasma from the subject or from a donor compatible with the subject, wherein the sample comprises immunoglobulins;

adding a conjugated protein that comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to the receptor fragment to the sample, wherein the conjugated protein binds to the immunoglobulins present in the sample;

administrating the sample having the conjugated protein bound to the immunoglobulins into the subject's body, in an amount sufficient to eradicate the toxin in the subject.

In accordance with a further aspect, the receptor fragment comprises one of the following sequences: SEQ ID NO: 16-36.

The present teachings also include a method for detecting immunoglobulins that are present in a solution or on a solid support matrix, but not bound to their cognate antigen, the method comprising: contacting immunoglobulins with conjugated proteins in the solution, wherein each conjugated protein comprises a polypeptide having at least 90% identity over its entire length with either the sequence SEQ ID NO:1 or the sequence SEQ ID NO:2 conjugated to a detectable probe, whereby the conjugated proteins bind to immunoglobulins that are not bound to their cognate antigen; separating conjugated proteins that are bound to immunoglobulins from conjugated proteins that are not bound to immunoglobulins; detecting the conjugated proteins that are bound to immunoglobulins by utilizing the detectable probe, thereby detecting immunoglobulins that are not bound to their cognate antigen. Examples of solid support matrix include: blots, beads, microplate well, resin.

In accordance with a further aspect, conjugation of the polypeptide and the detectable probe is made through a spacer.

In accordance with a further aspect, the spacer is a cleavable peptide having one of the following sequences: SEQ ID NO: 96-98.

In accordance with a further aspect, the detectable probe is an enzyme that has a fluorogenic, luminescent or chromogenic substrate.

In accordance with a further aspect, the detectable probe is a protein having a sequence chosen from SEQ ID NO:67-69.

In accordance with a further aspect, the detectable probe is a fluorescent or a luminescent or a radioactive molecule.

In accordance with a further aspect, the detectable probe is an epitope tag having a sequence chosen from SEQ ID NO: 70-81.

In accordance with a further aspect, the detectable probe is a polypeptide having a sequence chosen from SEQ ID NO:82-85 and configured to bind streptavidin and/or avidin.

In accordance with a further aspect, the detectable probe is a polypeptide having a sequence chosen from SEQ ID NO:86-92 or from SEQ ID NO:93-94, and configured to attach to its cognate binding partner, either covalently or non-covalently.

In accordance with a further aspect, the detectable probe is a fluorescent protein having the sequence SEQ ID NO:95.

The present teachings also include a codon-optimized polynucleotide that encodes the fusion protein according to claim 14.

In accordance with a further aspect, the codon-optimized polynucleotide according to claim 29 has a sequence that is at least 95% identical to one of the following nucleic acid sequences: SEQ ID NO: 40-61.

In accordance with a further aspect, the codon-optimized polynucleotide according to claim 29 is inserted in a vector configured for replication and protein expression in mammalian cells.

These and other features, aspects and advantages of the present teachings will become better understood with reference to the following description, examples and appended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

Those of skill in the art will understand that the drawings, described below, are for illustrative purposes only. The drawings are not intended to limit the scope of the present teachings in any way.

FIG. 1. A schematic representation of plasma antibodies of differing isotypes and differing specificities mixed with armY-ACE2 fusion protein. When armY-ACE2 is added to the plasma, armY-ACE2 binds and blocks the antibody antigen-binding region via the armY component (Protein M). The ACE2 component of the fusion protein endows the antibodies with a new binding specificity to the spike protein of SARS-CoV-2, allowing the antibodies to bind SARS-CoV-2 and mark for eradication.

FIG. 2. A schematic representation of one proposed route of administration of armY-ACE2 therapeutic, whereby plasma (containing antibodies) is obtained from the patient or a ABO-compatible donor through apheresis. A measured amount of armY-ACE2 is added to the plasma antibodies, which then acquire SARS-CoV-2 specificity as described in FIG. 1 and as demonstrated in Example 16, FIG. 14 below. A patient with active COVID-19 is treated with armY-ACE2 plasma antibodies and allowed to fully recover.

FIG. 3. Myc-tagged Protein M binds to human IgG antibody coated on wells in a dose-dependent fashion. Bound Protein M is detected via its myc-tag using a mouse IgG1 anti-myc antibody followed by an HRP-labeled goat anti-mouse IgG. Neither antibodies bind the human IgG antibody coated on the well (assay buffer, no Protein M).

FIG. 4A-B. Goat anti-human IgG (FIG. 4A) or Chicken anti-human IgG (FIG. 4B) is neutralized or blocked by Protein M in a dose-dependent fashion and prevented from binding their antigen human IgG, immobilized on the wells.

FIG. 5A-D. Protein M-HRP fusion protein detection of antibodies. Detection of goat F(ab′)₂antibody fragment (FIG. 5A), dose-dependent detection of human IgG antibody (FIG. 5B) and detection of two mouse monoclonal IgG1 antibodies: anti-myc and anti-CD28 (FIG. 5C) coated on wells. FIG. 5D. Protein M-HRP indirect detection of mouse IgG1 antibody in solution. Protein M-HRP incubated with increasing concentration of mouse IgG1 antibody resulted in the loss of Protein M binding to the human IgG antibody coated on the wells. In the absence of mouse IgG1 antibody in solution, binding of Protein M-HRP to the human IgG antibody coated on the wells is not hindered.

FIG. 6A-B. Antibodies bound to antigen are not detected by Protein M. Myc-specific mouse antibody bound to myc-tagged protein coated on wells in (FIG. 6A) or ASIP-specific rabbit antibody bound to ASIP protein coated on wells in (FIG. 6B) failed to be detected by Protein M-HRP. This is consistent with the described function of Protein M and its inability to bind antibodies already engaged in a complex with their cognate antigen. Presence of antibodies bound to their coated antigen was confirmed using detecting HRP-labeled anti-mouse IgG (A) and anti-rabbit IgG-biotin/streptavidin-HRP (B).

FIG. 7A-B). FIG. 7A. Biotinylated irrelevant (non-specific) antibody binds to SARS-CoV-2 spike protein coated on the wells only when the antibody is in a complex with armY-ACE2. ACE2 domain mediates binding between the spike protein and armY-bound antibody since antibody with Protein M lacking the ACE2 domain is unable to bind the spike protein coated on the well. As expected, antibody alone does not bind the SARS-CoV-2 spike protein, which requires a physical association with armY-ACE2 that binds the spike protein coated on the well. Moreover, armY-ACE2 alone does not produce any signals since the presence of signal requires the interaction between the biotinylated antibody and armY-ACE2 bound to the spike protein coated on the well. FIG. 7 B. Anti-histidine (his) tag was used to detect the his (6× histidine)-tagged SARS-CoV-2 spike protein in a complex with armY-ACE2. Although, not included in this experiment, SARS-CoV-2 spike protein does not bind to human IgG (previous observation), a complex between armY-ACE2 and SARS-CoV-2 spike protein was required for detection by anti-his tag antibody. Myc-specific antibody detected myc-tagged armY-ACE2 binding to human IgG coated wells regardless of whether it was bound or not bound to SARS-CoV-2 spike protein.

FIG. 8. Purified human IgG or antibodies in human serum formed a complex with armY-ACE2 in solution preventing army-ACE2 binding to antibody coated on the wells. As expected, in the absence of antibody in solution binding of armY-ACE2 to the antibody coated wells was not prevented.

FIG. 9. armY-ACE2 engaged antibody binds to K562 cells expressing FcγRII receptor (right panel, army-ACE2+antibody). No binding to K562 cells was observed in the absence of antibody (left panel, army-ACE2 alone, no antibody), therefore, the observed binding of armY-ACE2 to K562 cells is dependent upon the association between the antibody and armY-ACE2.

FIG. 10. Binding of [armY-ACE2+antibody] to K562 cells is prevented by blocking FcγRII receptor using anti-CD32 (IV.3) (left panel). Binding is not blocked by the isotype-match antibody (right panel), demonstrating that armY-ACE2 engaged antibody maintains Fc-receptor binding activity.

FIG. 11A-B. FIG. 11A. Only a complex between armY-ACE2 and mouse IgM resulted in binding to immobilized human C1q complement component. FIG. 11B. Binding of [armY-ACE2+mouse IgG1] complexes to immobilized human C1q complement component is inhibited by pre-incubation of the complex with soluble human C1q, which also suggest that army-ACE2 primes mouse IgG1 antibody to bind C1q in solution. A complex between Protein M (lacking ACE2) with mouse IgG1 did not result in binding to C1q, suggesting the requirement of the fusion partner domain, ACE2, to induce a conformation resulting in antibody binding to C1q.

FIG. 12A-B. FIG. 12A. armY-ACE2 exhibits ACE2 activity in a dose dependent fashion. FIG. 12B. [armY-ACE2+antibody] complexes exhibit ACE2 activity comparable to armY-ACE2 alone, suggesting that binding of armY-ACE2 to antibodies does not interfere with the enzymatic function of ACE2.

FIG. 13A-B. FIG. 13A. Diagram of armY-ACE2 construct showing a myc-tag at the N-terminus, followed by the human ACE2, a linker and Protein M “armY” at the C-terminus. FIG. 13B. Photograph of the SDS-PAGE gel of purified non-reduced (left lane) and reduced (right lane) of armY-ACE2 showing a ˜180 kDa protein band.

FIG. 14A-C. FIG. 14A. Non-immune serum antibodies armed with armY-ACE2 gain the ability to bind to SARS-CoV-2 spike protein. Unarmed non-immune serum (pre-vaccine) does not bind to SARS-CoV-2 spike protein (a) but gain the ability to bind after incubation with armY-ACE2 (b). Approximately one month post-vaccination with the Moderna Covid19 vaccine, serum antibodies bind to the SARS-CoV-2 spike protein coated on the well as expected and served as an assay positive control (c). The assay does not detect armY-ACE2 alone when added to the SARS-CoV-2 spike protein coated wells (d), suggesting a requirement for serum antibodies to be in a stable complex with armY-ACE2 to bind the SARS-CoV-2 spike protein for assay detection. The photo to the right, representative of the duplicate wells, shows the corresponding SARS-CoV-2 spike protein coated wells 20 minutes after the addition of the mixtures, followed by addition of detecting antibody and addition of the TMB substrate, which give rise to the appearance of the blue color product indicative of antibody presence bound to the SARS-CoV-2 spike protein. FIG. 14B. Non-immune plasma (anticoagulant: ACD-A) antibodies armed with armY-ACE2 also gain the ability bind to SARS-CoV-2 spike protein, comparable to non-immune serum antibodies armed with armY-ACE2. As expected, unarmed serum or plasma antibodies do not bind to SARS-CoV-2 spike protein. FIG. 14C. Less than 1 ug/ml of free-unengaged army-ACE2 remain detectable after a 60 minutes incubation with either serum- or plasma-antibodies at 37° C., suggesting at least 95% of armY-ACE2 added (20 ug/ml) readily engage and arm antibodies in solution.

FIG. 15A-B. FIG. 15A. Monoclonal antibody (mAb) armed with armY-ACE2 gains the ability to bind the SARS-CoV-2 spike protein, while in FIG. 15B mAb is no longer able to bind its natural target antigen.

FIG. 16. Specific detection of antibody light-chain (LC, ˜25 KDa), but not heavy-chain (HC, ˜50 KDa) by mono-biotinylated protein M on 1D gel electrophoresis by Western blot analysis. Antibody sample was loaded alone (right lane, Ab) or in a mixture with E. coli lysate (middle lane, E+Ab). E. coli lysate alone was also loaded (left lane, E) as control. The molecular weight standard values (KDa, left) are derived from the Coomassie blue stained blots.

FIG. 17. Biotinylated Protein M is immobilized in streptavidin-coated wells and serves as a surrogate antigen for a monoclonal antibody. Increasing amount of antibody is added to the wells and the level of bound antibody is measured in an ELISA-based method.

FIG. 18. Protein M fusion blocks the binding of a monoclonal antibody (mAb) to its natural antigen thereby a) shows that antigen binding is Fab dependent and b) confirms the antibody's binding specificity.

DETAILED DESCRIPTION
Abbreviations and Definitions

Unless otherwise noted, technical terms are used according to conventional usage. All publications, patent applications, patents, and other references mentioned herein are incorporated by reference in their entirety for all purposes. To facilitate understanding of the invention, a number of terms and abbreviations as used herein are defined below as follows.

The terms “polypeptide”, “protein” and “peptide” are used herein interchangeably to refer to amino acid chains in which the amino acid residues are linked by peptide bonds or modified peptide bonds. The amino acid chains can be of any length of greater than two amino acids. Unless otherwise specified, the terms “polypeptide”, “protein” and “peptide” also encompass various modified forms thereof. Such modified forms may be naturally occurring modified forms or chemically modified forms. Examples of modified forms include, but are not limited to, glycosylated forms, phosphorylated forms, myristoylated forms, palmitoylated forms, ribosylated forms, acetylated forms, and the like. Modifications also include intra-molecular crosslinking and covalent attachment of various moieties such as lipids, flavin, biotin, polyethylene glycol or derivatives thereof, and the like. In addition, modifications may also include cyclization, branching and cross-linking. Further, amino acids other than the conventional twenty amino acids encoded by genes may also be included in a polypeptide. The term “polypeptide” or “protein” may also encompass a “purified” polypeptide that is substantially separated from other polypeptides in a cell or organism in which the polypeptide naturally occurs (e.g., 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99%, 100% free of contaminants).

Conservative changes: As used herein, when referring to mutations in a nucleic acid molecule, “conservative changes” are those in which at least one codon in the protein-coding region of the nucleic acid has been changed such that at least one amino acid of the polypeptide encoded by the nucleic acid sequence is substituted with another amino acid having similar characteristics. Examples of conservative amino acid substitutions are ser for ala, thr, or cys; lys for arg; gin for asn, his, or lys; his for asn; glu for asp or lys; asn for his or gin; asp for glu; pro for gly; leu for ile, phe, met, or val; val for ile or leu; ile for leu, met, or val; arg for lys; met for phe; tyr for phe or trp; thr for ser; trp for tyr; and phe for tyr.

Isolated polypeptide: The term “isolated polypeptide” as used herein means a polypeptide molecule is present in a form other than found in nature in its original environment with respect to its association with other molecules. The term “isolated polypeptide” encompasses a “purified polypeptide”, which is used herein to mean that a specified polypeptide is in a substantially homogenous preparation, substantially free of other cellular components, other polypeptides, viral materials, or culture medium, or when the polypeptide is chemically synthesized, substantially free of chemical precursors or byproducts associated with the chemical synthesis. For a purified polypeptide, preferably the specified polypeptide molecule constitutes at least 15 percent of the total polypeptide in the preparation. A “purified polypeptide” can be obtained from natural or recombinant host cells by standard purification techniques, or by chemical synthesis.

An “isolated” biological component (such as a nucleic acid molecule, protein, or virus) has been substantially separated or purified away from other biological components (e.g., other chromosomal and extra-chromosomal DNA and RNA, proteins and/or organelles). Nucleic acids, proteins, and/or viruses that have been “isolated” include nucleic acids, proteins, and viruses purified by standard purification methods. The term also embraces nucleic acids, proteins, and viruses prepared by recombinant expression in a host cell, as well as chemically synthesized nucleic acids or proteins. The term “isolated” (or purified) does not require absolute purity; rather, it is intended as a relative term. Thus, for example, an isolated or purified nucleic acid, protein, virus, or other active compound is one that is isolated in whole or in part from associated nucleic acids, proteins, and other contaminants.

As used herein, the term “vector” refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. The term “vector” comprises an “expression vector”, e.g. a vector that is capable of directing the expression of genes to which they are operatively linked. The vector often includes sequences that effect the expression of a desirable molecule, e.g., a promoter, a coding region and a transcriptional termination sequence. An expression vector can be an integrative vector (i.e., a vector that can integrate into the host genome), or a vector that does not integrate but self-replicates, in which case, the vector includes an origin of replication which permits the entire vector to be reproduced once it is within the host cell. Preferred vectors are those capable of autonomous replication and/expression of nucleic acids to which they are linked.

Nucleic acid molecules encoding fusion proteins are also within the scope of the invention. Such nucleic acids can be made by preparing a construct (e g., an expression vector) that expresses a fusion protein when introduced into a suitable host. For example, such a construct can be made by ligating a first polynucleotide encoding a single-domain antibody, or fragment or variant thereof, fused in frame with a second polynucleotide encoding another protein such that expression of the construct in a suitable expression system yields a fusion protein. Polynucleotides that encode fusion proteins can be present in isolation, or can be inserted in a vector for expression in cells. Such vector may be suitable for replication and protein expression in bacterial, mammalian or insect cells. Polynucleotides that encode fusion proteins can be codon-optimized for expression in particular type of cells by standard methods known in the art.

A “codon-optimized” nucleic acid or polynucleotide refers to a nucleic acid sequence that has been altered such that the codons are optimal for expression in a particular system (such as a particular species or group of species). For example, a nucleic acid sequence can be optimized for expression in mammalian cells or in a particular mammalian species (such as human cells). Codon optimization does not alter the amino acid sequence of the encoded protein.

The term “neutralizing a pathogen” used herein is synonymous to “inactivating a pathogen” and means that the pathogen will no longer be able to interact with a specific receptor molecule either in vitro or in vivo, or will no longer be able to infect cells of an organism.

The term “neutralizing a toxin” used herein is synonymous to “inactivating a toxin” and means that the toxin will no longer be able to interact with its target, either in vitro, or in a subject's body.

The term “eradicating a pathogen” used herein refers to neutralizing the pathogen in a subject.

As used herein, the term “Protein M” or “armY” refers to antibody-binding fragment of protein from Mycoplasma genitalium that has an amino acid sequence SEQ ID NO:1 (Grover R K, et al., Science, 2014), or to antibody-binding fragment of protein from Mycoplasma pneumoniae that has an amino acid sequence SEQ ID NO:2 (Blötz C, et al., Front Microbiol. 2020), or to a polypeptide with immunoglobulin-binding activity having a sequence with at least 90% identity over its entire length to one of the following sequences: SEQ ID NO: 3-8. In some embodiments, the term “Protein M” or “armY” also includes an immunoglobulin-binding fragment of Protein M from Mycoplasma genitalium or Mycoplasma pneumoniae.

As used herein, the term “ACE2” refers to the human cellular angiotensin-converting enzyme 2 receptor.

As used herein, the term “fusion protein” refers to an artificial, non-natural polypeptide that consists of at least two unrelated covalently linked polypeptides. The linkage between these polypeptides can be of different nature, including a peptide bond, a short flexible amino acid spacer, or a spacer of another type. The spacer joins the polypeptides together, yet preserves some distance between the polypeptides such that both polypeptides can properly fold independently.

The term “immunoglobulin,” “Ig” or “antibody” (used interchangeably herein) refers to a glycoprotein formed in response to administration of bacteria, viruses or other antigens to a mammalian organism, said glycoprotein has the ability to specifically bind cognate antigen and consists of two heavy (H) chains and two light (L) chains connected and stabilized by interchain disulfide bonds. Immunoglobulins or antibodies may be monoclonal or polyclonal and may exist in monomeric or polymeric form, for example. IgM antibodies which exist in pentameric form and/or IgA antibodies which exist in monomeric, dimeric or multimeric form. The term “fragment” refers to a part or portion of an antibody or antibody chain comprising fewer amino acid residues than an intact or complete antibody or antibody chain. Fragments can be obtained via chemical or enzymatic treatment of an intact or complete antibody or antibody chain. Fragments can also be obtained by recombinant means. Exemplary fragments include Fab, Fab′, F(ab′)₂, Fc and/or Fv fragments.

The term “antigen-binding fragment” refers to a polypeptide portion of an immunoglobulin or antibody that binds an antigen or competes with intact antibody (i.e. with the intact antibody from which they were derived) for antigen binding (i.e. specific binding). Binding fragments can be produced by recombinant DNA techniques, or by enzymatic or chemical cleavage of intact immunoglobulins. Binding fragments include Fab, Fab′, F(ab′)₂, Fv, single chains, and single-chain antibodies.

As used herein, the term “toxin” refers to an endogenous entity or exogenous substance that is harmful to a subject (preferably, human subject). Examples of harmful endogenous entities are excessive inflammatory cytokines that may be produced during a cytokine storm in the subject. A harmful endogenous entity can be soluble or membrane bound. Examples of harmful exogenous substances are Botulinum neurotoxin A, Botulinum neurotoxin B, Staphylococcal enterotoxin A and B, Staphylococcal enterotoxin A, Staphylococcal enterotoxin B, Clostridium perfringens Epsilon toxin (ETX), Ricin, Anthrax.

As used herein, the term “donor compatible with the subject” refers to a human subject having compatibility for a blood transfusion (compatibility based on ABO blood groups, Rh Type).

As used herein, the term “receptor fragment” refers to a fragment of a protein to which a pathogen (usually, a protein from the pathogen's coat) or a toxin has a specific binding affinity, or can specifically bind. Preferably, receptor fragment is a protein fragment of a cellular receptor that the pathogen or toxin binds to and utilizes to enter the cell. Preferably, receptor fragment is located inside a subject's body.

Unless otherwise defined, technical and scientific terms used in the present teachings described herein shall have the meanings that are commonly understood by those of ordinary skill in the art. Further, unless otherwise required by context, plural terms shall include the singular and singular terms shall include pluralities. Generally, nomenclatures utilized in connection with molecular biology, cell and tissue culture, protein and oligo- or polynucleotide chemistry described herein are well-known and commonly used in the art. Standard techniques are used, for example, for recombinant nucleic acid and protein preparation, purification and analysis, for oligonucleotide synthesis. Purification techniques and enzymatic reactions are performed according to manufacturer's specifications or as described herein or as commonly accomplished in the art. The techniques and procedures described herein are generally performed according to conventional methods well known in the art and as described in various general references that are cited and discussed throughout the instant specification. See, e.g., Sambrook et al., Molecular Cloning: A Laboratory Manual (Third ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. 2000). The nomenclatures utilized in connection with, and the laboratory procedures and techniques described herein are those well-known and commonly used in the art.

The present invention is directed to methods and compositions for inactivating or eliminating a pathogen, preferably a bloodborne pathogen having a specific binding affinity for a receptor fragment, by utilizing a fusion protein that comprises Protein M and the receptor fragment. Preferably, Protein M is chosen from an extracellular domain of Mycoplasma genitalium protein (Grover R K, et al., Science, 2014; SEQ ID NO: 1) or an extracellular domain of Mycoplasma pneumoniae protein (Blötz C, et al., Front Microbiol. 2020; SEQ ID NO: 2) that strongly bind to immunoglobulin molecules (antibodies). Typical binding affinities (K_d) of Protein M to immunoglobulin molecules are from 1.2 to 5.2 nM (Grover R K, et al., Science, 2014).

Orthologs of Protein M can be found in several related species of Mycoplasma: M. penetrans, Mycoplasma tullyi, Mycoplasma iowae, Mycoplasma imitans, Mycoplasma alvi and M. gallisepticum (disclosed herein in the Sequence listing). These sequences are also disclosed herein and can be used to create fusions or fusion proteins according to the present invention. Protein M is functionally similar to other bacterial-derived proteins that bind antibodies (e.g., protein A, protein G and protein L) with the exception that Protein M blocks the antibody's binding site and prevent it from binding its cognate antigen. Therefore, harnessing the antibody binding property of Protein M, it can be used to couple any attached compounds (e.g., genetic fusion or chemical conjugation) to an antibody regardless of the antibody's specificity. Consequently, interaction with the Protein M fusion protein will result in the loss of the antibody's specificity and acquire the specificity as that of the attached compound. The properties of Protein M fusion protein with the compound will be a combination of the antibody's stability, antibody's functional properties (such as ability to engage Fc receptors on immune cells, activate the complement system, an increased binding avidity and the compound properties (affinity to a pathogen).

Preferred nucleic acid molecules for use in the invention are polynucleotides that encode fusion proteins shown herein in the appended Sequence Listing. Nucleic acid molecules utilized in the present invention may be in the form of RNA or in the form of DNA (e.g., cDNA, genomic DNA, and synthetic DNA). The nucleic acid molecule may be double-stranded or single-stranded, and if single-stranded may be the coding (sense) strand or non-coding (anti-sense) strand. The coding sequence which encodes a fusion Protein May be identical to one of the nucleotide sequences provided in the appendices, or it may also be a different coding sequence which, as a result of the redundancy or degeneracy of the genetic code, encodes the provided fusion protein.

In some embodiments, variant fusion proteins displaying substantial differences in structure can be generated by making nucleotide substitutions that cause less than conservative changes in the encoded polypeptide. Examples of such nucleotide substitutions are those that cause changes in (a) the structure of the polypeptide backbone; (b) the charge or hydrophobicity of the polypeptide; or (c) the bulk of an amino acid side chain. Nucleotide substitutions generally expected to produce the greatest changes in protein properties are those that cause non-conservative changes in codons. Examples of codon changes that are likely to cause major changes in protein structure are those that cause substitution of (a) a hydrophilic residue, e.g., serine or threonine, for (or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, valine or alanine; (b) a cysteine or proline for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., lysine, arginine, or histidine, for (or by) an electronegative residue, e.g., glutamic acid or aspartic acid; or (d) a residue having a bulky side chain, e.g., phenylalanine, for (or by) one not having a side chain, e g., glycine.

Sequence Identity: As used herein, the term “sequence identity” means the percentage of identical subunits at corresponding positions in two sequences when the two sequences are aligned to maximize subunit matching, i.e., taking into account gaps and insertions. Sequence identity is present when a subunit position in both of the two sequences is occupied by the same nucleotide or amino acid, e.g., if a given position is occupied by an adenine in each of two DNA molecules, then the molecules are identical at that position. For example, if 7 positions in a sequence of 10 nucleotides in length are identical to the corresponding positions in a second 10-nucleotide sequence, then the two sequences have 70% sequence identity. Sequence identity of a polynucleotide is typically measured using sequence analysis software (e.g., the Sequence Analysis Software Package of the Genetics Computer Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, Wis. 53705).

In preferred embodiments, variant fusion proteins displaying only non-substantial or negligible differences in structure can be generated by making nucleotide substitutions that cause only conservative amino acid changes in the encoded polypeptide. By doing this, fusion protein variants that comprise a sequence having at least 90% (90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, and 99%) sequence identity with the fusion protein sequences provided in the attached appendices, and retain at least one functional activity, e g., immunoglobulin binding activity. The invention also covers non-naturally occurring polynucleotides or variants that encode the fusion protein variants having at least 90% sequence identity over the entire length with the fusion protein sequences provided in the attached appendices, and retain at least one functional activity, e g., immunoglobulin binding activity. Methods of making targeted amino acid substitutions, deletions, truncations, and insertions are generally known in the art. For example, amino acid sequence variants can be prepared by mutations in the DNA. Methods for polynucleotide alterations are well known in the art, for example, Kunkel et al. (1987) Methods in Enzymol. 154:367-382; U.S. Pat. No. 4,873,192 and the references cited therein.

Therapeutically Effective Amount: As used herein, the term “therapeutically effective amount” refers to those amounts that, when administered to a particular subject in view of the nature and severity of that subject's disease or condition, will have a desired therapeutic effect, e.g., an amount which will cure, prevent, inhibit, or at least partially arrest or partially prevent a target disease or condition. By other words, this is an amount of an agent or composition that alone, or together with a pharmaceutically acceptable carrier or one or more additional agents, induces the desired response. Effective amounts of a therapeutic agent can be determined in many different ways, such as assaying for a reduction in symptoms or improvement of physiological condition of a subject. Effective amounts also can be determined through various in vitro, in vivo, or in situ assays.

In some embodiments, variants of fusion proteins having a reduced immunogenicity in humans may be generated by making amino acid substitutions in the fusion proteins that remove or modify human T-cell or B-cell epitopes present in said fusion protein. Fusion proteins that have less potential human T-cell or B-cell epitopes in the sequence are less prone to activate an unwanted immune response in a subject. The unwanted immune response includes development of anti-fusion protein antibodies that may neutralize said fusion protein. Several methods for identifying, modifying and removing potential human T-cell or B-cell epitopes in protein sequences are known and disclosed in, for example, Jawa V, Terry F, Gokemeijer J, et al. T-Cell Dependent Immunogenicity of Protein Therapeutics Pre-clinical Assessment and Mitigation-Updated Consensus and Review 2020. Front Immunol. 2020; 11:1301; Mazor R, Crown D, Addissie S, Jang Y, Kaplan G, Pastan I. Elimination of murine and human T-cell epitopes in recombinant immunotoxin eliminates neutralizing and anti-drug antibodies in vivo. Cell Mol Immunol. 2017; 14(5):432-442; U.S. Ser. No. 10/751,397 B2, US2018161419A1, the contents of which are incorporated herein by reference in its entirety.

Disclosed herein are methods for making and using fusion proteins that comprises amino acid sequences of Protein M or amino acid sequences that are at least 90% identical over the entire length with the sequences of Protein M. An example of such fusion protein is armY-ACE2, which consists of the Protein M sequence fused to the sequence of the ACE2 receptor, or to a fragment of the ACE2 receptor to which the envelope spike S protein of the SARS-CoV-2 virus is bound. Fusion protein armY-ACE2 can bind to immunoglobulin molecules of different classes, blocking their original specificity and instead directing them to interact with the envelope spike S protein of the SARS-CoV-2 virus (FIG. 1). As a result, the SARS-CoV-2 virus will be no longer capable of infecting human cells via its envelope spike S protein, and will be eliminated by macrophages that recognize immunoglobulin-bound targets, and by engaging via a complement factor activated by the bound immunoglobulins, or by other mechanisms. By utilizing knowledge of specific cellular receptors recognized by pathogens (virus or microorganism) and toxins for cellular entry, various armY-fusion proteins may be created and utilized according to the present invention. To make armY-fusion proteins, various fragments of the receptor may be used, including, without restriction, a full extracellular domain of the receptor or a fragment of the receptor which is necessary and sufficient for interaction with the pathogen or toxin.

Non-limiting examples of pathogens and toxins and their cellular attachment receptors suitable to make armY-fusion proteins are listed as follows: (a) armY-ACE2 (Angiotensin-converting enzyme 2) for the SARS-CoV and SARS-CoV-2, as well as human coronavirus NL63/HCoV-NL6; (b) armY-CD209 (DC-SIGN) for HIV-1, HIV-2, Ebolavirus, Cytomegalovirus, HCV, Dengue virus, Measles virus, Herpes simplex virus 1, Influenza virus, SARS-CoV, Japanese encephalitis virus, Lassa virus, Respiratory syncytial virus, Rift valley fever virus, West-nile virus, Marburg virus, Uukuniemi virus, and Yersinia Pestis; (c) armY-C-type lectin domain family 4 member M for Ebolavirus, Hepatitis C virus, HIV-1, Human coronavirus 229E, Human cytomegalovirus/HHV-5, Influenza virus, SARS-CoV, West-nile virus, Japanese encephalitis virus, Marburg virus glycoprotein, and M. bovis; (d) armY-CD4 for HIV; (e) armY-Synaptic vesicle glycoprotein 2A for the C. botulinum neurotoxin type A2 (BoNT/A, botA); (f) armY-Synaptic vesicle glycoprotein 2B for the C. botulinum neurotoxin type A2 (BoNT/A, botA). Probably also for the closely related C. botulinum neurotoxin type A1; (g) armY-Synaptic vesicle glycoprotein 2C for C. botulinum neurotoxin type A (BoNT/A, botA) and C. botulinum neurotoxin type A2; (h) armY-Synaptotagmin I for C. botulinum neurotoxin type B (BoNT/B, botB); (i) armY-Synaptotagmin II for C. botulinum neurotoxin type B (BoNT/B, botB); (j) armY-HLA class II histocompatibility antigen, DRB1 beta chain for Epstein-Barr virus and Staphylococcal enterotoxin A and B; (k) armY-HLA class II histocompatibility antigen, DR alpha chain for Epstein-Barr virus BZLF2/gp42, Staphylococcus aureus enterotoxin A/entA, enterotoxin B/entB, enterotoxin C1/entC1, enterotoxin D/entD, and enterotoxin H/entH; (1) armY-T cell receptor beta variable 7-9 for Staphylococcus aureus enterotoxin A/entA; (m) armY-T cell receptor beta variable 19 for Staphylococcus aureus enterotoxin B/entB; (n) armY-Hepatitis A virus cellular receptor 1 for Hepatitis A virus, Ebola virus, Marburg virus and Dengue virus and Clostridium perfringens Epsilon toxin (ETX); (o) armY-Myelin and lymphocyte protein for Clostridium perfringens Epsilon toxin (ETX); (p) armY-Complement factor H for Streptococcus pneumoniae, Neisseria meningitides, Staphylococcus aureus, Borrelia burgdorferi and West nile virus; (q) armY-Hepatocyte growth factor receptor for Listeria monocytogenes internalin InlB; (r) armY-Membrane cofactor protein (CD46) for Adenovirus subgroup B2 and Ad3, Measles virus, Herpesvirus 6/HHV-6, Neisseria and Streptococcus pyogenes; (s) armY-Glycophorin-A for Plasmodium falciparum, Influenza virus, Hepatitis A virus (HAV), Streptococcus gordonii; (t) armY-C-type lectin domain family 4 member K (Langerin, CD207) for Candida species, Saccharomyces species, Malassezia furfur, human immunodeficiency virus-1 (HIV-1) and Yesinia pestis; (u) armY-Anthrax toxin receptor 1 for Anthrax toxin; and (v) armY-Anthrax toxin receptor 2 for Anthrax toxin.

In some embodiments, codon-optimized polynucleotides are disclosed that contain a nucleic acid sequence at least 95%, at least 96%, at least 97%, at least 98% or at least 99% identical to SEQ ID NO: 40-61. These polynucleotides are codon-optimized for expression in human cells.

Taking as an example armY-ACE2 fusion protein and SARS-CoV-2 as a pathogen, several advantages of the armY-ACE2 approach can be shown over the other known potential virus inactivating strategies, such as (a) monoclonal antibody (mAb) therapy; (b) ACE2 or ACE2-Fc fusion proteins therapy; (c) Convalescent plasma antibody therapy, and (d) anti-viral vaccine.

As to (a), mAb therapy is subject to viral escape due to a mutation in a targeted viral epitope. Most viruses possess a high mutation rate; after a mutation in the mAb-recognizing area the mAb therapy is no longer effective, and mutated viruses will proliferate and eventually will be enriched. Instead, armY-ACE2 will bind the SARS-CoV-2 virus regardless of any mutation, because all SARS-CoV-2 viruses bind ACE2 for entry into human cells. Also, since armY can bind all antibody isotypes, armY-ACE2 can arm all antibody isotypes with the capacity to target SARS-CoV-2 viruses, hence mimicking a generalized antibody-mediated immune response.

As to (b), ACE2 monotherapy suffers from rapid renal clearance due to the small size of ACE2. ACE2-Fc fusion proteins is of a single isotype, usually, an IgG. It is known that other isotypes e.g., IgM, IgA are also efficacious in pathogen clearance. Thus, armY-ACE2 can arm all isotypes or a specific isotype with the capacity to target SARS-CoV-2. In addition, Fc fusion proteins do not activate the complement system. Instead, armY-ACE2 complex with antibody maintains Fc functionality, and is able to prime the antibody to bind the C1q complement factor, a required step for complement activation. Being able to harness the full effector potential of antibodies may be critical in the overall eradication of the targeted pathogen, such as SARS-CoV-2.

As to (c), convalescent plasma therapy requires blood from donors previously exposed to SARS-CoV-2, and no longer with COVID-19 symptoms. It might take as long as 7-10 days to test for lack of blood-borne pathogens, anti-SARS-CoV-2 titer levels and ABO blood type matching requirements. Instead, armY-ACE2 could arm the patient's own plasma antibodies, and can be available to the patient in less than 2-4 hours. Donor plasma can also be used, but these can be from regular donors that have already been screened, so this could be made available to the patient even faster as long as ABO blood type and Rh type match is achieved.

As to (d), SARS-CoV-2 vaccine is prophylactic in its use and the uninfected person will require time to develop a level of protective immunity. Vaccines cannot be a therapeutic for those with on-going COVID-19. Moreover, vaccine efficacy is subject to many variables including state of health of the individual and potential side-effects, e.g., anaphylactic reaction that might hinder completion of immunization protocol. armY-ACE2 is applicable to subjects with on-going COVID-19.

The abovementioned advantages apply to other fusion proteins that are disclosed herein.

Treatment with Protein M fusion proteins changes the specificity of antibodies in plasma to a new target (e.g, a virus, bacterium or a toxin) for immune recognition and elimination; provide more optimal pharmacokinetics and activity of a compound attached to a larger more stable antibody, and improvement of bioavailability of compounds; deliver therapeutic or diagnostic compounds to an antibody-binding target (e.g., antibody binding bacteria, tissue or cell); disrupt interaction between two or more entities required for pathogenicity.

Possible routes of administration for Protein M fusion proteins include parenteral, oral and/or inhalation. In a preferred embodiment, ex-vivo plasma/serum (patient-derived or from a compatible donor) is mixed with Protein M-fusion protein and administered to patient. Preferably, Protein M fusion proteins are administered in the form of a pharmaceutical composition, comprising additional pharmaceutically acceptable excipients.

In some embodiments, Protein M fusion proteins are stored or administered in a suitable formulation that provides stability to the fusion proteins. Such formulation includes one or several pharmaceutically acceptable excipients. By “pharmaceutically acceptable” it is meant the excipient is compatible with the other ingredients of the formulation and not deleterious to the recipient thereof. Excipients for protein formulations may be picked up by methods known in the art, and may include buffers, stabilizers, antioxidants, salts, polysorbates, amino acids, among others.

Other potential uses of Protein M fusion proteins include detecting presence of antibodies and/or antibody-binding factors found in blood, tissue or cells. For example, fusing a reporter enzyme (e.g., Horseradish peroxidase, HRP) or attaching a detectable probe or label (e.g., biotin-avidin, biotin-streptavidin) to Protein M can be used to detect antibodies that are present but not bound to their cognate antigen as observed in immunoassays that exhibit “false-positive” activity and thus serve as a false-positive detection tool.

In some embodiments, Protein M can be conjugated with the following detectable probes: HRP (chromogenic), alkaline phosphatase (chromogenic), biotin (for example, via Avi-Tag peptide), myc epitope antigen, Luciferase (bioluminescence), avidin (attachment of biotin conjugates), streptavidin (attachment of biotin conjugates), streptavidin-binding peptide, phycoerythrin (fluorescence), GFP (fluorescence), a radioactive label. Protein M-radiolabel peptide can be produced by fusing Protein M to the short peptide KGRPLVY (SEQ ID NO:62). As disclosed in Mebrahtu et al. 2013, the KGRPLVY peptide contains a metal chelate attachment [K-lysine for labeling Protein M with Cu-64 and DOTA] and radio-halogen attachment (Y-tyrosine for labeling Protein M with I-125, I-123 or I-131).

In some embodiments, Protein M-detectable probe fusions can be used in ELISA, western blotting, lateral flow assays, multiplex bead array assays, pull down assays, SPR (biacore, octet) assays, flow cytometry assays, for purification or for delivery of a cargo.

Protein M fusion proteins can also be used to: 1) neutralize antibodies by occupying their antigen binding site (Useful in decreasing non-specific signals in immunoassays, useful in in-vitro cell assays as well as in in-vivo settings to determine the role of antibodies or a specific antibody by essentially blocking its binding activity); 2) eliminate antibodies by increasing clearance from circulation or tissue by directing antibodies to immune cells or delivering degrading enzymes or compounds to antibodies; 3) deplete antibodies in solution by promoting clearance of unengaged antibodies, which are not bound to antigen. Protein M can be attached to a resin (e.g., agarose beads), added to a solution to pull down/remove or harvest such antibodies for use in process, for analysis or for elimination.

Protein M fusion proteins can also be used to protect antibodies from degradation by enzymes, microbes and cellular mechanisms; protect antibodies from bacterial escape mechanisms (e.g., protein A of S. aureus binds to antibodies and avoid antibody detection and clearance); deliver cargo to an antibody.

In some embodiments, the receptor fragment is a protein fragment of a cellular receptor, which is a target used by a pathogen for cell entry. In some embodiments, the pathogen is a virus, a bacterium or a fungus that can cause illnesses. In one embodiment, an antigen is a cell surface molecule of a pathogen, or antigenic parts or fragments thereof.

A fusion protein can be made by creating a nucleic acid molecule encoding the fusion protein and expressing the fusion protein from such nucleic acid in a recombinant expression system. The nucleic acid molecule encoding the fusion can be generated by linking a nucleic acid sequence encoding Protein M in frame with a nucleic acid sequence encoding a receptor fragment of a pathogen or a ligand of a toxin. Methods for constructing a fusion protein are known in the art (see Sambrook J. et al., Molecular Cloning, Cold Spring Harbor Press, New York (2001)).

In some embodiments, Protein M is fused to the N-terminus of the receptor fragment of a pathogen or the ligand of a toxin. In this orientation, an N-terminal tag can be attached for detection and purification of the fusion protein. In addition, the leader sequence (secretory signal peptide) can be attached for facilitating the secretion of the fusion protein. Alternatively, other appropriate leader sequences, suitable for guiding the fusion protein to the ER and the secretory pathway in the host cell, can be used. In other embodiments, Protein M is fused to the C-terminus of the receptor fragment of a pathogen or the ligand of a toxin.

In still another embodiment, a spacer can be incorporated between the Protein M sequence and the receptor fragment of a pathogen or the ligand of a toxin. In preferred embodiments, spacer is a short peptide sequence that joins both polypeptides, yet preserves some distance between the polypeptides such that both polypeptides can properly fold independently. Generally, the spacer consists of between 2 or 3 amino acids to 50 amino acids, typically between 3 to 25, or 3 to 20, or 3 to 15 amino acids. In a specific embodiment, the space consists of 3-10 amino acids. Although there is no specific restriction on the selection of amino acids for the spacer region, the amino acids can be selected to accommodate the folding, net charge, hydrophobicity or other properties of the fusion protein. Typical amino acids for use in a spacer region include Gly, Ala, Ser, Thr and Asp.

One of skill would recognize that modifications can be made to a fusion protein without diminishing their biological activities. Some modifications may be made to facilitate the cloning, expression, or incorporation of the constituent molecules into a fusion protein. For example, amino acids can be placed on either terminus to create conveniently located restriction sites or termination codons; and a methionine can be added at the amino terminus to provide an initiation site.

Recombinant Expression of the Fusion Proteins.

For recombinant expression of a fusion protein, a nucleic acid molecule encoding the fusion protein is generally placed in an expression vector in an operable linkage to a promoter (such as the T7, trp, or lambda promoters for expression in bacteria, or a CMV promoter for expression in mammalian cells) and a 3′ transcription termination sequence, and optionally additional suitable transcriptional and/or translational regulatory elements such as a transcription enhancer sequence and a sequence encoding suitable mRNA ribosomal binding sites. Additional sequences that can be included in the expression vector include an origin of replication, and a selection marker gene to facilitate identification of transformants such as genes conferring resistance to antibiotics (e.g., the amp, kana, gpt, neo, and hyg genes).

Host cells suitable for use in the recombinant expression of the fusion protein include bacterial cells such as E. coli, and eukaryotic cells including but not limited to yeast, insect cells (e.g. SF9 cells), and mammalian cells such COS, CHO, HeLa cells and HEK293.

The expression vectors can be introduced into a host cell by well-known methods such as calcium chloride transformation for bacterial cells, and calcium phosphate treatment or electroporation for mammalian cells. Cells transformed by the expression vectors can be selected based on the phenotype provided by the selectable marker gene.

Once expressed, the recombinant fusion proteins can be purified according to standard methods available in the art, such as ammonium sulfate precipitation, affinity columns, chromatography, gel electrophoresis, among others. In one embodiment, the fusion protein is purified based on affinity chromatography using antibodies that bound to Protein M. In another embodiment, a purification tag is inserted at the N-terminus or the C-terminus of the fusion protein and is used for purification. The examples of such tags are: 6 His-tag, myc-tag, strep-tag and others.

In some embodiments, uses for Protein M fusion proteins include the following.

The present teachings include a pharmaceutical composition comprising: a Protein M fusion protein having an antibody-binding domain and an ACE2 cellular receptor (referred to as armY-ACE2), serum or plasma from the subject or from a compatible donor, and a pharmaceutically acceptable carrier or a pharmaceutically acceptable excipient, wherein the fusion protein acts to eradicate SARS-CoV and SARS-COV-2 coronaviruses in patients infected with the virus, wherein the fusion protein arms immunoglobulins to recognize and bind with high affinity to the S1 spike protein expressed by the SARS-CoV and SARS-COV-2 coronaviruses. In some embodiments, the Protein M fusion protein optionally comprises a linker, the antibody-binding domain comprises the Protein M protein from Mycoplasma sp., the antibody-binding domain comprises Protein M that binds with high affinity to the antibody Fab domain and blocks the antibody's antigen binding site; the antibody-binding domain comprises Protein M that does not bind to the antibody whose Fab domain binding site is engaged with its cognate antigen; the antigen-binding domain comprises a cellular receptor, ACE2, that binds with high affinity to the S1 spike protein expressed by the SARS-CoV and SARS-COV-2 coronaviruses; the antibody-binding domain comprises Protein M that binds with high affinity to the antibody Fab domain and blocks the antibody's cognate antigen binding site. In some preferred embodiments, immunoglobulins bound with the disclosed fusion proteins retain at least partially Fc-linked functional activities (effector functions), such as Fc-receptor binding and complement activation.

In some embodiments, Protein M fusion proteins comprise a linker between Protein M and receptor fragment. Non-limiting examples of such linkers include

(SEQ ID NO: 12)

GGGGSGGGGSGGGGS,

(SEQ ID NO: 13)

GGGGSGGGGS

or

(SEQ ID NO: 14)

GGGGS.

The present teachings also include a pharmaceutical composition comprising: a Protein M fusion protein having an antibody-binding domain and a fusion domain comprising a protein, peptide or chemical group able to bind a pathogen, a toxin, any biologic entity or a chemical group, serum or plasma from the subject or from a compatible donor, and a pharmaceutically acceptable carrier or a pharmaceutically acceptable excipient. In some embodiments, the antigen to be bound by the fusion protein comprises an antigen arising from a pathogen, a toxin, a subject, arising from a disease state within the subject, or arising from a disease related organism within the subject and the disease state within the subject is caused by a virus, bacteria, tumor, abnormal cell or by exposure to an external disease-causing agent, wherein the antigen-binding domain comprises one or more protein or peptide or chemical group (collectively referred to as molecules) chosen from the group consisting of: a soluble molecule, a soluble molecule bound to a matrix, an insoluble molecule bound to a matrix, an insoluble aggregate of molecules, a molecule comprising one or more epitopes, a nonviable cell-associated molecule, a nonviable organism-associated molecule, or a molecule conjugated with a liposome.

The present teachings also include a Protein M fusion protein having an antibody-binding domain and a fusion partner domain comprising a protein, peptide or chemical group, wherein the antibody-binding domain comprises Protein M that does not bind to the antibody whose Fab domain binding site is engaged with its cognate antigen. In some embodiments, the fusion partner domain may be an endogenous protein or peptide; the fusion partner domain may be an exogenous protein or peptide; the fusion partner domain may be an enzyme, wherein the enzyme is a reporter enzyme horseradish peroxidase fusion protein (HRP). Protein M-HRP may be used to detect immunoglobulins in solution or in a matrix, wherein the immunoglobulins detected are not engaged with their cognate antigen. Thus, Protein M-HRP may be used to identify or rule out false positive test results in antibody-based detection of antigen. The fusion partner domain may permit for a chemical modification, wherein the chemical modification is, for example, an addition of biotin by an enzymatic conjugation of a single biotin on a unique 15 amino acid peptide tag using the biotin ligase (BirA).

In some embodiments, Protein M fusion proteins arm free non-antigen bound immunoglobulin to bind a pathogen or toxin (both referred heretofore as “target”) with a high affinity. This is made possible through (a) Protein M component of the fusion protein that engages the immunoglobulin rendering it no longer able to bind its cognate antigen, and (b) the fused receptor or ligand, which is the same attachment receptor or ligand found on cells that the target uses to attach and gain entry. Protein M fusion protein-armed immunoglobulins (referred heretofore as “armY-fusion”) binding to their target is the initial step in the mechanism of target eradication. Once bound to target, armY-fusion will block the interaction between the target and the attachment receptor found on host cells, thereby, neutralizing the target and prevent it from infecting the cell. Whereas Protein M fusions serve to associate immunoglobulins with the target and neutralize the target, the immunoglobulin serves to mark the target for destruction and clearance by the innate immune system including cells that bear Fc receptors (e.g., macrophages) and complement factors.

Complement is part of the innate surveillance system involved in the first line of defense against pathogens. One mechanism to direct complement to a specific pathogen is via the classical complement pathway, which is initiated by antibodies that are bound to antigen. C1q recruitment to antibodies is an essential first step in the activation of the complement cascade. Antibody binding to antigen (found on the pathogen or in solution as an immune complex) induces a change in the antibody's three-dimensional structure that exposes a C1q binding site found within the CH2 portion of the antibody Fc region. Upon C1q binding and activation, additional complement factors are recruited resulting in the formation of other effector molecules such as C3b, the main effector of the complement system. These events culminate in the formation of the membrane attack complex (MAC) that forms holes or pores on the surface of pathogens including bacteria, viruses and cancer cells resulting in subsequent clearance. C3b also serves as a potent opsin able to tag pathogens, immune complexes (antigen-antibody), and apoptotic cells for phagocytosis by immune cells that express C3b receptors. Together, MAC and C3b serve to effectively eradicate pathogens targeted by antibodies that recruit C1q. C1q is composed of 18 polypeptide chains: six A-chains, six B-chains, and six C-chains. Each chain contains a collagen-like region located near the N terminus and a C-terminal globular region.

In some embodiments, Protein M fusion protein in complex with an antibody can engage C1q and activate classical complement pathway that would contribute to eradication of the pathogen or a cancer cell, to which the Protein M fusion protein is targeted. Normally, for C1 q to bind the antibody, the antibody must first bind its antigen (immobilized on a cell or pathogen or in solution as an immune complex), and then the antibody undergoes a conformational change that permits C1q binding. However, Protein M fusion protein-IgG complex can specifically recruit C1q as demonstrated, for example, in Example 10, FIG. 11 below. Thus, Protein M-fusions can be considered a tool to specifically induce a conformational change in antibodies (while in solution) resulting in its ability to engage C1q and activate the complement pathway.

EXAMPLES

Aspects of the present teachings may be further understood in light of the following examples, which should not be construed as limiting the scope of the present teachings in any way. Below, exemplary methods to develop and characterize Avi-/myc-tagged Protein M, myc-tagged Protein M-HRP fusion protein and Protein M-ACE2 fusion protein (aka, armY-ACE2) are disclosed. These and similar methods can be applied to generate and use different Protein M fusion proteins.

Example 1. Gene Construction of Protein M

Protein M (also referred to as armY) (SEQ ID NO:10 and 38) was constructed using the mature amino acid sequence of Protein M (37-556 amino acid) containing a myc-tag (EQKLISEEDLLRKR) and linker sequence (AANGGGGSGGGGS) and a mono-biotinylation sequence “Avi-Tag” (MAGGLNDIFEAQKIEWHEGG) at its N-terminal end. The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57 (GenScript, Inc.). The insert was amplified and cloned into a previously constructed mammalian cell expression vector pcDNA3(−) containing a myc-tag-Protein M sequence by replacing the myc-tag-Protein M sequence with the above myc-tag sequence that included a mono-biotinylation sequence, producing a final Protein M construct (IL-2 leader sequence—biotinylation tag—myc tag—linker—Protein M). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).

Example 2A. Characterization of Protein M as an Antibody Neutralizer and Blocking Reagent Tool. Binding of Protein M to Immobilized Antibody (FIG. 3)

Protein M binding to plate bound antibody was demonstrated by measuring the amount of myc-tagged Protein M bound to the antibody coated on a 96-well plate by an ELISA-based method.

Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), Protein M in expression medium diluted in assay buffer (0.5% BSA in PBS+Tween 20) or assay buffer was added to antibody-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged Protein M. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 2B. Characterization of Protein M as an Antibody Neutralizer and Blocking Reagent Tool. Protein M Neutralizes/Blocks Antibody Binding to Cognate Antigen (FIG. 4)

The ability of Protein M to block binding to its cognate antigen was demonstrated by measuring the amount of unblocked, free antibody bound to its antigen coated on a 96-well plate by an ELISA-based method.

Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. RP-labeled goat anti-human IgG antibody (GenScript, Inc.) or a RP-labeled chicken anti-human IgG antibody (Aves Labs, Inc.) was added to Protein M in expression medium or to expression medium alone and allowed to form complexes at room temperature for approximately 2 hours. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG coated wells in duplicate. After approximately 45 minutes at room temperature, the wells were washed 3×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 3. PROTEIN M-HRP Fusion Protein (SEQ ID NO:11 and 39). Gene Construction of Protein M-HRP

The mature amino acid sequence of horseradish peroxidase HRP (31-338 amino acid) was generated containing a myc-tag (EQKLISEEDL) and linker (AAN) sequence at its N-terminal end. The amino acid sequence encoding 3 sets of 4 glycine residues and 1 serine residue (e.g., GGGGS)3 linker followed by the mature amino acid sequence of Protein M (37-556 amino acid) was added to its C-terminal end producing a final Protein M-HRP construct containing (IL-2 leader sequence—myc tag—HRP—linker—Protein M). The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57 (GenScript USA Inc.). The insert was amplified and cloned into a mammalian cell expression vector, pcDNA3(−). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).

Example 4. Characterization of Protein M-HRP as a Novel Antibody Detection Reagent Tool. Detection of Immobilized F(Ab′)₂, Antibody or Antibody in Solution by Protein M-HRP Fusion Protein

Protein M-HRP direct detection of plate bound F(ab′)₂, antibody or indirect detection of antibody in solution was demonstrated by measuring the amount of Protein M-HRP bound to antibody coated on a 96-well plate by an ELISA-based method (FIG. 5). Direct detection of antibody by Protein M-HRP fusion protein: Briefly, 2 ug/ml of goat F(ab′)₂or 5 ug/ml of human IgG (Sigma) or 2 ug/ml of mouse IgG1 isotype anti-myc or anti-human CD28 antibodies was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), Protein M-HRP in expression medium or expression medium diluted in assay buffer (0.5% BSA in PBS+Tween 20) was added to F(ab′)₂or antibody-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software). Indirect detection of antibody in solution by Protein M-HRP fusion protein: Briefly, Protein M-HRP was incubated with varying amounts of mouse IgG1 antibody in assay buffer and allowed to form complexes at room temperature for approximately 30 minutes. Protein M-HRP alone was also included as a positive control. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG-coated wells in duplicate. After approximately 30 minutes at room temperature, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 5. Absence of Detection of Antibody Bound to an Immobilized Antigen by Protein M-HRP Fusion Protein

Protein M does not bind to antibodies already bound to antigen. The absence of detection of antibody bound to an immobilized antigen by Protein M-HRP fusion protein was demonstrated by measuring the amount of Protein M-HRP bound to the antibody engaged with its antigen on a 96-well plate by an ELISA-based method (FIG. 6).

Briefly, 1 ug/ml of a myc-tagged protein or 2 ug/ml of human ASIP (agouti-signaling protein, RnD Systems) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. After washing twice with PBS+Tween 20 (wash buffer, Pierce), mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer, rabbit anti-ASIP antibody (Thermofisher) or assay buffer alone was added to the myc-tagged protein or ASIP, coated wells, respectively, in duplicates. After approximately 60 minutes, the wells were washed 3×s and Protein M-HRP was added. To show that mouse anti-myc and rabbit anti-ASIP bound to myc-tagged protein or ASIP coated wells, anti-mouse IgG labeled with HRP or biotinylated anti-rabbit IgG+SA-RP was added to another set of coated wells, respectively. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 6. PROTEIN M-ACE2 Fusion Protein (Referred to as armY-ACE2) (SEQ ID NO:9 and 37). Gene Construction of armY-ACE2 (Protein M Fused to Angiotensin I-Converting Enzyme 2 (ACE2))

The mature amino acid sequence of human ACE2 (18-740 amino acid) was generated containing a myc-tag (EQKLISEEDLLRKR) and linker (GSPGGA) sequence at its N-terminal end. The linear amino acid sequence was reverse translated to its corresponding DNA sequence using the free GenSmart™ Codon Optimization Tool by GenScript for expression in human cells (gensmart-free-gene-codon-optimization). This sequence was submitted for gene synthesis and inserted into the plasmid cloning vector pUC57. The insert was amplified and cloned into the mammalian cell expression vector pcDNA3(−) containing the myc-tag-Protein M-HRP sequence (see above) by replacing the myc-tag-HRP sequence with the above myc-tag-ACE2 sequence, upstream of the sequence encoding 3 sets of 4 glycine residues and 1 serine residue (e.g., GGGGS)3 linker followed by the mature amino acid sequence of Protein M (37-556 amino acid), producing a final armY-ACE2 construct containing (IL-2 leader sequence—myc tag—ACE2—linker—Protein M). The plasmid expression vector construct was verified by restriction enzyme analysis, amplified in E. coli and purified using a maxiprep kit (GenScript Inc. and Eton Bioscience, Inc.).

Example 7A. Protein M, Protein M-HRP and armY-ACE2 Gene Expression

The human 293T kidney cell line was transfected with the expression vector encoding the Protein M, Protein M-HRP or armY-ACE2 (Protein M-ACE2) fusion protein sequences, by calcium phosphate transfection method. After 7-16 hours, the transfection solution was replaced with protein expression medium and the supernatant harvested after approximately 48 hours. To purify the proteins, the supernatant was harvested and pass through an anti-myc antibody-coupled agarose resin and the captured proteins eluted using 0.1M Glycine pH 2.5 and neutralized by 1M Tris-HCl pH 8.0. The eluted proteins were dialyzed against a phosphate buffered saline solution and stored in 4° C.

Example 7B. Characterization of armY-ACE2 as a Novel Therapeutic in the Treatment of Coronavirus Infection. Targeting of [armY-ACE2+Antibody] to SARS-CoV-2 Spike Protein and Binding of [armY-ACE2+SARS-CoV-2 Spike Protein] Complex to Antibody

Complex [armY-ACE2+antibody] targeting of SARS-CoV-2 spike protein was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method. Binding of [armY-ACE2+ to SARS-CoV-2 spike protein] complexes to immobilized antibody was demonstrated by measuring the amount of [armY-ACE2+SARS-CoV-2 spike protein] complexes bound to the antibody coated on a 96-well plate by an ELISA-based method (FIG. 7).

Briefly, 50 ul of 1 ug/ml histidine (his)-tagged SARS-CoV-2 spike protein (GenScript, Inc.) or 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+antibody] complexes were allowed to form at room temperature by adding 0.25 ug/ml biotinylated goat IgG (Jackson ImmunoResearch Inc.) to armY-ACE2 in expression medium for 60 minutes. Biotinylated antibody was also added to Protein M (lacking ACE2 domain) or expression medium as negative controls. armY-ACE2 alone in expression medium was prepared as an additional negative control. The samples were diluted in assay buffer (0.5% BSA in PBS+Tween 20) and added to SARS-CoV-2 spike protein coated wells, washed twice with PBS+Tween 20 (wash buffer, Pierce), in duplicate. After approximately 30 minutes at room temperature, the wells were washed 3×s and streptavidin-horseradish peroxidase (SA-HRP) (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 20 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software). Binding of [armY-ACE2+ to SARS-CoV-2 spike protein] complexes to immobilized human IgG: armY-ACE2+SARS-CoV-2 spike protein complexes were allowed to form at room temperature by adding 2 ug/ml of SARS-CoV-2 spike protein to armY-ACE2 in expression medium. armY-ACE2 alone in expression medium was prepared as a negative control. After approximately 60 minutes at room temperature, the wells were washed and mouse IgG1 anti-histidine tag (GenScript, Inc.) or mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the histidine-tagged SARS-CoV-2 spike protein or myc-tagged armY-ACE2 bound to human IgG coated on the well, respectively. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 8. ArmY-ACE2 Binding to Immobilized Antibody or Antibody in Solution

armY-ACE2 binding to immobilized antibody or antibody in solution was demonstrated by measuring the amount of free or antibody-bound armY-ACE2 in an ELISA based method (FIG. 8).

Briefly, 5 ug/ml of human IgG (Sigma) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. Binding of antibody in solution by armY-ACE2: Briefly, armY-ACE2 was incubated with purified human IgG, 2% human serum (containing antibodies) or PBS in assay buffer and allowed to form complexes at room temperature for approximately 2 hours. After washing twice with PBS+Tween 20 (wash buffer, Pierce), samples were added to human IgG-coated wells in duplicate. After approximately 60 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged armY-ACE2 bound to human IgG coated on the well. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 30 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 9. [armY-ACE2+Antibody] Complexes Engage Fc Receptors on K562 Erythroleukemic Cell Line

Binding of antibodies to Fc-receptor expressed on cells (e.g., innate immune cells, antigen presenting cells) requires interaction with the antibody Fc region. [armY-ACE2+antibody] complex engagement of Fc receptors was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the human FcγRII (CD32) expressed on K562, a human erythroleukemic cell line, by flow cytometry (FIG. 9).

Briefly, K562 cells were taken from cell culture medium and centrifuged (3000 rpm for 3 minutes) and supernatant removed by vacuum aspiration. After a wash with chilled FACS buffer (0.5% BSA in PBS+0.1% sodium azide), 100,000 cells was transferred to 1.5 ml microcentrifuge tubes in FACS buffer and the supernatant removed after centrifugation and the cells kept on ice. 5 ug/ml of human IgG (Sigma Aldrich) was added to armY-ACE2 in expression medium and kept at room temperature for approximately 30 minutes to form complexes, and tubes transferred to ice to chill. armY-ACE2 alone in expression medium was also prepared as a negative control. 100 ul of [army-ACE2+antibody] complexes or armY-ACE2 alone was added to K562 cells and allowed to incubate on ice for approximately 30 minutes. After two washes in FACS buffer, anti-myc (clone 9E10 mouse antibody) was added to detect the myc-tagged army-ACE2 and allowed to incubate for approximately 20 minutes. After two washes, anti-mouse IgG-Alexafluor-488 (Biolegend, Inc.) was added to detect anti-myc antibody and allowed to incubate for approximately 20 minutes. After two washes, cells were resuspended in FACS buffer and analyzed by flow cytometry (BD FACS Calibur and CellQuest Pro analysis software). At least 5,000 events were acquired per sample. Cells incubated with negative controls as described above served as source of background basal percent value. The percentage of cells staining positive for [army-ACE2+antibody] complexes was determined by the percentage of cells present within a gate established such that <6% of the positive events of cells incubated with negative control samples measured represented background fluorescence.

To demonstrate that binding of [army-ACE2+antibody] complexes to K562 was through a specific interaction with Fc-receptors expressed on the cells, K562 cells were pre-incubated with FcγRII blocking anti-CD32 (clone IV.3, mouse IgG2b, kappa) (FIG. 10).

Briefly, 1 ug of anti-CD32 or an isotype-matched mouse IgG2b, kappa control antibody was added to 100 ul of FACS buffer and added to approximately 100,000 K562 cells and placed on ice for approximately 15 minutes. After 2 washes, 100 ul of [army-ACE2+antibody] complexes prepared as described above was added to the cells and kept on ice for approximately 20 minutes. After two washes, fluorescein (FITC)-labeled anti-myc (Biotium, Inc.) was added to K562 cells and allowed to incubate on ice for approximately 15 minutes. After two washes, cells were resuspended in FACS buffer and analyzed by flow cytometry (BD FACS Calibur and CellQuest Pro analysis software). At least 5,000 events were acquired per sample. Cells incubated with negative control served as source of background basal percent value. The percentage of cells staining positive for [army-ACE2+antibody] complexes was determined by the percentage of cells present within a gate established such that <2% of the positive events of cells incubated with negative control samples measured represented background fluorescence.

Example 10. [armY-ACE2+Antibody] Complex Binding to Purified Human C1q Complement Component

The binding of the C1q complement component to antibody is the initial step towards the activation of the classical complement pathway. The [armY-ACE2+antibody] complex binding to C1q complement component was demonstrated by measuring the amount of [armY-ACE2+antibody] complexes bound to the purified C1q coated on a 96-well plate by an ELISA-based method (FIG. 11).

Briefly, 50 ul of 5 ug/ml purified human C1q (>95% pure by SDS-PAGE analysis, Complement Technology, Inc.) was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed twice with PBS (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. armY-ACE2+antibody complexes were allowed to form at room temperature by adding 5 ug/ml FITC-labeled mouse IgM (Biolegend, Inc. Cat #401607) or 10 ug/ml FITC-labeled mouse IgG1 (Biolegend, Inc. Cat #200305) to armY-ACE2 or Protein M (lacking ACE2 domain) in expression medium. FITC-labeled antibody or armY-ACE2 added to expression medium served as negative controls. To block binding of [armY-ACE2+antibody] complex to immobilized C1q coated on the well, 10 ug/ml of soluble C1q was added to the [armY-ACE2+antibody] complexes and allowed to incubate at room temperature for 30 minutes. C1q-coated wells were washed twice with PBS+Tween 20 (wash buffer, Pierce), and the samples were added in duplicate. After approximately 30 minutes at room temperature, the wells were washed 3×s and biotinylated anti-FITC (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 45 minutes. The wells were washed 3×s and SA-HRP (Biolegend, Inc.) in assay buffer was added to the wells and allowed to incubate at room temperature for approximately 25 minutes. After 4 washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 11. armY-ACE2 or [armY-ACE2+Antibody] Complexes Exhibit ACE2 Activity

ACE2 activity in armY-ACE2 or [armY-ACE2+antibody] complexes was demonstrated by measuring the fluorescence emitted after cleavage of the ACE2 fluorogenic substrate MCA-APK-(Dnp). ACE2-dependent removal of the quenching Dnp group induces fluorescence, which is measured by a fluorescence plate reader (FIG. 12).

Briefly, armY-ACE2 or [armY-ACE2+antibody] complexes were diluted in ACE2 buffer (1 mol/L NaCl, 75 mmol/L Tris HCl, pH 7.5, and 50 μmol/L ZnCl2) and 30 μl of diluted samples were combined with 170 μl the ACE2 fluorogenic substrate MCA-APK(Dnp) (AnaSpec, Inc. Cat #AS-60757) in ACE2 buffer. The final concentration of ACE2 substrate was 20 μM in a final volume of 200 μl. The samples were kept in the dark for 16 hours at room temperature. 100 μl of samples were transferred to a flat bottom NUNC Black 96 Microwell strip plate and fluorescence measured using a fluorescence plate reader (Cytofluor 4000, Gain 75, Ex 360/40, Em 460/40).

Example 12. SDS-PAGE Analysis of Purified armY-ACE2

SDS-PAGE analysis of purified armY-ACE2 was performed under non-reducing and reducing conditions and showed the expected band of ˜180 kDa (theoretical molecular weight: 150 kDa) (FIG. 13).

Briefly, 8 ul of sample buffer (Invitrogen) was added to 24 ul of eluted fractions and mixed. The sample were heated in 80° C. water bath for 10 minutes. Reducing agent (10×, Invitrogen) was added to some of the tubes containing the samples and mixed. Non-reduced and reduced samples were loaded onto a 4-12% NuPAGE pre-cast SDS-PAGE gel and separated at 175V for 30 minutes in MES-SDS running buffer (Invitrogen). PageRuler unstained protein ladder (10-200 kDa, Invitrogen) was also included. After electrophoresis, the gel was rinsed in distilled water and the protein bands stained using SimplyBlue Safe Stain (Invitrogen) and the gel photographed.

Example 13. Evaluating Neutralization of SARS-CoV-2 by armY-ACE2 In-Vitro

Live SARS-CoV-2 virus has to be handled under biosafety level 3 conditions due to its high pathogenicity and infectivity and the lack of effective vaccines and therapeutics. Recently, a VSV pseudovirus production system, a pseudovirus-based neutralization assay has been developed for evaluating neutralizing antibodies against SARS-CoV-2 in biosafety level 2 facilities (Nie et.al., 2020). Pseudoviruses are useful tools because of their safety and versatility, especially for emerging and re-emerging viruses. This example utilizes a validated Pseudovirus neutralization protocol slightly modified from Nie et.al., to test the efficacy of armY-ACE2 by measuring the ability of armY-ACE2 to inhibit SARS-CoV-2 pseudovirus binding and infection of ACE2 expressing cells.

Briefly, the vesicular stomatitis virus (VSV) pseudovirus system (G*AG-VSV) is used, which packages expression cassettes for firefly luciferase instead of VSV-G in the VSV genome. The SARS-CoV-2 pseudovirus is produced by transfecting human 293T cells with the expression plasmid pcDNA3.1 containing the codon-optimized SARS-CoV-2 spike protein sequence, followed by infection with G*AG-VSV pseudovirus. Post infection, SARS-CoV-2 pseudoviruses is harvested and stored until use.

Huh7 human hepatocellular cell line naturally express the human ACE2 receptor protein and is an ideal cell line for SARS-CoV-2 pseudovirus infection as it demonstrates high luciferase activity upon infection. Viral inocula of approximately 650 TCID50 (the 50% tissue culture infectious dose of SARS-CoV-2 pseudovirus) is used for the assay.

Neutralization of SARS-CoV-2 pseudovirus infection of Huh7 is confirmed by the reduction in luciferase gene expression upon infection. Neutralization condition: SARS-CoV-2 pseudovirus is incubated with serial dilutions of armY-ACE2+human plasma containing immunoglobulins (six, 1:3 dilutions, or half-log dilutions) in duplicate. Human plasma added Protein M or human plasma alone are included as negative controls. Recombinant ACE2-Ig fusion protein (commercially available from GenScript Inc., catalog #Z03484) has been demonstrated to neutralize SARS-CoV-2 pseudovirus infection previously (Lei et.al., 2020) and is used in this assay as a positive control. After incubation for 1 hour at 37° C. in a 96-well plate format, 5×10{circumflex over ( )}4 Huh7 cells is added to each well. After 24 hours of incubation in a 5% CO2 chamber at 37° C., luminescence is measured by adding luciferase substrate and the luminescence measured using a 96-well plate luminescence plate reader. Upon subtraction of background luminescence, relative light units (RLU) versus the concentration of test sample and controls is plotted to generate an inhibitory dose response curve from which the IC50 is calculated. Human plasma added armY-ACE2 is neutralizing SARS-CoV-2 pseudovirus infection of Huh7 in a dose-dependent fashion. Human plasma added Protein M or human plasma alone is not neutralizing SARS-CoV-2 pseudovirus infection in this assay.

Example 14. Evaluating Eradication of SARS-CoV-2 by armY-ACE2 In Vivo

While the Example 13 evaluates the efficacy of armY-ACE2 engaged immunoglobulins to neutralize SARS-CoV-2 in vitro, this Example will demonstrate the efficacy of armY-ACE2 to promote eradication of SARS-CoV-2 in vivo, thereby protecting the animal from a severe clinical disease and succumbing to a lethal infection.

Protein M binds to immunoglobulin of various species including those of man and mice. Commercially available human ACE2 transgenic mice K18-hACE2 (The Jackson Laboratory, Stock #034860) develops severe clinical disease upon infection with SARS-CoV (McCray et.al., 2007) to a similar degree observed in patients with severe Covid-19. According to CDC, “Among patients who developed severe disease, the median time to dyspnea from the onset of illness or symptoms ranged from 5 to 8 days, the median time to acute respiratory distress syndrome (ARDS) from the onset of illness or symptoms ranged from 8 to 12 days, and the median time to ICU admission from the onset of illness or symptoms ranged from 10 to 12 days.”

According to JAX laboratory, “These K18-hACE2 mice develop a rapidly lethal infection after intranasal inoculation with a human strain of SARS-CoV. Infection begins in airway epithelia, with subsequent alveolar involvement and extrapulmonary virus spread to the brain. Infection results in macrophage and lymphocyte infiltration in the lungs and upregulation of proinflammatory cytokines and chemokines in both the lung and the brain. By days 3 to 5 postinfection, K18-hACE2 mice begin to lose weight and become lethargic with labored breathing.” K18-hACE2 mice become moribund 4 days after inoculation, and all mice are dead 7 days after inoculation.

Recently, it was determined that K18-hACE2 mice “present with more symptomatic disease than other hACE2 mouse models of SARS-CoV-2 infection.” (Moreau et.al., 2020) For this reason and because the k18-hACE2 mice are readily commercially available, we employed the k18-hACE2 SARS-CoV model to evaluate the efficacy of army-ACE2 engaged immunoglobulins to eradicate SARS-CoV or SARS-CoV-2 in vivo following the methods as described (McCray et.al., 2007) with slight modifications.

Infection of K18-hACE2 mice with SARS-CoV or SARS-CoV-2. SARS-CoV and SARS-CoV-2 strains is obtained from the Centers for Disease Control, Atlanta, Ga. The virus is propagated and titered on Vero E6 cells in a biosafety level 3 laboratory and the virus titer is determined by a plaque assay.

Mice are lightly anesthetized with isoflurane and infected intranasally with the indicated dosage of SARS-CoV or SARS-CoV-2 in 30 ul of Dulbecco's modified Eagle medium. Infected mice are examined, weighed and evaluated for severe clinical disease on a daily basis monitoring for appearances of lethargy, labored breathing, moribund and death.

Treatment with armY-ACE2. Plasma from mice of the same background (C57BL/6J×SJL/J) as K18-hACE2 mice is harvested and mixed with armY-ACE2 and allowed to incubate at 37° C. for 1-2 hours to permit arming of plasma immunoglobulins. Infected mice (n=6) receive daily injections of 0.2 ml of armY-ACE2 plasma beginning one day after infection for 7 days. Two other cohorts of mice receive Protein M+plasma or plasma alone and serve as negative control treatment groups.

To obtain specimens for virus titers, a few animals are sacrificed before injection and after 1, 2, 3, 4, 5 and 6 days after infection, and organs are aseptically removed into sterile phosphate-buffered saline. In some cases, blood is obtained via catheterization of the inferior vena cava. Tissues are homogenized using a manual homogenizer, and the 50% tissue culture infective dose (TCID) is determined as described previously (Subbarao et.al., 2004) to determine the amount of virus per gram of tissue. Mice treated with army-ACE2 plasma do not succumb to infection, whereas mice in the negative control groups succumb. Surviving mice are permitted to continue in the study over an additional 2 months. These mice developed immunity to the virus and are protected from a subsequent challenge with the virus. Surviving mice are re-infected and examined, weighed daily and evaluated.

At termination, whole-lung lavage is performed and the lavage is evaluated for cellular and biochemical changes using standard techniques. Significantly lower cellular infiltrates and inflammatory markers in armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups. Lungs and other organs are examined by histology and immunohistochemistry to evaluate the degree of disease pathology and detect viral antigen. Significantly lower severe disease pathology and viral presence in the lungs and organs or armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups, indicating effective viral clearance and eradication. Similar findings are observed in armY-ACE2 plasma treated mice that had developed immunity to the virus and re-challenged with the virus.

Extraction of total RNA and quantitative reverse transcription-PCR (RTPCR) are performed to measure levels of viral RNA in various tissue specimen. An aliquot of cDNA is subjected to PCR using a MyiQ single-color real-time PCR detection system with iQ SYBR green Supermix. A set of primers is used for the SARS-CoV or SARS-CoV-2 nucleocapsid (N) gene or a house-keeping gene. Significantly lower viral genes in specimens acquired from armY-ACE2 plasma treated mice are found as compared to mice in the negative control groups, indicating effective viral clearance and eradication. Similar findings are found in armY-ACE2 plasma treated mice that had developed immunity to the virus and re-challenged with the virus.

Example 15. Identification of Two Potential Immunogenic Peptide Regions in Protein M (a.a. 469-556)

The online B-cell epitope prediction tools (IEDB Analysis Resource) were used to determine potential immunogenic peptide regions in Protein M (a.a. 469-556). Using six online prediction tools [Bepipred Linear Epitope Prediction 2.0, Bepipred Linear Epitope Prediction, Chou & Fasman Beta-Turn Prediction, Emini Surface Accessibility Prediction, Karplus & Schulz Flexibility Prediction and Parker Hydrophilicity Prediction] two peptide regions in protein M c-terminal end (469-556 amino acid) were determined to be potentially immunogenic. The following peptide substitutions are proposed for these two regions to mitigate immunogenicity of Protein M C-terminal end (469-556 amino acid), which are listed below in a) an b), and additionally shown in the following Table 1. Complete Protein M amino acid sequences with substitutions are presented as SEQ ID NO: 63-66.

a) 494-507 amino acid of protein M

(based on SEQ ID NO: 1)

1. QALANATASALAAM

2. AKLANATASALARM

3. QALEADADSALEAM

4. AKLANDTASSAERA

b) 527-540 amino acid of protein M

(based on SEQ ID NO: 1)

1. AIAGVASATNAVAS

2. AIAGVASATNAVKS

3. DIAGVSADTAEVAS

4. AITGASSATNAVKA

TABLE 1

Proposed alanine substitutions for Protein M c-terminal

end (469-556 amino acid) to mitigate immunogenicity.

a)
494
495
496
497
498
499
500
501
502
503
504
505
506
507

Original
Q
K
L
E
N
D
T
D
S
S
L
E
R
M

Subs #1
Q
A
L
A
N
A
T
A
S
A
L
A
A
M

Subs #2
A
K
L
A
N
A
T
A
S
A
L
A
R
M

Subs #3
Q
A
L
E
A
D
A
D
S
A
L
E
A
M

Subs #4
A
K
L
A
N
D
T
A
S
S
A
E
R
A

b)
527
528
529
530
531
532
533
534
535
536
537
538
539
540

Original
D
I
T
G
V
S
S
D
T
N
E
V
K
S

Subs #1
A
I
A
G
V
A
S
A
T
N
A
V
A
S

Subs #2
A
I
A
G
V
A
S
A
T
N
A
V
K
S

Subs #3
D
I
A
G
V
S
A
D
T
A
E
V
A
S

Subs #4
A
I
T
G
A
S
S
A
T
N
A
V
K
A

Example 16. Non-Immune Human Serum or Plasma Antibodies Armed with armY-ACE2 Bind to SARS-CoV-2 Spike Protein (FIG. 14A-C)

Binding to SARS-CoV-2 spike protein by armY-ACE2 armed non-immune serum antibodies was demonstrated by measuring the amount of armed antibodies that bind the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method.

Briefly, 50 ul of 5 ug/ml SARS-CoV-2 spike protein was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed 2×s with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+antibody] complexes were allowed to form by mixing armY-ACE2 with non-immune serum (pre-vaccine) diluted 1:200 in assay medium for 60 minutes in a 37° C. incubator. Mixtures containing pre-vaccine serum diluted 1:200 in assay medium, post-vaccine (Moderna SARS-CoV-2 spike mRNA vaccine) serum diluted 1:200 in assay medium or armY-ACE2 in assay medium, were included as controls and placed in a 37° C. incubator for 60 minutes. The final concentration of armY-ACE2 was 20 ug/ml. Binding of armY-ACE2 armed non-immune plasma (ACD-A) antibodies to SARS-CoV-2 spike protein was demonstrated following the same procedure described above.

After the incubation period, the samples were added to SARS-CoV-2 spike protein coated wells that had been washed 2×s with PBS+Tween 20 (wash buffer, Pierce), in duplicate. After approximately 120 minutes at room temperature, the wells were washed 4×s and anti-human IgG labeled with HRP (Genscript) was added to the wells and allowed to incubate at room temperature for approximately 25 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and the blue color absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software) and a photo taken with a digital camera.

After incubating serum- or plasma-antibodies with armY-ACE2, the amount of free armY-ACE2 was determined by measuring the amount of unengaged armY-ACE2 that can bind to immobilized human IgG. Briefly, serum- or plasma samples incubated with armY-ACE2 as described above, were added to human IgG coated wells, in duplicate. 1 ug/ml of armY-ACE2 alone in assay buffer was added to separate wells as a reference positive control. After approximately 100 minutes at room temperature, the wells were washed and mouse IgG1 anti-myc antibody (clone: 9E10) in assay buffer was added to detect the myc-tagged armY-ACE2 bound to human IgG coated on the wells. After approximately 30 minutes, the wells were washed 3×s and anti-mouse IgG labeled with HRP was added to the wells. After approximately 20 minutes, the wells were washed 4×s and TMB substrate solution (Biolegend, Inc.) was added to the wells and the absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 17. Monoclonal Antibody (mAb) Armed with armY-ACE2 Gains the Ability to Bind the SARS-CoV-2 Spike Protein, but is No Longer Able to Bind its Natural Target Antigen (FIG. 15 A-B)

Binding to SARS-CoV-2 spike protein by armY-ACE2 armed mAb (originally anti-selectin) was demonstrated by measuring the amount of armed mAbs that bind to the SARS-CoV-2 spike protein coated on a 96-well plate by an ELISA-based method.

Briefly, 50 ul of 5 ug/ml SARS-CoV-2 spike protein was prepared in ELISA coating buffer (Biolegend, Inc.) and added to a flat bottom 96-well plate (Immulon 2HB). The next day, the wells were washed 2×s with PBS (phosphate buffered saline pH 7.4) (Gibco) and 100 ul 3% BSA in PBS (Boston Bioproducts, Inc.) was added to block unbound sites on the well. [armY-ACE2+mAb] complexes were allowed to form by mixing armY-ACE2 with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium, armY-ACE2 in assay medium, or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and armY-ACE2 were 1 ug/ml and 30 ug/ml, respectively.

After the incubation period, the samples were added to SARS-CoV-2 spike protein coated wells, in duplicate, that had been washed 2×s with PBS+Tween 20 (wash buffer, Pierce). After approximately 120 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 45 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

The inability of armY-ACE2 armed mAb (originally anti-selectin) to bind to its natural antigen was demonstrated by measuring the amount of armed mAb that bind to selectin protein coated on a 96-well plate by an ELISA-based method.

Briefly, 100 ul of 2 ug/ml biotinylated selectin protein was prepared in PBS and added to a flat bottom 96-well plate (streptavidin coated wells) after 2 washes with PBS+Tween 20 (wash buffer, Pierce).

[armY-ACE2+mAb] complexes were allowed to form by mixing armY-ACE2 with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and armY-ACE2 were 63 ng/ml and 15 ug/ml (50× molar excess) or 7.5 ug/ml (25× molar excess), respectively.

After 2 hours, the selectin-coated wells were washed 2×s with wash buffer and the mixtures added in duplicate wells and allowed to incubate for 1 hour at room temperature. After the incubation period, the wells were washed 3×s. After approximately 60 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech)+2% mouse serum was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 18. Biotinylated Protein M Detects the Light-Chain of Antibody, but not the Heavy-Chain on Western Blot (FIG. 16)

Protein M containing an N-terminal biotinylation “Avi-Tag” sequence was mono-biotinylated using the Accelagen TurboBiotinylation kit following the reaction protocol (Accelagen, TurboBiotinylation-protocol). The use of mono-biotinylated protein M fusion as an immunologic research tool for detection of antibody light-chain was demonstrated using a 1D gel electrophoresis Western blot method.

Briefly, the antibody sample was diluted to 1.0 mg/mL with sodium dodecyl sulfate (SDS) boiling buffer and heated to 95° C. for 10 minutes, and further diluted to 0.01 mg/mL. The E. coli (K12 MG1655) lysate sample was diluted to 2.5 mg/mL in SDS boiling buffer. SDS slab gel electrophoresis was carried out under reducing conditions according to the method of Laemmli, U. (Nature 227: 680-685, 1970) as modified by O'Farrell (J Biol. Chem. 250: 4007-4021). The samples were loaded in wells in 10% acrylamide slab gels (0.75 mm thick). SDS slab gel electrophoresis was carried out for about 4 hours at 15 mA/gel. The following proteins (Millipore Sigma) were used as molecular weight standards: myosin (220,000), phosphorylase A (94,000), catalase (60,000), actin (43,000), carbonic anhydrase (29,000), and lysozyme (14,000, not shown). After slab gel electrophoresis, the gel for blotting was placed in transfer buffer (10 mM CAPS, pH 11.0, 10% methanol) and transblotted onto PVDF membranes overnight at 145 mA and approximately 100 volts/two gels. The blots were stained with Coomassie Brilliant Blue R-250, cut into pieces at the dark lines and flatbed scanned (not shown).

Western Blot analysis. The membrane sections were destained in 100% MeOH and rinsed briefly in Tween-20 tris buffer saline (TTBS). The blot was blocked for two hours in Superblock with 0.05% Tween-20 (Superblock-T). The blot was then incubated overnight in Superblock-T and rinsed 3×10 minutes in TTBS. The blot was then placed in mono-biotinylated protein M diluted to 1.0 μg/ml in Superblock-T for two hours and rinsed as above. The blot was then placed in poly-HRP streptavidin (ThermoFisher, Cat #N200) diluted 1:500,000 in Superblock-T for two hours, rinsed as above, treated with ThermoFisher Pierce ECL, and exposed to x-ray film for 3 minutes.

Example 19. Protein M Fusion Serves as a Surrogate Antigen and May be Used to Confirm that a) Antigen Binding is Via the Fab Domain of the Antibody and b) the Antibody's Target Antigen and/or Specificity (FIG. 17)

Binding of antibody to antigen is mediated by the Fab arm of the antibody, which contains the variable region where the antigen binding site is found. Protein M binds specifically to the light-chain variable region in the Fab and blocks the antigen binding site. Therefore, protein M may serve as a surrogate antigen and as an immunologic research tool, and used to a) confirm that the antibody binds via its Fab domain and b) confirms its specificity as it loses its antigen binding ability when bound by protein M. A 96-well ELISA-based method was used to demonstrate such protein M uses.

Protein M fusion serves as a surrogate antigen.

Briefly, 100 ul of 2 ug/ml mono-biotinylated protein M was prepared in PBS and added to a flat bottom 96-well plate (streptavidin coated wells) after 2 washes with PBS+Tween 20 (wash buffer, Pierce).

After approximate 2 hours at room temperature, the coated wells were washed 2×s with wash buffer and the varying amounts of monoclonal antibody (originally anti-selectin) were added in duplicate wells and allowed to incubate for approximately 2 hours at room temperature. After the incubation period, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Example 20. Protein M is Used to Confirm that a) the Antibody Binds Via its Fab Domain and b) it Loses its Antigen Binding Ability when Bound by Protein M (FIG. 18)

Protein M+monoclonal antibody (mAb) complexes were allowed to form by mixing protein M with the mAb in assay medium for 120 minutes in a 37° C. incubator. Mixtures containing mAb in assay medium or assay medium alone were included as controls and placed in a 37° C. incubator for 120 minutes. The final concentration of mAb and protein M were 125 ng/ml and 3.4 ug/ml (50× molar excess) or 1.7 ug/ml (25× molar excess), respectively.

After 2 hours, the selectin-coated wells were washed 2×s with wash buffer and the mixtures added in duplicate wells and allowed to incubate for approximately 90 minutes at room temperature. After the incubation period, the wells were washed 3×s. After approximately 60 minutes at room temperature, the wells were washed 3×s and anti-human IgG labeled with HRP (Southern Biotech) was added to the wells and allowed to incubate at room temperature for approximately 60 minutes. After four washes, TMB substrate solution (Biolegend, Inc.) was added to the wells and absorbance at 650 nm measured using a plate reader (Molecular Devices Thermomax and Softmax Pro software).

Other Embodiments

The detailed description set-forth above is provided to aid those skilled in the art in practicing the present invention. However, the invention described and claimed herein is not to be limited in scope by the specific embodiments herein disclosed because these embodiments are intended as illustration of several aspects of the invention. Any equivalent embodiments are intended to be within the scope of this invention. Indeed, various modifications of the invention in addition to those shown and described herein will become apparent to those skilled in the art from the foregoing description which do not depart from the spirit or scope of the present inventive discovery. Such modifications are also intended to fall within the scope of the appended claims.

REFERENCES CITED

All publications, patents, patent applications and other references cited in this application are incorporated herein by reference in their entirety for all purposes to the same extent as if each individual publication, patent, patent application or other reference was specifically and individually indicated to be incorporated by reference in its entirety for all purposes. Citation of a reference herein shall not be construed as an admission that such is prior art to the present invention.

Specifically intended to be within the scope of the present invention, and incorporated herein by reference in its entirety, is the following publication:

Grover R K, Zhu X, Nieusma T, et al. A structurally distinct human mycoplasma protein that generically blocks antigen-antibody union. Science. 2014; 343(6171):656-661.
Blötz C, Singh N, Dumke R, Stülke J. Characterization of an Immunoglobulin Binding Protein (IbpM) From Mycoplasma pneumoniae. Front Microbiol. 2020; 11:685. Lei C, Qian K, Li T, et al. Neutralization of SARS-CoV-2 spike pseudotyped virus by recombinant ACE2-Ig. Nat Commun. 2020; 11(1):2070.
Nie J, Li Q, Wu J, et al. Establishment and validation of a pseudovirus neutralization assay for SARS-CoV-2. Emerg Microbes Infect. 2020; 9(1):680-686.
Mccray P B, Pewe L, et al. Lethal infection of K18-hACE2 mice infected with severe acute respiratory syndrome coronavirus. J Virol. 2007; 81(2):813-21.
G. Brett Moreau, Stacey L. Burgess, Jeffrey M. Sturek, Alexandra N. Donlan, William A. Petri Jr., Barbara J. Mann Evaluation of K18-hACE2 mice as a model of SARS-CoV-2 infection. bioRxiv 2020.06.26.171033.
Subbarao K, Mcauliffe J, Vogel L, et al. Prior infection and passive transfer of neutralizing antibody prevent replication of severe acute respiratory syndrome coronavirus in the respiratory tract of mice. J Virol. 2004; 78(7):3572-7.
Mebrahtu E, Zheleznyak A, Hur M A, Laforest R, Lapi S E. Initial characterization of a dually radiolabeled peptide for simultaneous monitoring of protein targets and enzymatic activity. Nucl Med Biol. 2013; 40(2):190-6.

SEQUENCE LISTING

1)

Mycoplasma genitalium

The mature protein M sequence

(37-556 amino acid)

SEQ ID NO: 1

TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVK

LKSLSLNDGSYQSEIDLSGGANFREKFRNFANELS

EAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFK

AGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTT

NGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTS

NNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAI

DAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNEL

QLPESVKKVSLYGDYTGVNVAKQIFANVVELEFYS

TSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDL

TQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQF

QGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKEL

NLHLEEAYREGDNTYYRVNENYYPGASIYENERAS

RDSEFQNEILKRAEQNGVTFDENIKRITASGKYSV

QFQKLENDTDSSLERMTKAVEGLVTVIGEEKFETV

DITGVSSDTNEVKSLAKELKTNALGVKLKL

2)

Mycoplasma pneumoniae

IgG-blocking mature protein M

sequence (36-582 amino acid)

SEQ ID NO: 2

AVLIVNEVLRLQSGETLIASGRSGNLSFQLYSKVN

QNAKSKLNSISLTDGGYRSEIDLGDGSNFREDFRN

FANNLSEAITDAPKDLLRPVPKVEVSGLIKTSSTF

ITPNFKAGYYDQVAADGKTLKYYQSTEYFNNRVVM

PILQTTNGTLTANNRAYDDIFVDQGVPKFPGWFHD

VDKAYYAGSNGQSEYLFKEWNYYVANGSPLYNVYP

NFIHFKQIKTIAFDAPRIKQGNTDGINLNLKQRNP

DYVIINGLTGDGSTLKDLELPESVKKVSIYGDYHS

INVAKQIFKNVLELEFYSTNQDNNFGFNPLVLGDH

TNIIYDLFASKPFNYIDLTSLELKDNQDNIDASKL

KRAVSDIYIRRRFERQMQGYWAGGYIDRYLVKNTN

EKNVNKDNDTVYAALKDINLHLEETYTHGGNTMYR

VNENYYPGASAYEAERATRDSEFQKEIVQRAELIG

VVFEYGVKNLRPGLKYTVKFESPQEQVALKSTDKF

QPVIGSVTDMSKSVTDLIGVLRDNAEILNITNVSK

DETVVAELKEKLDRENVFQEIRT

3)

Mycoplasma iowae

IgG-blocking mature protein M

sequence (21-497 amino acid)

SEQ ID NO: 3

VGVYVATTNTQNTSVNVNNNENINYKTNGTVVTGD

KLTFSAVVQQNSNISTQAFINDGTKPVGTYNKEIN

LGKDSITPKYTSGYVETYLESGDTVSRYSSSEYHN

NRTLMPILDTKEHYYTSERTYSEIQKGIYRGWEIS

TKSINYGEQFAYSASPVLKTVFRDLKQETIKAVQF

NLGLSDTSIESINSFLKTNTGIQFVTIKGISQDTD

LSKLVLPESVQKLTLLGQRNTINDLKLPSELQEIE

IYLGSSLKSIDPLIFPKSANIISDVVMNNTSSVFT

EIKLSDSTIDNNSPKLQKAIDDVYTYRIKERAFQG

LVPGGYIASWDLTGTKVTSFNNVNIPPLNDGTGRF

YIAHVEVKTDGNFGNSQNESIGSKPSNDSQINDWF

DWGGGWQKVQEVVVSSSENVSLETATQEIMGFIAK

YPNVKKINIVNVKLTDGSTHEQLKDNVIKAITAKY

GEESQYKDIEFVLPETVPSPVA

4)

Mycoplasma tullyi

IgG-blocking mature protein M

sequence (30-517 amino acid)

SEQ ID NO: 4

IVYTSVKISNTLNQDKQIAGSNLSPTQSNRLIGFQ

TLTKFKIQDLDFELQRKIYSSRLNSAELITKSAVV

LDQSTLQNHDGEVASGQPAPQVPPPVRIPAKEQTG

HTSDFISGYSENNLYYQTPYYYNDRVYMPILDSRK

TYLRNERTTTDIGLNNYEGWITSDHSRVNNRVNVF

NYRPSPELLAKYTDLAADKLIFTMTIDLYQANPEM

INEILKEYSPDFVILSNADSQVMKQLVFPSSVKKL

TIKSNLLDRFDFSLANTEIQELELYTPRLTEYNPF

ALNPNTHLIFDSNYSKPFTSINLYGVPLTHQQVLS

ALEDVFVRRHYERALQGSFSGGYISSLDLSNTGIT

SLSNLMIKNINPYYDSYTMSVKYNSNKNGEIELLK

TNSWKNPNPAPVSTPAASSPTTPTVPSTPGDSTIN

VQDKDLGLLVSSEVKVDPQVLINVVSKYLHNNPRV

NVLDISKVSLKSGSLVDVATNLKAKIDYLNVTI

5)

Mycoplasma imitans

IgG-blocking mature protein M

sequence (29-507 amino acid)

SEQ ID NO: 5

GIIYTSVKISSSQFNKQISNPIEVPKRNNTLIGFQ

TLARFKIENLDFELQKNIYSQNENALVNKAAVVQD

NSIINHDGEPTGQNERQVPAPVKILAKEQTGHTSD

FISGYTDNNSYYQSPFYYNDRVFMPILDSHSIYLK

NERTSKEIGLDSYEGWDKIGYSTINSRVSFVQYRA

TDQLIAKFNPSNKQIFAMMINLYQADPAVINNTLR

NYLPDFVILSNADNQIIKRLVFPSSVKKLTIKSNL

LDRFDFSLANSNIQELELYTPNLTEYNPLALNPDT

HLIFDTAYSKPFTSINLYGAKLTTQETQEAFNDIF

VRRYYERYLQGAFVGGYISLLDLSNTGINSVNDYV

VKNINPAYSSYTLSVTYNPGDPGQISILRTTTSIP

SETQPTNPSNNTPSQPTDPNITTQIDAKEKDLKLV

VSSTIQVDTQVVINVVGKYLLNNPRVNNVDISRIQ

LKSGTLVDIANNFKTKMSYLNVSV

6)

Mycoplasma gallisepticum

IgG-blocking mature protein M

sequence (29-417 amino acid)

SEQ ID NO: 6

GIIYTSVKISNSLYQDKLISGQNQPLAPVNRLIGF

QTLAKFRIEGLDFELKKKIYSSTVESVELVNRSAV

LVDDSVLENHDGELTSVQSDPQVPAPVKILAKEQT

GHTSDFVSGYSDDNKYYQSPYYYNDRVYMPILDSP

TIYLKNERTSSDIGLNNYQGWIAVGHARVNSRVSV

FNYRATDELLAKFNNLPDRLIFTMSIDLYQANPAM

INETLKEYSPDFVILSNADSQTMKQLVFPSSVKKL

TIKSNILDRFDFSLVNSEIQELELYTPNLTEYNPL

ALNPKTHLIFDADYSTRFLSINLYGAQLTNQQALA

ALEDVFVHRYYERALQGSFVDGYISSLVLSDTGIT

SLNNLVIKNINPNYDSYIMSVKYHSNDSGQIELLK

TTAW

7)

Mycoplasma alvi

IgG-blocking protein M sequence,

signal peptide included (28-540

amino acid)

SEQ ID NO: 7

ISIPFIIQSTHTNNANSTIPNVSKPSGSSLAPINY

SYDNFVNNYDGTLTSNSLVFSASGSKEVKSSLQTR

AITVDGLNDIDSSMGLVDAMSQGLLDNSYDPKYNE

VREVIDMDGAHRKIVTTKCFDNNRKYMPILTYNND

TYYSYSESRTWDDVNRSIYPGWNLNRSNLSSHNQN

KMIGVDILVYTPTEVLKTAYPSVTDKIIGLSISLS

NLISTYGDQTKQVLSQLIDAVNPSLVNFWGVSDSN

LDKLPDLSSNTNIKKISIRGDYSNLNGFVFPSSVL

ELEFSSQNYKAVDPLQIPESAAIIYEQGYSSYFTS

IDLSTHKGMSNEDLQKAVNVVYQQRIHERAFQGDF

AGGYIYSWNLRNTGIYSFNNVTIPMLTDGTGRFYI

AYVAVETDGNQGPIANEVISDNSSKPSNDSQINEW

FDWNQNGWSTITEVKITAKDNVKLNFNNTVQEILG

FINKYPNIKVVDISALQFSNDETLDELIDAVNKAI

ADKYTGMDGTPTVKLDFIKVNYL

8)

Mycoplasma penetrans

IgG-blocking mature protein M

sequence (31-505 amino acid)

SEQ ID NO: 8

LVTSNNNHENSLNNSSSNNGSNLKVNGSVISTDNL

NIVATGLSSNVSSQVSRQSLSSSSSSESTVDSKYT

AKKKLTTVSGQEKEYLVSTVYENNRKFMPILAYDE

DISYNNYQQSREYKDVVYGNFPGWDKKVAVVHQID

NVDLSKAYASVAEFTPTEILKKHFQVLQTSVKQLY

VALDSKTMTADVITKLVDRYQPDYLRIESVDDTSI

KQLPDMKYFSTVKKVDLGGAFTTIKGVSFPTTTQE

LKISSDNIKSIDPLQIPESAAIITETVHDARFTEI

DLSSHTDLTTDQLQKAVNIVYKDRIKERAFQGNFA

GGYIYSWNLQNTGITSFNDVSIPKLNDGTDRFYIA

YVAVSSGNSNGTANETITGGKEPSNDSQIGEWWDS

SSDGWSKVSKVTVTAKNGASLDYNKTLTEIMGFLA

KYPNVKTIDISLLKFEDASKTLDGLKTELTNQIKS

KYGEDSSYAKIDFIITSQSN

9)

Artificial armY-ACE2 fusion protein

sequence (1,298 amino acids).

Including the human IL-2 signal

sequence, human myc-peptide epitope

tag, linker, human

ACE2, linker, Mycoplasma genitalium

protein M

SEQ ID NO: 9

MYRMQLLSCIALSLALVTNSEQKLISEEDLLRKRG

SPGGAQSTIEEQAKTFLDKFNHEAEDLFYQSSLAS

WNYNTNITEENVQNMNNAGDKWSAFLKEQSTLAQM

YPLQEIQNLTVKLQLQALQQNGSSVLSEDKSKRLN

TILNTMSTIYSTGKVCNPDNPQECLLLEPGLNEIM

ANSLDYNERLWAWESWRSEVGKQLRPLYEEYVVLK

NEMARANHYEDYGDYWRGDYEVNGVDGYDYSRGQL

IEDVEHTFEEIKPLYEHLHAYVRAKLMNAYPSYIS

PIGCLPAHLLGDMWGRFWTNLYSLTVPFGQKPNID

VTDAMVDQAWDAQRIFKEAEKFFVSVGLPNMTQGF

WENSMLTDPGNVQKAVCHPTAWDLGKGDFRILMCT

KVTMDDFLTAHHEMGHIQYDMAYAAQPFLLRNGAN

EGFHEAVGEIMSLSAATPKHLKSIGLLSPDFQEDN

ETEINFLLKQALTIVGTLPFTYMLEKWRWMVFKGE

IPKDQWMKKWWEMKREIVGVVEPVPHDETYCDPAS

LFHVSNDYSFIRYYTRTLYQFQFQEALCQAAKHEG

PLHKCDISNSTEAGQKLFNMLRLGKSEPWTLALEN

VVGAKNMNVRPLLNYFEPLFTWLKDQNKNSFVGWS

TDWSPYADQSIKVRISLKSALGDKAYEWNDNEMYL

FRSSVAYAMRQYFLKVKNQMILFGEEDVRVANLKP

RISFNFFVTAPKNVSDIIPRTEVEKAIRMSRSRIN

DAFRLNDNSLEFLGIQPTLGPPNQPPVSGGGGSGG

GGSGGGGSTNLVNQSGYALVASGRSGNLGFKLFST

QSPSAEVKLKSLSLNDGSYQSEIDLSGGANFREKF

RNFANELSEAITNSPKGLDRPVPKTEISGLIKTGD

NFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRV

LMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNT

KATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFF

EDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSG

NGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFAN

VVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFAS

KPFTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYN

YRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDL

VYRSLKELNLHLEEAYREGDNTYYRVNENYYPGAS

IYENERASRDSEFQNEILKRAEQNGVTFDENIKRI

TASGKYSVQFQKLENDTDSSLERMTKAVEGLVTVI

GEEKFETVDITGVSSDTNEVKSLAKELKTNALGVK

LKL

10)

Artificial

Protein M with peptide tags

(aka: armY) protein sequence

(587 amino acids).

Including the human IL-2 signal

sequence. Avi-Tag, human myc-

peptide epitope tag, linker.

Mycoplasma genitalium protein M

SEQ ID NO: 10

MYRMQLLSCIALSLALVTNSMAGGLNDIFEAQKIE

WHEGGEQKLISEEDLLRKRAANGGGGSGGGGSTNL

VNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKS

LSLNDGSYQSEIDLSGGANFREKFRNFANELSEAI

TNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGY

YDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGT

LMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNL

TYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAIDAK

DISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP

ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSK

ANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQV

TLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGY

FAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLH

LEEAYREGDNTYYRVNENYYPGASIYENERASRDS

EFQNEILKRAEQNGVTFDENIKRITASGKYSVQFQ

KLENDTDSSLERMTKAVEGLVTVIGEEKFETVDIT

GVSSDTNEVKSLAKELKTNALGVKLKL

11)

Artificial

Protein M horseradish peroxidase

(HRP) fusion protein sequence

(876 amino acids).

Including the human IL-2 signal

sequence, human myc-peptide

epitope, linker. HRP, linker,

Mycoplasma genitalium protein M

SEQ ID NO: 11

MYRMQLLSCIALSLALVTNSEQKLISEEDLAANQL

TPTFYDNSCPNVSNIVRDTIVNELRSDPRIAASIL

RLHFHDCFVNGCDASILLDNTTSFRTEKDAFGNAN

SARGFPVIDRMKAAVESACPRTVSCADLLTIAAQQ

SVTLAGGPSWRVPLGRRDSLQAFLDLANANLPAPF

FTLPQLKDSFRNVGLNRSSDLVALSGGHTFGKNQC

RFIMDRLYNFSNTGLPDPTLNTTYLQTLRGLCPLN

GNLSALVDFDLRTPTIFDNKYYVNLEEQKGLIQSD

QELFSSPNATDTIPLVRSFANSTQTFFNAFVEAMD

RMGNITPLTGTQGQIRLNCRVVNSNSGGGGSGGGG

SGGGGSTNLVNQSGYALVASGRSGNLGFKLFSTQS

PSAEVKLKSLSLNDGSYQSEIDLSGGANFREKFRN

FANELSEAITNSPKGLDRPVPKTEISGLIKTGDNF

ITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLM

PILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKA

TTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFED

VKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNG

SQLNELQLPESVKKVSLYGDYTGVNVAKQIFANVV

ELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKP

FTHIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYR

RFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVY

RSLKELNLHLEEAYREGDNTYYRVNENYYPGASIY

ENERASRDSEFQNEILKRAEQNGVTFDENIKRITA

SGKYSVQFQKLENDTDSSLERMTKAVEGLVTVIGE

EKFETVDITGVSSDTNEVKSLAKELKTNALGVKLK

L

12)

Artificial

Set of three Glycine (G₄)-Serine

(S₁) linker sequence

(1-15 amino acid)

SEQ ID NO: 12

GGGGSGGGGSGGGGS

13)

Artificial

Set of two Glycine (G₄)-Serine

(S₁) linker sequence

(1-10 amino acid)

SEQ ID NO: 13

GGGGSGGGGS

14)

SEQ ID NO: 14

Artificial

Set of one Glycine (G₄)-Serine

(S₁) linker sequence

(1-5 amino acid)

GGGGS

15)

SEQ ID NO: 15

Human

Angiotensin-Converting Enzyme 2 (ACE2) Extracellular Domain Protein Sequence (18-740 Amino Acid)

Essential counter-regulatory carboxypeptidase of the renin-angiotensin hormone system that is a critical regulator of blood volume, systemic vascular resistance, and thus cardiovascular homeostasis. This receptor acts as an attachment receptor for human coronaviruses SARS-CoV and SARS-CoV-2, as well as human coronavirus NL63/HCoV-NL63

16)

SEQ ID NO: 16

QSTIEEQAKTFLDKFNHEAEDLFYQSSLASWNYNT

NITEENVQNMNNAGDKWSAFLKEQSTLAQMYPLQE

IQNLTVKLQLQALQQNGSSVLSEDKSKRLNTILNT

MSTIYSTGKVCNPDNPQECLLLEPGLNEIMANSLD

YNERLWAWESWRSEVGKQLRPLYEEYVVLKNEMAR

ANHYEDYGDYWRGDYEVNGVDGYDYSRGQLIEDVE

HTFEEIKPLYEHLHAYVRAKLMNAYPSYISPIGCL

PAHLLGDMWGRFWTNLYSLTVPFGQKPNIDVTDAM

VDQAWDAQRIFKEAEKFFVSVGLPNMTQGFWENSM

LTDPGNVQKAVCHPTAWDLGKGDFRILMCTKVTMD

DFLTAHHEMGHIQYDMAYAAQPFLLRNGANEGFHE

AVGEIMSLSAATPKHLKSIGLLSPDFQEDNETEIN

FLLKQALTIVGTLPFTYMLEKWRWMVFKGEIPKDQ

WMKKWWEMKREIVGVVEPVPHDETYCDPASLFHVS

NDYSFIRYYTRTLYQFQFQEALCQAAKHEGPLHKC

DISNSTEAGQKLFNMLRLGKSEPWTLALENVVGAK

NMNVRPLLNYFEPLFTWLKDQNKNSFVGWSTDWSP

YADQSIKVRISLKSALGDKAYEWNDNEMYLFRSSV

AYAMRQYFLKVKNQMILFGEEDVRVANLKPRISFN

FFVTAPKNVSDIIPRTEVEKAIRMSRSRINDAFRL

NDNSLEFLGIQPTLGPPNQPPVS

Human

CD209 (DC-SIGN) Extracellular Domain Protein Sequence (59-404 Amino Acid).

A pathogen-recognition receptor expressed on the surface of immature dendritic cells (DCs) and involved in initiation of primary immune response. This receptor acts as an attachment receptor for HIV-1, HIV-2, Ebolavirus, Cytomegalovirus, HCV, Dengue virus, Measles virus, Herpes simplex virus 1, Influenza virus, SARS-CoV, Japanese encephalitis virus, Lassa virus, Respiratory syncytial virus, Rift valley fever virus, West-nile virus, Marburg virus, Uukuniemi virus, and Yersinia Pestis

17)

SEQ ID NO: 17

QVSKVPSSISQEQSRQDAIYQNLTQLKAAVGELSE

KSKLQEIYQELTQLKAAVGELPEKSKLQEIYQELT

RLKAAVGELPEKSKLQEIYQELTWLKAAVGELPEK

SKMQEIYQELTRLKAAVGELPEKSKQQEIYQELTR

LKAAVGELPEKSKQQEIYQELTRLKAAVGELPEKS

KQQEIYQELTQLKAAVERLCHPCPWEWTFFQGNCY

FMSNSQRNWHDSITACKEVGAQLVVIKSAEEQNFL

QLQSSRSNRFTWMGLSDLNQEGTWQWVDGSPLLPS

FKQYWNRGEPNNVGEEDCAEFSGNGWNDDKCNLAK

FWICKKSAASCSRDEEQFLSPAPATPNPPPA

Human

C-Type Lectin Domain Family 4 Member M Extracellular Domain Protein Sequence (71-399 Amino Acid).

Probable pathogen-recognition receptor involved in peripheral immune surveillance in liver. This receptor acts as an attachment receptor for Ebolavirus, Hepatitis C virus, HIV-1, Human coronavirus 229E, Human cytomegalovirus/HHV-5, Influenza virus, SARS-CoV, West-nile virus, Japanese encephalitis virus, Marburg virus glycoprotein, and M. bovis.

18)

SEQ ID NO: 18

QVSKVPSSLSQEQSEQDAIYQNLTQLKAAVGELSE

KSKLQEIYQELTQLKAAVGELPEKSKLQEIYQELT

RLKAAVGELPEKSKLQEIYQELTRLKAAVGELPEK

SKLQEIYQELTRLKAAVGELPEKSKLQEIYQELTE

LKAAVGELPEKSKLQEIYQELTQLKAAVGELPDQS

KQQQIYQELTDLKTAFERLCRHCPKDWTFFQGNCY

FMSNSQRNWHDSVTACQEVRAQLVVIKTAEEQNFL

QLQTSRSNRFSWMGLSDLNQEGTWQWVDGSPLSPS

FQRYWNSGEPNNSGNEDCAEFSGSGWNDNRCDVDN

YWICKKPAACFRDE

Human

CD4 Extracellular Domain Protein Sequence (26-396 Amino Acid).

Integral membrane glycoprotein that plays an essential role in the immune response and serves multiple functions in responses against both external and internal offenses. In T-cells, functions primarily as a coreceptor for MHC class II molecule:peptide complex. This coreceptor acts as an attachment receptor for HIV.

19)

SEQ ID NO: 19

KKVVLGKKGDTVELTCTASQKKSIQFHWKNSNQIK

ILGNQGSFLTKGPSKLNDRADSRRSLWDQGNFPLI

IKNLKIEDSDTYICEVEDQKEEVQLLVFGLTANSD

THLLQGQSLTLTLESPPGSSPSVQCRSPRGKNIQG

GKTLSVSQLELQDSGTWTCTVLQNQKKVEFKIDIV

VLAFQKASSIVYKKEGEQVEFSFPLAFTVEKLTGS

GELWWQAERASSSKSWITFDLKNKEVSVKRVTQDP

KLQMGKKLPLHLTLPQALPQYAGSGNLTLALEAKT

GKLHQEVNLVVMRATQLQKNLTCEVWGPTSPKLML

SLKLENKEAKVSKREKAVWVLNPEAGMWQCLLSDS

GQVLLESNIKVLPTWSTPVQP

Human

Synaptic Vesicle Glycoprotein 2A Extracellular Domain Protein Sequence (469-598 Amino Acid).

Plays a role in the control of regulated secretion in neural and endocrine cells, enhancing selectively low-frequency neurotransmission. This protein acts as an attachment receptor for the C. botulinum neurotoxin type A2 (BoNT/A, botA).

20)

SEQ ID NO: 20

PDMIRHLQAVDYASRTKVFPGERVEHVTFNFTLEN

QIHRGGQYFNDKFIGLRLKSVSFEDSLFEECYFED

VTSSNTFFRNCTFINTVFYNTDLFEYKFVNSRLIN

STFLHNKEGCPLDVTGTGEGAYMVY

Human

Synaptic Vesicle Glycoprotein 2B Extracellular Domain Protein Sequence (412-535 Amino Acid).

Probably plays a role in the control of regulated secretion in neural and endocrine cells. This protein acts as an attachment receptor for the C. botulinum neurotoxin type A2 (BoNT/A, botA). Probably also serves as a receptor for the closely related C. botulinum neurotoxin type A1.

21)

SEQ ID NO: 21

PDMIRYFQDEEYKSKMKVFFGEHVYGATINFTMEN

QIHQHGKLVNDKFTRMYFKHVLFEDTFFDECYFED

VTSTDTYFKNCTIESTIFYNTDLYEHKFINCRFIN

STFLEQKEGCHMDLEQDND

Human

Synaptic Vesicle Glycoprotein 2C Extracellular Domain Protein Sequence (459-578 Amino Acid).

Plays a role in the control of regulated secretion in neural and endocrine cells, enhancing selectively low-frequency neurotransmission. This protein acts as an attachment receptor for C. botulinum neurotoxin type A (BoNT/A, botA). Also serves as a receptor for the closely related C. botulinum neurotoxin type A2.

22)

SEQ ID NO: 22

KPLQSDEYALLTRNVERDKYANFTINFTMENQIHT

GMEYDNGRFIGVKFKSVTFKDSVFKSCTFEDVTSV

NTYFKNCTFIDTVFDNTDFEPYKFIDSEFKNCSFF

HNKTGCQITFDDDYS

Human

Synaptotagmin I Extracellular Domain Protein Sequence (1-57 Amino Acid).

Calcium sensor that participates in triggering neurotransmitter release at the synapse. This protein acts as an attachment receptor for C. botulinum neurotoxin type B (BoNT/B, botB)

23)

SEQ ID NO: 23

MVSESHHEALAAPPVTTVATVLPSNATEPASPGEG

KEDAFSKLKEKFMNELHKIPLP

Human

Synaptotagmin II Extracellular Domain Protein Sequence (1-62 Amino Acid).

Exhibits calcium-dependent phospholipid and inositol polyphosphate binding properties. This protein acts as an attachment receptor for C. botulinum neurotoxin type B (BoNT/B, botB)

24) SEQ ID NO: 24

MRNIFKRNQEPIVAPATTTATMPIGPVDNSTESGGAGESQEDMFAKLKE

KLFNEINKIPLPP

Human

HLA Class II Histocompatibility Antigen, DRB1 Beta Chain Extracellular Domain Protein Sequence (30-227 Amino Acid).

A beta chain of antigen-presenting major histocompatibility complex class II (MHCII) molecule. This protein acts as an attachment receptor for Epstein-Barr virus and Staphylococcal enterotoxin A and B.

25) SEQ ID NO: 25

GDTRPRFLWQPKRECHFFNGTERVRFLDRYFYNQEESVRFDSDVGEFRA

VTELGRPDAEYWNSQKDILEQARAAVDTYCRHNYGVVESFTVQRRVQPK

VTVYPSKTQPLQHHNLLVCSVSGFYPGSIEVRWFLNGQEEKAGMVSTGL

IQNGDWTFQTLVMLETVPRSGEVYTCQVEHPSVTSPLTVEWRARSESAQ

SK

Human

Human

HLA Class II Histocompatibility Antigen, DR Alpha Chain Extracellular Domain Protein Sequence (26-216 Amino Acid).

Binds peptides derived from antigens that access the endocytic route of antigen presenting cells (APC) and presents them on the cell surface for recognition by the CD4 T-cells. This protein acts as an attachment receptor for Epstein-Barr virus BZLF2/gp42, Staphylococcus aureus enterotoxin A/entA, enterotoxin B/entB, enterotoxin C1/entC1, enterotoxin D/entD, and enterotoxin H/entH.

26) SEQ ID NO: 26

IKEEHVIIQAEFYLNPDQSGEFMFDFDGDEIFHVDMAKKETVWRLEEFG

RFASFEAQGALANIAVDKANLEIMTKRSNYTPITNVPPEVTVLTNSPVE

LREPNVLICFIDKFTPPVVNVTWLRNGKPVTTGVSETVFLPREDHLFRK

FHYLPFLPSTEDVYDCRVEHWGLDEPLLKHWEFDAPSPLPETTE

Human

T Cell Receptor Beta Variable 7-9 Mature Protein Sequence (22-115 Amino Acid).

V region of the variable domain of T cell receptor (TR) beta chain that participates in the antigen recognition. This protein acts as an attachment receptor for Staphylococcus aureus enterotoxin A/entA.

27) SEQ ID NO: 27

GVSQNPRHKITKRGQNVTFRCDPISEHNRLYWYRQTLGQGPEFLTYFQN

EAQLEKSRLLSDRFSAERPKGSFSTLEIQRTEQGDSAMYLCASSL

Human

T Cell Receptor Beta Variable 19 Mature Protein Sequence (22-114 Amino Acid).

28) SEQ ID NO: 28

GITQSPKYLFRKEGQNVTLSCEQNLNHDAMYWYRQDPGQGLRLIYYSQI

VNDFQKGDIAEGYSVSREKKESFPLTVTSAQKNPTAFYLCASSI

Human

Hepatitis a Virus Cellular Receptor 1 Extracellular Domain Protein Sequence (21-364 Amino Acid).

May play a role in T-helper cell development and the regulation of asthma and allergic diseases. This protein acts as an attachment receptor for Hepatitis A virus, Ebola virus, Marburg virus and Dengue virus and Clostridium perfringens Epsilon toxin (ETX).

29) SEQ ID NO: 29

SVKVGGEAGPSVTLPCHYSGAVTSMCWNRGSCSLFTCQNGIVWTNGTHV

TYRKDTRYKLLGDLSRRDVSLTIENTAVSDSGVYCCRVEHRGWFNDMKI

TVSLEIVPPKVTTTPIVTTVPTVTTVRTSTTVPTTTTVPMTTVPTTTVP

TTMSIPTTTTVLTTMTVSTTTSVPTTTSIPTTTSVPVTTTVSTFVPPMP

LPRQNHEPVATSPSSPQPAETHPTTLQGAIRREPTSSPLYSYTTDGNDT

VTESSDGLWNNNQTQLFLEHSLLTANTTKGIYAGVCISVLVLLALLGVI

IAKKYFFKKEVQQLSVSFSSLQIKALQNAVEKEVQAEDNIYIENSLYAT

D

Human

Myelin and Lymphocyte Protein Protein Sequence (1-153 Amino Acid).

Could be an important component in vesicular trafficking cycling between the Golgi complex and the apical plasma membrane. This protein acts as an attachment receptor for Clostridium perfringens Epsilon toxin (ETX).

30) SEQ ID NO: 30

MAPAAATGGSTLPSGFSVFTTLPDLLFIFEFIFGGLVWILVASSLVPWP

LVQGWVMFVSVFCFVATTTLIILYIIGAHGGETSWVTLDAAYHCTAALF

YLSASVLEALATITMQDGFTYRHYHENIAAVVFSYIATLLYVVHAVFSL

IRWKSS

Human

Complement Factor H Mature Protein Sequence (19-1231 Amino Acid).

Glycoprotein that plays an essential role in maintaining a well-balanced immune response by modulating complement activation. This protein binds to Streptococcus pneumoniae, Neisseria meningitides, Staphylococcus aureus, Borrelia burgdorferi and West nile virus.

31) SEQ ID NO: 31

EDCNELPPRRNTEILTGSWSDQTYPEGTQAIYKCRPGYRSLGNVIMVCR

KGEWVALNPLRKCQKRPCGHPGDTPFGTFTLTGGNVFEYGVKAVYTCNE

GYQLLGEINYRECDTDGWTNDIPICEVVKCLPVTAPENGKIVSSAMEPD

REYHFGQAVRFVCNSGYKIEGDEEMHCSDDGFWSKEKPKCVEISCKSPD

VINGSPISQKIIYKENERFQYKCNMGYEYSERGDAVCTESGWRPLPSCE

EKSCDNPYIPNGDYSPLRIKHRTGDEITYQCRNGFYPATRGNTAKCTST

GWIPAPRCTLKPCDYPDIKHGGLYHENMRRPYFPVAVGKYYSYYCDEHF

ETPSGSYWDHIHCTQDGWSPAVPCLRKCYFPYLENGYNQNYGRKFVQGK

SIDVACHPGYALPKAQTTVTCMENGWSPTPRCIRVKTCSKSSIDIENGF

ISESQYTYALKEKAKYQCKLGYVTADGETSGSITCGKDGWSAQPTCIKS

CDIPVFMNARTKNDFTWFKLNDTLDYECHDGYESNTGSTTGSIVCGYNG

WSDLPICYERECELPKIDVHLVPDRKKDQYKVGEVLKFSCKPGFTIVGP

NSVQCYHFGLSPDLPICKEQVQSCGPPPELLNGNVKEKTKEEYGHSEVV

EYYCNPRFLMKGPNKIQCVDGEWTTLPVCIVEESTCGDIPELEHGWAQL

SSPPYYYGDSVEFNCSESFTMIGHRSITCIHGVWTQLPQCVAIDKLKKC

KSSNLIILEEHLKNKKEFDHNSNIRYRCRGKEGWIHTVCINGRWDPEVN

CSMAQIQLCPPPPQIPNSHNMTTTLNYRDGEKVSVLCQENYLIQEGEEI

TCKDGRWQSIPLCVEKIPCSQPPQIEHGTINSSRSSQESYAHGTKLSYT

CEGGFRISEENETTCYMGKWSSPPQCEGLPCKSPPEISHGVVAHMSDSY

QYGEEVTYKCFEGFGIDGPAIAKCLGEKWSHPPSCIKTDCLSLPSFENA

IPMGEKKDVYKAGEQVTYTCATYYKMDGASNVTCINSRWTGRPTCRDTS

CVNPPTVQNAYIVSRQMSKYPSGERVRYQCRSPYEMFGDEEVMCLNGNW

TEPPQCKDSTGKCGPPPPIDNGDITSFPLSVYAPASSVEYQCQNLYQLE

GNKRITCRNGQWSEPPKCLHPCVISREIMENYNIALRWTAKQKLYSRTG

ESVEFVCKRGYRLSSRSHTLRTTCWDGKLEYPTCAKR

Human

Hepatocyte Growth Factor Receptor Extracellular Domain Protein Sequence (25-932 Amino Acid).

Receptor tyrosine kinase that transduces signals from the extracellular matrix into the cytoplasm by binding to hepatocyte growth factor/HGF ligand. This receptor acts as an attachment receptor for Listeria monocytogenes internalin InlB, mediating entry of the pathogen into cells.

32) SEQ ID NO: 32

ECKEALAKSEMNVNMKYQLPNFTAETPIQNVILHEHHIFLGATNYIYVL

NEEDLQKVAEYKTGPVLEHPDCFPCQDCSSKANLSGGVWKDNINMALVV

DTYYDDQLISCGSVNRGTCQRHVFPHNHTADIQSEVHCIFSPQIEEPSQ

CPDCVVSALGAKVLSSVKDRFINFFVGNTINSSYFPDHPLHSISVRRLK

ETKDGFMFLTDQSYIDVLPEFRDSYPIKYVHAFESNNFIYFLTVQRETL

DAQTFHTRIIRFCSINSGLHSYMEMPLECILTEKRKKRSTKKEVFNILQ

AAYVSKPGAQLARQIGASLNDDILFGVFAQSKPDSAEPMDRSAMCAFPI

KYVNDFFNKIVNKNNVRCLQHFYGPNHEHCFNRTLLRNSSGCEARRDEY

RTEFTTALQRVDLFMGQFSEVLLTSISTFIKGDLTIANLGTSEGRFMQV

VVSRSGPSTPHVNFLLDSHPVSPEVIVEHTLNQNGYTLVITGKKITKIP

LNGLGCRHFQSCSQCLSAPPFVQCGWCHDKCVRSEECLSGTWTQQICLP

AIYKVFPNSAPLEGGTRLTICGWDFGFRRNNKFDLKKTRVLLGNESCTL

TLSESTMNTLKCTVGPAMNKHFNMSIIISNGHGTTQYSTFSYVDPVITS

ISPKYGPMAGGTLLTLTGNYLNSGNSRHISIGGKTCTLKSVSNSILECY

TPAQTISTEFAVKLKIDLANRETSIFSYREDPIVYEIHPTKSFISGGST

ITGVGKNLNSVSVPRMVINVHEAGRNFTVACQHRSNSEIICCTTPSLQQ

LNLQLPLKTKAFFMLDGILSKYFDLIYVHNPVFKPFEKPVMISMGNENV

LEIKGNDIDPEAVKGEVLKVGNKSCENIHLHSEAVLCTVPNDLLKLNSE

LNIEWKQAISSTVLGKVIVQPDQNFT

Human

Membrane Cofactor Protein (CD46) Extracellular Domain Protein Sequence (35-343 Amino Acid).

Acts as a cofactor for complement factor I, a serine protease which protects autologous cells against complement-mediated injury by cleaving C3b and C4b deposited on host tissue. This protein acts as an attachment receptor for Adenovirus subgroup B2 and Ad3, Measles virus, Herpesvirus 6/HHV-6, Neisseria and Streptococcus pyogenes.

33) SEQ ID NO: 33

CEEPPTFEAMELIGKPKPYYEIGERVDYKCKKGYFYIPPLATHTICDRN

HTWLPVSDDACYRETCPYIRDPLNGQAVPANGTYEFGYQMHFICNEGYY

LIGEEILYCELKGSVAIWSGKPPICEKVLCTPPPKIKNGKHTFSEVEVF

EYLDAVTYSCDPAPGPDPFSLIGESTIYCGDNSVWSRAAPECKVVKCRF

PVVENGKQISGFGKKFYYKATVMFECDKGFYLDGSDTIVCDSNSTWDPP

VPKCLKVLPPSSTKPPALSHSVSTSSTTKSPASSASGPRPTYKPPVSNY

PGYPKPEEGILDSLD

Human

Glycophorin-A Extracellular Domain Protein Sequence (20-91 Amino Acid).

Glycophorin A is the major intrinsic membrane protein of the erythrocyte. This protein acts as an attachment receptor for Plasmodium falciparum, Influenza virus, Hepatitis A virus (HAV), Streptococcus gordonii.

34) SEQ ID NO: 34

SSTTGVAMHTSTSSSVTKSYISSQTNDTHKRDTYAATPRAHEVSEISVR

TVYPPEEETGERVQLAHHFSEPE

Human

C-Type Lectin Domain Family 4 Member K (Langerin, CD207) Extracellular Domain Protein Sequence (65-328 Amino Acid).

Calcium-dependent lectin displaying mannose-binding specificity. This protein binds to Candida species, Saccharomyces species, Malassezia furfur, human immunodeficiency virus-1 (HIV-1) and Yesinia pestis.

35) SEQ ID NO: 35

PRFMGTISDVKTNVQLLKGRVDNISTLDSEIKKNSDGMEAAGVQIQMVN

ESLGYVRSQFLKLKTSVEKANAQIQILTRSWEEVSTLNAQIPELKSDLE

KASALNTKIRALQGSLENMSKLLKRQNDILQVVSQGWKYFKGNFYYFSL

IPKTWYSAEQFCVSRNSHLTSVTSESEQEFLYKTAGGLWWIGLTKAGME

GDWSWVDDTPFNKVQSVRFWIPGEPNNAGNNEHCGNIKAPSLQAWNDAP

CDKTFLFICKRPYVPSEP

Human

Anthrax Toxin Receptor 1 Mature Protein Sequence (33-564 Amino Acid).

Plays a role in cell attachment and migration. Interacts with extracellular matrix proteins and with the actin cytoskeleton. This protein acts as an attachment receptor for Anthrax toxin.

36) SEQ ID NO: 36

EDGGPACYGGFDLYFILDKSGSVLHHWNEIYYFVEQLAHKFISPQLRMS

FIVFSTRGTTLMKLTEDREQIRQGLEELQKVLPGGDTYMHEGFERASEQ

IYYENRQGYRTASVIIALTDGELHEDLFFYSEREANRSRDLGAIVYCVG

VKDFNETQLARIADSKDHVFPVNDGFQALQGIIHSILKKSCIEILAAEP

STICAGESFQVVVRGNGFRHARNVDRVLCSFKINDSVTLNEKPFSVEDT

YLLCPAPILKEVGMKAALQVSMNDGLSFISSSVIITTTHCSDGSILAIA

LLILFLLLALALLWWFWPLCCTVIIKEVPPPPAEESEEEDDDGLPKKKW

PTVDASYYGGRGVGGIKRMEVRWGEKGSTEEGAKLEKAKNARVKMPEQE

YEFPEPRNLNNNMRRPSSPRKWYSPIKGKLDALWVLLRKGYDRVSVMRP

QPGDTGRCINFTRVKNNQPAKYPLNNAYHTSSPPPAPIYTPPPPAPHCP

PPPPSAPTPPIPSPPSTLPPPPQAPPPNRAPPPSRPPPRPSV

Human

Anthrax Toxin Receptor 2 Extracellular Domain Protein Sequence (34-318 Amino Acid).

Necessary for cellular interactions with laminin and the extracellular matrix. This protein acts as an attachment receptor for Anthrax toxin.

37) SEQ ID NO: 37

QEQPSCRRAFDLYFVLDKSGSVANNWIEIYNFVQQLAERFVSPEMRLSF

IVFSSQATIILPLTGDRGKISKGLEDLKRVSPVGETYIHEGLKLANEQI

QKAGGLKTSSIIIALTDGKLDGLVPSYAEKEAKISRSLGASVYCVGVLD

FEQAQLERIADSKEQVFPVKGGFQALKGIINSILAQSCTEILELQPSSV

CVGEEFQIVLSGRGFMLGSRNGSVLCTYTVNETYTTSVKPVSVQLNSML

CPAPILNKAGETLDVSVSFNGGKSVISGSLIVTATECSNG

Artificial

armY-Angiotensin-converting enzyme 2 (ACE2) fusion protein with N-terminal Myc-tag

codon-optimized (for human) nucleotide sequence (3,897 bp)

38) SEQ ID NO: 38

atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtgagcaaaagcttatctctgaagaggacttact

aagaaagcggggcagcccaggcggagcgcagagcacaatcgaggaacaggccaagaccttcctggacaagttcaaccacgaagctgaagac

ctgttctaccaatctagcctggctagttggaactacaacaccaacattacagaagagaacgtgcagaacatgaacaacgcaggcgacaagtggtcc

gccttccttaaagagcagtctacactggcccagatgtaccctctgcaagagattcagaatctgaccgtgaagctgcagctgcaggctctccagcaga

atgggtccagcgtgctgtctgaggataagagcaagcggctgaacaccatcctgaatacaatgagcaccatctacagcaccggcaaagtgtgtaac

cctgacaacccccaggagtgtctgctgctggaacctggcctgaacgaaatcatggccaactccctggactacaacgagagactgtgggcctggga

gagctggcgtagcgaggtgggaaaacagctgcgccccctgtatgaggagtacgtggtgctgaagaatgagatggccagagccaaccactacga

ggactacggcgactattggagaggcgattatgaagtcaacggcgttgacggctacgactacagccggggacagctgatcgaagacgtggaacat

acgtagaggagatcaagcctctgtacgagcacctgcacgcctacgtaagagccaaactgatgaatgcctaccccagctacatctcccctatcggct

gcctgcccgcccatctgctcggcgacatgtggggcagattctggaccaacctgtattctctgacagtgcctttcggccagaaacctaacatcgacgt

gacagatgccatggtggaccaggcctgggatgcccaaagaatcttcaaggaagccgagaaattcttcgtgtccgtggggctgcctaatatgaccca

gggcttctgggaaaacagcatgctcaccgatcctggcaacgtgcagaaggcagtgtgccaccccaccgcctgggaccttggaaagggcgacttc

cggattctgatgtgcaccaaggtgaccatggacgacttcctgaccgctcaccacgagatgggccacatccagtacgacatggcctacgccgctca

gcctttcctcctgagaaacggcgctaatgaaggcttccacgaggccgtgggcgaaatcatgagcctgagcgccgccacccctaagcacctgaagt

ctatcggactgctgagccccgactttcaggaggacaacgaaactgagatcaacttcttgctgaaacaggccctgacaatcgttggcaccctgccctt

tacctacatgctggaaaagtggagatggatggtctttaagggcgaaatccccaaggaccaatggatgaagaagtggtgggagatgaagcgggaa

atcgtgggcgtggtggaacctgtgccccacgacgagacatactgcgatcctgctagcctctttcacgtgagcaatgattactcattcatccggtacta

caccagaactctgtaccagttccagttccaggaggccctgtgccaggccgccaagcacgagggccctctgcacaagtgcgacatctctaacagca

ccgaggccggccagaagctgttcaacatgctgagactgggcaagagcgaaccttggacactggccctggagaacgtggtcggagccaagaaca

tgaacgtgagaccactgctgaactacttcgagcccctgttcacctggctgaaggatcaaaacaagaacagcttcgtgggctggtccacagactgga

gcccatacgctgatcagagcatcaaagtgaggatctctctgaagagcgccctgggagataaggcctacgagtggaacgataatgagatgtacctgt

tcagaagcagcgtggcctacgccatgcggcagtacttcctgaaagtgaagaaccagatgatcctgtttggcgaggaggatgtgagagtggccaat

ctgaaaccaagaatcagctttaactttttcgttaccgctcctaagaacgtgtctgatatcatccctagaaccgaggtggaaaaggccatcagaatgag

ccggtccagaatcaacgatgccttccgactgaatgacaactccctggagttcctgggaatccagcccaccctgggccctcctaaccagcctccagt

cagcggcggaggaggatctggcggtggaggctctggcggcggcggttcaacaaatctggtgaaccagagcggctacgccctggtggccagcg

gcagatccggcaatctgggcttcaagctgttcagcacccagtctccatctgccgaggtgaagctgaagagcctgagccttaacgacggcagctac

cagtccgagatcgacctgtcaggcggcgccaacttccgagaaaagttcagaaacttcgccaatgagctgagcgaggccatcacaaacagcccta

aaggcctggacagacctgtgcccaagacggaaatcagcggcctgatcaagacaggcgacaactttatcacccctagcttcaaggccggatattat

gaccacgtggcctctgatggctccctactgagctactaccagtccaccgagtacttcaacaacagagttctgatgcctatcctgcagacaacaaacg

gcactctgatggccaacaaccggggctacgacgacgttttcagacaagtgccctctttcagcggctggagcaacacaaaggccaccactgtgtcc

acaagcaacaatctgacatacgataagtggacctatttcgccgccaaaggcagccccctgtacgacagctaccccaaccacttcttcgaggacgtg

aagacactggccattgacgctaaggacatcagcgccctgaaaaccaccatcgacagcgagaagcctacctacctgattatccggggactgagcg

gaaacggcagccagctgaacgagctgcaactgcctgagtccgtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggctaagc

agatcttcgccaacgttgtggaactggaattctacagcaccagcaaggctaactcttttggctttaaccccctggtcctgggatctaaaacgaacgtga

tctacgacctgttcgcaagcaagcccttcacccacatcgacctgacacaggtgaccctgcaaaacagcgataattccgccatcgatgccaacaagc

tgaagcaagctgtgggcgatatctacaactacaggcggttcgagagacagtttcagggctacttcgccggaggctacatcgacaagtacctggtga

agaacgtcaataccaacaaggatagcgatgacgatctggtctaccggagcctgaaagagctgaacctccacctggaggaagcctacagagaagg

cgataacacctactacagagtgaatgagaactattaccctggagctagcatctacgagaacgagagagccagcagagacagcgagttccagaac

gagatcctgaagcgagccgagcagaacggcgtgacatttgacgagaacatcaaaagaatcacagccagcggcaagtatagcgtgcagttccaaa

agctagaaaatgataccgattccagcctggaaagaatgaccaaggccgtggaaggccttgtgaccgtgatcggcgaggaaaagttcgagacagt

ggatatcaccggcgtgtctagcgataccaatgaagtgaaaagcctggccaaggaactgaagaccaacgccctgggcgtcaagctgaaactctaa

Artificial

Protein M with N-terminal peptide Avi- and Myc-tags (aka: armY) codon-optimized (for

human) nucleotide sequence (1,764 bp)

39) SEQ ID NO: 39

atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtatggctggtggcctgaatgacatctttgaggc

ccagaagatcgagtggcatgagggaggagagcagaagctgatctccgaggaagatctgctgagaaagcgggccgccaacggcggaggagga

tctggcggtggaggctctaccaatctggtgaaccagagcggatacgccctggtggcctctgggagaagcggaaatctgggatttaagctgttcagt

acccagtctccaagcgctgaagtgaagctgaaaagcctctccctgaacgacggctcttatcagagcgagatcgacctgagcggcggcgctaactt

ccgggagaagttccgcaacttcgctaatgagctgtctgaagccatcacaaacagccctaagggcctggatagacctgtgcccaagacagaaatca

gcggcctgatcaagactggagataactttatcacccctagctttaaggccggctactacgaccatgtggctagcgacggttcactgctgtcctactac

cagtctacagagtactttaacaaccgggtgctgatgcctatactgcagaccaccaacggcaccctgatggccaataacagaggctacgatgacgtg

ttccggcaggtgcccagcttcagcggctggagcaacacaaaggccacaaccgtgagcacctccaacaacctgacctacgacaagtggacctact

tcgccgccaagggctctccactgtatgacagctatcctaaccacttcttcgaggacgtgaagacactggccatcgacgccaaggacatctctgccct

gaagaccaccatcgacagtgagaaacctacatacctgattatcagaggactgtccggcaacggcagccagctgaacgagcttcagctgcctgaga

gcgtgaaaaaggtgagcctgtacggcgactacacaggcgtcaatgtagctaagcaaatcttcgccaacgtggtggaactcgaattctacagcacat

ccaaggccaacagcttcggcttcaaccccctggtgctgggcagcaagaccaacgtgatctacgacctgttcgccagcaagcctttcacccacatcg

acctgacacaagtgaccctgcagaacagcgataacagcgccattgatgccaacaagctcaaacaggccgtgggcgatatctacaactacagaag

attcgagaggcagtttcagggctacttcgccggaggctatatcgataagtacctggtcaagaacgtgaacaccaacaaggactccgacgacgacct

ggtgtaccggagcctgaaggaactgaacctgcacctggaagaggcctacagagagggcgataatacctactacagagtgaacgagaactactac

cccggagctagcatctacgagaacgagagagcctctagagatagcgagttccagaacgagatcctgaagcgggccgagcagaatggcgtgaca

ttcgacgagaacatcaagcggatcaccgccagcggcaagtactccgtgcagttccaaaaactggaaaatgacaccgacagcagcctggaaagaa

tgaccaaggctgtggaaggcctggttacagttatcggcgaggagaagtttgaaaccgtggacatcaccggcgtgagctccgataccaatgaggtg

aaatctctggccaaagaactgaagacaaatgccctgggcgtcaaattaaaactgtaa

Artificial

Protein M horseradish peroxidase (HRP) fusion protein with N-terminal Myc-tag codon-

optimized (for human) nucleotide sequence (2,631 bp)

40) SEQ ID NO: 40

atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtgagcagaaactcatctcagaagaggatctgg

cagcaaatcagctgaccccaaccttctacgacaattcttgtccaaacgtctccaacatcgtgcgggacaccattgtgaacgagctgagaagcgacc

ctagaatcgccgcttctatcctgagactgcatttccacgactgcttcgtgaatggctgcgacgcctccatcctgctggacaacaccaccagcttccgg

acagagaaagacgccttcggaaatgccaacagcgctagaggcttccccgttatcgacagaatgaaggctgccgtggaatctgcctgccctcggac

cgtgagctgtgccgacctgctgaccatcgccgcccagcagagcgtgaccctggccggcggtcctagctggcgggtgcctctgggccggagaga

tagtctgcaggccttcctggatctggctaatgctaacctccccgctcctttctttaccctgcctcagctgaaggacagctacggaacgtcggcctaaa

cagaagcagcgacctggtggccctgtccggaggccacaccttcggcaagaaccagtgcagattcatcatggaccggctgtacaacttcagcaata

ccggcctgccagatcctacactgaacacaacctacctgcagacactgagaggcctgtgccccctcaacgggaatctgagcgccttggtggacttc

gacctgagaacccctaccatcttcgacaacaagtactacgtgaacctggaagaacagaagggcctgatccaaagcgatcaggagctgttctcttcc

cctaatgccacagacaccatccccctggtgcggtcattcgccaacagtacccagaccttttttaacgcttttgtggaagccatggatagaatgggcaa

catcacccctctgaccggaacacagggacagatcagactgaattgcagagtggtgaacagcaactctggcggaggaggatctggcggtggagg

ctctggcggcggcggttcaacaaatctggtgaaccagagcggctacgccctggtggccagcggcagatccggcaatctgggcttcaagctgttca

gcacccagtctccatctgccgaggtgaagctgaagagcctgagccttaacgacggcagctaccagtccgagatcgacctgtcaggcggcgccaa

cttccgagaaaagttcagaaacttcgccaatgagctgagcgaggccatcacaaacagccctaaaggcctggacagacctgtgcccaagacggaa

atcagcggcctgatcaagacaggcgacaactttatcacccctagcttcaaggccggatattatgaccacgtggcctctgatggctccctactgagct

actaccagtccaccgagtacttcaacaacagagttctgatgcctatcctgcagacaacaaacggcactctgatggccaacaaccggggctacgac

gacgttttcagacaagtgccctctttcagcggctggagcaacacaaaggccaccactgtgtccacaagcaacaatctgacatacgataagtggacc

tatttcgccgccaaaggcagccccctgtacgacagctaccccaaccacttcttcgaggacgtgaagacactggccattgacgctaaggacatcagc

gccctgaaaaccaccatcgacagcgagaagcctacctacctgattatccggggactgagcggaaacggcagccagctgaacgagctgcaactg

cctgagtccgtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaacgttgtggaactggaattctac

agcaccagcaaggctaactcttttggctttaaccccctggtcctgggatctaaaacgaacgtgatctacgacctgttcgcaagcaagcccttcaccca

catcgacctgacacaggtgaccctgcaaaacagcgataattccgccatcgatgccaacaagctgaagcaagctgtgggcgatatctacaactaca

ggcggttcgagagacagtttcagggctacttcgccggaggctacatcgacaagtacctggtgaagaacgtcaataccaacaaggatagcgatgac

gatctggtctaccggagcctgaaagagctgaacctccacctggaggaagcctacagagaaggcgataacacctactacagagtgaatgagaact

attaccctggagctagcatctacgagaacgagagagccagcagagacagcgagttccagaacgagatcctgaagcgagccgagcagaacggc

gtgacatttgacgagaacatcaaaagaatcacagccagcggcaagtatagcgtgcagttccaaaagctagaaaatgataccgattccagcctggaa

agaatgaccaaggccgtggaaggccttgtgaccgtgatcggcgaggaaaagttcgagacagtggatatcaccggcgtgtctagcgataccaatg

aagtgaaaagcctggccaaggaactgaagaccaacgccctgggcgtcaagctgaaactctaa

Artificial

armY-Angiotensin-converting enzyme 2 (ACE2) fusion protein codon-optimized (for human)

nucleotide sequence (3,837 bp)

41) SEQ ID NO: 41

atgtacaggatgcaactcctgtcttgcattgcactaagtcttgcacttgtcacaaacagtcagagcacaatcgaggaacaggccaagac

cttcctggacaagttcaaccacgaagctgaagacctgttctaccaatctagcctggctagttggaactacaacaccaacattacagaagagaacgtg

cagaacatgaacaacgcaggcgacaagtggtccgccttccttaaagagcagtctacactggcccagatgtaccctctgcaagagattcagaatctg

accgtgaagctgcagctgcaggctctccagcagaatgggtccagcgtgctgtctgaggataagagcaagcggctgaacaccatcctgaatacaat

gagcaccatctacagcaccggcaaagtgtgtaaccctgacaacccccaggagtgtctgctgctggaacctggcctgaacgaaatcatggccaact

ccctggactacaacgagagactgtgggcctgggagagctggcgtagcgaggtgggaaaacagctgcgccccctgtatgaggagtacgtggtgc

tgaagaatgagatggccagagccaaccactacgaggactacggcgactattggagaggcgattatgaagtcaacggcgttgacggctacgacta

cagccggggacagctgatcgaagacgtggaacatacgtagaggagatcaagcctctgtacgagcacctgcacgcctacgtaagagccaaactg

atgaatgcctaccccagctacatctcccctatcggctgcctgcccgcccatctgctcggcgacatgtggggcagattctggaccaacctgtattctct

gacagtgcctacggccagaaacctaacatcgacgtgacagatgccatggtggaccaggcctgggatgcccaaagaatcttcaaggaagccgag

aaattcttcgtgtccgtggggctgcctaatatgacccagggcttctgggaaaacagcatgctcaccgatcctggcaacgtgcagaaggcagtgtgc

caccccaccgcctgggaccttggaaagggcgacttccggattctgatgtgcaccaaggtgaccatggacgacttcctgaccgctcaccacgagat

gggccacatccagtacgacatggcctacgccgctcagcctacctcctgagaaacggcgctaatgaaggcttccacgaggccgtgggcgaaatca

tgagcctgagcgccgccacccctaagcacctgaagtctatcggactgctgagccccgactacaggaggacaacgaaactgagatcaacttcttgc

tgaaacaggccctgacaatcgttggcaccctgccattacctacatgctggaaaagtggagatggatggtattaagggcgaaatccccaaggacc

aatggatgaagaagtggtgggagatgaagcgggaaatcgtgggcgtggtggaacctgtgccccacgacgagacatactgcgatcctgctagcct

ctttcacgtgagcaatgattactcattcatccggtactacaccagaactctgtaccagttccagttccaggaggccctgtgccaggccgccaagcacg

agggccctctgcacaagtgcgacatctctaacagcaccgaggccggccagaagctgttcaacatgctgagactgggcaagagcgaaccttggac

actggccctggagaacgtggtcggagccaagaacatgaacgtgagaccactgctgaactacttcgagcccctgttcacctggctgaaggatcaaa

acaagaacagcttcgtgggctggtccacagactggagcccatacgctgatcagagcatcaaagtgaggatctctctgaagagcgccctgggagat

aaggcctacgagtggaacgataatgagatgtacctgttcagaagcagcgtggcctacgccatgcggcagtacttcctgaaagtgaagaaccagat

gatcctgtttggcgaggaggatgtgagagtggccaatctgaaaccaagaatcagctttaactttttcgttaccgctcctaagaacgtgtctgatatcatc

cctagaaccgaggtggaaaaggccatcagaatgagccggtccagaatcaacgatgccttccgactgaatgacaactccctggagttcctgggaat

ccagcccaccctgggccctcctaaccagcctccagtcagcggcggaggaggatctggcggtggaggctctggcggcggcggttcaacaaatct

ggtgaaccagagcggctacgccctggtggccagcggcagatccggcaatctgggcttcaagctgttcagcacccagtctccatctgccgaggtga

agctgaagagcctgagccttaacgacggcagctaccagtccgagatcgacctgtcaggcggcgccaacttccgagaaaagttcagaaacttcgc

caatgagctgagcgaggccatcacaaacagccctaaaggcctggacagacctgtgcccaagacggaaatcagcggcctgatcaagacaggcg

acaactttatcacccctagcttcaaggccggatattatgaccacgtggcctctgatggctccctactgagctactaccagtccaccgagtacttcaaca

acagagttctgatgcctatcctgcagacaacaaacggcactctgatggccaacaaccggggctacgacgacgttttcagacaagtgccctctttcag

cggctggagcaacacaaaggccaccactgtgtccacaagcaacaatctgacatacgataagtggacctatttcgccgccaaaggcagccccctgt

acgacagctaccccaaccacttcttcgaggacgtgaagacactggccattgacgctaaggacatcagcgccctgaaaaccaccatcgacagcga

gaagcctacctacctgattatccggggactgagcggaaacggcagccagctgaacgagctgcaactgcctgagtccgtgaaaaaggtgagcctg

tacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaacgttgtggaactggaattctacagcaccagcaaggctaactcttttggct

ttaaccccctggtcctgggatctaaaacgaacgtgatctacgacctgttcgcaagcaagcccttcacccacatcgacctgacacaggtgaccctgca

aaacagcgataattccgccatcgatgccaacaagctgaagcaagctgtgggcgatatctacaactacaggcggttcgagagacagtttcagggcta

cttcgccggaggctacatcgacaagtacctggtgaagaacgtcaataccaacaaggatagcgatgacgatctggtctaccggagcctgaaagagc

tgaacctccacctggaggaagcctacagagaaggcgataacacctactacagagtgaatgagaactattaccctggagctagcatctacgagaac

gagagagccagcagagacagcgagttccagaacgagatcctgaagcgagccgagcagaacggcgtgacatttgacgagaacatcaaaagaat

cacagccagcggcaagtatagcgtgcagttccaaaagctagaaaatgataccgattccagcctggaaagaatgaccaaggccgtggaaggcctt

gtgaccgtgatcggcgaggaaaagttcgagacagtggatatcaccggcgtgtctagcgataccaatgaagtgaaaagcctggccaaggaactga

agaccaacgccctgggcgtcaagctgaaactctaa

Artificial

armY-CD209 (DC-SIGN) fusion protein codon-optimized (for human) nucleotide sequence

(2,706 bp)

42) SEQ ID NO: 42

atgtaccgaatgcagctgctgtcttgtattgccctgtccctggccctggttaccaattctcaagtgagcaaggtgcccagcagcatctctc

aggagcagagcagacaggacgccatctaccagaacctgactcaactgaaggcggctgtgggcgaactgagcgagaagtctaagctgcaggag

atctatcaggaactgacacaactgaaggctgccgtgggggaattacccgagaagagcaagctgcaggaaatctaccaggagctgaccagactca

aagccgccgtgggcgagctgccagagaagtctaaactgcaggaaatctaccaggaattgacatggctgaaggcagctgttggcgagctgcctga

gaaaagcaagatgcaggagatttaccaggagctcacacggctgaaggccgccgtcggcgaactccccgagaaaagcaagcagcaggagatct

accaggagcttacaagacttaaggccgctgtgggagagctgcctgagaagtccaaacaacaggaaatctaccaagaactgaccagactgaaagc

cgccgtgggagaactgccagaaaaaagcaagcagcaggagatctaccaagaactgacacagcttaaagcagctgttgagcggctgtgtcaccca

tgcccttgggagtggacattcttccagggcaactgctacttcatgagcaatagccaaaggaactggcacgacagcatcacagcctgcaaggaagt

gggggcccagctggtggtgatcaagtccgccgaagaacaaaatttcctgcagctgcagtcctccagaagcaacagattcacatggatgggcctgt

cagacctgaaccaagaaggcacctggcagtgggtcgatggcagccccctgctgccctattcaagcagtactggaaccgcggcgagcctaacaa

tgtgggcgaggaagattgcgccgagtttagcggcaacggctggaatgacgacaagtgcaacctcgccaagttctggatctgtaaaaagtccgccg

cctcctgcagccgcgacgaggagcagtttctgtcccctgcccccgccacccctaatcctcctcccgccggcggtggcggaagcggcggcggcg

gcagcggaggaggcggcagcaccaacctggtgaatcagagcggctacgccctggtggcctctggtagatctggcaacctgggattcaagctgtt

cagcacacagtctcctagtgccgaagtgaagctgaagtcactgagcctgaacgacggcagctaccagagcgaaatcgacctgtctggcggtgcta

acttcagagagaagttccggaacttcgccaacgagctgtccgaggccattaccaacagtcccaagggcctggaccggcctgtgcctaagaccga

gatcagcggcctgatcaagaccggcgacaacttcatcacccctagattaaggctggctactacgaccacgtggcctccgatggctctctgctgtcc

tattatcagagcacagagtacttcaacaatagagtgctgatgcctatcctgcaaacaaccaacggcaccctgatggccaataataggggatacgac

gacgtattcggcaggtgcctagcttctccggctggagcaacaccaaggccacaaccgtgtctacaagcaacaacctgacatacgacaagtggac

ctactttgccgccaaggggagccctctgtacgactcttatcctaatcatttcttcgaggacgtgaagaccctggccatcgatgccaaggatatcagcg

ccctgaagaccaccatcgacagcgaaaaacccacctacctgatcatccggggcctgagcggcaatggcagccagctgaacgaactgcagctgc

cagaaagcgtgaagaaggtgtctctgtacggcgactacaccggcgtgaacgtggctaagcagatcttcgccaatgttgttgagcttgagttctacag

cacgagcaaggccaactcattcggatcaaccccctggtgctgggaagtaagacaaacgtgatctatgacctgtttgccagcaaacctttcacccac

atcgacctgacccaggtgaccctgcagaacagcgacaacagcgccattgatgctaacaagctgaaacaggccgtgggagacatctacaactacc

ggagattcgagagacagttccaaggctacttcgccggcggctatatcgataagtacctggtgaaaaacgtgaacaccaacaaggatagcgatgac

gacctggtgtacagaagcctgaaggaactgaacctgcacctggaggaagcctacagagaaggcgataacacatactacagagtgaacgagaact

actaccctggagccagcatctacgagaacgagagagcctctcgggactccgagttccagaacgaaatcctgaaacgggccgagcagaacggcg

tgacatttgatgaaaacatcaagagaatcaccgctagcggcaagtacagcgtgcagtttcagaagctggagaacgacactgattctagcctggaaa

gaatgaccaaggcggtcgagggcctggtgaccgtgatcggcgaggagaagttcgaaaccgtggacatcaccggcgtgtccagcgacaccaatg

aggtgaaatctctggccaaagagctgaagaccaacgccctcggagtgaagctgaagctgtaa

Artificial

armY-C-type lectin domain family 4 member M fusion protein codon-optimized (for human)

nucleotide sequence (2,655 bp)

43) SEQ ID NO: 43

atgtaccggatgcagctgctgtcttgtatcgccctgagcctggccctggtcaccaattctcaggtgtctaaggtgccttctagcctgagcc

aggagcagtctgagcaggacgctatctaccagaacctgacacagcttaaggccgctgtgggcgaactgtcagaaaagtctaagctccaagagatc

taccaggagcttacacagctgaaagccgccgtgggcgagctgcctgagaagtccaagttgcaagagatctaccaggagctgacccggctgaaag

ccgccgtgggagagctgcccgagaagagcaaactgcaggaaatctatcaggagctgaccagactgaaggccgccgtgggagagctgcccgag

aaatccaagctacaggagatctaccaggagctgacaagactgaaggccgcagtgggcgagctgccagaaaagagcaagctgcaggagatctac

caggaactgacagagctgaaggccgccgttggagaactgcctgaaaagtccaaactgcaggaaatctatcaggagctgacacagctgaaggctg

ccgtgggcgaactccctgaccagtccaagcagcagcagatttaccaggaactgaccgacctgaaaacagccttcgagagactgtgtagacactgc

cctaaggactggacattcttccagggcaactgctacttcatgagcaacagccagcggaactggcacgacagcgtgaccgcctgtcaggaggtgc

gggcccagctggtggtcatcaagaccgccgaagagcaaaacttcctgcagctgcaaacaagcagaagcaacagattcagctggatgggcctga

gcgatctgaaccaggagggcacctggcagtgggtggatggaagccctctgtctccaagcttccaaagatactggaacagcggagagcctaacaa

ctctggaaatgaggactgcgccgagttcagcggttctggctggaatgacaacagatgcgacgtggacaactactggatctgcaagaaacccgccg

cctgcttccgagatgagggcggtggcggaagcggcggcggaggcagcggaggcggcgggagtaccaacctggtgaatcagagcggctacgc

cctggtcgcctcgggcagatccggcaatctgggcttcaagctgttcagcacacaaagcccttctgctgaagtgaaactgaagagcctgagcctgaa

tgatggctcttaccagagcgagatcgacttatccgggggagccaactttcgggaaaaattcagaaacttcgctaacgagctgagcgaggccatcac

caactcccccaagggcctggatagacctgtgcccaagacagagatcagcggcctgatcaagaccggcgataacttcatcacccctagctttaagg

ccggatactacgaccacgtggcttccgatggcagcctgctgagctactaccagagcaccgagtacttcaacaacagagtactgatgcctatcctgc

agacaacaaatggcaccctgatggccaacaataggggctacgatgacgtgttcagacaggttccttcattcagcggctggagcaatacgaaggct

acaaccgtgtcgaccagcaacaacctgacctatgacaagtggacctacttcgccgctaagggcagccctctgtacgacagctaccccaaccacttc

ttcgaggatgtgaaaaccctggccattgacgccaaggacatcagcgccctgaaaaccaccatcgacagcgagaagcctacatacctgatcatcag

aggcctgtcaggcaacggctcccagctgaacgaactgcaactgccagagagtgttaagaaggtgagcctgtacggcgactatacaggagtgaac

gtggctaagcagatcttcgctaatgtggtggaactggaattctacagcaccagcaaagccaacagcttcggctttaaccccctggtgctgggcagca

agaccaacgtgatctacgaccttttcgccagcaagcccttcacccacatcgacctgacccaggtgaccctgcagaatagcgacaattctgccattga

cgccaacaagctgaaacaggccgtgggcgatatctacaactacaggcggttcgaaagacagttccaaggctattttgccggcggctacatcgaca

agtacctggtcaagaacgtgaacaccaacaaggattccgacgacgatctagtgtaccggagcttgaaggaactcaacctgcatctggaagaggcc

tacagagaaggcgacaacacatactaccgcgtgaacgagaactactaccctggcgccagcatctacgagaacgaacgggcttctagagatagcg

agtttcagaatgaaatcctgaagagagccgaacagaacggcgtgaccttcgacgagaacattaagcggatcacagcctctggcaagtacagcgtg

cagtttcagaagctggaaaacgacaccgacagctctctcgagagaatgaccaaggccgttgagggcctggtgacagtgatcggcgaggaaaagt

tcgaaaccgtggacatcaccggcgtgtcctctgataccaacgaggtgaagagcctggcaaaggaactgaagaccaacgccctgggcgtgaagct

gaagctgtaa

Artificial

armY-CD4 fusion protein codon-optimized (for human) nucleotide sequence (2,781 bp)

44) SEQ ID NO: 44

atgtacagaatgcagctgctgagctgcatcgccctgtccctggccctggttacaaacagcaagaaggtggtgctgggaaaaaagggc

gacaccgtggaactgacctgcaccgctagccagaagaagagcatccaatttcactggaagaacagcaaccagatcaaaatcctggggaaccagg

gctctttcctgacaaagggcccctctaagctgaatgatagagccgacagccggagatcgctgtgggaccagggcaacttccccctgatcatcaaga

acctgaagatcgaggatagtgacacatacatctgcgaggtggaagatcagaaggaagaggtgcaactgctggtgttcggactgaccgccaacag

cgacactcacctgctgcagggccagtctctcacactaaccctggaaagccctcctggaagctctccaagcgtccagtgtagatctcctagaggcaa

gaacatccagggcggcaagaccctactgtgtctcagctggagctgcaggactcaggcacctggacatgtaccgtactgcaaaatcagaaaaagg

tggaattcaagatcgacatcgttgtgctggccttccagaaggccagcagcatcgtgtacaagaaggaaggagagcaggtggagttttctttccctct

cgcctttaccgtggaaaaactgaccggttcaggcgagctgtggtggcaggccgagcgcgcaagctccagcaagagctggatcacattcgacctta

agaacaaagaggtgagcgtgaagagagtgacccaggaccccaagctgcagatgggcaagaagctgcccctgcacctgaccctcccgcaagcc

ctgcctcagtacgccggatccggcaacctgacactggccctcgaagccaaaaccggaaagctgcaccaggaggtgaacctggtggtgatgaga

gccacccagctgcagaaaaatctgacctgcgaagtgtggggccctacaagccctaagctcatgctgagtcttaaactggagaacaaggaggctaa

agtgagcaagcgggaaaaggccgtgtgggtgctgaatcctgaggccggcatgtggcagtgcctgctgtctgacagcgggcaagtgctgctggaa

tctaacatcaaggtcctgcccacctggtccacccctgtgcagccaggcggcggaggatctggcggcggcggcagcggaggcggcggctccac

caacctggtgaatcagagcggctacgccctggtggctagcggtagatccggcaatctgggattcaagcttttctccacacagagccctagcgccga

agtgaagttgaaatctctgagcctgaacgacggctcctaccagtccgagatcgacctgagcggcggcgctaatttcagagagaagtacggaactt

cgccaatgagctgtctgaagctatcaccaacagccctaaaggacttgatcgcccagtgcccaagaccgagattagcggcctgatcaagacaggcg

ataactttatcacccctagtttcaaggctggctattatgaccacgtggccagcgacggaagcctgctgagctactaccagagcacagagtacttcaac

aaccgggtgctgatgcctatcctgcagaccaccaacggcacgctgatggccaacaacagaggctacgacgacgtgttccggcaggtgcctagctt

tagcggatggagcaacaccaaggctacaactgtgagcaccagcaacaacctgacctacgataagtggacctacttcgccgccaaaggcagccct

ctgtacgatagctaccctaaccacttcttcgaggacgtgaagacactggctatcgacgccaaggacattagcgccctgaaaaccacaattgactctg

aaaagcccacctacctgatcatcagaggactgagcggcaacggcagccagctgaacgagctgcagctgcctgaatctgtgaaaaaagtcagcctt

tacggcgactacaccggcgtgaacgtggccaagcagatcttcgccaatgtggtggaactggagttctacagcacctctaaagccaacagtacggc

ttcaaccccctggtgctgggctctaaaaccaatgtaatttatgacctcttcgctagcaagcctacacacacatcgatctgacccaggtgacactgcag

aactctgacaacagcgccatcgatgccaataagctgaagcaggccgtgggcgacatctacaactaccggagattcgagagacagtacagggcta

ctttgccggcggctacatcgataagtacctggttaagaacgtgaataccaacaaggactctgatgacgacctggtgtacagaagcctgaaggaact

gaacctgcatctggaagaggcctacagagaaggcgacaacacctactatcgggtgaatgagaactactatcccggcgcttctatctacgagaatga

gcgggccagcagagatagtgagttccaaaatgagatcctgaagcgggcagagcaaaacggcgtgaccttcgacgagaacatcaagagaatcac

cgcctccggcaaatacagcgtgcagttccagaaactggaaaacgacactgatagcagcctggaacggatgaccaaggccgtagagggcctggt

caccgtgatcggcgaggagaagtttgagacagtggacatcacaggcgtgagctccgataccaacgaggtgaagagcctggccaaggaactgaa

gaccaacgccctgggagtgaagctgaagctataa

Artificial

armY-Synaptic vesicle glycoprotein 2A fusion protein codon-optimized (for human)

nucleotide sequence (2,058 bp)

45) SEQ ID NO: 45

atgtacagaatgcagctgctgtcatgcatcgccctctccctcgccctggtgaccaacagccccgacatgatcagacacctgcaggccg

tcgactacgccagcagaaccaaagtgttccccggagaacgggtggaacacgtgacatttaacttcaccctggaaaaccagatccacagaggcgg

ccagtacttcaacgacaagttcatcggcctgagactgaagtccgtgtccttcgaggatagcctgtagaggaatgctactagaggacgtgacatcta

gcaatacctttttccggaactgcacattcatcaacaccgtgttctacaacaccgatctgtttgaatacaagttcgtgaacagcagactgatcaacagca

cctactgcacaacaaggagggctgtcctttagatgtgaccggaacgggcgagggcgcctacatggtgtacggcggcggaggctccggcggcg

gtggcagcggtggaggaggcagcaccaatctggtcaaccaatctggctatgccctggtcgccagtggcagaagcgggaacctgggcttcaagct

gttcagcacacagagccctagcgctgaagtgaaactgaagagcctgtctctgaacgacggctcttatcagagcgagatcgacctgtccggaggcg

ccaatttcagagagaagttcaggaacttcgccaacgagctgagcgaggccatcaccaattcccctaagggactggatagacctgtgccaaaaacc

gagattagcggcctgattaagaccggagataatttcatcacacccagctttaaggccggatattacgaccacgtggcctctgacggcagcctgctga

gctactaccagagcaccgagtacttcaacaaccgggtgctgatgcctatcctgcaaacaacaaatggcacactgatggccaacaaccggggatat

gacgacgtgttccgccaggtgcccagcttcagcggctggagcaacacaaaggctacaaccgtgtctaccagcaacaacctgacctacgataagtg

gacctacttcgccgctaaaggcagccctctgtacgacagctaccccaaccacttcttcgaggacgtcaagaccctggcgatagacgccaaagaca

tcagcgctctgaagaccaccatcgacagcgaaaagccaacatacctgatcatcagaggcctgagcggcaacggctcacagctgaacgagctgca

gctgcctgagagcgtgaaaaaggtgtcactgtacggcgattacaccggcgtgaacgtggccaagcagatcttcgcaaacgttgtggaactggaatt

ctactctacaagcaaggccaacagcttcggctttaatcctctggtgctggggtctaagacaaacgtgatctacgacctgttcgccagtaagcctacac

ccacatcgacctgacccaggttacactgcagaactccgacaacagcgccatcgacgccaacaagctgaaacaggccgtgggcgacatctacaac

tacaggagattcgaaagacagttccagggctattttgccggcggctacatcgacaagtacctggtgaagaacgtgaataccaacaaggactctgat

gacgatctcgtgtaccggagcctgaaggaactgaatctgcatctggaagaagcttaccgggaaggcgacaatacctactacagagtgaacgagaa

ctactaccctggcgctagcatctacgagaacgaacgggccagcagagattctgagttccaaaacgagatcctgaagcgggccgagcagaatggc

gtcaccttcgacgagaacatcaagagaatcaccgcctctggcaaatacagcgtgcagttccaaaaactggaaaacgatactgatagctcccttgag

agaatgaccaaggccgtggaaggactggtgaccgtgatcggcgaagagaagttcgagacagtggacatcacaggcgtgtccagcgataccaat

gaggtgaagagcctggccaaggagctgaaaaccaacgccctcggcgtgaagctgaagctgtaa

Artificial

armY-Synaptic vesicle glycoprotein 2B fusion protein codon-optimized (for human)

nucleotide sequence (2,040 bp)

46) SEQ ID NO: 46

atgtacagaatgcagttgctgtcttgtatcgccctcagcctggctctggtgacgaatagcccagacatgatccgctacttccaggacgag

gaatacaagagcaagatgaaggtgttctaggcgagcatgtgtacggcgccaccatcaacttcaccatggaaaaccagatccaccagcacggcaa

gctggttaatgacaagtttacaagaatgtactttaagcacgtgctgttcgaggataccttttttgatgagtgctacttcgaggacgtgacaagcaccgac

acatacttcaagaactgcaccatcgagagcaccatcttctacaacaccgacctgtatgagcacaagttcatcaactgcagatttatcaacagcaccttc

ctggaacagaaagagggctgccacatggacctggaacaagacaatgatggaggcggaggaagcggcggcggaggcagcggcggcggggg

aagcaccaatctggtgaatcaaagcggctacgccctggtggctagcggcagaagcggcaacctgggcttcaagctgtttagcacacagagcccta

gcgctgaagtgaagctgaagtctctctctctgaatgacggctcctaccagtctgagatcgacctcagcggaggcgccaacttcagggaaaagttcc

ggaacttcgccaacgagctgagcgaggccattacaaacagccctaagggcctggacagacctgtgcccaagaccgagatcagcggcctgatca

agactggagataattttattacccctagcttcaaggcaggctactacgaccacgtggcctccgatggctctctgctgtcctattatcagagcacagagt

actttaacaacagagtgctgatgcctatcctgcagaccacaaacggcaccctgatggccaacaatagaggctatgatgatgtgttcagacaggtgcc

ttctttcagcggatggtccaacacaaaggccacaacagtttctacaagcaacaacctgacctacgataagtggacatacttcgccgccaagggctct

ccactgtacgacagctaccctaaccacttcttcgaagatgtgaagaccctggccatcgacgccaaggacatcagcgcccttaaaacaaccattgac

agcgagaagcctacctacctgatcatcagaggactgagcggaaacggctcccagctgaacgaactgcaactgcctgagtctgtgaaaaaggtga

gcctgtacggcgattacaccggcgttaacgtggctaaacagatcttcgccaacgtggtggaactggagttctacagcaccagcaaggccaatagct

tcgggttcaaccccctggtccttggctccaaaaccaacgtcatctacgacctgttcgcttctaagcccttcacacacatcgacctgacccaggttaccc

tgcagaactcagacaacagtgctatcgacgccaacaaactgaagcaggccgtgggcgatatctataactaccggagattcgagcggcagttccaa

ggctacttcgccggcggatatatcgacaagtacctggtcaagaacgtgaacaccaacaaggacagcgatgacgacctggtgtaccggagcctga

aggaactgaacctgcacctggaagaagcctaccgggaaggcgacaacacctactaccgggtgaacgagaattactaccccggcgctagcatcta

cgagaacgagagagcctccagagattcagagttccagaacgagatcctgaaaagagccgagcagaatggcgtgaccttcgacgagaacatcaa

gcggatcacagcctctggcaaatacagcgtgcagttccagaagctggaaaatgataccgatagcagcctggaaagaatgaccaaggcggtggaa

ggcttggtcaccgtgatcggcgaggagaagttcgagacagtggacatcaccggcgtgtccagcgacaccaacgaggtgaaaagcctggccaag

gaactgaagaccaacgccctgggcgtgaagctgaagctgtaa

Artificial

armY-Synaptic vesicle glycoprotein 2C fusion protein codon-optimized (for human)

nucleotide sequence (2,028 bp)

47) SEQ ID NO: 47

atgtaccgcatgcagctgctgagctgcatcgccctgagcctggctctggtgacaaacagcaaacctctgcagagcgacgagtacgcc

ctgctgacaagaaacgtcgagcgggacaagtacgccaattttaccatcaactttaccatggaaaaccagatccacaccggaatggaatacgataat

ggcagattcattggcgttaagttcaaaagcgtgacattcaaagatagcgtgttcaagagctgtacattcgaagatgtgaccagcgtaaatacctacttc

aaaaactgcaccttcatcgacaccgtgttcgacaacaccgatttcgagccttacaagttcatcgacagcgagttcaagaactgcagctttttccacaac

aaaaccggatgtcagatcaccttcgacgacgactacagcggcggcggcggctcgggcggaggaggctctggtggcggcggcagcacaaacct

ggtcaaccagagcgggtatgccctggtggccagcggcagaagcggcaatctgggcttcaagctgttcagcacacagtccccaagcgctgaggtg

aagctcaaatctctgtcccttaacgacggcagttaccaaagcgagatcgacctgagcggcggagccaacttccgggaaaagttcagaaatttcgct

aatgaactgagcgaggccatcacgaatagccctaagggcctggatagacccgtgcccaagactgagatcagcggcctgattaagacaggagata

acttcatcacacctagcttcaaggccggctattacgaccacgtggcctcagacggctccctgctgagctactaccagagcacagagtacttcaacaa

ccgggtgctgatgcctatcctgcagaccaccaacggaacactgatggccaacaacagaggctatgacgatgtgtttagacaggtcccctcttttagc

ggatggtccaacaccaaggctacaacagtgtccaccagcaacaacctgacctacgacaagtggacatatttcgccgccaagggaagccctctgta

cgacagctacccaaaccacttcttcgaggacgtgaagaccctggccattgacgccaaagacatcagcgccctgaagaccacaatcgattctgaga

aacctacctatctgatcatcagaggactctctggcaacggcagccagctgaacgagctgcagctgcctgagagcgtgaaaaaggtgtccctgtac

ggcgattacaccggcgtgaacgtggccaagcagatcttcgccaacgtggtggaacttgagttctacagcaccagcaaggccaattctttcggcttca

accccctggtcctgggcagcaagacaaatgtgatctacgacctgttcgcctctaagcctttcacccacatcgacctgacccaggtgacactgcaaaa

ttccgataacagcgccatcgacgctaacaagctgaagcaggccgtgggcgacatctacaactaccggcggtttgagcggcagtttcagggctactt

tgctggcggatacatcgacaagtacctggtgaagaacgtgaacacaaacaaggactctgatgacgacctggtttaccggtctctgaaggaactgaa

cctccatctggaagaagcctacagagaaggcgacaacacctactacagggtgaacgagaactactaccccggcgctagcatctacgagaacgaa

agagcctctagagatagcgaatttcagaacgagatcctgaagagagctgaacagaatggcgtgacctttgatgagaacatcaagcggatcaccgc

ctccggcaagtacagcgtgcagttccaaaagctggagaatgataccgactccagcctggaaagaatgaccaaggcagtggagggcctggtgacc

gtgatcggcgaggaaaagttcgagacagtggacatcaccggcgttagcagcgacaccaacgaggtgaagtctctggccaaggaactgaagacc

aacgccctgggagtgaaactgaagctgtaa

Artificial

armY-Synaptotagmin I fusion protein codon-optimized (for human) nucleotide sequence

(1,839 bp)

48) SEQ ID NO: 48

atgtacagaatgcagctgctgagctgcatcgccctgagcctggccctggttacaaacagcatggtgtccgagagccaccacgaggcc

ttagcagctcctcctgtgaccaccgtggctacagtgctgcccagcaatgccaccgagcctgccagccctggagagggaaaagaggacgcctttag

caagctgaaggagaagttcatgaacgagctgcataagatccctctgcctggaggtggcggcagcggaggaggtggctccggcggcggcggctc

caccaacctggtgaaccagagcggctacgccctggtggccagcggaagaagcggcaacctgggcttcaagctgttttctacgcagagccccagc

gccgaagtgaagctgaagagcctgtcactgaacgacggcagctatcagtctgagatcgacctgtctggcggggccaatttcagagagaaatttag

aaacttcgctaatgagctgagcgaggccatcaccaactcgcccaagggcctggacagacctgtgcccaagaccgaaatcagcggcctgattaaa

acaggcgataacttcatcaccccttcttttaaggctggctactacgaccacgtggccagcgatggcagcctgctgtcttactaccagagcacagagt

actttaacaacagagtgctgatgcctatcctgcagaccaccaacggaacactgatggccaacaaccggggctacgacgacgtcttcagacaggtg

cctagcttctctggctggtccaacaccaaggcgacaaccgtgtccaccagcaacaatctgacatacgataagtggacctacttcgctgccaagggc

tccccactgtacgactcttatccaaaccacttcttcgaggatgtgaaaactctggctatcgacgccaaggacatcagcgctctgaagaccacaatcga

cagcgaaaagcccacctacctgatcatcagaggactgagcggaaatggctcacagctgaacgaactgcagctgcctgagtctgtgaagaaggtgt

ccctctacggcgactacaccggcgtcaacgtggccaagcaaatcttcgccaatgtggtggaactggaattctacagcaccagcaaggccaacagc

ttcggcttcaaccccctggtgctggggagcaaaacaaacgtgatctatgacctgttcgccagcaagcctttcacccacatcgatctgacccaagtga

ccctgcagaacagcgataatagcgccatcgacgccaacaagctcaagcaggccgtgggcgatatctacaactacaggcggttcgagagacagtt

tcagggctacttcgccggcggctacatcgacaaatacctggtcaagaacgtgaacaccaacaaagactctgatgacgacctggtctaccggagcc

tgaaagagcttaatctgcacctggaagaggcctaccgggaaggcgacaacacatactacagagtgaacgagaactactacccaggcgccagtatt

tacgagaacgaacgcgcctctagagatagcgagttccaaaatgagattttaaaaagagccgagcagaacggcgtgacattcgacgagaacatcaa

gcggatcaccgcctccggcaagtacagcgtgcagttccagaagctggaaaatgataccgacagcagcctggaacggatgaccaaggccgtgga

aggcctggtgaccgtgatcggcgaggaaaagttcgaaaccgtcgacatcacaggcgtgtctagcgacaccaatgaggtgaagagccttgctaag

gaactgaagacaaacgccctgggcgtgaaactgaagctgtaa

Artificial

armY-Synaptotagmin II fusion protein codon-optimized (for human) nucleotide sequence

(1,854 bp)

49) SEQ ID NO: 49

atgtaccggatgcagctgctgagctgcatcgccctgtccctggccctggtgacaaacagcatgagaaacattttcaagagaaaccagg

agcctatcgtggcccctgctacaaccacagccacaatgcctatcggccctgtggataattcgactgaaagcggcggagccggcgagtcccaaga

agatatgttcgccaagctgaaagagaaactgttcaacgagatcaacaagatccccctgcctccaggcggcggcggcagcggaggaggcggcag

cggtggcggcggcagcacaaatctggtaaaccagagcggctacgccctggttgcctccggaagaagcggaaacctgggatttaagctgttcagc

acccagtccccatctgctgaagtgaaactgaagagcctgagcctgaatgacggctcttaccagagcgagatcgacctgagtggaggcgccaattt

cagagagaaattccgcaacttcgccaatgagctgagcgaggccatcaccaacagccctaagggcctggacagacctgtgcccaagaccgaaatc

agcggactgatcaagaccggcgacaacttcatcaccccttcttttaaggctggatattacgaccacgtggcctctgacggatctctgctgagctacta

ccagtctaccgagtacttcaacaaccgggtgctgatgccaattcttcagacaaccaacggcaccctgatggccaacaatagaggctacgacgatgt

gttccggcaagtgcctagcttttctggctggagcaacaccaaggccaccaccgtgtccaccagcaacaacctcacctatgataagtggacctacttt

gctgctaaaggcagccccctgtacgactcttatcctaaccacttcttcgaagatgtgaagaccctggctatcgatgccaaggacatcagcgccctga

aaaccaccatcgacagcgagaagcccacctacctgatcatcagaggcctatctggcaacggcagccagctgaacgagctgcagctccctgagag

cgtgaagaaggtgtctctgtacggcgattacaccggcgttaatgtggctaaacagatcttcgccaacgtggtggaactggaattctacagcacatcta

aagcaaacagttttggcttcaatcctctggtgctgggcagcaagaccaacgtgatctacgacctgtttgctagcaagcccttcacacacatcgatctg

acccaggtgaccctgcaaaactccgataatagcgccattgacgccaacaaactcaagcaggccgtgggcgatatctacaactacaggcggttcga

gagacagttccagggctacttcgccggcggatatatcgacaagtacctggtcaagaacgtcaacacaaacaaggacagcgatgacgacctggtct

accggagcctgaaggaactgaacctgcatctggaggaagcctacagagaaggcgacaacacctactacagagtgaacgagaactactaccccg

gcgccagcatctacgagaatgaaagagcctcaagagattccgagttccagaacgagatcctgaagcgggccgagcagaacggcgtgacattcg

acgagaacatcaagcggatcaccgccagcggcaagtacagcgtgcagtttcagaagctggaaaacgacaccgactcaagcctggaaagaatga

caaaggccgtggaaggcctggtgactgtgatcggcgaagagaagttcgagacagtggacatcacaggcgtgtctagcgacaccaacgaggtga

aaagcctggccaaggaactgaagacaaacgccctgggcgtgaagctgaagctataa

Artificial

armY-HLA class II histocompatibility antigen, DRB1 beta chain fusion protein codon-

optimized (for human) nucleotide sequence (2,262 bp)

50) SEQ ID NO: 50

atgtaccggatgcagctgctgagctgcatcgccctgtctcttgccctggtgaccaactctggagacaccagacctagattcctgtggca

gcccaagagggaatgtcactttttcaacggtacagagcgggtgagattcctggaccggtacttctacaaccaggaggaaagcgtgcggtttgatag

cgacgtgggcgagttccgggctgtgactgaactgggccggcccgatgccgagtactggaacagccagaaggatatcctggagcaggccagag

ccgcagtggacacctactgcagacacaactacggcgttgtggaaagcttcaccgtgcaaagaagagtgcagcctaaagtgaccgtgtacccatct

aaaacacagcctctgcagcaccacaatctgctggtatgcagcgtgtccggcttctaccctggcagcatcgaggtgcggtggttcctgaacggccag

gaggaaaaagccggcatggtgtctaccggcctgatccagaatggcgactggaccttccagaccctggtgatgctggaaacagtgcctagatccgg

cgaggtgtacacctgccaggtggagcaccccagcgtcaccagcccactgaccgtggaatggcgggccagatctgagagcgctcagagcaagg

gcggcggcggaagcggcggcggaggaagcggcggcggcggcagcacaaatctggtcaaccagagcggctacgccctggtggccagtggca

gaagcgggaacctgggctttaagctgtttagcacccagagccccagcgccgaagtgaagctgaaaagcctgtccctgaacgacggcagctacca

gagcgagatcgacctgtccggcggagccaacttcagagagaagttcagaaactttgccaacgagctgagcgaggccattacaaatagccctaag

ggcctggatagaccagtgcctaagaccgagattagcggcctgatcaagaccggcgataacttcatcacaccttcctttaaggccggttactatgacc

acgtggccagcgacggctccctcctgagctactatcagtctaccgagtacttcaacaaccgggtgctgatgcctatcctgcaaacaacaaacggca

ccctgatggccaacaacagaggctacgacgatgtgttcagacaagtgccctctttcagcggatggagcaacaccaaggctacaaccgtctccacta

gcaacaacctcacctacgacaagtggacctattttgccgccaagggcagccctctgtacgacagctaccctaaccacttcttcgaggacgtgaaga

ccctggccatcgacgctaaggacatcagcgcccttaagaccacaatcgattctgagaagcctacctacctgatcatccggggcttatctggcaacg

gctctcagctgaatgagctgcagctgccggaaagcgtgaagaaggtgtccctctacggcgactacacaggcgtgaatgttgccaagcagatcttc

gccaacgtggtggaactagaattctactccaccagcaaggctaacagctttggcttcaatcctctggtgctgggcagcaaaaccaatgtgatctatga

tctgttcgcttctaagcccttcacccacatcgatctgacacaggtgaccctgcagaacagcgacaatagcgccatcgacgctaacaagctgaaaca

ggctgtgggcgacatctacaactaccggagattcgagagacaattccagggctacttcgccggaggatatatcgacaagtacctggtgaaaaacgt

gaacaccaacaaggattctgatgacgacctggtttacaggagcctgaaggaactgaaccttcatctggaagaagcctacagagagggcgacaata

catactacagagtgaacgagaattactaccccggcgccagcatctacgagaacgaaagagcctctagagacagcgagttccaaaacgaaatcctc

aagcgcgctgagcagaacggagtgacattcgacgagaacattaagcggatcaccgccagcggcaagtacagcgtccagttccagaaactggaa

aacgacaccgattctagcctggaaaggatgaccaaggccgtggaaggcctggtaacagtgatcggagaggagaaattcgagacagttgacatca

ccggggtgagcagcgatacaaatgaggtgaagtctctggccaaggaactgaaaaccaacgccctgggagtcaagctgaagctgtaa

Artificial

armY-HLA class II histocompatibility antigen, DR alpha chain fusion protein codon-

optimized (for human) nucleotide sequence (2,241 bp)

51) SEQ ID NO: 51

atgtaccggatgcagctgctgtcatgcatcgccctgagcctcgctctggttaccaatagcatcaaggaagagcacgtgatcatccaggc

cgagttctacctgaatcctgatcagagcggagagttcatgttcgacttcgacggcgatgagatctacatgtggacatggccaaaaaggaaaccgtgt

ggcggctggaagagtaggccggttcgcctccttcgaggcccagggagctaggccaatatcgccgtggacaaggccaatctggagatcatgacc

aagcggagcaactacacccctatcaccaacgtgccacctgaggtgacagtgctgaccaatagccccgtggagctgcgggaacctaacgttctgat

ctgcttcatcgacaagtttacaccccccgtggtgaatgttacatggctgagaaacgggaagcctgtgaccacaggagtgtccgagacagtgttcctg

cctagagaagaccacctgttccggaagttccactacctgcccttcctgccttccaccgaggacgtgtacgattgtagagtggaacactggggcctgg

acgagcctctcctgaagcactgggagtttgacgcaccatcccctctgcctgagacaaccgaaggcggaggcggctccggcggcggaggtagcg

gaggcggcggcagcaccaacctggtcaaccagtccggatacgccctggtggccagcggcagatctggcaatctcggcttcaagcttttcagcac

gcagtcccctagcgccgaagtgaaactgaaatctctgtctctgaacgacggcagctaccagagcgagatcgacctgagcggcggcgccaatttca

gagagaagtttcggaacttcgccaacgagctgtccgaggctattaccaacagtccaaagggactggatagacctgtgcccaagaccgagatcagc

ggcctgatcaagacaggcgacaacttcatcacccctagcttcaaggccggctactacgaccacgtggcttctgatggctctctactgagctactacc

agagcacagaatactttaacaatagagtgctgatgcctatcctgcagaccactaacggcaccctgatggccaacaacagaggctacgacgacgtgt

tcagacaagtgccttcttttagcggatggtccaacacgaaggccaccacagtgtctacatctaacaacctgacatatgacaagtggacctacttcgcc

gccaagggcagccctctgtacgacagctatcctaatcacttcttcgaggatgtgaaaacactggctatcgacgcgaaagacattagcgccctgaag

accaccatcgatagcgaaaagcccacctacctgatcatcagaggcctctctggcaacggctctcagctgaacgagctgcaacttccggagagcgt

gaagaaagtgtccctgtacggcgactacaccggcgtgaacgtcgctaaacagatctttgccaacgtcgtggaactggaattctatagcaccagcaa

ggccaacagcttcggcttcaaccccctggtgctgggaagcaagaccaacgtgatctatgacctctttgcttctaaacctttcacccacatcgacctga

cccaggtcacactgcagaacagcgacaacagcgccatcgacgccaacaagctgaagcaggctgtgggcgatatctacaactaccgtagattcga

gcgccagttccagggctatttcgccggcggctacatcgacaagtacctggtgaagaacgtgaacacaaacaaggacagcgacgatgatctggtct

acagaagcctgaaggagctgaacctgcacctggaagaagcctacagagagggcgataacacctactacagggttaatgagaattactaccccgg

cgctagcatctacgagaacgagcgcgccagcagagattctgaattccaaaacgagatcctgaaaagagccgaacagaacggcgtgacattcgat

gagaacatcaagcggatcacagccagcggcaagtacagtgtgcagtttcagaaactggaaaacgacaccgacagcagcctggagagaatgacc

aaggccgtggaaggcctggtgaccgtgatcggcgaggaaaagttcgaaaccgttgacattaccggcgtgtctagcgataccaacgaggtgaaga

gcctggccaaggagctgaagacaaacgccctgggggtgaagctgaagttataa

Artificial

armY-T cell receptor beta variable 7-9 fusion protein codon-optimized (for human) nucleotide

sequence (1,950 bp)

52) SEQ ID NO: 52

atgtaccgcatgcagctgctgagctgcatcgccctgagcctcgccctggtgaccaacagcggcgttagccagaacccccggcacaa

gattaccaagoggggccagaacgtgaccttcagatgtgaccccatcagcgaacacaaccggctgtactggtacagacagacactgggccaagga

cctgagttcctgacctacttccagaacgaagcccagctggagaaatctagactgctttccgatagattcagcgccgagaggcctaagggctcttttag

cacactggagatccagagaacagagcagggcgatagcgcaatgtacctgtgcgccagcagcctgggcggcggcggcagcggcggaggcgg

ctccggcggcggcggatctaccaacctggtgaaccagtctggctacgccctggtggcctctggtagaagcggcaacctgggctttaagctgtttag

cacacagagtccctctgccgaggtgaagctgaagagcctgtccctgaacgacggcagctatcagtccgagatcgatctgagtggcggagctaact

tccgggaaaagttcagaaacttcgccaatgagctgtctgaagccatcaccaatagccctaagggcctggacagacctgtgcctaagaccgagattt

ctggcctgatcaagacaggtgataatttcatcacccctagctttaaggctggctactacgaccacgtggccagcgatggaagcctgctgagctacta

ccagtccaccgagtacttcaacaacagagtgctcatgcctatcctgcaaaccacaaacggaacactgatggccaacaacagaggatatgatgacgt

gttcagacaggtgccatctttttccggctggagcaacaccaaggccaccaccgtgtctacaagcaacaacctgacatatgacaagtggacctacttc

gccgccaagggctccccactgtacgacagctaccctaaccacttcttcgaggacgtaaagacactggctatcgatgccaaagacatcagcgcctta

aagaccaccatcgacagcgagaagcccacctacctgatcatcagaggactgagtggcaacggcagccagctgaatgaactgcagctgcctgaat

ctgtgaagaaggtgtccctgtacggcgactacaccggagtgaacgtggccaagcagatcttcgctaatgtggtcgagctggaattctacagcacca

gcaaggccaatagcttcggcttcaaccctctggtcctcggctctaagaccaacgtcatctacgacctattcgctagcaagcctttcacccacatcgac

ctgacccaggtgaccctgcagaacagtgacaatagcgccatcgacgccaacaagctgaagcaagccgtgggggacatctacaactaccggaga

tttgagcggcagttccagggctatttcgctggcggatacatcgacaagtacctggtgaaaaacgtgaatacaaacaaggacagcgacgacgatctg

gtgtaccgctctctgaaggaactgaacctgcatctggaagaggcctacagagagggcgataatacctactaccgggtgaacgagaactactaccc

cggcgcctccatctacgagaacgaacgggccagccgggacagcgaattccaaaacgagatcctgaaaagagctgaacagaatggcgtgacctt

cgacgagaacatcaagagaatcaccgcctccggcaagtacagcgtgcagttccagaagctggaaaatgacactgattctagcttggaaagaatga

caaaagccgtggaaggcctggtcacagtgatcggcgaggaaaagttcgagacagtggacatcacaggcgtgagcagcgataccaacgaggtg

aaaagcctggctaaagagctgaagaccaacgccctgggcgttaaactgaaactgtaa

Artificial

armY-T cell receptor beta variable 19 fusion protein codon-optimized (for human) nucleotide

sequence (1,947 bp)

53) SEQ ID NO: 53

atgtatagaatgcagctgctgtcctgcatagccctgtctctggctctggtgaccaactctgggatcacccagtccccaaagtacttgtttag

aaaggagggccagaacgtcaccctgtcttgtgaacagaacctcaaccacgacgccatgtactggtaccggcaggaccctggacagggcctgag

actgatctactacagccaaatcgttaatgatttccaaaagggagatattgctgagggctacagcgtgtccagagaaaagaaagaaagcttccctctg

accgtgaccagcgcccagaagaaccctaccgccttctacctgtgcgcctccagcattggcggcggcggcagcggaggcggaggcagcggagg

cggcggctcaacaaacctggttaaccagtccggctacgccctggtcgcctccggaagaagcggcaacctcggcttcaagctgttcagcacccag

agcccttccgccgaggtgaagctgaagagcctgagcctgaacgacggcagctaccagagcgagatcgacctgtctggcggagctaatttccgcg

agaagttcagaaacttcgccaacgagctgagcgaggccatcacaaacagccctaagggcctggacagacctgtgcctaagacagagatcagcg

gcctgatcaagaccggcgataatttcatcacaccatcttttaaggccggatattacgaccacgtggccagcgatggcagcctgctgagctactacca

gtctaccgagtactttaacaacagggtccttatgccaatcctgcaaacaacaaacggcacactgatggccaacaatcggggctatgatgatgtgttca

gacaggtgccctctttcagcggatggtccaacaccaaggccaccacagtgtctaccagcaacaacctgacctacgataagtggacttacttcgccg

ccaagggctcacccctgtacgacagctaccctaaccatttcttcgaagatgtgaagacgctggccatcgacgcaaaggacatcagcgccctgaag

accaccatcgacagcgaaaaacccacctacctgatcatccggggcctaagcgggaatggtagccagctgaacgagctgcagctgcctgagagc

gtgaaaaaggtgagcctgtacggcgactacacaggcgtgaacgtggccaaacagatcttcgctaatgtggtggaactggaattctattctacatcca

aggccaacagcttcggcttcaaccccctggtgctgggctctaaaacaaacgtgatctacgacctgttcgctagcaagcctttcacccacatcgacct

gacccaagtgaccctgcagaatagcgataacagcgctatcgacgccaacaagctgaagcaggccgtgggagacatctacaattacagaagatttg

aaagacagttccagggctacttcgccggcggctacatcgacaaatacctggtgaagaacgtgaataccaacaaggattctgacgacgacctggtct

accggtctctgaaagagctgaacctgcacctggaagaggcctaccgggagggagataacacctattaccgggtgaacgagaattactaccccgg

cgcctccatctatgagaacgagagagccagcagagacagcgagttccagaacgagatcctgaaaagagccgagcagaacggcgtgaccttcga

cgagaacatcaagcggatcaccgccagtggcaagtacagcgtgcagtttcaaaagctagaaaacgacacagatagcagcctggaaagaatgac

caaggctgtggaaggcctggtgaccgtgatcggcgaggaaaagtttgagacagtggacatcaccggcgtgagctctgacaccaatgaggtcaaa

agcctggctaaggaactgaagaccaacgccctgggcgtgaagctgaaactctaa

Artificial

armY-Hepatitis A virus cellular receptor 1 fusion protein codon-optimized (for human)

nucleotide sequence (2,700 bp)

54) SEQ ID NO: 54

atgtaccgcatgcagcttctgtcttgtatcgccctgagcctggcgctggtcaccaacagcagcgtgaaagttggcggagaggccggtc

ctagcgtcaccctgccttgccactactctggcgctgtgaccagcatgtgctggaaccggggcagctgtagcctgttcacctgccagaatggcatcgt

gtggacaaacggtacacacgtgacatacagaaaggacacaagatacaagctgctgggcgacctgtcaagacgggatgtgtctctgaccatcgag

aacaccgctgtaccgacagcggcgtgtactgctgcagagtggagcacagaggctggttcaatgacatgaagatcaccgtgagcctggagatcgt

gcctccaaaggtgaccaccacgcctatcgtgacaaccgtacctacagtgaccaccgtgcggaccagcacaaccgtgcctaccaccaccaccgtg

cccatgaccacggtgcccaccacaaccgtgccaaccaccatgagcatccccaccacgacaacagtgctgacaaccatgaccgtactacaacaac

atcagtgcctaccacaacaagcattcccacaaccacaagcgtgcctgtcacaacaaccgtgtccacattcgtgcctcctatgcccctgcctagacag

aatcacgagcctgtggctacctctcctagctcccctcagcctgccgagacacaccctactaccctgcagggcgccatccggagagaacccaccag

cagccctctgtatagttacaccaccgacggcaatgataccgtgaccgaaagcagcgatggactgtggaacaacaaccaaacacagctgttcctgg

aacattccctgctgacagccaatacaaccaagggcatctacgccggagtgtgcatctccgtgctggtcctgctggcactgctgggagttatcatcgc

caagaagtactttttcaagaaggaagtgcagcagctgagcgtgagcttctccagcctgcagatcaaagattgcagaacgccgtggaaaaggaagt

gcaagccgaagataacatctacatcgagaactccctgtacgccaccgatggcggcggaggctccggcggcggaggaagcggcggcggcggc

tccacaaatctggtgaaccagagcgggtacgccctggtggccagcggcagaagcggaaatctgggcttcaagctgtttagcacccagagcccttc

tgccgaggtgaaactgaaaagcctgtccctcaacgacggcagctaccagagcgagattgacctgagcggcggagccaatttcagagagaagttc

cgcaacttcgctaacgagctgtctgaagcaatcacaaactcccctaagggactggatagacccgtgcctaaaaccgagatcagcggcctgatcaa

gactggagacaatttcatcacccctagctttaaggccggctactatgaccacgttgcctccgacggcagcctgctgagctactaccagtctacagag

tactttaacaacagagtgctgatgcctattctgcagacaactaacggcacactgatggccaacaatcggggctacgatgacgtgttcagacaagtgc

ccagctttagcggctggagcaacaccaaggctactaccgtgtctaccagcaacaacctgacctacgacaagtggacctacttcgccgctaagggct

ccccactgtatgacagttaccccaaccacttcttcgaggacgtaaagaccctggccattgacgccaaggatatcagcgccctgaaaaccaccatcg

acagtgagaagcccacctacctgatcatccggggcctgagcggcaacggctctcagcttaacgagctgcagctgcctgagagcgtgaaaaaggt

gagtctatacggcgactacaccggcgtgaacgtggccaaacagatcttcgccaacgtggtggagctggaattctacagcaccagcaaggccaact

ctttcggcttcaaccccctcgtgctgggctccaagacaaacgtgatctacgacctgtttgcttctaaacctttcacccacatcgacctcacccaggtga

ccctgcaaaatagcgataacagcgccatcgacgccaacaagctgaagcaggctgttggagatatctataactaccggagattcgaaagacagttcc

aaggctatttcgccggcggctacatcgacaaatacctggtgaaaaacgtgaataccaacaaggacagcgacgatgacctggtgtacagatctctga

aggagctgaacctgcacctggaagaggcctacagagaaggcgacaacacatactacagagtgaacgagaactactacccaggagcttctatcta

cgagaatgaaagagccagcagagactctgagttccagaacgagatcctgaagcgggccgagcagaacggcgtgaccttcgacgagaatatcaa

gagaatcaccgcctccggcaagtacagcgtgcagtttcagaagctggaaaacgatacagactccagcctggaacggatgacaaaggccgtgga

gggcctggtgaccgtgatcggcgaggaaaaattcgaaaccgtggacatcaccggcgtctccagcgataccaacgaggtgaagagcctggccaa

ggaactgaagaccaacgccctgggagtgaagctgaagctataa

Artificial

armY-Myelin and lymphocyte protein fusion protein codon-optimized (for human) nucleotide

sequence (2,127 bp)

55) SEQ ID NO: 55

atgtacagaatgcagctgctgagctgcatcgccctgtccctggccctggtgaccaatagcatggcccctgccgccgctaccggcggta

gcacactgcctagcggcttcagcgtgtttacaacactgcctgacctgctctttatcttcgagttcatcttcggcggcctggtgtggatcctggtggcctc

tagcctggtcccttggcccctggtgcagggctgggtcatgttcgtgtccgtgttctgcttcgtggcaacaaccacactgatcatcctgtacattatcgg

cgcccacggtggcgagacaagctgggtgacactggacgccgcttatcattgtaccgccgctctgttttacctgtcagcaagcgtgctggaagccctt

gccaccatcaccatgcaggatggctttacctacaggcactaccacgagaacatcgccgccgtggtgttctcctacatcgccacactgctgtatgtcgt

gcacgccgtgttcagcctgattagatggaagtccagcggcggcggcggatctggcggaggcggaagcggcggcggaggctctaccaacctgg

tgaaccagagcggatacgccctggtggcctctggcagaagcggaaacctgggcttcaaactgttcagcacccagtccccaagcgccgaggtgaa

actgaagagcctgagcctgaatgacggcagctaccagagcgagattgacctctctggtggagccaatttcagagagaagttccggaacttcgcca

acgaactgtctgaagccatcaccaacagcccaaaaggcctcgatagaccagtgcccaagaccgaaatcagcggactgatcaagaccggcgata

atttcattacccctagctttaaggctggctattacgaccacgtggcttctgacggcagcctgctgagctactaccagagcaccgagtactttaacaata

gagtgctgatgcctatcctgcagaccaccaacggcaccctgatggccaacaacagaggttacgacgacgtgttcagacaggtgcctagcttcagc

ggctggtccaacaccaaggcgactaccgtctccacaagcaacaacctgacctacgataagtggacctacttcgccgcaaagggctctcctctgtac

gacagctaccccaaccacttcttcgaagatgtgaagaccctggctatcgatgctaaagatatcagtgccctgaagacaacaatcgacagcgagaaa

cctacctacctgatcatcagaggcctgagcggaaatggctcgcagctgaacgagctgcagctgcctgagtccgtgaaaaaggtgtccctctacgg

cgactataccggcgtgaacgttgccaagcagatctttgctaatgtggttgagctggagttctacagcacctctaaggccaattcttttggcttcaacccc

ctggtgctgggcagcaagaccaacgtgatctacgacctgttcgccagcaagcccttcacccacatcgatctcacccaagtgacactgcaaaactcc

gacaacagcgccatcgacgccaacaagctgaagcaggccgtgggcgatatctacaactacagacggttcgagagacagttccagggatatttcg

ccggcggctacatcgacaagtacctggtcaagaacgtgaacacgaacaaggatagcgatgacgacctggtgtaccggagcctgaaggaactga

acctgcacctggaagaggcttaccgggaaggcgacaacacctactaccgcgtgaatgaaaactactaccctggcgccagcatctacgagaacga

gcgggcctcccgggacagcgaattccagaatgaaatcctgaaaagagccgagcagaacggggtgaccttcgacgagaacatcaagcggatcac

cgccagcggcaagtactccgtgcagttccaaaagctggaaaacgataccgacagcagcctggaaagaatgactaaggccgtcgagggcctggtt

acagtgatcggcgaggaaaaatttgagacagtggacatcacaggcgtcagcagcgacacaaacgaggtgaagtctctggccaaggagctgaag

accaacgcccttggagttaagctgaagttataa

Artificial

armY-Complement factor H fusion protein codon-optimized (for human) nucleotide sequence

(5,307 bp)

56) SEQ ID NO: 56

atgtacagaatgcagctgctgtcctgcatcgccctgtctctggccctggttaccaattcagaagattgcaacgagctgcctcctcggcgg

aacaccgaaatcctgaccggatcctggagcgaccagacataccccgagggcacccaggccatttacaagtgtcggcctggctacaggtcactgg

ggaacgttatcatggtgtgccggaaaggcgagtgggtggccctgaaccctctgcggaagtgccagaaacggccatgtggccaccctggcgaca

cccctttcggaaccttcaccctcacaggtggcaacgtctttgagtacggcgtgaaagccgtttacacatgcaatgagggataccagctgctcggaga

gatcaactacagagagtgtgataccgacggatggaccaacgacatccccatctgtgaagtggtgaagtgcctccctgtcacagcccctgaaaacg

gcaagatcgtgtcttctgctatggagcctgatagagaatatcactttggccaggccgtgagattcgtgtgcaactctgggtacaaaatcgagggagat

gaggaaatgcactgctctgatgacggcttctggagcaaggaaaagcctaagtgcgtggagatcagctgcaagagtcctgacgtgatcaacggctc

ccctatctcacagaagatcatttacaaggagaacgaaagattccagtacaaatgtaacatgggatacgagtactctgaaagaggtgatgccgtttgta

ctgaatccggctggcggcctctgcctagctgcgaggagaagagctgtgacaatccttacatccccaatggagattacagccctctcagaatcaagc

accgcaccggcgacgagatcacctaccagtgtcgcaacggattttaccccgctacccggggcaacaccgccaagtgtacctccacaggctggat

ccctgcccccagatgcaccctgaaaccctgcgactaccctgatatcaagcacggcggcctgtatcacgagaacatgagaagaccttacttccctgt

ggccgtgggcaagtactactcttattactgcgatgaacactttgaaacccctagcggcagctactgggatcacatccactgtacccaggatggctgg

tctccagctgtgccatgtctgcgcaagtgctacttcccctacctggaaaacggctacaaccagaactacggtagaaagttcgtgcagggcaagtcta

tcgacgtggcatgccaccccggctacgccctacctaaggctcagaccacagtgacctgtatggaaaacggttggtctcccaccccacgctgcatcc

gggtgaagacctgctccaagtcttctatcgatattgaaaacggcttcatctctgaatcccaatacacctatgctctgaaggaaaaggccaagtaccag

tgtaagctgggatacgtgaccgccgacggcgagacatctggctccatcacctgtggcaaggacggctggagcgcacagcccacatgcattaagt

cttgcgacatcccggtgttcatgaacgccagaaccaagaacgatttcacctggttcaagctgaacgacacactggattacgagtgtcacgacggata

tgaaagcaataccggcagcaccaccggcagcatagtgtgcggctacaacggctggagcgatctgcccatctgctacgaaagagaatgcgagctg

cctaagatcgatgtgcacctggtgcccgatcggaagaaggaccagtacaaggtgggcgaagtgctgaagtttagctgcaagcccggattcacaat

cgtgggaccaaattctgtgcagtgctaccacttcggcctgagccccgacctgcccatctgcaaggaacaagtgcagagctgtggacctcctcctga

gctgctgaacggaaacgtgaaagagaagacaaaggaggagtacggccattctgaggtggtcgagtactactgtaaccctagattcctgatgaagg

gccctaacaagatccaatgcgtggacggagagtggaccaccctgcccgtttgcatagtggaggaaagcacctgtggcgacatcccggaactgga

acacggctgggcccagctgagcagccctccctactactacggcgattctgtcgaatttaactgtagcgagtcattcaccatgatcggccatagaagc

attacttgcatccacggagtgtggactcagttacctcagtgcgttgccatcgacaagctgaagaagtgtaaatctagcaacctgatcattctggaaga

acacctgaagaacaagaaagaattcgaccacaattcaaacatcagatacagatgccggggcaaagagggctggatccacaccgtgtgcatcaac

ggcagatgggaccccgaggtgaactgcagcatggcccagatccagctgtgtcctcctcccccccagatcccaaacagccacaacatgaccacca

cgctgaactaccgagacggcgagaaggtgagcgtgctgtgccaggagaactacctgatccaggagggcgaagagatcacatgtaaggacggtc

gttggcagagcatccccctgtgcgttgaaaagatcccctgcagccagcctcctcaaatcgagcacggcaccatcaacagctccagatcctcccag

gagtcctacgcccacggcacaaaactgagctacacatgcgaaggcggattccggatttctgaagagaacgagaccacctgctacatgggcaagt

ggagctctccccctcaatgtgagggcctgccttgcaagagccctcctgagatcagccacggcgtggttgcccacatgtctgatagctaccaatacg

gcgaggaagtgacttataagtgcttcgaggggtttgggatcgatggtcccgccattgccaagtgcctgggagaaaaatggtctcatccaccatcatg

tatcaagaccgactgcctgagtttgcctagctttgagaatgctatccctatgggcgagaagaaggacgtatacaaagccggcgagcaggtgacata

cacatgtgccacctactacaaaatggacggcgccagcaatgtaacgtgtataaatagcagatggacaggcagacctacctgcagagatacaagct

gcgtgaatcctcccacagtccaaaatgcttatatcgtgagtcggcagatgagcaagtaccctagcggcgagagagtgagataccagtgcaggtcc

ccctacgagatgttcggcgacgaggaggtgatgtgcctaaacggcaactggacggaacctcctcagtgcaaagacagcaccggaaaatgcggc

cctcctcctcctattgacaacggcgatatcaccagattccactgagcgtgtacgctcctgcttcatctgtcgagtaccaatgccagaatctgtaccag

ctggaaggtaataagagaatcacctgcagaaacggacagtggagcgaacctcctaagtgcctgcacccttgcgtgatctccagagagatcatgga

aaactacaacatcgccctgagatggaccgccaaacagaagctgtacagccggaccggcgagagcgtcgagttcgtgtgtaagagaggttaccga

ctgtcctctagaagccataccctgcggaccacctgctgggacggcaaactagagtaccctacgtgcgccaagcggggcggaggtggctcagga

ggcggcggctctggcggcggcggctctacaaacctggtgaaccagagcggttatgccctggtggccagcggcaggtctggaaatctgggcttta

agctgttttcaacgcagagcccttccgccgaagttaagctgaaatcactgagcctgaatgacggctcctaccagagcgagatcgacctgtctggag

gagctaactttagagagaagttcaggaacttcgctaacgagctgagcgaagccatcaccaatagccctaaaggcttggacagacctgtgcccaag

actgagatcagcggcttgatcaagaccggcgacaacttcatcaccccatcttttaaggccggctactacgaccacgtggcctctgacggaagcctg

ctatcctactatcagtctactgagtacttcaacaacagagtgctgatgcctatcttgcagaccaccaatggcaccctgatggccaacaaccggggata

tgacgatgtgttcagacaggtgcctagcttcagcggatggagcaacaccaaggcgacaaccgtgagcacatccaacaacctgacatacgacaagt

ggacatattttgcggccaagggctctccactgtatgatagctaccccaatcacttcttcgaggacgtgaagaccctggccatcgacgccaaagacat

cagcgcccttaagacaacgatcgattccgagaagcctacctacctgatcattagaggcctgagcggcaacggcagccagctgaacgagctgcag

ctgccagagtccgtgaagaaagtgtccctgtatggcgactacacaggcgtcaacgtggccaagcaaatcttcgctaatgtggtggaacttgagttct

acagcacatcgaaggctaactctacggcttcaaccccctggtgctgggcagcaagaccaatgtgatttacgacctgttcgccagcaagcccttcac

acacatcgacctgacccaagtgacactgcaaaacagcgataacagcgccatcgacgccaacaagctgaagcaggctgtgggcgacatctacaa

ctaccggagattcgagagacagttccagggctacttcgccggcggctacatcgataagtacctggtgaagaacgtgaataccaacaaagactctga

tgacgacctggtgtacagaagcctgaaagagctgaacctgcatctggaagaagcctaccgggagggcgataacacctactaccgggtgaacgaa

aactactatcctggcgctagcatctacgagaacgaacgagccagcagggattctgaattccagaacgagatcctgaagcgggccgagcagaacg

gagtgacatttgatgagaacatcaaacggatcaccgccagcggcaaatactccgttcagttccaaaaactggaaaatgatacagacagcagcctgg

agagaatgaccaaggccgtggaaggcctggtgacggtgatcggcgaagagaaattcgagaccgtggacatcaccggcgtaagctctgacacca

acgaagtgaagagcctggctaaggaactgaagaccaacgccctgggggtcaagctgaagctgtaa

Artificial

armY-Hepatocyte growth factor receptor fusion protein codon-optimized (for human)

nucleotide sequence (4,392 bp)

57) SEQ ID NO: 57

atgtaccggatgcaactgctgagctgcatagccttatctctggcactggtgaccaacagcgagtgcaaggaagccctcgccaagagtg

aaatgaacgtgaatatgaaataccagctgcctaacttcaccgccgaaacccctatccagaacgtcatcctgcatgagcaccacatcttcctgggcgc

tacaaattacatctacgtgctgaatgaggaggacttgcagaaagtcgccgaatacaagaccggacccgtgctggagcacccggactgcttcccatg

tcaggattgcagttctaaggccaacctgagtggtggcgtttggaaggacaacatcaacatggccctggtggtcgacacatattacgacgatcagctg

attagctgtggcagcgtgaaccggggcacctgccagagacacgtgttccctcacaaccacactgccgacatccagagcgaagtgcactgcatctt

cagcccccagatcgaggagcctagccagtgtcctgactgcgtggtgtcagccctgggtgctaaggtactgtccagcgttaaggacagattcatcaa

ctttttcgtgggtaacacaatcaacagcagctacttccccgatcaccctctgcacagcatatccgtgcggagactcaaggaaacaaaggacggcttc

atgttcctgacagaccagagctatatcgatgtgctgcctgagttcagagattcttaccccatcaagtacgtgcacgccttcgagagcaacaattttatct

atttcctgacagtccaaagggagacactcgatgcccagaccttccacaccagaatcatccggttctgcagcattaacagtggactgcactcttatatg

gaaatgcccctggaatgtatcctcacagagaaaaggaagaaaagaagcactaagaaggaggtgttcaacattctgcaggctgcttacgtgtccaag

cctggcgctcagctggccagacagatcggcgccagcctgaacgatgacatcctgttcggcgtcttcgcccaatctaagcctgacagcgccgagcc

catggacagatctgctatgtgcgctttccccatcaagtacgtgaatgacttcttcaacaagatcgtgaacaagaacaacgtgcggtgcctgcaacact

tctacggccctaaccacgagcactgttttaatagaaccctactgcggaactcctctggttgtgaagctagaagagacgaataccggaccgagttcac

caccgccctgcagagggtggacctgttcatgggccaattcagcgaggtcctgctgacatctataagcaccttcatcaagggagatctgacaatcgc

caacctgggcaccagtgagggcagattcatgcaggtggtggtgagtagatccggccctagtacaccccatgttaacttcctgctggactcacaccc

cgtgtcccctgaggtgatcgtggaacatacactgaaccagaatggctatacactggtgatcaccggaaagaagattaccaagattcctctgaacgg

cctgggctgcagacacttccagagctgtagccagtgcctgagcgcccctccttttgtgcagtgcggctggtgccacgacaagtgcgtgcgcagcg

aggagtgcctgagcggcacctggacacagcagatctgtctgcctgccatctacaaggtctttccaaacagcgccccattggaaggcggaactcgg

ctgacaatctgcggctgggacttcggctttcggcggaacaacaagtttgacctgaagaagacccgggtgctgctgggcaacgagagctgtaccct

gaccctgagcgaaagcaccatgaacacgctgaaatgcaccgtgggcccagccatgaacaaacacttcaacatgtctatcatcatcagcaatggcc

acggcacaacccagtacagcacgttcagctacgtggaccctgtgatcaccagcatctcaccgaagtacggccctatggccggcggcacattgctg

accctgaccggaaattatctgaactcgggcaacagccgtcacatctccataggcggaaagacatgcacgctgaagtcggtgtctaacagcatcctg

gagtgctacacaccagcccagaccatctcgacagaattcgctgtaaagctgaagatcgatctcgctaatcgagagacaagcatcttttcttacagag

aggatcctatcgtgtacgagatccaccctacaaagtctttcatcagcggcggcagcaccatcacaggcgtgggaaaaaacctgaactctgtgtctgt

gccgagaatggtgatcaacgtgcacgaggctggcagaaacttcacagtggcctgccagcatagaagcaacagcgaaatcatctgctgcaccacc

ccctcgctgcagcagcttaatctgcagctgcccctgaaaacgaaggccttcttcatgctggatgggatcctgtctaagtacttcgatctcatctacgtg

cacaatcctgtgtttaagccattcgagaagcccgtcatgatctctatgggcaacgagaacgtgctcgagatcaagggcaatgatatcgaccctgagg

ccgtgaaaggcgaggtgctgaaagtgggcaacaaaagctgcgaaaacatccacctgcacagcgaagccgtgctgtgcaccgtgcctaacgactt

gctgaagctgaactccgagctgaatatcgagtggaagcaggccatcagctctaccgtcctgggcaaggtgattgtgcaacctgaccagaacttcac

cggcggtggcggtagtggaggcggcgggagcggaggcggaggaagcaccaacctggtgaaccagagcgggtacgccctggtagctagcgg

cagaagcggcaacctgggctttaagctgttttctacccagagccctagcgccgaagtgaagctgaagagcctgagcctgaacgacggcagttacc

aatccgagatcgacctgtctggcggcgccaacttcagagagaagttcagaaacttcgctaatgagctgtctgaggccatcaccaacagccctaagg

gcctggatagacctgtgccaaagaccgagatctccggcctgatcaaaaccggcgataactttatcacacctagctttaaggccggctactacgacc

acgtggcctccgacggctccctgctgtcctactaccagagcacagaatacttcaacaacagagtgctgatgcctatcctgcaaaccacaaacggca

ccctgatggccaacaacagaggctacgacgatgtgttccggcaggtgcctagcttctccggctggagcaacaccaaggccactaccgtttctacca

gtaacaacctgacctacgataagtggacctactttgccgccaagggcagccccctgtacgactcataccccaatcacttctttgaagatgtgaagac

cctggccatcgatgccaaagatatcagcgctctgaaaacaaccatcgactccgagaagcccacctaccttattatcagaggcctgtccggcaacgg

ctctcagctgaatgagctgcagctcccagaaagcgtgaagaaggtgtcgctgtacggcgactacaccggcgtcaatgtggccaaacagatatttgc

caacgtagtagaattggaattctactctacaagcaaagccaactcttttggatttaaccccttagtgctaggatctaagacaaacgtgatctacgacctg

ttcgccagcaaacctttcacccacatcgacctgacccaagtgaccctgcagaacagcgacaacagcgctatcgacgccaacaagctgaagcagg

ccgtcggcgatatatacaattaccggcggttcgagagacagttccagggctacttcgccggaggatacatcgacaagtacctggtgaagaacgtga

acactaataaggacagcgacgacgacctcgtgtacagaagcctgaaagaactgaatctgcacctggaagaagcctaccgggaaggagacaaca

cctactacagagtgaacgaaaactactaccctggcgccagcatctatgagaacgagagagccagcagagattctgaattccagaacgagattctga

aacgggccgagcagaatggcgtgaccttcgacgagaatattaagcgcatcaccgccagcggcaaatattccgtccagtttcagaagctcgagaac

gacaccgacagcagcctggaaagaatgaccaaggccgtggaaggcctggtgaccgtgatcggcgaggaaaaattcgagaccgtggatatcacc

ggcgtgagcagcgacacaaacgaagtgaagagcctggccaaggaactgaagaccaacgccctgggagtgaagctcaagctgtaa

Artificial

armY-Membrane cofactor protein (CD46) fusion protein codon-optimized (for human)

nucleotide sequence (2,595 bp)

58) SEQ ID NO: 58

atgtaccgcatgcagctgctgagctgcatcgccctgtctctggctctggtgaccaacagctgcgaggaacctccaaccttcgaggccat

ggaactgatcggcaagccaaagccctactatgagattggcgaaagagtggattacaaatgcaagaaaggctacttttacatcccccccctggccac

ccacaccatctgtgatagaaaccacacatggctgcctgtctccgacgacgcctgttaccgggagacatgcccttacatccgagaccctctcaatgga

caggccgtgcctgctaatggcacatatgagttcggataccaaatgcacttcatctgcaacgagggctactacctgatcggcgaagaaatcctgtact

gcgagctgaaaggctcggtggctatttggtccggcaaacctcctatctgtgaaaaggtgctgtgcacccctcctcctaagatcaaaaacggcaagc

acacctttagcgaggtggaagtgttcgagtacctggatgccgtgacatatagctgtgaccccgcccctggccctgatcccttcagcctgattggcga

gagcaccatctattgcggcgataactctgtgtggagccgggccgcccctgaatgcaaggtggtgaagtgcagattccctgtggtggaaaacggaa

agcagatctccggctttggcaaaaagttctactataaggctaccgtgatgttcgagtgcgacaagggattctacctggacggctctgatacaatcgtg

tgcgacagcaactctacgtgggaccctccagtgcctaagtgtctgaaagttctgcctcctagctctacaaagccccccgccctgagccacagcgtgt

ccaccagcagcacaaccaagtccccagccagcagcgccagcggacctagacccacctacaagcctcctgtgtccaactaccctggctaccccaa

gcctgaggaaggcatcctggatagcctggatggcggcggcggctccggcggtggaggatctggcggcggaggaagcacattatctggtgaatc

agagcggctacgccctggttgccagcggcagaagcggcaacctgggcttcaagctgtttagcacacagagccccagcgccgaggtgaagctga

agagcttgtcgctaaatgatggctcctaccagtctgagatcgatctgagcgggggcgccaattttagagagaagttccggaacttcgcaaacgagct

gtctgaagccatcaccaacagccctaaggggctggacagacctgtgccaaagaccgagattagcggcctcatcaagacaggcgacaatttcatca

cacctagcttcaaggccggatactatgaccacgtggcctccgacggcagcctgctgagctactaccagagcacagagtacttcaacaacagagtg

ctgatgcctatcctgcagaccaccaacggcaccctcatggccaacaatcggggctatgacgacgtgttcaggcaggtgcctagcttcagcggctg

gagcaacaccaaggccaccactgtgtctacctccaacaacctgacctacgacaagtggacctacttcgcagctaaaggctctccactgtacgatag

ctacccaaaccacttcttcgaggacgtgaagaccctggctattgacgccaaggacatctctgccctgaagaccacaatcgacagcgagaagccta

cctacctgatcatccggggcctgagcggaaacggcagccagctgaacgagctgcagctgcccgagtccgtgaaaaaagtgtccctgtacggcga

ctacaccggcgtgaacgtggccaagcagatcttcgctaatgtggtggaacttgagttctactctaccagtaaggccaactcctttggatttaaccccct

ggtgctgggcagcaagaccaacgtgatctacgacctgttcgcctctaaacctttcacccatatcgacctgacccaggttacactgcaaaacagcgat

aactctgccatcgatgccaacaagctgaagcaagccgtgggcgacatctacaactaccgcagatttgaacggcagttccagggctacttcgccgg

cggctacatcgacaagtacttggtcaagaacgtgaataccaacaaggatagcgacgatgacctggtctaccggagcctgaaggaactgaacctgc

acctggaagaagcctacagagaaggtgacaatacctactatagagtgaacgagaactactacccgggagccagtatctacgagaacgaaagagc

ctctagagatagcgagttccaaaacgagatcctgaaaagagctgaacagaacggcgtgaccttcgacgagaacatcaagagaatcaccgccagc

ggcaagtacagcgtgcagtttcagaagctggaaaacgacaccgacagctccctggaacggatgaccaaggctgttgagggcctggtcacagtga

tcggagaggaaaagttcgaaacagtggatatcacgggcgttagcagcgacaccaacgaggtcaagagcctggccaaagagctgaagacaaac

gccctgggcgtgaagctgaagctgtaa

Artificial

armY-Glycophorin-A fusion protein codon-optimized (for human) nucleotide sequence

(1,884 bp)

59) SEQ ID NO: 59

atgtaccgtatgcagctgctgtcttgcatcgccctcagcctggctctggtgaccaacagctctagcacaacaggcgttgccatgcacac

cagcaccagctctagcgtgaccaagagttacatctcttctcagaccaacgatacccacaagagagacacgtacgccgccaccccaagagcccatg

aggtgtctgaaatcagcgtgcggaccgtgtacccccccgaggaagaaaccggcgagcgggtgcagctggcccaccacttttctgagcctgaggg

aggtggaggcagcggcggcggcggcagcggcggaggcggcagcaccaacctggttaaccagtccggctatgccctggtggctagcggcaga

tccggcaacctgggctttaagctgttcagcacccagagccccagcgccgaggtgaaactgaagagtctgagcctgaatgacggctcttatcagag

cgagatcgacctgagcggcggcgccaatttcagagagaagtttcggaacttcgccaatgaactgtccgaagccatcaccaacagcccaaagggc

ctggacagacccgtgcctaaaacagaaatcagcggactgatcaagaccggcgataatttcatcacacctagcttcaaggccggctactacgacca

cgtggccagcgacggctccctcctgagctactaccaaagcacagagtacttcaacaaccgggtgctgatgcctatcctgcagaccacaaatggca

ccctcatggccaataacagaggctatgatgacgtgttccggcaggtgcccagctttagcggatggagcaacaccaaggccacaaccgtgtccaca

tccaacaacctgacctacgacaagtggacctacttcgctgctaagggcagccctctgtacgactcttaccctaaccacttcttcgaggatgtgaagac

gctggctatcgacgccaaggacatctcggccctgaagaccacaatcgacagcgagaagcctacatacctgatcatcagaggactgagcggcaac

ggcagccaactgaatgagctgcagctgcctgagagcgtgaaaaaggtgagcctgtacggcgactataccggcgtgaatgtggctaagcagatctt

cgccaacgtcgtggaactggaattctacagcaccagcaaggctaactccttcggctttaaccccctggtgctgggctccaaaacaaacgtgatctac

gacctgttcgcctccaaacctttcacccacatcgacctgacacaagtgacactgcaaaatagcgataacagcgccatcgacgccaacaagcttaag

caggccgtgggcgacatctacaactacagaagattcgagagacagtttcagggctatttcgccggaggctatattgataaatacctggtgaagaac

gtgaacaccaacaaagatagcgacgacgatctggtgtacagatctctgaaagagctgaacctgcacctggaagaggcctaccgggaaggagata

acacctactacagggtcaacgagaactactaccctggagccagcatctacgagaacgagagagcttctagagatagcgagttccagaatgaaatc

ctgaagcgggccgaacagaacggagtgacattcgacgagaacattaagcggatcaccgcctctgggaagtacagcgtgcagttccagaagctg

gagaacgacaccgattcttctctggaaagaatgaccaaggcagtcgagggcctggtgaccgtgatcggagaggaaaagttcgagacagtcgaca

tcactggcgtgagctcggacaccaacgaggtaaagagcctggccaaggaactgaagaccaacgccctgggcgtgaagctcaaactgtaa

Artificial

armY-C-type lectin domain family 4 member K (Langerin, CD207) fusion protein codon-

optimized (for human) nucleotide sequence (2,460 bp)

60) SEQ ID NO: 60

atgtatcggatgcagctgctgagctgcatcgccttatccctggctctggtgacaaactcccctagattcatgggcaccatcagcgacgtg

aaaacgaacgtgcagctgctgaagggaagagtggacaacatctctaccctggattctgagatcaaaaagaactccgatggcatggaagctgctgg

cgtgcaaatccagatggtgaatgagagcctgggctacgtgcggtcccagttcctgaagctgaagaccagcgtggaaaaggccaacgcccagatt

cagatcctgacaagaagctgggaggaagtgtctacactgaatgctcagatccccgagctgaaaagcgatctcgagaaggctagcgccctgaaca

ccaagatccgggccttgcaaggctctctggaaaacatgagcaagctgctgaagagacagaacgatatcctgcaggtcgtgtctcagggctggaag

tacttcaagggcaacttctactacctctgatccctaagacctggtactctgccgagcagttctgcgtgtccagaaacagccacctgaccagcgtta

ccagtgagagcgagcaggagttcctgtataagacagccggaggcctgatctattggatcggcctgaccaaggccggcatggagggcgattggag

ctgggtcgacgacacccctacaacaaagtgcagagcgtgcggttaggatccccggcgagcctaacaacgccggcaacaacgagcactgcggc

aatatcaaagcccctagcctgcaggcctggaacgatgccccgtgcgacaagacatactgttcatctgtaaaaggccttacgtgcccagcgaaccc

ggcggcggcggcagcggaggcggcggctctggcggaggaggaagcaccaacctggtgaaccagagcggctacgccctggtcgccagcggc

agaagcggaaatctgggcttcaagctgtttagcacacagagcccatctgcagaggtgaaactgaagagcctgagcctgaacgacggcagctacc

agtctgagatcgacctgtctggcggggccaatttccgggaaaagttccggaacttcgctaacgagctgtctgaagccatcaccaatagtccaaagg

gcctggaccggcctgtgcctaagactgagatactggccttatcaagacaggcgacaacttcatcacccctagattaaggccggctactacgacca

cgtggccagcgatgggtctctgctgagctactaccagagcacagagtacttcaacaatagagtgctgatgccaatcctgcaaacaacaaatggcac

actgatggccaacaaccggggctacgacgatgtgttcagacaggttcctagcttcagcggctggtccaacaccaaggccaccaccgtgagcaca

agcaacaacctgacatatgataagtggacctacttcgccgctaagggcagccctctgtacgacagctaccctaaccatacttcgaggacgtgaaga

cgctggccattgacgccaaagacatctcggccctgaagaccaccatcgacagcgaaaaacctacctacctgatcatcagaggcctgagcggcaa

cggatctcagctgaacgagctgcagctgcccgagagcgtgaagaaggtgagcctctacggcgactacaccggcgtgaacgtggccaagcagat

tttcgcaaacgtggtggaactggaattttacagcacctccaaggctaacagcttcggctttaaccccctggtgctgggatctaagaccaatgtgatcta

cgacctcttcgcttccaagccctttacccacatcgacctgacccaggtgaccctgcaaaattcagataatagcgccatcgacgccaacaagctgaaa

caagccgtgggcgacatctacaactacagaagattcgagcgccagttccagggctattttgctggcggttacatcgacaagtacctggtgaaaaac

gtgaacaccaacaaggacagcgacgatgacctggtgtacagatccctgaaagagctgaacctgcacctggaagaggcctacagagagggcgat

aatacctactatagagtgaatgagaactactaccctggcgccagtatctacgagaacgaaagagctagcagagacagcgagttccagaacgagat

cctgaagcgggccgagcagaatggcgtgaccttcgacgagaacatcaagcggatcacagccagcggcaagtacagcgtgcagttccagaaact

ggaaaacgacacagatagcagcctcgagagaatgaccaaggccgtggaaggactggtgaccgtcatcggcgaagaaaagttcgaaacggtgg

acatcaccggagtgtcctccgacaccaatgaggtgaagtccctggccaaggaactgaagaccaatgccctcggagtgaagctgaagctataa

Artificial

armY-Anthrax toxin receptor 1 fusion protein codon-optimized (for human) nucleotide

sequence (3,264 bp)

61) SEQ ID NO: 61

atgtacagaatgcagctgttgagctgtatcgccctgagcctggccctggtgaccaacagcgaggacggtggccctgcctgctacggc

gggtagacctgtacttcatcctggataagtccggttctgtgctgcaccactggaacgaaatctactacttcgtggaacagctggcccacaagtttatct

cccctcagctgcggatgagcttcatcgtgttctccacaagaggcaccaccctgatgaagctgaccgaggatcgcgagcagatcagacagggactg

gaagagctgcagaaagtgctgcctggcggcgatacatacatgcacgagggatttgagagagcctccgagcagatctattacgagaacagacagg

gctaccgcaccgccagcgtgatcattgccctgacagacggcgagctgcatgaagatctgttcttctacagcgagcgcgaggccaacagaagccg

ggacctgggcgccatcgtgtactgtgtgggcgtgaaggacttcaacgaaacccagctggccagaatcgccgatagcaaggatcacgtgttccctg

tgaacgacggattccaggccctgcagggcatcatccacagcattctaaagaagtcctgcatcgagatcctggctgctgaacccagcaccatctgcg

ccggcgagagcttccaggtggtggtgcggggcaacggcttccggcacgccagaaacgtggacagagttctgtgcagctttaagatcaatgatagc

gtgacacttaacgagaagcccttcagcgtggaagatacctacctgctgtgtcctgctccaatcttaaaagaggtgggaatgaaagccgccctgcaa

gtgtccatgaacgatggcctctcttttatcagttccagcgtgatcatcaccacaacccactgttctgatggtagcatcctggccatcgccctgctcatcc

tgtttctgctgctggccttggccctgctgtggtggttctggcctctgtgctgcaccgtgatcatcaaagaagtgcctcctcctcccgctgaagagagcg

aagaggaggacgacgacggcctgcctaagaaaaagtggcccacagtcgatgcttcttactacggcggcagaggcgttggcgggatcaagcgga

tggaagtgcggtggggagaaaagggcagtaccgaggaaggagctaagctggaaaaggccaagaatgccagagtgaagatgcctgagcagga

gtacgagttccccgagcctcggaacctgaacaacaacatgagacggccctcctctccaagaaagtggtacagccctatcaagggcaagctggac

gccctctgggtcctgctgagaaagggctacgacagagtgagcgtgatgcggccccagcctggcgacactggcagatgcatcaactttaccaggg

tgaagaacaaccagcctgccaagtaccccctgaacaacgcctaccacacaagctctcctcctcccgctcccatctacactccgccccccccagcc

ccacactgccctcccccaccaccctctgcccctacccctcccatccccagccccccttcaaccctgcctccccctccgcaagcccctccaccaaac

agagcacctccacctagcagaccccctcctagaccttctgtgggcggcggcggcagcggcggaggcggcagcggcggaggcgggagcacca

acctggtgaaccagagcggctacgccctggtggcctccggcagaagcggcaacctgggcttcaagctgttctcgacccagagcccttctgccga

ggtgaagctgaaaagcctgtcactgaatgacggctcttaccagagcgagatcgacctgagcggcggagctaacttcagagaaaagttccggaact

tcgccaacgagctgtctgaggccatcaccaacagccctaagggcctggacagacccgtacccaagaccgagatcagcggactgattaagacgg

gcgacaacttcatcacaccttccttcaaggctggatactacgatcatgtggccagcgacggcagcctgctgagctactaccagtccacagagtactt

caacaacagagtcctgatgcctatcctccagaccaccaatggcaccctgatggccaacaatagaggctacgacgacgtgttcaggcaggttccttc

tttctccggctggagcaacacaaaggccaccacagtgagcacaagcaataacctcacctacgacaaatggacctacttcgctgccaagggcagcc

ccctctacgactcttatcctaaccactttttcgaggatgtgaaaacactggctatcgatgccaaggacatcagcgcccttaaaacaacaatcgactccg

agaaacctacctacctgatcatcagaggcctgtccggcaatggcagccagctgaacgagctgcaactgcctgaaagcgtgaaaaaagtgagcct

gtatggggactacaccggcgtgaacgtggccaagcagatcttcgccaatgtggtggaactggagttctacagcactagcaaggccaattctttcgg

ctttaaccccctggtgctgggcagcaagacaaacgtgatctacgatctgttcgccagcaagcctttcacccacatcgacctgacacaggtgacgctg

cagaacagcgacaacagcgccatcgacgccaacaagctgaagcaggccgtgggcgacatttacaactaccggagattcgagagacaatttcag

ggctatttcgccggcggatacatcgacaagtatctggtcaaaaatgtgaataccaacaaggatagcgacgacgacctggtataccggtccctgaaa

gaactgaacctgcacttggaggaagcctacagagagggcgacaatacctactatagagtcaacgagaactactaccctggcgcctccatctacga

aaatgaacgggcctctagagactctgagttccaaaacgagatcctgaaaagagcagagcagaatggcgtcaccttcgacgagaacatcaagcgc

attaccgccagcggaaagtactccgtgcagttccagaagctggagaacgataccgacagctctctggaacggatgaccaaggccgtggagggac

tggtcaccgtgatcggcgaagagaagttcgaaaccgtggacatcaccggcgtgtcttctgacacaaacgaagtgaaaagcctggctaaagagctg

aagacaaacgccctgggagtgaagctgaagctgtaa

Artificial

armY-Anthrax toxin receptor 2 fusion protein codon-optimized (for human) nucleotide

sequence (2,523 bp)

62) SEQ ID NO: 62

atgtacagaatgcagctgctctcttgcattgccctgagcctggccctggtgaccaatagccaggagcaacctagctgcagaagagcctt

cgacctctacttcgtgctggataagtccggcagcgtcgccaacaattggatcgagatctacaacttcgtacagcagctggccgaacgcttcgtgagc

cccgagatgagactgagcttcatcgtgttctcttcccaggccaccatcatcctgcctctgaccggcgacagaggcaaaatctcaaagggcctggaa

gatctgaaaagagtgtcccccgtcggcgagacatacatccacgagggcctgaagctggccaatgaacagatccagaaggccggcggactgaag

accagcagcatcatcattgccctgaccgacggcaaactggacggcctggtccctagctacgccgagaaggaagccaagatcagccggagcctg

ggcgcttctgtgtactgcgtgggagtgctggacttcgagcaggctcaactggagaggatcgctgatagcaaggagcaggttttcccagtgaaagg

cggctttcaagccctgaaaggcatcatcaacagcatcctggcccagagctgtacagagatcctggaactccagcctagcagcgtgtgcgtcggcg

aagagttccagatcgtgttaagcggcagaggcttcatgctgggcagcagaaacggcagcgtgctgtgcacatacaccgtcaatgagacctacaca

acaagcgtgaagcccgtgtccgtgcagctgaatagcatgctgtgtcctgcccctatcctcaacaaggccggcgaaaccctggacgtgtccgtgtctt

tcaatggcggcaagagcgtaatctccggctctctgatcgtgacagccaccgagtgcagcaacggaggcggaggcggatctggtggcggaggat

cgggcggtggcggtagcaccaacctggtgaaccagtcaggctacgcccttgtggccagcggaagatccggcaacctgggctttaagctgttttcta

cacagagcccatctgctgaagtgaagctgaagtctctcagcctgaacgacggctcttatcagtccgagatcgatctgagcggaggagccaatttcc

gggagaagttcagaaactttgctaatgagctgagcgaagccatcacaaacagccctaagggcctggatagacctgtgcccaagaccgagatcag

cggactgatcaagacaggcgacaacttcatcaccccaagcttcaaggctggctactatgaccacgtggcctctgatggatccctgctgtcttattacc

agagcacagaatacttcaacaacagagtgctgatgcctatcctgcaaaccaccaatggaacgctgatggccaacaaccggggctacgatgacgtg

ttcagacaggtgcctagcttcagcggatggagcaacaccaaggccacaacagtcagcacctctaacaacctgacctacgacaagtggacctacttt

gccgctaagggctctccactgtacgatagctaccccaaccacttctttgaggacgtgaagacactggccatcgatgccaaagacatatctgcgctga

agaccaccatcgacagcgagaagcctacatatctgatcatcagaggcttgagcggcaacgggtctcagctgaacgagcttcagctgcctgagagc

gtgaaaaaggtgagcctgtacggcgactacaccggcgtgaacgtggccaagcagatcttcgctaacgtggtggaattagagttctacagcaccag

caaggccaacagcttcggcttcaaccccctggtgctgggctctaagacaaacgtgatctacgatctgttcgccagcaaacccttcacccacatcgat

ctgacccaggtgaccctgcagaactccgacaacagcgccatcgacgccaacaagctgaaacaggccgtgggcgacatctacaattaccggagat

tcgagcggcaattccagggctactttgcgggcggctacatcgacaagtacctggtgaagaacgtgaacacgaacaaggacagcgacgacgacct

ggtgtaccggagccttaaggagctgaacctgcatctggaagaagcctaccgggagggcgataacacatattaccgggtgaatgagaactactacc

ctggcgccagcatctacgagaacgagagagccagcagagatagcgaattccaaaacgaaatcctgaagcgggccgagcagaacggcgtgactt

tcgacgagaatattaagagaatcaccgcctccggaaagtacagcgtgcagtttcagaaactggaaaacgatacagactcaagcttggagcgcatg

accaaggccgtggaaggcctggtgaccgtaatcggcgaggaaaaattcgaaaccgtggacattaccggcgtgtcttctgacaccaacgaggtga

agagcctggctaaagagctgaagaccaacgccctgggcgtcaagctgaagctgtaa

Artificial

Protein M with radiolabel peptide tag (KGRPLVY) protein sequence (555 amino acids).

Including the human IL-2 signal sequence, radiolabel tag, linker, Mycoplasma genitalium

protein M

63) SEQ ID NO: 63

MYRMQLLSCIALSLALVTNSKGRPLVYGGSGGGGSTNLVNQSGYALVASGRSGNLG

FKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANFREKFRNFANELSEAITNSPKGLDRPVP

KTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYYQSTEYFNNRVLMPILQTTNGTLMANN

RGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKWTYFAAKGSPLYDSYPNHFFEDVKTLAI

DAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLPESVKKVSLYGDYTGVNVAKQIFANV

VELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFTHIDLTQVTLQNSDNSAIDANKLKQA

VGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKDSDDDLVYRSLKELNLHLEEAYREGD

NTYYRVNENYYPGASIYENERASRDSEFQNEILKRAEQNGVTFDENIKRITASGKYSVQFQKL

ENDTDSSLERMTKAVEGLVTVIGEEKFETVDITGVSSDTNEVKSLAKELKTNALGVKLKL

Substitution #1: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted

in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in

Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids).

64) SEQ ID NO: 64

TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF

REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY

QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW

TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP

ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT

HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD

SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE

QNGVTFDENIKRITASGKYSVQFQALANATASALAAMTKAVEGLVTVIGEEKFETVAIAGVA

SATNAVASLAKELKTNALGVKLKL

Substitution #2: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted

in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in

Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids).

65) SEQ ID NO: 65

TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF

REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY

QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW

TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP

ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT

HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD

SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE

QNGVTFDENIKRITASGKYSVQFAKLANATASALARMTKAVEGLVTVIGEEKFETVAIAGVA

SATNAVKSLAKELKTNALGVKLKL

Substitution #3: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted

in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in

Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids).

66) SEQ ID NO: 66

TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF

REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY

QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW

TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP

ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT

HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD

SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE

QNGVTFDENIKRITASGKYSVQFQALEADADSALEAMTKAVEGLVTVIGEEKFETVDIAGVS

ADTAEVASLAKELKTNALGVKLKL

Substitution #4: Alanine mutagenesis (underlined “A”) of a) 494-507 amino acid (highlighted

in green) and b) 527-540 amino acid (highlighted in green) predicted to be immunogenic in

Protein M (469-556 amino acid). See SEQ ID NO: 1 for the original sequence (37-556 amino acids).

67) SEQ ID NO: 67

TNLVNQSGYALVASGRSGNLGFKLFSTQSPSAEVKLKSLSLNDGSYQSEIDLSGGANF

REKFRNFANELSEAITNSPKGLDRPVPKTEISGLIKTGDNFITPSFKAGYYDHVASDGSLLSYY

QSTEYFNNRVLMPILQTTNGTLMANNRGYDDVFRQVPSFSGWSNTKATTVSTSNNLTYDKW

TYFAAKGSPLYDSYPNHFFEDVKTLAIDAKDISALKTTIDSEKPTYLIIRGLSGNGSQLNELQLP

ESVKKVSLYGDYTGVNVAKQIFANVVELEFYSTSKANSFGFNPLVLGSKTNVIYDLFASKPFT

HIDLTQVTLQNSDNSAIDANKLKQAVGDIYNYRRFERQFQGYFAGGYIDKYLVKNVNTNKD

SDDDLVYRSLKELNLHLEEAYREGDNTYYRVNENYYPGASIYENERASRDSEFQNEILKRAE

QNGVTFDENIKRITASGKYSVQFAKLANDTASSAERATKAVEGLVTVIGEEKFETVAITGASS

ATNAVKALAKELKTNALGVKLKL

Armoracia rusticana

Horseradish peroxidase mature protein sequence (31-338 amino acids).

68) SEQ ID NO: 68

QLTPTFYDNSCPNVSNIVRDTIVNELRSDPRIAASILRLHFHDCFVNGCDASILLDNTTS

FRTEKDAFGNANSARGFPVIDRMKAAVESACPRTVSCADLLTIAAQQSVTLAGGPSWRVPLG

RRDSLQAFLDLANANLPAPFFTLPQLKDSFRNVGLNRSSDLVALSGGHTFGKNQCRFIMDRL

YNFSNTGLPDPTLNTTYLQTLRGLCPLNGNLSALVDFDLRTPTIFDNKYYVNLEEQKGLIQSD

QELFSSPNATDTIPLVRSFANSTQTFFNAFVEAMDRMGNITPLTGTQGQIRLNCRVVNSNS

Escherichia coli

Alkaline phosphatase mature protein sequence (22-471 amino acids).

69) SEQ ID NO: 69

RTPEMPVLENRAAQGDITAPGGARRLTGDQTAALRDSLSDKPAKNIILLIGDGMGDS

EITAARNYAEGAGGFFKGIDALPLTGQYTHYALNKKTGKPDYVTDSAASATAWSTGVKTYN

GALGVDIHEKDHPTILEMAKAAGLATGNVSTAELQDATPAALVAHVTSRKCYGPSATSEKCP

GNALEKGGKGSITEQLLNARADVTLGGGAKTFAETATAGEWQGKTLREQAQARGYQLVSD

AASLNSVTEANQQKPLLGLFADGNMPVRWLGPKATYHGNIDKPAVTCTPNPQRNDSVPTLA

QMTDKAIELLSKNEKGFFLQVEGASIDKQDHAANPCGQIGETVDLDEAVQRALEFAKKEGNT

LVIVTADHAHASQIVAPDTKAPGLTQALNTKDGAVMVMSYGNSEEDSQEHTGSQLRIAAYG

PHAANVVGLTDQTDLFYTMKAALGLK

Photinus pyralis

Luciferase protein sequence (1-550 amino acid).

70) SEQ ID NO: 70

MEDAKNIKKGPAPFYPLEDGTAGEQLHKAMKRYALVPGTIAFTDAHIEVNITYAEYF

EMSVRLAEAMKRYGLNTNHRIVVCSENSLQFFMPVLGALFIGVAVAPANDIYNERELLNSM

NISQPTVVFVSKKGLQKILNVQKKLPIIQKIIIMDSKTDYQGFQSMYTFVTSHLPPGFNEYDFV

PESFDRDKTIALIMNSSGSTGLPKGVALPHRTACVRFSHARDPIFGNQIIPDTAILSVVPFHEIGF

GMFTTLGYLICGFRVVLMYRFEEELFLRSLQDYKIQSALLVPTLFSFFAKSTLIDKYDLSNLHE

IASGGAPLSKEVGEAVAKRFHLPGIRQGYGLTETTSAILITPEGDDKPGAVGKVVPFFEAKVV

DLDTGKTLGVNQRGELCVRGPMIMSGYVNNPEATNALIDKDGWLHSGDIAYWDEDEHFFIV

DRLKSLIKYKGYQVAPAELESILLQHPNIFDAGVAGLPDDDAGELPAAVVVLEHGKTMTEKE

IVDYVASQVTTAKKLRGGVVFVDEVPKGLTGKLDARKIREILIKAKKGGKSKL

Artificial

Xpress tag, a peptide recognized by an antibody

71) SEQ ID NO: 71

DLYDDDDK

Artificial

E-tag, a peptide recognized by an antibody (13 amino acid)

72) SEQ ID NO: 72

GAPVPYPDPLEPR

Artificial

FLAG-tag, a peptide recognized by an antibody (8 amino acid)

73) SEQ ID NO: 73

DYKDDDDK

Artificial

HA-tag, a peptide recognized by an antibody (9 amino acid)

74) SEQ ID NO: 74

YPYDVPDYA

Artificial

HA-tag, a peptide recognized by an antibody (9 amino acid)

75) SEQ ID NO: 75

YPYDVPDYA

Artificial

His-tag, 5-10 histidines bound by a nickel or cobalt chelate or antibody (6 amino acid)

76) SEQ ID NO: 76

HHHHHH

Artificial

Myc-tag, a short peptide recognized by an antibody (14 amino acid)

77) SEQ ID NO: 77

EQKLISEEDLLRKR

Artificial

S-tag, a short peptide recognized by an antibody (15 amino acid)

78) SEQ ID NO: 78

KETAAAKFERQHMDS

Artificial

Softag 1, for mammalian expression, a short peptide recognized by an antibody (13 amino

acid)

79) SEQ ID NO: 79

SLAELLNAGLGGS

Artificial

VSV-tag, a peptide recognized by an antibody (11 amino acid)

80) SEQ ID NO: 80

YTDIEMNRLGK

Artificial

Softag 3, for prokaryotic expression, a short peptide recognized by an antibody (8 amino acid)

81) SEQ ID NO: 81

TQDPSRVG

Artificial

VS tag, a peptide recognized by an antibody (14 amino acid)

82) SEQ ID NO: 82

GKPIPNPLLGLDST

Artificial

Avi-Tag, a peptide allowing biotinylation by the enzyme BirA and so the protein can be

isolated by streptavidin and/or avidin (20 amino acid)

83) SEQ ID NO: 83

MAGGLNDIFEAQKIEWHEGG

Artificial

SBP-tag, a peptide which binds to streptavidin (38 amino acid)

84) SEQ ID NO: 84

MDEKTTGWRGGHVVEGLAGELEQLRARLEHHPQGQREP

Artificial

Strep-tag (Strep-tag II), a peptide which binds to streptavidin or the modified streptavidin

called streptactin (8 amino acid)

85) SEQ ID NO: 85

WSHPQFEK

Escherichia coli

BCCP (Biotin Carboxyl Carrier Protein), a protein domain biotinylated by BirA enabling

recognition by streptavidin (73-156 amino acids)

86) SEQ ID NO: 86

PAAAEISGHIVRSPMVGTFYRTPSPDAKAFIEVGQKVNVGDTLCIVEAMKMMNQIEA

DKSGTVKAILVESGQPVEFDEPLVVIE

Artificial

TC tag, a tetracysteine tag that is recognized by FlAsH and ReAsH biarsenical compounds (6

amino acid)

87) SEQ ID NO: 87

CCPGCC

Artificial

Calmodulin-tag, a peptide bound by the protein calmodulin (26 amino acid)

88) SEQ ID NO: 88

KRRWKKNFIAVSAANRFKKISSSGAL

Artificial

Polyglutamate tag, a peptide binding efficiently to anion-exchange resin such as Mono-Q (6

amino acids)

89) SEQ ID NO: 89

EEEEEE

Rhodococcus sp./Artificial

Halo-tag, a mutated hydrolase that covalently attaches to the HaloLin Resin (297 amino acid)

90) SEQ ID NO: 90

MAEIGTGFPFDPHYVEVLGERMHYVDVGPRDGTPVLFLHGNPTSSYVWRNIIPHVAP

THRCIAPDLIGMGKSDKPDLGYFFDDHVRFMDAFIEALGLEEVVLVIHDWGSALGFHWAKR

NPERVKGIAFMEFIRPIPTWDEWPEFARETFQAFRTTDVGRKLIIDQNVFIEGTLPMGVVRPLT

EVEMDHYREPFLNPVDREPLWRFPNELPIAGEPANIVALVEEYMDWLHQSPVPKLLFWGTPG

VLIPPAEAARLAKSLPNCKAVDIGPGLNLLQEDNPDLIGSEIARWLSTLEISG

Escherichia coli

Maltose binding protein-tag, a protein which binds to amylose agarose (27-396 amino acid)

91) SEQ ID NO: 91

KIEEGKLVIWINGDKGYNGLAEVGKKFEKDTGIKVTVEHPDKLEEKFPQVAATGDGP

DIIFWAHDRFGGYAQSGLLAEITPDKAFQDKLYPFTWDAVRYNGKLIAYPIAVEALSLIYNKD

LLPNPPKTWEEIPALDKELKAKGKSALMFNLQEPYFTWPLIAADGGYAFKYENGKYDIKDVG

VDNAGAKAGLTFLVDLIKNKHMNADTDYSIAEAAFNKGETAMTINGPWAWSNIDTSKVNY

GVTVLPTFKGQPSKPFVGVLSAGINAASPNKELAKEFLENYLLTDEGLEAVNKDKPLGAVAL

KSYEEELAKDPRIAATMENAQKGEIMPNIPQMSAFWYAVRTAVINAASGRQTVDEALKDAQ

TRITK

Escherichia coli

Nus-tag, recognized by an antibody (1-495)

92) SEQ ID NO: 92

MNKEILAVVEAVSNEKALPREKIFEALESALATATKKKYEQEIDVRVQIDRKSGDFDT

FRRWLVVDEVTQPTKEITLEAARYEDESLNLGDYVEDQIESVTFDRITTQTAKQVIVQKVREA

ERAMVVDQFREHEGEIITGVVKKVNRDNISLDLGNNAEAVILREDMLPRENFRPGDRVRGVL

YSVRPEARGAQLFVTRSKPEMLIELFRIEVPEIGEEVIEIKAAARDPGSRAKIAVKTNDKRIDPV

GACVGMRGARVQAVSTELGGERIDIVLWDDNPAQFVINAMAPADVASIVVDEDKHTMDIAV

EAGNLAQAIGRNGQNVRLASQLSGWELNVMTVDDLQAKHQAEAHAAIDTFTKYLDIDEDF

ATVLVEEGFSTLEELAYVPMKELLEIEGLDEPTVEALRERAKNALATIAQAQEESLGDNKPAD

DLLNLEGVDRDLAFKLAARGVCTLEDLAEQGIDDLADIEGLTDEKAGALIMAARNICWFGDE

A

Escherichia coli

Thioredoxin-tag is commonly used in expression and purification of recombinant proteins. It

improves the solubility of that protein of interest. Recognized by an antibody (2-109 amino acid)

93) SEQ ID NO: 93

SDKIIHLTDDSFDTDVLKADGAILVDFWAEWCGPCKMIAPILDEIADEYQGKLTVAKL

NIDQNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLA

Artificial

Isopeptag, a peptide which binds covalently to pilin-C protein (16 amino acid)

94) SEQ ID NO: 94

TDKDMTITFTNKKDAE

Artificial

SpyTag, a peptide which binds covalently to SpyCatcher protein (13 amino acids)

95) SEQ ID NO: 95

AHIVMVDAYKPTK

Aequorea victoria

Green fluorescent protein-tag, a protein which is spontaneously fluorescent and can be bound

by antibodies (1-238 amino acid)

96) SEQ ID NO: 96

MSKGEELFTGVVPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPW

PTLVTTFSYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTL

VNRIELKGIDFKEDGNILGHKLEYNYNSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQLADH

YQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITHGMDELYK

Artificial

Allows for cleavage by TEV protease between the Gln and Ser residues (7 amino acid)

97) SEQ ID NO: 97

ENLYFQS

Artificial

Allows for cleavage by Thrombin protease between Arg and Gly residues (6 amino acid)

98) SEQ ID NO: 98

LVPRGS

Artificial

Allows for cleavage by PreScission protease between the Gln and Gly residues (8 amino acid)

99) SEQ ID NO: 99

LEVLFQGP

Human

C1q A-chain mature amino acid sequence (23-245amino acid)

100) SEQ ID NO: 100

EDLCRAPDGKKGEAGRPGRRGRPGLKGEQGEPGAPGIRTGIQGLKGDQGEPGPSGNP

GKVGYPGPSGPLGARGIPGIKGTKGSPGNIKDQPRPAFSAIRRNPPMGGNVVIFDTVITNQEEP

YQNHSGRFVCTVPGYYYFTFQVLSQWEICLSIVSSSRGQVRRSLGFCDTTNKGLFQVVSGGM

VLQLQQGDQVWVEKDPKKGHIYQGSEADSVFSGFLIFPSA

Human

C1q B-chain mature amino acid sequence (28-253 amino acid)

101) SEQ ID NO: 101

QLSCTGPPAIPGIPGIPGTPGPDGQPGTPGIKGEKGLPGLAGDHGEFGEKGDPGIPGNP

GKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQKIAFSATRTINVPLRRDQTIRFDHVITN

MNNNYEPRSGKFTCKVPGLYYFTYHASSRGNLCVNLMRGRERAQKVVTFCDYAYNTFQVT

TGGMVLKLEQGENVFLQATDKNSLLGMEGANSIFSGFLLFPDMEA

Human

C1q C-chain mature amino acid sequence (29-245 amino acid)

NTGCYGIPGMPGLPGAPGKDGYDGLPGPKGEPGIPAIPGIRGPKGQKGEPGLPGHPGK

NGPMGPPGMPGVPGPMGIPGEPGEEGRYKQKFQSVFTVTRQTHQPPAPNSLIRFNAVLTNPQ

GDYDTSTGKFTCKVPGLYYFVYHASHTANLCVLLYRSGVKVVTFCGHTSKTNQVNSGGVLL

RLQVGEEVWLAVNDYYDMVGIQGSDSVFSGFLLFPD

Protein M fusion proteins and uses

Information

Patent Number

Date Filed

Date Issued

Inventors

Examiners

CPC

Field of Search

US

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATIONS

Non-Patent Literature Citations (4)

Related Publications (1)

Provisional Applications (1)

Entry
Arfi et al. (2021) mBio 12(6)e0197421: 1-13.
Blötz et al. (2020) Front. Microbiol. 11(685): 1-12.
Grover et al. (2014) Science 343(6171): 656-661.
Rentero et al. (2011) Chimia 65: 843-845.