IMMUNOGENIC COMPOSITIONS AND VACCINES FOR PREVENTING OR TREATING FILARIAL DISEASE

Information

  • Patent Application
  • 20180064791
  • Publication Number
    20180064791
  • Date Filed
    March 17, 2016
    8 years ago
  • Date Published
    March 08, 2018
    6 years ago
Abstract
The present disclosure is directed to an immunogenic composition including: at least one or at least two isolated polypeptides or immunogenic fragments thereof, and optionally a pharmaceutically acceptable carrier, wherein each polypeptide is expressed on a luminal surface of an intestine of a filarial worm, wherein each polypeptide is expressed at a level at least two-fold higher in the intestine in comparison to the level of expression of each polypeptide in a reproductive tract or a body wall of the filarial worm, wherein each isolated polypeptide has at least one transmembrane domain, and wherein each polypeptide is a non-mitochondrial polypeptide. Also provided herein is a method for preventing or treating a filarial disease.
Description
SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Mar. 9, 2016, is named HMJ-150-PCT_ST25.txt and is 120,000 bytes in size.


BACKGROUND

The filariae are thread-like parasitic nematodes (roundworms) that are transmitted by arthropod vectors. The adult worms inhabit specific tissues where they mate and produce microfilariae, the characteristic tiny, thread-like larvae.


The microfilariae infect vector arthropods, in which they mature to infective larvae. Diseases caused by filariae are a major health problem in many tropical and subtropical areas. Wuchereria bancrofti and Brugia malayi are filarial parasites that are the major causative agents of lymphatic filariasis. Currently, it is estimated that over 129 million people are infected with either of these organisms and over one billion live in at-risk areas. Since 2000, there has been an ongoing effort through the Global Program to Eliminate Lymphatic Filariasis to eradicate these infections. While this program is having a substantive impact on the prevalence of infection, its efficacy is limited by the need to repeatedly treat entire endemic populations for 6-10 years. The advent of new tools, such as vaccines or more effective anthelmintics, would be of great benefit toward these eradication efforts.


One of the principle obstacles in designing vaccines against such parasitic worms, however, is that previously exposed individuals frequently have IgE antibodies to surface and secreted worm antigens, putting them at risk for allergic reactions when re-exposed to these antigens. Since intestinal antigens of helminths may be “hidden” from the immune response during natural infection, yet accessible by antibodies after antigen administration, homogenates of such antigens have been proposed for use in vaccines. However, while the genomes of Wuchereria bancrofti and Brugia malayi, as well as the filarial genomes of the causative agents of loiasis and river blindness have been completed, the anatomic localization of proteins in these filarial worms is unknown. Moreover, the use of homogenates from helminth intestines has resulted in variable efficacy.


Other diseases known in the art, which are caused by filarial parasites, include heartworm disease. This disease is caused by the parasite Dirofilaria immitis. The physical presence of the heartworm parasite in the pulmonary artery and right ventricle of the canine heart, for example, and the resulting destruction of tissue, causes respiratory and circulatory problems which can be fatal under conditions of stress or vigorous exercise. The heartworm parasite has also been shown to be the cause of focal lung, liver, eye and cutaneous lesions in man. Currently, heartworm disease is treated by administering anti-parasitic agents to infected animals. Unfortunately, heartworm disease that has not been diagnosed in its early stages may be quite refractile to treatment.


Accordingly, there is a need in the art for immunogenic compositions, such as vaccines, which may be used to prevent or treat diseases caused by filarial worms including lymphatic filariasis, loiasis, river blindness and heartworm.


BRIEF SUMMARY

Further areas of applicability of the present disclosure will become apparent from the detailed description provided hereinafter. It should be understood that the detailed description and specific examples, while indicating embodiments of the disclosure, are intended for purposes of illustration only and are not intended to limit the scope of the disclosure.


The present disclosure is directed to an immunogenic composition including: at least one or at least two isolated polypeptides or immunogenic fragments thereof, and optionally a pharmaceutically acceptable carrier, wherein each polypeptide is expressed on a luminal surface of an intestine of a filarial worm, wherein each polypeptide is expressed at a level at least two-fold higher in the intestine in comparison to the level of expression of each polypeptide in a reproductive tract or a body wall of the filarial worm, wherein each isolated polypeptide has at least one transmembrane domain, and wherein each polypeptide is a non-mitochondrial polypeptide.


Also provided herein is a method for preventing or treating a filarial disease including administering an effective amount of a vaccine composition including at least one or at least two isolated polypeptides or immunogenic fragments thereof to a subject in need thereof, wherein each polypeptide is expressed on a luminal surface of an intestine of a filarial worm, wherein each polypeptide is expressed at a level at least two-fold higher in the intestine in comparison to the level of expression of each polypeptide in a reproductive tract or a body wall of the filarial worm, wherein each isolated polypeptide has at least one transmembrane domain, and wherein each polypeptide is a non-mitochondrial polypeptide.





BRIEF DESCRIPTION OF THE DRAWINGS

The present disclosure will become more fully understood from the detailed description and the accompanying drawings, wherein:



FIG. 1 depicts the anatomy of adult female B. malayi. The tissues and structures dissected for the proteomic analysis as described in the Examples include the body wall, reproductive tract and intestine.



FIGS. 2A and 2B depict the dissection process of adult female B. malayi. FIG. 2A shows a break in the body wall and extrusion of the gut and reproductive tract as described in the Examples. Magnification: top left: 40×, bottom left: 100×. FIG. 2B shows the body wall in the process of being slid away from the gut and reproductive tract. Magnification: top right: 30×, bottom right: 20×.



FIG. 3A depicts a Venn diagram of proteins identified within each anatomic fraction of adult female Brugia malayi based on a 2 peptide minimum for identification as described in the Examples.



FIG. 3B depicts a heat map of hierarchical clustered proteins found within the different anatomic fractions of adult female B. malayi.



FIG. 4 depicts the association of transporter proteins with the intestine as measured by gene set enrichment analysis (GSEA) P-value=0.005, as described in the Examples.



FIG. 5A depicts the association of immunological (P-value=0.003) proteins with the body wall of the adult female B. malayi as measured by GSEA, as described in the Examples.



FIG. 5B depicts the association of cytoskeletal proteins (P-value=0 0.009) with the body wall of the adult female B. malayi as measured by GSEA, as described in the Examples.



FIG. 6A depicts the association of transcription (P<0.001with the reproductive tract of adult female B. malayi as described in the Examples.



FIG. 6B depicts the association of nuclear regulation proteins (P=0.013) with the reproductive tract of adult female B. malayi as described in the Examples.





DETAILED DESCRIPTION

The following description of the embodiments is merely exemplary in nature and is in no way intended to limit the disclosure, its application, or uses.


As used throughout, ranges are used as shorthand for describing each and every value that is within the range. Any value within the range can be selected as the terminus of the range. In addition, all references cited herein are hereby incorporated by reference in their entireties. In the event of a conflict in a definition in the present disclosure and that of a cited reference, the present disclosure controls.


Unless otherwise specified, all percentages and amounts expressed herein and elsewhere in the specification should be understood to refer to percentages by weight. The amounts given are based on the active weight of the material.


Immunogenic Composition


In some embodiments, the present disclosure encompasses an immunogenic composition comprising at least one isolated polypeptide as described herein or an immunogenic fragment thereof and optionally a pharmaceutically acceptable carrier.


The term “polypeptide” as used herein refers to a polymer of amino acid residues. This term is used interchangeably with the term “protein.”


In some embodiments, the present polypeptides are obtained or derived from a filarial worm. A “filarial worm” as used herein refers to parasitic nematodes of the Metazoa kingdom including the superfamily filarioidea, family Filariidae. Filarial worms include, but are not limited to, species belonging to the genera Brugia, such Brugia malyai, Wuchereria, such as Wuchereria bancrofti, Onchocerca, such as Onchocerca volvulus, Loa, such as Loa loa and Dirofilaria, such as Dirofilaria immitis. The phrase “derived from” encompasses actually or theoretically “originating from,” “obtained from,” or “isolated from” a parent polypeptide.


In some embodiments, the polypeptides of the present immunogenic are expressed on a luminal surface of a filarial worm intestine. Without being bound by theory, it is believed that administering such intestinal luminal surface polypeptides to a subject may mitigate the possibility of an allergic reaction in the subject. Because intestinal antigens may be “hidden” from the immune response during natural infection, yet accessible by antibodies induced by administration, intestinal antigens are likely to have a low potential for eliciting allergic responses when administering, such as vaccinating, previously infected individuals. Further, it is believed that administering, such as vaccinating, a subject with particular intestinal antigens, such as the polypeptides or immunogenic fragments thereof described herein, may be more effective than vaccinating with crude homogenates of antigens since, for example, sufficient amounts of effective antigen may not be present in such mixtures. Accordingly, in some embodiments, the present immunogenic composition consists essentially of the isolated polypeptides disclosed herein, such as one or more isolated polypeptides selected from SEQ ID NOS: 1-27 or immunogenic fragments thereof and a pharmaceutically acceptable carrier and does not encompass crude homogenates of antigen, such as a crude homogenate of intestinal proteins.


In various embodiments, the present polypeptides are enriched in the intestine of a filarial worm in comparison to another anatomic fraction of a filarial worm. As used herein, “enriched” or “abundant” refers to a polypeptide that is naturally found to be expressed at a higher level in the intestine, for example, in comparison to other anatomic fractions, such as the reproductive tract or the body wall. See FIG. 1, which depicts the anatomy of a filarial worm. In some embodiments, the expression level of the polypeptide is at least two fold higher, such as at least three fold higher, such as at least five fold higher, such as at least 10 fold higher, such as at least 50 fold higher in the intestine in comparison to the expression level of the polypeptide in the reproductive tract or the body wall of the filarial worm.


Enrichment or abundance of polypeptides can be assessed by any method known in the art. For example, protein separation and comparison by two-dimensional polyacrylamide gel electrophoresis (2D-PAGE), followed by mass spectrometry (MS) or tandem mass spectrometry (MS/MS) identification may be used for quantitative analysis of protein mixtures, see, for example, H. J. Issaq and T. D. Veenstra, BioTechniques, vol. 44, no. 5, pp. 697-700, 2008, which is herein incorporated by reference. In this method, the intensity of the protein stain is used to make a determination regarding the quantity of a particular protein.


Protein quantitation may also be assessed using non-gel-based “shotgun” proteomic techniques such as Multidimensional Protein Identification (MudPIT). See e.g., A. Motoyama and J. R. Yates III, Analytical Chemistry, vol. 80, no. 19, pp. 7187-7193, 2008 and B. Domon and R. Aebersold, Science, vol. 312, no. 5771, pp. 212-217, 2006, which are each herein incorporated by reference.


Non-gel based proteomic methods may include (i) sample preparation including protein extraction, reduction, alkylation, and digestion; (ii) sample separation by liquid chromatography (LC or LC/LC) and analysis by MS/MS; (iii) data analysis including peptide/protein identification, quantification, and statistical analysis. For example, each sample may be separately prepared, then subjected to individual LC-MS/MS or LC/LC-MS/MS runs. Protein abundance may be assessed, for example, using spectral counting of identified proteins after MS/MS analysis. Spectral count may be measured for individual LC-MS/MS or LC/LC-MS/MS runs and changes in protein abundance may be calculated via a direct comparison between different analyses.


Typically, normalization and statistical analysis of spectral counting datasets are used to detect changes in protein abundance in complex mixtures. See, for example, McIlwain et al., 2012, BMC Bioinformatics, 13, 308, Paoletti et al., 2006, Proc. Natl. Acad. Sci. U.S.A., 103, 18928-18933, regarding determining the relative abundance of a single protein between samples and Liu et al., 2004, Anal. Chem.,76, 4193-4201, for example, regarding estimating relative abundance between different proteins in one sample. These references are herein incorporated in their entireties.


The relative abundance of the present polypeptides may be assessed using normalization and statistical analysis of spectral counting datasets as described in the present Examples. For example, Normalized Spectral Abundance Factor (NSAF) and NSAF enrichment values may be determined (see Examples). In some embodiments, the polypeptides of the present disclosure exhibit an NSAF enrichment value of at least 2, such as at least 3, such as at least 4, such as at least 20 in the intestine and less than 2, such as less than 1 or less than 0.5 in other atomic fractions, such as the body wall or the reproductive tract.


In some embodiments, the polypeptides of the present immunogenic composition have at least one transmembrane domain, such as 1-2 transmembrane domains, to facilitate recombinant expression and a non-cytoplasmic portion to increase the likelihood of interaction with antibodies ingested by filariae. The non-cytoplasmic portion may be for example, 50 amino acids in length or more, such as 100 amino acids in length or more, such as 500 amino acids in length or more. Such features may be determined using methods known in the art or predicted using art known software, e.g. Interpro, Zdobnov E M, Apweiler R. 2001. InterProScan-an integration platform for the signature-recognition methods in InterPro. Bioinformatics 17: 847-848, which is herein incorporated by reference.


In some embodiments, the polypeptide of the present disclosure is specific to the intestine. “Specific” in reference to a particular anatomical fraction of the worm means that the polypeptide was only identified within the specific fraction, e.g., the intestine.


In some embodiments, the present polypeptides are non-mitochondrial peptides, i.e. not present, targeted to or expressed in the mitochondria.


As noted above, the present composition comprising at least one isolated polypeptide is an immunogenic composition. As used herein, the term “immunogen” or “immunogenic” refers to any substrate that elicits an immune response in a host, e.g., at least an antibody response. An “immunogenic composition” includes at least one isolated polypeptide with or without a pharmaceutically acceptable carrier, such as an adjuvant. The immunogenic compositions disclosed herein may or may not be immunoprotective or therapeutic. Accordingly, the term “immunogenic” is not intended to be limited to vaccines.


In some embodiments, the immunogenic composition of the present disclosure is a vaccine. As used herein, a vaccine encompasses an immunogenic composition that prevents, ameliorates, palliates, or eliminates disease from a host, such as the diseases described herein.


In other embodiments, the immunogenic composition described herein may be used to obtain an antibody composition, which may then be administered to a subject to provide temporary immunity, i.e., artificially acquired passive immunity. Methods for preparing and administering such antibody compositions are known in the art and are described, for example, in U.S. Pat. No. 4,748,018, which is herein incorporated by reference in its entirety.


As described herein, a polypeptide of the present immunogenic composition is an isolated polypeptide. The terms “isolated” or “purified” or “biologically pure” refer to material that is substantially or essentially free from components that normally accompany it as found in its native state. Purity and homogeneity are typically determined using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chromatography. A protein that is the predominant species present in a preparation is considered to be substantially purified. The term “purified” denotes that a protein gives rise to essentially one band in an electrophoretic gel. For example, it means that the protein is at least 85% pure, such as at least 95% pure or at least 99% pure.


In some embodiments, the polypeptide of the present immunogenic composition is a polypeptide selected from SEQ ID NOS: 1-27. In some embodiments, the isolated polypeptide is a cell adhesion protein, such as a polypeptide of SEQ ID NOS: 1 or 2. In some embodiments, the isolated polypeptide is a cell signaling protein, such as a polypeptide of SEQ ID NOS: 3 or 4. In some embodiments, the polypeptide is a chaperone/HSP protein, such as a polypeptide of SEQ ID NOS: 5 or 6. In some embodiments, the polypeptide is involved in xenobiotic metabolism using glucuronidation, for example, a UDP-glucuronosyl or a UDP glucosyl transferase, such as a polypeptide of SEQ ID NOS: 7 or 8, respectively. In some embodiments, the isolated polypeptide is a protease such as SEQ ID NOS: 13, 14 or 15. In some embodiments, the isolated polypeptide is a protease inhibitor, such as SEQ ID NO: 17. A description of SEQ ID NOS: 1-27 is found in Table 7 and Tables C and D.


In some embodiments, the polypeptide of the present immunogenic composition is an immunogenic fragment, such as an immunogenic fragment of SEQ ID NOS: 1-27. The phrase “immunogenic fragment” or “immunogenic portion” refers to a fragment or truncated form of an amino acid sequence, for example, an amino acid sequence selected from the group consisting of SEQ ID NOS. 1-27 that elicits an immunological response.


In some embodiments, the polypeptide of the present immunogenic composition is an immunogenic variant, such as an immunogenic variant of SEQ ID NOS: 1-27. The phrase “immunogenic variant” refers to a substituted form of an amino acid sequence, for example, an amino acid sequence selected from the group consisting of SEQ ID NOS. 1-27 that elicits an immunological response. Insertions and/or deletions may also be incorporated into an immunogenic variant.


In some embodiments, the immunogenic variant of the present disclosure comprises a substitution, such as a conservative substitution for example, a substituted form of an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-27 containing one or more conservative amino acid substitutions. A “conservative amino acid substitution” is one in which the amino acid residue is replaced with an amino acid residue having a side chain with a similar charge. Families of amino acid residues having side chains with similar charges have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine).


In general, the immunogenic fragments and variants described herein, such as fragments or variants of the polypeptides of SEQ ID NOS: 1-27 comprise at least one epitope and include at least six contiguous amino acids from the full-length protein, e.g., at least six contiguous amino acids from the cell adhesion protein set forth in SEQ ID NO: 1. More typically, the present variants or fragments will have at least 10, even more typically at least 15, and still more typically at least 19, and yet even more typically 30 contiguous amino acids from the full-length protein, e.g., the cell adhesion protein set forth in SEQ ID NO: 1.


The term “epitope” means a segment or fragment of a composition of matter, e.g., a polypeptide, which is recognized by the immune system, specifically by antibodies, B cells, or T cells. In some embodiments, the epitope is generally a fragment or fragments of a polypeptide set forth in SEQ ID NOS: 1-27.


Such fragments can be identified using any number of epitope mapping techniques, well known in the art. See, e.g., Epitope Mapping Protocols in Methods in Molecular Biology, Vol. 66 (Glenn E. Morris, Ed., 1996) Humana Press, Totowa, N.J), which is herein incorporated by reference. For example, linear epitopes may be determined by concurrently synthesizing large numbers of peptides on solid supports, the peptides corresponding to portions of a protein molecule, and reacting the peptides with antibodies while the peptides are still attached to the supports. Such techniques are known and described in the art, see e.g., U.S. Pat. No. 4,708,871; Geysen et al., 1984, Proc. Natl. Acad. Sci. U.S.A. 81,3998-4002; and Geysen et al., 1986, Molec. Immunol. 23,709-715, which are herein incorporated by reference in their entireties. Similarly, conformational epitopes are readily identified by determining spatial conformation of amino acids, such as by, e.g., x-ray crystallography and two-dimensional nuclear magnetic resonance. See Epitope Mapping Protocols, supra.


In some embodiments, the polypeptides of the present disclosure encompass polypeptides that are substantially homologous to the polypeptides set forth in SEQ ID NOS: 1-27. The substantially homologous polypeptides may be from or derived from any filarial species or genera including but not limited to Brugia, such as Brugia malyai, Wuchereria, such as Wuchereria bancrofti, Onchocerca, such as Onchocerca volvulus, Loa, such as Loa loa and Dirofilaria, such as Dirofilaria immitis. In some embodiments, the polypeptides of the present disclosure, which are substantially homologous to the polypeptides set forth in SEQ ID NOS: 1-27 are not substantially homologous to a human polypeptide.


In some embodiments, the immunogenic composition includes at least one isolated polypeptide comprising an amino acid sequence selected from SEQ ID NOS: 1-7, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NOS: 18-20, SEQ ID NO: 22, SEQ ID NO: 23 and SEQ ID NO: 26.


In specific embodiments, the immunogenic composition includes at least one, for example, polypeptide obtained Dirofilaria immitis and displaying substantial homology to a polypeptide selected from SEQ ID NOS: 1-7, SEQ ID NO: 11, SEQ ID NO: 14, SEQ ID NOS: 18-20, SEQ ID NO: 22, SEQ ID NO: 23 and SEQ ID NO: 26.


As used herein, “homology” refers to the percent sequence identity between two polypeptide moieties. Two polypeptide sequences “display substantial homology” or are “substantially homologous” to each other when the sequences exhibit at least about 41%, such as at least about 75%, more typically at least about 80%-85%, even more typically at least about 90%, and most typically at least about 95%, 96%, 97%, 98%, 99% or more sequence identity over a defined length of the molecules. As used herein, “substantially homologous” also refers to sequences showing complete (100%) sequence identity to the polypeptide sequences.


In some embodiments, a sequence is not substantially homologous when it exhibits a sequence identity of 40% or less sequence identity.


“Sequence identity” as used herein refers to a relationship between two or more polypeptide sequences, namely a reference polypeptide sequence and a given polypeptide sequence to be compared with the reference polypeptide sequence. Sequence identity is determined by comparing the given polypeptide sequence to the reference polypeptide sequence after the polypeptide sequences have been optimally aligned to produce the highest degree of sequence similarity, as determined by the match between strings of such sequences, with gaps introduced if necessary. Upon such alignment, sequence identity is ascertained on a position-by-position basis, e.g., the sequences are “identical” at a particular position if at that position, the amino acid residues are identical. The total number of such position identities is then divided by the total number of residues in the reference sequence to give % sequence identity.


Sequence identity can be readily calculated by known methods, including but not limited to, those described in Computational Molecular Biology, Lesk, A. N., ed., Oxford University Press, New York (1988), Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York (1993); Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey (1994); Sequence Analysis in Molecular Biology, von Heinge, G., Academic Press (1987); Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M. Stockton Press, New York (1991); and Carillo, H., and Lipman, D., SIAM J. Applied Math., 48: 1073 (1988); the teachings of which are incorporated herein by reference.


Methods to determine sequence identity are codified in publicly available computer programs which determine sequence identity between given sequences. Examples of such programs include, but are not limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research, 12(1):387 (1984)), BLASTP, BLASTN and BLASTX (Altschul, S. F. et al., J. Molec. Biol., 215:403-410 (1990). The BLAST programs are publicly available from NCBI and other sources (BLAST Manual, Altschul, S. et al., NCVI NLM NIH Bethesda, Md. 20894, Altschul, S. F. et al., J. Molec. Biol., 215:403-410 (1990), the teachings of which are incorporated herein by reference). These programs optimally align sequences using default gap weights in order to produce the highest level of sequence identity between the given and reference sequences.


In specific embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to a polypeptide selected from SEQ ID NOS: 1-27.


In other specific embodiments, the immunogenic composition of the instant disclosure includes no more than one isolated polypeptide, such as no more than two, such as no more than three, such as no more than four, such as no more than five, such as no more than six, such as no more than seven, such as no more than ten, such as no more than fifteen, such as no more than twenty, such as no more than twenty-five, such as no more than twenty-six isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-27.


In further specific embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 1 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 2-27 or immunogenic fragments thereof.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 2 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1, 3 and 4-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 3 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1, 2 and 4-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 4 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-3 and 5-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 5 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-4 and 6-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 6 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-5 and 7-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 7 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-6 and 8-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 8 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-7 and 9-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 9 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-8 and 10-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 10 and optionally least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-9 and 11-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 11 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-10 and 12-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 12 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-11 and 13-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 13 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-12 and 14-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 14 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-13 and 15-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 15 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-14 and 16-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 16 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-15 and 17-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 17 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-16 and 18-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 18 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-17 and 19-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 19 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-18 and 20-27.


In some embodiments, the vaccine composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 20 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-19 and 21-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 21 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-20 and 22-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 22 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-21 and 23-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 23 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-22 and 24-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 24 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-23 and 25-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 25 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-24 and 26-27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 26 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-25 and SEQ ID NO: 27.


In some embodiments, the immunogenic composition of the instant disclosure includes at least one isolated polypeptide comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to SEQ ID NO: 27 and optionally at least one additional isolated polypeptide, such as at least two, such as at least three, such as at least four, such as at least five, such as at least six, such as at least seven, such as at least ten, such as at least fifteen, such as at least twenty, such as at least twenty-five, such as at least twenty-six additional isolated polypeptides comprising an amino acid sequence having a sequence identity of at least 75%, such as at least 80%, such as at least 85%, such as at least 90%, such as at least 95%, such as at least 97%, such as at least 98%, such as at least 99%, such as at least 100% to an amino acid sequence selected from SEQ ID NOS: 1-26.


In some embodiments, the polypeptides of the present disclosure are recombinant polypeptides. The term “recombinant polypeptide” refers to a protein produced by recombinant expression methods, such as, for example, in prokaryotic or eukaryotic host cells, or in cell-free in vitro expression systems.


The polypeptides of the present disclosure are typically expressed using an expression vector and purified. Expression vectors may be either self-replicating extrachromosomal vectors or vectors that integrate into a host genome. Generally, expression vectors include transcriptional and translational regulatory nucleic acid sequences, operably linked to the nucleic acid encoding the target protein.


In some embodiments, control sequences may be used for the expression of an operably linked coding sequence in a particular host organism. The control sequences that are suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are known to use promoters, polyadenylation signals, and enhancers. A nucleic acid sequence is “operably linked” when it is placed into a functional relationship with another nucleic acid sequence. For example, DNA for a pre-sequence or secretory leader is operably linked to DNA if it is expressed as a pre-protein that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site is operably linked to a coding sequence if it is positioned so as to facilitate translation. Operably linked DNA sequences may be contiguous or non-contiguous. Methods for linking DNA sequences are well-known in the art and include use of the polymerase chain reaction and ligation. The transcriptional and translational regulatory nucleic acid will generally be appropriate to the host cell used to express the target protein; for example, transcriptional and translational regulatory nucleic acid sequences from E. coli are typically used to express the target protein in E. coli.


Numerous types of appropriate expression vectors, and suitable regulatory sequences are known in the art for a variety of host cells. Methods for expressing polypeptides are well known in the art (e.g., Sambrook et al. (1989) Molecular Cloning, A Laboratory Manual, 2nd ed., vol. 1-3, Cold Spring Harbor Laboratory; Berger and Kimmel (1987) Guide to Molecular Cloning Techniques, Methods in Enzymology, vol. 152, Academic Press, Inc., San Diego, Calif.; Ausubel et al. (1995) Current Protocols in Molecular Biology, John Wiley & Sons, Inc., NY) herein incorporated by reference in their entireties.


The polypeptides of the present disclosure may be produced by culturing a host cell transformed with an expression vector containing nucleic acid encoding polypeptides of the present disclosure, such as encoding one or more of SEQ ID NOS: 1-27 or immunogenic fragments or variants thereof, under the appropriate conditions to induce or cause expression of the polypeptides of the present disclosure. The conditions appropriate for protein expression will vary with the choice of the expression vector and the host cell, and may be easily determined by one skilled in the art using routine experimentation.


Appropriate host cells include yeast, bacteria, archaebacteria, fungi, insect cells, and animal cells, including mammalian cells (such as human cells and cells lines). Thus, host cells include, but are not limited to, Drosophila melanogaster cells, Tetrahymena, Saccharomyces cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, C129 cells, 293 cells, Neurospora, BHK, CHO, COS, HeLa cells, Hep G2 cells, THP1 cell line (a macrophage cell line), and human embryonic kidney cell lines (e.g., HEK293).


Nucleic acid molecules encoding the polypeptides of the present disclosure may be cloned using standard molecular biological methods, including DNA amplification methods, such as the polymerase chain method (PCR) (see e.g., Sambrook et al. (1989)Molecular Cloning: A Laboratory Manual, 2nd ed., Cold Spring Harbour, N.Y.; Berger & Kimmel (1987) Methods in Enzymology. Vol. 152: Guide to Molecular Cloning Techniques, Academic Press, Inc., San Diego, Calif.; Co et al. (1992) J. Immunol. 148:1149), which are each herein incorporated by reference. Thus, for example, a nucleic acid molecule encoding a polypeptide of the present disclosure may be PCR-amplified using a sense primer containing one restriction site and an antisense primer containing another restriction site. This will produce a nucleic acid encoding the desired sequence or subsequence having terminal restriction sites. This nucleic acid can then readily be ligated into a vector having appropriate corresponding restriction sites. Suitable PCR primers may be chosen by one of skill in the art based on the sequence to be expressed. Appropriate restriction sites can also be added by site-directed mutagenesis as is well known in the art.


The methods of introducing exogenous nucleic acids into host cells are also well known in the art, and will vary with the host cell used. Suitable techniques include, but are not limited to, dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated transfection, protoplast fusion, electroporation, viral infection, encapsulation of the nucleic acids in liposomes, and direct microinjection of nucleic acids into nuclei.


The polypeptides of the present disclosure may be isolated or purified in a variety of ways known to those skilled in the art depending on what other components are present in the sample. Standard purification methods include electrophoretic, molecular, immunological, and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography.


Some embodiments of the disclosure provide synthetic polypeptides of the present disclosure. Polypeptides having up to about 100-150 amino acid residues may be prepared by in vitro synthesis using established techniques. Synthetic polypeptides may be prepared by chemical synthesis (such as solid phase peptide synthesis) using methods known in the art.


In some embodiments, the immunogenic composition of the present disclosure includes a pharmaceutically acceptable carrier. The carrier must be “acceptable” in the sense that it is compatible with the active ingredient of the composition, and typically, capable of stabilizing the active ingredient and not deleterious to the subject to be treated. In some embodiments, the pharmaceutically acceptable carrier is a non-naturally occurring pharmaceutically acceptable carrier.


The pharmaceutically acceptable carriers (vehicles) may be conventional, but are not limited to conventional carriers (vehicle). For example, E. W. Martin, Remington's Pharmaceutical Sciences, Mack Publishing Co., Easton, Pa., 15th Edition (1975) and D. B. Troy, ed. Remington: The Science and Practice of Pharmacy, Lippincott Williams & Wilkins, Baltimore Md. and Philadelphia, Pa., 21st Edition (2006) describe compositions and formulations suitable for pharmaceutical delivery of one or more molecules and additional pharmaceutical agents.


The present immunogenic compositions may comprise buffers (e.g., sodium phosphate, histidine, potassium phosphate, sodium citrate, potassium citrate, maleic acid, ammonium acetate, tris-(hydroxymethyl)-aminomethane (tris), acetate, diethanolamine, etc.), amino acids (e.g., argenine, cysteine, histidine, glycine, serine, lysine, alanine, glutamic acid, proline), sodium chloride, potassium chloride, sodium citrate, sucrose, glucose, mannitol, lactose, glycerol, xylitol, sorbitol, maltose, inositol, trehalose, bovine serum albumin (BSA), albumin (e.g., human serum albumin, recombinant albumin), dextran, PVA, hydroxypropyl methylcellulose (HPMC), polyethyleneimine, gelatin, polyvinylpyrrolidone (PVP), hydroxyethylcellulose (HEC), polyethylene glycol (PEG), ethylene glycol, dimethylsulfoxide (DMSO), dimethylformamide (DMF), hydrochloride, sacrosine, gamma-aminobutyric acid, Tween-20, Tween-80, sodium dodecyl sulfate (SDS), polysorbate, polyoxyethylene copolymer, sodium acetate, ammonium sulfate, magnesium sulfate, sodium sulfate, trimethylamine N-oxide, betaine, zinc ions, copper ions, calcium ions, manganese ions, magnesium ions, CHAPS, sucrose monolaurate, 2-O-beta-mannoglycerate, the like, or a combination thereof.


In some embodiments, the present immunogenic compositions may comprise propellants (e.g., hydrofluoroalkane (HFA)) for aerosol delivery. In some embodiments, the immunogenic compositions of the present disclosure may be formulated as described in U.S. Pat. No. 5,192,743 that form a gel when reconstituted and which can improve stability of a protein of interest (e.g., for storage).


Immunogenic compositions of the present disclosure may be appropriately constructed for some or all routes of administration, for example topical administration (including inhalation and nasal administration), oral or enteral administration, intravenous or parenteral administration, transdermal administration, epidural administration or the like. For example, parenteral formulations usually comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle. For solid compositions (for example, powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically-neutral carriers, immunogenic compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.


In some embodiments, a parenteral formulation may comprise injectable fluids that include pharmaceutically and physiologically acceptable fluids such as water, physiological saline, balanced salt solutions, aqueous dextrose, glycerol or the like as a vehicle.


For solid compositions (for example, powder, pill, tablet, or capsule forms), conventional non-toxic solid carriers can include, for example, pharmaceutical grades of mannitol, lactose, starch, or magnesium stearate. In addition to biologically-neutral carriers, pharmaceutical compositions to be administered can contain minor amounts of non-toxic auxiliary substances, such as wetting or emulsifying agents, preservatives, and pH buffering agents and the like, for example sodium acetate or sorbitan monolaurate.


The aforementioned immunogenic compositions and protein modifications to increase protein stability can be applied as described in U.S. Patent Application 2009/032692, which is herein incorporated by reference in its entirety.


In some embodiments the pharmaceutically acceptable carrier of the present disclosure includes an adjuvant, such as a non-naturally occurring adjuvant. As used herein, an “adjuvant” is understood as a pharmacological or immunological agent that modifies the effect of other agents (e.g., immunogen or target antigen in an immunogenic composition) while having few if any direct effects when given by itself. In some embodiments, an adjuvant will enhance the recipient's immune response to the polypeptides in the present immunogenic composition while keeping the injected foreign material at a minimum.


Suitable adjuvants are well known in the art (see, for example, Vaccine Design-The Subunit and Adjuvant Approach (1995) Pharmaceutical Biotechnology, Volume 6 (eds. Powell, M. F., & Newman, M. J.) Plenum Press, New York and London, ISBN 0-306-44867-X), which is incorporated herein by reference in its entirety. Examples of adjuvants include, but are not limited to, alum-precipitate, Freund's complete adjuvant, Freund's incomplete adjuvant, monophosphoryl-lipid A/trehalose dicorynomycolate adjuvant, water in oil emulsion containing Corynebacterium parvum and tRNA, and other substances that accomplish the task of increasing immune response by mimicking specific sets of evolutionarily conserved molecules including liposomes, lipopolysaccharide (LPS), molecular cages for antigen, components of bacterial cell walls, and endocytosed nucleic acids such as double-stranded RNA, single-stranded DNA, and unmethylated CpG dinucleotide-containing DNA. Other examples include cholera toxin, E. coli heat-labile enterotoxin, liposome, immune-stimulating complex (ISCOM), immunostimulatory sequences oligodeoxynucleotide, and aluminum hydroxide.


Other exemplary adjuvants include the adjuvants described in Lanar et al., U.S. Pat. No. 7,029,685 and U.S. Patent Publication No. 2006/0073171, herein incorporated by reference in their entireties. Alternatively, the polypeptides of the immunogenic composition described herein can be used without any adjuvant.


Methods


The present disclosure is further directed to methods for preventing or treating filarial diseases. As used herein, “filarial diseases” refer to diseases caused by thread-like nematodes (filariae) that belong to the roundworm superfamily filarioidea family Filariidae. Such diseases include, but are not limited to lymphatic filariasis, river blindness, loiasis or heartworm. As used herein “preventing” refers to the administration of a therapeutically effective amount of a polypeptide, immunogenic composition, such as a vaccine, of the present disclosure to an animal in order to protect the animal from the development of, for example, lymphatic filariasis, river blindness, loiasis, heartworm or the symptoms thereof. In some embodiments, the immunogenic composition of the disclosure is administered to a subject that is at risk for developing a lymphatic filariasis, river blindness, loiasis or heartworm.


By “treating” a disease associated with filarial infection such as lymphatic filariasis, river blindness, loiasis or heartworm is intended administration of a therapeutically effective amount of a polypeptide, immunogenic composition, such as a vaccine of the present disclosure to an animal that has, for example, lymphatic filariasis, river blindness, loiasis or heartworm or that has been exposed to a filarial infection, such as a filarial worm selected from Brugia malyai, Wuchereria bancrofti, Onchocerca volvulus, Loa loa, or Dirofilaria immitis, where the purpose is to cure, heal, alleviate, relieve, alter, remedy, ameliorate, improve, or affect the condition or the symptoms of the lymphatic filariasis, river blindness, loiasis or heartworm.


The immunogenic compositions of the present disclosure may be administered to a subject. In some embodiments, the subject is a mammal, e.g., a non-human primate such as a baboon or macaque. In some embodiments, the subject mammal is a human. The subject mammal can also include, but is not limited to, pet or companion animals (e.g. cats, dogs, house rabbits, ferrets, rodents, including gerbils, hamsters, chinchillas, rats, mice, guinea pigs, etc); working animals, such as guide animals (e.g., monkeys; herding animals, etc.), draught animals (e.g., draught horses, oxen, camels, elephants, oxen, camels, donkeys) and sport animals (e.g., racing or show-jumping horses); livestock (e.g., alpaca, banteng, bison, camel, cattle, deer, donkey, gayal, goat, llama, mule, pig, reindeer, sheep, water buffalo, yak, etc.); laboratory animals (e.g., mice, rabbits, rats, non-human primates); and undomesticated animals held in captivity, e.g. in zoological parks and the like. In other embodiments, particularly when the disease being prevented or treated is heartworm, the subject is a dog or a cat, typically a dog.


In various embodiments, a therapeutically effective amount of the immunogenic composition described herein is administered to the subject. As used herein “a therapeutically effective amount” is an amount that provides a therapeutic effect for a given condition and administration regimen. In particular aspects of the disclosure, a “therapeutically effective amount” refers to an amount of a polypeptide, immunogenic composition, or vaccine of the present disclosure that when administered to an animal brings about a positive therapeutic response with respect to the prevention or treatment of a subject for diseases or conditions associated with filarial worms such as Brugia malyai, Wuchereria bancrofti, Onchocerca volvulus, Loa loa, or Dirofilaria immitis. For example, a positive therapeutic response in regard to treating diseases or conditions associated with filarial worms includes curing or ameliorating the symptoms of the disease.


A positive therapeutic response with respect to preventing a condition associated with a filarial infection includes, for example, the production of filarial antibodies by the subject in a quantity sufficient to protect against development of the disease. The production of antibodies elicited by a treatment is readily ascertained by obtaining a plasma or serum sample from the subject to which an immunogenic composition is administered, and assaying the antibodies therein for their ability to bind to the polypeptide(s) used to elicit the immune response to filarial worms. Exemplary methods include, but are not limited to, ELISA assays, immunofluorescence assays (IFA), or other immunoassays such as a Western blots, as is well known in the art.


The exact dosage is chosen by the individual physician in view of the patient to be treated. Dosage and administration are adjusted to provide sufficient levels of the active moiety or to maintain the desired effect. Additional factors that may be taken into account include the prevalence of filarial worms in the geographical vicinity of the patient, the severity of the disease state of the patient, age, and weight of the patient, diet, time and frequency of administration, drug combination(s), reaction sensitivities, and tolerance/response to therapy. An appropriate effective amount may be readily determined using only routine experimentation. Several doses may be needed per individual in order to achieve a sufficient response to effect treatment. Suitable regimes for initial administration and follow-up administration (e.g., booster shots) are also variable, but are typified by an initial administration followed in intervals (weeks or months) by a subsequent administration. In some embodiments, typical dosages may range from about 0.01 to about 20 mg/kg, and more particularly from about 0.1 to about 10 mg/kg.


EXAMPLES
Example 1
Materials and Methods

Dissections


Adult Brugia malayi worms were received in multiple shipments from TRS Labs (Athens, Ga.) and frozen at −80° C. until processing. For separation of anatomic structures, worms were thawed at room temperature and then dissected using a stereomicroscope and fine tipped forceps. One set of forceps was used to grip and steady the center of the parasite after thawing and placement into a petri dish filled with phosphate buffered saline (PBS). Another set of forceps was used to grasp and gently twist the parasite close to the first set of forceps, resulting in a tear of the body wall. The cephalic tip of the body wall was then grasped and gently peeled away from the rest of the organs. The caudal portion of the body wall was then peeled away from the intestines and uterine tubes (FIG. 2). Reproductive organs were identified by their anterior junction and then separated from the intestine. Each anatomic fraction (intestine, reproductive tract, and body wall) was placed in a microcentrifuge tube filled with PBS. These were stored at −20° C. until protein extraction.


Protein Extraction


The samples were thawed and then centrifuged in 1.5 ml eppendorf tubes. The pelleted tissues were frozen and thawed 4 times by cycling through placement on dry ice for 10 minutes followed by placement in a 37° C. water bath. Using a mini disposable micropestle, the samples were homogenized with 50 μl of UPX extraction buffer (Expedeon). The micropestle was washed with 50 μl of UPX extraction buffer and processed as per the manufacturer's instructions. In brief, samples were placed in a 100° C. water bath for 5 minutes, removed and cooled at 4° C. for one hour. Samples were then centrifuged at 15,000×g for 10 minutes and supernatant was collected.


Protein Concentrations were measured by BCA assay. 400 μg proteins of intestine, body wall and reproductive tract each were reduced, alkylated and trypsin digested overnight following filter-aided digestion procedure using a FASP digestion kit (Protein Discovery, San Diego, Calif.) according to vendor protocol. Tryptic peptides were further desalted, lyophilized and reconstituted in 25% acetonitrile with 0.1% formic acid and further fractionated using strong cation exchange (SCX) chromatography. The SCX fractions of the three samples were pooled into 16 to 18 fractions each, lyophilized and reconstituted in 0.1% trifluoroacetic acid to be analyzed by liquid chromatography-mass spectrometry (LC-MS).


Nanobore Reversed-Phase Liquid Chromatography Tandem MS (nanoRPLC-MSMS)


Nanobore RPLC-MSMS was performed using an Agilent 1200 nanoflow LC system coupled online with a LTQ Orbitrap XL mass spectrometer. The RPLC column (75 μm i.d.×10 cm) were slurry-packed in-house with 5 μm, 300 Å pore size C-18 stationary phase into fused silica capillaries with a flame pulled tip. After sample injection, the column was washed for 20 minutes with 98% mobile phase A (0.1% formic acid in water) at 0.5 μl/min. Peptides were eluted using a linear gradient of 2% mobile phase B (0.1% formic acid in acetonitrile) to 35% B in 100 minutes, then to 80% B over an additional 40 minutes. The column flow-rate was maintained at 0.25 μl/min throughout the separation gradient. The mass spectrometer was operated in a data-dependent mode in which each full MS scan was followed by seven MS/MS scans wherein the seven most abundant molecular ions were dynamically selected for collision-induced dissociation (CID) using a normalized collision energy of 35%.


Protein Identification


The LC-MS/MS data were searched using SEQUEST through Bioworks interface against a combined database of Brugia malayi database downloaded from The Institute for Genomic Research (TIGR) and the Wolbachia database from New England Biolabs (Beverly, Mass.). SEQUEST was searched with a fragment ion mass tolerance of 0.50 Da and a parent ion tolerance of 25 PPM. Carbamidomethyl of cysteine was specified in SEQUEST as a fixed modification. Oxidation of methionine was specified in SEQUEST as a variable modification. Scaffold (version Scaffold 3.5.2, Proteome Software Inc., Portland, Oreg.) was used to validate MS/MS based peptide and protein identifications. Peptide identifications were accepted if they could be established at greater than 95.0% probability by the Peptide Prophet algorithm (Keller et al., 2002, Anal. Chem., 74, 5383-5392, herein incorporated by reference). Protein identifications were accepted if they could be established at greater than 99.0% probability and contained at least 2 identified peptides. Protein probabilities were assigned by the Protein Prophet algorithm (Nesvizhskii et al., 2003, Anal. Chem., 75, 4646-4658, herein incorporated by reference). Proteins that contained similar peptides and could not be differentiated based on MS/MS analysis alone were grouped to satisfy the principles of parsimony. TIGR accession numbers were matched to PUB_loci from the proteome published by Bennuru et. Al., 2011, Proc. Natl. Acad. Sci. U.S.A., 108, 9649-9654, herein incorporated by reference.


Heat Map Analysis


A heat map of the relative abundance of each protein, defined as the number of unique matching peptides within each anatomic fraction, was made with JMP software.


Quantitative Analysis


Protein quantitation was determined by normalized spectral abundance. This approach provides a theoretical quantitative value useful for determining relative abundance of a single protein between samples (McIlwain et al., 2012, BMC Bioinformatics, 13, 308, Paoletti et al., 2006, Proc. Natl. Acad. Sci. U.S.A., 103, 18928-18933) and an estimation of relative abundance between different proteins in one sample (Liu et al., 2004, Anal. Chem., 76, 4193-4201). Exclusive spectral counts, spectra that match to only 1 protein, were first divided by the length of the protein to account for the differences in numbers of possible spectra. This calculation provides the spectral abundance factor. This was then normalized to obtain the normalized spectral abundance factor (NSAF) by dividing by the sum of the total spectral abundance factors found within that anatomic fraction.






NSAF
=




(

Spectra
Length

)


p





p
=
1

n




(

Spectra
length

)


p



.





NSAF enrichment was then calculated by dividing the NSAF of a given protein in the target fraction divided by the sum of the NSAF of the other two fractions to determine whether a protein was more abundant or “enriched” in one fraction compared to the others.







NSAF





enrichment

=



NSAF






(

target





fraction

)



NSAF





of





other





two





fractions






(
added
)



.





Proteins were considered enriched when they had an NSAF enrichment value of 2 or greater.


Functional Categories for Gene Set Enrichment Analysis (GSEA)


The proteome of B. malayi had previously been functionally characterized by Bennuru and colleagues (Bennuru, et al., 2011). For proteins previously annotated for function, no further analysis of function was carried out. The 665 newly identified proteins were annotated based loosely on the KOG and PFAM functions. Categories of function were used as previously described (Bennuru, et al., 2011), including cytoskeletal, extracellular matrix, immunological, metabolism, nuclear regulation, protein export, protein modification, protein synthesis, signal transduction, transcription, transporters, and uncharacterized. Functions of anatomic fractions were analyzed based on GSEA, which analyzes the data for bias in a condition (or anatomic fraction) (Subramanian et al., 2005, Proc. Natl. Acad. Sci. U.S.A., 102, 15545-15550, herein incorporated by reference). Proteins were ranked according to abundance using spectral counts. A priori defined sets of proteins, based on functional annotation, were then analyzed using GSEA for bias within each anatomic fraction.


BLASTp


BLASTp was performed on proteins of interest from B. malayi to identify similarity among W. bancrofti, O. volvulus, D. immitis, L. boa, and H. sapiens. BLAST query was conducted with blast+ 2.2.29 downloaded from NCBI. Protein databases for W. bancrofti, O. volvulus, H. sapiens, L. loa were downloaded from uniprotKB. Protein database for D. immitis was downloaded from nematodes.org. A FASTA file containing the B. malayi proteins of interest were blasted against each other genome individually. Percent identity and query coverage were recorded for the top scoring sequence for each protein. Score is determined by an algorithm that takes into account similarity of AA sequence, gaps in homologous regions, and length of homology. Percent identity is defined as the percentage of amino acids that match perfectly over the sequence region with greatest homology.


Example 2
Results

Distinct Anatomic Fractions Exhibit Markedly Different Expression of Proteins


Based on a match of 2 unique peptides to a protein, we identified a total of 5023 proteins. Of these, 204 were Wolbachia proteins, and 34 could be matched to more than 1 specific protein, leaving 4,785 specifically identified B. malayi proteins. While 1,895 of the proteins were identified by two peptides in all three anatomical fractions of the parasite, 396 proteins were identified solely within the intestine, 114 solely within the body wall, and 1011 solely within the uterine tubes (data not shown). Additionally, although the majority of proteins were present in all three anatomic fractions, we found that there was a differential expression of each protein among the anatomic fractions (FIG. 3).


Proteomic Profiling of the B. malayi Intestine is Consistent with Functional Absorption and Digestion


Like all nematodes, filarial parasites have a fully formed intestine. However, the functionality of this tract is not completely clear (Munn, E. A. and Munn, P. D., 2002, Feeding and Digestion. In: Lee, D. L., (Ed., The Biology of Nematodes. CRC Press Taylor and Francis Group, Boca Raton, Fla., pp. 211-233, herein incorporated by reference). We performed several analyses to further elucidate the possible function of the intestine in B. malayi. First, gene set enrichment analysis (GSEA) was performed which showed a bias for proteins with transporter function to be present within the intestine (FIG. 4). Next, we rank ordered the proteins that were enriched within the intestine based on their NSAF value, a measure which takes into account the number of spectra uniquely matching to a protein and the length of the protein in amino acids. Spectral counting has previously been shown to be useful to determine relative abundance of a single protein in different samples (McIlwain, et al., 2012, Paoletti, et al., 2006) and provide a reasonable approximation of protein abundance within a sample compared to other proteins in the same sample (Liu, et al., 2004). Of the 20 most abundant, enriched, and named intestine proteins, 3 are proteolytic enzymes (Bm1_00205, Bm1_18805, Bm1_34740), 2 are transporters (Bm1_42930, and Bm1_24840), and 1 is associated with phagocytosis (Bm1_02265). The abundance of such proteins suggests the intestine is involved in both digestion and active absorption of nutrients. Of the remaining 20 most abundant named proteins in the intestine, 3 are muscle associated proteins (Bm1_28910, Bm1_45035, Bm1_00655) and the rest are involved in various functions including translation, cell trafficking, RNA binding, cell adhesion, hydrolysis, lipid metabolism, catabolism, and cellular structure.


Table 1, below, depicts the twenty most abundant proteins, with proper names, enriched in the intestine of adult female B. malayi based on normalized spectral abundance factor (NSAF). The NSAF enrichment values are also depicted. “Specific” means that the protein was only found within the intestine.













TABLE 1








Abundance
NSAF





NSAF
enrichment


Protein Type
Accession
Name
Intestine
Intestine



















Translational
Bm1_41515
40S ribosomal protein S21, putative
5.4E−03
2.17


Muscle
Bm1_28910
Calsequestrin, skeletal muscle
2.5E−03
2.90


Associated

isoform precursor, putative


Cell trafficking
Bm1_14235
SNARE domain containing protein
2.4E−03
2.0


Muscle
Bm1_45035
Probable myosin regulatory light
2.2E−03
2.5


Associated

chain, putative


Protease
Bm1_34740
aspartic protease BmAsp-1,
1.1E−03
16.0




identical


carrier protein
Bm1_21135
Acyl CoA binding protein
9.0E−04
7.68


Muscle
Bm1_00655
myosin heavy chain, nonmuscle
7.8E−04
2.0


Associated

type 1, putative


Phagocytosis
Bm1_02265
MGC69076 protein-related
7.3E−04
3.77


associated


Xenobiotic
Bm1_13480
UDP-glucoronosyl and UDP-
7.0E−04
28.16


metabolism

glucosyl transferase family protein


RNA binding
Bm1_20295
Glycine-rich RNA-binding protein.-
6.9E−04
8.96




related


Miscellaneous
Bm1_25280
Prion-like--related
6.4E−04
2.37


Cell Adhesion
Bm1_10500
AMOP domain containing protein
6.1E−04
5.99


Hydrolase
Bm1_24820
Histidine acid phosphatase family
6.1E−04
6.32




protein


Cytoskeleton
Bm1_30265
Tubulin alpha chain, putative
5.9E−04
2.95


Transporter
Bm1_42930
Excitatory amino acid transporter,
5.7E−04
2.75




putative


Lipid
Bm1_08150
NAD-dependent malic enzyme,
5.5E−04
7.10


Metabolism

mitochondrial precursor, putative


Catabolism
Bm1_48185
putative amidase
5.1E−04
3.74


Transporter
Bm1_24840
Major Facilitator Superfamily
4.7E−04
19.88




protein


Protease
Bm1_18805
Papain family cysteine protease
4.4E−04
Specific




containing protein


Protease
Bm1_00205
ShTK domain containing protein
4.2E−04
3.42









Many Predominant Body Wall Enriched Proteins Provide Muscular Structure or are Involved in Muscular Contraction


The body wall of B. malayi includes, from superficial to deep, the epicuticle, cuticle, epidermis, musculature (divided into a superficial fibrous portion and a deeper metabolically active portion) and the lateral cords (FIG. 4). The musculature is separated into quadrants by the lateral, ventral and dorsal cords with up to 9 myocytes per quadrant (Vincent et al., 1975, J. Parasitol., 61, 499-512.). The lateral cords contain the cell bodies of the epidermis, which produces and maintains the cuticle. Also associated with the lateral cords is a secretory gland which is connected to the secretory pore by the secretory canal (Landmann et al., 2010, PLoS Negl. Trop. Dis., 4, e758) herein incorporated by reference. The ventral and dorsal cords are associated with nerves that innervate the musculature.


GSEA of the body wall showed a bias for cytoskeletal proteins and proteins of immunological interest to be present within the body wall (FIG. 5). Further, analysis of the 20 most abundant named proteins that were enriched within the body wall by NSAF yielded 12 proteins associated with muscle structure or regulation of muscular contraction (Table 2). These included an actin (Bm1_21705), 4 myosins (Bm1_40715, Bm1_50805, Bm1_00935, Bm1_14060), 2 paramyosins (Bm1_04450, Bm1 02615), 1 tropomyosin (Bm1_02060), and a disorganized muscle protein (Bm1_40320). In addition to the muscular proteins, there were 3 cuticular proteins, a glutathione peroxidase, which provides protection from oxidative damage, a cytoskeletal protein, a heat shock protein, and a glutamine synthetase.


Table 2, below, depicts the twenty most abundant proteins enriched in the body wall of adult female Brugia malayi.













TABLE 2








Abundance
NSAF





(NSAF)
enrichment


Protein type
Accession
Name
Body Wall
Body Wall



















Muscle
Bm1_21705
actin 1, putative
6.2E−02
3.86


associated


Cytoskeletal
Bm1_45215
intermediate filament protein, putative
2.0E−02
2.94


Muscle
Bm1_40320
Disorganized muscle protein 1,
1.6E−02
5.15


associated

putative


HSP
Bm1_19805
small heat shock protein, putative
1.4E−02
7.97


Muscle
Bm1_04450
Paramyosin, putative
1.1E−02
5.87


associated


Muscle
Bm1_02615
Paramyosin, identical
1.0E−02
6.33


associated


Calcium
Bm1_48810
EF hand family protein
6.5E−03
9.81


Binding


Cuticle
Bm1_13015
Nematode cuticle collagen N-terminal
6.1E−03
3.05




domain containing protein


Muscle
Bm1_01235
Tropomyosin-related
6.0E−03
5.24


associated


Muscle
Bm1_49075
Calponin homolog OV9M, putative
5.9E−03
3.31


Associated


Muscle
Bm1_40715
myosin heavy chain, putative
5.8E−03
3.11


associated


Cuticle
Bm1_54705
Nematode cuticle collagen N-terminal
5.5E−03
6.57




domain containing protein


Muscle
Bm1_50805
Myosin tail family protein
4.7E−03
4.00


associated


Antioxidant
Bm1_40465
Cuticular glutathione peroxidase
4.5E−03
2.59




precursor, putative


Muscle
Bm1_00935
myosin heavy chain B (MHC B),
4.4E−03
3.36


associated

putative


Carbohydrate
Bm1_16060
carbohydrate phosphorylase, putative
4.2E−03
2.67


metabolism


Muscle
Bm1_14060
myosin heavy chain B (MHC B),
4.1E−03
2.23


associated

putative


Cuticle
Bm1_17485
Nematode cuticle collagen N-terminal
3.2E−03
2.40




domain containing protein


Muscle
Bm1_02060
Tropomyosin family protein
3.2E−03
3.19


associated


Amino Acid
Bm1_53470
glutamine synthetase, putative
3.0E−03
3.16


Synthesis









Nuclear Regulatory Proteins, Including Those Involved in Chromatin Organization are Enriched and Highly Abundant in the Reproductive Tract


The nematode female reproductive tract consists of two ovaries where gamete production takes place, two seminal receptacles (aka spermatheca) which store sperm obtained from males, and 2 uterine tubes that allow for embryo and subsequent in utero microfilaria development (FIG. 1). The two uterine tubes merge into the vulva, which is on the ventral surface of the worm in the cephalic region (Fischer et al., 2011, PLoS Negl. Trop. Dis., 5, e1174. Jiang et al., 2012, Int. J. Parasitol., 42, 841-850, Landmann, et al., 2010, Li et al., 2012, BMC Genomics, 13, 184, which are herein incorporated by reference in their entireties). GSEA showed a bias for transcription and nuclear regulation proteins to be present within the female reproductive tract (FIG. 6). Similarly the 20 most abundant named proteins that were enriched in the reproductive tract as assessed by NSAF contained many proteins involved in nuclear regulation. 12 proteins contained domains associated with nucleotide binding or splicing, with 8 of these 12 being histones or histone linkers (Bm1_02505, Bm1_02515, Bm1_20280, Bm1_02495, Bm1_, 20285, Bm1_, 38685, Bm1_02800, Bm1_04110). Three microfilarial sheath proteins were also abundant and enriched within the reproductive tract, which is consistent with presence of developing microfilariae within the uterine tubes. The remaining 7 proteins are involved in trafficking, protection from oxidation, xenobiotic metabolism, proteolysis and cell adhesion.


Table 3, below, depicts the most abundant enriched named proteins in the reproductive tract of adult female Brugia malayi. Specific means that the protein was only found within the reproductive tract.













TABLE 3








Abundance
NSAF





NSAF
Enrichment





Reproductive
Reproductive


Protein Type


tract
tract



















Chromatin
Bm1_02505
histone H2A, putative
3.4E−02
8.1


organization


Chromatin
Bm1_02515
histone H4, putative
3.1E−02
2.7


organization


Chromatin
Bm1_20280
Probable histone H2B 3,
1.1E−02
4.3


organization

putative


Chromatin
Bm1_02495
histone H3, putative
7.8E−03
25.9


organization


Sheath
Bm1_19100
Major microfilarial sheath
6.2E−03
2.6




protein precursor.-related


Chromatin
Bm1_20285
histone H2A, putative
5.9E−03
17.2


organization


Chromatin
Bm1_38685
Histone H2A variant, putative
3.4E−03
2.7


organization


Sheath
Bm1_05185
sheath protein 5, identical
2.8E−03
2.9


Trafficking
Bm1_07925
peroxisomal membrane
2.2E−03
2.6




anchor protein, putative


Antioxidant
Bm1_44840
Glutathione S-transferase, N-
2.0E−03
2.3




terminal domain containing




protein


DNA
Bm1_25620
high mobility group protein,
1.7E−03
10.2


binding

putative


Sheath
Bm1_00650
microfilarial sheath protein,
1.2E−03
2.4




identical


RNA
Bm1_49560
NOP5/NOP58, putative
1.2E−03
2.2


splicing


RNA
Bm1_49460
small nuclear
1.1E−03
2.3


modulation

ribonucleoprotein-associated




protein homolog F9F13.90 -





Arabidopsis thaliana, putative



Chromatin
Bm1_57630
retinoblastoma-binding
1.1E−03
2.6


organization

protein., putative


Chromatin
Bm1_04110
linker histone H1 and H5
9.6E−04
4.7


organization

family protein


Xenobiotic
Bm1_32235
Flavin-binding
9.4E−04
2.6


metabolism

monooxygenase-like family




protein


Chromatin
Bm1_02800
Histone H2B 2, putative
9.0E−04
Specific


organization


Protease
Bm1_45620
Trypsin family protein
8.9E−04
36.5


Cell
Bm1_17270
Fasciclin domain containing
8.1E−04
2.8


Adhesion

protein









Identification of Potential Intestine Vaccine Candidates


To identify gastrointestinal proteins that could potentially be used as vaccine candidates, we analyzed the proteomics set for proteins that were enriched in the intestine, had at least one predicted transmembrane domain, and were not predicted to be in the mitochondria. We sought luminal surface proteins because these proteins may be accessible to host antibodies after vaccination. 106 proteins were identified with these criteria (Table A and Table B, below, provided after the Examples) and were categorized based on likely metabolic function within the cell. The amino acid sequences identified by Genbank Accession numbers in Table. A and Table B are herein incorporated by reference.


In order to evaluate the potential of the 106 surface proteins as pan-filarial vaccine candidates, we performed a blast search for each protein against databases for Homo sapiens, Wuchereria bancrofti, which causes lymphatic filariasis, Onchocerca volvulus, which causes river blindness, L. loa, which causes loiasis, and Dirofilaria immitis, which causes heartworm. Proteins were selected that contained a percent identity of >75% to W. bancrofti or O. volvulus and <40% homology to humans. 72 proteins matched these criteria.


We then selected those proteins that had 1-2 transmembrane domains for ease of recombinant protein production. These were evaluated with Interpro software for the presence of non-cytoplasmic domains that could be bound by host antibodies. 27 proteins matched all of these criteria (Tables C and D, below, provided after the Examples), with 12 displaying substantial homology between all of the filarial species. Of these 27 proteins, 10 are hypothetical proteins, 3-4 are proteases, 2 are involved in xenobiotic metabolism using glucuronidation, 2 participate in cell adhesion, 2 function in cell signaling, and 2 are chaperones.


Certain Excretory/Secretory (ES) Products are Associated with Specific Anatomic Fractions


227 proteins were found within the ES product of adult female B. malayi in a previous study by Bennuru and colleagues (Bennuru et al., 2009, PLoS Negl. Trop. Dis., 3, e410, herein incorporated by reference). To better define the origin of these proteins, we analyzed all adult female ES proteins for enrichment within any of the three worm fractions from this study. Four (1.7%) of these proteins were either enriched or specific to the intestine (Table 4). The most notable of these was the papain family cysteine protease (Bm1_18805). Eight (3.5%) female ES products were enriched within the body wall (Table 5), including two proteins that protect against oxidative damage, cuticular glutathione peroxidase (Bm1_40465) and peptide methionine sulfoxide reductase (Bm1_10795) (Weissbach, et al., 2005). Other ES products enriched within the body wall included a cuticle collagen (Bm_13015), and muscular proteins.


There were 30 adult female ES products (13%) enriched within the female reproductive tract (Table 6). Some of these antigens include Juv-p120 (Bm1_18010), which has been implicated in being critical for MF survival, Von willebrand factor type A domain containing protein (Bm1_27495), which likely binds to collagen, a trypsin inhibitor (Bm1_03520), and an aspartyl amino peptidase (Bm1_16690).


Tables 4, 5, and 6 below depict the intestine-enriched, body-wall enriched, and reproductive tract-enriched ES products, respectively. “Specific” means that the protein was only was only identified within the specified anatomical fraction.












TABLE 4







Abundance
NSAF




(NSAF)
Enrichment


ID
Name
Body wall
Body wall







Bm1_13015
Nematode cuticle collagen
6.1E−03
3.05



N-terminal domain containing



protein


Bm1_50805
Myosin tail family protein
4.7E−03
4.00


Bm1_40465
Cuticular glutathione
4.5E−03
2.59



peroxidase precursor, putative


Bm1_39425
protein unc-22, putative
8.8E−04
7.42


Bm1_26690
Prion-like-, putative
8.0E−04
4.73


Bm1_12515
Immunoglobulin I-set
4.5E−04
7.68



domain containing protein


Bm1_45145
Ryanodine Receptor
1.4E−04
3.67



TM 4-6 family protein


Bm1_10795
Peptide methionine
9.6E−05
Specific



sulfoxide reductase family



protein



















TABLE 5







Abundance
NSAF




(NSAF)
Enrichment


ID
Name
Body wall
Body wall







Bm1_13015
Nematode cuticle collagen N-
6.1E−03
3.05



terminal domain containing



protein


Bm1_50805
Myosin tail family protein
4.7E−03
4.00


Bm1_40465
Cuticular glutathione
4.5E−03
2.59



peroxidase precursor, putative


Bm1_39425
protein unc-22, putative
8.8E−04
7.42


Bm1_26690
Prion-like-, putative
8.0E−04
4.73


Bm1_12515
Immunoglobulin I-set domain
4.5E−04
7.68



containing protein


Bm1_45145
Ryanodine Receptor
1.4E−04
3.67



TM 4-6 family protein


Bm1_10795
Peptide methionine sulfoxide
9.6E−05
Specific



reductase family protein



















TABLE 6







Abundance
NSAF




(NSAF)
Enrichment




Reproductive
Reproductive


ID
Name
Tract
Tract







Bm1_29260
60S ribosomal protein L34, putative
4.6E−04
2.4


Bm1_20440
Biotin/lipoate A/B protein ligase family
4.5E−04
2.8



protein


Bm1_43080
hypothetical protein
3.7E−04
2.1


Bm1_02485
Potential global transcription activator
2.5E−04
2.6



SNF2L, putative


Bm1_16970
hypothetical protein
1.8E−04
2.7


Bm1_16690
Aspartyl aminopeptidase, putative
1.6E−04
3.4


Bm1_18010
excretory/secretory protein Juv-p120
1.4E−04
3.4



precursor-related


Bm1_21390
RNA binding protein, putative
1.3E−04
2.0


Bm1_48000
RAS FAMILY PROTEIN
1.1E−04
2.7


Bm1_12225
G-patch domain containing protein
1.0E−04
3.9


Bm1_03520
Kunitz/Bovine pancreatic trypsin inhibitor
9.9E−05
2.8



domain containing protein


Bm1_02770
Galactosyltransferase family protein
8.5E−05
2.6


Bm1_48025
Alpha-catulin, putative
7.5E−05
3.5


Bm1_41495
Gex interacting protein protein 4, isoform c-
7.2E−05
7.0



related


Bm1_18480
HYPOTHETICAL PROTEIN
7.2E−05
Specific


Bm1_46460
26S PROTEASOME REGULATORY
6.4E−05
5.4



CHAIN 4, PUTATIVE


Bm1_49790
tRNA modification GTPase TrmE family
5.1E−05
2.0



protein


Bm1_27495
von Willebrand factor type A domain
4.9E−05
2.1



containing protein


Bm1_46930
HYPOTHETICAL PROTEIN
4.4E−05
2.1


Bm1_21025
hypothetical protein, conserved
4.3E−05
3.4


Bm1_25670
hypothetical protein
4.0E−05
Specific


Bm1_41650
hypothetical protein
3.2E−05
4.1


Bm1_54890
RNA recognition motif.
3.0E−05
2.0


Bm1_16685
hypothetical protein
2.7E−05
2.7


Bm1_40395
Phosphatidylinositol 3- and 4-kinase family
2.6E−05
4.3



protein


Bm1_00750
RhoGEF domain containing protein
1.9E−05
3.4


Bm1_25450
Formin Homology 2 Domain containing
1.8E−05
Specific



protein


Bm1_05305
HYPOTHETICAL PROTEIN,
1.7E−05
Specific



CONSERVED


Bm1_42420
Eye-specific diacylglycerol kinase, putative
1.6E−05
Specific


Bm1_17115
conserved hypothetical protein
9.8E−06
Specific









Summary of the Results


In conclusion, the results detail the proteins found within the major anatomic fractions of B. malayi, including the intestine, body wall, and reproductive tract. The results suggest that the intestine of adult filarial worms likely plays an important role in digestion and absorption, and may have other physiologic functions that have not yet been characterized. Further, we have identified vaccine candidates from the B. malayi intestine that that could be protective against all major filarial pathogens of humans, and which may provide protective efficacy as vaccines against the causative agents of lymphatic filariasis, river blindness, loiasis as well as heartworm.


Sequences


The sequence identifiers described herein and the sequences set forth in the following sequence listing correspond to the Accession numbers and descriptions described in Table 7, below.












TABLE 7





SEQ ID

Gene Symbol and



NO:

Protein Type
Description




















Cell Adhesion



1
XP_001899381
Bm1_39630
Immunoglobulin I-set





domain containing





protein


2
XP_001892066
Bm1_02820
EGF-like domain





containing protein




Cell Signaling


3
XP_001897556
Bm1_30585
Tyrosine-protein kinase





abl-1.-related


4
XP_001895334
Bm1_19395
Protein kinase domain





containing protein


5
XP_001899110
Bm1_38285
Ser/Thr protein





phosphatase family





protein




Chaperone/HSP


6
XP_001894589
Bm1_15660
DnaJ domain containing





protein


7
XP_001895946
Bm1_22450
hemimethylated DNA





binding domain





containing protein




Glycosylation/




glucuronidation


8
XP_001900394
Bm1_44655
Fukutin.-related


9
XP_001894161
Bm1_13480
UDP-glucoronosyl and





UDP-glucosyl transferase





family protein




Miscellaneous


10
XP_001901384
Bm1_49590
CG3054-PA-related


11
XP_001893572
Bm1_10500
AMOP domain





containing protein


12
XP_001901064
Bm1_48010
EGF-like domain





containing protein




Proteases


13
XP_001899113
Bm1_38300
Peptidase family M1





containing protein


14
XP_001893672
Bm1_11005
MGC84665 protein-





related


15
XP_001902078
Bm1_53050
Reprolysin




Possible




Proteases


16
XP_001891567
Bm1_00205
ShTK domain containing





protein




Protease




Inhibitors


17
XP_001893428.1
Bm1_09775
“serpin, putative”




Hypothetical




Proteins


18
XP_001901910
Bm1_52210
hypothetical protein


19
XP_001902925
Bm1_57335
“Conserved hypothetical





protein, putative”


20
XP_001895546
Bm1_20460
hypothetical protein


21
XP_001900482
Bm1_45100
hypothetical protein


22
XP_001893045
Bm1_07875
CONSERVED





HYPOTHETICAL





PROTEIN


23
XP_001894967
Bm1_17550
hypothetical protein


24
XP_001893039
Bm1_07845
hypothetical protein


25
XP_001894908
Bm1_17255
hypothetical protein


26
XP_001895519
Bm1_20325
Hypothetical protein-





conserved


27
XP_001900708
Bm1_46230
hypothetical protein
















TABLE A







BLAST-P OF INTESTINE ENRICHED NON-MITOCHONDRIAL PROTEINS WITH TRANSMEMBRANE


DOMAINS AGAINST W. BANCROFTI, O. VOLVULUS AND H. SAPIENS











B. malayi


H. sapiens


W. bancrofti


O. volvulus















Protein Type Gene Symbol/


Query

Query

Query


Accession No./GI No.
Description
% Ident*
cov.**
% Ident*
Cov**
% Ident*
Cov**
















Acyltransferase




















Bm1_43465/
Temporarily assigned gene name protein 40,
xx
xx
93
290-882 
79
50-878 


XP_001900154.1/
putative


GI: 170590790













Carbohydrate Metabolism




















Bm1_36055/
hexokinase, putative
47
16-437
65
22-438 
82
1-439


XP_001898658.1/


GI: 170587792


Bm1_52335/
UDP-N-acetylglucosamine-dolichyl-
49
 2-404
94
1-406
89
1-406


XP_001901935.1/
phosphate N-


GI: 170594367
acetylglucosaminephosphotransferase,



putative


Catabolism


Bm1_48180/
Amidase family protein
41
45-303
96
90-324 
75
1-373


XP_001901098.1/


GI: 170592691


Cell Adhesion


Bm1_39630/
Immunoglobulin I-set domain containing
28
 45-1170
97
628-1171 
87
26-1171


XP_001899381.1/
protein


GI:170589239


Bm1_02820/
EGF-like domain containing protein
35
53-206
96
1-269
82
3-269


XP_001892066.1/


GI:170572325


Cell Signaling


Bm1_30585/
Tyrosine-protein kinase abl-1.-related
24
212-281 
95
10-281 
79
10-281 


XP_001897556.1/


GI: 170585572


Bm1_19395/
Protein kinase domain containing protein
34
 15-1280
98
1-681
92
 1-1280


XP_001895334.1/


GI:170580602


Bm1_38285/
Ser/Thr protein phosphatase family
40
22-287
95
9-293
79
2-293


XP_001899110.1/
protein


GI: 170588697


Bm1_51260/
Innexin family protein
29
327-399 
83
1-523
64
1-533


XP_001901720.1/


GI: 170593937


Bm1_31730/
phosphatidate cytidylyltransferase-
27
56-127
97
1-128
84
1-118


XP_001897792.1/
related


GI:170586046


Bm1_36290/
Putative phosphatidate
50
41-480
94
1-374
85
5-487


XP_001898705.1/
cytidylyltransferase, putative


GI:170587887


Bm1_43990/
Latrophilin receptor protein 2, putative
42
354-715 
81
59-873 
68
32-873 


XP_001900260.1/


GI:170591002


Bm1_05960/
Patched family protein
20
229-896 
96
66-549 
73
1-933


XP_001892664.1/


GI:170574085


Bm1_23705/
protein C24B5.3, putative
21
 8-957
32
12-955 
83
2-746


XP_001896198.1/


GI:170582592


Bm1_39815/
Ly-6-related protein HOT-2-related
29
18-55 
95
97-199 
89
60-199 


XP_001899418.1/


GI:170589313


Bm1_55745/
sulfakinin receptor protein, putative
28
 5-333
86
1-299
71
1-398


XP_001902606.1/


GI:170596018


Bm1_54240/
Endonuclease/Exonuclease/phosphatase
38
 3-298
98
1-205
85
1-391


XP_001902307.1/
family protein


GI:170595251


Bm1_52975/
ER lumen protein retaining
69
 1-207
99
1-213
96
1-213


XP_001902063.1/
receptor, putative


GI:170594623


Bm1_48590/
Low-density lipoprotein
47
4-46
91
1-154
66
1-154


XP_001901181.1/
receptor domain class A


GI:170592857
containing protein


Cellular Trafficking


Bm1_14235/
SNARE domain containing
37
 9-230
98
1-248
86
1-248


XP_001894314.1/
protein


GI:170578208


Chaperone/HSP


Bm1_15660/
DnaJ domain containing
27
25-835
91
1-839
77
3-839


XP_001894589.1/
protein


GI:170578901


Bm1_22450/
hemimethylated DNA
25
34-112
97
29-119 
95
29-119 


XP_001895946.1/
binding domain containing


GI:170582031
protein


DNA/RNA Binding


Bm1_41070/
Zinc finger DHHC domain
53
11-262
97
1-445
86
1-444


XP_001899675.1/
containing protein 5,


GI:170589828
putative


Glycosylation/glucuronidation


Bm1_34610/
glycosyl transferase, group 2 family
56
80-554
96
1-447
88
1-582


XP_001898369.1/
protein


GI:170587206


Bm1_44655/
Fukutin.-related
31
138-364 
96
1-362
73
1-364


XP_001900394.1/


GI:170591272


Bm1_13480/
UDP-glucoronosyl and UDP-glucosyl
27
35-509
95
1-425
29
214-293 


XP_001894161.1/
transferase family protein


GI:170577851


Immunological


Bm1_50985/
Complement component C6 precursor.-
32
49-136
98
1-171
72
3-161


XP_001901665.1/
related


GI:170593827


Miscellaneous


Bm1_15480/
Acyltransferase family protein
28
295-352 
94
44-351 
80
17-649 


XP_001894556.1/


GI:170578816


Bm1_23850/
cDNA sequence BC017158-related
37
38-384
93
163-353 
81
1-389


XP_001896227.1/


GI:170582654


Bm1_49590/
CG3054-PA-related
28
97-260
81
1-242
69
1-260


XP_001901384.1/


GI:170593263


Bm1_15855/
D4Ertd196e protein, putative
56
14-174
94
1-177
81
1-150


XP_001894628.1/


GI:170578992


Bm1_10500/
AMOP domain containing protein
26
652-932 
98
679-1377 
92
 1-1513


XP_001894628.1/


GI:170578992


Bm1_06760/
zgc: 100814 protein-related
39
 9-355
95
1-136
75
1-356


XP_001892822.1/


GI:170574457


Bm1_41280/
uncharacterized hypothalamus protein
56
50-295
96
1-296
82
4-296


XP_001899713.1/
HTMP, putative


GI:170589904


Bm1_17180/
MiaB-like tRNA modifying enzyme,
56
 1-402
97
17-256 
90
1-424


XP_001894893.1/
archaeal-type family protein


GI:170579583


Bm1_48010/
EGF-like domain containing protein
36
10-395
91
19-338 
66
6-560


XP_001901064.1/


GI:170592623


Proteases


Bm1_18805/
Papain family cysteine protease containing
34
65-309
68
73-267 
52
35-309 


XP_001895218.1/
protein


GI:170580338


Bm1_38300/
Peptidase family M1 containing protein
28
136-586 
90
90-819 
67
1-819


XP_001899113.1/


GI:170588703


Bm1_26370/
Rhomboid family protein
36
77-347
96
1-377
83
1-377


XP_001896728.1/


GI:170583765


Bm1_11005/
MGC84665 protein-related
38
2-98
95
1-96 
80
1-98 


XP_001896728.1/


GI:170583765


Bm1_53050/
Reprolysin
32
153-753 
91
96-845 
77
1-839


XP_001902078.1/


GI:170594653


Possible Protease


Bm1_00205/
ShTK domain containing protein
27
54-161
80
142-229 
52
70-227 


XP_001891567.1/


GI:170571016













Protease inhibitor




















Bm1_09775/
serpin, putative
29
26-388
75
84-375 
52
29-390 


XP_001893428.1/


GI:170575897


Sterol Metabolism


Bm1_37660/
Oxysterol-binding protein
45
37-747
93
312-748 
87
57-724 


XP_001898982.1/


GI:170588441


Structural


Bm1_53475/
Nematode cuticle collagen N-terminal
40
102-274 
85
48-288 
61
1-291


XP_001902163.1/
domain containing protein


GI:170594883


Bm1_04695/
Cuticle collagen F09G8.6.-related
40
80-251
96
43-270 
89
1-266


XP_001892417.1/


GI:170573305


Bm1_19730/
Autophagy protein Apg9 containing
43
50-655
91
204-814 
82
1-814


XP_001895401.1/
protein


GI:170580769


Transporters


Bm1_08720/
ABC transporter N-terminus family protein
46
65-153
94
1-150
78
1-153


XP_001893214.1/


GI:170575377


Bm1_06830/
ABC transporter transmembrane region
40
80-246
92
82-183 
60
1-248


XP_001892835.1/
family protein


GI:170574488


Bm1_08185/
Cation transporter family protein
37
35-439
96
1-444
84
3-444


XP_001893108.1/


GI:170575121


Bm1_34425/
Ctr copper transporter family protein
28
63-255
89
1-276
69
10-276 


XP_001898332.1/


GI:170587131


Bm1_00795/
E1-E2 ATPase family protein
49
 6-523
96
1-449
87
1-530


XP_001891680.1/


GI:170571314


Bm1_42365/
ZIP Zinc transporter family protein
25
17-386
92
1-387
76
1-387


XP_001899937.1/


GI:170590354


Bm1_40010/
NRAMP-like transporter K11G12.4,
59
44-457
100
228-457 
85
34-457 


XP_001899457.1/
putative


GI:170589391


Bm1_38955/
Twik family of potassium channels protein
24
101-536 
93
1-561
96
49-561 


XP_001899244.1/
28, putative


GI:170588965


Bm1_44770/
TWiK family of potassium channels
26
136-438 
98
80-330 
87
1-525


XP_001900417.1/
protein 7, putative


GI:170591318


Bm1_24840/
Major Facilitator Superfamily protein
27
 7-477
95
102-493 
85
1-493


XP_001896428.1/


GI:170583097


Bm1_37140/
Major Facilitator Superfamily protein
34
70-221
90
100-739 
69
1-739


XP_001898875.1/


GI:170588227


Bm1_24985/
cation efflux family protein
26
181-300 
97
54-483 
80
15-483 


XP_001896457.1/


GI:170583158


Bm1_46360/
Transmembrane amino acid transporter
27
29-447
97
32-443 
93
1-401


XP_001900734.1/
protein


GI:170591953


Bm1_25200/
Mitochondrial carrier C16C10.1, putative
47
12-332
97
1-332
84
1-332


XP_001896499.1/


GI:170583257


Bm1_01695/
RE11181p-related
28
14-258
89
1-360
22
1-258


XP_001891855.1/


GI:170571765


Bm1_42075/
zgc: 92765, putative
49
11-351
81
1-397
80
1-396


XP_001899877.1/


GI:170590234


Bm1_43555/
ABC TRANSPORTER
40
 19-1536
91
78-1087
71
549-1536 


XP_001900172.1/
TRANSMEMBRANE REGION FAMILY


GI:170590826
PROTEIN


Bm1_37475/
Major Facilitator Superfamily protein
33
100-320 
93
67-806 
76
1-791


XP_001898945.1/


GI:170588367


Bm1_38360/
major facilitator superfamily protein
30
41-553
98
31-565 
92
31-564 


XP_001899125.1/


GI:170588727


Bm1_15490/
ABC transporter family protein
45
67-706
98
35-543 
81
4-712


XP_001894558.1/


GI:170578824


Bm1_31305/
vesicular acetylcholine transporter unc-17,
50
17-440
94
20-528 
91
1-528


XP_001897706.1/
putative


GI:170585872


Bm1_42930/
Excitatory amino acid transporter, putative
59
 7-469
98
1-320
92
245-498 


XP_001900048.1/


GI:170590576


Bm1_02560/
Sodium/calcium exchanger protein
35
29-341
86
156-342 
78
1-342


XP_001892014.1/


GI:170572178


Bm1_31865/
Probable calcium-binding mitochondrial
45
46-504
93
5-508
86
4-506


XP_001897818.1/
carrier F55A11.4, putative


GI:170586099













Hypothetical Proteins




















Bm1_04935/
hypothetical protein
47
4-37
28
1-54 
65
1-68 


XP_001892465.1/


GI:170573424


Bm1_33605/
hypothetical protein
41
42-70 
81
1-216
49
58-216 


XP_001898167.1/


GI:170586800


Bm1_34095/
hypothetical protein
29
15-88 
42
55-87 
68
1-131


XP_001898267.1/


GI:170587000


Bm1_22820/
hypothetical protein
35
30-69 
100
1-68 
94
1-81 


XP_001896020.1/


GI:170582194/


Bm1_26845/
Hypothetical protein
33
77-164
92
1-162
79
58-245 


XP_001896819.1/


GI:170584006


Bm1_45625/
hypothetical protein
38
127-171 
90
1-338
76
25-334 


XP_001900588.1/


GI:170591660


Bm1_52210/
hypothetical protein
29
238-350 
97
53-433 
85
1-431


XP_001901910.1/


GI:170594317


Bm1_02860
hypothetical protein
xx
xx
33
11-31 
38
11-31 


Bm1_57335/
Conserved hypothetical protein, putative
31
114-236 
96
1-246
89
1-246


XP_001902925.1/


GI:170596866/


Bm1_25895/
hypothetical protein
28
115-179 
96
122-325 
68
33-325 


XP_001896638.1/


GI:170583561


Bm1_26820/
hypothetical protein
38
318-378 
88
1-317
59
1-191


XP_001896814.1/


GI:170583994


Bm1_46300/
hypothetical protein, conserved
38
129-154 
91
195-387 
52
30-387 


XP_001900722.1/


GI:170591929


Bm1_20460/
hypothetical protein
30
103-147 
24
94-147 
82
1-191


XP_001895546.1/


GI:170581122


Bm1_27875/
hypothetical protein
28
186-237 
91
61-298 
76
12-202 


XP_001897026.1/


GI:170584478


Bm1_30935/
hypothetical protein
39
37-67 
82
4-90 
27
30-89 


XP_001897629.1/


GI:170585718


Bm1_53700/
hypothetical protein
37
56-108
92
178-216 
56
1-220


XP_001902205.1/


GI:170594994


Bm1_57235/
hypothetical protein
40
39-81 
50
45-62 
65
1-67 


XP_001902905.1/


GI:170596815


Bm1_45100/
hypothetical protein
29
330-414 
85
118-727 
60
75-727 


XP_001900482.1/


GI:170591448


Bm1_07875/
CONSERVED HYPOTHETICAL
31
84-225
99
142-231 
85
53-231 


XP_001893045.1/
PROTEIN


GI:170574981


Bm1_30410/
conserved hypothetical protein
38
152-201 
99
112-234 
89
105-234 


XP_001897520.1/


GI:170585498


Bm1_17550/
hypothetical protein
33
63-121
87
77-129 
82
82-125 


XP_001894967.1/


GI:170579749


Bm1_00920/
Hypothetical 30.5 kDa protein ZK1321.3 in
25
 5-168
93
1-176
72
1-176


XP_001891702.1/
chromosome II.-related


GI:170571368


Bm1_04875/
hypothetical protein
xx
xx
xx
xx
xx
xx


XP_001892453.1/


GI:170573392


Bm1_29435/
Hypothetical protein
30
23-163
96
23-204 
83
23-204 


XP_001897329.1/


GI:170585108


Bm1_42465/
hypothetical protein
28
85-134
84
1-394
70
1-394


XP_001899956.1/


GI:170590392


Bm1_48705/
conserved hypothetical protein
23
57-267
97
96-267 
74
1-268


XP_001901204.1/


GI:170592903


Bm1_32415/
Hypothetical 21.5 kDa protein in SEC15-
52
25-168
93
1-176
90
1-176


XP_001897928.1/
SAP4 intergenic region.-related


GI:170586322


Bm1_07845/
hypothetical protein
28
24-108
89
1-210
64
1-210


XP_001893039.1/


GI:170574965


Bm1_15300/
Hypothetical protein
xx
xx
xx
xx
xx
xx


XP_001894520.1/


GI:170578730


Bm1_17255/
hypothetical protein
21
93-246
87
17-251 
78
28-251 


XP_001894908.1/


GI:170579615


Bm1_18965/
Hypothetical protein
45
46-235
91
1-206
78
1-236


XP_001895250.1/


GI:170580404


Bm1_20325/
Hypothetical protein-conserved
37
 2-215
96
1-487
89
1-487


XP_001895519.1/


GI:170581054


Bm1_44010/
hypothetical protein
42
3-33
92
91-185 
26
74-156 


XP_001900264.1/


GI:170591010


Bm1_46230/
hypothetical protein
29
184-256 
91
24-278 
62
1-278


XP_001900708.1/


GI:170591901





*% Ident” is the percentage of amino acids within the query coverage identical to query sequence.


**Query cov.” span of amino acids in the query sequence that aligns with the target sequence producing significant alignment













TABLE B







BLAST P OF THE TABLE A INTESTINE PROTEINS AGAINST L. LOA AND D. IMMITIS.










B. malayi





Protein Type


Gene Symbol/

L. loa


D. immitis













Accession No./GI


Query

Query


No.
Description
% Ident*
Cov**
% Ident*
Cov**





Acyltransferase







Bm1_43465/
Temporarily assigned gene name protein
88
50-664 
84
64-882 


XP_001900154.1/
40, putative


GI: 170590790


Carbohydrate


Metabolism


Bm1_36055/
hexokinase, putative
63
22-438 
84
1-439


XP_001898658.1/


GI: 170587792


Bm1_52335/
UDP-N-acetylglucosamine-dolichyl-
91
1-406
87
1-388


XP_001901935.1/
phosphate N-


GI: 170594367
acetylglucosaminephosphotransferase,



putative


Catabolism


Bm1_48180/
Amidase family protein
78
1-373
74
90-373 


XP_001901098.1/


GI: 170592691


Cell Adhesion


Bm1_39630/
Immunoglobulin I-set domain containing
89
 1-1171
82
 1-1171


XP_001899381.1/
protein


GI: 170589239


Bm1_02820/
EGF-like domain containing protein
89
62-269 
79
2-269


XP_001892066.1/


GI: 170572325


Cell Signaling


Bm1_30585/
Tyrosine-protein kinase abl-1.-related
83
10-281 
84
10-281 


XP_001897556.1/


GI: 170585572


Bm1_19395/
Protein kinase domain containing protein
95
 1-1280
93
 1-1280


XP_001895334.1/


GI: 170580602


Bm1_38285/
Ser/Thr protein phosphatase family protein
88
9-293
79
 1-290


XP_001899110.1/


GI: 170588697


Bm1_51260/
Innexin family protein
69
1-523
64
1-532


XP_001901720.1/


GI: 170593937


Bm1_31730/
phosphatidate cytidylyltransferase-related
93
1-125
87
1-125


XP_001897792.1/


GI: 170586046


Bm1_36290/
Putative phosphatidate cytidylyltransferase,
89
5-487
78
1-487


XP_001898705.1/
putative


GI: 170587887


Bm1_43990/
Latrophilin receptor protein 2, putative
76
19-841 
69
27-873 


XP_001900260.1/


GI: 170591002


Bm1_05960/
Patched family protein
87
1-934
75
1-933


XP_001892664.1/


GI: 170574085


Bm1_23705/
protein C24B5.3, putative
90
1-934
93
648-959 


XP_001896198.1/


GI: 170582592


Bm1_39815/
Ly-6-related protein HOT-2-related
89
69-199 
89
6-189


XP_001899418.1/


GI: 170589313


Bm1_55745/
sulfakinin receptor protein, putative
36
4-333
64
139-392 


XP_001902606.1/


GI: 170596018


Bm1_54240/
Endonuclease/Exonuclease/phosphatase
88
1-256
83
1-391


XP_001902307.1/
family protein


GI: 170595251


Bm1_52975/
ER lumen protein retaining receptor,
96
1-213
96
1-209


XP_001902063.1/
putative


GI: 170594623


Bm1_48590/
Low-density lipoprotein receptor domain
74
4-154
67
9-154


XP_001901181.1/
class A containing protein


GI: 170592857


Cellular Trafficking


Bm1_14235/
SNARE domain containing protein
94
1-248
83
1-235


XP_001894314.1/


GI: 170578208


Chaperone/HSP


Bm1_15660/
DnaJ domain containing protein
85
11-839 
80
1-839


XP_001894589.1/


GI: 170578901


Bm1_22450/
hemimethylated DNA binding domain
97
29-119 
91
28-119 


XP_001895946.1/
containing protein


GI: 170582031


DNA/RNA Binding


Bm1_41070/
Zinc finger DHHC domain containing
90
28-445 
88
1-445


XP_001899675.1/
protein 5, putative


GI: 170589828


Glycosylation/


glucuronidation


Bm1_34610/
glycosyl transferase, group 2 family protein
89
1-435
88
1-582


XP_001898369.1/


GI: 170587206


Bm1_44655/
Fukutin.-related
85
1-364
75
1-364


XP_001900394.1/


GI: 170591272


Bm1_13480/
UDP-glucoronosyl and UDP-glucosyl
81
1-423
72
155-502 


XP_001894161.1/
transferase family protein


GI: 170577851


Immunological


Bm1_50985/
Complement component C6 precursor.-
61
2-166
75
3-171


XP_001901665.1/
related


GI: 170593827


Miscellaneous


Bm1_15480/
Acyltransferase family protein
86
266-467 
76
49-638 


XP_001894556.1/


GI: 170578816


Bm1_23850/
cDNA sequence BC017158-related
82
1-389
72
21-389 


XP_001896227.1/


GI: 170582654


Bm1_49590/
CG3054-PA-related
63
1-262
67
1-254


XP_001901384.1/


GI: 170593263


Bm1_15855/
D4Ertd196e protein, putative
86
1-177
81
1-151


XP_001894628.1/


GI: 170578992


Bm1_10500/
AMOP domain containing protein
94
 1-1513
90
609-1513 


XP_001894628.1/


GI: 170578992


Bm1_06760/
zgc: 100814 protein-related
84
1-357
71
1-356


XP_001892822.1/


GI: 170574457


Bm1_41280/
uncharacterized hypothalamus protein
81
1-296
80
1-296


XP_001899713.1/
HTMP, putative


GI: 170589904


Bm1_17180/
MiaB-like tRNA modifying enzyme,
94
1-390
95
1-265


XP_001894893.1/
archaeal-type family protein


GI: 170579583


Bm1_48010/
EGF-like domain containing protein
74
6-556
64
6-546


XP_001901064.1/


GI: 170592623


Proteases


Bm1_18805/
Papain family cysteine protease
56
134-313 
41
1-314


XP_001895218.1/
containing protein


GI: 170580338


Bm1_38300/
Peptidase family M1 containing protein
74
1-819
70
1-819


XP_001899113.1/


GI: 170588703


Bm1_26370/
Rhomboid family protein
83
1-377
85
17-377 


XP_001896728.1/


GI: 170583765


Bm1_11005/
MGC84665 protein-related
91
1-98 
90
1-40 


XP_001896728.1/


GI: 170583765


Bm1_53050/
Reprolysin
79
1-839
77
1-843


XP_001902078.1/


GI: 170594653


Possible Protease


Bm1_00205/
ShTK domain containing protein
46
126-264 
55
110-264 


XP_001891567.1/


GI: 170571016


Protease inhibitor


Bm1_09775/
serpin, putative
54
1-391
52
1-391


XP_001893428.1/


GI: 170575897







Sterol


Sterol Metabolism

Metabolism





Bm1_37660/
Oxysterol-binding protein
82
57-753 
86
57-753 


XP_001898982.1/


GI: 170588441





Structural
Structural





BBm1_53475/
Nematode cuticle collagen N-terminal
64
1-281
55
46-291 


XP_001902163.1/
domain containing protein


GI: 170594883


Bm1_04695/
Cuticle collagen F09G8.6.-related
89
1-270
86
1-269


XP_001892417.1/


GI: 170573305


Bm1_19730/
Autophagy protein Apg9 containing protein
86
1-814
81
1-814


XP_001895401.1/


GI: 170580769


Transporters


Bm1_08720/
ABC transporter N-terminus family protein
84
1-153
72
1-153


XP_001893214.1/


GI: 170575377


Bm1_06830/
ABC transporter transmembrane region
81
82-248 
62
1-244


XP_001892835.1/
family protein


GI: 170574488


Bm1_08185/
Cation transporter family protein
57
32-345 
59
67-345 


XP_001893108.1/


GI: 170575121


Bm1_34425/
Ctr copper transporter family protein
74
2-276
73
1-276


XP_001898332.1/


GI: 170587131


Bm1_00795/
E1-E2 ATPase family protein
91
45-531 
85
1-529


XP_001891680.1/


GI: 170571314


Bm1_42365/
ZIP Zinc transporter family protein
81
28-387 
43
17-386 


XP_001899937.1/


GI: 170590354


Bm1_40010/
NRAMP-like transporter K11G12.4, putative
91
1-456
83
1-457


XP_001899457.1/


GI: 170589391


Bm1_38955/
Twik family of potassium channels protein
92
1-561
91
1-561


XP_001899244.1/
28, putative


GI: 170588965


Bm1_44770/
TWiK family of potassium channels protein


XP_001900417.1/
7, putative
94
1-525
91
1-525


GI: 170591318


Bm1_24840/
Major Facilitator Superfamily protein
87
1-418
86
1-493


XP_001896428.1/


GI: 170583097


Bm1_37140/
Major Facilitator Superfamily protein
75
1-738
71
1-739


XP_001898875.1/


GI: 170588227


Bm1_24985/
cation efflux family protein
85
1-483
81
1-483


XP_001896457.1/


GI: 170583158


Bm1_46360/
Transmembrane amino acid transporter
86
1-443
91
1-439


XP_001900734.1/
protein


GI: 170591953


Bm1_25200/
Mitochondrial carrier C16C10.1, putative
89
1-332
83
1-332


XP_001896499.1/


GI: 170583257


Bm1_01695/
RE11181p-related
26
4-105
23
4-258


XP_001891855.1/


GI: 170571765


Bm1_42075/
zgc: 92765, putative
84
1-396
74
1-396


XP_001899877.1/


GI: 170590234


Bm1_43555/
ABC TRANSPORTER
71
12-996 
71
14-1423


XP_001900172.1/
TRANSMEMBRANE REGION


GI: 170590826
FAMILY PROTEIN


Bm1_37475/
Major Facilitator Superfamily protein
82
1-792
78
113-795 


XP_001898945.1/


GI: 170588367


Bm1_38360/
major facilitator superfamily protein
93
31-565 
91
21-564 


XP_001899125.1/


GI: 170588727


Bm1_15490/
ABC transporter family protein
89
4-713
77
7-638


XP_001894558.1/


GI: 170578824


Bm1_31305/
vesicular acetylcholine transporter unc-
92
1-528
87
1-528


XP_001897706.1/
17, putative


GI: 170585872


Bm1_42930/
Excitatory amino acid transporter,
92
1-498
93
1-499


XP_001900048.1/
putative


GI: 170590576


Bm1_02560/
Sodium/calcium exchanger protein
79
1-342
71
1-314


XP_001892014.1/


GI: 170572178


Bm1_31865/
Probable calcium-binding
82
8-501
86
4-508


XP_001897818.1/
mitochondrial carrier F55A11.4,


GI: 170586099
putative


Hypothetical


Proteins


Bm1_04935/
hypothetical protein
72
1-111
66
1-112


XP_001892465.1/


GI: 170573424


Bm1_33605/
hypothetical protein
58
111-216 
22
19-188 


XP_001898167.1/


GI: 170586800


Bm1_34095/
hypothetical protein
81
1-131
71
1-129


XP_001898267.1/


GI: 170587000


Bm1_22820/
hypothetical protein
98
1-81 
96
1-81 


XP_001896020.1/


GI: 170582194/


Bm1_26845/
Hypothetical protein
81
63-162 
72
1-245


XP_001896819.1/


GI: 170584006


Bm1_45625/
hypothetical protein
82
1-332
76
1-332


XP_001900588.1/


GI: 170591660


Bm1_52210/
hypothetical protein
87
15-432 
84
1-430


XP_001901910.1/


GI: 170594317


Bm1_02860
hypothetical protein
33
11-31 
33
11-31 


Bm1_57335/
Conserved hypothetical protein, putative
91
1-246
87
1-246


XP_001902925.1/


GI: 170596866/


Bm1_25895/
hypothetical protein
74
31-325 
67
24-325 


XP_001896638.1/


GI: 170583561


Bm1_26820/
hypothetical protein
77
1-317
72
99-317 


XP_001896814.1/


GI: 170583994


Bm1_46300/
hypothetical protein, conserved
75
1-191
56
18-380 


XP_001900722.1/


GI: 170591929


Bm1_20460/
hypothetical protein
86
1-191
85
1-191


XP_001895546.1/


GI: 170581122


Bm1_27875/
hypothetical protein
79
12-202 
77
185-298 


XP_001897026.1/


GI: 170584478


Bm1_30935/
hypothetical protein
64
9-90 
28
17-70 


XP_001897629.1/


GI: 170585718


Bm1_53700/
hypothetical protein
57
53-236 
62
2-103


XP_001902205.1/


GI: 170594994


Bm1_57235/
hypothetical protein
85
10-67 
74
1-68 


XP_001902905.1/


GI: 170596815


Bm1_45100/
hypothetical protein
67
81-696 
49
271-727 


XP_001900482.1/


GI: 170591448


Bm1_07875/
CONSERVED HYPOTHETICAL
90
42-231 
90
41-231 


XP_001893045.1/
PROTEIN


GI: 170574981


Bm1_30410/
conserved hypothetical protein
90
1-234
94
154-234 


XP_001897520.1/


GI: 170585498


Bm1_17550/
hypothetical protein
77
77-129 
86
82-125 


XP_001894967.1/


GI: 170579749


Bm1_00920/
Hypothetical 30.5 kDa protein ZK1321.3 in
75
1-174
75
1-176


XP_001891702.1/
chromosome II.-related


GI: 170571368


Bm1_04875/
hypothetical protein
xx
xx
xx
xx


XP_001892453.1/


GI: 170573392


Bm1_29435/
Hypothetical protein
89
23-204 
81
23-204 


XP_001897329.1/


GI: 170585108


Bm1_42465/
hypothetical protein
25
14-387 
25
14-385 


XP_001899956.1/


GI: 170590392


Bm1_48705/
conserved hypothetical protein
81
1-268
72
1-269


XP_001901204.1/


GI: 170592903


Bm1_32415/
Hypothetical 21.5 kDa protein in SEC15-
93
1-176
91
1-176


XP_001897928.1/
SAP4 intergenic region.-related


GI: 170586322


Bm1_07845/
hypothetical protein
75
1-210
72
60-210


XP_001893039.1/


GI: 170574965


Bm1_15300/
Hypothetical protein
xx
xx
xx
xx


XP_001894520.1/


GI: 170578730


Bm1_17255/
hypothetical protein
73
13-251 
75
28-251 


XP_001894908.1/


GI: 170579615


Bm1_18965/
Hypothetical protein
85
1-236
77
1-239


XP_001895250.1/


GI: 170580404


Bm1_20325/
Hypothetical protein-conserved
91
1-487
89
1-485


XP_001895519.1/


GI: 170581054


Bm1_44010/
hypothetical protein
72
9-185
32
61-129 


XP_001900264.1/


GI: 170591010


Bm1_46230/
hypothetical protein
65
18-278 
31
161-281 


XP_001900708.1/


GI: 170591901





*% Ident” is the percentage of amino acids within the query coverage identical to query sequence.


**Query cov.” span of amino acids in the query sequence that aligns with the target sequence producing significant alignment













TABLE C







SELECTED PROTEINS FROM TABLES A AND B CONTAINING 1-2 TRANSMEMBRANE DOMAINS, A SIGNIFICANT NON-CYTOPLASMIC


PORTION, >75% HOMOLOGY TO EITHER W. BANCROFTI OR O. VOLVULUS AND <40% HOMOLOGY TO HUMANS.














H. sapiens


W. bancrofti


O. volvulus


L. loa


D. immitis


















SEQ ID NO:/

Query

Query

Query

Query

Query


Gene Symbol
% Ident*
cov**.
% Ident*
Cov**
% Ident*
Cov**
% Ident*
Cov**
% Ident*
Cov**




















1/Bm1_39630/
28
 45-1170
97
628-1171
87
26-1171
89
 1-1171
82
 1-1171


XP_001899381.1/


GI:170589239


2/Bm1_02820/
35
53-206
96
 1-269
82
3-269
89
62-269 
79
2-269


XP_001892066.1/


GI:170572325


3/Bm1_30585/
24
212-281 
95
10-281
79
10-281 
83
10-281 
84
10-281 


XP_001897556.1/


GI:170585572


4/Bm1_19395/
34
 15-1280
98
 1-681
92
 1-1280
95
 1-1280
93
 1-1280


XP_001895334.1/


GI:170580602


5/Bm1_38285/
40
22-287
95
 9-293
79
2-293
88
9-293
79
1-290


XP_001899110.1/


GI:170588697


6/Bm1_15660/
27
25-835
91
 1-839
77
3-839
85
11-839 
80
1-839


XP_001894589.1/


GI:170578901


7/Bm1_22450/
25
34-112
97
29-119
95
29-119 
97
29-119 
91
28-119 


XP_001895946.1/


GI:170582031


8/Bm1_44655/
31
138-364 
96
 1-362
73
1-364
85
1-364
75
1-364


XP_001900394.1/


GI:170591272


9/Bm1_13480/
27
35-509
95
 1-425
29
214-293 
81
1-423
72
155-502 


XP_001894161.1/


GI:170577851


10/Bm1_49590/
28
97-260
81
 1-242
69
1-260
63
1-262
67
1-254


XP_001901384.1/


GI:170593263


11/Bm1_10500/
26
652-932 
98
679-1377
92
 1-1513
94
 1-1513
90
609-1513 


XP_001893572.1/


GI:170576299/


12/Bm1_48010/
36
10-395
91
19-338
66
6-560
74
6-556
64
6-546


XP_001901064.1


GI:170592623/


13/Bm1_38300/
28
136-586 
90
90-819
67
1-819
74
1-819
70
1-819


XP_001899113.1/


GI:170588703


14/Bm1_11005/
38
2-98
95
1-96
80
1-98 
91
1-98 
90
1-40 


XP_001893672.1/


GI:170576544


15/Bm1_53050/
32
153-753 
91
96-845
77
1-839
79
1-839
77
1-843


XP_001902078.1/


GI:170594653


16/Bm1_00205/
27
54-161
80
142-229 
52
70-227 
46
126-264 
55
110-264 


XP_001891567.1/


GI:170571016


17/Bm1_09775/
29
26-388
75
84-375
52
29-390 
54
1-391
52
1-391


XP_001893428.1/


GI:170575897


18/Bm1_52210/
29
238-350 
97
53-433
85
1-431
87
15-432 
84
1-430


XP_001901910.1/


GI:170594317


19/Bm1_57335/
31
114-236 
96
 1-246
89
1-246
91
1-246
87
1-246


XP_001902925.1/


GI:170596866


20/Bm1_20460/
30
103-147 
24
94-147
82
1-191
86
1-191
85
1-191


XP_001895546.1/


GI:170581122


21/Bm1_45100/
29
330-414 
85
118-727 
60
75-727 
67
81-696 
49
271-727 


XP_001900482.1/


GI:170591448


22/Bm1_07875/
31
84-225
99
142-231 
85
53-231 
90
42-231 
90
41-231 


XP_001893045.1/


GI:170574981


23/Bm1_17550/
33
63-121
87
77-129
82
82-125 
77
77-129 
86
82-125 


XP_001894967.1/


GI:170579749


24/Bm1_07845/
28
24-108
89
 1-210
64
1-210
75
1-210
72
60-210 


XP_001893039.1/


GI:170574965


25/Bm1_17255/
21
93-246
87
17-251
78
28-251 
73
13-251 
75
28-251 


XP_001894908.1/


GI:170579615


26/Bm1_20325/
37
 2-215
96
 1-487
89
1-487
91
1-487
89
1-485


XP_001895519.1/


GI:170581054


27/Bm1_46230/
29
184-256 
91
24-278
62
1-278
65
18-278 
31
161-281 


XP_001900708.1/


GI:170591901





*% Ident” is the percentage of amino acids within the query coverage identical to query sequence.


**Query cov.” span of amino acids in the query sequence that aligns with the target sequence producing significant alignment













TABLE D







NORMALIZED SPECTRAL ABUNDANCE FACTOR (NSAF) AND NSAF ENRICHMENT











NSAF enrichment*
NSAF

















Reproductive
Abundance**
Non-cytoplasmic
Transmembrane


SEQ ID NO.
Intestine
Body Wall
Tract
Intestine
domaina
(TMHMM)b
















1/Bm1_39630/
3.84
0.26
0
2.20E−05
19-1120
1


XP_001899381.1/


GI:170589239


2/Bm1_02820/
9999
0
0
1.30E−04
1-225
1


XP_001892066.1/


GI:170572325


3/Bm1_30585/
4.42
0
0.23
8.40E−05
19-135 
1


XP_001897556.1/


GI:170585572


4/Bm1_19395/
3.42
0.14
0.12
3.30E−05
1-942
1


XP_001895334.1/


GI:170580602


5/Bm1_38285/
7.68
0.13
0
1.80E−04
41-293 
1


XP_001899110.1/


GI:170588697


6/Bm1_15660/
10.31
0
0.1
7.20E−05
18-220 
1


XP_001894589.1/


GI:170578901


7/Bm1_22450/
2.74
0.17
0.14
2.70E−04
1-125
1


XP_001895946.1/


GI:170582031


8/Bm1_44655/
2.95
0
0.34
4.70E−05
28-364 
1


XP_001900394.1/


GI:170591272


9/Bm1_13480/
28.16
0.04
0
7.40E−04
1-486
1


XP_001894161.1/


GI:170577851


10/Bm1_49590/
2.95
0
0.34
6.50E−05
1-51, 101-265
2


XP_001901384.1/


GI:170593263


11/Bm1_10500/
5.99
0.01
0.15
6.10E−04
23-1322
1


XP_001893572.1/


GI:170576299/


12/Bm1_48010/
2.03
0.38
0.06
2.50E−04
1-430
1


XP_001901064.1


GI:170592623/


13/Bm1_38300/
9999
0
0
1.60E−05
81-1061
1


XP_001899113.1/


GI:170588703


14/Bm1_11005/
2.74
0.17
0.14
2.00E−04
1-16, 76-169 
1


XP_001893672.1/


GI:170576544


15/Bm1_53050/
2.95
0
0.34
6.10E−04
1-607
1


XP_001902078.1/


GI:170594653


16/Bm1_00205/
3.42
0.23
0.04
4.200+00
26-264 
1


XP_001891567.1/


GI:170571016


17/Bm1_09775/
2.56
0.39
0
4.70E−05
17-391 
1


XP_001893428.1/


GI:170575897


18/Bm1_52210/
9999
0
0
4.00E−05
1-369
1


XP_001901910.1/


GI:170594317


19/Bm1_57335/
9999
0
0
7.00E−05
29-211 
2


XP_001902925.1/


GI:170596866


20/Bm1_20460/
2.95
0
0.34
8.00E−05
143-147 
2


XP_001895546.1/


GI:170581122


21/Bm1_45100/
5.12
0.2
0
4.70E−05
285-727 
1


XP_001900482.1/


GI:170591448


22/Bm1_07875/
4.42
0
0.23
1.10E−04
1-56. 
1


XP_001893045.1/


GI:170574981


23/Bm1_17550/
2.56
0.39
0
1.30E−04
1-61. 
1


XP_001894967.1/


GI:170579749


24/Bm1_07845/
2.09
0.15
0.24
4.90E−04
53-210 
1


XP_001893039.1/


GI:170574965


25/Bm1_17255/
3.64
0.2
0.05
3.80E−04
36-251 
1


XP_001894908.1/


GI:170579615


26/Bm1_20325/
3.36
0.12
0.14
3.00E−04
1-194
1


XP_001895519.1/


GI:170581054


27/Bm1_46230/
2.21
0
0.45
1.80E−04
121-289 
1


XP_001900708.1/


GI:170591901





*Calculated as described in the Examples


**Calculated as described in the Examples



aNon-cytoplasmic domain refers to the span of amino acids predicted to be non-cytoplasmic as described in the Examples




bTransmembrane refers to number of transmembrane domains as predicted by a membrane protein topology prediction method TMHMM as described in the Examples.






Claims
  • 1. An immunogenic composition comprising: at least two isolated polypeptides or immunogenic fragments thereof, and optionally a pharmaceutically acceptable carrier,wherein each polypeptide is expressed on a luminal surface of an intestine of a filarial worm,wherein each polypeptide is expressed at a level at least two-fold higher in the intestine in comparison to the level of expression of each polypeptide in a reproductive tract or a body wall of the filarial worm,wherein each isolated polypeptide has at least one transmembrane domain, andwherein each polypeptide is a non-mitochondrial polypeptide.
  • 2. The immunogenic composition of claim 1, wherein the filarial worm is selected from the group consisting of Brugia malayi, Wuchereria bancrofti, Onchocerca volvulus, L. loa and Dirofilaria immitis.
  • 3. The immunogenic composition of claim 1, wherein each isolated polypeptide is no more than 40% identical to a human polypeptide sequence.
  • 4. The immunogenic composition of claim 1, wherein each isolated polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-27 or an immunogenic fragment thereof.
  • 5. The immunogenic composition as in claim 1, wherein each isolated polypeptide comprises an amino acid sequence selected from the group consisting of SEQ ID NOS: 1-7, 11, 14, 15, 18, 19, 22, 23 and 26.
  • 6. The immunogenic composition as in claim 1, wherein the immunogenic composition consists essentially of five isolated polypeptides and the pharmaceutically acceptable carrier.
  • 7. The immunogenic composition as in claim 1, wherein the immunogenic composition is a vaccine.
  • 8. The immunogenic composition as in claim 1, wherein the at least two isolated polypeptides are selected from the group consisting of a cell adhesion protein, a cell signaling protein, a chaperone protein, a protease inhibitor, a protease, a UDP-glucuronosyl and a UDP glucosyl transferase.
  • 9. The immunogenic composition as in claim 1, wherein the pharmaceutically acceptable carrier is an adjuvant.
  • 10. The immunogenic composition as in claim 1, wherein the adjuvant is Freund's complete adjuvant.
  • 11. A method for preventing or treating a filarial disease comprising administering an effective amount of a vaccine comprising the immunogenic composition of claim 1.
  • 12. The method of claim 11, wherein the filarial disease is selected from the group consisting of lymphatic filariasis, river blindness, loiasis and heartworm.
  • 13. The method of claim 11, wherein the subject is a human.
  • 14. The method of claim 11, wherein the filarial disease is heartworm.
  • 15. The method of claim 11, wherein the subject is a dog.
  • 16. The method of claim 11, wherein the vaccine is subcutaneously, intradermally, orally, or nasally administered.
CROSS REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of, and relies on the filing date of, U.S. provisional patent application No. 62/135,115, filed 18 Mar. 2015, the entire disclosure of which is incorporated herein by reference.

STATEMENT OF GOVERNMENT INTEREST

The present invention arose in part from research funded by grant R073UE from the Uniformed Services University of the Health Sciences. The U.S. Government may have certain rights in this invention.

PCT Information
Filing Document Filing Date Country Kind
PCT/US16/22787 3/17/2016 WO 00
Provisional Applications (1)
Number Date Country
62135115 Mar 2015 US