The recent explosion in genomic sequencing has deposited a wealth of information in the hands of researchers. However, there is not yet a means to efficiently analyze such data to identify which antigens among many thousands are appropriate targets for vaccine development.
More than 5000 proteins are expressed during the life cycle of the Plasmodium spp. parasite. Subunit vaccines currently in development are based on a single or few antigens and may therefore, elicit too narrow a breadth of response, providing neither optimal protection nor protection on genetically diverse backgrounds. By contrast, to duplicate the protection induced by whole organism vaccination (Good, M. F. & Doolan, D. L. Immune effector mechanisms in malaria. Curr. Opin. Immunol. 11, 412-419 (1999)), a malaria vaccine targeting an unprecedented number of parasite-derived proteins through inclusion of their minimal CD8+ and CD4+ T cell epitopes in a multiepitope construct appears to be required. However, the antigens mediating whole organism induced protection are largely unknown.
Because of various factors, principally related to antigen abundance and immunodominance, not all possible antigens are recognized by natural immunity (Yewdell J W, Bennink J R. Immunodominance in major histocompatibility complex class I-restricted T lymphocyte responses. Annu. Rev. Immunol. 17, 51-88. (1999)). Various approaches have been proposed for antigen identification, including expression cloning (Kawakami, Y. & Rosenberg, S. A. Immunobiology of human melanoma antigens MART-1 and gp100 and their use for immuno-gene therapy. Int. Rev. Immunol. 14, 173-192 (1997)), elution and mass spectrometry sequencing of naturally processed MHC-bound peptides (Rotzschke, O. et al. Isolation and analysis of naturally processed viral peptides as recognized by cytotoxic T cells. Nature 348, 252-254 (1990); van Bleek, G. M. & Nathenson, S. G. Isolation of an endogenously processed immunodominant viral peptide from the class I H-2 Kb molecule. Nature 348, 213-216 (1990); Hunt, D. F. et al. Peptides presented to the immune system by the murine class II major histocompatibility complex molecule 1-Ad. Science 256, 1817-1820 (1992); Cox, A. L. et al. Identification of a peptide recognized by five melanoma-specific human cytotoxic T cell lines. Science 264, 716-719 (1994)), in vitro testing of pools of overlapping peptides (Kern, F. et al. Cytomegalovirus (CMV) Phosphoprotein 65 Makes a Large Contribution to Shaping the T Cell Repertoire in CMV-Exposed Individuals. J. Infect. Dis. 185, 1709-1716 (2002)), and reverse immunogenetics (Davenport, M. P. & Hill, A. V. Reverse immunogenetics: from HLA-disease associations to vaccine candidates. Mol. Med. Today 2, 38-45 (1996); Aidoo, M. et al. Identification of conserved antigenic components for a cytotoxic T lymphocyte-inducing vaccine against malaria. Lancet 345, 1003-1007 (1995)). However, these methods suffer from potential problems such as the repeated identification of the same (frequent/dominant) epitope, biases at the level of expansion of T cell populations, and use of clonal/oligoclonal T cells. They also tend to underestimate the complexity of responses, and are not able to analyze a large number of potential targets in the context of multiple HLA types. Finally, none of these approaches easily lends itself towards the daunting task of efficiently analyzing large amounts of genomic sequence data.
The subject invention also provides novel Plasmodium falciparum antigens that are useful in therapeutic and diagnostic applications. In various aspects, the subject invention provides embodiments such as:
g) a polynucleotide sequence encoding a polypeptide fragment of a polypeptide selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, and 27, wherein the fragment has substantially the same serologic reactivity as the native polypeptide and/or substantially the same T-cell reactivity as the native polypeptide or fragment;
Table 1 presents a summary of immune reactivities of a panel of 27 novel antigens and four known antigens.
Tables 2-6 provide peptide epitopes of P. falciparum.
Sequence ID NOs: 1-27 are amino acid sequences of novel malaria antigens.
The subject invention provides isolated and/or purified novel P. falciparum polynucleotides and fragments of these novel polynucleotides. Thus, the present invention provides isolated and/or purified polynucleotide sequences comprising:
“Nucleotide sequence”, “polynucleotide” or “nucleic acid” can be used interchangeably and are understood to mean, according to the present invention, either a double-stranded DNA, a single-stranded DNA or products of transcription of the said DNAs (e.g., RNA molecules). It should also be understood that the present invention does not relate to genomic polynucleotide sequences of P. falciparum in their natural environment or natural state. The nucleic acid, polynucleotide, or nucleotide sequences of the invention have been isolated, purified (or partially purified), by separation methods including, but not limited to, ion-exchange chromatography, molecular size exclusion chromatography, affinity chromatography, or by genetic engineering methods such as amplification, cloning, subcloning or chemical synthesis.
A homologous polynucleotide or polypeptide sequence, for the purposes of the present invention, encompasses a sequence having a percentage identity with the polynucleotide or polypeptide sequences, set forth herein, of between at least (or at least about) 20.00% to 99.99% (inclusive). The aforementioned range of percent identity is to he taken as including, and providing written description and support for, any fractional percentage, in intervals of 0.01%, between 20.00% and, up to, including 99.99%. These percentages are purely statistical and differences between two nucleic acid sequences can be distributed randomly and over the entire sequence length.
In various embodiments, homologous sequences can exhibit a percent identity of 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent with the sequences of the instant invention. Typically, the percent identity is calculated with reference to the full length, native, and/or naturally occurring polypeptide or polynucleotide (e.g., those polypeptides set forth in SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27 or those set forth in SEQ ID NOs:28-81)). The terms “identical” or percent “identity”, in the context of two or more polynucleotide or polypeptide sequences, refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid residues that are the same, when compared and aligned for maximum correspondence over a comparison window, as measured using a sequence comparison algorithm or by manual alignment and visual inspection. Preferably, such a substitution is made in accordance with analoging principles set forth, e.g., in co-pending U.S. Ser. No. 09/260,714 filed Mar. 1, 1999 and Ser. No. 09/226,775, filed Jan. 6, 1999 and PCT application number PCT/US00/19774 each of which is hereby incorporated by reference in its entirety.
Both protein and nucleic acid sequence homologies may be evaluated using any of the variety of sequence comparison algorithms and programs known in the art. Such algorithms and programs include, but are by no means limited to, TBLASTN, BLASTP, FASTA, TFASTA, and CLUSTALW (Pearson and Lipman, 1988, Proc. Natl. Acad. Sci. USA 85(8):2444-2448; Altschul et al., 1990, J. Mol. Biol. 215(3):403-410; Thompson et al., 1994, Nucleic Acids Res. 22(2):4673-4680; Higgins et al., 1996, Methods Enzymol. 266:383-402; Altschul et al., 1990, J. Mol. Biol. 215(3):403-410; Altschul et al., 1993, Nature Genetics 3:266-272). Sequence comparisons are, typically, conducted using default parameters provided by the vendor or using those parameters set forth in the above-identified references, which are hereby incorporated by reference in their entireties. A “complementary” polynucleotide sequence, as used herein, generally refers to a sequence arising from the hydrogen bonding between a particular purine and a particular pyrimidine in double-stranded nucleic acid molecules (DNA-DNA, DNA-RNA, or RNA-RNA). The major specific pairings are guanine with cytosine and adenine with thymine or uracil. A “complementary” polynucleotide sequence may also be referred to as an “antisense” polynucleotide sequence or an “antisense” sequence.
Sequence homology and sequence identity can also be determined by hybridization studies under high stringency, intermediate stringency, and/or low stringency. Various degrees of stringency of hybridization can be employed. The more severe the conditions, the greater the complementarity that is required for duplex formation. Severity of conditions can be controlled by temperature, probe concentration, probe length, ionic strength, time, and the like. Preferably, hybridization is conducted under low, intermediate, or high stringency conditions by techniques well known in the art, as described, for example, in Keller, G. H., M. M. Manak [1987] DNA Probes, Stockton Press, New York, N.Y., pp. 169-170.
For example, hybridization of immobilized. DNA on Southern blots with 32P-labeled gene-specific probes can be performed by standard methods (Maniatis et al. [1982] Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York). In general, hybridization and subsequent washes can be carried out under intermediate to high stringency conditions that allow for detection of target sequences with homology to the exemplified polynucleotide sequence. For double-stranded DNA gene probes, hybridization can be carried out overnight at 20-25° C. below the melting temperature (Tm) of the DNA hybrid in 6×SSPE, 5×Denhardt's solution, 0.1% SDS, 0.1 mg/ml denatured DNA. The melting temperature is described by the following formula (Beltz et al. [1983] Methods of Enzymology, R. Wu, L. Grossman and K. Moldave [eds.] Academic Press, New York 100:266-285).
Tm=81.5° C.+16.6 Log [Na+]0.41(% G+C)−0.61(% formamide)−600/length of duplex in base pairs.
Washes are typically carried out as follows:
For oligonucleotide probes, hybridization can be carried out overnight at 10-20° C. below the melting temperature (Tm) of the hybrid in 6×SSPE, 5×Denhardt's solution, 0.1% SDS, 0.1 mg/ml denatured DNA. Tm for oligonucleotide probes can be determined by the following formula:
T
m (° C.)=2(number T/A base pairs)+4(number G/C base pairs) (Suggs et al. [1981] ICN-UCLA Symp. Dev. Biol. Using Purified Genes, D. D. Brown [ed.], Academic Press, New York, 23:683-693).
Washes can be carried out as follows:
In general, salt and/or temperature can be altered to change stringency. With a labeled DNA fragment >70 or so bases in length, the following conditions can be used:
Low: 1 or 2×SSPE, room temperature
Low: 1 or 2×SSPE, 42° C.
Intermediate: 0.2× or 1×SSPE, 65° C.
High: 0.1×SSPE, 65° C.
By way of another non-limiting example, procedures using conditions of high stringency can also be performed as follows: Pre-hybridization of filters containing DNA is carried out for 8 h to overnight at 65° C. in buffer composed of 6×SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 μg/ml denatured salmon sperm DNA. Filters are hybridized for 48 h at 65° C., the preferred hybridization temperature, in pre-hybridization mixture containing 100 μg/ml denatured salmon sperm DNA and 5-20×106 cpm of 32P-labeled probe. Alternatively, the hybridization step can be performed at 65° C. in the presence of SSC buffer, 1×SSC corresponding to 0.15M NaCl and 0.05 M Na citrate. Subsequently, filter washes can be done at 37° C. for 1 h in a solution containing 2×SSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0.1×SSC at 50° C. for 45 min. Alternatively, filter washes can be performed in a solution containing 2×SSC and 0.1% SDS, or 0.5×SSC and 0.1% SDS, or 0.1×SSC and 0.1% SDS at 68° C. for 15 minute intervals. Following the wash steps, the hybridized probes are detectable by autoradiography. Other conditions of high stringency which may be used are well known in the art and as cited in Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. are incorporated herein in their entirety.
Another non-limiting example of procedures using conditions of intermediate stringency are as follows: Filters containing DNA are pre-hybridized, and then hybridized at a temperature of 60° C. in the presence of a 5×SSC buffer and labeled probe. Subsequently, filters washes are performed in a solution containing 2×SSC at 50° C. and the hybridized probes are detectable by autoradiography. Other conditions of intermediate stringency which may he used are well known in the art and as cited in Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., pp. 9.47-9.57; and Ausubel et al., 1989, Current Protocols in Molecular Biology, Green Publishing Associates and Wiley Interscience, N.Y. are incorporated herein in their entirety.
Duplex formation and stability depend on substantial complementarity between the two strands of a hybrid and, as noted above, a certain degree of mismatch can be tolerated. Therefore, the probe sequences of the subject invention include mutations (both single and multiple), deletions, insertions of the described sequences, and combinations thereof, wherein said mutations, insertions and deletions permit formation of stable hybrids with the target polynucleotide of interest. Mutations, insertions and deletions can be produced in a given polynucleotide sequence in many ways, and these methods are known to an ordinarily skilled artisan. Other methods may become known in the future. It is also well known in the art that restriction enzymes can be used to obtain functional fragments of the subject DNA sequences. For example, Bal31 exonuclease can be conveniently used for time-controlled limited digestion of DNA (commonly referred to as “erase-a-base” procedures). See, for example, Maniatis et al. [1982] Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York; Wei et al. [1983] J. Biol. Chem. 258:13006-13512.
The present invention further comprises fragments of the polynucleotide sequences of the instant invention. Representative fragments of the polynucleotide sequences according to the invention will be understood to mean any nucleotide fragment having at least 8 successive nucleotides, preferably at least 12 successive nucleotides, and still more preferably at least 15 or at least 20 successive nucleotides of the sequence from which it is derived. The upper limit for such fragments is the total number of polynucleotides found in the full length sequence (or, in certain embodiments, of the full length open reading frame (ORF) identified herein).
In some embodiments, the subject invention includes those fragments capable of hybridizing under various conditions of stringency conditions (e.g., high or intermediate or low stringency) with a nucleotide sequence according to the invention; fragments that hybridize with a nucleotide sequence of the subject invention can be, optionally, labeled as set forth below.
Other embodiments provide for nucleic acid fragments corresponding to nucleotide sequences comprising full, or partial, open reading frames (ORF sequences). Also within the scope of the invention are those polynucleotide fragments encoding polypeptides reactive with antibodies found in the serum of individuals infected with P. falciparum. Fragments according to the subject invention can be obtained, for example, by specific amplification (e.g., PCR amplification), digestion with restriction enzymes, of nucleotide sequences according to the invention. Such methodologies are well-known in the art and are taught, for example, by Sambrook et al., 1989. Nucleic acid fragments according to the invention can also be obtained by chemical synthesis according to methods well known to persons skilled in the art.
The subject invention also provides nucleic acid based methods for the identification of the presence of an organism in a sample. In these varied embodiments, the invention provides for the detection of nucleic acids in a sample comprising contacting a sample with a nucleic acid (polynucleotide) of the subject invention (such as an RNA, mRNA, DNA, cDNA, or other nucleic acid). In a preferred embodiment, the polynucleotide is a probe that is, optionally, labeled and used in the detection system. Many methods for detection of nucleic acids exist and any suitable method for detection is encompassed by the instant invention. Typical assay formats utilizing nucleic acid hybridization includes, and are not limited to, 1) nuclear run-on assay, 2) slot blot assay, 3) northern blot assay (Alwine, et al. Proc. Natl. Acad. Sci. 74:5350), 4) magnetic particle separation, 5) nucleic Acid or DNA chips, 6) reverse Northern blot assay, 7) dot blot assay, 8) in situ hybridization, 9) RNase protection assay (Melton, et al. Nuc. Acids Res. 12:7035 and as described in the 1998 catalog of Ambion, Inc., Austin, Tex.), 10) ligase chain reaction, 11) polymerase chain reaction (PCR), 12) reverse transcriptase (RT)-PCR (Berchtold, et al. Nuc. Acids. Res. 17:453), 13) differential display RT-PCR (DDRT-PCR) or other suitable combinations of techniques and assays. Labels suitable for use in these detection methodologies include, and are not limited to 1) radioactive labels, 2) enzyme labels, 3) chemiluminescent labels, 4) fluorescent labels, 5) magnetic labels, or other suitable labels, including those set forth below. These methodologies and labels are well known in the art and widely available to the skilled artisan. Likewise, methods of incorporating labels into the nucleic acids are also well known to the skilled artisan.
Thus, the subject invention also provides detection probes (e.g., fragments of the disclosed polynucleotide sequences) for hybridization with a target sequence or the amplicon generated from the target sequence. Such a detection probe will advantageously have as sequence a sequence of at least 8, 9, 10, 11, 12, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, or 100 nucleotides. Labeled probes or primers are labeled with a radioactive compound or with another type of label as set forth above. Alternatively, non-labeled nucleotide sequences may be used directly as probes or primers; however, the sequences are generally labeled with a radioactive element (32P, 35S, 3H, 125I) or with a molecule such as biotin, acetylaminofluorene, digoxigenin, 5-bromo-deoxyuridine, or fluorescein to provide probes that can be used in numerous applications.
The polynucleotide sequences according to the invention may also he used in analytical systems, such as DNA chips. DNA chips and their uses are well known in the art and (see for example, U.S. Pat. Nos. 5,561,071; 5,753,439; 6,214,545; Schena et al., BioEssays, 1996, 18:427-431; Bianchi et al., Clin. Diagn. Virol., 1997, 8:199-208; each of which is hereby incorporated by reference in their entireties) and/or are provided by commercial vendors such as Affymetrix, Inc. (Santa Clara, Calif.). In addition, the nucleic acid sequences of the subject invention can be used as molecular weight markers in nucleic acid analysis procedures.
The subject invention also provides for modified nucleotide sequences. Modified nucleic acid sequences will be understood to mean any nucleotide sequence that has been modified, according to techniques well known to persons skilled in the art, and exhibiting modifications in relation to the native, naturally occurring nucleotide sequences. One non-limiting example of a “modified” nucleotide sequences includes mutations in regulatory and/or promoter sequences of a polynucleotide sequence that result in a modification of the level of expression of the polypeptide. A “modified” nucleotide sequence will also be understood to mean any nucleotide sequence encoding a “modified” polypeptide as defined below.
Another aspect of the invention provides vectors for the cloning and/or the expression of a polynucleotide sequence taught herein. Vectors of this invention, including vaccine vectors, can also comprise elements necessary to allow the expression and/or the secretion of the said nucleotide sequences in a given host cell. The vector can contain a promoter, signals for initiation and for termination of translation, as well as appropriate regions for regulation of transcription. In certain embodiments, the vectors can be stably maintained in the host cell and can, optionally, contain signal sequences directing the secretion of translated protein. These different elements are chosen according to the host cell used. Vectors can integrate into the host genome or, optionally, be autonomously-replicating vectors.
The subject invention also provides for the expression of a polypeptide, peptide, derivative, or variant encoded by a polynucleotide sequence disclosed herein comprising the culture of an organism transformed with a polynucleotide of the subject invention under conditions that allow for the expression of the polypeptide, peptide, derivative, or analog and, optionally, recovering the expressed polypeptide, peptide, derivative, or analog.
The disclosed polynucleotide sequences can also be regulated by a second nucleic acid sequence so that the protein or peptide is expressed in a host transformed with the recombinant DNA molecule. For example, expression of a protein or peptide may be controlled by any promoter/enhancer element known in the art. Promoters which may be used to control expression include, but are not limited to, the CMV-IE promoter, the SV40 early promoter region (Bernoist and Chambon, 1981, Nature 290:304-310), the promoter contained in the 3′ long terminal repeat of Rous sarcoma virus (Yamamoto, et al., 1980, Cell 22:787-797), the herpes simplex thymidine kinase promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory sequences of the metallothionein gene (Brinster et al., 1982, Nature 296:39-42); prokaryotic vectors containing promoters such as the β-lactamase promoter (Villa-Kamaroff, et al., 1978, Proc. Natl. Acad. Sci. U.S.A. 75:3727-3731), or the tac promoter (DeBoer, et al., 1983, Proc. Natl. Acad. Sci. U.S.A. 80:21-25); see also “Useful proteins from recombinant bacteria” in Scientific American, 1980, 242:74-94; plant expression vectors comprising the nopaline synthetase promoter region (Herrera-Estrella et al., 1983, Nature 303:209-213) or the cauliflower mosaic virus 35S RNA promoter (Gardner, et al., 1981, Nucl. Acids Res. 9:2871), and the promoter of the photosynthetic enzyme ribulose biphosphate carboxylase (Herrera-Estrella et al., 1984, Nature 310:115-120); promoter elements from yeast or fungi such as the Gal 4 promoter, the ADC (alcohol dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter, and/or the alkaline phosphatase promoter.
The vectors according to the invention are, for example, vectors of plasmid or viral origin. In a specific embodiment, a vector is used that comprises a promoter operably linked to a protein or peptide-encoding nucleic acid sequence contained within the disclosed polynucleotide sequences, one or more origins of replication, and, optionally, one or more selectable markers (e.g., an antibiotic resistance gene). Expression vectors comprise regulatory sequences that control gene expression, including gene expression in a desired host cell. Exemplary vectors for the expression of the polypeptides of the invention include the pET-type plasmid vectors (Promega) or pBAD plasmid vectors (Invitrogen) or those provided in the examples below. Furthermore, the vectors according to the invention are useful for transforming host cells so as to clone or express the polynucleotide sequences of the invention.
The invention also encompasses the host cells transformed by a vector according to the invention. These cells may be obtained by introducing into host cells a nucleotide sequence inserted into a vector as defined above, and then culturing the said cells under conditions allowing the replication and/or the expression of the polynucleotide sequences of the subject invention.
The host cell may be chosen from eukaryotic or prokaryotic systems, such as for example bacterial cells, (Gram negative or Gram positive), yeast cells (for example, Saccharomyces cereviseae or Pichia pastoris), animal cells (such as Chinese hamster ovary (CHO) cells), plant cells, and/or insect cells using baculovirus vectors. In some embodiments, the host cells for expression of the polypeptides include, and are not limited to, those taught in U.S. Pat. Nos. 6,319,691, 6,277,375, 5,643,570, or 5,565,335, each of which is incorporated by reference in its entirety, including all references cited within each respective patent.
Furthermore, a host cell strain may be chosen which modulates the expression of the inserted sequences, or modifies and processes the gene product in the specific fashion desired. Expression from certain promoters can be elevated in the presence of certain inducers; thus, expression of the genetically engineered polypeptide may be controlled. Furthermore, different host cells have characteristic and specific mechanisms for the translational and post-translational processing and modification (e.g., glycosylation, phosphorylation) of proteins. Appropriate cell lines or host systems can be chosen to ensure the desired modification and processing of the foreign protein expressed. For example, expression in a bacterial system can be used to produce an unglycosylated core protein product. Expression in yeast will produce a glycosylated product. Expression in mammalian cells can be used to ensure “native” glycosylation of a heterologous protein. Furthermore, different vector/host expression systems may effect processing reactions to different extents.
The subject invention also concerns novel compositions that can be employed to elicit an immune response or a protective immune response. In this aspect of the invention, an amount of a composition comprising recombinant DNA or mRNA encoding an polynucleotide of the subject invention sufficient to elicit an immune response or protective immune response is administered to an individual. Signal sequences may be deleted from the nucleic acid encoding an antigen of interest and the individual may be monitored for the induction of an immune response according to methods known in the art. A “protective immune response” or “therapeutic immune response” refers to a CTL (or CD8+ T cell) and/or an HTL (or CD4+ T cell) response to an antigen that, in some way, prevents or at least partially arrests disease symptoms, side effects or progression. The immune response may also include an antibody response that has been facilitated by the stimulation of helper T cells.
In another embodiment, the subject invention further comprises the administration of polynucleotide vaccines in conjunction with a polypeptide antigen, or composition thereof, of the invention. In a preferred embodiment, the antigen is the polypeptide that is encoded by the polynucleotide administered as the polynucleotide vaccine. As a particularly preferred embodiment, the polypeptide antigen is administered as a booster subsequent to the initial administration of the polynucleotide vaccine.
A further embodiment of the subject invention provides for the induction of an immune response to the novel Plasmodium falciparum antigens disclosed herein (see, for example, the antigens and peptides set forth in the Tables and Sequence Listing attached hereto) using a “prime-boost” vaccination regimen known to those skilled in the art. In this aspect of the invention, a DNA vaccine is administered to an individual in an amount sufficient to “prime” the immune response of the individual, provided that the DNA vaccine comprises nucleic acids encoding the antigens, multi-epitope constructs, and/or peptide antigens set forth herein. The immune response of the individual is then “boosted” via the administration of: 1) one or a combination of: a peptide, polypeptide, and/or full length polypeptide antigen (e.g., SEQ ID NOs: 1-27) of the subject invention (optionally in conjunction with a immunostimulatory molecule and/or an adjuvant); or 2) a viral vector that contains nucleic acid encoding one, or more, of the same or, optionally, different, antigens, multi-epitope constructs, and/or peptide antigens set forth in the Tables or Sequence Listing of the subject application. In some alternative embodiments of the invention, a gene encoding an immunostimulatory molecule may be incorporated into the viral vector used to “boost the immune response of the individual. Exemplary immunostimulatory molecules include, and are not limited to, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, 1L-9, IL-10, IL-11, IL-15, 11-16, 11-18, IL-23, IL-24, erythropoietin, G-CSF, M-CSF, platelet derived growth factor (PDGF), MSF, FLT-3 ligand, EGF, fibroblast growth factor (FGF; e.g., aFGF (FGF-1), bFGF (FGF-2), FGF-3, FGF-4, FGF-5, FGF-6, or FGF-7), insulin-like growth factors (e.g., IGF-1, IGF-2); vascular endothelial growth factor (VEGF); interferons (e.g., IFN-γ, IFN-α, IFN-β); leukemia inhibitory factor (LIF); ciliary neurotrophic factor (CNTF); oncostatin M; stem cell factor (SCF); transforming growth factors (e.g., TGF-a, TGF-β1, TGF-β1, TGF-β1), or chemokines (such as, but not limited to, BCA-1/BLC-1, BRAK/Kec, CXCL16, CXCR3, ENA-78/LIX, Eotaxin-1, Eotaxin-2/MPIF-2, Exodus-2/SLC, Fractalkine/Neurotactin, GROalpha/MGSA, HCC-1, I-TAC, Lymphotactin/ATAC/SCM, MCP-1/MCAF, MCP-3, MCP-4, MDC/STCP-1, ABCD-1, MIP-1α, MIP-1β, MIP-2α/GROβ, MIP-3α/Exodus/LARC, MIP-3β/Exodus-3/ELC, MIP-4/PARC/DC-CK1, PF-4, RANTES, SDF1α, TARC, or TECK). Genes encoding these immunostimulatory molecules are known to those skilled in the art and coding sequences may be obtained from a variety of sources, including various patents databases, publicly available databases (such as the nucleic acid and protein databases found at the National Library of Medicine or the European Molecular Biology Laboratory), the scientific literature, or scientific literature cited in catalogs produced by companies such as Genzyme, Inc., R&D Systems, Inc, or InvivoGen, Inc. [see, for example, the 1995 Cytokine Research Products catalog, Genzyme Diagnostics, Genzyme Corporation, Cambridge Mass.; 2002 or 1995 Catalog of R&D Systems, Inc (Minneapolis, Minn.); or 2002 Catalog of InvivoGen, Inc (San Diego, Calif.) each of which is incorporated by reference in its entirety, including all references cited therein].
Methods of introducing DNA vaccines into individuals are well-known to the skilled artisan. For example, DNA can be injected into skeletal muscle or other somatic tissues (e.g., intramuscular injection). Cationic liposomes or biolistic devices, such as a gene gun, can be used to deliver DNA vaccines. Alternatively, iontophoresis and other means for transdermal transmission can be used for the introduction of DNA vaccines into an individual.
Viral vectors for use in the subject invention can have a portion of the viral genome deleted to introduce new genes without destroying infectivity of the virus. The viral vector of the present invention is, typically, a non-pathogenic virus. At the option of the practitioner, the viral vector can be selected so as to infect a specific cell type, such as professional antigen presenting cells (e.g., macrophage or dendritic cells). Alternatively, a viral vector can be selected that is able to infect any cell in the individual. Exemplary viral vectors suitable for use in the present invention include, but are not limited to poxvirus such as vaccinia virus, avipox virus, fowlpox virus, a highly attenuated vaccinia virus (such as Ankara or MVA [Modified Vaccinia Ankara]), retrovirus, adenovirus, baculovirus and the like. In a preferred embodiment, the viral vector is Ankara or MVA. General strategies for construction of vaccinia virus expression vectors are known in the art (see, for example, Smith and Moss Bio Techniques November/December, 306-312, 1984; U.S. Pat. No. 4,738,846 (hereby incorporated by reference in its entirety). Sutter and Moss (Proc. Nat'l. Acad. Sci U.S.A. 89:10847-10851, 1992) and Sutter et al. (Vaccine, 12(11):1032-40, 1994) disclose the construction and use as a vector, a non-replicating recombinant Ankara virus (MVA) which can be used as a viral vector in the present invention. Other versions of the Modified Vaccinia Ankara strain can also be used in the practice of the subject invention (such as the MVA-BN strain produced by Bavarian Nordic S/A (Copenhagen, Denmark).
Compositions comprising the subject polynucleotides can include appropriate nucleic acid vaccine vectors (plasmids), which are commercially available (e.g., Vical, San Diego, Calif.) or other nucleic acid vectors (plasmids), which are also commercially available (e.g., Valenti, Burlingame, Calif.). Alternatively, compositions comprising viral vectors and polynucleotides according to the subject invention are provided by the subject invention. In addition, the compositions can include a pharmaceutically acceptable carrier, e.g., saline. The pharmaceutically acceptable carriers are well known in the art and also are commercially available. For example, such acceptable carriers are described in E. W. Martin's Remington's Pharmaceutical Science, Mack Publishing Company, Easton, Pa.
The subject invention also provides one or more isolated polypeptides comprising:
The term “peptide” may be used interchangeably with “oligopeptide” or “polypeptide” or “epitope” in the present specification to designate a series of residues, typically L-amino acids, connected one to the other, typically by peptide bonds between the α-amino and carboxyl groups of adjacent amino acids. The preferred CTL (or CD8+ T cell)-inducing peptides of the invention are 13 residues or less in length and usually consist of between about 8 and about 11 residues (e.g., 8, 9, 10 or 11 residues), preferably 9 or 10 residues. The preferred HTL (or CD4+ T cell)-inducing peptides are less than about 50 residues in length and usually consist of between about 6 and about 30 residues, more usually between about 12 and 25 (e.g., 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25), and often between about 15 and 20 residues (e.g., 15, 16, 17, 18, 19 or 20).
According to the subject invention, a “fragment” is a polypeptide of at least 3 consecutive, preferably 4 consecutive, and even more preferably 5 consecutive amino acids. In some embodiments, the polypeptide fragments are reactive with antibodies found in the serum of an individual. In other embodiments, a fragment is an “epitope” as described supra. In the context of the instant invention, the terms polypeptide, peptide and protein can be used interchangeably; however, it should be understood that the invention does not relate to the polypeptides in natural form, that is to say that they are not in their natural environment but that the polypeptides may have been isolated or obtained by purification from natural sources, obtained from host cells prepared by genetic manipulation (e.g., the polypeptides, or fragments thereof, are recombinantly produced by host cells, or by chemical synthesis). Polypeptides according to the instant invention may also contain non-natural amino acids, as will be described below.
A “variant” or “modified” polypeptide (or polypeptide variant) is to be understood to designate polypeptides exhibiting, in relation to the natural polypeptide, certain modifications. These modifications can include a deletion, addition, or substitution of at least one amino acid, a truncation, an extension, a chimeric fusion, a mutation, or polypeptides exhibiting post-translational modifications. Among the homologous polypeptides, those whose amino acid sequences exhibit between at least (or at least about) 20.00% to 99.99% (inclusive) identity to the full length, native, or naturally occurring polypeptide are another aspect of the invention. The aforementioned range of percent identity is to be taken as including, and providing written description and support for, any fractional percentage, in intervals of 0.01%, between 20.00% and, up to, including 99.99%. These percentages are purely statistical and differences between two polypeptide sequences can be distributed randomly and over the entire sequence length.
Variant peptides (epitopes) can also be created by altering the presence or absence of particular residues in these primary anchor positions. Such analogs are used to modulate the binding affinity of a peptide comprising a particular motif or supermotif. The term “motif' refers to the pattern of residues in a peptide of defined length, usually a peptide of from about 8 to about 13 amino acids for a class I HLA motif (e.g., 8, 9, 10, 11, 12 or 13 aa) and from about 6 to about 25 amino acids for a class II HLA motif (e.g., 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24 or 25 amino acids), which is recognized by a particular HLA molecule. Peptide motifs are typically different for each protein encoded by each human HLA allele and differ in the pattern of the primary and secondary anchor residues. Optionally, variant peptides or polypeptides can also comprise one or more heterologous polypeptide sequences (e.g., tags that facilitate purification of the polypeptides of the invention (see, for example, U.S. Pat. No. 6,342,362, hereby incorporated by reference in its entirety; Altendorf et al. [1999-WWW, 2000] “Structure and Function of the Fo Complex of the ATP Synthase from Escherichia Coli,” J. of Experimental Biology 203:19-28, The Co. of Biologists, Ltd., G. B.; Baneyx [1999] “Recombinant Protein Expression in Escherichia coli,” Biotechnology 10:411-21, Elsevier Science Ltd.; Eihauer et al. [2001] “The FLAG™ Peptide, a Versatile Fusion Tag for the Purification of Recombinant Proteins,” J. Biochem Biophys Methods 49:455-65; Jones et al. [1995] J. Chromatography 707:3-22; Jones et al. [1995] “Current Trends in Molecular Recognition and Bioseparation,” J. of Chromatography A. 707:3-22, Elsevier Science B. V.; Margolin [2000] “Green Fluorescent Protein as a Reporter for Macromolecular Localization in Bacterial Cells,” Methods 20:62-72, Academic Press; Puig et al. [2001] “The Tandem Affinity Purification (TAP) Method: A General Procedure of Protein Complex Purification,” Methods 24:218-29, Academic Press; Sassenfeld [1990] “Engineering Proteins for Purification,” TibTech 8:88-93; Sheibani [1999] “Prokaryotic Gene Fusion Expression Systems and Their Use in Structural and Functional Studies of Proteins,” Prep. Biochem. & Biotechnol. 29(1):77-90, Marcel Dekker, Inc.; Skerra et al. [1999] “Applications of a Peptide Ligand for Streptavidin: the Strep-tag”, Biomolecular Engineering 16:79-86, Elsevier Science, B. V.; Smith [1998] “Cookbook for Eukaryotic Protein Expression: Yeast, Insect, and Plant Expression Systems,” The Scientist 12(22):20; Smyth et al. [200] “Eukaryotic Expression and Purification of Recombinant Extracellular Matrix Proteins Carrying the Strep II Tag”, Methods in Molecular Biology, 139:49-57; Unger [1997] “Show Me the Money: Prokaryotic Expression Vectors and Purification Systems,” The Scientist 11(17):20, each of which is hereby incorporated by reference in their entireties), or commercially available tags from vendors such as such as STRATAGENE (La Jolla, Calif.), NOVAGEN (Madison, Wis.), QIAGEN, Inc., (Valencia, Calif.), or InVitrogen (San Diego, Calif.).
Variant polypeptides can, alternatively, have 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 percent identity with the polypeptide sequences of the instant invention. In a preferred embodiment, a variant or modified polypeptide exhibits approximately 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identity to a natural polypeptic of the invention. Typically, the percent identity is calculated with reference to the full length, native, and/or naturally occurring polypeptide (e.g., those polypeptides set forth in SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, or 27).
The nomenclature used to describe peptide compounds follows the conventional practice wherein the amino group is presented to the left (the N-terminus) and the carboxyl group to the right (the C-terminus) of each amino acid residue. When amino acid residue positions are referred to in an epitope, they are numbered in an amino to carboxyl direction with position one being the position closest to the amino terminal end of the epitope, or the peptide or protein of which it may be a part. In the formulae representing selected specific embodiments of the present invention, the amino- and carboxyl-tenninal groups, although not specifically shown, are in the form they would assume at physiologic pH values, unless otherwise specified. In the amino acid structure formulae, each residue is generally represented by standard three-letter or single-letter designations (e.g., as set forth infra). By way of example, amino acid substitutions can be carried out without resulting in a substantial modification of the biological activity of the corresponding modified polypeptides; for example, the replacement of leucine with valine or isoleucine, of aspartic acid with glutamic acid, of glutamine with asparagine, of arginine with lysine, and the like, the reverse substitutions can be performed without substantial modification of the biological activity of the polypeptides.
The L-form of an amino acid residue is represented by a capital single letter or a capital first letter of a three-letter symbol, and the D-form, for those amino acids having D-forms, is represented by a lower case single letter or a lower case three letter symbol. Glycine has no asymmetric carbon atom and is simply referred to as “Gly” or G. Symbols for the amino acids are as follows: (Single Letter Symbol; Three Letter Symbol Amino Acid) A; Ala; Alanine: C; Cys; Cysteine: D; Asp; Aspartic Acid: E; Glu; Glutamic Acid: F; Phe; Phenylalanine: G; Gly; Glycine: H; His; Histidine: I; Ile; Isoleucine: K; Lys; Lysine: L; Leu; Leucine: M; Met; Methionine: N; Asn; Asparagine: P; Pro; Proline: Q; Gln; Glutamine: R; Arg; Arginine: S; Ser; Serine: T; Thr; Threonine: V; Val; Valine: W; Trp; Tryptophan: Y; Tyr; Tyrosine.
Amino acid “chemical characteristics” are defined as: Aromatic (F, W, Y); Aliphatic-hydrophobic (L, I, V, M); Small polar (S, T, C); Large polar (Q, N); Acidic (D, E); Basic (R, H, K); Non-polar: Proline; Alanine; and Glycine.
In order to extend the life of the polypeptides according to the invention, it may be advantageous to use non-natural amino acids, for example in the D-form, or alternatively amino acid analogs, for example sulfur-containing forms of amino acids in the production of “variant polypeptides”. Alternative means for increasing the life of polypeptides can also he used in the practice of the instant invention. For example, polypeptides of the invention, and fragments thereof, can be recombinantly modified to include elements that increase the plasma, or serum half-life of the polypeptides of the invention. These elements include, and are not limited to, antibody constant regions (see for example, U.S. Pat. No. 5,565,335, hereby incorporated by reference in its entirety, including all references cited therein), or other elements such as those disclosed in U.S. Pat. Nos. 6,319,691, 6,277,375, or 5,643,570, each of which is incorporated by reference in its entirety, including all references cited within each respective patent. Alternatively, the polynucleotides and genes of the instant invention can be recombinantly fused to elements, well known to the skilled artisan, that are useful in the preparation of immunogenic constructs for the purposes of vaccine formulation.
The subject invention also provides biologically active fragments (epitopes) of a polypeptide according to the invention and includes those peptides capable of eliciting an immune response directed against P. falciparum, said immune response providing components (B-cells, antibodies, and/or or components of the cellular immune response (e.g., helper, cytotoxic, and/or suppressor T-cells)) reactive with the biologically active fragment of a polypeptide; the intact, full length, unmodified polypeptide disclosed herein; or both the biologically active fragment of a polypeptide and the intact, full length, unmodified polypeptides disclosed herein.
Fragments, as described herein, can be obtained by cleaving the polypeptides of the invention with a proteolytic enzyme (such as trypsin, chymotrypsin, or collagenase) or with a chemical reagent, such as cyanogen bromide (CNBr). Alternatively, polypeptide fragments can be generated in a highly acidic environment, for example at pH 2.5. Such polypeptide fragments may be equally well prepared by chemical synthesis or using hosts transformed with an expression vector according to the invention. The transformed host cells contain a nucleic acid, allowing the expression of these fragments, under the control of appropriate elements for regulation and/or expression of the polypeptide fragments.
In one embodiment, the subject invention provides methods for eliciting an immune response in an individual comprising the administration of compositions comprising polypeptides according to the subject invention to an individual in amounts sufficient to induce an immune response in the individual. In some embodiments, a “protective” or “therapeutic immune response” is induced in the individual. A “protective immune response” or “therapeutic immune response” refers to a CTL (or CD8+ T cell) and/or an HTL (or CD4+ T cell), and/or an antibody response to an antigen derived from an infectious agent or a tumor antigen, which in some way prevents or at least partially arrests disease symptoms, side effects or progression. The protective immune response may also include an antibody response that has been facilitated by the stimulation of helper T cells (or CD4+ T cells). Additional methods of inducing an immune response in an individual are taught in U.S. Pat. No. 6,419,931, hereby incorporated by reference in its entirety. The term CTL can be used interchangeably with CD8+ T-cell(s) and the term HTL can be used interchangeably with CD4+ T-cell(s) throughout the subject application.
The term “individual” includes mammals which include, and are not limited to, apes, chimpanzees, orangutans, humans, monkeys or domesticated animals (pets) such as dogs, cats, guinea pigs, hamsters, Vietnamese pot-bellied pigs, rabbits, ferrets, cows, horses, goats and sheep. In a preferred embodiment, the methods of inducing an immune response contemplated herein are practiced on humans.
Another embodiment of the subject invention provides methods of inducing an immune response in an individual comprising the administration of a composition comprising polypeptides encoded by the polynucleotides of the subject invention in amounts sufficient to induce an immune response. In some embodiments of the invention, the immune response provides protective immunity. The composition administered to the individual may, optionally, contain an adjuvant and may be delivered in any manner known in the art for the delivery of immunogen to a subject. Compositions may also be formulated in any carriers, including for example, pharmaceutically acceptable carriers such as those described in E. W. Martin's Remington's Pharmaceutical Science, Mack Publishing Company, Easton, Pa. In a preferred embodiment, compositions may be formulated in incomplete Freund's adjuvant. In various embodiments, the subject invention provides for diagnostic assays based upon Western blot formats or standard immunoassays known to the skilled artisan. For example, antibody-based assays such as enzyme linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), lateral flow assays, immunochromatographic strip assays, automated flow assays, and assays utilizing antibody-containing biosensors may be employed for the detection of the polypeptides, and fragments thereof, provided by the subject invention. The assays and methods for conducting the assays are well-known in the art and the methods may test biological samples qualitatively (presence or absence of polypeptide) or quantitatively (comparison of a sample against a standard curve prepared using a polypeptide of the subject invention) for the presence of one or more polypeptide of the subject invention. Thus, the subject invention provides a method of detecting a P. falciparum polypeptide, or fragment thereof, comprising contacting a sample with an antibody that specifically binds to a polypeptide, or fragment thereof, comprising SEQ ID NOs: 1-26, or 27 and detecting the presence of an antibody-antigen complex.
The antibody-based assays can be considered to be of four types: direct binding assays, sandwich assays, competition assays, and displacement assays. In a direct binding assay, either the antibody or antigen is labeled, and there is a means of measuring the number of complexes formed. In a sandwich assay, the formation of a complex of at least three components (e.g., antibody-antigen-antibody) is measured. In a competition assay, labeled antigen and unlabelled antigen compete for binding to the antibody, and either the bound or the free component is measured. In a displacement assay, the labeled antigen is pre-bound to the antibody, and a change in signal is measured as the unlabelled antigen displaces the bound, labeled antigen from the receptor.
Lateral flow assays can be conducted according to the teachings of U.S. Pat. No. 5,712,170 and the references cited therein. U.S. Pat. No. 5,712,170 and the references cited therein are hereby incorporated by reference in their entireties. Displacement assays and flow immunosensors useful for carrying out displacement assays are described in: (1) Kusterbeck et al., “Antibody-Based Biosensor for Continuous Monitoring”, in Biosensor Technology, R. P. Buck et al., eds., Marcel Dekker, N.Y. pp. 345-350 (1990); Kusterbeck et al., “A Continuous Flow Immunoassay for Rapid and Sensitive Detection of Small Molecules”, Journal of Immunological Methods, vol. 135, pp. 191-197 (1990); Ligler et al., “Drug Detection Using the Flow Immunosensor”, in Biosensor Design and Application, J. Findley et al., eds., American Chemical Society Press, pp. 73-80 (1992); and Ogert et al., “Detection of Cocaine Using the Flow Immunosensor”, Analytical Letters, vol. 25, pp. 1999-2019 (1992), all of which are incorporated herein by reference in their entireties. Displacement assays and flow immunosensors are also described in U.S. Pat. No. 5,183,740, which is also incorporated herein by reference in its entirety. The displacement immunoassay, unlike most of the competitive immunoassays used to detect small molecules, can generate a positive signal with increasing antigen concentration. One aspect of the invention allows for the exclusion of Western blots as a diagnostic assay, particularly where the Western blot is a screen of whole cell lysates of P. falciparum, or related organisms, against immune serum of infected individuals. In another aspect of the invention, peptide, or polypeptide, based diagnostic assays utilize P. falciparum peptides or polypeptides that have been produce either by chemical peptide synthesis or by recombinant methodologies that utilize non-plasmodium host cells for the production of peptides or polypeptides.
Another aspect of the invention provides for the use of peptides, polypeptides, and multi-epitope constructs in assays such as those taught in U.S. Pat. No. 5,635,363, which is hereby incorporated by reference in its entirety. Briefly, peptides, polypeptides, and multi-epitope constructs of the subject invention can be used to form stable multimeric complexes that comprise prepared major histocompatibility complex (MHC) protein subunits having a substantially homogeneous bound peptide population. The multimeric MHC-antigen complex forms a stable structure with T cells recognizing the complex through their antigen receptor, thereby allowing for the labeling, identification and separation of specific T cells. The multimeric binding complex has the formula (α-β-P)n, where n≧2, usually n≧4, and usually n≧10; α is an α chain of a class I or class II MHC protein. β is a β chain, (the β chain of a class II MHC protein or β2microglobulin for a MHC class I protein; and P is a peptide antigen. The multimeric complex stably binds through non-covalent interactions to a T cell receptor having the appropriate antigenic specificity. The MHC proteins may be from any individual. Of particular interest are the human HLA proteins. Included in the HLA proteins are the class II subunits HLA-DPα, HLA-DPβ, HLA-DQα, HLA-DQβ, HLA-DRα and HLA-DRβ, and the class I proteins HLA-A, HLA-B, HLA-C, and β2-microglobulin. In a preferred embodiment, the MHC protein subunits are a soluble foim of the normally membrane-bound protein. The soluble form is derived from the native form by deletion of the transmembrane domain. Conveniently, the protein is truncated, removing both the cytoplasmic and transmembrane domains. The protein may be truncated by proteolytic cleavage, or by expressing a genetically engineered truncated form. For class I proteins, the soluble form will include the α1, α2 and α3 domain. Not more than about 10, usually not more than about 5, preferably none of the amino acids of the transmembrane domain will be included. The deletion may extend as much as about 10 amino acids into the α3 domain, preferably none of the amino acids of the α3 domain will be deleted. The deletion will be such that it does not interfere with the ability of the α3 domain to fold into a disulfide bonded structure. The class I β chain, β2-microglobulin, lacks a transmembrane domain in its native form, and need not be truncated. Generally, no Class II subunits will be used in conjunction with Class I subunits. Soluble class II subunits will include the α1 and α2 domains for the α subunit, and the β1 and β2 domains for the β subunit. Not more than about 10, usually not more than about 5, preferably none of the amino acids of the transmembrane domain will be included. The deletion may extend as much as about 10 amino acids into the α2 or β2 domain, preferably none of the amino acids of the β2 or β2 domain will be deleted. The deletion will be such that it does not interfere with the ability of the α2 or β2 domain to fold into a disulfide bonded structure. The monomeric complex (α-β-P) (monomer) is multimerized. The resulting multimer will be stable over long periods of time. Usually not more than about 10% of the multimer will be dissociated after storage at 4° C. for about one day, more usually after about one week. Preferably, the multimer will be formed by binding the monomers to a multivalent entity through specific attachment sites on the α or β subunit, as described below in detail. The multimer may also be formed by chemical cross-linking of the monomers. A number of reagents capable of cross-linking proteins are known in the art, illustrative entities include: azidobenzoyl hydrazide, N-[4-(p-azidosalicylamino)butyl]-3′-[2′-pyridyldithio]propionamide), bis-sulfosuccinimidyl suberate, dimethyladipimidate, disuccinimidyltartrate, N-.gamma.-maleimidobutyryloxysuccinimide ester, N-hydroxy sulfosuccinimidyl-4-azidobenzoate, N-succinimidyl [4-azidophenyl]-1,3′-dithiopropionate, N-succinimidyl [4-iodoacetyl]aminobenzoate, glutaraldehyde, formaldehyde and succinimidyl 4-[N-maleimidomethyl]cyclohexane-1 -carboxylate.
The attachment site for binding to a multivalent entity may be naturally occurring, or may be introduced through genetic engineering. The site will be a specific binding pair member or one that is modified to provide a specific binding pair member, where the complementary pair has a multiplicity of specific binding sites. Binding to the complementary binding member can be a chemical reaction, epitope-receptor binding or hapten-receptor binding where a hapten is linked to the subunit chain. In a preferred embodiment, one of the subunits is fused to an amino acid sequence providing a recognition site for a modifying enzyme. The recognition sequence will usually be fused proximal to the carboxy terminus of one of the subunit to avoid potential hindrance at the antigenic peptide binding site. Conveniently, an expression cassette will include the sequence encoding the recognition site.
Modifying enzymes of interest include BirA, various glycosylases, farnesyl protein transferase, protein kinases and the like. The subunit may be reacted with the modifying enzyme at any convenient time, usually after formation of the monomer. The group introduced by the modifying enzyme, e.g. biotin, sugar, phosphate, farnesyl, etc. provides a complementary binding pair member, or a unique site for further modification, such as chemical cross-linking, biotinylation, etc. that will provide a complementary binding pair member. An alternative strategy is to introduce an unpaired cysteine residue to the subunit, thereby introducing a unique and chemically reactive site for binding. The attachment site may also be a naturally occurring or introduced epitope, where the multivalent binding partner will be an antibody, e.g. IgG, IgM, etc. Any modification will be at a site, e.g. C-terminal proximal, that will not interfere with binding.
Exemplary of multimer formation is the introduction of the recognition sequence for the enzyme BirA, which catalyzes biotinylation of the protein substrate. The monomer with a biotinylated subunit is then bound to a multivalent binding partner, e.g. streptavidin or avidin, to which biotin binds with extremely high affinity. Streptavidin has a valency of 4, providing a multimer of (α-β-P)4.
The multivalent binding partner may be free in solution, or may be attached to an insoluble support. Examples of suitable insoluble supports include beads, e.g. magnetic beads, membranes and microtiter plates. These are typically made of glass, plastic (e.g. polystyrene), polysaccharides, nylon or nitrocellulose. Attachment to an insoluble support is useful when the binding complex is to be used for separation of T cells.
Frequently, the multimeric complex will be labeled, so as to be directly detectable, or will be used in conjunction with secondary labeled immunoreagents which will specifically bind the complex. In general the label will have a light detectable characteristic. Preferred labels are fluorophors, such as fluorescein isothiocyanate (FITC), rhodamine, Texas Red, phycoerythrin and allophycocyanin. Other labels of interest may include dyes, enzymes, chemiluminescers, particles, radioisotopes, or other directly or indirectly detectable agent. Conveniently, the multivalent binding partner will have the labeling group. Alternatively, a second stage label may be used, e.g. labeled antibody directed to one of the peptide constituents, and the like.
The binding complex will be used to detect and/or separate antigen specific T cells. The T cells may be from any source, usually having the same species of origin as the MHC heterodimer. The T cells may be from an in vitro culture, or a physiologic sample. For the most part, the physiologic samples employed will be blood or lymph, but samples may also involve other sources oft cells, particularly where T cells may be invasive. Thus other sites of interest are tissues, or associated fluids, as in the brain, lymph node, neoplasms, spleen, liver, kidney, pancreas, tonsil, thymus, joints, synovia, and the like. The sample may be used as obtained or may be subject to modification, as in the case of dilution, concentration, or the like. Prior treatments may involve removal of cells by various techniques, including centrifugation, using Ficoll-Hypaque, panning, affinity separation, using antibodies specific for one or more markers present as surface membrane proteins on the surface of cells, or any other technique that provides enrichment of the set or subset of cells of interest.
The binding complex is added to a suspension comprising T cells of interest, and incubated at about 4° C. for a period of time sufficient to bind the available cell surface receptor. The incubation will usually be at least about 5 minutes and usually less than about 30 minutes. It is desirable to have a sufficient concentration of labeling reagent in the reaction mixture, so that labeling reaction is not limited by lack of labeling reagent. The appropriate concentration is determined by titration. The medium in which the cells are labeled will be any suitable medium as known in the art. If live cells are desired a medium will be chosen that maintains the viability of the cells. A preferred medium is phosphate buffered saline containing from 0.1 to 0.5% BSA. Various media are commercially available and may be used according to the nature of the cells, including Dulbecco's Modified Eagle Medium (dMEM), Hank's Basic Salt Solution (HBSS), Dulbecco's phosphate buffered saline (dPBS), RPMI, Iscove's medium, PBS with 5 mM EDTA, etc., frequently supplemented with fetal calf serum, BSA, HSA, etc.
Where a second stage labeling reagent is used, the cell suspension may be washed and resuspended in medium as described above prior to incubation with the second stage reagent. Alternatively, the second stage reagent may be added directly into the reaction mix.
A number of methods for detection and quantitation of labeled cells are known in the art. Flow cytometry is a convenient means of enumerating cells that are a small percent of the total population. Fluorescent microscopy may also be used. Various immunoassays, e.g. ELISA, RIA, etc. may used to quantitate the number of cells present after binding to an insoluble support.
Flow cyometry may also be used for the separation of a labeled subset of T cells from a complex mixture of cells. The cells may be collected in any appropriate medium which maintains the viability of the cells, usually having a cushion of serum at the bottom of the collection tube. Various media are commercially available as described above. The cells may then be used as appropriate.
Alternative means of separation utilize the binding complex bound directly or indirectly to an insoluble support, e.g. column, microtiter plate, magnetic beads, etc. The cell sample is added to the binding complex. The complex may be bound to the support by any convenient means. After incubation, the insoluble support is washed to remove non-bound components. From one to six washes may be employed, with sufficient volume to thoroughly wash non-specifically bound cells present in the sample. The desired cells are then eluted from the binding complex. In particular the use of magnetic particles to separate cell subsets from complex mixtures is described in Miltenyi et al. (1990) Cytometry 11:231-238.
Detecting and/or quantitating specific T cells in a sample or fraction thereof may be accomplished by a variety of specific assays. In general, the assay will measure the binding between a patient sample, usually blood derived, generally in the fol in of plasma or serum and the subject multimeric binding complexes. The patient sample may be used directly, or diluted as appropriate, usually about 1:10 and usually not more than about 1:10,000. Assays may be perfoinied in any physiological buffer, e.g. PBS, normal saline, HBSS, dPBS, etc.
A sandwich assay is perfoli zied by first attaching the multimeric binding complex to an insoluble surface or support. The multimeric binding complex may be bound to the surface by any convenient means, depending upon the nature of the surface, either directly or through specific antibodies. The particular manner of binding is not crucial so long as it is compatible with the reagents and overall methods of the invention. They may be bound to the plates covalently or non-covalently, preferably non-covalently.
The insoluble supports may be any compositions to which the multimeric binding complex can be bound, which is readily separated from soluble material, and which is otherwise compatible with the overall method of measuring T cells. The surface of such supports may be solid or porous and of any convenient shape. Examples of suitable insoluble supports to which the receptor is bound include beads, e.g. magnetic beads, membranes and microtiter plates. These are typically made of glass, plastic (e.g. polystyrene), polysaccharides, nylon or nitrocellulose. Microtiter plates are especially convenient because a large number of assays can be carried out simultaneously, using small amounts of reagents and samples.
Before adding patient samples or fractions thereof, the non-specific binding sites on the insoluble support i.e. those not occupied by the multimeric binding complex, are generally blocked. Preferred blocking agents include non-interfering proteins such as bovine serum albumin, casein, gelatin, and the like. Samples, fractions or aliquots thereof are then added to separately assayable supports (for example, separate wells of a microtiter plate) containing support-hound multimeric binding complex.
Generally from about 0.001 to 1 ml of sample, diluted or otherwise, is sufficient, usually about 0.01 ml sufficing. Preferably, each sample and standard will be added to multiple wells so that mean values can be obtained for each. The incubation time should be sufficient for T cells to bind the insoluble binding complex. Generally, from about 0.1 to 3 hr is sufficient, usually 1 hr sufficing.
After incubation, the insoluble support is generally washed of non-bound components. Generally, a dilute physiologic buffer at an appropriate pH, generally 7-8, is used as a wash medium. From one to six washes may be employed, with sufficient volume to thoroughly wash non-specifically bound T cells present in the sample. After washing, a solution containing specific second receptor is applied. The receptor may be any compound that binds patient T cells with sufficient specificity such that they can be distinguished from other components present. In a preferred embodiment, second receptors are antibodies specific for common T cell antigens, either monoclonal or polyclonal sera, e.g. anti-thy-1, anti-CD45, etc.
T cell specific antibodies may be labeled to facilitate direct or indirect quantification of binding. Examples of labels that permit direct measurement include radiolabels, such as 3H or 125I, fluorescers, dyes, beads, chemilumninescers, colloidal particles, and the like. Examples of labels which permit indirect measurement of binding include enzymes where the substrate may provide for a colored or fluorescent product. Examples of suitable enzymes for use in conjugates include horseradish peroxidase, alkaline phosphatase, malate dehydrogenase and the like. Where not commercially available, such antibody-enzyme conjugates are readily produced by techniques known to those skilled in the art.
Alternatively, the second receptor may be unlabeled. In this case, a labeled second receptor-specific compound is employed which hinds to the bound second receptor. Such a second receptor-specific compound can be labelled in any of the above manners. It is possible to select such compounds such that multiple compounds bind each molecule of bound second receptor. Examples of second receptor/second receptor-specific molecule pairs include antibody/anti-antibody and avidin (or streptavidin)/biotin. Since the resultant signal is thus amplified, this technique may be advantageous where only a small number oft cells are present. An example is the use of a labeled antibody specific to the second receptor. More specifically, where the second receptor is a rabbit anti-allotypic antibody, an antibody directed against the constant region of rabbit antibodies provides a suitable second receptor specific molecule. The anti-immunoglobulin will usually come from any source other than human, such as ovine, rodentia, particularly mouse, or bovine. The volume, composition and concentration of T cell specific receptor solution provides for measurable binding to the T cells already bound to the insoluble substrate. Generally, the same volume as that of the sample is used: from about 0.001 to 1 ml is sufficient, usually about 0.1 ml sufficing. When antibody ligands are used, the concentration generally will be about 0.1 to 50 μg/ml, preferably about 1 μg/ml. The solution containing the second receptor is generally buffered in the range of about pH 6.5-9.5. The solution may also contain an innocuous protein as previously described. The incubation time should be sufficient for the labeled ligand to bind available molecules. Generally, from about 0.1 to 3 hr is sufficient, usually 1 hr sufficing.
After the second receptor or second receptor-conjugate has bound, the insoluble support is generally again washed free of non-specifically bound second receptor, essentially as described for prior washes. After non-specifically bound material has been cleared, the signal produced by the bound conjugate is detected by conventional means. Where an enzyme conjugate is used, an appropriate enzyme substrate is provided so a detectable product is formed. More specifically, where a peroxidase is the selected enzyme conjugate, a preferred substrate combination is H2O2 and O-phenylenediamine which yields a colored product under appropriate reaction conditions. Appropriate substrates for other enzyme conjugates such as those disclosed above are known to those skilled in the art. Suitable reaction conditions as well as means for detecting the various useful conjugates or their products are also known to those skilled in the art. For the product of the substrate O-phenylenediamine for example, light absorbance at 490-495 nm is conveniently measured with a spectrophotometer.
Generally the number of bound T cells detected will be compared to control samples from samples having a different MHC context, e.g. T cells from an animal that does not express the MHC molecule used to make the binding complex.
An alternative protocol is to provide anti-T cell reagent, e.g. anti-thy-1, anti-CD45, etc. bound to the insoluble surface. After adding the sample and washing away non-specifically bound T cells, one or a combination of the subject binding complexes are added, where the binding complexes are labeled so as not to interfere with the binding to T cells.
It is particularly convenient in a clinical setting to perform the assays in a self-contained apparatus. A number of such methods are known in the art. The apparatus will generally employ a continuous flow-path of a suitable filter or membrane, having at least three regions, a fluid transport region, a sample region, and a measuring region. The sample region is prevented from fluid transfer contact with the other portions of the flow path prior to receiving the sample. After the sample region receives the sample, it is brought into fluid transfer relationship with the other regions, and the fluid transfer region contacted with fluid to permit a reagent solution to pass through the sample region and into the measuring region. The measuring region may have bound to it the multimeric binding complex, with a conjugate of an enzyme with T cell specific antibody employed as a reagent, generally added to the sample before application. Alternatively, the binding complex may be conjugated to an enzyme, with T cell specific antibody bound to the measurement region.
Detection of T cells is of interest in connection with a variety of conditions associated with T cell activation. Such conditions include autoimmune diseases, e.g. multiple sclerosis, myasthenia gravis, rheumatoid arthritis, type 1 diabetes, graft vs. host disease, Grave's disease, etc.; various forms of cancer, e.g. carcinomas, melanomas, sarcomas, lymphomas and leukemias. Various infectious diseases such as those caused by viruses, e.g. HIV-1, hepatitis, herpesviruses, enteric viruses, respiratory viruses, rhabdovirus, rubeola, poxvirus, paramyxovirus, morbillivirus, etc. are of interest. Infectious agents of interest also include bacteria, such as Pneumococcus, Staphylococcus, Bacillus. Streptococcus, Meningococcus, Gonococcus, Eschericia, Klebsiella, Proteus, Pseudomonas, Salmonella, Shigella, Hemophilus, Yersinia, Listeria, Corynebacterium, Vibrio, Clostridia, Chlamydia, Mycobacterium, Helicobacter and Treponema; protozoan pathogens, and the like. T cell associated allergic responses may also be monitored, e.g. delayed type hypersensitivity or contact hypersensitivity involving T cells.
Of particular interest are conditions having an association with a specific peptide or MHC haplotype, where the subject binding complexes may be used to track the T cell response with respect to the haplotype and antigen. A large number of associations have been made in disease states that suggest that specific MHC haplotypes, or specific protein antigens are responsible for disease states.
Polypeptide fragments, including immunogenic fragments, for each of SEQ ID NOs: 1-27 can be any length from at least 5 consecutive amino acids to 1 amino acid less than a full length polypeptide of any given SEQ ID NO:. Thus, for SEQ ID NO: 1 (used here as a non-limiting example) the polypeptide fragment can contain any number of consecutive amino acids from 5 to 1903 (for example, 5, 6, 7, . . . , 1901, 1902, 1903). For the sake of brevity, the individual integers between 5 and 1903 have not been reproduced herein but are, in fact, specifically contemplated. In one embodiment, the immunogenic fragments of the invention induce immunity or protective immunity from disease.
The present invention also provides for the exclusion of any individual fragment (of any given SEQ ID NO:) specified by N-terminal to C-terminal positions, actual sequence, or of any fragment specified by size (in amino acid residues) as described above. In addition, any number of fragments specified by N-terminal and C-terminal positions, actual sequence, or by size (in amino acid residues) as described above may be excluded as individual species. Further, any number of fragments specified by N-terminal and C-terminal positions or by size (in amino acid residues) as described above may be combined to provide a polypeptide fragment. These types of fragments may, optionally, include polypeptide sequences such as linkers, described below.
Where a claim recites “a polypeptide comprising SEQ ID NO: X, or fragments or immunogenic fragments or epitopes of SEQ ID NO:X”, the language “fragments or immunogenic fragments or epitopes of SEQ ID NO:X” specifically excludes identical sub-sequences found within other longer naturally occurring prior art polypeptide or protein sequences that are not identical to sequence from which the claimed sequence was derived. This does not include instances where such sub-sequences are a part of a larger molecule specifically modified by the hand of man to enhance the immunogenicity of the fragments of the subject invention. Thus, fragments or immunogenic fragments or epitopes of SEQ ID NO:X specifically exclude, and are not to be considered anticipated, where the fragment is a sub-sequence of another naturally occurring non-malarial peptide, polypeptide, or protein isolated from a bacterial, viral, reptilian, insect, avian, or mammalian source and is identified in a search of protein sequence databases.
Fragments or immunogenic fragments or epitopes of the invention may further contain linkers that facilitate the attachment of the fragments to a carrier molecule for the stimulation of an immune response or diagnostic purposes. The linkers can also be used to attach fragments according to the invention to solid support matrices for use in affinity purification protocols. In this aspect of the invention, the linkers specifically exclude, and are not to be considered anticipated, where the fragment is a subsequence of another peptide, polypeptide, or protein as identified in a search of protein sequence databases as indicated in the preceding paragraph. In other words, the non-identical portions of the other peptide, polypeptide, of protein are not considered to be a “linker” in this aspect of the invention. Non-limiting examples of “linkers” suitable for the practice of the invention include chemical linkers (such as those sold by Pierce, Rockford, Ill.) and peptides that allow for the connection of the immunogenic fragment to a carrier molecule (see, for example, linkers disclosed in U.S. Pat. Nos. 6,121,424, 5,843,464, 5,750,352, and 5,990,275, hereby incorporated by reference in their entirety). In various embodiments, the linkers can be up to 50 amino acids in length, up to 40 amino acids in length, up to 30 amino acids in length, up to 20 amino acids in length, up to 10 amino acids in length, or up to 5 amino acids in length. Of course, the linker may be any pre-selected number of amino acids (up to 50 amino acids) in length.
In various embodiments, polypeptides suitable for use in various disclosed methods of the subject invention can be selected from the group consisting of: a) a polypeptide comprising a polypeptide sequence selected from the group consisting of SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 ,17, 18, 19, 20, 21 ,22, 23, 24, 25, 26, and 27; b) a variant polypeptide having at least about 20% to 99.99% identity to a polypeptide selected from the group consisting of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16 ,17, 18, 19, 20, 21, 22, 23, 24, 25, 26, and 27; c) a fragment of a polypeptide or a variant polypeptide of SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, or 27, wherein said fragment or variant has substantially the same serologic reactivity or substantially the same T-cell reactivity as the native polypeptide; d) a multi-epitope construct; and e) combinations thereof.
As indicated supra, the subject invention provides for “multi-epitope constructs”. A “multi-epitope construct” comprises: 1) nucleic acids that encode multiple polypeptide epitopes (of any length) that can bind to one or more molecules functioning in the immune system; or 2) polypeptides comprising multiple polypeptide epitopes that can bind to one or more molecules functioning in the immune system. “Multi-epitope constructs” can, optionally, contain “flanking” or “spacing” residues between each epitope. Some embodiments provide for “multi-epitope constructs” that comprise a series of the same epitope (termed “homopolymers”). Other embodiments provide for “multi-epitope constructs” that comprise a combination or series of different epitopes, optionally connected by “flanking” or “spacing” residues (termed “heteropolymers”). In some embodiments, “multi-epitope constructs” may exclude full-length polypeptides from which the epitopes are obtained (e.g., the polypeptides of SEQ ID NOs: 1-27). In certain preferred embodiments, the epitopes used in the formation of the multi-epitope construct are selected from those set forth in Table 2, Table 3, Table 4, Table 5, and/or Table 6 and any epitope set forth in these Tables 2-6 can be mixed and/or matched any other epitope set forth in any of the aforementioned Tables 2-6.
Multi-epitope constructs may be of “high affinity” or “intermediate affinity”. As used herein, “high affinity” with respect to HLA class I molecules is defined as binding with an IC50, or KD value, of 50 nM or less; “intermediate affinity” with respect to HLA class I molecules is defined as binding with an IC50 or KD value of between about 50 and about 500 nM. “High affinity” with respect to binding to HLA class II molecules is defined as binding with an IC50 or KD value of 100 nM or less; “intermediate affinity” with respect to binding to HLA class II molecules is defined as binding with an IC50 or KD value of between about 100 and about 1000 nM.
The multi-epitope constructs described herein preferably include five or more, ten or more, fifteen or more, twenty or more, or twenty-five or more epitopes. Other embodiments provide multi-epitope constructs that comprise at least 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99 epitopes. All of the epitopes in a multi-epitope construct may be from one organism (e.g., the epitopes are obtained from P. falciparum), or the multi-epitope construct may include epitopes present in two or more different organisms (e.g., some epitopes from P. falciparum and some epitopes from another organism). Additionally, the same epitope may be present in a multi-epitope construct at more than one location in the construct. In some embodiments, novel epitopes of the subject invention may be linked to known epitopes of an organism (e.g., P. falciparum or another organism).
A “multi-epitope vaccine,” is a vaccine comprising multiple epitopes. A multi-epitope vaccine can induce an immune response and is administered to an individual in an amount sufficient to induce an immune response in the individual. In sonic embodiments, the immune response induced by the multi-epitope vaccine is a protective immune response against a given organism, pathogen, or pathologic condition (e.g., P. falciparum).
In certain embodiments, the epitopes of a multi-epitope construct or the polypeptides disclosed herein interact with an antigen binding site of an antibody molecule, a class I HLA, a T-cell receptor, and/or a class II HLA molecule. In certain preferred embodiments, the epitopes interact with an FILA molecule (e.g., class I or class II) or a T-cell receptor. In an even more preferred embodiment, the epitope interacts with both an HLA molecule (e.g., class I or class II) and a T-cell receptor. In various embodiments, all of the nucleic acids in a multi-epitope construct can encode class I HLA epitopes or class II HLA epitopes. Multi-epitope constructs comprising epitopes that interact exclusively with class I HLA molecules may be referred to as “CTL multi-epitope constructs” (or “CD8+ T cell multi-epitope constructs”). Multi-epitope constructs comprising epitopes that interact exclusively with class II HLA molecules may be referred to as “HTL multi-epitope constructs” (or “CD4+ T cell multi epitope constructs”). Some multi-epitope constructs (designated “TL multi-epitope constructs”) can have a subset of the multi-epitope nucleic acids encoding class I HLA epitopes and another subset of the multi-epitope nucleic acids encoding class II HLA epitopes (e.g., the constructs stimulate both CTL (i.e., CD8+ T cell) and HTL (i.e., CD4+ T cell) of the immune system). Other multi-epitope constructs can provide epitopes that interact exclusively with B-cells or immunoglobulin molecules and are designated “BL multi-epitope constructs”. Multi-epitope constructs that provide epitopes that interact with B-cells (and/or immunoglobulin molecules) and further provide class I HLA epitopes and class II HLA epitopes are designated “immune system (IMS) multi-epitope constructs”. In certain embodiments, multi-epitope constructs can provide class I or class II epitopes (e.g., CTL (i.e., CD8+ T cell) epitopes or HTL (i.e., CD4+ T cell) epitopes) and BL epitopes. “Human Leukocyte Antigen” or “HLA” is a human class I or class II Major Histocompatibility Complex (MHC) protein (see, e.g., Stites, et al., IMMUNOLOGY, 8TH ED., Lange Publishing, Los Altos, Calif. (1994)).
CTL epitope (class I epitope) (i.e., CD8+ T cell epitope) encoding nucleic acids preferably provide an epitope peptide of about eight to about thirteen amino acids in length (e.g., 8, 9, 10, 11, 12 or 13), more preferably about eight to about eleven amino acids in length, and most preferably about nine amino acids in length. HTL (CD4+ T-cell) epitope nucleic acids can provide an epitope peptide of about seven to about twenty three (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22 or 23) preferably about seven to about seventeen (e.g., 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, or 17, more preferably about eleven to about fifteen (e.g., 11, 12, 13, 14 or 15), and most preferably about thirteen amino acids in length.
“Degenerate binding” indicates that a peptide is bound by more than one HLA molecule; a synonym is “cross reactive binding.” “Cross reactive binding” may also be used to define the interaction of an antigen with multiple populations of antibodies. In certain preferred embodiments, epitopes disclosed herein do not exhibit cross reactive or degenerate binding. Other embodiments encompass degenerate or cross reactive binding of antigens or epitopes.
With regard to a particular amino acid sequence, an “epitope” is a set of amino acid residues that is involved in recognition by a particular immunoglobulin, or in the context of T cells, those residues necessary for recognition by T cell receptor proteins and/or Major Histocompatibility Complex (MHC) receptors. In an immune system setting, in vitro or in vivo, an epitope is the collective features of a molecule, such as primary, secondary and tertiary peptide structure, and charge, that together form a site recognized by an immunoglobulin, T cell receptor or HLA molecule. Throughout this disclosure epitope and peptide are often used interchangeably. It is to be appreciated, however, that isolated or purified protein or peptide molecules larger than and comprising an epitope of the invention are still within the bounds of the invention.
A “flanking” or “linking” residue is a residue that is positioned next to an epitope. A flanking residue can be introduced or inserted at a position adjacent to the N-terminus or the C-terminus of an epitope. Flanking residues suitable for use in the subject invention are disclosed, for example, in U.S. Pat. No. 6,419,931, which is hereby incorporated by reference in its entirety, including all sequences, figures, references, and tables.
An “immunogenic peptide” or “peptide epitope” is a peptide that comprises an allele-specific motif or supermotif such that the peptide will bind an HLA molecule and induce a CTL (or CD8+ T cell) and/or HTL (or CD4+ T cell) response. An “immunogenic peptide” or “peptide epitope” can also be a peptide that comprises a motif that binds to antibody molecules or B-cells found in the immune system of an individual. Thus, immunogenic peptides of the invention are capable of binding to an antibody molecule, a B-cell, or appropriate HLA molecule and thereafter inducing an immune response (e.g., the induction of antibody production, a cytotoxic T cell response, or a helper T cell response) to the antigen from which the immunogenic peptide is derived.
The term “residue” refers to an amino acid or amino acid mimetic incorporated into a peptide or protein by an amide bond or amide bond mimetic.
A “spacer” or “linker” refers to a sequence that is inserted between two epitopes in a multi-epitope construct to prevent the occurrence of junctional epitopes and/or to increase the efficiency of processing. A multi-epitope construct may have one or more spacer nucleic acids. A spacer nucleic acid may flank each epitope nucleic acid in a construct, or the spacer nucleic acid to epitope nucleic acid ratio may be about 2 to 10, about 5 to 10, about 6 to 10, about 7 to 10, about 8 to 10, or about 9 to 10, where a ratio of about 8 to 10 has been determined to yield favorable results for some constructs. The spacer nucleic acid may encode one or more amino acids. A spacer nucleic acid flanking a class I HLA epitope in a multi-epitope construct is preferably between one and about eight amino acids in length. A spacer nucleic acid flanking a class II HLA epitope in a multi-epitope construct is preferably greater than five, six, seven, or more amino acids in length, and more preferably five or six amino acids in length. The number of spacers in a construct, the number of amino acids in a spacer, and the amino acid composition of a spacer can be selected to optimize epitope processing and/or minimize junctional epitopes. It is preferred that spacers are selected by concomitantly optimizing epitope processing and junctional motifs. Suitable amino acids for optimizing epitope processing are described herein. Also, suitable amino acid spacing for minimizing the number of junctional epitopes in a construct are described herein for class I and class II HLAs. For example, spacers flanking class II HLA epitopes preferably include G, P, and/or N residues as these are not generally known to be primary anchor residues (see, e.g., PCT/US00/19774). A particularly preferred spacer for flanking a class II HLA epitope includes alternating G and P residues, for example, (GP)n, (PG)n, (GP)nG, or (PG)nP, and so forth, where n is an integer between one and ten, preferably two or about two, and where a specific example of such a spacer is GPGPG.
In some multi-epitope constructs, it is sufficient that each spacer nucleic acid encodes the same amino acid sequence. In multi-epitope constructs having two spacer nucleic acids encoding the same amino acid sequence, the spacer nucleic acids encoding those spacers may have the same or different nucleotide sequences, where different nucleotide sequences may be preferred to decrease the likelihood of unintended recombination events when the multi-epitope construct is inserted into cells.
In other multi-epitope constructs, one or more of the spacer nucleic acids may encode different amino acid sequences. While many of the spacer nucleic acids may encode the same amino acid sequence in a multi-epitope construct, one, two, three, four, five or more spacer nucleic acids may encode different amino acid sequences, and it is possible that all of the spacer nucleic acids in a multi-epitope construct encode different amino acid sequences. Spacer nucleic acids may be optimized with respect to the epitope nucleic acids they flank by determining whether a spacer sequence will maximize epitope processing and/or minimize junctional epitopes, as described herein.
Multi-epitope constructs may be distinguished from one another according to whether the spacers in one construct optimize epitope processing or minimize junctional epitopes over another construct, and preferably, constructs may be distinguished where one construct is concomitantly optimized for epitope processing and junctional epitopes over the other. Computer assisted methods and in vitro and in vivo laboratory methods for determining whether a construct is optimized for epitope processing and junctional motifs are described herein.
“Multi-epitope constructs of the invention may also be “optimized”. The term “optimized” or “optimizing” refers to increasing the immunogenicity or antigenicity of a multi-epitope construct having at least one epitope pair by sorting epitopes to minimize the occurrence of junctional epitopes, inserting flanking residues that flank the C-terminus or N-terminus of an epitope, and inserting spacer residue to further prevent the occurrence of junctional epitopes or to provide a flanking residue. An increase in immunogenicity or antigenicity of an optimized multi-epitope construct is measured relative to a multi-epitope construct that has not been constructed based on the optimization parameters and is using assays known to those of skill in the art, e.g., assessment of immunogenicity in HLA transgenic mice, ELISPOT, interferon-gamma release assays, tetramer staining, chromium release assays, and presentation on dendritic cells.
The subject invention also concerns antibodies that bind to polypeptides of the invention. Antibodies that are immunospecific for the malarial polypeptides set forth herein are specifically contemplated. In various embodiments, antibodies which do not cross react with other proteins or malarial proteins are also specifically contemplated. The antibodies of the subject invention can be prepared using standard materials and methods known in the art (see, for example, Monoclonal Antibodies: Principles and Practice, 1983; Monoclonal Hybridoma Antibodies: Techniques and Applications, 1982; Selected Methods in Cellular Immunology, 1980; Immunological Methods, Vol. II, 1981; Practical Immunology, and Kohler et al. [1975] Nature 256:495).
The term “antibody” is used in the broadest sense and specifically covers monoclonal antibodies (including full length monoclonal antibodies), polyclonal antibodies, multispecific antibodies (e.g., bispecific antibodies), and antibody fragments so long as they exhibit the desired biological activity, particularly neutralizing activity. “Antibody fragments” comprise a portion of a full length antibody, generally the antigen binding or variable region thereof. Examples of antibody fragments include Fab, Fab′, F(ab′)2, and Fv fragments; diabodies; linear antibodies; single-chain antibody molecules; and multi-specific antibodies formed from antibody fragments.
The term “monoclonal antibody” as used herein refers to an antibody obtained from a population of substantially homogeneous antibodies, i.e., the individual antibodies comprising the population are identical except for possible naturally occurring mutations that may be present in minor amounts. Monoclonal antibodies are highly specific, being directed against a single antigenic site. Furthermore, in contrast to conventional (polyclonal) antibody preparations that typically include different antibodies directed against different determinants (epitopes), each monoclonal antibody is directed against a single determinant on the antigen. The modifier “monoclonal” indicates the character of the antibody as being obtained from a substantially homogeneous population of antibodies, and is not to be construed as requiring production of the antibody by any particular method. For example, the monoclonal antibodies to be used in accordance with the present invention may be made by the hybridoma method first described by Kohler et al. [1975] Nature 256: 495, or may be made by recombinant DNA methods (see, e.g., U.S. Pat. No. 4,816,567). The “monoclonal antibodies” may also be isolated from phage antibody libraries using the techniques described in Clackson et al. [1991] Nature 352: 624-628 and Marks et al. [1991]J. Mol. Biol. 222: 581-597, for example.
The monoclonal antibodies described herein specifically include “chimeric” antibodies (immunoglobulins) in which a portion of the heavy and/or light chain is identical with or homologous to corresponding sequences in antibodies derived from a particular species or belonging to a particular antibody class or subclass, while the remainder of the chain(s) is identical with or homologous to corresponding sequences in antibodies derived from another species or belonging to another antibody class or subclass, as well as fragments of such antibodies, so long as they exhibit the desired biological activity (U.S. Pat. No. 4,816,567; and Morrison et al. [1984] Proc. Natl. Acad Sci. USA 81: 6851-6855). Also included are humanized antibodies, such as those taught in U.S. Pat. Nos. 6,407,213 or 6,417,337 which are hereby incorporated by reference in their entirety.
“Single-chain Fv” or “sFv” antibody fragments comprise the VH and VL domains of an antibody, wherein these domains are present in a single polypeptide chain. Generally, the Fv polypeptide further comprises a polypeptide linker between the VH and VL domains which enables the sFv to form the desired structure for antigen binding. For a review of sFv see Pluckthun in The Pharmacology of Monoclonal Antibodies [1994] Vol. 113:269-315, Rosenburg and Moore eds. Springer-Verlag, New York.
The term “diabodies” refers to small antibody fragments with two antigen-binding sites, which fragments comprise a heavy chain variable domain (VH) connected to a light chain variable domain (VL) in the same polypeptide chain (VH-VL). Diabodies are described more fully in, for example, EP 404,097; WO 93/11161; and Hollinger et al. [1993] Proc. Natl. Acad. Sci. USA 90: 6444-6448. The term “linear antibodies” refers to the antibodies described in Zapata et al. [1995] Protein Eng. 8(10):1057-1062.
An “isolated” antibody is one which has been identified and separated and/or recovered from a component of its natural environment. Contaminant components of its natural environment are materials which would interfere with diagnostic or therapeutic uses for the antibody, and may include enzymes, hormones, and other proteinaceous or nonproteinaceous solutes. In preferred embodiments, the antibody will be purified (1) to greater than 95% by weight of antibody as determined by the Lowry method, and most preferably more than 99% by weight, (2) to a degree sufficient to obtain at least 15 residues of N-terminal or internal amino acid sequence by use of a spinning cup sequenator, or (3) to homogeneity by SDS-PAGE under reducing or nonreducing conditions using Coomassie blue or, preferably, silver stain. Isolated antibody includes the antibody in situ within recombinant cells since at least one component of the antibody's natural environment will not be present. Ordinarily, however, isolated antibody will be prepared by at least one purification step.
The terms “comprising”, “consisting of” and “consisting essentially of” are defined according to their standard meaning. The terms may be substituted for one another throughout the instant application in order to attach the specific meaning associated with each term. The phrases “isolated” or “biologically pure” refer to material that is substantially or essentially free from components which normally accompany the material as it is found in its native state. Thus, isolated peptides in accordance with the invention preferably do not contain materials normally associated with the peptides in their in situ environment. “Link” or “join” refers to any method known in the art for functionally connecting peptides, including, without limitation, recombinant fusion, covalent bonding, disulfide bonding, ionic bonding, hydrogen bonding, and electrostatic bonding.
Following are examples which illustrate procedures for practicing the invention. These examples should not be construed as limiting. All percentages are by weight and all solvent mixture proportions are by volume unless otherwise noted.
In this disclosure, “binding data” results are often expressed in terms of “IC50's.” IC50 is the concentration of peptide in a binding assay at which 50% inhibition of binding of a reference peptide is observed. Given the conditions in which the assays are run (i.e., limiting HLA proteins and labeled peptide concentrations), these values approximate KD values. Assays for deteimining binding are described in detail, e.g., in PCT publications WO 94/20127 and WO 94/03205 (each of which is hereby incorporated by reference in its entirety). It should be noted that IC50 values can change, often dramatically, if the assay conditions are varied, and depending on the particular reagents used (e.g., HLA preparation, etc.). For example, excessive concentrations of HLA molecules will increase the apparent measured IC50 of a given ligand. Alternatively, binding is expressed relative to a reference peptide. Although as a particular assay becomes more, or less, sensitive, the IC50's of the peptides tested may change somewhat, the binding relative to the reference peptide will not significantly change. For example, in an assay run under conditions such that the IC50 of the reference peptide increases 10-fold, the IC50 values of the test peptides will also shift approximately 10-fold. Therefore, to avoid ambiguities, the assessment of whether a peptide is a good, intermediate, weak, or negative binder is generally based on its IC50, relative to the IC50 of a standard peptide. Binding may also be determined using other assay systems including those using: live cells (e.g., Ceppellini et al., Nature 339:392, 1989; Christnick et al., Nature 352:67, 1991; Busch et al., Int. Immunol. 2:443, 19990; Hill et al., J. Immunol. 147:189,1991; del Guercio et al., J. Immunol. 154:685, 1995), cell free systems using detergent lysates (e.g., Cerundolo et al., J Immunol. 21:2069, 1991), immobilized purified MHC (e.g., Hill et al., J. Immunol. 152, 2890, 1994; Marshall et al., J Immunol. 152:4946, 1994), ELISA systems (e.g., Reay et al., EMBO J. 11:2829, 1992), surface plasmon resonance (e.g., Khilko et al., J. Biol. Chem. 268:15425, 1993); high flux soluble phase assays (Hammer et al., J. Exp. Med. 180:2353, 1994), and measurement of class I MHC stabilization or assembly (e.g., Ljunggren et al., Nature 346:476, 1990; Schumacher et al, Cell 62:563, 1990; Townsend et al., Cell 62:285, 1990; Parker et al., J. lmmunol. 149:1896, 1992). Predicted IC50 values may be referred to as PIC values and measured IC50 values may be referred to a MIC values.
Starting with 27 open reading frames defined by Multidimensional Protein Identification Technology, 9 highly antigenic proteins were identified. These highly antigenic proteins were recognized by volunteers immunized with irradiated sporozoites; mock immunized individuals (controls) failed to recognize these proteins. Several of these nine proteins were more antigenic than previously well-characterized proteins.
To identify and prioritize a set of ORFs representing antigens potentially expressed in the sporozoite and intrahepatic stage of the parasite life cycle, MS/MS spectra of peptide sequences generated by Multidimensional Protein Identification Technology (MudPIT) (Washburn, M. P., Wolters, D., & Yates, J. R. 3rd. Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat. Biotechnol. 19, 242-247 (2001)) of P. falciparum sporozoite preparations were scanned against the P. falciparum genomic sequence database using SEQUEST™ software (Florens, L. et al. A proteomic view of the Plasmodium falciparum life cycle. Submitted). A panel of 27 ORF's (10 expressed only in sporozoites, and 17 common to other stages of the parasite life cycle) were selected. Their size ranged between 96-4544 amino acids (mean 1252), the percentage of the protein covered by identified peptides ranged between 0.5-49.5%, and the frequency of recognition in the P. falciparum proteome dataset ranged between 16 peptide hits from 6 different sporozoite runs (antigen 2) to single peptide hits (antigens 1, 11, 14, 16, 19 and 25. When searched against the final P. falciparum database using refined gene model predictions, and taking into consideration genomic sequence information from the Anopheles (vector) and human (host) databases, 19 of the 27 antigens could be identified using stringent selection criteria and six others could be identified only with relaxed criteria.
Amino acid sequences from the 27 ORFs were scanned with HLA-A1, A2, A3/A11, A24 and DR supertype PIC algorithms; a total of 3241 peptides were identified (range=14-435; mean=120 sequences per antigen). A set of 1142 sequences was synthesized (range=13-50; mean=42), selecting the top 10 scorers per supertype per antigen for larger ORFs. Control sets of peptides were synthesized from 4 known antigens (PfCSP, PfSSP2, PfLSA1 and PfEXP1). Next, predicted epitopes were tested for their capacity to induce recall IFN-γ immune responses using PBMC from volunteers immunized with irradiated P. falciparum sporozoites and either protected (n=4) or not protected (n=4) against challenge with infectious sporozoites, or control volunteers mock immunized in parallel (n=4) (see Table 1). Peptides were tested as pools, at 1 μg/ml each peptide with each antigen represented by a separate pool, by IFN-γ ELIspot (Washburn, M. P., Wolters, D., & Yates, J. R. 3rd. Large-scale analysis of the yeast proteome by multidimensional protein identification technology. Nat. Biotechnol. 19, 242-247 (2001)). Positive and negative control epitopes from well characterized antigens (CMV, Influenza, EBV, HIV) were also included.
Considering a stimulation index (ratio test response/control) >2.0 as positive, 19 of the 27 unknown antigens were recognized by at least 1 of 8 irradiated sporozoite immunized volunteers, but not by any of the 4 mock immunized controls (Table 1). Nine of the 27 antigens (#2, 5, 3, 18, 22, 21, 13, 11, 20) were recognized by at least 50% of irradiated sporozoite volunteers in at least 25% of assays, 3 antigens (#1, 12, 17) were recognized by at least 25% of volunteers in at least 15% of assays, and 7 antigens (#6, 7, 9, 14, 15, 16, 19) were recognized by at least 10% volunteers in at least 5% of assays. Eight of the 27 unknown antigens (#4, 8, 10, 23, 24, 25, 26, 27) failed to induce IFN-γ responses of sufficient magnitude to meet our criteria of positivity. Pools of predicted epitopes from the known antigens, PfCSP, PfSSP2, PfLSA1 and PfEXP1, were also recognized by irradiated sporozoite volunteers although the frequency of response to those pools was somewhat lower than that to pools of peptides representing previously validated epitopes derived from the same antigens (Doolan, D. L. et al. Degenerate cytotoxic T cell epitopes from P. falciparum restricted by multiple HLA-A and HLA-B supertype alleles. Immunity. 7, 97-112 (1997); Doolan, D. L. et al. HLA-DR-promiscuous T cell epitopes from Plasmodium flaciparum pre-erthrocytic-stage antigens restricted by multiple HLA class II alleles. J Immunol. 165:1123-1137 (2000); Wang, R., et al. Induction of CD4(+) T cell-dependent CD8(+) type 1 responses in humans by a malaria DNA vaccine. Proc. Natl. Acad. Sci. U.S.A. 98, 10817-10822 (2001)) (Table 1). Particularly noteworthy, the reactivity against several of the newly identified antigens greatly exceeded the reactivities observed against all 4 known antigens For example, both antigens 2 and 5 were recognized by 7/8 irradiated sporozoite volunteers in 9/16 assays, and antigens 3 and 18 were recognized by 6/8 irradiated sporozoite volunteers in 6/16 assays.
Results show that HLA-A2 peptide pools from antigens 2, 5 and 13, and HLA-A1 and HLA-DR peptide pools from antigens 2 and 5, are recognized by irradiated sporozoite volunteers who express the respective HLA alleles, but not by mock immunized controls. Deconvolution at the level of individual epitopes is in progress. Additionally, a comprehensive analysis of HLA binding against the A1, A2, A3/11, A24, and DR1 supertypes has been completed for selected antigens. Several degenerate binders have been identified for each supertype/antigen combination, and 50 to 70% of the predicted peptides have been identified as degenerate HLA binders.Further analysis also revealed that the antigenicity results correlate to a large degree with the proteomic data. For example, of 9 antigens associated with high immune reactivity, 7 were identified by multiple peptide hits in multiple MudPIT runs
All patents, patent applications, provisional applications, polynucleotide sequences, amino acid sequences, tables and publications referred to or cited herein are incorporated by reference in their entirety, including all figures, to the extent they are not inconsistent with the explicit teachings of this specification. It should be understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application.
This application is a continuation of U.S. application Ser. No. 10/537,642, filed Dec. 27, 2005, which is the national stage of PCT Application No. PCT/US2003/038966, filed Dec. 8, 2003, which claims the benefit of U.S. Provisional Application Ser. No. 60/431,494, filed Dec. 6, 2002, the disclosure of each of which is incorporated herein by reference in its entirety, including all figures, tables, nucleic acid sequences, amino acid sequences, and drawings. The Sequence Listing for this application is labeled “Seq-List.txt” which was created on Feb. 24, 2004 and is 566 KB. The entire contents of the sequence listing is incorporated herein by reference in its entirety.
This invention was made with government support under Grant No. 1 R43AI49051-01 awarded by NIAID. The government has certain rights in the invention.
Number | Date | Country | |
---|---|---|---|
60431494 | Dec 2002 | US |
Number | Date | Country | |
---|---|---|---|
Parent | 10537642 | Dec 2005 | US |
Child | 13189765 | US |