Table 1 shows the structural characteristics of the different amino acids and the amino-acid replacements designed to reduce the antigenicity of protein epitopes.
This invention relates to the design of hypoallergenic molecules that could be used in the desensitization of allergic individuals with lessened chance of anaphylaxis. The hypoallergenic molecules may also be used as vaccines against allergy.
When we are exposed to a foreign substance (an antigen), our immune system reacts by producing molecules and cells that are specific for the substance. Antibodies are molecules produced by our immune system and these bind the antigen, neutralizing or immobilizing it, and, thereby, rendering it more susceptible to elimination by normal processes. Various types of antibodies are produced by the immune system and the various antibody types have different structures, functions, and distribution in our body. For example, the major type of antibody that we produce is IgG (Immunoglobulin G). A type of antibody that is produced in much smaller amounts is IgE; IgE is the antibody type that is responsible for allergy. It is not known why some antigens elicit an IgE response and not an IgG response. It is also not known why some individuals, when exposed to a particular antigen, develop an allergic reaction to it, while others don't. Antigens that elicit an IgE response are called allergens.
A number of cell types have receptors for IgE on their surface. For example, mast cells, that lie under our skin and in the lining of our blood vessels, and basophils, that circulate in our blood, bind IgE through high affinity receptors. When allergen binds to IgE on mast cells, or basophils, the cells release histamine and other vasoactive compounds from pre-formed granules in their cytoplasm. The release of those molecules results in the usual allergic symptoms: sneezing, coughing, rashes, local edema, etc. Severe allergic reactions, like edema that closes the breathing passages, or systemic anaphylaxis, could result in death.
An attempt to rid an individual of allergy to a particular allergen is made by exposing the individual to ever increasing amounts of allergen over time—a process called desensitization. The objective of desensitization is to elicit an IgG response that would compete with IgE for the allergen. Not surprisingly, there is danger that desensitization could cause a severe allergic reaction.
Various attempts have been made to produce allergens with reduced allergenicity (the antigenicity of an allergen; here, antigenicity, the ability to elicit an antibody response, and allergenicity, the ability to elicit an IgE response, are used interchangeably) (see, for example, Ferreira et al., 1996; Vrtala et al., 1997; Ferreira et al., 1998; Schramm et al., 1999). Such hypoallergenic molecules would permit safer desensitization. If the regions in an allergen, to which the IgE molecules bind (the dominant IgE epitopes) are known, the residues in those regions could be replaced by amino acids that would cause less binding to IgE.
The method described here is a purely computational procedure designed to locate the putative dominant IgE epitopes (putative because it is impossible to identify and delineate all the dominant IgE epitopes of any allergen) and to identify the residues which contribute to the antigenicity of those epitopes. The method, called “de-Antigenization”, also describes a procedure to decrease the antigenicity of the dominant IgE epitopes by the judicious replacement of the contributing residues with amino acids that by virtue of their physicochemical properties are expected to contribute less to antigenicity.
The de-Antigenization of the putative dominant IgE epitopes is achieved through the following steps:
(Step 1) Identify a protein molecule that has been identified as a major allergen.
(Step 2) Calculate the antigenicity of the various regions of the allergen, using three-dimensional structural information about the molecule and the known physicochemical properties of the amino-acid residues. Locate the regions with high antigenicities, i.e. the putative dominant IgE epitopes.
(Step 3) Identify the amino-acid residues comprising the putative dominant IgE epitopes, in particular those residues which, by virtue of their physicochemical properties and their accessibility, can contribute significantly to tight binding by IgE. Replace those residues with amino acids that would be expected to contribute less to the binding by IgE, while ensuring that the replacements will not significantly alter the structure of the allergen. At least one T-cell epitope should be preserved.
(Step 4) Using the new structure (the structure with the replacements), repeat Steps 2 and 3 as needed until the putative dominant IgE epitopes have significantly lower antigenicities.
(Step 5) The amino acid sequences, which result in significantly lower antigenicities for the putative dominant IgE epitopes, and polynucleotides derived from those sequences, provide the basis for hypoallergenic molecules that could be used in the desensitization of allergic individuals with lessened chance of anaphylaxis, or as vaccines against the allergy.
Information about the three-dimensional structure of a particular allergen is often available from the Protein Data Bank (Berman et al., 2000) (http://www.rcsb.org/pdb). In the absence of experimentally-determined three-dimensional information, a model of the allergen could be built based on structural information from closely related molecules. Various techniques are available for modeling purposes and those techniques are known to those skilled in the art.
On the basis of the three-dimensional structure of the allergen, the solvent accessibilities of the individual amino acid residues are computed using standard methods (see, for example, Padlan, 1990; Padlan 1994). Solvent accessibilities could also be obtained using the program DSSP (Kabsch et. al., 1983) (implemented in http://bioweb.pasteur.fr/seqanal/interfaces/dssp-simple.html). The solvent accessibilities are used as weighting factors in the calculation of the antigenicities. The use of solvent accessibilities as weighting factors de-emphasizes the contribution of residues that are not too accessible and that probably do not contribute much to the interaction with IgE.
A method had been proposed earlier for quantifying the antigenicity of a given region in a protein molecule using the physicochemical attributes of the amino acid residues in the region (Padlan, 1985). That method is particularly suitable for locating the putative dominant IgE epitopes and is followed here. Structural parameters describing the physicochemical attributes of the various amino acids have been computed by various authors (for example, Sneath, 1966; Grantham, 1974; Sandberg et al., 1998) and those can be used in the calculation of antigenicities. The antigenicity of a region in the molecule is computed by taking the sum of the structural parameters, weighted or unweighted, corresponding to all the residues within that region. Structural parameters have been shown to provide a good measure of the ability of a given region to participate in antibody-antigen and other protein-protein interactions (see, for example, Padlan, 1990; Novotny, 1991; be Genst et al., 2002; David et al., 2007). Thus, antigenicity computed in this manner is directly correlated with the ability of a particular region to engage in tight binding to IgE. The regions displaying highest antigenicities are identified as the putative dominant IgE epitopes.
The de-Antigenization of the putative dominant IgE epitopes is achieved by the judicious replacement of the residues in those epitopes with amino acids that would contribute less to the total antigenicity values, while preserving the structure of the molecule. By taking into account the physicochemical properties of the amino acids and their propensity to participate in a particular secondary structure (presented in Table 1), replacement rules could be proposed. The replacement rules used in the examples below are included in Table 1. Other replacement rules could be proposed and used provided that they result in reduced antigenicity while preserving structure.
The concept can be implemented by those skilled in the art using the following, or similar, algorithm:
(A.1.0)—Generate a set of amino-acid replacement rules based on structural criteria, e.g., the replacement rules in Table 1. The recommended structural criteria are (1) the replacing amino acid should contribute less to the binding interaction with an antibody and (2) the replacement should not result in a significant change in the structure of the molecule.
(A.2.0)—Identify a protein molecule that is a major allergen in a particular allergy. Locate on the sequence the known T-cell epitopes of the molecule; if T-cell epitopes had not been experimentally determined, obtain possible T-cell epitopes using predictors, e.g. SYFPEITHI (Rammensee et al., 1999) (http://www.syfpeithi.de). If an experimentally-determined three-dimensional structure is available for the allergen, proceed to (A.3.0);
(A.2.1)—If a model structure for the allergen is available, proceed to (A.3.0);
(A.2.2)—Identify a homologous molecule for which an experimentally-determined three-dimensional structure or a model structure is available; if there is none, STOP
(A.2.3)—Generate a model for the allergen from its amino acid sequence.
(A.3.0)—Generate atomic coordinates for the biological, i.e. natural, aggregation state of the molecule (dimer, trimer, etc.) using appropriate symmetry operations. For experimentally-determined structures, atomic coordinates for the biological aggregation state may already be available from the Protein Data Bank. All subsequent computations will be on the biological aggregation state of the molecule.
(A.4.0)—Choose and isolate the positions at which the antigenicities will be computed, e.g., the alpha-carbon positions.
(A.5.0)—Compute the solvent accessibilities of the individual amino acid residues by using standard procedures (as described in Padlan, 1990 and references cited therein), or by using program DSSP (Kabsch et al., 1983) (implemented, for example, in http://bioweb.pasteur.fr/seqanal/interfaces/dssp-simple.html).
(A.6.0)—Choose a set of structural parameters (physicochemical attributes) for use in the computation of the antigenicities. The structural parameters compiled by Sandberg et al. (1998), or by Grantham (1974), are particularly suitable for the computation of antigenicities.
(A.7.0)—Compute the antigenicities at the positions chosen in (A.4.0). A measure of antigenicity ascribed to a given position would be the total contribution of the amino acids within a defined region around that position. The contribution of each amino acid may be the sum, appropriately weighted or unweighted, of the structural parameters chosen in (A.6.0). The solvent accessibility of the amino acid, computed in (A.5.0), is recommended as an appropriate overall weight for the contribution of that amino acid to the antigenicity.
(A.8.0)—Identify the possible location of the putative dominant IgE epitopes. The positions with antigenicity values significantly higher than the rest are most probably part of the putative dominant IgE epitopes. A basis for the identification of the putative dominant IgE epitopes, could be the root-mean-square (r.m.s.) deviation from the mean of the antigenicity values of all epitopes.
(A.9.0)—Replace the residues comprising the putative dominant IgE epitopes according to the replacement rules generated in (A.1.0). The residues would be the ones located within a certain radius of the epitope centers chosen in (A.4.0). A suitable value for the radius could be determined by examining known antibody-antigen complexes (see, for example, Padlan, 1996). It is recommended that the residues to be replaced be chosen on the basis of their solvent accessibility and their relative contribution to the overall antigenicity of the epitope. Preserve those residues which are probably critical to the structure (secondary, tertiary, quaternary) of the antigen, including residues whose posttranslational modification, e.g. glycosylation, is probably required for preservation of structure. Preserve at least one of the T-cell epitopes located in (A.2.0), as well as segments for which high antigenicity values might elicit useful antibody responses, e.g. inhibition of particular reactions. The suggested replacement should not be made if it will result in a peptide segment (of sufficient length to be presented by T cells) that is identical to a segment present in a human protein; this is to obviate autoimmune reactions.
(A.10.0)—Repeat (A.2.3) to (A.9.0) until it is deemed that the decrease in antigenicity of the putative dominant IgE epitopes is sufficient, or until no further amino-acid replacements are warranted.
(A.11.0)—The amino acid sequences resulting from (A.10.0), or the polynucleotides derived from those sequences, provide the basis for hypoallergenic molecules that could be used in the desensitization of allergic individuals, with lessened chance of anaphylaxis, or as vaccines against the allergy.
The present invention will now be described with reference to the following specific, non-limiting examples.
Design of possible hypoallergenic molecules for use in the desensitization with lessened chance of anaphylaxis, or as vaccines, against Der p 1, the major allergen of the European house dust mite, Dermatophagoides pteronyssinus:
Three-dimensional structural information for the mature form of Der p 1 has been provided by X-ray crystallography (de Halleux et al., 2006) (Protein Data Bank entry 2AS8). The sequence of the mature form of Der p 1, for which an X-ray structure is available, is presented as SEQ ID NO: 1. Hereinafter, the fragment represented by that structure will be referred to simply as 2AS8. Using SYFPEITHI, three putative T-cell epitopes were predicted: residues 22-36, 34-48, and 37-51. During de-Antigenization, residues 22-51 were preserved.
The solvent accessibilities of the individual residues of 2AS8 were obtained using the program DSSP (Kabsch et al., 1983) (http://bioweb.pasteur.fr/seqanal/interfaces/dssp-simple.html). Fractional accessibility for each amino acid was estimated by dividing the accessibility obtained from DSSP by the total surface area of the amino acid (obtained from http://prowl.rockefeller.edu/aainfo/volume.htm).
The structural parameters provided by Sandberg et al. (1998) (reproduced in Table 1) were used in the calculation of antigenicities. The antigenicity of a region centered at each alpha-carbon position was computed by taking the sum of the zz1, zz2 and zz3 structural parameters of Sandberg et al. (1988) corresponding to all the residues within 14 Angstroms of the alpha-carbon. In this example, the radius of 14 Angstroms was chosen on the basis of the results of calculations on the known epitopes of the allergen, hen egg white lysozyme (Padlan, 1996). The solvent accessibilities obtained above for 2AS8 were used as weighting factors in the calculation of the antigenicities.
Only those epitopes whose antigenicity values are greater than 2 r.m.s. deviations above the mean were considered. De-Antigenization was achieved after two rounds of antigenicity calculation followed by amino-acid replacements. No further replacements were suggested after the two rounds. The replacement rules proposed in Table 1 were applied. Only those residues, whose contribution to the antigenicity of the putative dominant IgE epitope is at least 3% of the total and whose fractional solvent accessibility is at least 40%, were replaced.
Prior to de-Antigenization, the average antigenicity of the molecule represented by SEQ ID NO: 1 was 25.5 (r.m.s. deviation=12.6) (arbitrary units). A total of 27 amino acid replacements were made, yielding SEQ ID NO: 2. This resulted in an average antigenicity value of 2.3 (r.m.s. deviation=8.4); 2 more changes were suggested, yielding SEQ ID NO: 3. This resulted in an average antigenicity value of 1.6 (r.m.s. deviation=7.9); no more changes were suggested. The plots of antigenicities computed for 2AS8, before and after two rounds of de-Antigenization, are presented in
Possible Hypoallergenic Molecule for Use in the Desensitization to Der p 1, with Lessened Chance of Anaphylaxis, or as Vaccine Against Allergy to the European House Dust Mite:
Since every round of de-Antigenization resulted in a significant decrease in the antigenicity of the dominant IgE epitopes, any of the derivative amino-acid sequences (SEQ ID NO: 2 or 3), or a polynucleotide derived from it, could be the basis of a possible hypoallergenic molecule useful in the desensitization of individuals allergic to Der p 1 with lessened chance of anaphylaxis, or as possible vaccine against European house dust mite allergy. The best candidate is probably the one represented by the sequence after the two rounds of de-Antigenization (SEQ ID NO: 3).
Design of possible hypoallergenic molecules for use in the desensitization with lessened chance of anaphylaxis, or as vaccines, against Jun a 1, the major pollen allergen from the cedar, Juniperus ashei:
A crystallographically-determined structure of Jun a 1 (Czerwinski et al., 2005) is available from the Protein Data Bank (Entry 1PXZ), hereinafter referred to simply as 1PXZ. The sequence of the mature form of Jun a 1, for which an X-ray structure is available, is presented as SEQ ID NO: 4. Several peptides were predicted by SYFPEITHI as possible T-cell epitopes; two of these (residues 131-145 and 142-156) were chosen to be preserved during de-Antigenization.
Solvent accessibilities for 1PXZ were computed as in EXAMPLE 1. The surface areas accessible to solvent were computed using DSSP and the fractional accessibility of each residue was estimated by dividing the solvent accessible area of the residue by the surface area of the particular amino acid.
The antigenicity of regions around the alpha-carbon positions of 1PXZ were computed as in EXAMPLE 1. The zz1, zz2 and zz3 structural parameters of Sandberg et al. (1998) were used. A radius of 14 Angstroms was used to define the region around each alpha-carbon position. The initial average antigenicity value was 22.5 (arbitrary units) with a root-mean-square (r.m.s.) deviation from the mean of 12.2. The regions with antigenicity values greater than two r.m.s. deviations above the mean were identified as the putative dominant IgE epitopes.
The residues in the putative dominant IgE epitopes, which each contribute at least 3% of the total antigenicity of the epitope and whose fractional accessibilities are greater than 40%, were replaced according to the rules proposed in Table 1. Seventeen residues were replaced, yielding SEQ ID NO: 5. The antigenicities were recalculated and this resulted in an average antigenicity of 12.1 (r.m.s. deviation=11.6). Fourteen more residues were replaced, yielding SEQ ID NO: 6. This resulted in an average antigenicity of 4.2 (r.m.s. deviation=7.8). Eleven more residues were replaced, yielding SEQ ID NO: 7. A third round of de-Antigenization resulted in an average antigenicity of −0.3 (r.m.s. deviation=8.3). After replacing six more residues, yielding SEQ ID NO: 8, a fourth round of de-Antigenization resulted in an average antigenicity of −2.1 (r.m.s. deviation=8.3). No additional residues were found to need replacement after this fourth round of de-Antigenization. The antigenicities before and after the four rounds of de-Antigenization of 1PXZ are plotted in
Possible Hypoallergenic Molecules for Use in the Desensitization, with Lessened Chance of Anaphylaxis, or as Vaccines Against Jun a 1, the Major Pollen Allergen from the Cedar, Juniperus ashei:
Since every round of de-Antigenization resulted in a significant decrease in the antigenicity of the dominant IgE epitopes, any of the derivative amino-acid sequences (SEQ ID NO: 5 through 8), or a polynucleotide derived from it, could be the basis of a possible hypoallergenic molecule useful in the desensitization of individuals allergic to Jun a 1 with lessened chance of anaphylaxis, or as possible vaccine against pollen from Juniperus ashei. The best candidate is probably the one represented by the sequence after the four rounds of de-Antigenization (SEQ ID NO: 8).
Design of possible hypoallergenic molecules for use in the desensitization with lessened chance of anaphylaxis, or as vaccines, against Ves v 5, the major venom allergen from yellow jackets, Vespula vulgaris:
A crystallographically-determined structure of Ves v 5 (Henriksen et al., 2001) is available from the Protein Data Bank (Entry 1QNX), hereinafter referred to simply as 1QNX. The sequence of the mature form of Ves v 5, for which an X-ray structure is available, is presented as SEQ ID NO: 9. Several peptides have been shown to be T-cell epitopes (Bohle et al., 2005); two of those (residues 78-87 and 181-192) were chosen to be preserved during de-Antigenization.
Solvent accessibilities for 1QNX were computed as in EXAMPLE 1. The surface areas accessible to solvent were computed using DSSP and the fractional accessibility of each residue was estimated by dividing the solvent accessible area of the residue by the surface area of the particular amino acid.
The antigenicity of regions around the alpha-carbon positions of 1QNX were computed as in EXAMPLE 1. The zz1, zz2 and zz3 structural parameters of Sandberg et al. (1998) were used. A radius of 14 Angstroms was used to define the region around each alpha-carbon position. The initial average antigenicity value was 12.1 (arbitrary units) with a root-mean-square (r.m.s.) deviation from the mean of 11.2. The regions with antigenicity values greater than two r.m.s. deviations above the mean were identified as the putative dominant IgE epitopes.
The residues in the putative dominant IgE epitopes, which each contribute at least 3% of the total antigenicity of the epitope and whose fractional accessibilities are greater than 40%, were replaced according to the rules proposed in Table 1. Twelve residues were replaced, yielding SEQ ID NO: 10. The antigenicities were recalculated and this resulted in an average antigenicity of 2.9 (r.m.s. deviation=10.5). Seven more residues were replaced, yielding SEQ ID NO: 11. This resulted in an average antigenicity of −2.7 (r.m.s. deviation=10.6). Eleven more residues were replaced, yielding SEQ ID NO: 12. A third round of de-Antigenization resulted in an average antigenicity of −6.1 (r.m.s. deviation=9.3). After replacing two more residues, yielding SEQ ID NO: 13, a fourth round of de-Antigenization resulted in an average antigenicity of −8.1 (r.m.s. deviation=8.2). No additional residues were found to need replacement after this fourth round of de-Antigenization. The antigenicities before and after the four rounds of de-Antigenization of 1QNX are plotted in
Possible Hypoallergenic Molecules for Use in the Desensitization, with Lessened Chance of Anaphylaxis, or as Vaccines Against Ves v 5, the Major Venom Allergen from the Cedar, Vespula vulgaris:
Since every round of de-Antigenization resulted in a significant decrease in the antigenicity of the dominant IgE epitopes, any of the derivative amino-acid sequences (SEQ ID NO: 9 through 13), or a polynucleotide derived from it, could be the basis of a possible hypoallergenic molecule useful in the desensitization of individuals allergic to Ves v 5 with lessened chance of anaphylaxis, or as possible vaccine against pollen from Vespula vulgaris. The best candidate is probably the one represented by the sequence after the four rounds of de-Antigenization (SEQ ID NO: 13).