Methods for treating sensitivity to protein allergen using peptides which include a T cell epitope recognized by a T cell receptor specific for the protein allergen

Information

  • Patent Grant
  • 6335019
  • Patent Number
    6,335,019
  • Date Filed
    Monday, June 5, 1995
    29 years ago
  • Date Issued
    Tuesday, January 1, 2002
    22 years ago
Abstract
Antigen E or Amb a I of ragweed pollen has been shown to be a family or families of proteins. cDNAs encoding Amb a I, the major human allergen of ragweed and Amb a II, peptides derived from Amb a I or Amb a II, antibodies against the peptides; and methods of treating individuals for sensitivity to ragweed are disclosed.
Description




DESCRIPTION




FUNDING




Work described herein was supported by the National Institutes of Health (Grant No. AI14908).




BACKGROUND OF THE INVENTION




Genetically predisposed individuals, who make up about 10% of the population, become hypersensitized (allergic) to antigens from a variety of environmental sources to which they are exposed. Those antigens that can induce immediate and/or delayed types of hypersensitivity in people are called allergen. King, T. P.,


Adv. Immun.


, 23:77-105 (1976). Anaphylaxis or atopy, which includes the symptoms of hay fever, asthma and hives, is one form of immediate allergy. It can be caused by a variety of atopic allergens, such as products of grasses, trees, weeds, animal dander, insects, and food, drugs and chemicals.




The antibodies involved in atopic allergy belong primarily to the IgE class of immunoglobulins. IgE binds to mast cells and basophils. Upon combination of a specific allergen with IgE bound to mast cells, the IgE is cross-linked on the cell surface, resulting in the physiological effects of IgE-antigen interaction. Degranulation results in release of, among other substances, histamine, heparin, a chemotactic factor for eosinophilic leukocytes and the leukotrienes, C4, D4 and E4, which cause prolonged constriction of bronchial smooth muscle cells. Hood, L. E. et al.,


Immunology


, (2nd ed.), pp460-462, The Benjamin/Cumming Publishing Co., Inc. (1984). These released substances are the mediators which result in allergic symptoms caused by combination of IgE with a specific allergen. Through them, the effects of an allergen are manifested. Such effects may be systemic or local in nature, depending on the route by which the antigen entered the body and the pattern of deposition of IgE and mast cells. Local manifestations generally occur on epithelial surfaces at the location at which the allergen entered the body. Systemic effects can include anaphylaxis (anaphylactic shock), which is the result of an IgE-basophil response to circulating (intravascular) antigen.




One allergen of particular concern for many people is Antigen or


Amb a


I, a poorly-defined constituent (or group of constituents) which is the major allergenic component(s) of short ragweed (


Ambrosia artemisiifolia I


. or


Ambrosia elatior


) pollen and the major cause of late summer hayfever in North America and Canada. Smith, J. J., et al.,


Mol. Immun


, 25:355-364 (1988); King, T. P., et al.,


Biochem.


, 3:458-468 (1964); King, T. P.,


Adv. Immun.


, 23:77-105 (1976). It has been estimated that, on average, as much as 13% of the total serum IgE in ragweed-sensitive individuals is specific for


Amb a


I. Zeiss, C. R., et al.,


J. Immun.


, 110:414-421 (1973).


Amb a


I has been claimed to be an acidic, 38,000 molecular weight, non-glycosylated protein which is cleaved during extraction and chromatographic isolation into two non-covalently associated chains: an alpha chain of 26,000 molecular weight and a beta chain of 12,000 molecular weight. Knox, R. B., et al.,


Nature


, 255:1066-1068 (1970); Knox, R. B., and Heslop-Harrison, J.,


J. Cell Sci.


, 6:1-27 (1970); King, T. P.,


Adv. Immun.


, 23:77-105 (1976); King, T. P., et al.,


Archs Biochem. Biophys.


, 212:127-135 (1981). The two-chain and the single chain forms of


Amb a


I, which are both highly reactive with IgE, are allergenically and antigenically related. King, T. P., et al.,


Biochemistry


, 3:458-468 (1964). It has been shown, however, that several physical and chemical modifications of


Amb a


I cause a marked loss of antigen and allergenic activity. King, T. P., et al.,


Archs Biochem. Biophys.


, 212:127-135 (1981); King, T. P., et al.,


Immunochemistry


, 11: 83-92 (1974).




Because ragweed pollen is the chief causative agent of late-summer hay fever in the eastern United States and Canada, it has been the subject of more studies by different laboratories than any other pollen allergen. King, T. P.,


Adv. Immun.


, 23:77-105 (1976). Despite extensive study, the immunochemical definition of


Amb a


I is still far from complete. Smith and co-workers have begun characterization of the epitope structure of


Amb a


I, using a series of murine monoclonal antibodies raised against purified, native


Amb a


I. Three non-overlapping, non-repeating antigenic sites were defined (sites A, B, and C) and monoclonal antibodies directed to sites A and B together resulted in inhibition of 80% of human IgE binding to


Amb a


I. The reactivity of the monoclonal antibodies used was greatly diminished when


Amb a


I was physically or chemically modified. Olsen, Ph.D. thesis, University of North Carolina, Chapel Hill (1986); Olson, J. R., and Klapper, D. G.,


J. Immun.


, 136:2109-2115 (1986). They indicated that the two sites (A and B) are conformationally dependent epitopes. That is, they are either single structures which lose their conformation during modification or composite structures made up of two or more discontinuous peptides which are proximal in the native allergen but separate once the allergen has been modified. Smith, J. J., et al.,


Mol. Immun.


, 25:355-364 (1988).




Despite the considerable attention ragweed allergens have received, definition or characterization of the structure(s) or component(s) of the allergen responsible for its adverse effects on people is far from complete and current desentization therapy involves treatment with a complex, ill-defined extract of ragweed pollen.




SUMMARY OF THE INVENTION




The present invention relates to allergenic proteins or peptides from ragweed, DNAs encoding all or a portion of such allergenic proteins or peptides; to compositions containing such an allergen(s) or portions of the allergen(s); and to methods of administering the allergen(s) or a portion thereof or a composition which includes the allergen(s) or portions thereof to reduce or prevent the adverse effects that exposure to the allergen normally has on ragweed-sensitive individuals (i.e., to desensitize individuals to the allergen or block the effects of the allergen). The present invention further relates to methods of diagnosing sensitivity to ragweed pollen.




It has now been shown that Antigen E or


Amb a


I is not a single protein but, rather, a family or families of proteins to which ragweed-sensitive individuals react. In particular, the present invention relates to DNA encoding an amino acid sequence or peptide present in allergenic proteins from ragweed pollen. It relates to DNA encoding all or a portion of the ragweed allergen


Amb a


I or Antigen E preparation which has been isolated. Such ragweed allergen preparations are heterogeneous in nature and may include, in additions to what is currently referred to as


Amb a


I or Antigen E, other ragweed components which are allergenic (i.e., cause the typical adverse effects observed in a ragweed-sensitive individual upon exposure to ragweed pollen). These may include, for example, what is referred to in the literature as Antigen K and referred to herein as


Amb a


II. The present invention also relates to DNAs encoding similar amino acid sequences (i.e., DNA encoding amino acid sequences of allergens) in types of ragweed other than short ragweed, such as giant ragweed and western ragweed.











BRIEF DESCRIPTION OF THE DRAWINGS





FIG. 1A

is a schematic representation of screening of an


Amb a


I or Antigen E preparation, using monoclonal antibodies and oligoprobes.





FIG. 1B

is a schematic representation of screening of a ragweed flowerhead λgt10 library. It also illustrates the use of cross-hybridization and polymerase chain reaction (PCR) methods to obtain full-length cDNA clones encoding


Amb a


I and


Amb a


II.





FIG. 2

is the nucleotide sequence of the DNA insert of UNC Clone 1 (referred to as


Amb a


IA) (SEQ ID No:56), which was isolated from a λgt11 library by screening with monoclonal antibodies specific for components of an


Amb a


I preparation.





FIG. 3

is the nucleotide sequence of the DNA insert of UNC Clone 6 (referred to as


Amb a


IB) (SEQ ID NO:58), which was isolated from a λgt11 library by screening with monoclonal antibodies specific for components of and


Amb a


I preparation.





FIG. 4

is the nucleotide sequence of the DNA insert of UNC Clone 15 (referred to as


Amb a


IC) (SEQ ID NO:60 ), which was isolated from a λgt11 library by screening with monoclonal antibodies specific for components of an


Amb a


I preparation.





FIG. 5

is the nucleotide sequence of the cDNA insert of IPC Clone 1(SEQ ID NO:62), which was isolated from a λgt10 cDNA library using an oligonucleotide probe whose sequence was deduced form an amino acid sequence known to be present in the ragweed allergen preparation


Amb a


I. The location of the sequence from which the sequence of the oligonucleotide probe was deduced is underlined.





FIG. 6

is the nucleotide sequence of the cDNA insert of IPC Clone 5(SEQ ID NO:62), which was isolated from a λgt10 cDNA library using an oligonucleotide probe whose sequence was deduced from an amino acid sequence known to be present in the ragweed allergen preparation


Amb a


I. The location of the sequence from which the sequence of the oligonucleotide probe was deduced is underlined.





FIG. 7

is the nucleotide sequence of the cDNA insert of IPC Clone 6 (SEQ ID NO:64), which was isolated from a λgt10 cDNA library using an oligonucleotide probe whose sequence was deduced from an amino acid sequence known to be present in the ragweed allergen preparation


Amb a


I. The location of the sequence from which the sequence of the oligonucleotide probe was deduced is underlined.





FIG. 8

is a schematic representation of open reading frame analysis of the DNA insert of IPC Clone 1.





FIG. 9

is a schematic representation of open reading frame analysis of the DNA insert of IPC Clone 5.





FIG. 10

is a schematic representation of open reading frame analysis of the DNA insert of IPC Clone 6.





FIG. 11

is the nucleotide sequence (SEQ ID NO:71) and deduced amino acid sequence of a full length


Amb a


IA clone (related to UNC clone 1)(SEQ ID NO:72).





FIG. 12

is the nucleotide sequence (SEQ ID NO:73 ) and deduced amino acid sequence of a full length


Amb a


IB clone (related to UNC clone 6)(SEQ ID NO:74).





FIG. 13

is the nucleotide sequence (SEQ ID NO:75) and deduced amino acid sequence of a full length


Amb a


IC clone (related to UNC clone 15)(SEQ ID NO:76).





FIG. 14

is the nucleotide sequence (SEQ ID NO:77) and deduced amino acid sequence of a full length


Amb a


ID clone (SEQ ID NO:78).





FIG. 15

is the nucleotide sequence (SEQ ID NO:79 ) and deduced amino acid sequence of a full length


Amb a


II clone (SEQ ID NO:80).





FIG. 16

is the composite amino acid sequences of the


Amb a


I and


Amb a


II multigene family showing regions of similarity as well as regions of disagreements.





FIG. 17

is a photograph of a Western blot of affinity purified


Amb a


I treated with rabbit anti-


Amb a


I polyclonal antibody, JB1E3-4 anti-


Amb a


I monoclonal antibody or ragweed allergic patient sera.





FIG. 18

is a photograph of a two dimensional gel of an aqueous extract of short ragweed pollen, separated on the basis of size and charge and stained with T. P. King's antibody, which recognizes


Amb a


I (goat polyclonal anti-


Amb a


I).





FIG. 19

is a photograph of a Western blot of several


E. coli


-expressed recombinant


Amb a


I cDNAs treated with goat anti-


Amb a


I antibody.





FIG. 20

is a photograph of a Western blot of several


E. coli


expressed recombinant


Amb a


I cDNAs treated with human allergic sera stained with anti-human IgE.





FIG. 21

is a graphic representation of T cell proliferation responses of ragweed allergic patient PBMC toward an aqueous extract of short ragweed pollen, affinity purified


Amb a


I (B7) chromatographically purified


Amb a


I and


E. coli


lysate containing expressed recombinant


Amb a


I proteins.











DETAILED DESCRIPTION OF THE INVENTION




The present invention is based on an investigation of ragweed pollen allergens, particularly the preparation known as


Amb a


I (or Antigen E) from short ragweed, using several inner-related approaches, each described below. The terms


Amb a


I and Antigen E are used interchangeably. Such a preparation, obtained from ragweed pollen, is likely to contain other ragweed allergens, such as Antigen K or


Amb a


II. The possibility that such a preparation does contain other such allergens has been assessed and results demonstrate that this is the case.




Results of work described herein show that


Amb a


I is not a single protein or peptide but is, in fact, heterogeneous in nature. That is, what is presently referred to as Antigen E (or


Amb a


I) appears to be a family or families of proteins or to be polymorphic in nature. The work described herein has resulted in identification and isolation of DNAs encoding peptides or amino acid sequences present in a ragweed allergen. As described, full-length cDNAs encoding


Amb a


IA,


Amb a


IB,


Amb a


IC and


Amb a


ID and


Amb a


II have been isolated and sequenced. It has also resulted in isolation and purification from an


Amb a


I preparation of a protein shown to bind human ragweed IgE and to bind rabbit


Amb a


I antisera produced using a purified


Amb a


I preparation. Interrelationships among DNAs and proteins or peptides identified and isolated using the approaches described in the following section have been demonstrated. For ease of presentation, the several approaches used are represented schematically in

FIGS. 1A and 1B

, to which reference is made in the following discussion.




As a result of the work described herein, DNAs encoding proteins or peptides present in ragweed allergens have been identified and isolated and the amino acid sequence of the encoded product has been deduced. In addition, through the use of monoclonal antibodies specific for


Amb a


I or Antigen E, a protein has been obtained from an


Amb a


I preparation. This protein, referred to as affinity purified


Amb a


I, has been shown to have biological activity (human IgE binding ability and ability to bind rabbit


Amb a


I antisera) and, thus, is highly likely to be an allergen. It has also been shown to be encoded by a region of the nucleotide sequences present in two of the isolated DNAs.




The following is a description of several approaches which have been used to identify and isolate DNAs encoding proteins or peptides from


Amb a


I or Antigen E preparations, as well as to isolate from an


Amb a


I preparation a protein shown to have


Amb a


I activity. As represented in

FIG. 1A

, an


Amb a


I or Antigen E preparation, which was prepared from pollen extract by a method based on the method of T. P. King and co-workers, was produced. King, T. P. et al,


Arch. Bioch. and Biophys.


, 212:127-135 (1981). A panel of monoclonal antibodies produced by Klapper and co-workers was used to identify proteins in the preparation.




Smith, J. J. et al.,


Mol. Immun.


, 25:355-365 (1988). Sequences of several peptides from an Antigen E preparation were determined by conventional techniques.




The following sections describe: 1) use of a pool of these monoclonal antibodies (i.e., a pool of monoclonal antibodies reactive with


Amb a


I) to identify clones containing DNA inserts encoding the reactive product and 2) use of an oligonucleotide probe, constructed from an amino acid sequence present in the


Amb a


I preparation to identify clones containing DNA inserts encoding the amino acid sequence. Each approach resulted in identification of three clones containing DNA encoding an amino acid sequence present in the


Amb a


I or antigen E preparation. The two sets of clones isolated as described below have been shown to be different from each other.




Use of Monoclonal Antibodies to Identify Clones Containing DNA Inserts Encoding Ragweed Protein




A pool of seven monoclonal antibodies specifically reactive with components of the


Amb a


I preparation was used to screen a ragweed pollen λgt11 library, using a known method. Young, R. A. and R. W. Davis,


Proceedings of the National Academy of Sciences, USA


, 80:1194-1198 (1983 ). This resulted in identification of three clones, initially designated UNC Clones 1, 6 and 15 and referred to herein as


Amb a


IA, IB and IC, respectively, which expressed a product recognized by at least one of the monoclonal antibodies in the panel. The nomenclature of cDNAs encoding the allergens


Amb a


I and


Amb a


II have been named according to the recommendations of the International Union of Immunological Societies Sub-Committee for Allergen Nomenclature (Marsh et al.,


Annals of Allergy


, 60:499-504 (1988)).




DNA isolated from the three reactive clones was sequenced, using the method of Sanger, F. et al. Sanger, F. et al.,


Proc. Natl. Acad. Sci., USA


, 74:5463 (1977). The nucleotide sequences of the three clones are presented in

FIGS. 2-4

.




Using the partial cDNA sequences presented in

FIGS. 2-4

, cross-hybridization (as described in Example 2) and PCR methods (as described in Example 3) were used to isolate full-length cDNAs encoding


Amb a


IA (FIG.


11


),


Amb a


IB (FIG.


12


),


Amb a


IC (

FIG. 13

) and


Amb a


ID (FIG.


14


).




In the course of DNA sequencing of cross-hybridizing cDNAs from a separately constructed λgt10 ragweed flowerhead library, a new cDNA was derived that shared sequence with


Amb a


II peptide sequence (FIG.


15


and FIG.


16


). Construction of this library and isolation of the new cDNA are described in Example 2. The composite amino acid sequences of the


Amb a


I and


Amb a


II multigene family are shown in

FIG. 16

, with the regions of similarity and of disagreement represented. In

FIG. 16

, the sequence of


Amb a


I is given in standard one-letter code. Sequences for the other


Amb a


I family members are given relative to that of


Amb a


I, with only differences being shown. A dash indicates identity between the two sequences. An asterisk indicates a break in the sequence introduced to maintain maximal alignment. Amino acid numbering is based on the


Amb a


IB sequence. Wherever sequence polymorphism has been observed in a given family member, the dominant sequence is given in superscript and the minor sequence is given in subscript. Polymorphisms in a given family member occur as independent events, except for amino acids 183-189 of


Amb a


ID, in which the polymorphism occurs as a block.




Use of an Oligonucleotide Probe to Identify Clones Containing DNA Inserts Encoding Ragweed Protein




As also represented in

FIG. 1A

, an amino acid sequence (SEQ ID NO:1) (WENFK) in the


Amb a


I preparation, which was identified and sequenced by conventional techniques, was used to deduce the sequence of an oligonucleotide probe (oligoprobe) encoding the amino acid sequence. The amino acid sequence used to deduce the oligonucleotide sequence was VWVKPWENFK (SEQ ID NO:2). A portion of that amino acid sequence (WENFK) was used to deduce the sequence of the oligoprobe, designated AGE#1. AGE#1 was used, as described in Example 1, to screen a cDNA library constructed in λgt10 using polyA


+


enriched RNA from pooled short ragweed flower heads. Screening with this oligoprobe resulted in identification of ten duplicated signals. These duplicated signals (clones) were subjected to a secondary screening with the same AGE#1 oligonucleotide probe. Three of the positives (referred to as secondary positives) were clearly detected in duplicate. The clones (designated IPC Clone 1, IPC Clone 5 and IPC Clone 6) identified in this manner were grown under appropriate conditions and verified as positive, by Southern blot analysis.




The cDNA insert from each of the three clones was isolated and cloned into M13mp18 and sequenced (FIGS.


5


-


7


). The amino acid sequence was also deduced (FIGS.


8


-


10


). Open reading frames in the sequenced cDNAs were examined (

FIGS. 8-10

) and the sequence (from which the sequence of the oligonucleotide probe had been deduced) was identified. That the cDNA inserts encode a portion of translated protein was supported by the fact that the surrounding amino acid sequence deduced from the DNA sequence (VWVKP) agreed with the amino acid sequence initially used to deduce the sequence of the oligoprobe (FIGS.


8


-


10


). T cells from allergic patients could be stimulated by a synthetic peptide RAE4 (Table 5). The RAE4 sequence was deduced from IPC Clone 5 (FIG.


8


).




As is evident from a comparison of the two “sets” of nucleotide sequences (i.e., set 1, which are the DNAs isolated through use of monoclonal antibodies, and set 2, which are the DNAs isolated through use of the oligoprobe), there is homology among sequences within a set (i.e., within

FIGS. 2-4

and within

FIGS. 5-7

) but little similarity in sequences between sets.




Thus, it is apparent that the


Amb a


I or Antigen E preparation is heterogenous in nature and represents a family (or families) of proteins or that there is considerable polymorphism in


Amb a


I-encoding DNA. This is in contrast to present literature descriptions of


Amb a


I or Antigen E, which refer to Antigen E as a protein, rather than as a group or groups of allergenic proteins, present in ragweed pollen, to which ragweed-sensitive individuals respond.




Additional Demonstration of Isolation of Antigenic Peptides and DNAs of


Amb a


I




Additional results further demonstrate that antigenic peptides of


Amb a


I and DNAs encoding them have been identified and isolated. As represented in

FIG. 1A

, a selected monoclonal antibody (designated 4B5/B7) which recognizes an


Amb a


I preparation unsubjected to denaturing conditions was used to affinity purify from pollen extract a single protein, which is referred to as affinity purified


Amb a


I. This was carried out, using known techniques, by producing the desired monoclonal antibody, isolating it in large quantities from ascites and immobilizing it on Sepharose (Pharmacia). Aqueous pollen extract was passed over the monoclonal antibody-containing column and a protein species was eluted. Antigen E isolated in this manner was shown, using both Western blot (

FIG. 17

) and ELISA techniques, to bind human IgE, thus demonstrating biological activity expected of an


Amb a


I protein or peptide.




Peptide sequence analysis was carried out as follows: Two peptides were isolated from partial tryptic digestion of cyanogen bromide (CNBr) cleavage of affinity purified


Amb a


I, respectively, and then subjected to peptide sequencing. Because the N-terminal of


Amb a


I is blocked, no amino acid sequence can be obtained from direct N-terminal protein sequence analysis. The result of the sequence analysis of the tryptic peptide demonstrated that the major portion of its amino acid sequence agreed with peptide sequence 45 to 77 encoded by the


Amb a


IA cDNA (Table 1). Table 1 is a comparison of the amino acid sequences of


Amb a


I protein, determined by protein sequence analysis, with the amino acid sequence deduced from


Amb a


I cDNA. The CNBr cleavage peptide sequencing demonstrated that the CNBr cleavage peptide was similar to the peptide sequence 171 to 184 encoded by the


Amb a


IA cDNA (Table 1).




Further peptide sequence analysis was performed from the protein cleavage mixture without isolating individual peptides. The techniques employed involved specific hydrolysis (with 70% formic acid or CNBr) of the putative Asp-Pro and Met-Pro bonds deduced from the cDNA sequences of


Amb a


I. Any primary amino groups were then blocked by reaction with o-phthalaldehyde prior to conventional sequencing from any available N-terminal proline residue.












TABLE 1









Amb a I PROTEIN SEQUENCES


a


COMPARED TO PROTEIN SEQUENCE DEDUCED






FROM Amb a IA cDNA (SEQ ID NO:4) SEQUENCES























PARTIAL TRYPTIC DIGEST


b

















45        50        55        60        65        70        75        80








Amb a


IA




cDNA




 T S G A Y N I I D G C W R G K A D W A E N R K A L A D C A Q G F G K G T V G G








  85        90








 K D G D I T T V T















Amb a


I


c






MAJOR


d






(T)S(G)A T N I I D G C(W)R G K A D(W)A E N(R K)A L A D C A Q G F(G)







(SEQ ID NO:5)







MINOR


e






(D)                                       (S R)







(SEQ ID NO:6)













AgE


f






(SEQ ID NO:7)




(T)S G A T N I I D G C W R G K A D W A E N R K A L A D C A Q G F G K G T V G G








  K D G D I T(T)V(T)


















CNBr CLEAVAGE















       175       180       185








Amb 


Ia




cDNA




 H D V K V N P G G L I K G N D G







(SEQ ID NO:8)








Amb a






MAJOR




 F D L K V N I G Q L I K(G)N






I


g,h






(SEQ ID NO:9)







MINOR




  (A)P   N Y(I)P L    (N)







(SEQ ID NO:10)













AgE


1






(SEQ ID NO:11)




(H D V K V  )P G G L I K( )N( )G








     280       285       290       295       300       305       310       315








Amb a






cDNA




 P R C R H G F F Q V V N N N Y D K W G S Y A I G G S A S P T I L S Q G N R F C






IA




(SEQ ID NO:12)




      320








 A P D R E S








Amb a


 I


i






MAJOR




 P R C R H G F F Q V V N N N Y D R W G(S)Y A I G G S(A  )P T I L S Q G M( )F(C)







(SEQ ID NO:13)




 A F(D G  Y)







MINOR #1




         F         I       P         D(H)             (N)                    V







(SEQ ID NO:14)







MINOR #2




 P V L(T)P E(Q)S A(G M)







(SEQ ID NO:15)







MINOR #3




 T S G A Y N I I D G C W R G(K)A(D W)A







(SEQ ID NO:16)






AgE




MAJOR




 P R( )R H G F F Q V V N N N Y D(E W)G S Y A I G G S A S P T I







(SEQ ID NO:17)







MINOR #1


m






 A(W)N(W)R(T E K)D L







(SEQ ID NO:18)







MINOR #2


n






 V(I)N L(D Q)E I(F V)







(SEQ ID NO:19)











70% FORMIC ACID HYDROLYSIS OF ASP-PRO PEPTIDE BOND


1

















       365       370       375       380       385       390       395








Amb a


  IA




cDNA




 P V L T P E Q S A G M I P A E P G K S A L S L T S S A G V L S C Q P G A P







(SEQ ID NO:20)








Amb a


  I




MAJOR




 P V L(N P)E( )N A G M I Q A E(P G)E A







(SEQ ID NO:21)







MINOR




     I







(SEQ ID NO:22)






AgE




(SEQ ID NO:23)




 P V L T P E Q S A G M I P A E P G E S A L S L T S(S)A G V L(  C)Q P(G A)P






35kD


p,q






(SEQ ID NO:24)




 P V L T P V Q S A G M I P A E P G E A A I(K)L T S S













a


the amino acids are presented in single letter code; uncertain residues are included in paranthesis












b


the peptides were separated by EDS-PAGE then Western blotted on PVDF membrane for sequence analysis












c


IFC'S affinity purified


Amb a


 I preparation












d


major sequence determined in protein sequence analysis












e


minor sequence determined in protein sequence analysis












f


T. P. King's


Amb a


 I preparation












g


the cleavage mixture was separated by SDS-PAGE then Western blotted on PVDF membrane












h


the sequence is most similar to


a


 IA cDNA sequence among all the cloned cDNA sequence












i


the primary amine of the cleavage mixture was blocked by o-phthalaldehyde on the 7th step of sequence analysis












j


similar to the


a


 IIA cDNA sequence 277-315












k


similar to the


a


 IA cDNA sequence 361-371












l


similar to the


a


 IA cDNA sequence 45-63












m


similar to the


a


 IC cDNA sequence 338-347












n


similar to the


a


 IC cDNA sequence 126-135












o


matches to


a


 IC cDNA sequence 363












p


IPC's


Amb a


 I preparation with molecular weight of 35,000 dalton












q


matches to


a


 IC cDNA sequence 361-386













Results of these assessment (shown in Table 1) demonstrated that two peptide sequences determined from the affinity purified


Amb a


I preparation agreed with that encoded by two portions of


Amb a


IA DNA sequence (277-321 or 361-397). The minor sequences detected in the peptide sequence analysis also corresponded to a portion of peptide sequence encoded by cDNA's of


Amb a


I or


Amb a


II. The above peptide sequence analyses provided strong support that


Amb a


I or Antigen E-encoding DNA had been isolated.




An Antigen E preparation obtained from Dr. T. P. King was also subjected to peptide sequencing. The same peptide sequencing techniques were employed. Four peptides sequences were identified which agreed with the same four segments of peptide sequence encoded by


Amb a


IA DNA (45-92, 171-186, 277-321 and 361-397 in Table 1). This provided additional proof that


Amb a


I or Antigen E-encoding DNA had been isolated.




The same techniques were used with purified Antigen K (


Amb a


II) from Dr. T. P. King. Results demonstrated that two peptide sequences agreed with two portions of peptide sequence encoded by DNA of


Amb a


II (Table 2, see Example 2; FIG.


15


). Table 2 is a comparison of the amino acid sequences of


Amb a


II protein, determined by protein sequence analysis, with the amino acid sequence deduced from


Amb a


II cDNA. This finding provided support that ragweed pollen allergen encoding DNA had been isolated.












TABLE 2









Amb a II PROTEIN SEQUENCES


a


COMPARED TO PROTEIN SEQUENCE DEDUCED FROM






Amb a IIA c DNA SEQUENCE























CNBr CLEAVAGE


b

















    280       285       290       295       300       305       310








Amb a






cDNA




 P R C R F G F F Q I V N N F Y D R W D K Y A I G G S S N P T I L S Q G N






IIA




(SEQ ID NO:25)




  315       320








 K F V A P D F I Y













AgK


c






MAJOR


d






 P R( )R F G F F Q I V N N F Y D R W D(H)T A I G G S S H P T I L S Q G N(R)F







(SEQ ID NO:26)




(R)P V A P(D  )I(Y)







MINOR


e,f






 P V L T P E Q N A G M







(SEQ ID NO:27)















Amb a






(SEQ ID NO:28)




 P(R   R)F G F F Q I V N N F Y D






II


g




















705 FORMIC ACID HYDROLYSIS OF ASP-PRO PEPTIDE BOND


b

















       365       370       375       380       385       390       395








Amb a






cDNA




 P V L T A E Q N A G M M Q A E P G D M V P Q L T M N A G V L T C S P G A P






IIA




(SEQ ID NO:29)






AgK




SEQ ID NO:30




 P V L T A E Q N A G M M Q A E P G D M V P Q L T M N A(G)V(L     S)P G A P








Amb a


II




MAJOR




 P V L T A E Q N A G M M Q A E P G D M V P Q L T M N A G V L T( )S P G A P







(SEQ ID NO:31)







MINOR


h






         P     S       I P         E S A L S     S          (S)







(SEQ ID NO:32)













a


the amino acids are presented in single letter code; uncertain residues are included in paranthesis












b


o-phthalaldehyde is reacted with peptide mixture prior to conventional peptide sequence analysis












c


T. P. King's


Amb a


 II preparation












d


major sequence determined in protein sequence analysis












e


minor sequence determined in protein sequence analysis












f


matches the a IIA cDNA sequence 361-371












g


matches the a IIA cDNA sequence 361-371












h


matches to a IA cDNA sequence 361-397













It has been previously reported that


Amb a


I and


Amb a


II share some antigenic determinants using rabbit and human antisera (King, T. P.,


Adv. Immun.


, 23:77-105 (1976)). However, the exact relationship between the two antigens, until the present invention, has remained unclear. King and colleagues have also reported that different isoforms of antigen E and K (


Amb a


I and


Amb a


II) can be isolated by ion-exchange chromatography (King, T. P. et al.,


Ach. Biochem. Biophys.


, 212:127-135 (1981)). The different isoforms described, designated A, B, C and D, were interpreted to be produced by limited proteolysis of the intact


Amb a


I and


Amb a


II species. It should be noted that these isoforms, designated A, B, C, etc., have no direct relationship with the nomenclature outlined in this invention (i.e.,


Amb a


IA,


Amb a


IB, etc.).




A 35,000 dalton species coprecipitates from ragweed pollen extract with


Amb a


II in 45% saturation of ammonium sulfate. Most of these proteins are shown to be aggregated by gel filtration chromatography. Some monomeric forms of these proteins were separated from


Amb a


II by ion exchange chromatography. The sequencing technique, which involved 70% formic acid hydrolysis of putative Asp-Pro bound and o-phthalaldehyde blocking of primary amino groups, demonstrated that the predominant protein corresponds to that encoded by the DNA sequence of


Amb a


IC. This peptide sequence is referred to as 35 kD in Table 1. This result provided additional support that


Amb a


I proteins are heterogeneous in nature and are encoded by closely related DNA's .




As is also represented in

FIG. 1A

, rabbit polyclonal antibodies were produced using the King Antigen E preparation. These antibodies were shown to identify a 38 kd protein species on a Western blot of pollen extracts (FIG.


17


). A two-dimensional gel of ragweed pollen extract, electrophoresed in one dimension on the basis of charge and in the other dimension on the basis of size and treated with goat anti-Amb antibodies is shown in

FIG. 18

Results demonstrate binding to several proteins present in ragweed pollen extract with a relative molecular weight of 38 kD, corresponding to differently charged forms of what was formerly referred to as


Amb a


I protein. These antibodies were also shown, using a similar technique, to bind to the affinity purified


Amb a


I described previously (FIG.


17


).




It is clear from the antibody reactivity that the 4B5/B7 affinity purified


Amb a


I has a recognition pattern similar to that of the


Amb a


I of pollen and skin test reagent with both rabbit polyclonal anti-


Amb a


I and JB1E3-4 anti-


Amb a


I monoclonal antibody (FIG.


17


). It also has readily detectable IgE reactivity on a Western blot (

FIG. 17

; patient number 155). It is also clear that chromatographically purified


Amb a


II (Antigen K) has cross-reactive B-cell epitopes with the affinity purified


Amb a


I (FIG.


17


: anti-


Amb a


I polyclonal).




As a result of the work described herein, cDNAs encoding allergenic peptides of proteins from a preparation of


Amb a


I, the major human allergen of ragweed and a preparation of


Amb a


II, have been cloned, isolated and sequenced; the encoded amino acid sequences (of the allergen(s)) have been deduced and peptides derived from


Amb a


I and


Amb a


II have been identified and isolated.




Furthermore, full-length and truncated cDNAs encoding several members of the


Amb a


I multigene family, as well as


Amb a


II, were cloned in-frame into the expression vector pTrc99 (Amann et al.


Gene


, 69:301-315, (1988)) and transformed into the JM109 host. Expression of recombinant


Amb a


I and


Amb a


II protein was induced by 1 mM isopropyl-β-D-thiogalactopyranoside, cells were harvested, lysozyme treated, sonicated and insoluble inclusion bodies recovered by a low speed centrifugation. Recombinant


Amb a


I and


Amb a


II protein present in the recovered pellet was solubilized in buffer containing 8M urea, 50 mM Tris HC1 pH8.0, 50 mM NaCl, 1 mM EDTA, 1 mM dithiothreitol, 1 mM phenylmethylsulfonyl fluoride. After solubilization, the crude urea lysate was dialyzed at 4° C. against PBS. The expressed recombinant


Amb a


I and


Amb a


II proteins were Western blotted and results are shown in

FIGS. 19 and 20

. Results demonstrate (

FIG. 19

) that goat anti-


Amb a


I antibody binds specifically to several forms of


Amb a


I (A, B and C), as well as to


Amb a


II (Antigen K). This antigenic cross-reactivity is consistent with the observed sequence homology of the cDNAs (see FIG.


16


). They further demonstrate (

FIG. 20

) that allergic human IgE binds specifically to some members of the Amb a I multigene family. In the case of patient #295, Amb a IA (full-length) and Amb a IC are bound specifically by IgE to a far greater extent than Amb a IB or Amb a II. A high level of variability in the patterns of IgE binding is seen in (Table 3 and data not shown), suggesting that different patients respond to the different Amb a I proteins to different extents.












TABLE 3











SUMMARIZED WESTERN BLOT DATA*



















Antigen










Patient




Pollen




IA(t)




IA




IB




IC




IIA









151




+









+









+




+






222




+−




+−




+−




+−




+−




+






291




+++




+




+++









+++




+−






295




+++




+




+++




+




+++











296




++





++





++
















− no signal over background










+− barely discernable over background










+ clearly positive










++ strongly positive










+++ highly positive










*selected from the total of ten patients screened to date.













An analysis of SDS-PAGE Western blot of IgE binding to several recombinant forms of Amb a I and Amb a II has demonstrated that there is considerable variation in the pattern observed with different patients. Of the ten ragweed allergic patients examined, all possess serum IgE that binds to at least one recombinant Amb a I or Amb a II, with some patient's IgE binding several different recombinant species (summarized in Table 3). Comparison of human IgE binding to recombinant Amb a I and Amb a II proteins with anti-peptide and monoclonal anti-Amb a I antibodies have provided data consistent with the conclusion that the N-terminal portion (historically referred to as the β-region) of Amb a IA includes the major IgE eptiope(s). This data (Table 3) is based on the observation that Amb a IA(t) (truncated Amb a IA; amino acid 70-398) binds ragweed allergic patient IgE less well than the full-length Amb a IA (amino acid 10-398). It is expected that the other Amb a I and Amb a II forms possess the same IgE binding properties (see

FIG. 20

, for example).




T cells from patients allergic to ragweed, previously stimulated with a mixed ragweed pollen extract, can recognize and proliferate in response to pollen extract, ragweed skin test reagent (RWST), affinity purified Amb a I protein and crude bacterial lysates containing recombinant Amb a I gene products IA, IB and IC (Table 4). T cells from these patients do not proliferate in the presence of an equivalent amount of control bacterial lysate, JM109. These results demonstrate that each gene product can stimulate some T cell reactivity. The use of crude bacterial lysates as antigens precludes a firm conclusion form the negative responses, since the relative levels of recombinant proteins in lysate have not been determined.












TABLE 4











STIMULATORY RESPONSE


a


OF THE HUMAN T CELL TO RECOMBINANT RAGWEED PROTEINS























Amb a IB




Amb a IC




Amb a IA (t)




Amb a IA




JM109






PATIENT #




POLLEN




RWST


b






Amb a I


c






LYSATE




LYSATE




LYSATE




LYSATE




LYSATE









151 2°




+++




+




(+)




+




+




+




(+)











222 2°




+++




+++




+++









++




++




++











274 2°




++




++




++




+




++













295 2°




+++






+




+




+












296 2°




+++




+++




+++




+++




+++




+++












314 2°




+++




+++




+++




+++




+++




+++












316 2°




+++




+++




+++




++




+++




+++












319 2°




+++




+++




++




(+)




++

















320 2°




++




++




+++




+









++












321 2°




+++




+++




+




++




+




++



















a


proliferation responses as compared to medium control:










(+) 2 fold










+ 2-4 fold










++ 4-10 fold










+++ >10 fold












b


ragweed skin test reagent from Hollister-Stier












c


affinity purified Amb a I













Uses of the Subject Allergenic Proteins/Peptides and DNA Encoding Same




The materials resulting from the work described herein, as well as compositions containing these materials, can be used in methods of diagnosing, treating and preventing ragweed allergy. In addition, the cDNA (or the mRNA from which it was transcribed) can be used to identify similar sequences in any variety or type of ragweed and, thus, to identify or “pull out” sequences which have sufficient homology to hybridize to, for example, DNA from short ragweed pollen. This can be carried out, for example, under conditions of low stringency; those sequences which have sufficient homology (generally greater than 40%) can be selected for further assessment using the method described herein. Alternatively, high stringency conditions can be used.




In this manner, DNA of the present invention can be used to identify, in other types of ragweed (such as giant ragweed or Western ragweed) sequences encoding peptides having amino acid sequences similar to that of Amb a I and, thus, to identify allergens in such other types of ragweed. Thus, the present invention includes not only Amb a I and other ragweed allergens (e.g., Amb a II or Antigen K) encoded by the present DNA sequences, but also other ragweed allergens encoded by DNA which hybridizes to DNA of the present invention.




Proteins or peptides encoded by the cDNA of the present invention can be used, for example, as “purified” allergens. Such purified allergens are useful in the standardization of allergen extracts which are key reagents for the diagnosis and treatment of ragweed allergy. Furthermore, by using peptides based on the sequences listed in

FIGS. 2 through 16

, anti-peptide antisera or monoclonal antibodies can be made using standard methods. Such reagents can be specifically directed against individual isoforms of Amb a I or Amb a II (i.e., directed against divergent regions/epitopes of the molecule) or can be specific for all forms Amb a I or Amb a II (i.e, directed against common sequences/epitopes). These sera or monoclonal antibodies, directed against Amb a I or Amb a II, can be used to standardize allergen extracts. One such monospecific anti-peptide antisera has already been successfully produced. This rabbit antisera, directed against an Amb a II sequence (amino acid 326-338; designated RAE 50. K with the sequence: CLRTGAQEPEWMT SEQ ID NO:33) binds specifically on Western blots to recombinant Amb a II but not Amb a IA, B or C (data not shown).




Through use of the peptides of the present invention, allergen preparations of consistent, well-defined composition and biological activity can be made and administered for therapeutic purposes (e.g., to modify the allergic response of a ragweed-sensitive individual to a ragweed pollen). Such peptides or proteins (or modified versions thereof, such as are described below) may, for example, modify B-cell response to a ragweed allergen, T-cell response to a ragweed allergen or both responses. Purified allergens can also be used to study the mechanism of immunotherapy of ragweed allergy and to design modified derivatives or analogous which are more useful in immunotherapy than are the unmodified (“naturally-occurring”) peptides.




Work by others has shown that high doses of allergens generally produce the best results (i.e., best symptom relief). However, many people are unable to tolerate large doses of allergens because of allergic reactions to the allergens. Modification of a naturally-occurring allergens can be designed in such a manner that modified peptides or modified allergens which have the same or enhanced therapeutic properties as the corresponding naturally-occurring allergen but have reduced side effects (especially anaphylactic reactions) can be produced. These can be, for example, a peptide of the present invention (e.g., one having all or a portion of the amino acid sequence of a peptide derived from the DNA insert of Clone Amb a IA, Clone Amb a IB, Clone Amb a IC, Amb a II, IPC Clone


1


. IPC Clone


5


or IPC Clone


6


, or their full-length cDNAs) or a modified peptide or peptide analogue (e.g., a peptide in which the amino acid sequence has been altered to modify immunogenicity and/or reduce allergenicity or to which a component has been added for the same purpose). For example, Amb a I peptides can be modified using the polyethylene glycol method of A. Sehon and co-workers.




Administration of the peptides of the present invention to an individual to de desensitized can be carried out using known techniques. A peptide or combination of different peptides can be administered to an individual in a composition which includes, for example, an appropriate buffer, a carrier and/or an adjuvant. Such compositions will generally be administered by injection, oral administration, inhalation, transdermal application or rectal administration. Using the structural information now available, it is possible to design a ragweed pollen peptide which, when administered to a ragweed-sensitive individual in sufficient quantities, will modify the individual's allergic response to a ragweed allergen. This can be done, for example, by examining the structures of the ragweed proteins, producing peptides to be examined for their ability to influence B-cell and/or T-cell responses in ragweed-sensitive individuals and selecting appropriate epitopes recognized by the cells. Synthetic amino acid sequences which mimic those of the epitopes and which are capable of down regulating allergic response to ragweed allergen can also be used. Proteins, peptides or antibodies of the present invention can also be used for detecting and diagnosing ragweed allergy. For example, by combining blood or blood products obtained from an individual to be assessed for sensitivity to ragweed allergen with an isolated allergenic peptide of ragweed pollen, under conditions appropriate for binding of components (e.g., antibodies, T cells, B cells) in the blood with the peptide and determining the extent to which such binding occurs.




It is now also possible to design an agent or a drug capable of blocking or inhibiting the ability of ragweed allergens to induce an allergic reaction in ragweed-sensitive individuals. Such agents could be designed, for example, in such a manner that they would bind to relevant anti-ragweed IgEs, thus preventing IgE-allergen binding and subsequent mast cell degranulation. Alternatively, such agents could bind to cellular components of the immune system, resulting in suppression or desensitization of the allergic response to ragweed allergens. A non-restrictive example of this is the use of appropriate B- and T-cell epitope peptides, or modifications thereof, based on the cDNA/protein structures of the present invention to suppress the allergic response to ragweed allergens. This can be carried out by defining the structures of B- and T-cell epitope peptides which affect B- and T-cell function in in vitro studies with blood cells from ragweed-sensitive individuals.




The cDNA encoding an allergenic protein or peptide from ragweed can be used to produce additional peptides, using known techniques such as gene cloning. A method of producing a protein or a peptide of the present invention can include, for example, culturing a host cell containing an expression vector which, in turn, contains DNA encoding all or a portion of a selected allergenic protein or peptide (e.g., Amb a I protein or peptide). Cells are cultured under conditions appropriate for expression of the DNA insert (production of the encoded protein or peptide). The expressed product is then recovered, using known techniques. Alternatively, the Amb I allergen or portion thereof can be synthesized using known mechanical or chemical techniques. As used herein, the term protein or peptide referes to proteins or peptides made by any of these techniques. The resulting peptide can, in turn, be used as described previously.




DNA to be used in any embodiment of this invention can be cDNA obtained as described herein or, alternatively, can be any oligodeoxynucleotide sequence having all or a portion of a sequence represented herein (See

FIGS. 2-16

), or their functional equivalents. Such oligodeoxynucleotide sequences can be produced chemically or mechanically, using known techniques. A functional equivalent of an oligonucleotide sequence is one which is capable of hybridizing to a complementary oligonucleotide sequence to which the sequence (or corresponding sequence portions) of

FIGS. 2-16

hybridizes and/or which encodes a product (e.g., a polypeptide or peptide) having the same functional characteristics of the product encoded by the sequence (or corresponding sequence portion) of

FIGS. 2-16

. Whether a functional equivalent must meet one or both criteria will depend on its use (e.g., if it is to be used only as oligoprobe, it need meet only the first criterion and if it is to be used to produce an Amb a I allergen, it need only meet the second criterion).




Antibodies against Amb a I peptides can be used to isolate additional components of ragweed allergens which can be used for further definition of the characteristics of the Amb a I family. Furthermore, anti-peptide sera or monoclonal antibodies directed against Amb a I and/or Amb II can be used to standardize and define the content of ragweed skin test reagents (RWST). This use would include RWST other than those derived from


Ambrosia artemisiifolia I


. (e.g., Western, Desert, Giant ragweeds, etc.).




The structural information now available (e.g., DNA, protein/peptide sequences) can also be used to identify or define T cell epitope peptides and/or B cell epitope peptides which are of importance in ragweed allergic reactions and to elucidate the mediators or mechanisms (e.g., interleukin-2, interleukin-4, gamma interferon) by which these reactions occur. This knowledge should make it possible to design peptide-based ragweed therapeutic agents or drugs which can be used to modulate these responses.




The present invention will now be further illustrated by the following Examples, which are not intended to be limiting in any way.




Example 1




Screening of a λgt10 cDNA Library Using an Oligonucleotide Probe




Poly+ enriched RNA extracted from pooled short ragweed flower heads was used to construct a cDNA library in the vector λgt10. The cDNA library was constructed using the Gubler and Hoffman method and the kit supplied by Amersham. Gubler, U. and B. J. Hoffman,


Gene


, 25:263 (1983). A library of 1.4×10


5


plaques constituting approximately 7×10


4


recombinants was constructed. This library was plated out and screened according to the method of Benton and Davis. Benton, W. D. and R. W. Davis,


Science


, 196:180 (1977).




Screening of the cDNA Library




An amino acid sequence (VWVKPWENFKK SEQ ID NO:34), thought to be derived from antigen E was used to deduce an oligonucleotide probe with which to screen the cDNA library.



















Amino Acid




W




E




N




F




K




K




(SEQ ID NO:35)






Sequence













Deduced




TGG




GAA




AAT




TTC




AAA




AAA




(SEQ ID NO:36)






Nucleotide





  G




  C




  T




  G




  G




(SEQ ID NO:37)






Sequence











AGE #1 OLIGOPROBE




The AGE #1 oligoprobe was end-labeled with


32


P and used to probe 70,000 recombinants using the hybridization conditions listed below:





















 6 × SSC








 1 × Denhardt's




at 30 degrees C for







50 μg/ml


E. coli


tRNA




22 hours















Ten duplicated signals were detected and these clones were subjected to a secondary screening with the same AGE #1 oligoprobe. It was subsequently discovered that the correct amino acid sequence is WENFKE (SEQ ID NO:33).




A summary of the cloning procedure is listed below:





















Primary Screen:




70,000 plaques










32


P - AGE #1 oligoprobe















Numerous spots with 10 signals were clearly seen on duplicated filters.




Secondary Screen




Plaques from the 10 duplicate signals were picked and plated out at low density and rescreened using methods outlined in Clontech's catalog.




Tertiary Screening




Three secondary positives number #1, #5, and #6 were clearly detected in duplicate. Each clone was grown up and verified as positive by Southern Blot analysis.




Sequencing of Positive Clones




cDNA inserts from each of the clones were isolated then cloned into M13mp18. Each clone was sequenced using the Sanger dideoxy method and the deduced amino acid sequence was determined. Sanger, F. et al.,


Proc. Natl. Acad. Sci., USA


, 74:5463 (1977).




Identification of WENFKK and Surrounding Sequence




The DNA sequences of the cDNA clones are presented in

FIGS. 5

,


6


, and


7


. The cDNA clones are not full-length and are less than 500 mucletides in length. The AGE#1 oligoprobe nucleotide sequence is underlined in

FIGS. 5

,


6


and


7


. Open reading frames in the sequenced cDNAs were examined and are presented in

FIGS. 8

,


9


and


10


. The translated amino acid sequence (WENFK) used to deduce AGE 1 oligoprobe sequence is underlined as well as the N-terminal surrounding sequence (VWVKPWENFK (SEQ ID NO:2); see

FIGS. 8

,


9


and


10


). IPC clones 1 and 5 disagree with the amino acid sequence at only one out of ten residues (i.e., L instead of P). The presence of the correct surrounding sequence (VWVKP (SEQ ID NO:39)) verifies that the cDNAs encode protein in pollen. Furthermore, a synthetic peptide based on the cDNA sequence designated RAE 4, which has the sequence EFPILGGITEVKDNDNSVDFC (SEQ ID NO:40), stimulates ragweed allergic patient T cells, in in vitro proliferation assays (see Table 5 and sequences in FIGS


8


,


9


and


10


).




Example 2




Cross-hybridization Methods Used to Obtain Full-Length cDNAs




Antigen E is reported to be a protein of approximately 38,000 molecular weight and consequently a full-length cDNA encoding this protein must be at least 1.1. kb in length (King, T. P et al.,


Arch Biochem. Biophys


., 212:127 (1981)). Consequently, IPC clones 1, 5 and 6 as well as UNC clones


1


,


6


and


15


(designated Amb a IA, IB and IC, respectively) are not full-length.




In order to isolate full-length clones, nick-translated


32


p-labelled Amb a I cDNA probes were used to screen the ragweed flowerhead λgt10 (see Example 1) and the ragweed pollen λgt11 library using standard methods (Maniatis et al.,


Molecular Cloning


, Cold Spring Harbor Laboratory, (1982)). Full-length or near full-length cDNAs encoding Amb a IB (

FIGS. 12 and 16

) and Amb a IC (

FIGS. 13 and 16

) were isolated using this method (FIG.


1


B). One cross-hybridizing cDNA clone (called K6-5), which has an open reading frame of approximately 145 amino acids (amino acids 253-398; FIG.


15


), was found to be significantly divergent from the previously isolated Amb a IA, Amb a IB, Amb a IC and Amb a ID and showed complete agreement (Table 2) with a peptide sequence derived from conventionally purified antigen K (a gift from T. P. King, New York). Consequently, this partial cDNA was designated as Amb a II (see FIG.


15


and below).




Example 3




Polymerase Chain Reaction (PCR) Methods Used to Obtain Full-length cDNAs




PCR methods can be successfully used to isolate both rare message cDNA as well as genomic clones of known sequence (Mullis et al.,


Cold Spring Harbor Symposium Quant. Biol


., 263-273 (1986)). 5′ and 3′ oligonucleotide primers were synthesized and used in a PCR experiment with ragweed pollen cDNA serving as template. The 5′primers were deduced from N-terminal conserved regions of Amb a IB (

FIG. 12

) and Amb a IC (FIG.


13


). The 3′ primers wre deduced from Amb a IA specific (UNC clone 1, designated Amb a IA,

FIG. 2

) and Amb a II specific (clone K6-5, partial 3′ sequence of

FIG. 15

) non-coding strand sequences at the 3′ end of the cDNA. A third 3′ primer used to PCR clone Amb a ID was derived from a conserved region of the C-terminal end of Amb a IA, B and C (amino-acids 395-398 corresponding to GAPC.stop). The oligonucleotide primers used to amplify and clone Amb a IA, Amb a ID and Amb a II cDNAs are listed below:




N-terminal primers used to produce full-length Amb a IA and Amb a II (amino acids 10-15)





















  ECORI




 L




 Y




 F




 T




 L




 A




(SEQ ID NO:41)













IG38














GGGAATTC




TTG




TAT




TTT




ACC




TTA




GC




(SEQ ID NO:42)






  5′











3′











N-terminal primer used to produce truncated Amb a IA and Amb a II (amino acids 70-75)





















  ECORI




 D




 C




 A




 Q




 G




 F




(SEQ ID NO:43)













IG33







GGGAATTC




GAC




TGT




GCC




CAA




GGT




TTT




G




(SEQ ID NO:44)











C-terminal primer used to produce full-length and truncated Amb a IA (12-29 nucleotides of the noncoding strand 3′ of the TAA stop codon; see FIG.


2


).

















   Pst I
















IG32














GGGCTGCAG




TCATTATAAGTGCTTAGT





(SEQ ID NO:






  5′






3′




45)











C-terminal primer used to produce full-length Amb a ID (corresponding to the C-terminal conserved GAPC encoding region). The primer is of the non-coding strand and includes the stop codon and an artificially introduced Pst I cloning site (see FIG.


15


).















Pst I














IG49














GGGCTGCAG TGC TTA GCA AGG TGC TCC




(SEQ ID






  5′





3′  NO:46)











C-terminal primer used to produce full-length and truncated Amb a II (44-76 nucleotides of the noncoding strand 3′ of the TAA stop codon; see FIG.


15


).

















Pst I
















AgK2














GGGCTGCAG CGT GTC CAA ATC TAA TCA AAT GAA CAC TTA TGG





(SEQ ID NO:47)






  5′





3′











First strand cDNA was synthesized form 1 μg RNA with the cDNA synthesis system plus kit (Amersham) using poly dT as primer. This single stranded cDNA was amplified using sets of primers (IG38 plus IG32; IG33 plus IG32; IG38 plus IG49; IG38 plus AgK2; IG33 plus AgK2) according to methods recommended in the GeneAmp kit (US Biochemicals, Cleveland, Ohio). The samples were amplified with a programmable thermal controller; the first five rounds of amplification consisted of denaturation at 94° C. for 30 sec., annealing of primers to the template at 45° C. for 1 min. 30 sec., and chain elongation at 70° C. for 4 min. The final 20 rounds of amplification consisted of denaturation as above, annealing at 55° C. for 1 min. 30 sec. and elongation as above. The PCR generated bands corresponding to the predicted size on an analytical gel and DNA sequencing confirmed that the cDNAs corresponded to full-length and truncated Amb a IA and Amb a Ii (

FIGS. 11 and 15

, respectively) and full-length Amb a ID (FIG.


14


).




The near full-length cDNAs presented in

FIGS. 11 through 15

, have their nucleotide sequences numbered such that the first nucleotide is designated number


1


. Although some cDNAs start at what is probably the N-terminal methionine (Amb a IB,

FIG. 12

; Amb a IC, FIG.


13


), some do not (Amb a IA,

FIG. 11

; Amb a ID, FIG


14


; Amb a II, FIG


15


). Consequently, since the cDNAs are of different lengths, their nucleotide numbers do not necessarily correspond from one sequence to another. The universal genetic code is used to deduce the amino acid sequences from the cDNA sequences and the complete amino acid sequence comparisons between the clones are presented in FI.


16


. In

FIG. 16

, the amino acids are numbered sequentially from the probably N-terminal methionine (designated number 1) of the Amb a IB sequence.




Example 4




T Cell Responses to Ragweed Proteins and Peptides




Peripheral blood mononuclear cells (PBMC) were purified from 60 ml of heparinized blood from ragweed-allergic patients. PBMC were subsequently treated as described below, although in individual cases, the length of time of cultivation with IL-2 and IL-4 and the specific ragweed proteins and peptides used for stimulation varied. As an example, ten ml of patient 222 PBMC at 10


6


/ml were cultured at 37° C. for 7 days in the presence of 20 micrograms aqueous ragweed pollen extract/ml RPMI-1640 supplemented with 5% pooled human AB serum. Viable cells were purified by Ficoll-Hypaque centrifugation and cultured for three weeks at 5 units recombinant human IL-2/ml and 5 units recombinant human IL-4/ml. The resting T cells were then restimulated (secondary) with 20 micrograms aqueous ragweed pollen extract/ml at a density of 2×10


5


cells/ml in the presence of X-irradiated (3500 RADS) autologous PBMC (5×10


5


/ml) for three days, purified by Ficoll-Hypaque centrifugation and grown in 5 units IL-2/ml and 5 units IL-4/ml for two weeks. For assay, 2×10


4


resting secondary T cells were restimulated (tertiary) in the presence of 5×10


4


X-irradiated (3500 RADS) autologous PBMC or 2×10


4


autologous Epstein-Barr virus-transformed B cells (20,000 RADS) with various concentrations of allergen or their fragments in a volume of 200 microliters in 96-well round bottom assay plates for 3 days. Each well then received 1 microCurie tritiated (methyl) thymidine for 16 hours. The counts incorporated were collected onto glass fiber filters and processed for liquid scintillation counting.

FIG. 21

shows the results of a representative assay, demonstrating the reactivity and specificity of the T cell culture to ragweed pollen proteins. Antigens used: IPC aqueous pollen extract (pollen), Hollister-Stier ragweed skin test extract (RWST), ALK cat epithelium skin test extract (CST), affinity 4B5/B7 antibody purified (dialyzed) Amb a I (B7), and chromatographically purified Amb a I and (Amb a I). Medium only control is shown as a line with no symbol. Alternatively, PBMC were sometimes carried only into a secondary assay (as outlined above for a tertiary assay) with 20 micrograms aqueous pollen extract 7 days, followed by culture in 5 units IL-2/ml and 5 units IL-4/ml for 2-3 weeks. One ragweed allergic patient's T cells in secondary assay responded to pollen extract, RWST, B7 or Amb a I, but did not respond to CST or medium only (FIG.


21


). Secondary and tertiary assays of a panel of ragweed allergic patients were performed using synthetic peptides derived from the sequences of various ragweed pollen proteins. The results of several experiments are shown in Table 5. Three peptides (RAE16.6, RAE45.15, RAE24.E) which are derived from the sequence of three different Amb a I cDNA's could not stimulate any of the patients' T cells. Another four peptides (RAE15.6, RAE3.D, RAE28.1, RAE26.15) which are also derived from the sequence of the same three cDNA's could stimulate 35 to 58% of the patients' T cells. One peptide (RAE4) which is derived from the cDNA of IPC Clone 5 could also stimulate 25% of the patients' T cells. These results are consistent with the above cDNA's encoding ragweed pollen proteins. They further demonstrate the opportunity offered by knowledge of the protein structures of the Amb I/II family/ies to identify peptidic fragments which stimulate a response in T cells from ragweed allergic patients and others which do not. By this method it is possible to identify novel therapeutic and diagnostics entities for use in the treatment and the diagnosis of ragweed allergy.












TABLE 5











Human Ragweed-Allergic T Cell Responses to Ragweed Peptides

















NO.








PEPTIDE


b






SEQUENCE




PATIENTS




NUMBER




POSITIVE






NAME




BASED ON




TESTED




POSITIVE




%


















RAE 16.6




Amb a IB




7




0




0






RAE 45.15




Amb a IC




2




0




0






RAE 24.E




Amb a IA




9




0




0






RAE 4




Clone #5




28




7




25






RAE 15.6




Amb a IB




20




7




35






RAE 3.D




Amb a IA




35




13




37






RAE 28.1




Amb a IA




33




17




52






RAE 26.15




Amb a IC




24




14




58













a


Responses were scored as positive when the T cell proliferative response of ragweed pollen-specific T cells was greater than 2-fold above the culture medium control.












b


Sequence of named peptide is as follows:










RAE 16.6 (SEQ ID NO:48) RTDKDLLENGAIC










RAE 45.15 (SEQ ID NO:49) LNQELVVNSDKTIDGRGVK










RAE 24.E (SEQ ID NO:50) ETRRSLKTSGAYNIIDGCWRGKAD










RAE 4 (SEQ ID NO:51) FFPILGGITEVKDNDNSVDFC










RAE 15.6 (SEQ ID NO:52) YTVTSDKDDDVANC










RAE 3.D (SEQ ID NO:53) GKADWAENRC










RAE 28.1 (SEQ ID NO:54) LENGAIFVASGVDPVLTPEQ










RAE 26.15 (SEQ ID NO:55) GFFQVVNNNYDRWGTYA













Example 5




Antibody Binding to Recombinant Affinity Purified Amb a I, and Pollen Extract Derived Amb a I and Amb a II




Affinity purified Amb a I was electrophoresed, Western transferred (Towbin et al.,


Proc. Natl. Acad. Sci. USA


, 76:4350 (1979)) and probed with a variety of antibodies, including IgE from an allergic patient (FIG.


17


). In pollen extract Amb a I is not only present as an intact 38-KD species, but also characterized by its component 26KD alpha chains and 12-KD beta chains which are formed by enzymatic cleavage. The intact 38-KD species and the alpha chain are clearly detected using rabbit anti-Amb a I, polyclonal affinity purified anti-RAE 16 and monoclonal anti-Amb a I JBIE3-4 (FIG.


17


); RAE 16 peptide has the sequence RTDKDLLENGAIC derived from amino-acids 343-353 of Amb a IB, FIG.


16


). Affinity purified Amb a I (partial sequence presented in Table 1) as well as chromatographically purified Amb a II (partial sequence presented in Table 2) are bound on Western blots by these antibodies as well as by patient IgE (FIG.


17


). The goat anti-Amb a I polyclonal antibody also binds multiple Amb a I and Amb a II species on a two dimensional Western blot of pollen extract (FIG.


18


). The Western blot was performed as outlined below.




Isoelectric focusing was done on a Hoeffer gel apparatus with 15 μg of crude soluble pollen protein. The gel consisted of 7.5% acrylamide with 3.5% Pharmalytes pH 4.5-5.3 (Pharmacia) and 3.5% Ampholines pH 3.5-10 (LKB), run at 13W for 3.5 hours until a constant voltage was reached. The gel section was placed on a slab of 10% acrylamide SDS-PAGE and electrophoresed for 3.5 hours at 40 mA according to the protocol cited. The proteins were transferred overnight in phosphate buffer to 0.1 micron nitrocellulose (Schleicher and Schuell) at 0.2A. The blot was rinsed in blot solution (25 mM Tris-CHl pH 7.5, 0.171 M NaCl, 0.05% Tween-20; Sigma). The first antibody incubation was overnight at room temperature with a 1:000 dilution of goat anti-Amb a I IgC (obtained from Dr. David Marsh) in blot solution. The excess first antibody was removed with three 15 minute rinses with blot solution. The second antibody was a 1:2,500 dilution of biotinylated swine anti-goat IgG (Boehringer-Manneheim) in blot solution for two hours. The blot was then rinsed with blot solution three times for 15 minutes and incubated for 1 hr in blot solution with 2 μCi I


125


streptavidin (Amersham). The blots were rinsed with blot solution until the waste wash was down to background. The blot was then exposed to film at —80° C. overnight. In the case of one-dimensional SDS-PAGE Western blots (

FIGS. 17

,


19


and


20


) the isoelectric focusing step was omitted. When human sera was used to probe the Western blots (FIGS.


17


and


20


), 10% human plasma in 1% milk in blot solution was incubated overnight with the blot prior to using as second antibody biotinylated goat anti-human IgE.




Equivalents




Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described specifically herein. Such equivalents are intended to be encompassed in the scope of the following claims.




Equivalents




Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described specifically herein. Such equivalents are intended to be encompassed in the scope of the following claims.







93





5 amino acids


amino acid


linear




peptide



internal


1
Trp Glu Asn Phe Lys
1 5






10 amino acids


amino acid


linear




peptide



internal


2
Val Trp Val Lys Pro Trp Glu Asn Phe Lys
1 5 10






5 amino acids


amino acid


linear




peptide



internal


3
Val Trp Val Lys Pro
1 5






48 amino acids


amino acid


linear




peptide



internal


4
Thr Ser Gly Ala Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly Lys Ala
1 5 10 15
Asp Trp Ala Glu Asn Arg Lys Ala Leu Ala Asp Cys Ala Gln Gly Phe
20 25 30
Gly Lys Gly Thr Val Gly Gly Lys Asp Gly Asp Ile Tyr Thr Val Thr
35 40 45






33 amino acids


amino acid


linear




peptide



internal


5
Thr Ser Gly Ala Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly Lys Ala
1 5 10 15
Asp Trp Ala Glu Asn Arg Lys Ala Leu Ala Asp Cys Ala Gln Gly Phe
20 25 30
Gly






33 amino acids


amino acid


linear




peptide



internal


6
Asp Ser Gly Ala Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly Lys Ala
1 5 10 15
Asp Trp Ala Glu Asn Ser Arg Ala Leu Ala Asp Cys Ala Gln Gly Phe
20 25 30
Gly






48 amino acids


amino acid


linear




peptide



internal


7
Thr Ser Gly Ala Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly Lys Ala
1 5 10 15
Asp Trp Ala Glu Asn Arg Lys Ala Leu Ala Asp Cys Ala Gln Gly Phe
20 25 30
Gly Lys Gly Thr Val Gly Gly Lys Asp Gly Asp Ile Tyr Thr Val Thr
35 40 45






16 amino acids


amino acid


linear




peptide



internal


8
His Asp Val Lys Val Asn Pro Gly Gly Leu Ile Lys Ser Asn Asp Gly
1 5 10 15






14 amino acids


amino acid


linear




peptide



internal


9
Phe Asp Leu Lys Val Asn Ile Gly Gln Leu Ile Lys Ser Asn
1 5 10






14 amino acids


amino acid


linear




peptide



internal


10
Phe Ala Phe Lys Asn Tyr Ile Pro Leu Leu Ile Asn Ser Asn
1 5 10






13 amino acids


amino acid


linear




peptide



internal


11
His Asp Val Lys Val Pro Gly Gly Leu Ile Lys Asn Gly
1 5 10






45 amino acids


amino acid


linear




peptide



internal


12
Pro Arg Cys Arg His Gly Phe Phe Gln Val Val Asn Asn Asn Tyr Asp
1 5 10 15
Lys Trp Gly Ser Tyr Ala Ile Gly Gly Ser Ala Ser Pro Thr Ile Leu
20 25 30
Ser Gln Gly Met Arg Phe Cys Ala Pro Asp Glu Arg Ser
35 40 45






42 amino acids


amino acid


linear




peptide



internal


13
Pro Arg Cys Arg His Gly Phe Phe Gln Val Val Asn Asn Asn Tyr Asp
1 5 10 15
Arg Trp Gly Ser Tyr Ala Ile Gly Gly Ser Ala Pro Thr Ile Leu Ser
20 25 30
Gln Gly Asn Phe Cys Ala Pro Asp Gly Tyr
35 40






43 amino acids


amino acid


linear




peptide



internal


14
Pro Arg Cys Arg Phe Gly Phe Phe Gln Ile Val Asn Asn Phe Tyr Asp
1 5 10 15
Arg Trp Asp His Tyr Ala Ile Gly Gly Ser Ala Asn Pro Thr Ile Leu
20 25 30
Ser Gln Gly Asn Phe Val Ala Pro Asp Gly Tyr
35 40






11 amino acids


amino acid


linear




peptide



internal


15
Pro Val Leu Thr Pro Glu Gln Ser Ala Gly Met
1 5 10






19 amino acids


amino acid


linear




peptide



internal


16
Thr Ser Gly Ala Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly Lys Ala
1 5 10 15
Asp Trp Ala






30 amino acids


amino acid


linear




peptide



internal


17
Pro Arg Arg His Gly Phe Phe Gln Val Val Asn Asn Asn Tyr Asp Glu
1 5 10 15
Trp Gly Ser Tyr Ala Ile Gly Gly Ser Ala Ser Pro Thr Ile
20 25 30






10 amino acids


amino acid


linear




peptide



internal


18
Ala Trp Asn Trp Arg Thr Glu Lys Asp Leu
1 5 10






10 amino acids


amino acid


linear




peptide



internal


19
Val Ile Asn Leu Asp Gln Glu Ile Phe Val
1 5 10






37 amino acids


amino acid


linear




peptide



internal


20
Pro Val Leu Thr Pro Glu Gln Ser Ala Gly Met Ile Pro Ala Glu Pro
1 5 10 15
Gly Glu Ser Ala Leu Ser Leu Thr Ser Ser Ala Gly Val Leu Ser Cys
20 25 30
Gln Pro Gly Ala Pro
35






18 amino acids


amino acid


linear




peptide



internal


21
Pro Val Leu Asn Pro Glu Asn Ala Gly Met Ile Gln Ala Glu Pro Gly
1 5 10 15
Glu Ala






18 amino acids


amino acid


linear




peptide



internal


22
Pro Val Ile Asn Pro Glu Asn Ala Gly Met Ile Gln Ala Glu Pro Gly
1 5 10 15
Glu Ala






36 amino acids


amino acid


linear




peptide



internal


23
Pro Val Leu Thr Pro Glu Gln Ser Ala Gly Met Ile Pro Ala Glu Pro
1 5 10 15
Gly Glu Ser Ala Leu Ser Leu Thr Ser Ser Ala Gly Val Leu Cys Gln
20 25 30
Pro Gly Ala Pro
35






26 amino acids


amino acid


linear




peptide



internal


24
Pro Val Leu Thr Pro Val Gln Ser Ala Gly Met Ile Pro Ala Glu Pro
1 5 10 15
Gly Glu Ala Ala Ile Lys Leu Thr Ser Ser
20 25






45 amino acids


amino acid


linear




peptide



internal


25
Pro Arg Cys Arg Phe Gly Phe Phe Gln Ile Val Asn Asn Phe Tyr Asp
1 5 10 15
Arg Trp Asp Lys Tyr Ala Ile Gly Gly Ser Ser Asn Pro Thr Ile Leu
20 25 30
Ser Gln Gly Asn Lys Phe Val Ala Pro Asp Phe Ile Tyr
35 40 45






43 amino acids


amino acid


linear




peptide



internal


26
Pro Arg Arg Phe Gly Phe Phe Gln Ile Val Asn Asn Phe Tyr Asp Arg
1 5 10 15
Trp Asp His Tyr Ala Ile Gly Gly Ser Ser Asn Pro Thr Ile Leu Ser
20 25 30
Gln Gly Asn Arg Phe Val Ala Pro Asp Ile Tyr
35 40






11 amino acids


amino acid


linear




peptide



internal


27
Pro Val Leu Thr Pro Glu Gln Asn Ala Gly Met
1 5 10






15 amino acids


amino acid


linear




peptide



internal


28
Pro Arg Arg Phe Gly Phe Phe Gln Ile Val Asn Asn Phe Tyr Asp
1 5 10 15






37 amino acids


amino acid


linear




peptide



internal


29
Pro Val Leu Thr Ala Glu Gln Asn Ala Gly Met Met Gln Ala Glu Pro
1 5 10 15
Gly Asp Met Val Pro Gln Leu Thr Met Asn Ala Gly Val Leu Thr Cys
20 25 30
Ser Pro Gly Ala Pro
35






35 amino acids


amino acid


linear




peptide



internal


30
Pro Val Leu Thr Ala Glu Gln Asn Ala Gly Met Met Gln Ala Glu Pro
1 5 10 15
Gly Asp Met Val Pro Gln Leu Thr Met Asn Ala Gly Val Leu Ser Pro
20 25 30
Gly Ala Pro
35






36 amino acids


amino acid


linear




peptide



internal


31
Pro Val Leu Thr Ala Glu Gln Asn Ala Gly Met Met Gln Ala Glu Pro
1 5 10 15
Gly Asp Met Val Pro Gln Leu Thr Met Asn Ala Gly Val Leu Thr Ser
20 25 30
Pro Gly Ala Pro
35






36 amino acids


amino acid


linear




peptide



internal


32
Pro Val Leu Thr Pro Glu Gln Ser Ala Gly Met Ile Pro Ala Glu Pro
1 5 10 15
Gly Glu Ser Ala Leu Ser Leu Thr Ser Asn Ala Gly Val Leu Ser Ser
20 25 30
Pro Gly Ala Pro
35






13 amino acids


amino acid


linear




peptide



internal


33
Cys Leu Arg Thr Gly Ala Gln Glu Pro Glu Trp Met Thr
1 5 10






11 amino acids


amino acid


linear




peptide



internal


34
Val Trp Val Lys Pro Trp Glu Asn Phe Lys Lys
1 5 10






6 amino acids


amino acid


linear




peptide



internal


35
Trp Glu Asn Phe Lys Lys
1 5






18 base pairs


nucleic acid


single


linear




cDNA



36
TGGGAAAATT TCAAAAAA 18






18 base pairs


nucleic acid


single


linear




cDNA



37
TGGGAGAACT TTAAGAAG 18






6 amino acids


amino acid


linear




peptide



internal


38
Trp Glu Asn Phe Lys Glu
1 5






5 amino acids


amino acid


linear




peptide



internal


39
Val Trp Val Lys Pro
1 5






21 amino acids


amino acid


linear




peptide



internal


40
Glu Phe Pro Ile Leu Gly Gly Ile Thr Glu Val Lys Asp Asn Asp Asn
1 5 10 15
Ser Val Asp Phe Cys
20






6 amino acids


amino acid


linear




peptide



internal


41
Leu Tyr Phe Thr Leu Ala
1 5






25 base pairs


nucleic acid


single


linear




cDNA



42
GGGAATTCTT GTATTTTACC TTAGC 25






6 amino acids


amino acid


linear




peptide



internal


43
Asp Cys Ala Gln Gly Phe
1 5






27 base pairs


nucleic acid


single


linear




cDNA



44
GGGAATTCGA CTGTGCCCAA GGTTTTG 27






27 base pairs


nucleic acid


single


linear




cDNA



45
GGGCTGCAGT CATTATAAGT GCTTAGT 27






27 base pairs


nucleic acid


single


linear




cDNA



46
GGGCTGCAGT GCTTAGCAAG GTGCTCC 27






42 base pairs


nucleic acid


single


linear




cDNA



47
GGGCTGCAGC GTGTCCAAAT CTAATCAAAT GAACACTTAT GG 42






13 amino acids


amino acid


linear




peptide



internal


48
Arg Thr Asp Lys Asp Leu Leu Glu Asn Gly Ala Ile Cys
1 5 10






19 amino acids


amino acid


linear




peptide



internal


49
Leu Asn Gln Glu Leu Val Val Asn Ser Asp Lys Thr Ile Asp Gly Arg
1 5 10 15
Gly Val Lys






24 amino acids


amino acid


linear




peptide



internal


50
Glu Thr Arg Arg Ser Leu Lys Thr Ser Gly Ala Tyr Asn Ile Ile Asp
1 5 10 15
Gly Cys Trp Arg Gly Lys Ala Asp
20






21 amino acids


amino acid


linear




peptide



internal


51
Glu Phe Pro Ile Leu Gly Gly Ile Thr Glu Val Lys Asp Asn Asp Asn
1 5 10 15
Ser Val Asp Phe Cys
20






14 amino acids


amino acid


linear




peptide



internal


52
Tyr Thr Val Thr Ser Asp Lys Asp Asp Asp Val Ala Asn Cys
1 5 10






10 amino acids


amino acid


linear




peptide



internal


53
Gly Lys Ala Asp Trp Ala Glu Asn Arg Cys
1 5 10






20 amino acids


amino acid


linear




peptide



internal


54
Leu Glu Asn Gly Ala Ile Phe Val Ala Ser Gly Val Asp Pro Val Leu
1 5 10 15
Thr Pro Glu Gln
20






17 amino acids


amino acid


linear




peptide



internal


55
Gly Phe Phe Gln Val Val Asn Asn Asn Tyr Asp Arg Trp Gly Thr Tyr
1 5 10 15
Ala






323 base pairs


nucleic acid


single


linear




cDNA




CDS


1..321




56
GAA TTC GGC TGG AGA ACG AAT AAA GAC GTG CTT GAA AAT GGT GCT ATT 48
Glu Phe Gly Trp Arg Thr Asn Lys Asp Val Leu Glu Asn Gly Ala Ile
1 5 10 15
TTT GTT GCA TCC GGG GTC GAT CCA GTG CTA ACC CCT GAG CAA AGC GCA 96
Phe Val Ala Ser Gly Val Asp Pro Val Leu Thr Pro Glu Gln Ser Ala
20 25 30
GGG ATG ATT CCA GCC GAA CCA GGA GAG TCC GCT CTA AGC CTC ACT AGT 144
Gly Met Ile Pro Ala Glu Pro Gly Glu Ser Ala Leu Ser Leu Thr Ser
35 40 45
AGT GCT GGT GTA CTC TCA TGC CAA CCC GGA GCA CCT TGC TAA GCA CCC 192
Ser Ala Gly Val Leu Ser Cys Gln Pro Gly Ala Pro Cys * Ala Pro
50 55 60
GAC CAA TTA CTA AGC ACT TAT AAT GAT CAT TAA TAC TTT TTT TTA TTT 240
Asp Gln Leu Leu Ser Thr Tyr Asn Asp His * Tyr Phe Phe Leu Phe
65 70 75 80
TAT TTT TGA TAT TTT ATA TGT ACT AAG GTA ATG GAA ATG AAC CTT TAC 288
Tyr Phe * Tyr Phe Ile Cys Thr Lys Val Met Glu Met Asn Leu Tyr
85 90 95
CTT CTA GTA CTC TAA AAA AAA AAA AAA CCG AAT TC 323
Leu Leu Val Leu * Lys Lys Lys Lys Pro Asn
100 105






61 amino acids


amino acid


linear




protein



57
Glu Phe Gly Trp Arg Thr Asn Lys Asp Val Leu Glu Asn Gly Ala Ile
1 5 10 15
Phe Val Ala Ser Gly Val Asp Pro Val Leu Thr Pro Glu Gln Ser Ala
20 25 30
Gly Met Ile Pro Ala Glu Pro Gly Glu Ser Ala Leu Ser Leu Thr Ser
35 40 45
Ser Ala Gly Val Leu Ser Cys Gln Pro Gly Ala Pro Cys
50 55 60






1328 base pairs


nucleic acid


single


linear




cDNA




CDS


1..1328




58
TAC ATC TTG TAT TTT ACC TTA GCC CTT GTC ACT TTG CTG CAA CCT GTT 48
Tyr Ile Leu Tyr Phe Thr Leu Ala Leu Val Thr Leu Leu Gln Pro Val
1 5 10 15
CGT TCT GCA GAA GAT GTT GAA GAA TTC TTA CCT TCA GCT AAC GAA ACA 96
Arg Ser Ala Glu Asp Val Glu Glu Phe Leu Pro Ser Ala Asn Glu Thr
20 25 30
AGG AGG AGC CTG AAA GCA TGT GAA GCA CAC AAC ATT ATA GAC AAG TGC 144
Arg Arg Ser Leu Lys Ala Cys Glu Ala His Asn Ile Ile Asp Lys Cys
35 40 45
TGG AGG TGC AAA GCC GAT TGG GCG AAT AAC CGA CAA GCG TTA GCC GAT 192
Trp Arg Cys Lys Ala Asp Trp Ala Asn Asn Arg Gln Ala Leu Ala Asp
50 55 60
TGT GCC CAA GGT TTT GCA AAG GGA ACC TAC GGT GGA AAA CAT GGT GAT 240
Cys Ala Gln Gly Phe Ala Lys Gly Thr Tyr Gly Gly Lys His Gly Asp
65 70 75 80
GTC TAC ACG GTC ACC AGT GAT AAA GAT GAT GAT GTT GCA AAT CCA AAA 288
Val Tyr Thr Val Thr Ser Asp Lys Asp Asp Asp Val Ala Asn Pro Lys
85 90 95
GAA GGC ACA CTC CGG TTT GCT GCT GCC CAA AAC AGG CCC TTG TGG ATC 336
Glu Gly Thr Leu Arg Phe Ala Ala Ala Gln Asn Arg Pro Leu Trp Ile
100 105 110
ATT TTT AAA AGA AAT ATG GTG ATT CAT TTG AAT CAA GAG CTT GTC GTA 384
Ile Phe Lys Arg Asn Met Val Ile His Leu Asn Gln Glu Leu Val Val
115 120 125
AAC AGC GAC AAG ACC ATC GAT GGC CGA GGG GTG AAA GTT AAC ATC GTT 432
Asn Ser Asp Lys Thr Ile Asp Gly Arg Gly Val Lys Val Asn Ile Val
130 135 140
AAC GCC GGT CTC ACC CTC ATG AAT GTC AAG AAT ATA ATC ATT CAT AAC 480
Asn Ala Gly Leu Thr Leu Met Asn Val Lys Asn Ile Ile Ile His Asn
145 150 155 160
ATA AAT ATC CAT GAT ATT AAA GTT TGT CCA GGA GGC ATG ATT AAG TCC 528
Ile Asn Ile His Asp Ile Lys Val Cys Pro Gly Gly Met Ile Lys Ser
165 170 175
AAC GAT GGT CCA CCA ATT TTA AGA CAA CAA AGT GAT GGT GAT GCT ATA 576
Asn Asp Gly Pro Pro Ile Leu Arg Gln Gln Ser Asp Gly Asp Ala Ile
180 185 190
AAT GTT GCT GGT AGT TCA CAA ATA TGG ATC GAC CAT TGC TCG CTC AGT 624
Asn Val Ala Gly Ser Ser Gln Ile Trp Ile Asp His Cys Ser Leu Ser
195 200 205
AAG GCT TCC GAT GGG CTG CTC GAT ATC ACC CTC GGC AGC TCA CAC GTG 672
Lys Ala Ser Asp Gly Leu Leu Asp Ile Thr Leu Gly Ser Ser His Val
210 215 220
ACC GTT TCC AAC TGC AAA TTC ACC CAA CAC CAA TTT GTA TTA TTG CTC 720
Thr Val Ser Asn Cys Lys Phe Thr Gln His Gln Phe Val Leu Leu Leu
225 230 235 240
GGG GCT GAT GAC ACC CAT TAT CAA GAT AAA GGC ATG CTA GCA ACG GTA 768
Gly Ala Asp Asp Thr His Tyr Gln Asp Lys Gly Met Leu Ala Thr Val
245 250 255
GCA TTC AAC ATG TTC ACC GAT CAC GTT GAC CAA AGA ATG CCT AGA TGT 816
Ala Phe Asn Met Phe Thr Asp His Val Asp Gln Arg Met Pro Arg Cys
260 265 270
AGA TTT GGG TTT TTC CAA GTC GTT AAC AAC AAC TAC GAC AGA TGG GGA 864
Arg Phe Gly Phe Phe Gln Val Val Asn Asn Asn Tyr Asp Arg Trp Gly
275 280 285
ACG TAC GCC ATC GGT GGT AGC TCG GCC CCA ACT ATA CTC AGC CAA GGG 912
Thr Tyr Ala Ile Gly Gly Ser Ser Ala Pro Thr Ile Leu Ser Gln Gly
290 295 300
AAC AGA TTC TTC GCC CCC GAT GAT ATC ATC AAG GAA AAT GTC TTA GCG 960
Asn Arg Phe Phe Ala Pro Asp Asp Ile Ile Lys Glu Asn Val Leu Ala
305 310 315 320
AGG ACT GGT ACT GGC AAC GCA GAG TCG ATG TCG TGG AAC TGG AGA ACA 1008
Arg Thr Gly Thr Gly Asn Ala Glu Ser Met Ser Trp Asn Trp Arg Thr
325 330 335
GAT AAA GAC TTG CTT GAA AAT GGT GCT ATT TTT CTC CCA TCC GGG TCT 1056
Asp Lys Asp Leu Leu Glu Asn Gly Ala Ile Phe Leu Pro Ser Gly Ser
340 345 350
GAT CCA GTG CTA ACC CCT GAG CAA AAA GCA GGG ATG ATT CCA GCT GAA 1104
Asp Pro Val Leu Thr Pro Glu Gln Lys Ala Gly Met Ile Pro Ala Glu
355 360 365
CCA GGA GAA GCC GTT CTA AGA CTC ACT AGT AGT GCT GGT GTA CTC TCA 1152
Pro Gly Glu Ala Val Leu Arg Leu Thr Ser Ser Ala Gly Val Leu Ser
370 375 380
TGC CAT CAA GGA GCA CCT TGC TAA GCA CCT GGC CAA TTC CTA AGC TTT 1200
Cys His Gln Gly Ala Pro Cys * Ala Pro Gly Gln Phe Leu Ser Phe
385 390 395 400
TAT AAT AAT CAT AAA TAC TTA TTT TAT TTT ATT TTT GAT ATT TTA TAT 1248
Tyr Asn Asn His Lys Tyr Leu Phe Tyr Phe Ile Phe Asp Ile Leu Tyr
405 410 415
GAA CCA TTA CGT TCA AGT ACT CTA TTA ACA TGT TTT AAA TTC ATA AGA 1296
Glu Pro Leu Arg Ser Ser Thr Leu Leu Thr Cys Phe Lys Phe Ile Arg
420 425 430
GTT TAT TGA TAA AAA AAA AAA AAA CCG AAT TC 1328
Val Tyr * * Lys Lys Lys Lys Pro Asn
435 440






391 amino acids


amino acid


linear




protein



59
Tyr Ile Leu Tyr Phe Thr Leu Ala Leu Val Thr Leu Leu Gln Pro Val
1 5 10 15
Arg Ser Ala Glu Asp Val Glu Glu Phe Leu Pro Ser Ala Asn Glu Thr
20 25 30
Arg Arg Ser Leu Lys Ala Cys Glu Ala His Asn Ile Ile Asp Lys Cys
35 40 45
Trp Arg Cys Lys Ala Asp Trp Ala Asn Asn Arg Gln Ala Leu Ala Asp
50 55 60
Cys Ala Gln Gly Phe Ala Lys Gly Thr Tyr Gly Gly Lys His Gly Asp
65 70 75 80
Val Tyr Thr Val Thr Ser Asp Lys Asp Asp Asp Val Ala Asn Pro Lys
85 90 95
Glu Gly Thr Leu Arg Phe Ala Ala Ala Gln Asn Arg Pro Leu Trp Ile
100 105 110
Ile Phe Lys Arg Asn Met Val Ile His Leu Asn Gln Glu Leu Val Val
115 120 125
Asn Ser Asp Lys Thr Ile Asp Gly Arg Gly Val Lys Val Asn Ile Val
130 135 140
Asn Ala Gly Leu Thr Leu Met Asn Val Lys Asn Ile Ile Ile His Asn
145 150 155 160
Ile Asn Ile His Asp Ile Lys Val Cys Pro Gly Gly Met Ile Lys Ser
165 170 175
Asn Asp Gly Pro Pro Ile Leu Arg Gln Gln Ser Asp Gly Asp Ala Ile
180 185 190
Asn Val Ala Gly Ser Ser Gln Ile Trp Ile Asp His Cys Ser Leu Ser
195 200 205
Lys Ala Ser Asp Gly Leu Leu Asp Ile Thr Leu Gly Ser Ser His Val
210 215 220
Thr Val Ser Asn Cys Lys Phe Thr Gln His Gln Phe Val Leu Leu Leu
225 230 235 240
Gly Ala Asp Asp Thr His Tyr Gln Asp Lys Gly Met Leu Ala Thr Val
245 250 255
Ala Phe Asn Met Phe Thr Asp His Val Asp Gln Arg Met Pro Arg Cys
260 265 270
Arg Phe Gly Phe Phe Gln Val Val Asn Asn Asn Tyr Asp Arg Trp Gly
275 280 285
Thr Tyr Ala Ile Gly Gly Ser Ser Ala Pro Thr Ile Leu Ser Gln Gly
290 295 300
Asn Arg Phe Phe Ala Pro Asp Asp Ile Ile Lys Glu Asn Val Leu Ala
305 310 315 320
Arg Thr Gly Thr Gly Asn Ala Glu Ser Met Ser Trp Asn Trp Arg Thr
325 330 335
Asp Lys Asp Leu Leu Glu Asn Gly Ala Ile Phe Leu Pro Ser Gly Ser
340 345 350
Asp Pro Val Leu Thr Pro Glu Gln Lys Ala Gly Met Ile Pro Ala Glu
355 360 365
Pro Gly Glu Ala Val Leu Arg Leu Thr Ser Ser Ala Gly Val Leu Ser
370 375 380
Cys His Gln Gly Ala Pro Cys
385 390






324 base pairs


nucleic acid


single


linear




cDNA




CDS


1..300




60
GAA TTC GGC TGG AGA ACG AAT AAA GAC GTG CTT GAA AAT GGT GCT ATT 48
Glu Phe Gly Trp Arg Thr Asn Lys Asp Val Leu Glu Asn Gly Ala Ile
1 5 10 15
TTT GTT GCA TCC GGG GTC GAT CCA GTG CTA ACC CCT GAG CAA AGC GCA 96
Phe Val Ala Ser Gly Val Asp Pro Val Leu Thr Pro Glu Gln Ser Ala
20 25 30
GGG ATG ATT CCA GCC GAA CCA GGA GAG TCC GCT CTA AGC CTC ACT AGT 144
Gly Met Ile Pro Ala Glu Pro Gly Glu Ser Ala Leu Ser Leu Thr Ser
35 40 45
AGT GCT GGT GTA CTC TCA TGC CAA CCC GGA GCA CCT TGC TAA GCA CCC 192
Ser Ala Gly Val Leu Ser Cys Gln Pro Gly Ala Pro Cys * Ala Pro
50 55 60
GAC CAA TTA CTA AGC ACT TAT AAT GAT CAT TAA TAC TTT TTT TTA TTT 240
Asp Gln Leu Leu Ser Thr Tyr Asn Asp His * Tyr Phe Phe Leu Phe
65 70 75 80
TAT TTT TGA TAT TTT ATA TGT ACT AAG GTA ATG GAA ATG AAC CTT TAC 288
Tyr Phe * Tyr Phe Ile Cys Thr Lys Val Met Glu Met Asn Leu Tyr
85 90 95
CTT CTT AGT ACT CTAAAAAAAA AAAAAACCGA ATTC 324
Leu Leu Ser Thr
100






61 amino acids


amino acid


linear




protein



61
Glu Phe Gly Trp Arg Thr Asn Lys Asp Val Leu Glu Asn Gly Ala Ile
1 5 10 15
Phe Val Ala Ser Gly Val Asp Pro Val Leu Thr Pro Glu Gln Ser Ala
20 25 30
Gly Met Ile Pro Ala Glu Pro Gly Glu Ser Ala Leu Ser Leu Thr Ser
35 40 45
Ser Ala Gly Val Leu Ser Cys Gln Pro Gly Ala Pro Cys
50 55 60






452 base pairs


nucleic acid


single


linear




cDNA



62
GAATTCCGAT TCTTGGAGGA ATTACCGAAG TTAAAGACAA TGATAACAGC GTCGATTTCG 60
ACGAGCTTGC TAAATTCGCC ATCGCTGAAC ACAACAAGAA GGAGAATGCT GCTCTGGAGT 120
TTGGAAAAGT AATAGAAAAA AAGCAGCAGG CGGTACAGGG CACCATGTAT TATATAAAAG 180
TGGAAGCAAA TGATGGTGGT GAGAAGAAAA CTTATGAAGC CAAGGTGTGG GTTAAGCTAT 240
GGGAAAATTT CAAGGAATTG CAGGAACTCA AACTTGTTTG ATGGACGGGT GTGTGCTATG 300
ACAAAATAGC TCGAGCAGGT GAAGCATGAA TGTATAAATA TTCTTTTTAA GTTTAATAAT 360
AAACATTTCT TGTAATATGG TACAGGTTTA TGTACTTTGG TATGTATAAC AGAAAACATA 420
TCATAAATTC AAACTTAGAA TTTTGGGAAT TC 452






452 base pairs


nucleic acid


single


linear




cDNA



63
GAATTCCCAA AATTCTAAGT TTGAATTTAT GATATGTTTT CTGTTATACA TACCAAAGTA 60
CATAAACCTG TACCATATTA CAAGAAATGT TTATTATTAA ACTTAAAAAG AATATTTATA 120
CATTCATGCT TCACCTGCTC GAGCTATTTT GTCATAGCAC ACACCCGTCC ATCAAACAAG 180
TTTGAGTTCC TGCAATTCCT TGAAATTTTC CCATAGCTTA ACCCACACCT TGGCTTCATA 240
AGTTTTCTTC TCACCACCAT CATTTGCTTC CACTTTTATA TAATACATGG TGCCCTGTAC 300
CGCCTGCTGC TTTTTTTCTA TTACTTTTCC AAACTCCAGA GCAGCATTCT CCTTCTTGTT 360
GTGTTCAGCG ATGGCGAATT TAGCAAGCTC GTCGAAATCG ACGCTGTTAT CATTGTCTTT 420
AACTTCGGTA ATTCCTCCAA GAATCGGAAT TC 452






488 base pairs


nucleic acid


single


linear




cDNA



64
GAATTCCCGA TTCTTGGAGG AATTACCGAA GTTAAAGACA ATGATAACAG CGTCGATTTC 60
GACGAGCTTG CTAAATTCGC CATCACTGAA CACAACAAGA AGGAGAATGC TGCTCTGGAG 120
TTTGGAAAAG TAATAGAAAA AAAGCAGCAG GCGGTACAGG GCACCATGTA TTATATAAAA 180
GCGGAAGCAA ATGATGGTGG TGAGAAGAAA ACTTATGAAG CCAAGGTGTG GGTTAAGCTA 240
TGGGAAAATT TCAAGGAATT TGCAAGGAAC TCAAACCTTG TTTGATGATG CCACCTCACC 300
TTAACTCCAT ATGGACGGTG TGCTATGACA AAATAGCTCA AGGAGGTGAA GCATAAATGT 360
ATAAATATTC TTTTTAAGTT TAATAATAAA CATTTCTTGT AATATAGTAC AAGTTTATGT 420
ACTTTGGTAT GTATAACAGA AAACATATCA TAAATTCAAA CTTAATGTTT TTTTTTCTCG 480
CGGAATTC 488






488 base pairs


nucleic acid


single


linear




cDNA



65
GAATTCCGCG AGAAAAAAAA ACATTAAGTT TGAATTTATG ATATGTTTTC TGTTATACAT 60
ACCAAAGTAC ATAAACTTGT ACTATATTAC AAGAAATGTT TATTATTAAA CTTAAAAAGA 120
ATATTTATAC ATTTATGCTT CACCTCCTTG AGCTATTTTG TCATAGCACA CCGTCCATAT 180
GGAGTTAAGG TGAGGTGGCA TCATCAAACA AGGTTTGAGT TCCTTGCAAA TTCCTTGAAA 240
TTTTCCCATA GCTTAACCCA CACCTTGGCT TCATAAGTTT TCTTCTCACC ACCATCATTT 300
GCTTCCGCTT TTATATAATA CATGGTGCCC TGTACCGCCT GCTGCTTTTT TTCTATTACT 360
TTTCCAAACT CCAGAGCAGC ATTCTCCTTC TTGTTGTGTT CAGTGATGGC GAATTTAGCA 420
AGCTCGTCGA AATCGACGCT GTTATCATTG TCTTTAACTT CGGTAATTCC TCCAAGAATC 480
GGGAATTC 488






190 base pairs


nucleic acid


single


linear




cDNA



66
TCGATTCGCT GTCGATGAAC ACAACAAGAA GCAGAATACC CTGCTGGAAT TTAAGAAGGT 60
ACTGAATACA AAGGAGCAGG TAGTAGCTGG TATAATGTAT TATATCACAC TTGAAGCAAC 120
TGATGGTGGT GAGAAAAAGA CTTATGAAGC CAAGGTTTGG GTTAAGCCAT GGGAAAACTT 180
CAAAGAATTC 190






190 base pairs


nucleic acid


single


linear




cDNA



67
GAATTCTTTG AAGTTTTCCC ATGGCTTAAC CCAAACCTTG GCTTCATAAG TCTTTTTCTC 60
ACCACCATCA GTTGCTTCAA GTGTGATATA ATACATTATA CCAGCTACTA CCTGCTCCTT 120
TGTATTCAGT ACCTTCTTAA ATTCCAGCAG GGTATTCTGC TTCTTGTTGT GTTCATCGAC 180
AGCGAATCGA 190






92 amino acids


amino acid


linear




peptide



internal


68
Ile Pro Ile Leu Gly Gly Ile Thr Glu Val Lys Asp Asn Asp Asn Ser
1 5 10 15
Val Asp Phe Asp Glu Leu Ala Lys Phe Ala Ile Ala Glu His Asn Lys
20 25 30
Lys Glu Asn Ala Ala Leu Glu Phe Gly Lys Val Ile Glu Lys Lys Gln
35 40 45
Gln Ala Val Gln Gly Thr Met Tyr Tyr Ile Lys Val Glu Ala Asn Asp
50 55 60
Gly Gly Glu Lys Lys Thr Tyr Glu Ala Lys Val Trp Val Lys Leu Trp
65 70 75 80
Glu Asn Phe Lys Glu Leu Gln Glu Leu Lys Leu Val
85 90






94 amino acids


amino acid


linear




peptide



internal


69
Glu Phe Pro Ile Leu Gly Gly Ile Thr Glu Val Lys Asp Asn Asp Asn
1 5 10 15
Ser Val Asp Phe Asp Glu Leu Ala Lys Phe Ala Ile Thr Glu His Asn
20 25 30
Lys Lys Glu Asn Ala Ala Leu Glu Phe Gly Lys Val Ile Glu Lys Lys
35 40 45
Gln Gln Ala Val Gln Gly Thr Met Tyr Tyr Ile Lys Ala Glu Ala Asn
50 55 60
Asp Gly Gly Glu Lys Lys Thr Tyr Glu Ala Lys Val Trp Val Lys Leu
65 70 75 80
Trp Glu Asn Phe Lys Glu Phe Ala Arg Asn Ser Asn Leu Val
85 90






60 amino acids


amino acid


linear




peptide



internal


70
Val Asp Glu His Asn Lys Lys Gln Asn Thr Leu Leu Glu Phe Lys Lys
1 5 10 15
Val Leu Asn Thr Lys Glu Gln Val Val Ala Gly Ile Met Tyr Tyr Ile
20 25 30
Thr Leu Glu Ala Thr Asp Gly Gly Glu Lys Lys Thr Tyr Glu Ala Lys
35 40 45
Val Trp Val Lys Pro Trp Glu Asn Phe Lys Glu Phe
50 55 60






1196 base pairs


nucleic acid


single


linear




cDNA




CDS


1..1161




71
TTG TAT TTT ACC TTA GCC CTT GTC ACT TTG CTG CAA CCT GTT CGT TCT 48
Leu Tyr Phe Thr Leu Ala Leu Val Thr Leu Leu Gln Pro Val Arg Ser
1 5 10 15
GCC GAA GAT CTC CAG GAA ATC TTA CCA GTT AAC GAA ACA AGG AGG CTG 96
Ala Glu Asp Leu Gln Glu Ile Leu Pro Val Asn Glu Thr Arg Arg Leu
20 25 30
ACA ACA AGT GGA GCA TAC AAC ATT ATA GAC GGG TGC TGG AGG GGC AAA 144
Thr Thr Ser Gly Ala Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly Lys
35 40 45
GCC GAT TGG GCG GAA AAC CGA AAA GCG TTA GCC GAT TGT GCC CAA GGT 192
Ala Asp Trp Ala Glu Asn Arg Lys Ala Leu Ala Asp Cys Ala Gln Gly
50 55 60
TTT GGG AAG GGA ACA GTG GGC GGA AAA GAT GGT GAT ATA TAC ACG GTC 240
Phe Gly Lys Gly Thr Val Gly Gly Lys Asp Gly Asp Ile Tyr Thr Val
65 70 75 80
ACC AGT GAG CTA GAT GAT GAT GTT GCA AAT CCA AAA GAA GGC ACA CTC 288
Thr Ser Glu Leu Asp Asp Asp Val Ala Asn Pro Lys Glu Gly Thr Leu
85 90 95
CGG TTT GGT GCC GCC CAA AAC AGG CCC TTG TGG ATC ATT TTT GAA AGA 336
Arg Phe Gly Ala Ala Gln Asn Arg Pro Leu Trp Ile Ile Phe Glu Arg
100 105 110
GAT ATG GTG ATT CGT TTG GAT AAA GAG ATG GTG GTA AAC AGT GAC AAG 384
Asp Met Val Ile Arg Leu Asp Lys Glu Met Val Val Asn Ser Asp Lys
115 120 125
ACC ATC GAT GGC CGA GGG GCG AAA GTT GAA ATC ATT AAC GCT GGT TTC 432
Thr Ile Asp Gly Arg Gly Ala Lys Val Glu Ile Ile Asn Ala Gly Phe
130 135 140
ACC CTT AAT GGT GTC AAG AAT GTA ATC ATT CAT AAC ATA AAT ATG CAT 480
Thr Leu Asn Gly Val Lys Asn Val Ile Ile His Asn Ile Asn Met His
145 150 155 160
GAT GTT AAA GTG AAT CCA GGA GGC CTG ATT AAG TCC AAC GAT GGT CCA 528
Asp Val Lys Val Asn Pro Gly Gly Leu Ile Lys Ser Asn Asp Gly Pro
165 170 175
GCA GCT CCA AGA GCT GGT AGT GAT GGT GAT GCT ATA AGT ATT TCT GGT 576
Ala Ala Pro Arg Ala Gly Ser Asp Gly Asp Ala Ile Ser Ile Ser Gly
180 185 190
AGT TCA CAA ATA TGG ATC GAC CAT TGT TCG CTC AGT AAG TCT GTT GAT 624
Ser Ser Gln Ile Trp Ile Asp His Cys Ser Leu Ser Lys Ser Val Asp
195 200 205
GGG CTG GTA GAT GCC AAG CTC GGC ACC ACA CGC TTA ACC GTT TCC AAC 672
Gly Leu Val Asp Ala Lys Leu Gly Thr Thr Arg Leu Thr Val Ser Asn
210 215 220
AGC TTA TTC ACC CAA CAC CAG TTT GTA CTA TTA TTC GGG GCT GGT GAC 720
Ser Leu Phe Thr Gln His Gln Phe Val Leu Leu Phe Gly Ala Gly Asp
225 230 235 240
GAA AAT ATT GAA GAT AGA GGC ATG CTA GCA ACG GTC GCT TTC AAC ACG 768
Glu Asn Ile Glu Asp Arg Gly Met Leu Ala Thr Val Ala Phe Asn Thr
245 250 255
TTC ACT GAT AAC GTT GAC CAA AGA ATG CCT AGA TGT CGA CAT GGG TTT 816
Phe Thr Asp Asn Val Asp Gln Arg Met Pro Arg Cys Arg His Gly Phe
260 265 270
TTC CAA GTC GTT AAC AAC AAC TAT GAT AAA TGG GGA TCG TAT GCC ATC 864
Phe Gln Val Val Asn Asn Asn Tyr Asp Lys Trp Gly Ser Tyr Ala Ile
275 280 285
GGT GGT AGC GCG TCC CCA ACC ATA CTC AGC CAA GGG AAC AGA TTC TGC 912
Gly Gly Ser Ala Ser Pro Thr Ile Leu Ser Gln Gly Asn Arg Phe Cys
290 295 300
GCC CCC GAT GAA CGC AGC AAG AAA AAT GTC CTA GGA AGG CAT GGT GAA 960
Ala Pro Asp Glu Arg Ser Lys Lys Asn Val Leu Gly Arg His Gly Glu
305 310 315 320
GCC GCC GCA GAG TCG ATG AAG TGG AAC TGG AGA ACG AAT AAA GAC GTG 1008
Ala Ala Ala Glu Ser Met Lys Trp Asn Trp Arg Thr Asn Lys Asp Val
325 330 335
CTT GAA AAT GGT GCT ATT TTT GTT GCA TCC GGG GTC GAT CCA GTG CTA 1056
Leu Glu Asn Gly Ala Ile Phe Val Ala Ser Gly Val Asp Pro Val Leu
340 345 350
ACC CCT GAG CAA AGC GCA GGG ATG ATT CCA GCC GAA CCA GGA GAG TCC 1104
Thr Pro Glu Gln Ser Ala Gly Met Ile Pro Ala Glu Pro Gly Glu Ser
355 360 365
GCT CTA AGC CTC ACT AGT AGT GCT GGT GTA CTC TCA TGC CAA CCC GGA 1152
Ala Leu Ser Leu Thr Ser Ser Ala Gly Val Leu Ser Cys Gln Pro Gly
370 375 380
GCA CCT TGC TAA GCA CCC GAC CAA TTA CTA AGC ACT TAT AAT 1194
Ala Pro Cys *
385
GA 1196






387 amino acids


amino acid


linear




protein



72
Leu Tyr Phe Thr Leu Ala Leu Val Thr Leu Leu Gln Pro Val Arg Ser
1 5 10 15
Ala Glu Asp Leu Gln Glu Ile Leu Pro Val Asn Glu Thr Arg Arg Leu
20 25 30
Thr Thr Ser Gly Ala Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly Lys
35 40 45
Ala Asp Trp Ala Glu Asn Arg Lys Ala Leu Ala Asp Cys Ala Gln Gly
50 55 60
Phe Gly Lys Gly Thr Val Gly Gly Lys Asp Gly Asp Ile Tyr Thr Val
65 70 75 80
Thr Ser Glu Leu Asp Asp Asp Val Ala Asn Pro Lys Glu Gly Thr Leu
85 90 95
Arg Phe Gly Ala Ala Gln Asn Arg Pro Leu Trp Ile Ile Phe Glu Arg
100 105 110
Asp Met Val Ile Arg Leu Asp Lys Glu Met Val Val Asn Ser Asp Lys
115 120 125
Thr Ile Asp Gly Arg Gly Ala Lys Val Glu Ile Ile Asn Ala Gly Phe
130 135 140
Thr Leu Asn Gly Val Lys Asn Val Ile Ile His Asn Ile Asn Met His
145 150 155 160
Asp Val Lys Val Asn Pro Gly Gly Leu Ile Lys Ser Asn Asp Gly Pro
165 170 175
Ala Ala Pro Arg Ala Gly Ser Asp Gly Asp Ala Ile Ser Ile Ser Gly
180 185 190
Ser Ser Gln Ile Trp Ile Asp His Cys Ser Leu Ser Lys Ser Val Asp
195 200 205
Gly Leu Val Asp Ala Lys Leu Gly Thr Thr Arg Leu Thr Val Ser Asn
210 215 220
Ser Leu Phe Thr Gln His Gln Phe Val Leu Leu Phe Gly Ala Gly Asp
225 230 235 240
Glu Asn Ile Glu Asp Arg Gly Met Leu Ala Thr Val Ala Phe Asn Thr
245 250 255
Phe Thr Asp Asn Val Asp Gln Arg Met Pro Arg Cys Arg His Gly Phe
260 265 270
Phe Gln Val Val Asn Asn Asn Tyr Asp Lys Trp Gly Ser Tyr Ala Ile
275 280 285
Gly Gly Ser Ala Ser Pro Thr Ile Leu Ser Gln Gly Asn Arg Phe Cys
290 295 300
Ala Pro Asp Glu Arg Ser Lys Lys Asn Val Leu Gly Arg His Gly Glu
305 310 315 320
Ala Ala Ala Glu Ser Met Lys Trp Asn Trp Arg Thr Asn Lys Asp Val
325 330 335
Leu Glu Asn Gly Ala Ile Phe Val Ala Ser Gly Val Asp Pro Val Leu
340 345 350
Thr Pro Glu Gln Ser Ala Gly Met Ile Pro Ala Glu Pro Gly Glu Ser
355 360 365
Ala Leu Ser Leu Thr Ser Ser Ala Gly Val Leu Ser Cys Gln Pro Gly
370 375 380
Ala Pro Cys
385






1349 base pairs


nucleic acid


single


linear




cDNA




CDS


1..1323




73
ATG GGG ATC AAA CAC TGT TGT TAC ATC TTG TAT TTT ACC TTA GCC CTT 48
Met Gly Ile Lys His Cys Cys Tyr Ile Leu Tyr Phe Thr Leu Ala Leu
1 5 10 15
GTC ACT TTG CTG CAA CCT GTT CGT TCT GCA GAA GAT GTT GAA GAA TTC 96
Val Thr Leu Leu Gln Pro Val Arg Ser Ala Glu Asp Val Glu Glu Phe
20 25 30
TTA CCT TCA GCT AAC GAA ACA AGG AGG AGC CTG AAA GCA TGT GAA GCA 144
Leu Pro Ser Ala Asn Glu Thr Arg Arg Ser Leu Lys Ala Cys Glu Ala
35 40 45
CAC AAC ATT ATA GAC AAG TGC TGG AGG TGC AAA GCC GAT TGG GCG AAT 192
His Asn Ile Ile Asp Lys Cys Trp Arg Cys Lys Ala Asp Trp Ala Asn
50 55 60
AAC CGA CAA GCG TTA GCC GAT TGT GCC CAA GGT TTT GCA AAG GGA ACC 240
Asn Arg Gln Ala Leu Ala Asp Cys Ala Gln Gly Phe Ala Lys Gly Thr
65 70 75 80
TAC GGT GGA AAA CAT GGT GAT GTC TAC ACG GTC ACC AGT GAT AAA GAT 288
Tyr Gly Gly Lys His Gly Asp Val Tyr Thr Val Thr Ser Asp Lys Asp
85 90 95
GAT GAT GTT GCA AAT CCA AAA GAA GGC ACA CTC CGG TTT GCT GCT GCC 336
Asp Asp Val Ala Asn Pro Lys Glu Gly Thr Leu Arg Phe Ala Ala Ala
100 105 110
CAA AAC AGG CCC TTG TGG ATC ATT TTT AAA AGA AAT ATG GTG ATT CAT 384
Gln Asn Arg Pro Leu Trp Ile Ile Phe Lys Arg Asn Met Val Ile His
115 120 125
TTG AAT CAA GAG CTT GTC GTA AAC AGC GAC AAG ACC ATC GAT GGC CGA 432
Leu Asn Gln Glu Leu Val Val Asn Ser Asp Lys Thr Ile Asp Gly Arg
130 135 140
GGG GTG AAA GTT AAC ATC GTT AAC GCC GGT CTC ACC CTC ATG AAT GTC 480
Gly Val Lys Val Asn Ile Val Asn Ala Gly Leu Thr Leu Met Asn Val
145 150 155 160
AAG AAT ATA ATC ATT CAT AAC ATA AAT ATC CAT GAT ATT AAA GTT TGT 528
Lys Asn Ile Ile Ile His Asn Ile Asn Ile His Asp Ile Lys Val Cys
165 170 175
CCA GGA GGC ATG ATT AAG TCC AAC GAT GGT CCA CCA ATT TTA AGA CAA 576
Pro Gly Gly Met Ile Lys Ser Asn Asp Gly Pro Pro Ile Leu Arg Gln
180 185 190
CAA AGT GAT GGT GAT GCT ATA AAT GTT GCT GGT AGT TCA CAA ATA TGG 624
Gln Ser Asp Gly Asp Ala Ile Asn Val Ala Gly Ser Ser Gln Ile Trp
195 200 205
ATC GAC CAT TGC TCG CTC AGT AAG GCT TCC GAT GGG CTG CTC GAT ATC 672
Ile Asp His Cys Ser Leu Ser Lys Ala Ser Asp Gly Leu Leu Asp Ile
210 215 220
ACC CTC GGC AGC TCA CAC GTG ACC GTT TCC AAC TGC AAA TTC ACC CAA 720
Thr Leu Gly Ser Ser His Val Thr Val Ser Asn Cys Lys Phe Thr Gln
225 230 235 240
CAC CAA TTT GTA TTA TTG CTC GGG GCT GAT GAC ACC CAT TAT CAA GAT 768
His Gln Phe Val Leu Leu Leu Gly Ala Asp Asp Thr His Tyr Gln Asp
245 250 255
AAA GGC ATG CTA GCA ACG GTA GCA TTC AAC ATG TTC ACC GAT CAC GTT 816
Lys Gly Met Leu Ala Thr Val Ala Phe Asn Met Phe Thr Asp His Val
260 265 270
GAC CAA AGA ATG CCT AGA TGT AGA TTT GGG TTT TTC CAA GTC GTT AAC 864
Asp Gln Arg Met Pro Arg Cys Arg Phe Gly Phe Phe Gln Val Val Asn
275 280 285
AAC AAC TAC GAC AGA TGG GGA ACG TAC GCC ATC GGT GGT AGC TCG GCC 912
Asn Asn Tyr Asp Arg Trp Gly Thr Tyr Ala Ile Gly Gly Ser Ser Ala
290 295 300
CCA ACT ATA CTC AGC CAA GGG AAC AGA TTC TTC GCC CCC GAT GAT ATC 960
Pro Thr Ile Leu Ser Gln Gly Asn Arg Phe Phe Ala Pro Asp Asp Ile
305 310 315 320
ATC AAG AAA AAT GTC TTA GCG AGG ACT GGT ACT GGC AAC GCA GAG TCG 1008
Ile Lys Lys Asn Val Leu Ala Arg Thr Gly Thr Gly Asn Ala Glu Ser
325 330 335
ATG TCG TGG AAC TGG AGA ACA GAT AGA GAC TTG CTT GAA AAT GGT GCT 1056
Met Ser Trp Asn Trp Arg Thr Asp Arg Asp Leu Leu Glu Asn Gly Ala
340 345 350
ATT TTT CTC CCA TCC GGG TCT GAT CCA GTG CTA ACC CCT GAG CAA AAA 1104
Ile Phe Leu Pro Ser Gly Ser Asp Pro Val Leu Thr Pro Glu Gln Lys
355 360 365
GCA GGG ATG ATT CCA GCT GAA CCA GGA GAA GCC GTT CTA AGA CTC ACT 1152
Ala Gly Met Ile Pro Ala Glu Pro Gly Glu Ala Val Leu Arg Leu Thr
370 375 380
AGT AGT GCT GGT GTA CTC TCA TGC CAT CAA GGA GCA CCT TGC TAA GCA 1200
Ser Ser Ala Gly Val Leu Ser Cys His Gln Gly Ala Pro Cys * Ala
385 390 395 400
CCT GGC CAA TTC CTA AGC TTT TAT AAT AAT CAT AAA TAC TTA TTT TAT 1248
Pro Gly Gln Phe Leu Ser Phe Tyr Asn Asn His Lys Tyr Leu Phe Tyr
405 410 415
TTT ATT TTT GAT ATT TTA TAT GAA CCA TTA CGT TCA AGT ACT CTA TTA 1296
Phe Ile Phe Asp Ile Leu Tyr Glu Pro Leu Arg Ser Ser Thr Leu Leu
420 425 430
ACA TGT TTT AAA TTC ATA AGA GTT TAT TGA TAA AAA AAA AAA AAA CCG 1344
Thr Cys Phe Lys Phe Ile Arg Val Tyr * *
435 440
AAT TC 1349






398 amino acids


amino acid


linear




protein



74
Met Gly Ile Lys His Cys Cys Tyr Ile Leu Tyr Phe Thr Leu Ala Leu
1 5 10 15
Val Thr Leu Leu Gln Pro Val Arg Ser Ala Glu Asp Val Glu Glu Phe
20 25 30
Leu Pro Ser Ala Asn Glu Thr Arg Arg Ser Leu Lys Ala Cys Glu Ala
35 40 45
His Asn Ile Ile Asp Lys Cys Trp Arg Cys Lys Ala Asp Trp Ala Asn
50 55 60
Asn Arg Gln Ala Leu Ala Asp Cys Ala Gln Gly Phe Ala Lys Gly Thr
65 70 75 80
Tyr Gly Gly Lys His Gly Asp Val Tyr Thr Val Thr Ser Asp Lys Asp
85 90 95
Asp Asp Val Ala Asn Pro Lys Glu Gly Thr Leu Arg Phe Ala Ala Ala
100 105 110
Gln Asn Arg Pro Leu Trp Ile Ile Phe Lys Arg Asn Met Val Ile His
115 120 125
Leu Asn Gln Glu Leu Val Val Asn Ser Asp Lys Thr Ile Asp Gly Arg
130 135 140
Gly Val Lys Val Asn Ile Val Asn Ala Gly Leu Thr Leu Met Asn Val
145 150 155 160
Lys Asn Ile Ile Ile His Asn Ile Asn Ile His Asp Ile Lys Val Cys
165 170 175
Pro Gly Gly Met Ile Lys Ser Asn Asp Gly Pro Pro Ile Leu Arg Gln
180 185 190
Gln Ser Asp Gly Asp Ala Ile Asn Val Ala Gly Ser Ser Gln Ile Trp
195 200 205
Ile Asp His Cys Ser Leu Ser Lys Ala Ser Asp Gly Leu Leu Asp Ile
210 215 220
Thr Leu Gly Ser Ser His Val Thr Val Ser Asn Cys Lys Phe Thr Gln
225 230 235 240
His Gln Phe Val Leu Leu Leu Gly Ala Asp Asp Thr His Tyr Gln Asp
245 250 255
Lys Gly Met Leu Ala Thr Val Ala Phe Asn Met Phe Thr Asp His Val
260 265 270
Asp Gln Arg Met Pro Arg Cys Arg Phe Gly Phe Phe Gln Val Val Asn
275 280 285
Asn Asn Tyr Asp Arg Trp Gly Thr Tyr Ala Ile Gly Gly Ser Ser Ala
290 295 300
Pro Thr Ile Leu Ser Gln Gly Asn Arg Phe Phe Ala Pro Asp Asp Ile
305 310 315 320
Ile Lys Lys Asn Val Leu Ala Arg Thr Gly Thr Gly Asn Ala Glu Ser
325 330 335
Met Ser Trp Asn Trp Arg Thr Asp Arg Asp Leu Leu Glu Asn Gly Ala
340 345 350
Ile Phe Leu Pro Ser Gly Ser Asp Pro Val Leu Thr Pro Glu Gln Lys
355 360 365
Ala Gly Met Ile Pro Ala Glu Pro Gly Glu Ala Val Leu Arg Leu Thr
370 375 380
Ser Ser Ala Gly Val Leu Ser Cys His Gln Gly Ala Pro Cys
385 390 395






1320 base pairs


nucleic acid


single


linear




cDNA




CDS


1..1302




75
ATG GGG ATC AAA CAA TGT TGT TAC ATC TTG TAT TTT ACC TTA GCA CTT 48
Met Gly Ile Lys Gln Cys Cys Tyr Ile Leu Tyr Phe Thr Leu Ala Leu
1 5 10 15
GTC GCT TTG CTG CAA CCT GTT CGT TCT GCC GAA GGT GTC GGG GAA ATC 96
Val Ala Leu Leu Gln Pro Val Arg Ser Ala Glu Gly Val Gly Glu Ile
20 25 30
TTA CCT TCA GTT AAC GAA ACG AGG AGC CTG CAA GCA TGT GAA GCA CTC 144
Leu Pro Ser Val Asn Glu Thr Arg Ser Leu Gln Ala Cys Glu Ala Leu
35 40 45
AAC ATT ATA GAC AAG TGC TGG AGG GGC AAA GCC GAT TGG GAG AAC AAC 192
Asn Ile Ile Asp Lys Cys Trp Arg Gly Lys Ala Asp Trp Glu Asn Asn
50 55 60
CGA CAA GCG TTA GCC GAC TGT GCC CAA GGT TTT GCA AAG GGA ACC TAC 240
Arg Gln Ala Leu Ala Asp Cys Ala Gln Gly Phe Ala Lys Gly Thr Tyr
65 70 75 80
GGC GGA AAA TGG GGT GAT GTC TAC ACG GTC ACC AGC AAT CTA GAT GAT 288
Gly Gly Lys Trp Gly Asp Val Tyr Thr Val Thr Ser Asn Leu Asp Asp
85 90 95
GAT GTT GCA AAT CCA AAA GAA GGC ACA CTC CGG TTT GCT GCC GCC CAA 336
Asp Val Ala Asn Pro Lys Glu Gly Thr Leu Arg Phe Ala Ala Ala Gln
100 105 110
AAC AGG CCC TTG TGG ATC ATT TTT AAA AAT GAT ATG GTG ATT AAT TTG 384
Asn Arg Pro Leu Trp Ile Ile Phe Lys Asn Asp Met Val Ile Asn Leu
115 120 125
AAT CAA GAG CTT GTC GTA AAC AGC GAC AAG ACC ATC GAT GGC CGA GGG 432
Asn Gln Glu Leu Val Val Asn Ser Asp Lys Thr Ile Asp Gly Arg Gly
130 135 140
GTG AAA GTT GAA ATC ATT AAC GGA GGT CTC ACC CTC ATG AAT GTC AAG 480
Val Lys Val Glu Ile Ile Asn Gly Gly Leu Thr Leu Met Asn Val Lys
145 150 155 160
AAT ATA ATC ATT CAT AAC ATA AAT ATC CAT GAT GTT AAA GTG CTT CCA 528
Asn Ile Ile Ile His Asn Ile Asn Ile His Asp Val Lys Val Leu Pro
165 170 175
GGA GGC ATG ATT AAG TCC AAC GAT GGT CCA CCA ATT TTA AGA CAA GCA 576
Gly Gly Met Ile Lys Ser Asn Asp Gly Pro Pro Ile Leu Arg Gln Ala
180 185 190
AGT GAT GGG GAT ACT ATA AAT GTT GCT GGT AGT TCC CAA ATA TGG ATA 624
Ser Asp Gly Asp Thr Ile Asn Val Ala Gly Ser Ser Gln Ile Trp Ile
195 200 205
GAC CAT TGC TCA CTC AGC AAG TCT TTC GAT GGG CTG GTC GAT GTC ACC 672
Asp His Cys Ser Leu Ser Lys Ser Phe Asp Gly Leu Val Asp Val Thr
210 215 220
CTC GGT AGC ACA CAC GTG ACC ATT TCC AAC TGC AAA TTC ACC CAA CAG 720
Leu Gly Ser Thr His Val Thr Ile Ser Asn Cys Lys Phe Thr Gln Gln
225 230 235 240
TCA AAA GCA ATA TTG TTG GGA GCA GAT GAC ACC CAT GTT CAA GAT AAA 768
Ser Lys Ala Ile Leu Leu Gly Ala Asp Asp Thr His Val Gln Asp Lys
245 250 255
GGA ATG CTA GCA ACG GTC GCT TTC AAC ATG TTC ACC GAT AAC GTT GAC 816
Gly Met Leu Ala Thr Val Ala Phe Asn Met Phe Thr Asp Asn Val Asp
260 265 270
CAA AGA ATG CCT AGA TGT CGA TTT GGG TTT TTC CAA GTT GTT AAC AAC 864
Gln Arg Met Pro Arg Cys Arg Phe Gly Phe Phe Gln Val Val Asn Asn
275 280 285
AAC TAC GAC AGA TGG GGA ACG TAC GCC ATA GGT GGT AGC TCG GCC CCA 912
Asn Tyr Asp Arg Trp Gly Thr Tyr Ala Ile Gly Gly Ser Ser Ala Pro
290 295 300
ACT ATA CTC TGC CAA GGG AAC AGA TTC TTG GCC CCT GAT GAT CAG ATC 960
Thr Ile Leu Cys Gln Gly Asn Arg Phe Leu Ala Pro Asp Asp Gln Ile
305 310 315 320
AAG AAA AAT GTC CTA GCG AGG ACT GGT ACA GGC GCT GCT GAG TCG ATG 1008
Lys Lys Asn Val Leu Ala Arg Thr Gly Thr Gly Ala Ala Glu Ser Met
325 330 335
GCG TGG AAC TGG AGA TCT GAT AAA GAC TTG CTT GAA AAT GGT GCT ATT 1056
Ala Trp Asn Trp Arg Ser Asp Lys Asp Leu Leu Glu Asn Gly Ala Ile
340 345 350
TTT GTT ACA TCT GGG TCT GAT CCA GTG CTA ACC CCT GTT CAA AGC GCA 1104
Phe Val Thr Ser Gly Ser Asp Pro Val Leu Thr Pro Val Gln Ser Ala
355 360 365
GGG ATG ATT CCA GCT GAA CCA GGA GAA GCC GCT ATA AAA CTC ACT AGT 1152
Gly Met Ile Pro Ala Glu Pro Gly Glu Ala Ala Ile Lys Leu Thr Ser
370 375 380
AGT GCT GGT GTA TTC TCA TGC CGT CCT GGA GCA CCT TGC TAA GCA CCC 1200
Ser Ala Gly Val Phe Ser Cys Arg Pro Gly Ala Pro Cys * Ala Pro
385 390 395 400
TGC CAA TTC TCC TAA GCT TTT GCA ATG ATC AAA AAT ACT TTT TTA TTT 1248
Cys Gln Phe Ser * Ala Phe Ala Met Ile Lys Asn Thr Phe Leu Phe
405 410 415
TAT TTT TAA TAT TTT ATA TGT ACT GGA AAT GAA CCA TTA CCT TCT AGT 1296
Tyr Phe * Tyr Phe Ile Cys Thr Gly Asn Glu Pro Leu Pro Ser Ser
420 425 430
ACT CTA TAA CAT GTT TTG CAT TTA 1320
Thr Leu *
435






397 amino acids


amino acid


linear




protein



76
Met Gly Ile Lys Gln Cys Cys Tyr Ile Leu Tyr Phe Thr Leu Ala Leu
1 5 10 15
Val Ala Leu Leu Gln Pro Val Arg Ser Ala Glu Gly Val Gly Glu Ile
20 25 30
Leu Pro Ser Val Asn Glu Thr Arg Ser Leu Gln Ala Cys Glu Ala Leu
35 40 45
Asn Ile Ile Asp Lys Cys Trp Arg Gly Lys Ala Asp Trp Glu Asn Asn
50 55 60
Arg Gln Ala Leu Ala Asp Cys Ala Gln Gly Phe Ala Lys Gly Thr Tyr
65 70 75 80
Gly Gly Lys Trp Gly Asp Val Tyr Thr Val Thr Ser Asn Leu Asp Asp
85 90 95
Asp Val Ala Asn Pro Lys Glu Gly Thr Leu Arg Phe Ala Ala Ala Gln
100 105 110
Asn Arg Pro Leu Trp Ile Ile Phe Lys Asn Asp Met Val Ile Asn Leu
115 120 125
Asn Gln Glu Leu Val Val Asn Ser Asp Lys Thr Ile Asp Gly Arg Gly
130 135 140
Val Lys Val Glu Ile Ile Asn Gly Gly Leu Thr Leu Met Asn Val Lys
145 150 155 160
Asn Ile Ile Ile His Asn Ile Asn Ile His Asp Val Lys Val Leu Pro
165 170 175
Gly Gly Met Ile Lys Ser Asn Asp Gly Pro Pro Ile Leu Arg Gln Ala
180 185 190
Ser Asp Gly Asp Thr Ile Asn Val Ala Gly Ser Ser Gln Ile Trp Ile
195 200 205
Asp His Cys Ser Leu Ser Lys Ser Phe Asp Gly Leu Val Asp Val Thr
210 215 220
Leu Gly Ser Thr His Val Thr Ile Ser Asn Cys Lys Phe Thr Gln Gln
225 230 235 240
Ser Lys Ala Ile Leu Leu Gly Ala Asp Asp Thr His Val Gln Asp Lys
245 250 255
Gly Met Leu Ala Thr Val Ala Phe Asn Met Phe Thr Asp Asn Val Asp
260 265 270
Gln Arg Met Pro Arg Cys Arg Phe Gly Phe Phe Gln Val Val Asn Asn
275 280 285
Asn Tyr Asp Arg Trp Gly Thr Tyr Ala Ile Gly Gly Ser Ser Ala Pro
290 295 300
Thr Ile Leu Cys Gln Gly Asn Arg Phe Leu Ala Pro Asp Asp Gln Ile
305 310 315 320
Lys Lys Asn Val Leu Ala Arg Thr Gly Thr Gly Ala Ala Glu Ser Met
325 330 335
Ala Trp Asn Trp Arg Ser Asp Lys Asp Leu Leu Glu Asn Gly Ala Ile
340 345 350
Phe Val Thr Ser Gly Ser Asp Pro Val Leu Thr Pro Val Gln Ser Ala
355 360 365
Gly Met Ile Pro Ala Glu Pro Gly Glu Ala Ala Ile Lys Leu Thr Ser
370 375 380
Ser Ala Gly Val Phe Ser Cys Arg Pro Gly Ala Pro Cys
385 390 395






1160 base pairs


nucleic acid


single


linear




cDNA




CDS


1..1148




77
TTG TAT TTT ACC TTA GCC CTT GTC ACT TTG CTG CAA CCT GTT CGT TCT 48
Leu Tyr Phe Thr Leu Ala Leu Val Thr Leu Leu Gln Pro Val Arg Ser
1 5 10 15
GCC GAA GAT CTC CAG GAA ATC TTA CCT TCA GCT AAC GAA ACA AGG AGC 96
Ala Glu Asp Leu Gln Glu Ile Leu Pro Ser Ala Asn Glu Thr Arg Ser
20 25 30
CTG ACA ACA TGT GGA ACA TAC AAC ATT ATA GAC GGG TGC TGG AGG GGC 144
Leu Thr Thr Cys Gly Thr Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly
35 40 45
AAA GCC GAT TGG GCG GAA AAC CGA AAA GCG TTA GCC GAT TGT GCC CAA 192
Lys Ala Asp Trp Ala Glu Asn Arg Lys Ala Leu Ala Asp Cys Ala Gln
50 55 60
GGT TTT GCA AAG GGA ACA ATC GGC GGA AAA GAT GGT GAT ATA TAC ACG 240
Gly Phe Ala Lys Gly Thr Ile Gly Gly Lys Asp Gly Asp Ile Tyr Thr
65 70 75 80
GTC ACC AGT GAG CTA GAT GAT GAT GTT GCA AAT CCA AAA GAA GGC ACA 288
Val Thr Ser Glu Leu Asp Asp Asp Val Ala Asn Pro Lys Glu Gly Thr
85 90 95
CTC CGG TTT GGT GCC GCC CAA AAC AGG CCC TTG TGG ATT ATT TTT GAA 336
Leu Arg Phe Gly Ala Ala Gln Asn Arg Pro Leu Trp Ile Ile Phe Glu
100 105 110
AGA GAT ATG GTG ATT CGT TTG GAT AGA GAG TTG GCT ATA AAC AAC GAC 384
Arg Asp Met Val Ile Arg Leu Asp Arg Glu Leu Ala Ile Asn Asn Asp
115 120 125
AAG ACC ATC GAT GGC CGA GGG GCG AAA GTT GAA ATC ATT AAC GCT GGT 432
Lys Thr Ile Asp Gly Arg Gly Ala Lys Val Glu Ile Ile Asn Ala Gly
130 135 140
TTC GCC ATC TAT AAT GTC AAG AAT ATA ATC ATT CAT AAC ATA ATT ATG 480
Phe Ala Ile Tyr Asn Val Lys Asn Ile Ile Ile His Asn Ile Ile Met
145 150 155 160
CAT GAT ATT GTA GTG AAT CCA GGA GGC CTG ATT AAG TCC CAC GAT GGT 528
His Asp Ile Val Val Asn Pro Gly Gly Leu Ile Lys Ser His Asp Gly
165 170 175
CCA CCA GTT CCA AGA AAG GGT AGT GAT GGT GAT GCT ATA GGT ATT TCT 576
Pro Pro Val Pro Arg Lys Gly Ser Asp Gly Asp Ala Ile Gly Ile Ser
180 185 190
GGT GGT TCA CAA ATA TGG ATC GAC CAT TGC TCC CTC AGT AAG GCT GTT 624
Gly Gly Ser Gln Ile Trp Ile Asp His Cys Ser Leu Ser Lys Ala Val
195 200 205
GAT GGG CTA ATC GAT GCT AAA CAC GGC AGC ACA CAC TTC ACC GTT TCT 672
Asp Gly Leu Ile Asp Ala Lys His Gly Ser Thr His Phe Thr Val Ser
210 215 220
AAC TGC TTA TTC ACC CAA CAC CAA TAT TTA TTA TTG TTC TGG GAT TTT 720
Asn Cys Leu Phe Thr Gln His Gln Tyr Leu Leu Leu Phe Trp Asp Phe
225 230 235 240
GAC GAG CGA GGC ATG CTA TGT ACG GTC GCA TTC AAC AAG TTC ACT GAT 768
Asp Glu Arg Gly Met Leu Cys Thr Val Ala Phe Asn Lys Phe Thr Asp
245 250 255
AAC GTT GAC CAA AGA ATG CCT AAC TTA CGA CAT GGG TTT GTC CAA GTC 816
Asn Val Asp Gln Arg Met Pro Asn Leu Arg His Gly Phe Val Gln Val
260 265 270
GTT AAC AAC AAC TAC GAA AGA TGG GGA TCG TAC GCC CTC GGT GGT AGC 864
Val Asn Asn Asn Tyr Glu Arg Trp Gly Ser Tyr Ala Leu Gly Gly Ser
275 280 285
GCA GGC CCA ACC ATA CTT AGC CAA GGG AAC AGA TTC TTA GCC TCC GAT 912
Ala Gly Pro Thr Ile Leu Ser Gln Gly Asn Arg Phe Leu Ala Ser Asp
290 295 300
ATC AAG AAA GAG GTC GTA GGG AGG TAT GGT GAA TCC GCC ATG TCA GAG 960
Ile Lys Lys Glu Val Val Gly Arg Tyr Gly Glu Ser Ala Met Ser Glu
305 310 315 320
TCG ATT AAT TGG AAC TGG AGA TCG TAT ATG GAC GTA TTT GAA AAT GGT 1008
Ser Ile Asn Trp Asn Trp Arg Ser Tyr Met Asp Val Phe Glu Asn Gly
325 330 335
GCT ATT TTT GTT CCA TCC GGG GTT GAT CCA GTG CTA ACC CCT GAG CAA 1056
Ala Ile Phe Val Pro Ser Gly Val Asp Pro Val Leu Thr Pro Glu Gln
340 345 350
AAC GCA GGG ATG ATT CCA GCC GAA CCA GGA GAA GCC GTT CTA AGA CTC 1104
Asn Ala Gly Met Ile Pro Ala Glu Pro Gly Glu Ala Val Leu Arg Leu
355 360 365
ACT AGT AGT GCT GGT GTC CTC TCA TGC CAA CCT GGA GCA CCT TGC TAA 1152
Thr Ser Ser Ala Gly Val Leu Ser Cys Gln Pro Gly Ala Pro Cys *
370 375 380
GCA CTG CA 1160






383 amino acids


amino acid


linear




protein



78
Leu Tyr Phe Thr Leu Ala Leu Val Thr Leu Leu Gln Pro Val Arg Ser
1 5 10 15
Ala Glu Asp Leu Gln Glu Ile Leu Pro Ser Ala Asn Glu Thr Arg Ser
20 25 30
Leu Thr Thr Cys Gly Thr Tyr Asn Ile Ile Asp Gly Cys Trp Arg Gly
35 40 45
Lys Ala Asp Trp Ala Glu Asn Arg Lys Ala Leu Ala Asp Cys Ala Gln
50 55 60
Gly Phe Ala Lys Gly Thr Ile Gly Gly Lys Asp Gly Asp Ile Tyr Thr
65 70 75 80
Val Thr Ser Glu Leu Asp Asp Asp Val Ala Asn Pro Lys Glu Gly Thr
85 90 95
Leu Arg Phe Gly Ala Ala Gln Asn Arg Pro Leu Trp Ile Ile Phe Glu
100 105 110
Arg Asp Met Val Ile Arg Leu Asp Arg Glu Leu Ala Ile Asn Asn Asp
115 120 125
Lys Thr Ile Asp Gly Arg Gly Ala Lys Val Glu Ile Ile Asn Ala Gly
130 135 140
Phe Ala Ile Tyr Asn Val Lys Asn Ile Ile Ile His Asn Ile Ile Met
145 150 155 160
His Asp Ile Val Val Asn Pro Gly Gly Leu Ile Lys Ser His Asp Gly
165 170 175
Pro Pro Val Pro Arg Lys Gly Ser Asp Gly Asp Ala Ile Gly Ile Ser
180 185 190
Gly Gly Ser Gln Ile Trp Ile Asp His Cys Ser Leu Ser Lys Ala Val
195 200 205
Asp Gly Leu Ile Asp Ala Lys His Gly Ser Thr His Phe Thr Val Ser
210 215 220
Asn Cys Leu Phe Thr Gln His Gln Tyr Leu Leu Leu Phe Trp Asp Phe
225 230 235 240
Asp Glu Arg Gly Met Leu Cys Thr Val Ala Phe Asn Lys Phe Thr Asp
245 250 255
Asn Val Asp Gln Arg Met Pro Asn Leu Arg His Gly Phe Val Gln Val
260 265 270
Val Asn Asn Asn Tyr Glu Arg Trp Gly Ser Tyr Ala Leu Gly Gly Ser
275 280 285
Ala Gly Pro Thr Ile Leu Ser Gln Gly Asn Arg Phe Leu Ala Ser Asp
290 295 300
Ile Lys Lys Glu Val Val Gly Arg Tyr Gly Glu Ser Ala Met Ser Glu
305 310 315 320
Ser Ile Asn Trp Asn Trp Arg Ser Tyr Met Asp Val Phe Glu Asn Gly
325 330 335
Ala Ile Phe Val Pro Ser Gly Val Asp Pro Val Leu Thr Pro Glu Gln
340 345 350
Asn Ala Gly Met Ile Pro Ala Glu Pro Gly Glu Ala Val Leu Arg Leu
355 360 365
Thr Ser Ser Ala Gly Val Leu Ser Cys Gln Pro Gly Ala Pro Cys
370 375 380






1368 base pairs


nucleic acid


single


linear




cDNA




CDS


1..1366




79
TTG TAT TTT ACC TTA GCA CTT GTC ACT TTG GTG CAA GCT GGA CGT CTT 48
Leu Tyr Phe Thr Leu Ala Leu Val Thr Leu Val Gln Ala Gly Arg Leu
1 5 10 15
GGC GAA GAG GTC GAC ATC TTA CCT TCA CCT AAC GAT ACA AGG AGG AGC 96
Gly Glu Glu Val Asp Ile Leu Pro Ser Pro Asn Asp Thr Arg Arg Ser
20 25 30
CTG CAA GGA TGT GAA GCA CAC AAC ATT ATA GAC AAG TGT TGG AGG TGC 144
Leu Gln Gly Cys Glu Ala His Asn Ile Ile Asp Lys Cys Trp Arg Cys
35 40 45
AAA CCC GAT TGG GCG GAG AAC CGA CAA GCG TTA GGC GAT TGT GCG CAA 192
Lys Pro Asp Trp Ala Glu Asn Arg Gln Ala Leu Gly Asp Cys Ala Gln
50 55 60
GGT TTT GGA AAG GCA ACT CAC GGC GGA AAA TGG GGT GAT ATC TAC ATG 240
Gly Phe Gly Lys Ala Thr His Gly Gly Lys Trp Gly Asp Ile Tyr Met
65 70 75 80
GTC ACA AGT GAT CAG GAT GAT GAT GTT GTA AAT CCA AAA GAA GGC ACA 288
Val Thr Ser Asp Gln Asp Asp Asp Val Val Asn Pro Lys Glu Gly Thr
85 90 95
CTC CGG TTC GGT GCT ACC CAG GAC AGG CCC TTG TGG ATC ATT TTT CAA 336
Leu Arg Phe Gly Ala Thr Gln Asp Arg Pro Leu Trp Ile Ile Phe Gln
100 105 110
AGA GAT ATG ATT ATT TAT TTG CAA CAA GAG ATG GTC GTA ACC AGC GAC 384
Arg Asp Met Ile Ile Tyr Leu Gln Gln Glu Met Val Val Thr Ser Asp
115 120 125
ACG ACC ATT GAT GGT CGA GGG GCG AAA GTT GAG CTC GTT TAT GGA GGT 432
Thr Thr Ile Asp Gly Arg Gly Ala Lys Val Glu Leu Val Tyr Gly Gly
130 135 140
ATC ACC CTC ATG AAT GTC AAG AAT GTA ATC ATT CAC AAC ATA GAT ATC 480
Ile Thr Leu Met Asn Val Lys Asn Val Ile Ile His Asn Ile Asp Ile
145 150 155 160
CAT GAT GTT AGA GTG CTT CCA GGA GGT AGG ATT AAG TCC AAT GGT GGT 528
His Asp Val Arg Val Leu Pro Gly Gly Arg Ile Lys Ser Asn Gly Gly
165 170 175
CCA GCC ATA CCA AGA CAT CAG AGT GAT GGT GAT GCT ATC CAT GTT ACG 576
Pro Ala Ile Pro Arg His Gln Ser Asp Gly Asp Ala Ile His Val Thr
180 185 190
GGT AGT TCA GAC ATA TGG ATC GAC CAT TGC ACG CTC AGT AAG TCA TTT 624
Gly Ser Ser Asp Ile Trp Ile Asp His Cys Thr Leu Ser Lys Ser Phe
195 200 205
GAT GGG CTC GTC GAT GTC AAC TGG GGC AGC ACA GGA GTA ACC ATT TCC 672
Asp Gly Leu Val Asp Val Asn Trp Gly Ser Thr Gly Val Thr Ile Ser
210 215 220
AAC TGC AAA TTC ACC CAC CAC GAA AAA GCT GTT TTG CTC GGG GCT AGT 720
Asn Cys Lys Phe Thr His His Glu Lys Ala Val Leu Leu Gly Ala Ser
225 230 235 240
GAC ACG CAT TTT CAA GAT CTG AAA ATG CAT GTA ACG CTT GCA TAC AAC 768
Asp Thr His Phe Gln Asp Leu Lys Met His Val Thr Leu Ala Tyr Asn
245 250 255
ATC TTC ACC AAT ACC GTT CAC GAA AGA ATG CCC AGA TGC CGA TTT GGG 816
Ile Phe Thr Asn Thr Val His Glu Arg Met Pro Arg Cys Arg Phe Gly
260 265 270
TTT TTC CAA ATC GTT AAC AAC TTC TAC GAC AGA TGG GAT AAG TAC GCC 864
Phe Phe Gln Ile Val Asn Asn Phe Tyr Asp Arg Trp Asp Lys Tyr Ala
275 280 285
ATC GGT GGT AGC TCG AAC CCT ACT ATT CTC AGC CAA GGG AAC AAA TTC 912
Ile Gly Gly Ser Ser Asn Pro Thr Ile Leu Ser Gln Gly Asn Lys Phe
290 295 300
GTG GCC CCC GAT TTC ATT TAC AAG AAA AAC GTC TGT CTA AGG ACT GGT 960
Val Ala Pro Asp Phe Ile Tyr Lys Lys Asn Val Cys Leu Arg Thr Gly
305 310 315 320
GCA CAG GAG CCA GAA TGG ATG ACT TGG AAC TGG AGA ACA CAA AAC GAC 1008
Ala Gln Glu Pro Glu Trp Met Thr Trp Asn Trp Arg Thr Gln Asn Asp
325 330 335
GTG CTT GAA AAT GGT GCT ATC TTT GTG GCA TCT GGG TCT GAT CCA GTG 1056
Val Leu Glu Asn Gly Ala Ile Phe Val Ala Ser Gly Ser Asp Pro Val
340 345 350
CTA ACC GCT GAA CAA AAT GCA GGC ATG ATG CAA GCT GAA CCG GGA GAT 1104
Leu Thr Ala Glu Gln Asn Ala Gly Met Met Gln Ala Glu Pro Gly Asp
355 360 365
ATG GTT CCA CAA CTC ACC ATG AAT GCA GGT GTA CTC ACA TGC TCG CCT 1152
Met Val Pro Gln Leu Thr Met Asn Ala Gly Val Leu Thr Cys Ser Pro
370 375 380
GGA GCA CCT TGC TAA GCA CCT GGC CAA TTC CTA TGC AAC GAT CAT AAA 1200
Gly Ala Pro Cys * Ala Pro Gly Gln Phe Leu Cys Asn Asp His Lys
385 390 395 400
TAC TTG CTC ACC ATA AGT GTT CAT TTG ATT AGA TTT GGA CAC GAA TGA 1248
Tyr Leu Leu Thr Ile Ser Val His Leu Ile Arg Phe Gly His Glu *
405 410 415
TGT AAC CGA TTC GTC TGA ATT ATG ATT TGT TTT GAT TCT CAG TTT CAT 1296
Cys Asn Arg Phe Val * Ile Met Ile Cys Phe Asp Ser Gln Phe His
420 425 430
AAT ATG GCT TCT TGA GAG CAA AAT TAG AGA AGA GTG TCT TTG ATC AAC 1344
Asn Met Ala Ser * Glu Gln Asn * Arg Arg Val Ser Leu Ile Asn
435 440 445
TAC ATT TTA TGG TTT TTA TAT T AA 1368
Tyr Ile Leu Trp Phe Leu Tyr
450 455






388 amino acids


amino acid


linear




protein



80
Leu Tyr Phe Thr Leu Ala Leu Val Thr Leu Val Gln Ala Gly Arg Leu
1 5 10 15
Gly Glu Glu Val Asp Ile Leu Pro Ser Pro Asn Asp Thr Arg Arg Ser
20 25 30
Leu Gln Gly Cys Glu Ala His Asn Ile Ile Asp Lys Cys Trp Arg Cys
35 40 45
Lys Pro Asp Trp Ala Glu Asn Arg Gln Ala Leu Gly Asp Cys Ala Gln
50 55 60
Gly Phe Gly Lys Ala Thr His Gly Gly Lys Trp Gly Asp Ile Tyr Met
65 70 75 80
Val Thr Ser Asp Gln Asp Asp Asp Val Val Asn Pro Lys Glu Gly Thr
85 90 95
Leu Arg Phe Gly Ala Thr Gln Asp Arg Pro Leu Trp Ile Ile Phe Gln
100 105 110
Arg Asp Met Ile Ile Tyr Leu Gln Gln Glu Met Val Val Thr Ser Asp
115 120 125
Thr Thr Ile Asp Gly Arg Gly Ala Lys Val Glu Leu Val Tyr Gly Gly
130 135 140
Ile Thr Leu Met Asn Val Lys Asn Val Ile Ile His Asn Ile Asp Ile
145 150 155 160
His Asp Val Arg Val Leu Pro Gly Gly Arg Ile Lys Ser Asn Gly Gly
165 170 175
Pro Ala Ile Pro Arg His Gln Ser Asp Gly Asp Ala Ile His Val Thr
180 185 190
Gly Ser Ser Asp Ile Trp Ile Asp His Cys Thr Leu Ser Lys Ser Phe
195 200 205
Asp Gly Leu Val Asp Val Asn Trp Gly Ser Thr Gly Val Thr Ile Ser
210 215 220
Asn Cys Lys Phe Thr His His Glu Lys Ala Val Leu Leu Gly Ala Ser
225 230 235 240
Asp Thr His Phe Gln Asp Leu Lys Met His Val Thr Leu Ala Tyr Asn
245 250 255
Ile Phe Thr Asn Thr Val His Glu Arg Met Pro Arg Cys Arg Phe Gly
260 265 270
Phe Phe Gln Ile Val Asn Asn Phe Tyr Asp Arg Trp Asp Lys Tyr Ala
275 280 285
Ile Gly Gly Ser Ser Asn Pro Thr Ile Leu Ser Gln Gly Asn Lys Phe
290 295 300
Val Ala Pro Asp Phe Ile Tyr Lys Lys Asn Val Cys Leu Arg Thr Gly
305 310 315 320
Ala Gln Glu Pro Glu Trp Met Thr Trp Asn Trp Arg Thr Gln Asn Asp
325 330 335
Val Leu Glu Asn Gly Ala Ile Phe Val Ala Ser Gly Ser Asp Pro Val
340 345 350
Leu Thr Ala Glu Gln Asn Ala Gly Met Met Gln Ala Glu Pro Gly Asp
355 360 365
Met Val Pro Gln Leu Thr Met Asn Ala Gly Val Leu Thr Cys Ser Pro
370 375 380
Gly Ala Pro Cys
385






12 amino acids


amino acid


linear




protein



81
Ala Pro Asp Gln Leu Leu Ser Thr Tyr Asn Asp His
1 5 10






7 amino acids


amino acid


linear




protein



82
Tyr Phe Phe Leu Phe Tyr Phe
1 5






17 amino acids


amino acid


linear




protein



83
Tyr Phe Ile Cys Thr Lys Val Met Glu Met Asn Leu Tyr
1 5 10
Leu Leu Val Leu
15






6 amino acids


amino acid


linear




protein



84
Lys Lys Lys Lys Pro Asn
1 5






42 amino acids


amino acid


linear




protein



85
Ala Pro Gly Gln Phe Leu Ser Phe Tyr Asn Asn His Lys Tyr Leu Phe Tyr
1 5 10 15
Phe Ile Phe Asp Ile Leu Tyr Glu Pro Leu Arg Ser Ser Thr Leu Leu
20 25 30
Thr Cys Phe Lys Phe Ile Arg Val Tyr
35 40






17 amino acids


amino acid


linear




protein



86
Tyr Phe Ile Cys Thr Lys Val Met Glu Met Asn Leu Tyr
1 5 10
Leu Leu Ser Thr
15






6 amino acids


amino acid


linear




protein



87
Ala Pro Cys Gln Phe Ser
1 5






13 amino acids


amino acid


linear




protein



88
Ala Phe Ala Met Ile Lys Asn Thr Phe Leu Phe Tyr Phe
1 5 10






15 amino acids


amino acid


linear




protein



89
Tyr Phe Ile Cys Thr Gly Asn Glu Pro Leu Pro Ser Ser
1 5 10
Thr Leu
15






26 amino acids


amino acid


linear




protein



90
Ala Pro Gly Gln Phe Leu Cys Asn Asp His Lys Tyr Leu Leu Thr Ile Ser Val
1 5 10 15
Leu Ile Arg Phe Gly His Glu
20 25






5 amino acids


amino acid


linear




protein



91
Cys Asn Arg Phe Val
1 5






14 amino acids


amino acid


linear




protein



92
Ile Met Ile Cys Phe Asp Ser Gln Phe His Asn Met Ala Ser
1 5 10






14 amino acids


amino acid


linear




protein



93
Arg Arg Val Ser Leu Ile Asn Tyr Ile Leu Trp Phe Leu Tyr
1 5 10







Claims
  • 1. A method of treating sensitivity to a protein allergen in an individual sensitive to the allergen, comprising administering to the individual an effective amount of at least one isolated peptide having an amino acid sequence derived from the amino acid sequence of a ragweed protein allergen, said peptide comprising at least one T cell epitope recognized by a T cell receptor specific for the ragweed protein allergen, wherein the ragweed protein allergen comprises an amino acid sequence selected from the group consisting of SEQ ID NOS:72, 74, 76, and 78, and treating sensitivity to the ragweed protein allergen in an individual sensitive to the allergen.
  • 2. The method of claim 1 wherein the peptide is approximately 24 amino acid residues or less in length.
  • 3. The method of claim 2 wherein the peptide is approximately 33 amino acid residues in length.
  • 4. The method of claim 1 wherein the peptide is approximately 27 amino acid residues in length.
  • 5. The method of claim 1 wherein the peptide is approximately 14 amino acid residues in length.
  • 6. The method of claim 1 wherein the peptide is administered by injection.
  • 7. The method of claim 1 wherein the peptide is administered orally.
  • 8. The method of claim 1 wherein the peptide is administered by inhalation.
  • 9. The method of claim 1, further comprising administering to the individual an effective amount of at least two peptides, wherein each peptide has an amino acid sequence derived from the amino acid sequence of a ragweed protein allergen, each peptide comprising at least one T cell epitope recognized by a T cell receptor specific for the ragweed protein allergen, wherein the ragweed protein allergen comprises an amino acid sequence selected from the group consisting of SEQ ID NOS:72, 74, 76, and 78.
  • 10. The method of claim 1 wherein the peptide is approximately 10 amino acid residues in length.
Parent Case Info

This application is a divisional application of Ser. No. 08/290,448 filed on Aug. 15, 1994, now U.S. Pat. No. 5,698,204 which in turn is a continuation application of Ser. No. 07/529,951 filed on May 29, 1990, abandoned, which in turn is a continuation-in-part application of Ser. No. 07/325,365 filed on Mar. 17, 1989, abandoned. The contends of all of the aforementioned application(s) are hereby incorporated by reference.

US Referenced Citations (1)
Number Name Date Kind
4338297 Michael et al. Jul 1982 A
Foreign Referenced Citations (1)
Number Date Country
9011293 Oct 1990 WO
Non-Patent Literature Citations (28)
Entry
Hurtenbach et al J Exp Medicine vol. 177 1499-1504, 1993.*
Ashton-Rickardt et al Cell vol. 76 651-663, Feb. 1994.*
Livingstone el at Ann Rev immunol vol. 5 477-501, 1987.*
Goodfriend, L. et al. (1985) “RA-5G A Homologue of RA-5 in Giant Ragweed Ambrosia-trifida pollen isolation Hla-dr-associated activity and amino-acid sequence” Molecular Immunology 22(8):899-906.
Ishizeka, K. et al. (1975) J. of Immunology 114(1):110-115.
King, T.P. and P.S. Norman (1962) Biochemistry 1(4):709-720.
King, T.P. et al. (1964) Isolation and characterization of allergens from ragweed pollen. II. Biochemistry 3(3):458-468.
King, T.P. et al. (1981) “Limited Proteolysis of Antigens E and K from Ragweed Pollen” Arch. Biochem. Biophy. 212(1):127-135.
King, T.P. et al. (1974) “Chemical modification of the major allergen of ragweed pollen, antigen E. Immunochemistry” Immunochemistry 11:83-92.
King et al. (1981) “Limited Proteolysis of Antigens E and K from Ragweed Pollen” Archives of Biochemistry and Biphysics, 212(1):127-135.
King T. P. (1972) “Separation of Proteins by Ammonium Sulfate Gradient Solubilization” Biochemistry 11(3):367-371.
Lamb et al. (1983) “Induction of Tolerance in Influenza Virus-Immune T Lymphocyte Clones with Synthetic Peptides of Influenza Hemagglutinin” J. Exp. med. 157:1434-1447.
Lerner, R.A. (1982) “Tapping the immunological repertoire to produce antibodies of predetermined specificity” Nature 299:592-596.
Litwin, A. et al. (1988) International Archives of Applied Immunology, 87:361-366.
Litwin et al. (1991) “Regulation of the Human Immune Response to Ragweed Pollen by Immunotherapy. A Controlled Trial Comparing the Effect of Immunosuppressive Peptic Fragments of Short Rageweed with Strandard Treatment” Clinical and Experimental Allergy 21:457-465.
Lowenstein, H. et al. (1981) “Antigens of ambrosia-elatior short ragweed pollen 2. immunochemical identification of known antigens by quantitative immuno electrochemical identification of known antigens by quantitative immunoelectrophoresis” Immunology 127(2):637-642.
Marsh, D.G. et al. (1987) “Immune Responsiveness to ambrosia-Artemisiifolia short ragweed pollen allergen AMB-A-VI RA6 is associated with HLA-DR5 in allergic Humans” Immunogenetics 26:230-236.
Marsh et al. (1986) “Allergen Nonmenclature” Terminology, Terminologie 64(5):767-770.
Marsh et al. (1988) “Allergen nomenclature” Allergy 43:161-168.
Michael et al. (1990) “Modulation of the Immune Response to Ragweed Allergens by Peptic Fragments” Clinical Experimental Allergy 20:669-674.
Muckerheide, A. et al. (1980) Cellular Immunology 50:340-347.
Olson, J.R. and D.G. Klapper (1986) “Two Major Human Allrgenic Sites on Ragweed Pollen Allergen Antigen E Identified By Using Monoclonal Antibodies” J. of Immunology 136(6):2109-2115.
Olson et al. (1986) “Two Major Human Allergenic sites on ragweed Pollen Allergen Antigen E Identified By Using Monoclonal Antibodies” J. of Immunol. 136(6):2109-2115.
Paull et al. (1979) “Structure and activity of ragweed antigen E” The Journal of Allergy and Clinical Immunology 64(6)(1):539-545.
Scherer, M.T., et al. (1989) Cold Spring Harbor Symposia on Quantitative Biology vol. LIV, Cold Spring Harbor Laboratory Press, pp. 497-504.
Smith, J.J. et al. (1988) Molecular Immunology 25(4):355-365.
Takatsu et al. (1975) “Immunogenic Properties of Modified Antigen E” Journal of Immunology 115(6):1469-1476.
Young et al. (1983) “Efficient isolation of genes by using antibody probes” Proc. Natl. Acad. Sci. USA 80:1194-1198.
Continuations (1)
Number Date Country
Parent 07/529951 May 1990 US
Child 08/290448 US
Continuation in Parts (1)
Number Date Country
Parent 07/325365 Mar 1989 US
Child 07/529951 US