Randomly generated glycopeptide combinatorial libraries

Information

  • Patent Application
  • 20020193563
  • Publication Number
    20020193563
  • Date Filed
    April 27, 2001
    23 years ago
  • Date Published
    December 19, 2002
    21 years ago
Abstract
Randomly generated glycopeptide combinatorial libraries are generated by randomly glycosylating a peptide having at least one glycosylation site with at least one glycosyl donor, optionally blocking unreacted glycosylation sites on the glycopeptides and optionally selectively removing one or more protecting groups on the carbohydrate groups introduced at the first level; whereby a first level library of glycopeptides is created; and then optionally randomly glycosylating said first level library of glycopeptides, or a combination of first level libraries of glycopeptides, with at least one glycosyl donor, and optionally selectively removing one or more designated protecting groups on the carbohydrate groups introduced at the second level; whereby a second level library of glycopeptides is created. Further iterations of the process result in higher level libraries of increased diversity. The glycopeptide libraries including, e.g., carcinoma-associated mucins such as MUC1, are screened for drug-like, competitive inhibitory, immunostimulatory, antibody-like, and other biological activities.
Description


BACKGROUND OF THE INVENTION

[0001] The present invention relates to a method for generating a combinatorial library of glycopeptides, and to glycopeptide libraries produced by the method. More particularly, it relates to a method for generating a combinatorial library of a cancer-associated mucin.


[0002] Glycopeptides are a broad class of organic compounds that are important in diverse biochemical processes, including cell growth regulation, binding of pathogens to cells, intercellular communication, and metastasis. Biosynthetically, the carbohydrates of glycoproteins are attached (co- or post-translationally) by glycosyltransferase enzymes, each enzyme being specific for a particular monosaccharide unit and linkage type. Mono- and oligo-saccharides are transferred to proteins in the endoplasmic reticulum and linked to the NH2 group on the side chain of an asparagine residue of the protein, to form N-linked oligosaccharides. Mono- and oligosaccharides also may be linked to the OH group on the side chain of a serine, threonine, or hydroxylysine residue, to form O-linked oligosaccharides.


[0003] Different glycoforms of the same protein are regularly found, which differ in the carbohydrate structures that are attached. The glycoforms vary in properties such as protease stability, affinity to receptors, and pharmacokinetic profile. To study the influence of glycosylation patterns on the properties of a glycopeptide, and to identify compounds useful in diagnosis and/or therapy, it is necessary to have access to several different glycoforms. The most generally applied approach for obtaining defined glycopeptide fragments is chemical synthesis using glycosylated amino acids, although enzymatic glycosylation using glycosyltransferases has also been used.


[0004] Chemical synthesis of serine and threonine with large O-linked carbohydrate structures as building blocks for glycopeptide synthesis is much more difficult than synthesis of either oligopeptides or oligosaccharides. For example, a number of mono-, di- and trisaccharides have been identified on the core of the carcinoma-associated MUC1. Chemical synthesis of the α-O-linked N-acetylgalactosamine-based mucin-type glycopeptides depends on the accessibility to large amounts of O-glycosylated amino acids. Sequential glycosylations of serine and threonine for Fmoc-based glycopeptide synthesis involves a complex manipulation of the selectivities of base-sensitive protecting groups. Fmoc-protected serine and threonine are among the more highly sensitive and hindered aglycons used in glycosylation reactions. It is virtually-impossible to synthesize chemically all possible combinations of glycopeptides and test them against a natural anti-mucin antibody or polyclonal serum.


[0005] Combinatorial methods have gained great interest as a method of finding desirable compounds. These methods involve creating libraries of related compounds. Interaction of an antigen with the library is then measured, in order to assess whether one or more compounds in the library recognizes the antigen. The use, of combinatorial chemistry to synthesize large numbers of molecules, either as individual compounds or as mixtures, is considered to be one of the frontiers of organic chemistry applied to drug discovery.


[0006] To date, combinatorial chemistry has been applied to the generation of peptide and oligonucleotide libraries, where the well established chemistry of amide bond and phosphodiester bond formation led to rapid progress. Combinatorial chemistry has not been applied, however, to the generation of glycopeptide libraries. Glycosylation of the amino group on the side chain of an asparagine residue of the protein, to form N-linked oligosaccharides, may occur at any one of the three or four hydroxyl groups on the saccharide. For both N-linked and O-linked oligosaccharides, a new stereocenter is formed on glycosylation. This results in either an α- or β-glycosidic linkage, which are usually axial and equatorial, respectively.


[0007] A need therefore exists for a method of generating a combinatorial library of glycopeptides. Such a library could be used to screen for biological activity of different glycoforms within the library.



SUMMARY OF THE INVENTION

[0008] It is therefore an object of the present invention to provide a combinatorial method for the generation of glycopeptide libraries.


[0009] It is a further object of the invention to provide a random library of glycopeptides.


[0010] It is another object of the invention to provide a method of identifying glycopeptides that have a defined biological activity by screening a random library of glycopeptides for the biological activity.


[0011] It is yet another object of the invention to provide a method of generating a combinatorial library of mucins.


[0012] It is another object of the invention to provide a method of generating a combinatorial library of MUC1 glycopeptides.


[0013] These and other objects of the invention are achieved by a method of generating a glycopeptide library, comprising (a) randomly glycosylating a platform having at least one glycosylation site with at least one glycosyl donor, optionally blocking unreacted glycosylation sites on the glycosylated platforms and optionally selectively removing one or more protecting groups on the carbohydrate groups introduced at the first level; whereby a first level library of glycosylated platforms is created; and then (b) optionally randomly glycosylating said first level library of glycosylated platforms, or a combination of first level libraries of glycosylated platforms, with at least one glycosyl donor, and optionally selectively removing one or more designated protecting groups on the carbohydrate groups introduced at the second level; whereby a second level library of glycosylated platforms is created. The method may further comprise randomly glycosylating the second level library of glycosylated platforms, or a combination of second level or first and second level libraries of glycosylated platforms, with at least one glycosyl donor, and optionally selectively removing one or more designated protecting groups on the carbohydrate groups introduced at the third level; whereby a third level library of glycosylated platforms is created; and optionally repeating the foregoing step to produce fourth and higher level libraries of increased diversity.


[0014] In a preferred embodiment, the randomly-generated glycopeptide library comprises carcinoma-associated mucins or structures associated with adhesion ligands for bacterial receptors that are expressed on human cell surface antigens. Components of the library can be screened for drug-like, competitive inhibitory, immunostimulatory or antibody-like activity.


[0015] Other objects, features and advantages of the present invention will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific, examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description.







BRIEF DESCRIPTION OF THE DRAWINGS

[0016]
FIG. 1 shows a linear platform for combinatorial library synthesis according to the present invention.


[0017]
FIG. 2 is an example of a cyclic platform for combinatorial library synthesis according to the present invention.


[0018]
FIG. 3 shows a simple cyclic peptide and solubilized version which contains a lipid chain.


[0019]
FIG. 4 shows a platform with unnatural glycosylation sites.


[0020]
FIG. 5 is an example of a hybrid platform which does not include peptide linkages.


[0021]
FIG. 6 is an example of a cyclic peptide for random glycosylations, in which solubility is enhanced by introduction of hydrophobic groups.


[0022]
FIG. 7 shows carbohydrate structures found on cancer mucins.


[0023]
FIG. 8 is a bar graph of the results of screening of a GSTA library.







DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

[0024] In accordance with the present invention, it is possible to generate large glycopeptide libraries by random glycosylation of a selected peptide or peptide-like structure that has at least one glycosylation site. Random glycosylation of a peptide or peptide-like structure with more than one glycosylation site yields a large library of all possible combinations of glycosylation since each site. of glycosylation is unique for a given sequence. The random libraries provide a fast and efficient way of screening for drug-like, competitive inhibitory, immunostimulatory, antibody-like and other biological activities. The size of the random library of glycopeptides depends on the extent of glycosylation in terms of number of sites and the variety of carbohydrate structures that are added.


[0025] The “platform” or core used as the basis for the glycopeptide library is any peptide or peptide-like structure, including unnatural, synthetic structures, and may contain tandem repeats. The platform includes one or more glycosylation sites, which may be natural or unnatural. A platform with unnatural glycosylation sites is shown in FIG. 4. Preferably there are from 2 to 5 glycosylation sites on the peptide or tandem repeat, and more preferably 2 or 3 glycosylation sites. In a preferred embodiment, each glycosylation site on a platform is unique and distinguishable from other sites due to distinct structural features in the vicinity of the site.


[0026] Where the platform is a peptide-like structure, it does not necessarily contain peptide linkages, but may comprise any structure with glycosylation sites to which carbohydrate structures may be attached. For example, a platform may be a “hybrid” platform comprising a nonpeptide polymer or even a chain of carbon atoms to which natural amino acid side chains with natural glycosylation sites are attached. An example of a hybrid platform in shown in FIG. 5.


[0027] The glycosylation sites provide hydroxy functions for O-glycosylation and/or natural carboxy or carboxamido functional groups for N-glycosylation. Preferred glycosylation sites include one or more of serine, threonine and hydroxylysine, the hydroxyl group of which provides O-glycosylation sites, or asparagine, the amino group of which provides an N-glycosylation site.


[0028] Glycosylation sites my be of either d- or 1-optical configuration, although it is preferable that the glycosylation sites consist entirely of d-optical configuration. It is more particularly preferred that the entire platform be constructed of d-amino acids.


[0029] The platform may be linear (FIG. 1) or cyclic (FIG. 2), and may carry UV-active or fluorescent labels to aid in detection during a process of screening a glycopeptide library produced using the platform. Hydrophobic amino acids preferably are incorporated in the platform, as shown in FIG. 6, in order to increase the solubility of the platform in the organic solvents used to promote glycosylation reactions. In a preferred embodiment, glycosylation sites are spaced, singly or in clusters, between sequences that include hydrophobic amino acids such as alanine, phenylalanine, valine, leucine and isoleucine or unnatural hydrophobic amino acids. Lipid chains also can be incorporated into the platform to aid in the coating of microtiter plates, as shown in FIG. 3.


[0030] The peptide of the platform may be isolated or synthesized by any known method. The peptide may have free groups at the glycosylation binding sites, or it may be partially blocked, so that only certain glycosylation sites are available. The blocking groups may be introduced selectively during the synthesis of the platform, or may be introduced at any time during a series of glycosylation reactions on the platform. The blocking sites also may be selectively removed during any step of the process.


[0031] Carbohydrate structures are randomly introduced or arranged on the platform to generate a diverse glycopeptide library. Carbohydrate structures may include or be derived from glycoproteins, proteoglycans and glycolipids found on both normal and malignant cells. Carbohydrate structures may be pre-designed and restricted in number for more efficient screening of the resulting combinatorial library for specific purposes. Carbohydrate structures may be unnatural, i.e., a random combination of monosaccharides linked together with a mix of α- and β-linkages.


[0032] Glycosyl (carbohydrate) donors are selected based on knowledge of the carbohydrate moieties contained in a glycopeptide of interest. One preferred group of carbohydrate structures includes those structures known to be adhesion ligands for bacterial receptors that are expressed on human cell surface antigens. Another preferred group of carbohydrate structures include those known to be associated with malignant cell antigens.


[0033] For example, N-acetylgalactosamine O-linked to serine or threonine is known as carcinoma-associated Tn antigen, while the disaccharide beta βGal(1-3) αGalNAc O-linked to serine or threonine in mucins is commonly known as carcinoma associated Thomsen-Friedenreich (TF) antigen. The disaccharide is also found on the cell surface of chronic and acute myelogenous leukemic leukocytes. Sialylated versions of TF, particularly α2-6-TF, have been found on the mucins expressed by human breast cancer cell lines. The structure and synthesis of TF family glycosyl donors is described by Qiu et al., Tetrahedron Letters, 37:595-585 (1996). (The contents of this document, and all other documents specifically cited herein, are incorporated in this disclosure in their entirety by reference.) Examples of carbohydrate structures found on cancer mucins are found in FIG. 7.


[0034] Thus, when the desired glycopeptides are carcinoma-associated mucins, glycosyl donors used to glycosylate the core peptide include galactosamine, N-acetylgalactosamine, and sialyl. When generating a combinatorial library for glycopeptides other than Tn and TF glycopeptides, other glycosyl donors are employed.


[0035] The number of glycosyl donors at each level preferably is at least 2, and more preferably at least 3. Typically, more donors are used at levels beyond first level than are used in generating the first level library. It may be preferred in some instances to limit the size of a library by limiting the number of donors at each level to less than 5. Synthesis of glycosyl donors is well known in the art.


[0036] Glycosyl donors can be designed, in accordance with the present invention, to yield only particular carbohydrate structures of interest. In this regard, a glycosyl donor may be designed with protecting groups, such as 4,6-benzylidene, to favor formation of a particular carbohydrate structure. See, e.g., Qiu et al., supra, and Yule et al., Tetrahedron Letters, 36:6839-6842 (1995), and Broddefalk et al., Tetrahedron Letters, 17:3011-3014 (1996). The glycosyl donors, as well as glycosylation sequence, may be selected based on a predetermined assessment of the nature and number of established carbohydrate structures for a selected glycopeptide.


[0037] Glycosyl donors are reacted with a core peptide according to methods well known in the art. See, e.g., Qiu et al. and Yule et al., supra, as well as Kunz, Angew. Chem. Int. Ed. Engl. 26:294-308 (1987) and Garg et al., Advances in Carbohydrate Chemistry and Biochemistry, 50:277-310 (1994). A first level library of a desired glycopeptide is created by primary glycosylation of the peptide with a single glycosyl donor or a mixture of donors. Reaction of a core peptide with a glycosyl donor, or mixture of donors, results in a library of randomly glycosylated glycopeptides. A first level library can be used in a screening process, as described below, or can form the basis for generating higher level libraries.


[0038] A second level library is created by reacting one or more first level libraries with one or more further glycosyl donors. Prior to further reaction, unreacted glycosylation sites on the peptides may be blocked, e.g., by acetylation, in order to prevent these glycoforms from being eliminated from the library by being converted into different glycoforms. Following purification, the protecting groups of the carbohydrate structures on the glycoforms are selectively removed to create additional glycosylation sites on the existing carbohydrate structures. Random glycosylation with these additional donors further extends existing carbohydrate structures, thereby to create more complex glycopeptide structures. Higher level libraries are similarly created by reacting one or more second level or higher libraries with one or more further glycosyl donors.


[0039] The total number of possible glycoforms contained in a combinatorial library of glycopeptides according to the invention can be calculated by the following formula:


# of glycoforms=(x+1)n


[0040] where x is the number of glycosyl donors used and n is the number of glycosylation sites. For example, the following table shows the number of glycoforms that are obtained when up to five glycosyl donors and up to five glycosylation sites are used:
1TABLE 1# ofcarbohydrate structures (x)sites12345123456249162536382764125216416812566251296532243102431257776


[0041] It is possible to selectively control the size and complexity of the glycopeptide libraries in a number of ways. The size and complexity of a random library of glycopeptides depend on the extent of glycosylation, which in turn depends on the number of glycosylation sites on the core peptide and on the number of glycosyl units that are added. Libraries with various combinations of glycoforms can be achieved by mixing lower level libraries before glycosylation, by mixing donors at each level, and by using mixed lower level libraries in combination with mixed donors. Sites of further glycosylation can be controlled by protecting unreacted gylcosylation sites on the peptide, thereby preserving those structures in the library.


[0042] The glycopeptide libraries can be used to screen for biologically active compounds. By screening a number of libraries, each with different combinations of carbohydrate structures on the core peptide, it is possible to identify which structures have drug-like, competitive inhibitory, immunostimulatory, antibody-like, and many other biological activities. A library according to the present invention can be screened for compounds that have anti-bacterial activity, including compounds that competitively inhibit bacterial adhesion to a host cell. The library also may be screened for compounds that have anti-viral activity. In a preferred embodiment libraries are generated and screened to develop highly sensitive diagnostic antibodies and antigens for the detection and immunotherapy of cancers.


[0043] Methods of screening for biological activities using combinatorial libraries are known in the art. U.S. Pat. Nos. 5,510,240 and 5,541,061 provide examples of methods of screening combinatorial libraries, but any suitable method of screening for biological activity using combinatorial libraries may be used in accordance with the present invention. For example, an ELISA of various antibodies binding either Ovine Submaxillary Mucin (OSM) or CA27.29 as the solid phase may be used in an inhibition format with components from the libraries as inhibitors.


[0044] In a preferred embodiment, the peptides derived from a cancer-associated mucin, and is in particular a MUC1 core protein. The MUC1 tandem repeat derived sequence GVTSAPDTRPAPGSTA, contains five O-glycosylation sites, two serines and three threonines, and is an example of a peptide that can be glycosylated according to the present invention to create a glycopeptide library. If all possible glycosylation sites in a tandem repeat are used only once in primary glycosylation with N-acetylgalactosamine (Tn antigen), five different monoglycosylated tandem repeats result, but if glycosylation is randomized between 0 and 5 sites, there are 32 different combinations of glycosylated tandem repeats. If 0 to 5 sialic acids are then randomly added at the 6-position of the existing N-acetylgalactosamines, the possible number of glycoforms increases to 243. These will carry only combinations and varied numbers of Tn and STn. If another donor is added at each glycosylation, e.g., TF along with the first and GlcNAc along with the second, a total of 16807 glycosylation variants of MUC1 tandem repeat will be produced. This library will constitute more than 90% of all truncated versions (core structures) that may be associated with cancerous MUC1 mucin. These are useful as vaccine components.


[0045] A simpler tetrapeptide, GSTA,has only two O-glycosylation sites and is useful in demonstrating the method according to the invention, as shown in the following examples. The protecting groups on the glycosyl donors. and the sequence of glycosylation reactions are designed to yield only the established carbohydrate structures possible in mucin biosynthesis. The “rules” observed by the glycosyltransferases in creating carbohydrate structures on mucins are shown in Table 2, wherein GalNAc is N-acetylgalactosamine, GlcNAc is N-acetylglucosamine, and Gal is galactose. Sialic acid is another name for N-acetylneuraminic acid. It is notable that the N-acetylgalactosamine-based glycosyl donors form only alpha linkages with serine and threonine hydroxyls.


[0046] The library of GSTA glycopeptides modelled on naturally-existing mucins, is small enough that the components can be characterized by mass spectrometry. It is therefore very useful in gaining a precise understanding of glycosylation patterns of the MUC1 core protein, which is necessary in order to design effective therapeutic vaccines and diagnostic tools.
2TABLE 2DonorLinkageSiteExampleGalNacalphaserine, threonineTn, Core 6and 3-0 GalNacGlcNAcbeta6-0 GalNac, 3-0 GalF1 alphaSialic acidalpha6-0 GalNac, 3-0 GalSTn, STFGalbeta3-0 GalNac, 4-0 GlcNAcTF, F1 alpha


[0047] The following examples illustrate generation of a library of GSTA glycopeptides according to the present invention, but do not limit the scope of the invention in any way. Further aspects and variations of the invention, based on the disclosure above and the following examples, will be apparent to the person of ordinary skill in the art.



EXAMPLES


Example 1


Synthesis of Protected Peptides of GSTA

[0048] GSTA is a four amino acid residue of MUC1, which has two unique sites for glycosylation, the serine residue (S) and the threonine residue (T). It is manually synthesized in solution with N-terminal Fmoc and C-terminal benzyl, with serine and threonine hydroxyls free.


[0049] Glycosyl donors N-acetylgalactosamine (Tn antigen) and βGal(1-3) αGalNAc (carcinoma associated Thomsen-Friedenreich, or TF antigen) are protected with 4,6-benzylidenyl protecting groups. Synthesis of protected peptides of GSTA is shown in Reaction Scheme I.


[0050] Preparation of Compound 1


[0051] A mixture of L-alanine, 20 g, 0.22 mol and toluene-4-sulphonic acid, 53 g, 0.28 mol in 100 mL of benzyl alcohol and 150 ml of benzene were refluxed overnight using a Dean stark apparatus. Benzene was removed in vacuo and 250 ml of ether was added to give compound 1, a solid, 75 g, 78%. [α]D−5.2 (c0.25, MeOH); 1H-NMR (300 MHz, CDCl3), 8=8.25 (m, 3H, TSOH, 2 NH2), 7.00-7.70 (m, 9H, Ar—H), 5.08, 4.98 (2 d, 2H, J=12.5 Hz, CH2), 4.05 (dd, 1H, J=7.5, 15.0 Hz, Ala-(x-H), 2.30 (s, 3H, PhCH3), 1.43 (d, 3H, J=7.0 Hz, CH3).


[0052] Preparation of Compound 3


[0053] To a solution of N-Boc-L-threonine 2, 10 g, 45.6 mmol in 80 mL of dry THF was added 3.5 mL of N-methyl morpholine and 4.5 mL of i-butyl chloroformate at −20° C. under nitrogen. After stirring at −20° C. for 5 min., a solution of compound 1, 15 g, 57 mmol and 3.5 ml of N-methyl morpholine in 40 ml of dry THF and 5 ml of dry DMF was added. After stirring at −20° C. for 30 min., 10 mL of methanol was added and the solvent was removed in vacuo. The residue was purified by silica gel column using ethyl acetate/hexane (1:1) to give compound 3, a white powder after freeze drying from dioxane, 12.5 g, 71%. [α]D−39.2 (c0.25, MeOH); 1H-NMR (300 MHz, CDCl3), δ=7.30-7.50 (m, 5H, Ar—H), 7.10 (d, 1H, J=7.0 Hz, NH), 5.56 (d, 1H, J=8.0 Hz, NH), 5.20, 5.14 (2 d, 2H, J=12.0 Hz, CH2Ph), 4.60 (m, 1H, Ala-α-H), 4.30 (m, 1H, Thr-β-H), 4.14 (dd, 1H, J=2.0, 7.5 Hz, Thr-(x-H), 3.46 (m, 1H, OH), 1.45 (s, 9H, 3 CH3), 1.41 (d, 3H, J=7.5 Hz, CH3), 1.18 (d, 3H, J=6.5 Hz, CH3)


[0054] Preparation of Compound 4


[0055] A solution of compound 3, 8.7 g, in 50 mL of formic acid was stirred at room temperature for 5 hours. Formic acid was removed in vacuo and ethyl acetate was added to the residue to give compound 4, a solid, 5.3 g, 71%. [α]D−66.8 (c0.25, MeOH); 1H-NMR (300 MHz, CDCl3+CD3OD), δ=8.42 (s, 1H, HCOOH), 7.30-7.50 (m, 5H, Ar—H), 7.10 (d, 1H, J=7.0 Hz, NH), 5.56 (d, 1H, J=8.0 Hz, NH), 5.17, 5.09 (2 d, 2H, J=12.0 Hz, CH2Ph), 4.50 (dd, 1H, J=7.5, 15.0 Hz, Ala-(α-H), 3.89 (m, 1H, Thr-α-H), 3.48 (d, 1H, J=7.5 Hz, Thr-(X-H), 1.41 (d, 3H, J=7.5 Hz, CH3), 1.23 (d, 3H, J=6.5 Hz, CH3); ES-MS. Calc. for C14H20O4N2, 280.3; Found, 279.4.


[0056] Preparation of Compound 5


[0057] A solution of N-Fmoc-L-glycine, 100 g, 0.336 mol and DCC, 139 g, 0.67 mol in 1000 mL of dry ethyl acetate was stirred at 5° C. for 15 min. A solution of N-hydroxysuccinimide, 138.8 g, 0.67 mol in 1000 mL of dry ethyl acetate was added. After stirring at 5° C. for 2 hours, the solid was filtered and washed with ethyl acetate (3×50 mL). The solvent was removed in vacuo and the residue was purified by crystallization from hexane and ethyl acetate to give compound 5, a solid, 83.2 g, 63%. [α]D+6.0(c0.25, MeOH); 1H-NMR (300 MHz, CDCl3), δ=7.30-7.80 (m, 8H, Ar—H), 5.42 (t, 1H, J=5.5 Hz, NH), 4.43(d, 2H, J=7.0 Hz, CH2 on Fmoc), 4.37 (d, 2H, J=6.0 Hz, Ala-(α-H), 4.23(t, 1H, J=7.0 Hz, CH on Fmoc), 2.84 (s, 4H, 2 CH2)


[0058] Preparation of Compound 7


[0059] To a solution of L-serine, 20 g, 0.18 mol and sodium bicarbonate, 16 g in 150 mL of water was dropped compound 5, 56 g, 0.142 mol in 600 ml of DME. The mixture was stirred at room temperature overnight. DME was removed in vacuo and the 5% of aqueous HCl was added to adjust pH=2. The water solution was extracted with ethyl acetate (3×200 mL). The organic layer was washed with water and dried over Na2SO4. The solvent was removed in vacuo and the residue was purified by silica gel column using ethyl acetate/acetic acid (9:1) to give compound 7, a white powder after freeze drying from dioxane, 35 g, 62%. [α]D+9.6 (c0.50, MeOH); 1H-NMR (300 MHz, CDCl3+CD3OD), δ=7.30-7.80 (m, 8H, Ar—H), 4.61 (t, 1H, J=4.0 Hz, ser-α-H), 4.45 (d, 2H, J=7.0 Hz, CH2 on Fmoc), 4.30 (t, 1H, J=7.0 Hz, CH on Fmoc), 4.03 (dd, 1H, J=4.0, 11.5 Hz, ser-α-H), 3.90-4.08 (m, 3H). ES-MS: Calc. for C20H20O7N2, 384.3; Found, 384.3.


[0060] Preparation of Compound 8


[0061] To a solution of compound 7, 1.7 g, 4.25 mmol in 20 mL of dry THF was added 0.44 mL of N-methylmopholine and 0.52 mL of i-butylchloroformate at −20° C. under nitrogen. After stirring at −20° C. for 5 min., a solution of compound 3, 1.12 g, 4 mmol and 0.44 mL of N-methylmorpholine in 10 mL of dry THF and 5 mL of dry DMF was dropped in 5 min. After stirring at −20° C. for 30 min., 5 mL of methanol was added and the solvent was removed in vacuo. The residue was purified by silica gel column using hexane/ethyl acetate/methanol (10:10:1) to give compound 8, a white powder after freeze drying from dioxane, 1.7 g, 66%. [α]D−36.5 (c0.2, MeOH:CHCl3=4:1); 1H-NMR (300 MHz, CDCl3÷+CD3OD), δ=7.30-7.80 (m, 13H, Ar—H), 4.99, 5.05 (2d, 2H, J=12.5 Hz, CH2Ph), 4.34-4.45 (m, 2H), 4.24-4.32 (m, 3H), 4.08-4.20 (m, 2H), 3.78-3.85 (m, 2H), 3.75 (d, 2H, 7.0 Hz, CH2), 3.53-3.62 (m, 1H), 1.28 (d, 3H, J=7.0 Hz, CH3), 1.06 (d, 3H, J=6.0 Hz, CH3). ES-MS; Calc. for C44H37O9N4; 645.67. Found; 646.0 (M+H), 669.3 (M+Na).



Example 2


Synthesis of Donors for First Level Library

[0062] Synthesis of donors for the first level of the library is summarized in Reaction Scheme II.


[0063] Preparation of Compound 9


[0064] A mixture of N-acetyl-D-galactosamine, 20 g, 90.4 mmol, 20 mL of benzaldehyde dimethyl acetal and 200 mg of p-toluenesulfonic acid in 500 mL of dry acetonitrile was stirred at 60° C. for 5 hours. After cooling to room temperature, the solid was filtered, washed with CH2Cl2 (3×20 mL) and dried in vacuo to give compound 9, a solid, 22 g, 88%. [α]+133.0 (c1.0, H2O). 1H-NMR (DMSO-d6): δ=7.35-7.71 (m, Ar—H), 5.62 (s, 1H, CHPH), 5.12 (d, 1H, J=3.0 Hz, H-1), 4.21 (bd, 1H, J=3.5 Hz, H-4), 4.12 (dd, 1H, J=3.5, 11.0 Hz, H-2), 4.06 (m, 2H, H-6), 3.92 (m, 1H, H-3), 3.85 (m, 1H, H-5), 1.90 (s, 3H, NAc). 13C-NMR: δ=99.7 (CHPh), 91.4 (C-1).


[0065] Preparation of Compound 10


[0066] To a solution of compound 9, 20 g, 64.7 mmol in 150 mL of dry pyridine was dropped 8.9 mL of benzoyl chloride in 40 mL of CH2C12 at −25° C. under argon in 30 min. and stirred for 2 hours. After adding 5 mL of ethanol the solvent was removed in vacuo. Aqueous workup (CH2Cl2)and recrystallization from ethyl acetate gave compound 10, a solid, 16 g, 62%. [α]D+182.0 (cl, MeOH); 1H-NMR: 87.30-8.10 (m, Ar—H), 6.05 (s, 1H, NH), 4.00-5.40 (m), 1.88 (s, 3H, NAc).


[0067] Preparation of Compound 11


[0068] To a solution of compound 10, 10 g, 24.9 mmol in 100 mL of dry CH2Cl2 was added 7 mL of trichloroacetonitrile and 0.5 mL of DBU. The solution was stirred for 2 hours at room temperature and solvent was removed in vacuo. The residue was purified by silica gel column using hexane/ethyl acetate(1:1) to give compound 11, a white powder after freeze drying from benzene, 11.5 g, 85%. [60 ]D+151.0 (c1.0, CHCl3). 1H NMR: δ=8.85 (s, 1H, NH), 7.408.10 (m, Ar—H), 6.75 (d, 1H, J=3.5 Hz, H-1), 5.65 (d, 1H, J=9.0 Hz, NH), 5.60 (s, 1H, CHPH). 5.50 (dd, 1H, J=3.5 Hz, H-3) , 4.00-5.15 (m), 1.90 (s, 3H, NAc)


[0069] Preparation of Compound 12


[0070] To a solution of N-acetyl galactosamine, 50 g, 0.226 trial in 100 ml of dry allyl alcohol was dropped 10 N HCl in 50 ml of THF. After stirring at 50° C. overnight the solvent was removed in vacuo and the residue was purified by recrystallization from ethanol to give compound 12, a solid, 35 g, 59% −[α]D+165.00 (c1.0, H2O). 1H NMR (D20): δ=6.00 (m, 1H, CH=), 5.30 (m, 2H, =CH2), 4.95 (d, 1H, J=3.5 Hz, H-1), 3.70-4.30 (m), 2.01 (s, 3H, NAc).


[0071] Preparation of Compound 13


[0072] A mixture of compound 12, 35 g, 0.134 mol, 62 mL of benzaldehyde dimethyl acetal and 350 mg of p-toluenesulfonic acid in 600 mL of dry acetonitrile was stirred at 60° C. for 4 hours. The solvent was removed in vacuo and the residue was purified by recrystallization from ethanol to give compound 13, a solid, 29 g, 62%. [α]D+155.0 (c1.0, MeOH). 1H NMR: 8=7.30-7.60 (m, Ar—H), 5.80 (m, 1H, CH═), 5.57 (s, 1H, CHPh), 5.26 (m, 2H ═CH2), 5.00 (d, 1H, J=3.5 Hz, H-1), 3.70-4.55 (m), 2.90 (d, 1H, J=10.0 Hz, OH), 2.04 (s, 3H, NAc).


[0073] Preparation of Compound 14


[0074] A solution of 13, 20 g, 57.3 mmol, 40 g of tetra-O-acetyl-bromo-(α-D-galactopyranoside and Hg(CN)2, 24 g in 50 mL of dry benzene/50 mL of dry nitromethane were stirred at 50° C. overnight under argon. Tetra-O-acetyl-bromo-(α-D-galactopyranoside (25 g) and Hg(CN)2 (15 g) were added and the stirring was continued for 4 hours. The solvent was removed in vacuo and the residue was dissolved in 1000 mL of CH2C12, washed with sat'd Na2CO3, 30% KBr and water before drying over Na2SO4. The solvent was removed in vacuo and the residue was purified by silica gel column (1:1 to 1:4 hexane/ethyl acetate) to give compound 14, a white powder after freeze drying from dioxane, 25 g, 64%. [α]D+96.00 (c2.0, MeOH). 1H-NMR (300 MHz, CDCl3): 8=7.40-7.70 (m, Ar—H), 5.88 (m, 1H, CH═), 5.65 (d, 1H, J=9.5 Hz, NH), 5.55 (s, 1H, CHPH), 5.35 (m, 1H, H-4b), 5.25 (m, 3H), 5.03 (d, 1H, J=3.5 Hz, H-1a), 4.96 (dd, 1H, J=3.5, 10.0 Hz, H3b), 4.75 (d, 1H, J=7.5 Hz, H-1b), 3.6-4.70 (m), 2.14, 2.04, 1.98, 1.96 (4 s, 15H, 5 Ac).


[0075] Preparation of Compound 15


[0076] To a solution of compound 14, 23 g, 33 mmol and [bis (methyldiphenylpilosphine)1(1,5-cyclooctadiene)-iridium( 1) hexafluorophosphate, 500 mg in 500 mL of dry THF was bubbled argon for 15 min. and hydrogen for 5 min. The solution was stirred at room temperature for 4 hours. 15 g of I2, 800 mg of dimethylaminopyridine, 10 mL of pyridine and 80 mL of water were added and the solution was stirred at room temperature overnight. Sodium sulfate (5%, 200 mL) was added and THF was removed in vacuo. Aqueous workup (CH2Cl2) and silica gel purification (5:10:2 hexane/ethyl acetate/methanol) gave compound 15, a white powder after freeze drying from dioxane, 12.5 g, 58%. [α]D+55.6° (c1.0, MeOH). 1H-NMR(300 MHz, CDCl3): δ=7.35-7.60 (m, Ar—H), 6.50 (d, 1H, J=9.0 Hz, NH), 5.45 (s, 1H, CHPh), 5.33 (d, 1H, J=3.5 Hz, H-1a), 5.15 (m, 2H, H-2b & H-4b), 4.96 (2d, 1H, J=3.5, 10.5 Hz, H3b), 4.68 (d, 1H, J=8.0 Hz, H-1b), 3.60-4.50 (m), 2.11, 2.02, 2.00, 1.96, 1.94 (5 s, 15H, 5 Ac).


[0077] Preparation of Compound 16


[0078] To a solution of compound 15, 7.5 g, 11.5 mmol in 100 mL of dry CH2Cl2 was added 2.6 mL of trichloroacetonitrile and 0.5 mL of DBU. The solution was stirred for 2 hours at room temperature and solvent was removed in vacuo. The residue was purified by silica gel column using hexane/ethyl acetate(1:2) to give compound 16, a white powder after freeze drying from benzene, 5.6 g, 61%. [α]D+81.2 (c1.0, CH2Cl2). 1H-NMR (300 MHz, CDCl3): δ=8.75 (s, 1H, NH), 7.35-7.60 Cm, Ar—H), 6.62 (d, 1H, J=3.5 Hz, H-1a), 5.80 ( d, 1H, J=8.0 Hz, NH), 5.52 (s, 1H, CHPh), 5.40 (2d, 1H, J=1.0, 3.5 Hz, H-4b), 5.25 (dd, 1H, J=8.0, 10.0 Hz, H-2b), 5.01 (dd, 1H, J=3.5, 10.5 Hz, H-2a), 4.90 (d, 1H, J=8.0 Hz, H-1b), 3.80-4.80 (m), 2.15, 2.04, 2.02, 2.00, 1.96 (5 s, 15H, 5 Ac)



Example 3


Synthesis of Donors for Second Level Library

[0079] Synthesis of donors for the second level of the library is summarized in Reaction Scheme III.


[0080] Preparation of Compound 17


[0081] K. Fufase, et al., Tetrahedron Lett., 36: 7455-7458. To a solution of D-glucosamine hydrochloride, 34 g, in 1 L of water was added trichloroethyl chloroformate, 33 mL dropwise over 2-3 hours at 0° C. After stirring at room temperature overnight, the solid was filtered out and washed with water and then ether. The solid was recrystallized from ethanol to give N-troc-D-glucosamine, 59.5 g, 98%. A solution of N-troc-D-glucosamine, 13.5 g, 38.2 mmol in 100 mL of allyl alcohol with 5% HCl was stirred at 100° C. for 30 min. After cooling to room temperature, allyl alcohol was remove in vacuo and the residue was dissolved in 250 mL of acetonitrile. 12 ml of PhCH(OMe)2 and 100 mg of TSOH were added. The mixture was stirred at room temperature over night under argon. 10 g of Na2CO3 was added and the mixture was stirred for 10 min. Solid was removed and washed with acetone (5×5mL). The solvent was removed and the solid was purified by crystallized from ethanol to give compound 17, a solid, 12 g, 65%. 1H-NMR (300 MHz, CDCl3). δ=7.30-7.60 (m, 5H, Ar—H), 5.80-6.00 (m, 1H, CH=), 5.55(,s, 1H, CHPh), 5.20-5.40 (m, 2H, =CH2), 4.92 (d, 1H, J=3.0 Hz, H-1), 4.82, 4.69 (2 d, 2H, J=12.0 Hz, CH2), 3.70-4.30 (m), 3.59 (t, 1H, J=8.0 Hz, H-4), 2.67 (d, 1H, J=2.0 Hz, OH).


[0082] Preparation of Compound 18


[0083] A solution of compound 17, 8 g, 16.6 mmol in 20 mL of pyridine and 10 mL of acetic anhydride was stirred at room temperature overnight. The solvent was removed and the residue was purified by crystallization from ethanol to give compound 18, a solid, 4.7 g, 54%. [α]D+58.5 (cl, ethyl acetate); 1H-NMR (300 MHz, CDCl3): δ=7.30-7.50 (m, 5H, Ar—H), 5.806.00 (m, 1H, CH═), 5.53 (s, 1H, CHPH), 5.38 (t, 1H, J=10 Hz, H-3), 5.20-5.35 (m, 2H, ═CH2), 4.93 (d, 1H, J=3.5 Hz, H-1), 4.79, 4.68 (2 d, 2H, J=12.0 Hz, CH2), 3.65-4.35(m), 2.10 (s, 3H, CH3).


[0084] Preparation of Compound 19


[0085] To a solution of compound 18, 9.1 g, 17.4 mmol in 300 mL of THF was added [bis(methyldiphenylphosphine)] (1,5-cyclooctadiene)-iridium(1) hexafluorophosphate, 350 mg. Argon was passed to the solution for 10 min. following by hydrogen, 20 min. The solution was stopped and stirred at room temperature for 4 hours. 500 rug of DMAP, 6.25 mL of pyridine, 50 mL of water and 9.35 g of 12 were added. The mixture was stirred at room temperature overnight. THF was removed and the mixture was dissolved in 1 L of CH2Cl2. The solution was washed with sat. sodium carbonate, 1N HCl and water, dried over Na2SO4. The solvent was removed and the residue was purified by silica gel column using hexane/ethylacetate=7:3 to give compound 19, a white powder after freeze drying from dioxane, 5.5 g, 65%. [α]D+60.0 (cl, ethyl acetate); 1H-NMR (300 MHz, CDCl3): δ=7.30-7.50 (m, 5H, Ar—H), 5.53 (s, 1H, CHPh), 5.52 (d, 1H, J=10.0 Hz, NH), 5.43 (t, 1H, J=10 Hz, H-3), 5.33 (d, 1H, J=3.5 Hz, H-1), 4.81, 4.66 (2 d, 2H, J=12.0 Hz, CH2), 3.65-4.35 (m), 3.17 (dd, 1H, J=1.0, 3.5 Hz, OH), 2.06 (s, 3H, CH3).


[0086] Preparation of Compound 20


[0087] A solution of compound 19, 5.5 g, 11 mmol, 2.42 mL of tricholoroacetonitrile and 6 drops of DBU in 70 mL of CH2Cl2 was stirred at room temperature for 2 hours under argon. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate (7:3) to give compound 20, a white powder after freeze drying from benzene, 5.59 g, 81%. [α]D+59.0 (cl, ethyl acetate); 1H-NMR (300 MHz, CDCl3): 8=8.80 (s, 1H, NH), 7.307.50 (m, 5H, Ar—H), 6.40 (d, 1H, J=3.5 Hz, H-1), 5.56 (s, 1H, CHPh), 5.46 (t, 1H, J=10 Hz, H-3), 5.36 (d, 1H, J=9.0 Hz, NH), 4.71 (dd, 2H, J=12.0 Hz, CH2), 4.35 (dd, 1H, J=5.0, 10.0 Hz, H-4), 4.27 (m, 1H, H-2), 3.75-4.00 (m), 2.11 (s, 3H, CH3).


[0088] Preparation of Compound 21


[0089] G. Grundler and R. R. Schmidt, Libigs Ann. Chem., 1984, 1826-1847.


[0090] To a solution of 2.0 mL of HClO4 (70%) in 300 mL of acetic anhydride was added lactose, 100 g, 0.29 mol by portion to keep the temperature between 30 to 35° C. After adding of 15 g of red phosphorous, the mixture was cooled in ice-salt bath and 90 g (29 mL) of Br2 was dropped to keep the temperature below 20° C. 15 mL of water was added in 15 min. The solid was filtered out and washed with acetic acid. The mixture was added to a solution of 100 g of zinc and 11 g of CuSO4 in 290 mL of water and 200 mL of acetic acid in 2 hour at −20° C. The mixture was stirred at −20° C. for 2 hours and the solid was filtered out and washed with acetic acid. The solvent was removed and the residue was purified by silica gel column using hexane/ethyl acetate (1:1) to give compound 21, a white powder after freeze drying from benzene, 100 g, 62%. 1H-NMR (300 MHz, CDCl3): 8=6.32 (dd, 1H, J=2.0, 6.5 Hz, ═CH), 5.32 (m, 1H, ═CH), 5.27 (dd, 1H, 1=1.0, 3.5 Hz, H-4′), 5.09 (dd, 1H, J=8.0, 11.0 Hz, H-2′), 4.92 (dd, 1H, J=3.5, 10.5 Hz, H-3′), 4.74 (dd, 1H, J=3.5, 6.0 Hz, H-3), 4.60 (d, 1H, J=8.0 Hz, H1′), 3.80-4.40 (m), 2.07, 2.03, 2.00, 1.97, 1.96, 1.89 (6 s, 18H, 6 Ac).


[0091] Preparation of Compound 22


[0092] G. Grundler and R. R. Schmidt, Libigs Ann. Chem., 1984, 1826-1847.


[0093] To a solution of compound 21, 100 g, 0.179 mol and 300 g of ceric ammonium nitrite (CAN) was added 20 g of sodium azide at −20° C. under nitrogen. After stirring at −20° C. for 6 hours, 1 L of water was added and the mixture was extracted with ether (5×500 mL). Ether was washed with water and evaporated. The residue was dissolved in 1 L of dioxane and 100 g of Na2NO2 in 30 mL of water was added. The mixture was stirred at 80° C. for 6 hours. 500 mL of water was added and the mixture was extracted with CH2Cl2 (5×400 mL). The solution was washed with water and dried over Na2SO4. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate (1:1 to 1:2) as eluant to give compound 22, a whiter powder after freeze drying from benzene, as a mixture of (α, β-isomers.


[0094] Preparation of Compound 23


[0095] G. Grundler and R. R. Schmidt, Libigs Ann. Chem., 1984, 1826-1847.


[0096] To a solution of compound 22, 20 g, 32.4 mmol in 300 mL of CH2Cl2 was added 8 mL of Cl3CCN and 1.7 mL of DBU. The mixture was stirred at room temperature for 3 hours under argon. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate from 1:1 to 2:3 to give compound 23, a white powder after freeze drying from benzene, 11.5 g, 47%. 1H-NMR (300 MHz, CDCl3): δ=8.80 (s, 1H, NH), 6.44(d, 1H, J=3.5 Hz, H-1), 5.51 (dd, 1H, J=9.0, 10.5 Hz, H-3), 5.36 (dd, 1H, J=1.0, 3.0 Hz, H-4′), 5.13 (dd, 1H, J=8.0, 10.0 Hz, H-2′), 4.96 (dd, 1H, J=3.5, 10.5 Hz, H-3′), 4.51 (d, 1H, J=7.5 Hz, H-1′), 3.70-4.50 (m), 3.62 (dd, 1H, J=3.5, 10.5 Hz, H-2), 1.90-2.20 (6 s, 18H, 6 Ac).


[0097] Preparation of Compound 24


[0098] T. J. Martin and R. R. Schmidt, Tetrahedron Lett., 36: 7455-7458.


[0099] A solution of sialic acid, 10 g and IR120+resin, 4 g in 1000 mL of dry methanol was stirred at room temperature overnight under argon. The resin was filtered out and washed with methanol. The solvent was removed to give compound 24, a solid, 10.65 g, 100%.


[0100] Preparation of Compound 25


[0101] T. J. Martin and R. R. Schmidt, Tetrahedron Lett., 36: 7455-7458.


[0102] To a solution of 20 mL of acetic anhydride was added 0.1 mL of HClO4. Compound 24, 10.65 g, was added to the mixture by portion to keep the temperature between 30 to 35° C. After finishing adding the mixture was stirred at room temperature for 3 hours. The solution was poured onto ice water and kept at room temperature overnight. The solution was extracted with chloroform (3×100 mL) and washed with sat. NaHCO3 and water, dried over Na2SO4. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:20:1) to give compound 25, a white powder after freeze drying from dioxane, 6 g, 76% as a mixture of α,p-isomers 1H-NMR of major isomer (300 MHz, CDCl3): δ=6.15 (d, 1H, J=10.0 Hz, NH), 5.39 (dd, 1H, 3=2.0, 4.5 Hz, H-7), 5.23 (m, 1H, H-8), 5.16 (t, 1H, J=9.0 Hz, H-4), 4.97 (s, 1H, OH), 4.60 (dd, 1H, J=2.0, 10.0 Hz, H-9a), 4.18 (t, 1H, J=10.0 Hz, H-5), 4.03 (dd; 1H, J=8.0, 11.5 Hz, H-9b), 3.85 (s, 3H, CH3), 2.22 (d, 1H, J=7.0 Hz, H-3e), 2.15, 2.07, 2.02, 2.01, 1.90, (5 s, 15H, 5 Ac). 2.02 (m, 1H, H-3a).


[0103] Preparation of Compound 26


[0104] T J. Martin and R. R. Schmidt, Tetrahedron Lett., 36: 7455-7458.


[0105] To a solution of compound 25, 7.2 g, 14.4 mmol in 150 mL of dry acetonitrile was added 6 mL of EtNi-Pr2 and 4.5 mL of phospholoride diethyl ester. The solution was stirred at room temperature for 10 min. and the solvent was removed in vacuo. The residue was purified by silica gel column using hexane/acetone 1:1 to give compound 26, a syrup, 8.5 g, 97% as a mixture of (α, β-isomers. 1H-NMR (300 MHz, CDCl3): δ=2.80 (dd, 0.33H, H-2eβ). 2.50 (dd, 0.67H, H-2e α).



Example 4


Generation of First Level Library

[0106] GSTA is glycosylated separately with the protected Tn and TF donors to produce first level libraries with four glycoforms each:


[0107] 1. Tn: 00, Tn0, 0 Tn, TnTn


[0108] 2. TF: 00, TF0, 0TF, TFTF


[0109] where “0” means no glycosylation.


[0110] GSTA also is reacted with Tn and TF as a mixture. The result is a first level library that contains nine glycoforms:


[0111] 3. Tn+TF: 00, Tn0, 0Tn, TnTn, TF0, 0TF, TFTF, TnTF, TFTn


[0112] As can be seen, the first level libraries 1, 2 and 3 above are different in composition and/or size. Library 3 formed with mixed Tn and TF glycosyl donors contains all the glycoforms that occur in libraries 1 and 2, formed when Tn and TF are used separately as glycosyl donors, in addition to other components that arise from the use of mixed donors. A library formed with a mixture of two donors is therefore more comprehensive than a library formed by combining two libraries produced when the two donors are reacted with the core peptide separately. Generation of a first level library is summarized in Reaction Scheme IV.


[0113] Preparation of Compound 27


[0114] A solution of compound 8, 0.6 g, 0.925 mmol, compound 11, 2 g, 3.73 mmol, compound 16, 2.45 g, 3.088 mmol and 3.5 g of 3 A molecular sieves in 20 mL of dry THF was stirred at room temperature for 10 min. under argon and cooled to −20° C. Ten mL of 0.1 mol of BF3EtO2 in dry THF was added in 5 min. After stirring at −20° C. for 1 hour, the reaction mixture was then warmed to room temperature. The molecular sieves were filtered out and washed with ethyl acetate (3×20 mL). The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate/methanol (10:10:1.5) to give 2.3 g of a mixture as a white powder after freeze drying from dioxane.


[0115] Preparation of Compound 28


[0116] A solution of compound 27, 2.3 g in 40 mL of dry pyridine and 5 mL of dry acetic anhydride was stirred at room temperature overnight under argon. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate/methanol (10:10:1.5) to give 2.34 g of a white powder after freeze drying from dioxane.


[0117] Preparation of Compound 29


[0118] A solution of compound 28, 1.9 g in 30 mL of 80% aqueous acetic acid was stirred at 80° C. for 2 hours. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:10:3) to give 1.61 g of a white powder after freeze drying from dioxane.


[0119] Preparation of Compound 30


[0120] A mixture of compound 29, 150 mg and 5 mL of 0.1 N NaOH in 5 mL of methanol was stirred at room temperature for 24 hours. Then 5 mL of 0.1 N NaOH and 5 mL of methanol was added and the stirring was continued for .24 hours. IR120+resin was added to adjust pH=4.5. The resin was filtered out and washed with water (5×5 mL). The water was removed in vacuo and the residue was purified by P-2 column using water as eluant to give a solid, 13.1 mg.
3ES-MS30-1:Calc. for C20H35O12N5, 537.50. Found, 538.55 (M + H)30-2:Calc. for C28H48O17N6, 740.31. Found, 741.41 (M + H)30-3:Calc. for C26H45O17N5, 699.64. Found, 700.43 (M + H)30-4:Calc. for C40H68O27N6, 1064.41. Found, 1065.67 (M + H)30-5:Calc. for C34H58O22N6, 902.36. Found, 903.51 (M + H)30-6:Calc. for C34H58O22N6, 902.36. Found, 903.51 (M + H)



Example 5


Generation of Second Level Libraries

[0121] Unglycosylated serine and threonine residues in the first level libraries are temporarily blocked by acetylation to prevent the next set of glycosyl donors from reacting with unreacted sites. Following purification the 4,6-benzylidenyl protecting groups of all glycoforms are selectively removed by acid cleavage to create additional glycosylation sites on all the existing carbohydrate structures. GIcNAc, sialic acid (S)and N-acetyllactosamine (L) are reacted with the existing carbohydrate structures of the glycopeptides of each first level library, to produce second level libraries with new structures. Sialic acid and N-acetyllactosamine react at the 6-0- position of N-acetylgalactosamine common to the glycosylated Tn and TF components of the first level libraries.


[0122] For example, the following second level libraries are created when libraries 1, 2 and 3 of Example 1 are reacted with sialic acid:


[0123] 4. 1+S: 00, TnO, STnO, OTn, OSTn, TnTn, TnSTn, STnTn, STnSTn


[0124] 5. 2+S: 00, TFO, STFO, OTF, OSTF, TFTF, TFSTF, STFTF, STFSTF


[0125] 6. 3+S: 00, TnO, STnO, OTn, OSTn, TnTn, STnTn, TnSTn, STnSTn, TFO, STFO, OTF, OSTF, TFTF, STFTF, TFSTF, STFSTF, TnTF, STnTF, TnSTF, STnSTF, TFTn, STFTn, TFSTn, STFSTn


[0126] Second level library 6 is more comprehensive than 4 and 5 combined.


[0127] The number of components may be increased further by using mixed donors, as in library 7. When library 3 from the first level is reacted with a mixture of sialic acid and N-acetyllactosamine, the following library is produced:


[0128] 7. 3+SL: 00, TnO, STnO, OTn, OSTn, TnTn, STnTn, TnSTn, STnSTn, TFO, STFO, OTF, OSTF, TFTF, STFTF, TFSTF, STFSTF, TnTF, STnTF, TnSTF, STnSTF, TFTn, STFTn, TFSTn, STFSTn, LTnO, OLTn, LTnTn, TnLTn, LTnLTn, LTFO, OLTF, TFLTF, LTFTF, LTFLTF, STnLTn, LTnSTn, LTFSTF, STFLTF


[0129] Additional second level libraries can be formed by mixing each of libraries 1, 2 and 3 with N-acetyllactosamine, and by mixing each of libraries 1 and 2 with a mixture of sialic acid and N-acetyllactosamine.


[0130] The ultimate size and definition of the library is controlled by the number and identity of donors, preblocking of defined sites on the peptide, and use of split-mix-split type of synthesis, as described in Plunkett et al., Scientific American, April 1997, 68-73, the contents of which are incorporated herein by reference. Generation of a second level libraries is summarized in Reaction Schemes V, VI, VII and VII.


[0131] Synthesis of Second Level Library with GIcNAc as Donor


[0132] Preparation of Compound 31


[0133] A solution of compounds 29, 0.4 g, compound 20, 1.2 g and 2 g of 3 A molecular sieves in 15 mL of dry acetonitrile was stirred at room temperature for 10 min. under argon. The mixture was cooled to −20° C. and 3 mL of 0. 1 mol of TMS-OTF in dry acetonitrile was added in 5 min. The solution was stirred at −20° C. for 1 hour and warmed to room temperature. The molecular sieves were filtered out and washed with ethyl acetate (5×5 mL). The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:10:2) to give 0.64 g of a white powder after freeze drying from dioxane.


[0134] Preparation of Compound 32


[0135] A solution of compound 31, 0.63 g and 4 g of activated zinc in 50 mL of 80% acetic acid in ethyl acetate was stirred at room temperature for 2 hours. Zinc was filtered out and washed with ethyl acetate (5×20 mL). The solvent was removed in vacuo and the residue was dissolved in 15 mL of dry pyridine and 5 mL of dry acetic anhydride. The solution was stirred at room temperature for 12 hours. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:10:2) to give 0.60 g of a white powder after freeze drying from dioxane. A solution of above solid, 200 mg, in 200 mL of 80% of aqueous acetic acid was stirred at 80° C. for 2 hours and the solvent was removed in vacuo. The residue was dissolved in 5 mL of methanol and 20 mL of 0.1 N NAOH was added and the mixture was stirred at room temperature for 24 hours. IR120+resin was added to adjust pH=4.5 and the resin was filtered out and washed with water (5×5 mL). Water was removed in vacuo and the residue was purified by P-2 column to give a solid, 15 mg.
4ES-MS32-1:Calc. for C28H48O17N6, 740.31. Found, 741.30 (M + H) +32-2:Calc. for C36H61O22N7, 943.39. Found, 944.46 (M + H) +32-3:Calc. for C36H61O22N7, 943.39. Found, 944.46 (M + H) +32-4:Calc. for C44H74O27N8, 1146.46. Found, 574.50 (M + 2H)2 +32-5:Calc. for C34H58O22N6, 902.36. Found, 903.41 (M + H) +32-6:Calc. for C42H71O27N7, 1105.44. Found, 1106.72 (M + H) +32-7:Calc. for C42H71O27N7, 1105.44. Found, 1106.72 (M + H) +32-8:Calc. for C56H94O37N8, 1470.57. Found, 737.00 (M + 2H) +32-9:Calc. for C50H84O32N8, 1308.52. Found, 1309.57 (M + H) +32-10:Calc. for C50H84O32N8, 1308.52. Found, 1309.57 (M + H) +32-11:Calc. for C42H71O27N7, 1105.44. Found, 1106.72 (M + H) +32-12:Calc. for C42H71O27N7, 1105.44. Found, 1106.72 (M + H) +32-13:Calc. for C48H81O32N7, 1268.16. Found, 635.0 (M + 2H)2 +32-14:Calc. for C48H81O32N7, 1268.16. Found, 635.0 (M + 2H)2 +


[0136] Synthesis of Second Level Library with N-acetyl Lactosamine as Donor


[0137] Preparation of Compound 33


[0138] A solution of compound 29, 0.4 g, compound 23, 1.5 g and 2 g of 3A molecular sieves in 15 mL of dry CH2Cl2 was stirred at room temperature for 10 min under argon. Three mL of 0.1 mol of BF3OEt2 in dry CH2Cl2 was added in 10 min. The solution was stirred at room temperature for 1 hour. The molecular sieves were filtered out and washed with ethyl acetate (5×10 mL) and the solvent was removed in vacuo. The residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:10:2) to give 0.43 g of a white powder after freeze drying from dioxane.


[0139] Preparation of Compound 34


[0140] A solution of compound 33, 0.42 g in 30 mL of pyridine and 2 mL of water was passed H2S gas for 30 min. The solution was stopped and stirred at room temperature for 48 hours. The solvent was removed in vacuo and the residue was dissolved in 15 mL of dry pyridine and 5 mL of dry acetic anhydride. The mixture was stirred at room temperature for 12 hours. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:10:2) to give 0.44 g of a white powder after freeze drying from dioxane. A solution of the above solid; 200 mg, was dissolved in 5 mL of methanol and 20 mL of 0.1 N NaOH and the mixture was stirred at room temperature for 24 hours. IR120+resin was added to adjust pH=4.5 and the resin was filtered out and washed with water (5×5 mL). Water was removed in vacuo and the residue was purified by P-2 column to give a solid, 12.7 mg.
5ES-MS34-1:Calc. for C34H58O22N6, 902.36. Found, 903.46 (M + H) +34-2:Calc. for C42H71O27N7, 1105.44. Found, 1106.54 (M + H) +34-3:Calc. for C42H71O27N7, 1105.44. Found, 1106.54 (M + H) +34-4:Calc. for C56H94O37N8, 1470.57. Found, 736.60 (M + 2H) +34-5:Calc. for C40H68O27N6, 1064.41. Found, 1065.54 (M + H) +34-6:Calc. for C48H81O32N7, 1267.49. Found, 1268.52 (M + H) +34-7:Calc. for C48H81O32N7, 1267.49. Found, 1268.52 (M + H) +34-8:Calc. for C68H114O47N8, 1794.68. Found, 898.70 (M + 2H)2 +34-9:Calc. for C62H104O42N8, 1632.62. Found, 817.64 (M + 2H) +34-10:Calc. for C62H104O42N8, 1632.62. Found, 817.64 (M + 2H) +34-11:Calc. for C48H81O32N7, 1267.49. Found, 1268.52 (M + H) +34-12:Calc. for C48H81O32N7, 1267.49. Found, 1268.52 (M + H) +34-13:Calc. for C54H91O37N7, 1430.30. Found, 1430.71 (M + H) +34-14:Calc. for C54H91O37N7, 1430.30. Found, 1430.71 (M + H) +


[0141] Synthesis of Second Level Library with Sialic Acid as Donor


[0142] Preparation of Compound 35


[0143] A solution of compound 29, 0.4 g, compound 26, 0.8 g and 2 g of 3A molecular sieves in 15 mL of dry THF was stirred at room temperature for 10 min. under argon and cooled to −20° C. Two mL of 0. 1 mol of TMS-OTF in dry THF was added in 5 min. The solution was stirred at −20° C. for 1 hour. The molecular sieves was filtered out and washed with ethyl acetate (5×10 mL) and the solvent was removed in vacuo. The residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:10:3) to give 0.55 g of a white powder after freeze drying from dioxane.


[0144] Preparation of Compound 36


[0145] A solution of compound 35, 0.2 g in 10 mL of 0.1 N NaOH and 5 mL of methanol was stirred at room temperature for 24 hours. IR120+resin was added to adjust pH =4.5 and the resin was filtered out and washed with water (5×5 mL). Water was removed in vacuo and the residue was purified by P-2 column to give a solid, 18.3 mg.
6ES-MS36-1:Calc. for C31H52O20N6, 828.32. Found, 829.60 (M + H) +36-2:Calc. for C39H65O25N7, 1031.40. Found, 1032.80 (M + H) +36-3:Calc. for C39H65O25N7, 1031.40. Found, 1032.80 (M + H) +36-4:Calc. for C50H82O33N8, 1322.50. Found, 662.00 (M + 2H)2 +36-5:Calc. for C37H62O25N6, 990.37. Found, 991.70 (M + H) +36-6:Calc. for C45H75O30N7, 1193.45. Found, 1194.80 (M + H) +36-7:Calc. for C45H75O30N7, 1193.45. Found, 1194.80 (M + H) +36-8:Calc. for C62H102O43N8, 1646.60. Found, 824.20 (M + 2H)2 +36-9:Calc. for C56H92O38N8, 1484.55. Found, 743.60 (M + 2H)2 +36-10:Calc. for C56H92O38N8, 1484.55. Found, 743.60 (M + 2H)2 +36-11:Calc. for C45H75O30N7, 1193.45. Found, 1194.80 (M + H) +36-12:Calc. for C45H75O30N7, 1193.45. Found, 1194.80 (M + H) +36-13:Calc. for C51H85O35N7, 1356.22. Found, 1356.60 (M + H) +36-14:Calc. for C51H85O35N7, 1356.22. Found, 1356.60 (M + H) +


[0146] Synthesis of Second Level Library with GlcNAc and Sialic Acid as Donors


[0147] Preparation of Compound 37


[0148] A solution of compound 29, 0.2 g, compound 26, 0.22 g and compound 20, 0.22 g with 2 g of 3A molecular sieves in 15 mL of dry THF was stirred at room temperature for 10 min. under argon and cooled to −20° C. One mL of 0.1 mol of TMS-OTF in dry THF was added in 5 min. The solution was stirred at −20° C. for 1 hour. The molecular sieves was filtered out and washed with ethyl acetate (5×10 mL) and the solvent was removed in vacuo. The residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:10:1) to give 0.27 g of a white powder after freeze drying from dioxane.


[0149] Preparation of Compound 38


[0150] A solution of compound 37, 0.27 g and 1 g of activated zinc in 20 mL of 80% acetic acid in ethyl acetate was stirred at room temperature for 2 hours. Zinc was filtered out and washed with ethyl acetate (5×20 mL). The solvent was removed in vacuo and the residue was dissolved in 4 mL of dry pyridine and 2 mL of dry acetic anhydride. The solution was stirred at room temperature for 12 hours. The solvent was removed in vacuo and the residue was purified by silica gel column using hexane/ethyl acetate/methanol (5:10:1) to give 0.25 g of a white powder after freeze drying from dioxane. A solution of above solid, 0.25 g, in 10 mL of 80% of aqueous acetic acid was stirred at 80° C. for 2 hours and the solvent was removed in vacuo. The residue was dissolved in 3 mL of methanol and 7 mL of 0.1 N NaOH was added and the mixture was stirred at room temperature for 24 hours. IR120+resin was added to adjust pH=4.5 and the resin was filtered out and washed with water (5×5 mL). Water was removed in vacuo and the residue was purified by P-2 column to give a solid, 16.1 mg.



Example 6


Screening of GSTA Libraries

[0151] An ELISA assay with antibodies that bind to a solid phase of either Ovine Submaxillary Mucin (OSM) or CA27.29 were used in an inhibition format to screen the GSTA libraries, with library components being used as inhibitors. Wells were coated with 110 μL of antigen diluted in PBS and allowed to stand overnight at 4° C. One μg/mL OSM or 10 U/mL CA27.29 were used in the assay. On the day of the assay, the wells were aspirated and washed twice with PBS, then 200 μL of 2% BSA blocking solution was added to each well. The wells were incubated 2 hours at room temperature.


[0152] The monoclonal antibodies and inhibitors were diluted in 1% FBS/PBS to various concentrations using glass culture tubes. Monoclonal antibody and inhibitor were combined in pre-incubation tubes. The zero inhibition tube received monoclonal antibody and 1% FBS/PBS diluent. The preincubation for each plate was completed exactly one hour prior to the end of the blocking incubation. When the blocking incubation was complete, the wells were aspirated and washed 4 times with TPBS (PBS+0.05% Tween 20), and 100 μL of the pre-incubation mixture was added to duplicate wells. Blank and substrate negative control wells received 100 μL of 1% FES/PBS diluent only.


[0153] Wells were incubated for 90 minutes at room temperature. Just prior to the end of incubation, goat anti-mouse IgG, H+L, HRP labelled was diluted with 1% FBS/PBS. The wells were aspirated and washed 4 times with TPBS, and 90 μL of diluted goat anti-mouse HRP was added to all wells except the substrate negative control wells, which received 90 μL 1% FBS/PBS.


[0154] The wells were incubated for 90 minutes at room temperature, and then aspirated and washed 4 times with TPBS. Equal volumes of ABTS and peroxide solution B substrate were mixed, and 100 μL was added to each well. The plate was immediately put into a plate reader and read on kinetic mode at wavelength 405-490 nm, 10 minute read, 20 second interval. Data were expressed in mOD/min, and results are shown in Table 3 and FIG. 8.
7TABLE 3Inhibition of MAbs with 5 Glycopeptide Libraries June 23, 1997 970623.XLSCC49B72.3B195.3R11MAbOSMOSMOSMSolid PhaseFinal Conc.VmaxVmaxVmaxInhibitor(μg/mL)(mOD/min.)% Inhibition(mOD/min.)% Inhibition(mOD/min.)% InhibitionNo Inhibitor Control121.60.0127.70.0130.50.0#9, Tn, TF 60121.60.0124.92.2127.02.7120123.5−1.6128.1−0.3131.5−0.8#10, Tn, TF, STn, STF150122.4−0.748.462.1119.28.7300124.8−2.619.584.7100.023.4#11, Tn, TF, GTn, GTF150130.3−7.2127.6−0.1139.1−6.6300118.22.8127.60.7131.1−0.5#12, Tn, TF, STn, STF300119.51.7116.58.8121.07.3GTn, GTF600120.70.791.728.2111.314.7#13, Tn, TF, ?150124.6−2.5128.6−0.7132.3−1.4300124.4−2.3138.4−8.4139.9−7.2Final Concentrations of MAbs used:CC4920 ng/mLB72.310 ng/mLB195.3H1175 ng/mLB239.9R84B27.29MAbOSMCa 27.29Solid PhaseVmaxVmaxInhibitor(mOD/min.)% Inhibition(mOD/min.)% InhibitionNo Inhibitor Control113.10.0120.20.0#9, Tn, TF114.7−1.4120.7−0.4115.7−2.3124.1−3.2#10, Tn, TF, STn, STF96.714.5124.1−3.270.337.9121.6−1.2#11, Tn, TF, GTn, GTF119.8−5.9121.7−1.2112.20.6117.72.1#12, Tn, TF, STn, STF106.06.3115.83.7GTn, GTF108.34.2121.0−0.7#13, Tn, TF, ?116.9−3.4121.9−1.4114.6−1.3118.51.4B239.9R8425 ng/mLB27.2920 ng/mL


[0155]

1





2





3





4





5





6





7





8






Claims
  • 1. A method of generating a glycopeptide library, comprising the steps of: (a) randomly glycosylating a platform having at least one glycosylation site with at least one glycosyl donor, optionally blocking unreacted glycosylation sites on the glycosylated platforms and optionally selectively removing one or more protecting groups on the carbohydrate groups introduced at the first level; whereby a first level library of glycosylated platforms is created; and then (b) optionally randomly glycosylating said first level library of glycosylated platforms, or a combination of first level libraries of glycosylated platforms, with at least one glycosyl donor, and optionally selectively removing one or more designated protecting groups on the carbohydrate groups introduced at the second level; whereby a second level library of glycosylated platforms is created.
  • 2. A method according to claim 1, which further comprises further randomly glycosylating said second level library of glycosylated platforms, or a combination of second level or first and second level libraries of glycosylated platforms, with at least one glycosyl donor, and optionally selectively removing one or more designated protecting groups on the carbohydrate groups introduced at the third level; whereby a third level library of glycosylated platforms is created; and optionally repeating the foregoing step to produce fourth and higher level libraries of increased diversity.
  • 3. A method according to claim 2, wherein said peptide has an amino acid sequence GVTSAPDTRPAPGSTA.
  • 4. A method according to claim 2, wherein said peptide has an amino acid sequence GSTA.
  • 5. A method according to claim 2, wherein said unreacted glycosylation sites are blocked.
  • 6. A method according to claim 5, wherein said sited are blocked by acetylation.
  • 7. A method according to claim 3, wherein said glycosyl donors are selected from the group consisting of GalNAc, βGal(1-3)αGalNAc and sialyl.
  • 8. A method according to claim 4, wherein said glycosyl donors are selected from the group consisting of GalNAc, βGal(1-3)αGalNAc and sialyl.
  • 9. A method according to claim 1, wherein hydroxyl groups on said glycosyl donors are protected prior to reaction of said glycosyl donors with said platforms or said glycosylated platforms.
  • 10. A method according to claim 9, wherein said hydroxyl groups are deprotected after reaction with said platforms or said glycosylated platforms.
  • 11. A method according to claim 10, wherein some of said hydroxyl groups are removed during said deprotection step.
  • 12. A method according to claim 1, wherein said platform is a peptide.
  • 13. A method according to claim 1, wherein said platform does not contain peptide linkages.
  • 14. A method according to claim 1, wherein said platform comprises natural glycosylation sites.
  • 15. A method according to claim 1, wherein said platform comprises unnatural glycosylation sites.
  • 16. A method according to claim 1, wherein said platform comprises tandem repeats.
  • 17. A method according to claim 1, wherein each glycosylation site on said platform is unique and distinguishable from other sites due to distinct structural features in the vicinity of the site.
  • 18. A method according to claim 1, wherein said platform is a hybrid platform comprising a non-peptide polymer to which natural amino acid side chains with natural glycosylation sites are attached.
  • 19. A method according to claim 1, wherein said glycosylation sites provide hydroxy functions for O-glycosylation or carboxy or carboxamido functional groups for N-glycosylation.
  • 20. A method according to claim 1, wherein said glycosylation sites include one or more of serine, threonine, hydroxylysine and asparagine.
  • 21. A method according to claim 1, wherein said glycosylation sites consist entirely of d-optical configuration.
  • 22. A method according to claim 1, wherein said platform is constructed entirely of d-amino acids.
  • 23. A method according to claim 1, wherein said platform is linear.
  • 24. A method according to claim 1, wherein said platform is cyclic.
  • 25. A method according to claim 1, wherein said platform comprises a UV-active or fluorescent label.
  • 26. A method according to claim 1, wherein said platform comprises hydrophobic amino acids which increase the solubility of the platform in organic solvents.
  • 27. A method according to claim 1, wherein said glycosylation sites are spaced, singly or in clusters, between sequences that include hydrophobic amino acids.
  • 28. A method according to claim 1, wherein lipid chains are incorporated into said platform.
  • 29. A method according to claim 1, wherein said glycosyl donors are unnatural.
  • 30. A method according to claim 1, wherein said glycosyl donors comprise structures associated with adhesion ligands for bacterial receptors that are expressed on human cell surface antigens.
  • 31. A method according to claim 1, wherein said glycosyl donors comprise structures associated with malignant cell antigens.
  • 32. A randomly-generated glycopeptide library.
  • 33. A randomly-generated glycopeptide library according to claim 32, comprising carcinoma-associated mucins.
  • 34. A library of glycosylated platforms produced by the method of claim 1.
  • 35. A library of glycosylated platforms produced by the method of claim 2.
  • 36. A library of glycosylated platforms produced by the method of claim 30.
  • 37. A library of glycosylated platforms produced by the method of claim 31.
  • 38. A method of identifying a biologically-active compound, comprising: generating a library of glycosylated platforms according to claim 34; and screening components of said library for drug-like, competitive inhibitory, immunostimulatory or antibody-like activity.
  • 39. A method of identifying an anti-viral compound, comprising: generating a library of glycosylated platforms according to claim 34; and screening components of said library for anti-viral activity.
  • 40. A method of identifying an anti-bacterial compound, comprising: generating a library of glycosylated platforms according to claim 30; and screening components of said library for the ability competitively to inhibit bacterial adhesion to a host cell.
  • 41. A method of identifying compounds for detection or treatment of cancer, comprising: generating a library of glycosylated platforms according to claim 31; and screening components of said library for anti-cancer activity.
Provisional Applications (1)
Number Date Country
60056240 Aug 1997 US
Divisions (1)
Number Date Country
Parent 09143379 Aug 1998 US
Child 09842873 Apr 2001 US