OLIGOSACCHARIDE ANALYTICAL STANDARDS

FIELD OF THE INVENTION

The invention is directed to synthesis of complex N-linked oligosaccharide including methods of preparing key intermediates that can be readily elaborated into more complicated compounds such as glycoconjugates, and derivatives thereof for use as native or isotopically enriched analytical standards with application in mass spectrometry, high-pressure liquid chromatography, capillary electrophoresis, and nuclear magnetic resonance; for antibody, glycoprotein, and therapeutic development; and for construction of analytical or high-throughput platforms including microarray and biofluid analysis.

BACKGROUND

N-glycosylation of proteins is one of the most complex and diverse post-translational modifications that can influence a multitude of biological processes such as signal transduction, embryogenesis, neuronal development, fertilization, hormone activity, immune regulation and the proliferation of cells and their organization into specific tissues. It has been implicated in the etiology of many human diseases such as pathogen recognition, inflammation, immune responses, the development of autoimmune diseases, and cancer. Although it is widely accepted that N-glycans contain high information content, the limited accessibility of well-defined structures makes it difficult to uncover the molecular basis by which they regulate biological and disease processes. Consequently, diverse collections of well-defined N-glycans are needed as standards for glycan structure determination of heterogeneous biological samples, as ligands to study interactions with glycan-binding proteins, as probes to examine the molecular basis of glycoconjugate biosynthesis and as starting materials for glycoprotein synthesis.

There remains a need for improved methods of synthesizing oligosaccharides, including those found in biologically relevant systems, including glycans. There remains a need for analytical standards permitting the rapid identification and quantification of N-glycans in a biomolecule of interest.

FIG. 1 depicts Structure of N-glycans and a bio-inspired strategy for their preparation. FIG. 1a, MGAT enzymes responsible for installing GlcNAc at different branching points. FIG. 1b. Enzyme classes involved in the biosynthesis of complex N-glycans. FIG. 1c. Structure of unnatural UDP-GlcNTFA (4). FIG. 1d, Bio-inspired strategy for the synthesis of asymmetric N-glycans. Symmetrical bi-antennary glycan 1, which can easily be obtained from a glycopeptide isolated from egg yolk, can be further branched by recombinant MGAT4 and MGAT5. The use of unnatural UDP-GlcNTFA makes it possible to prepare 2 bearing GleNAc, GleN₃and GlcNH₂branching moieties. Compound 2 is the key intermediate for preparing complex targets such as 3. FIG. 1e, Transformation of GleNTFA, installed by MGAT4 and MGAT5, into GleNH₂or GlcN₃‘stops’ further enzymatic extension of these moieties until they are converted into natural GlcNAc (‘go’), which can then be elaborated by glycosyltransferases into complex appendages. FIG. 2 depicts the synthesis of asymmetric branched tri-antennary glycosyl asparagines using MGAt5 and uDP-GlcNtFA. MGAT5 readily accepts UDP-GlcNTFA to give a tri-antennary glycan that, following base treatment, provides a compound with a GleNH₂at the β6 arm. The latter residue is not a substrate for the galactosyl transferase B4GalT1 and therefore it is possible to selectively elaborate the MGAT1 and MGAT2 arms by exploiting the inherent branch selectivities of glycosidases and glycosyltransferases. Once the MGAT1 and MGAT2 arms were capped with Neu5Ac, preventing these positions from further elongation, the GleNH₂could be acetylated to give natural GleNAc capable of being extended by a series of glycosyltransferases. FIG. 3 depicts the synthesis of asymmetric branched tetra-antennary N-glycans using MGAt4 and MGAt5 in combination with uDP-GlcNtFA and subsequent conversion of the transferred GlcNtFA into GlcN₃or GlcNH₂. The latter moieties are temporarily disabled from modification by glycosyltransferases, making it possible to selectively elaborate the MGATI and MGAT2 arms. At an appropriate point in the synthesis, the unnatural GlcN₃or GlcNH₂moieties can be converted into natural GleNAc, allowing each arm to be uniquely extended.

FIG. 4 depicts additional compounds of the invention. The symbols used to depict sugars is presented in FIG. 1.

FIG. 5 depicts additional compounds of the invention. The symbols used to depict sugars is presented in FIG. 1.

FIG. 6 depicts additional compounds of the invention. The symbols used to depict sugars is presented in FIG. 1.

DETAILED DESCRIPTION

Before the present methods and systems are disclosed and described, it is to be understood that the methods and systems are not limited to specific synthetic methods, specific components, or to particular compositions. It is also to be understood that the terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting.

As used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Ranges may be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another embodiment includes—from the one particular value and/or to the other particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about.” it will be understood that the particular value forms another embodiment. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint.

“Optional” or “optionally” means that the subsequently described event or circumstance may or may not occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.

Throughout the description and claims of this specification, the word “comprise” and variations of the word, such as “comprising” and “comprises,” means “including but not limited to.” and is not intended to exclude, for example, other additives, components, integers or steps.

“Exemplary” means “an example of” and is not intended to convey an indication of a preferred or ideal embodiment. “Such as” is not used in a restrictive sense, but for explanatory purposes.

Disclosed are components that can be used to perform the disclosed methods and systems. These and other components are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these components are disclosed that while specific reference of each various individual and collective combinations and permutation of these may not be explicitly disclosed, each is specifically contemplated and described herein, for all methods and systems. This applies to all aspects of this application including, but not limited to, steps in disclosed methods. Thus, if there are a variety of additional steps that can be performed it is understood that each of these additional steps can be performed with any specific embodiment or combination of embodiments of the disclosed methods.

The term “alkyl” as used herein is a branched or unbranched hydrocarbon group such as methyl. ethyl, n-propyl, isopropyl, n-butyl, isobutyl, t-butyl, pentyl, hexyl, heptyl, octyl, nonyl, decyl, dodecyl, and the like. The alkyl group can also be substituted or unsubstituted. Unless stated otherwise, the term “alkyl” contemplates both substituted and unsubstituted alkyl groups. The alkyl group can be substituted with one or more groups including, but not limited to, alkoxy, alkenyl, alkynyl, cycloalkyl, heterocycloalkyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, nitro, silyl, sulfo-oxo, or thiol. An alkyl group which contains no double or triple carbon-carbon bonds is designated a saturated alkyl group, whereas an alkyl group having one or more such bonds is designated an unsaturated alkyl group. Unsaturated alkyl groups having a double bond can be designated alkenyl groups, and unsaturated alkyl groups having a triple bond can be designated alkynyl groups. Unless specified to the contrary, the term alkyl embraces both saturated and unsaturated groups.

The term “cycloalkyl” as used herein is a non-aromatic carbon-based ring composed of at least three carbon atoms. Examples of cycloalkyl groups include, but are not limited to cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, etc. The term “heterocycloalkyl” is a cycloalkyl group as defined above where at least one of the carbon atoms of the ring is replaced with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, selenium or phosphorus. The cycloalkyl group and heterocycloalkyl group can be substituted or unsubstituted. Unless stated otherwise, the terms “cycloalkyl” and “heterocycloalkyl” contemplate both substituted and unsubstituted cyloalkyl and heterocycloalkyl groups. The cycloalkyl group and heterocycloalkyl group can be substituted with one or more groups including, but not limited to, alkyl, alkoxy, alkenyl, alkynyl, cycloalkyl, heterocycloalkyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, nitro, silyl, sulfo-oxo, or thiol. A cycloalkyl group which contains no double or triple carbon-carbon bonds is designated a saturated cycloalkyl group, whereas a cycloalkyl group having one or more such bonds (yet is still not aromatic) is designated an unsaturated cycloalkyl group. Unless specified to the contrary, the term cycloalkyl embraces both saturated and unsaturated, non-aromatic, ring systems.

The term “aryl” as used herein is an aromatic ring composed of carbon atoms. Examples of aryl groups include, but are not limited to, phenyl and naphthyl, etc. The term “heteroaryl” is an aryl group as defined above where at least one of the carbon atoms of the ring is replaced with a heteroatom such as, but not limited to, nitrogen, oxygen, sulfur, selenium or phosphorus. The aryl group and heteroaryl group can be substituted or unsubstituted. Unless stated otherwise, the terms “aryl” and “heteroaryl” contemplate both substituted and unsubstituted aryl and heteroaryl groups. The aryl group and heteroaryl group can be substituted with one or more groups including, but not limited to, alkyl, alkoxy, alkenyl, alkynyl, cycloalkyl, heterocycloalkyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, nitro, silyl, sulfo-oxo, or thiol.

Exemplary heteroaryl and heterocyclyl rings include: benzimidazolyl, benzofuranyl, benzothiofuranyl, benzothiophenyl, benzoxazolyl, benzoxazolinyl, benzthiazolyl, benztriazolyl, benztetrazolyl, benzisoxazolyl, benzisothiazolyl, benzimidazolinyl, carbazolyl, 4aHl carbazolyl, carbolinyl, chromanyl, chromenyL cirrnolinyl, decahydroquinolinyl, 2H,6H˜1,5,2-dithiazinyl, dihydrofuro[2,3b]tetrahydrofuran, furanyl, furazanyl, imidazolidinyl, imidazolinyl, imidazolyl, 1H-indazolyl, indolenyl, indolinyl, indolizinyl, indolyl, 3H-indolyl, isatinoyl, isobenzofuranyl, isochromanyl, isoindazolyl, isoindolinyl, isoindolyl, isoquinolinyl, isothiazolyl, isoxazolyl, methylenedioxyphenyl, morpholinyl, naphthyridinyl, octahydroisoquinolinyl, oxadiazolyl, 1,2,3-oxadiazolyl, 1,2,4-oxadiazolyl, 1,2,5-oxadiazolyl, 1,3,4-oxadiazolyl, oxazolidinyl, oxazolyl, oxindolyl, pyrimidinyl, phenanthridinyl, phenanthrolinyl, phenazinyl, phenothiazinyl, phenoxathinyl, phenoxazinyl, phthalazinyl, piperazinyl, piperidinyl, piperidonyl, 4-piperidonyl, piperonyl, pteridinyl, purinyl, pyranyl, pyrazinyl, pyrazolidinyl, pyrazolinyl, pyrazolyl, pyridazinyl, pyridooxazole, pyridoimidazole, pyridothiazole, pyridinyl, pyridyl, pyrimidinyl, pyrrolidinyl, pyrrolinyl, 2H-pyrrolyl, pyrrolyl, quinazolinyl, quinolinyl, 4H-quinolizinyl, quinoxalinyl, quinuclidinyl, tetrahydrofuranyl, tetrahydroisoquinolinyl, tetrahydroquinolinyl, tetrazolyl, 6H-1,2,5-thiadiazinyl, 1,2,3-thiadiazolyl, 1,2,4-thiadiazolyl, 1,2,5-thiadiazolyl, 1,3,4-thiadiazolyl, thianthrenyl, thiazolyl, thienyl, thienothiazolyl, thienooxazolyl, thienoimidazolyl, thiophenyl, and xanthenyl.

The terms “alkoxy.” “cycloalkoxy,” “heterocycloalkoxy,” “cycloalkoxy,” “aryloxy,” and “heteroaryloxy” have the aforementioned meanings for alkyl, cycloalkyl, heterocycloalkyl, aryl and heteroaryl, further providing said group is connected via an oxygen atom.

As used herein, the term “substituted” is contemplated to include all permissible substituents of organic compounds. In a broad aspect, the permissible substituents include acyclic and cyclic, branched and unbranched, carbocyclic and heterocyclic, and aromatic and nonaromatic substituents of organic compounds. Illustrative substituents include, for example, those described below. The permissible substituents can be one or more and the same or different for appropriate organic compounds. For purposes of this disclosure, the heteroatoms, such as nitrogen, can have hydrogen substituents and/or any permissible substituents of organic compounds described herein which satisfy the valencies of the heteroatoms. This disclosure is not intended to be limited in any manner by the permissible substituents of organic compounds. Also, the terms “substitution” or “substituted with” include the implicit proviso that such substitution is in accordance with permitted valence of the substituted atom and the substituent. and that the substitution results in a stable compound, e.g., a compound that does not spontaneously undergo transformation such as by rearrangement, cyclization, elimination, etc. Unless specifically stated, a substituent that is said to be “substituted” is meant that the substituent can be substituted with one or more of the following: alkyl, alkoxy, alkenyl, alkynyl, cycloalkyl, heterocycloalkyl, aryl, heteroaryl, aldehyde, amino, carboxylic acid, ester, ether, halide, hydroxy, ketone, nitro, silyl, sulfo-oxo, or thiol. In a specific example, groups that are said to be substituted are substituted with a protic group, which is a group that can be protonated or deprotonated, depending on the pH.

The skilled person will understand that when the disclosed compounds bear ionizable functional groups (e.g., amines, carboxylic acids, sulfonic acids, etc) the compounds may be protonated or deprotonated depending on pH. Unless specifically stated otherwise, the structure of a compound embraces all ionized forms of the compound as well. Acceptable salts of the disclosed compounds may be formed under conventional conditions. Examples of such salts are acid addition salts formed with inorganic acids, for example, hydrochloric, hydrobromic, sulfuric, phosphoric, and nitric acids and the like; salts formed with organic acids such as acetic. oxalic, tartaric, succinic, maleic, fumaric, gluconic, citric, malic, methanesulfonic, p-toluenesulfonic, napthalenesulfonic, and polygalacturonic acids, and the like; salts formed from elemental anions such as chloride, bromide, and iodide; salts formed from metal hydroxides, for example, sodium hydroxide, potassium hydroxide, calcium hydroxide, lithium hydroxide, and magnesium hydroxide; salts formed from metal carbonates, for example, sodium carbonate, potassium carbonate, calcium carbonate, and magnesium carbonate; salts formed from metal bicarbonates, for example, sodium bicarbonate and potassium bicarbonate; salts formed from metal sulfates, for example, sodium sulfate and potassium sulfate; and salts formed from metal nitrates, for example, sodium nitrate and potassium nitrate.

Disclosed herein are a family of oligosaccharides and intermediates useful as analytical standards, and for applications such as glycopeptide synthesis, microarray development, and others. The oligosaccharides can have the formula:

embedded image

- wherein R¹and R⁴are as defined herein, R^MG2, R^MG3, R^MG4, and R^MG5, are cach independently chosen from H, GlcNAc, or modified GleNAc. As used herein, a modified GlcNAc refers to a 2-deoxygluco residue bearing substituted or unsubstituted nitrogen atom at the 2-position.

In some embodiments, the invention relates to a modified GlcNAc glycosyl donor having the formula:

embedded image

- wherein PN represents a phosphonucleotide and PG represents a protective group that is tolerated by glycosyltransferase enzymes. After glycosylation, the protective group can be removed to yield the corresponding 2-deoxyglucosamine, which can be further derivatized using appropriate chemistries. Because the 2-deoxyglucosamine and derivatives thereof are not substrates for galactosyltransferase, the present invention provides methods of selectively preparing a vast number of different oligosaccharide compounds.

Disclosed herein are oligosaccharide compounds having the formula:

embedded image

- and salts thereof, wherein
- R1 can be OH, or a residue having the formula:

embedded image

- wherein R^ozcan be H, C_1-4alkyl (preferably CH₃), aryl, CF₃, CCl₃,
- n can be 1 or 0;
- R^facan be H or a fucose residue having the structure:

embedded image

- R²can be OR^c, NR²ⁿOR^c; NR²ⁿR^cand R³can be OR^cor R^c;
  - wherein:
  - R²ⁿcan be H and C_1-4alkyl;
  - R^ccan be X^pH, X^pC_1-8alkyl, X^pC_1-8alkylaryl, X^paryl, X^pfluorescent marker, or an amino acid residue having the formula:

embedded image

- wherein R^aa1and R^aa2can be H or additional amino acid residues, for instance as found in a protein, polypeptide, monoclonal antibody, etc. In certain embodiments, R^aa1can be Cbz (carboxybenzyl ether), and R^aa2can be H.
- R³can be X^pC₁-galkyl, X^pC_1-8alkylaryl, or X^paryl;
- wherein X^Pcan be null or a polymer, preferably a polyethylene glycol —(CH₂CH₂O)₂—, wherein z is 1-500.
- R⁴can be N₃, NR^4aR^4b,
- wherein:
- R^4aand R^4bare the same or different, and can be H, Z-X⁴wherein Z can be null, C═O, SO₂, and
- X⁴can be C_1-4alkyl, O—C_1-4alkyl, C_1-4haloalkyl, O—C_1-4haloalkyl, C_1-4alkylaryl, O—C_1-4alkylaryl, aryl, O-aryl; or R^4aand R^4bcan together form a ring; or R⁴can be a radical having the formula:

embedded image

- and one or both of R^4cor R^4dconstitute a conjugated payload.

In certain embodiments, R^4acan be H and R^4bcan be methoxymethyl, methylthiomethyl, p-methoxybenzyloxymethyl, p-nitrobenzyloxymethyl, t-butoxymethyl, 2-methoxyethoxymethyl, 1-ethoxyethyl, allyl, p-methoxybenzyloxycarbonyl (Moz), p-nitrobenzyloxycarbonyl (PNZ), trimethylsilyl, diethylisopropylsilyl, triphenylsilyl, formyl, chloroacetyl, methanesulfonyl, tosyl, benzylsulfonyl, methoxymethylcarbonyl, benzyloxycarbonyl, carboxybenzyl (Cbz), t-butyloxycarbonyl (BOC), 9-fluorenylmethylcarbonyl, N-phenylcarbamoyl, [2-(trimethylsilyl)ethoxy]methyl, or 4,4′-dimethoxytrityl.

While R², when present, can be in either the α or β configuration, or as a mixture of anomers, it is be preferred that R²is in the β configuration:

embedded image

In some instances, n can be zero, e.g., R¹is a residue having the formula:

embedded image

In other embodiments, n is one, e.g., R¹is a residue having the formula:

embedded image

In some instances, R^ccan be C^1-8alkyl, C_1-8alkylaryl, a polymer such as polyethylene glycol, or aryl bearing functional group enabling covalent or affinity-based immobilization for a microarray slide. For instance, R^ccan have the formula —(CH₂)_acR^im, wherein nc is an integer from 1-8, and R^imcan be —NH₂, —SH, —OH, —COOH, —N₃, C≡CH, a Michael acceptor such as vinyl sulfone or maleimide, SO₃, OSO₃, or a biotin residue having the formula:

embedded image

- wherein X^btis selected from null, O, NH, or S.

In other instances, R^ccan be a group suitable for UV-VIS or fluorescent detection. Suitable aryl groups typically include polyaromatic systems such as coumarins, rhodamines, fluoresceins, cyanines, eosins, erythrosins, and the like.

In other instances, R^ccan be a C_1-8alkylaryl or aryl group suitable for solid phase extractions. Exemplary aryl groups include phenyls and naphthyls bearing two or more sulfonate residues:

embedded image

- wherein np is from 0-50, ns is from 2-7, and ns′ is from 2-5, with two sulfonate groups in an ortho configuration being particularly preferred:

embedded image

In some embodiments, R^ccan be an aryl group useful as a tag is LC-MS, microarray, or capillary electrophoresis analysis. For example R^ccan be an aryl (e.g., phenyl, naphthyl, anthracene, phenanthrene, phenalene, tetracene, chrysene, triphenylene, pyrene and the like) residue substituted one or more times by carboxylic acids, carboxamides (especially primary carboxamides), sulfonates, reporter groups such as quaternary ammonium salts and the like. By way of example, suitable tags include residues having the formula:

embedded image

- wherein ne is from 2-8.

Also disclosed herein are oligosaccharides having the formula:

embedded image

- wherein R¹and R⁴are as defined above, and R⁵can be N₃, NR^5aR^5b, wherein R^5aand R^5bare the same or different, and can be H, Z-X⁵wherein Z can be null, C═O, SO₂, and X⁴can be C_1-4alkyl, O—C_1-4alkyl, C_1-4haloalkyl, O—C_1-4haloalkyl, C_1-4alkylaryl, O—C_1-4alkylaryl, aryl, O-aryl; or R^5aand R^5bcan together form a ring; or R⁵can be a radical having the formula:

embedded image

- wherein one or both of R^5cor R^5dconstitute a conjugated payload.

In certain embodiments, R^5acan be H and R^5bcan be methoxymethyl, methylthiomethyl, p-methoxybenzyloxymethyl, p-nitrobenzyloxymethyl, t-butoxymethyl, 2-methoxyethoxymethyl, 1-ethoxyethyl, allyl, p-methoxybenzyloxycarbonyl (Moz), p-nitrobenzyloxycarbonyl (PNZ), trimethylsilyl, diethylisopropylsilyl, triphenylsilyl, formyl, chloroacetyl, methanesulfonyl, tosyl. benzylsulfonyl, methoxymethylcarbonyl, benzyloxycarbonyl, carboxybenzyl (Cbz), t-butyloxycarbonyl (BOC), 9-fluorenylmethylcarbonyl, N-phenylcarbamoyl. [2-(trimethylsilyl)ethoxy]methyl, or 4,4′-dimethoxytrityl.

In certain embodiments, the R⁵bearing residue can be an isotopically enriched GlcNAc or modified GlcNAc, for instance in which one or more of the carbon atoms are enriched with ¹³C above naturally occurring levels. In some embodiments, R⁵bearing residue is a ¹³C₆enriched residue, which herein means that each of the ring carbons and the C-6 carbon are enriched with ¹³C above naturally occurring levels. When R⁵is NHCOCH₃, one or both of those carbon atoms may also be ¹³C enriched. Such residues are termed ¹³C₇and ¹³C₈enriched residues, respectively. The isotopic enrichment can be at least 90%, at least 95%, at least 98%, or at least 99%.

Also disclosed herein are oligosaccharides having the formula:

embedded image

- wherein R¹, R⁴, and R⁵are as defined above;
- R⁶can be hydrogen or a residue having the formula:

embedded image

- wherein R⁹can be N₃, NR^9aR^9b, wherein R^9aand R^9bare the same or different, and can be H, Z-X⁹wherein Z can be null, C═O, SO₂, and X⁹can be C_1-4alkyl, O—C_1-4alkyl, C_1-4haloalkyl, O—C_1-4haloalkyl, C_1-4alkylaryl, O—C_1-4alkylaryl, aryl, O-aryl; or R^9aand R^9bcan together form a ring; or
- R⁹can be a radical having the formula:

embedded image

- wherein one or both of R^9cor R^9dconstitute a conjugated payload.

In certain embodiments, R^9acan be H and R^9bcan be methoxymethyl, methylthiomethyl, p-methoxybenzyloxymethyl, p-nitrobenzyloxymethyl, t-butoxymethyl, 2-methoxyethoxymethyl, 1-ethoxyethyl, allyl, p-methoxybenzyloxycarbonyl (Moz), p-nitrobenzyloxycarbonyl (PNZ), trimethylsilyl, diethylisopropylsilyl, triphenylsilyl, formyl, chloroacetyl, methanesulfonyl, tosyl, benzylsulfonyl, methoxymethylcarbonyl, benzyloxycarbonyl, carboxybenzyl (Cbz), t-butyloxycarbonyl (BOC), 9-fluorenylmethylcarbonyl, N-phenylcarbamoyl, [2-(trimethylsilyl)ethoxy]methyl, or 4,4′-dimethoxytrityl.

- R⁷can be hydrogen or a residue having the formula:

embedded image

- wherein R¹⁰can be N₃, NR^10aR^10b, wherein R^10aand R^10bare the same or different, and can be H, Z-X¹⁰wherein Z can be null, C═O, SO₂, and X⁴can be C_1-4alkyl, O—C_1-4alkyl. C_1-4haloalkyl, O—C_1-4haloalkyl, C_1-4alkylaryl, O—C_1-4alkylaryl, aryl, O-aryl; or R^10aand R^10bcan together form a ring; or R¹⁰can be a radical having the formula:

embedded image

- wherein one or both of R^10cor R^10dconstitute a conjugated payload.

In certain embodiments, R^10acan be H and R^10bcan be methoxymethyl. methylthiomethyl, p-methoxybenzyloxymethyl, p-nitrobenzyloxymethyl, t-butoxymethyl, 2-methoxyethoxymethyl. 1-ethoxyethyl, allyl, p-methoxybenzyloxycarbonyl (Moz), p-nitrobenzyloxycarbonyl (PNZ), trimethylsilyl, diethylisopropylsilyl, triphenylsilyl, formyl, chloroacetyl, methanesulfonyl, tosyl, benzylsulfonyl, methoxymethylcarbonyl, benzyloxycarbonyl, carboxybenzyl (Cbz), t-butyloxycarbonyl (BOC), 9-fluorenylmethylcarbonyl, N-phenylcarbamoyl. [2-(trimethylsilyl)ethoxy]methyl, or 4,4′-dimethoxytrityl.

- R¹²can be hydrogen or a residue having the formula:

embedded image

- wherein R¹³can be hydrogen or a residue having the formula:

embedded image

- wherein R^13bis selected from H and a conjugated cargo moiety, and R^13ais selected from OH and a conjugated cargo moiety;
- R¹⁴can be hydrogen or a residue having the formula:

embedded image

- wherein R^14bis selected from H and a conjugated cargo moiety, and R^14ais selected from OH and a conjugated cargo moiety;
- R⁸can be hydrogen or a residue having the formula:

embedded image

- wherein R¹¹can be N₃, NR^11aR^11b, wherein R^11aand R^11bare the same or different, and can be H, Z-X¹¹wherein Z can be null, C═O, SO₂, and X¹¹can be C_1-4alkyl, O—C_1-4alkyl, C_1-4haloalkyl, O—C_1-4haloalkyl, C_1-4alkylaryl, O—C_1-4alkylaryl, aryl, O-aryl; or R^11aand R^11bcan together form a ring; or R¹¹can be a radical having the formula:

embedded image

- wherein one or both of R^11cor R^11dconstitute a conjugated payload.

In certain embodiments, R^11acan be H and R^11bcan be methoxymethyl, methylthiomethyl, p-methoxybenzyloxymethyl, p-nitrobenzyloxymethyl, t-butoxymethyl, 2-methoxyethoxymethyl, 1-ethoxyethyl, allyl, p-methoxybenzyloxycarbonyl (Moz), p-nitrobenzyloxycarbonyl (PNZ), trimethylsilyl, diethylisopropylsilyl, triphenylsilyl, formyl, chloroacetyl, methanesulfonyl, tosyl, benzylsulfonyl, methoxymethylcarbonyl, benzyloxycarbonyl, carboxybenzyl (Cbz), t-butyloxycarbonyl (BOC), 9-fluorenylmethylcarbonyl, N-phenylcarbamoyl, [2-(trimethylsilyl)ethoxy]methyl, or 4,4′-dimethoxytrityl.

Disclosed herein are oligosaccharides having the formula:

embedded image

- wherein R¹, R⁴, R⁵, R⁶, R⁷, and R⁸have the meanings given above;
- R¹⁵can hydrogen or a residue having the formula:

embedded image

- wherein R¹⁷can be hydrogen or a residue having the formula:

embedded image

- wherein R^17bcan be H or a conjugated cargo moiety, and R^17acan be OH or a conjugated cargo moiety;
- R¹⁸can be hydrogen or a residue having the formula:

embedded image

- wherein R^18bis can be H or a conjugated cargo moiety, and R^18acan be OH or a conjugated cargo moiety;
- R¹⁶can be hydrogen or a residue having the formula:

embedded image

- wherein R¹⁹can be hydrogen or a residue having the formula:

embedded image

- wherein R^18bcan be H or a conjugated cargo moiety, and R^18acan be OH or a conjugated cargo moiety;
- R²⁰can be hydrogen or a residue having the formula:

embedded image

- wherein R^20bcan be H or a conjugated cargo moiety, and R^20acan be OH and a conjugated cargo moiety.

As used herein, a conjugated payload in the context of the triazole residues described above can be formed using click chemistry cycloadditions between an azide and alkyl or alkene:

embedded image

- wherein Y is a cytotoxic drug or tracer compound, and L is a linker. In some instances the conjugated payload/triazole can have the formula:

embedded image

- wherein R⁰is in each case independently selected from hydrogen, halogen, C_1-8alkyl, C_1-8alkoxy, aryl, C_1-8heteroaryl, C_3-8cycloalkyl, or C_1-8heterocyclyl; wherein any two or more R⁰groups can together form a ring;
- E¹can be:

embedded image

- wherein L^C1can be null, cleavable linker, and non-cleavable linker. In some embodiments, the conjugated payload/triazole can have the formula:

embedded image

- wherein R^eis selected from hydrogen, or either *—OSO₃X¹or *—OPO₃X₁, wherein X₁is selected from H, C_1-8alkyl, or a pharmaceutically acceptable cation. In other embodiments, each of R⁰is hydrogen.

Compounds disclosed herein can be prepared from by selectively glycosylating an appropriate acceptor with a GlcNHC(O)R^adonor as shown below:

embedded image

- wherein R¹has the meaning given above, and R^ais CX₃, CHX₂, CH₂X, CH₃, OBn, wherein X is in each case independently selected from F, Cl, Br, and I, and UDP is a residue having the formula:

embedded image

The above glycosylation may be carried out using a suitable glycosyltransferase, for instance α-1,3-mannosyl-glycoprotein 2-β-N-acetylglucosaminyltransferase, referred to herein as MGAT1.

In certain embodiments, when R^ais CX₃, CHX₂, CH₂X, or OBn, the haloacetamide or Cbz group can be removed to furnish the free amine, which can be further elaborated to R⁴as defined herein:

embedded image

In some embodiments, R⁴can be azide. After glycosylation with MGAT1, the resulting product (either with or without conversion to the free amine or the R⁴group) can be selectively glycosylated with a further GlcNHC(O)R^adonor:

embedded image

The above glycosylation may be carried out using a suitable glycosyltransferase, for instance α-1,6-mannosyl-glycoprotein 2-β-N-acetylglucosaminyltransferase, referred to herein as MGAT2. In preferred embodiments, R⁴and GlcNHC(O)R^aare not the same. The resulting oligosaccharide can be further elaborated as described above:

embedded image

The disclosed compounds are useful intermediates for a variety of additional selective transformations: Use of any of MGAT3, MGAT4, and/or MGAT5 permits selective installation of GlcNHC(O)R^aresidues on the oligosaccharide, which can be converted to R⁶, R⁷, and R⁸residues as defined above.

embedded image

As R⁴and R⁵can be selected to deactivate those GlcNAc derivatives to galactosyltransferases (e.g., when neither R⁴nor R⁵are NHC(O)R^a, selective elaboration of the oligosaccharide at other arms of the glycan can be achieved:

embedded image

Also disclosed herein are oligosaccharides having the formula:

embedded image

- wherein R¹, R⁴, and R⁵are as defined above;
- R⁶can be hydrogen or a residue having the formula:

embedded image

- wherein R⁹can be N₃, NR^9aR^9b, wherein R^9aand R^9bare the same or different, and can be H, Z-X⁹wherein Z can be null, C═O, SO₂, and X⁹can be C_1-4alkyl, O—C_1-4alkyl, C_1-4haloalkyl, O—C_1-4haloalkyl, C_1-4alkylaryl, O—C_1-4alkylaryl, aryl, O-aryl; or R^9aand R^9bcan together form a ring; or
- R⁹can be a radical having the formula:

embedded image

- wherein one or both of R^9cor R^9dconstitute a conjugated payload.

Disclosed herein are compounds listed in the following table:

embedded image

The compounds may be characterized by a natural isotopic abundance for the indicated GlcNAc residue, or that residue may be ¹³C-enriched as defined herein.

Compound 1

R^1a, R^1b, R^1c, R², R³, R⁴, R⁵
H

Compound 2

R^1b, R^1c, R², R³, R⁴, R⁵
H

R^1a

embedded image

Compound 3

R^1b, R^1c, R², R³, R⁵
H

R^1a

embedded image

R⁴

R⁸and R⁹= H

Compound 4

R^1a, R^1b, R^1c, R², R⁵
H

R³

embedded image

R⁶and R⁷= H

R⁴

embedded image

R⁸=

R⁹= H

Compound 5

R^1b, R^1c, R², R⁵
H

R^1a

embedded image

R³

R⁶and R⁷= H

R⁴

embedded image

R⁸and R⁹= H

Compound 6

R^1a, R^1b, R^1c, R², R⁵
H

R³

embedded image

R⁶and R⁷= H

R⁴

embedded image

R⁸and R⁹= H

Compound 7

R^1a, R^1b, R^1c, R², R⁵
H

R³

embedded image

and R⁶= H

R⁴

embedded image

R⁸and R⁹= H

Compound 8

R^1a, R^1b, R^1c, R², R⁵
H

R³

embedded image

and R⁶= H

R⁴

embedded image

R⁹=

and R⁸= H

Compound 9

R^1a, R^1b, R^1c, R², R⁵
H

R³

embedded image

R⁷=

and R⁶= H

R⁴

embedded image

R⁸=

and R⁹= H

Compound 10

R^1a, R^1b, R^1c, R², R⁵
H

R³

embedded image

R⁶=

and R⁷= H

R⁴

embedded image

R⁸=

and R⁹= H

Compound 11

R^1a, R^1b, R^1c, R², R⁵
H

R³

embedded image

R⁶=

and R⁷= H

R⁴

embedded image

R⁹=

and R⁸= H

Compound 12

R^1b, R^1c, R², R⁵
H

R^1a

embedded image

R³

R⁷=

and R⁶= H

R⁴

embedded image

R⁸and R⁹= H

Compound 13

R^1b, R^1c, R², R⁵
H

R^1a

embedded image

R³

R⁷=

and R⁶= H

R⁴

embedded image

R⁹=

and R⁸= H

Compound 14

R^1b, R^1c, R⁵
H

R^1a

embedded image

R²

R³

R⁶=

and R⁷= H

R⁴

embedded image

R⁹=

and R⁸= H

Compound 15

R^1b, R^1c, R⁵
H

R^1a

embedded image

R²

R³

R⁶=

and R⁷= H

R⁴

embedded image

R⁹and R⁸= H

Compound 16

R^1b, R^1c, R⁵
H

R^1a

embedded image

R²

R³

R⁶and R⁷= H

R⁴

embedded image

R⁹and R⁸= H

Compound 17

R^1a, R^1b, R^1c, R²
H

R³

embedded image

R⁶and R⁷= H

R⁴

embedded image

R⁸and R⁹= H

R⁵

embedded image

R^1dand R¹⁰= H

Compound 18

R^1a, R^1b, R^1c, R²
H

R³

embedded image

R⁶and R⁷= H

R⁴

embedded image

R⁸and R⁹= H

R⁵

embedded image

R¹⁰=

R^1d, R¹¹, R¹⁰= H

Compound 19

R^1a, R^1b, R^1c, R²
H

R³

embedded image

R⁷=

R⁶= H

R⁴

embedded image

R⁸and R⁹= H

R⁵

embedded image

R¹⁰=

R^1d, R¹¹, R¹⁰= H

Compound 20

R^1a, R^1b, R^1c, R²
H

R³

embedded image

R⁷and R⁶= H

R⁴

embedded image

R⁹=

R⁸= H

R⁵

embedded image

R¹⁰=

R^1d, R¹¹, R¹⁰= H

Compound 21

R^1a, R^1b, R^1c, R²
H

R³

embedded image

R⁷and R⁶= H

R⁴

embedded image

R⁹and R⁸= H

R⁵

embedded image

R¹⁰=

R^1dand R¹⁰= H

R¹¹= embedded image

Compound 22

R^1a, R^1b, R^1c, R²
H

R³

embedded image

R⁷=

R⁶= H

R⁴

embedded image

R⁹=

R⁸= H

R⁵

embedded image

R¹⁰=

R^1d, R¹¹, R¹⁰= H

Compound 23

R^1a, R^1b, R^1c, R²
H

R³

embedded image

R⁷and R⁶= H

R⁴

embedded image

R⁹=

R⁸= H

R⁵

embedded image

R¹⁰=

R^1dand R¹⁰= H;

R¹¹= embedded image

Compound 24

R^1a, R^1b, R^1c, R²
H

R³

embedded image

R⁷=

R⁶= H

R⁴

embedded image

R⁸and R⁹= H

R⁵

embedded image

R¹⁰=

R^1dand R¹⁰= H

R¹¹= embedded image

The disclosed compounds are useful as analytical standard for the characterization and quantification of N-glycans. In some embodiments are provided kits containing at least one, isolated compound in a vial. Preferably, the compound will be present in the vial as a lyophilized mixture, optionally in combination with one or more inert bulking or stabilizing agents. The purity of the compound in the vial can be at least 90%, at least 95%, at least 98%, or at least 99%, as measured by HPLC. Some kits may contain multiple vials, each containing a single compound different from the rest. Exemplary kits may include at least 2 compounds, at least 3 compounds, at least 5 compounds, at least 8 compounds, at least 10 compounds, at least 12 compounds, at least 15 compounds, or at least 20 compounds disclosed herein.

Also disclosed are oligosaccharides having the formula:

embedded image

- and salts thereof, wherein n is 1 or 0;
- R^ais N₃, or NR^4aR^4b.
- wherein R^4aand R^4bare independently selected from H or Z-X⁴,
- wherein Z is null, C═O, or SO₂, and X₄is C_1-4alkyl, O—C_1-4alkyl, C_1-4haloalkyl, O—C_1-4haloalkyl, C_1-4alkylaryl, O—C_1-4alkylaryl, aryl, or O-aryl; or
- R^4aand R^4bcan together form a ring; or
- R^ais a radical having the formula:

embedded image

- wherein one or both of R^4cor R^4dconstitute a conjugated payload;
- R^bis N₃, or NR^4aR^4b,
- wherein R^4aand R^4bare independently selected from H or Z-X⁴,
- wherein Z is null, C═O, or SO₂, and X⁴is C_1-4alkyl, O—C_1-4alkyl, C_1-4haloalkyl, O—C_1-4haloalkyl, C_1-4alkylaryl, O—C_1-4alkylaryl, aryl, or O-aryl; or
- R^4aand R^4bcan together form a ring; or R^bis a radical having the formula:

embedded image

- wherein one or both of R^4cor R^4dconstitute a conjugated payload;
- provided that R^aand R^bare not both NHC(O)CH₃;
- R^1ais hydrogen or α-(L)-fucose, and
- R³, R², and R^g1are independently selected from hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. Exemplary R³, R², and R^g1groups include N-acetylglucosamine (GlcNAc), galactose (Gal), sialic acid (Neu5Ac), and oligosaccharide comprising the same. Exemplary sequences are depicted in Figure

In some instances R³can be a moiety having the formula:

embedded image

- wherein R⁴is selected from hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. In some instances R⁴can be a moiety having the formula:

embedded image

- wherein R⁵is selected from hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. In some instances R⁵can be a moiety having the formula:

embedded image

- wherein R⁶is selected from hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. In some instances R⁶can be a moiety having the formula:

embedded image

- wherein R⁷is selected from hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. In some instances R⁷can be a moiety having the formula:

embedded image

- wherein R^c3is selected from hydrogen or conjugated payload, and R^c4is selected from OH or conjugated payload.

In some instances R²can be a moiety having the formula:

embedded image

- wherein R⁴is hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. In some instances R⁴can be a moiety having the formula:

embedded image

- wherein R⁵is hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. In some instances R⁵can be a moiety having the formula:

embedded image

- wherein R⁶is hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. In some instances R⁶can be a moiety having the formula:

embedded image

- wherein R⁷is hydrogen or further carbohydrate, for instance a monosaccharide or oligosaccharide. In some instances R⁷can be a moiety having the formula:

embedded image

- wherein R⁸is hydrogen or a carbohydrate moiety having the formula:

embedded image

- wherein R^c3is selected from hydrogen or conjugated payload, and R^c4is selected from OH or conjugated payload.
- R^g1is hydrogen or a carbohydrate moiety having the formula:

embedded image

- R²is hydrogen or a carbohydrate moiety having the formula:

embedded image

- wherein R⁴is hydrogen or a carbohydrate moiety having the formula:

embedded image

- wherein R⁵is hydrogen or a carbohydrate moiety having the formula:

embedded image

- wherein R⁶is hydrogen or a carbohydrate moiety having the formula:

embedded image

- wherein R⁷is hydrogen or a carbohydrate moiety having the formula:

embedded image

- wherein R⁸is hydrogen or a carbohydrate moiety having the formula:

embedded image

- wherein R^c3is selected from hydrogen or conjugated payload, and R^c4is selected from OH or conjugated payload.

EXAMPLES

The following examples are for the purpose of illustration of the invention only and are not intended to limit the scope of the present invention in any manner whatsoever.

¹H spectra were recorded on a 600 MHZ Varian Inova or an Agilent 900 MHZ DD2 spectrometer with a triple resonance (HCN) cryogenically cooled probe spectrometer. Chemical shifts are reported in parts per million (ppm) relative to H1 and C1 of reducing N-acetylglucosamine which were set to δ 5.08 and 78.02 repectivelly as the internal standard. NMR data is represented as follows: Chemical shift, multiplicity (s=singlet, d=doublet, t=triplet, dd=doublet of doublets, m=multiplet and/or multiple resonances, br.=broad signal), J coupling, integration, and peak identity. NMR signals were assigned based on ¹H NMR, gCOSY, gHSQC, zTOCSY, and NOESY experiments. Enzymatic reactions were monitored by mass spectrometry recorded on an Applied Biosystems SCIEX MALDI TOF/TOF 5800 using 2.5-dihydroxybenzoic acid (DHB) as a matrix or a Shimadzu 20AD UFLC LCMS-IT-TOF. Reagents were purchased from Sigma-Aldrich (unless otherwise noted) and used without further purification. HILIC-HPLC purification of compounds was performed on a Shimadzu 20AD UFLC LCMS-IT-TOF with a Waters XBridge BEH, Amide column, 5 μm, 10×250 mm. HPLC grade acetonitrile and water were purchased from Fischer. Uridine 5′-diphosphogalactose diphosphate galactose (UDP-Gal) and cytidine-5′monophospho-N-acetylneuraminc acid (CMP-Neu5Ac) were both purchased from Roche, uridine 5′-diphospho-N-acetylglucosamine (UDP-GlcNAc) was purchased from Sigma-Aldrich, and guanosine 5′-diphospho-β-L-fucose (GDP-Fuc) was purchased from Carbosynth.

2b. Extraction, Isolation and Trimming of SGP
Sialyl Glycopeptide (SGP, 5) Extraction

SGP (5) was extracted according to our previously reported procedure¹. In short, commercially available egg yolk powder (Natural Foods, Inc., 2.27 Kg) was suspended twice in 95% ethanol (4 L) and mechanically stirred for 2 h at room temperature to remove lipids and other organic soluble components. The filtrate was discarded and the insoluble powder was suspended twice in aqueous ethanol (40% w/v ethanol, 3 L) solution. The insoluble material was discarded and the filtrate was concentrated under reduced pressure at 40° C. The resulting translucent liquid was purified using an active carbon/celite column (500 g of active carbon and 500 g celite). Impurities were removed by flushing the column with 3 L of water (0.1% v/v TFA), 3 L of 5% acetonitrile in water (0.1% v/v TFA), and 3 L 10% acetonitrile in water (0.1% v/v TFA). The desired glycopeptide was released from the column using a solution of 25% acetonitrile in water (0.1% v/v TFA), and fractions containing the product were pooled and dried under reduced pressure. The resulting white powder was subjected to size-exclusion chromatography (Bio-Rad® P-2, fine particle size 45-90 μm, column dimensions 5.0 cm×80 cm, 250 mL fractions) eluting with 0.1 M ammonium bicarbonate to yield SGP (5) as a fluffy, white powder (1.82 g, or 0.8 mg SGP/g egg yolk powder).

Trimming and Modification of SGP to Prepare Glycosyl Asparagine-CBz 1¹

Isolated SGP 5 (319 mg) was dissolved in 5 mL of Tris buffer (100 mM, pH 8.0) containing 5 mM CaCl₂. Pronase from Streptomyces griseus (Sigma-Aldrich #P5147-1G, 150 mg) was added, and the reaction was incubated for 5 days at 37° C. with shaking. The reaction was monitored by ESI-MS and once complete the mixture was heated at 80° C. for 20 min followed by Pronase removal using an Amicon Ultra-10 (MWCO-10k) centrifugal filter. The filtrate was lyophilized and purified by size-exclusion chromatography (Bio-Rad P-2 BioGel, fine particle size 45-90 μm, 2×80 cm), eluting with a 0.1 M ammonium bicarbonate solution. The fractions containing the glycosylated asparagine were pooled, lyophilized, and dissolved in 5 mL of water. To this mixture was added K₂CO₃(1.1 g), and CBzCl (0.54 g, 3.2 mmol) drop-wise. The heterogeneous mixture was stirred vigorously at room temperature until ESI-MS indicated complete installation of the CBz-protecting group (6). The reaction was diluted with water (50 mL) and extracted with ethyl acetate (2×50 mL). The organic phase was discarded, and the aqueous phase was lyophilized and purified by size-exclusion chromatography using P-2 BioGel eluting with a 0.1 M ammonium bicarbonate solution. The fractions containing 6 were pooled, lyophilized, and re-dissolved in 5 mL of sodium acetate buffer (50 mM, pH 5.5) containing 5 mM CaCl₂. To this mixture was added neuraminidase from Clostridium perfringens (New England Biolabs #P0720L. 40 μL, 2000 units) and the reaction was incubated overnight at 37° C. with shaking at which time, ESI-MS indicated all the sialic acid residues had been removed. The pH of the reaction mixture was adjusted to 4.5 with acetic acid after which. BSA (5 mg) and β-galactosidase (200 μL, 800 units:) from Aspergillus niger (Megazyme #E-BGLAN) were added. The reaction was incubated at 37° C. with shaking overnight, after which another 150 μL of β-galactosidase were added. The reaction was monitored by ESI-MS and once complete galactose removal was observed the enzymes were removed using an Amicon Ultra-10 (MWCO-10k) centrifugal filter. The filtrate was lyophilized and purified by size-exclusion chromatography using P-2 BioGel eluting with a 0.1 M ammonium bicarbonate solution. The fractions containing the trimmed glycosyl asparagine-CBz were pooled, lyophilized, and dissolved in 10 mL of MES buffer (100 mM, pH 7.3). To this mixture BSA (1 mg), calf intestine alkaline phosphatase (CIAP. 100 μL, 2 kU/mL), GDP-Fucose (75 mg), and FUT8 (200 μL, 1 mg/mL) were added and the reaction was incubated overnight at 37° C. with shaking. The reaction was lyophilized and purified by size-exclusion chromatography using P-2 BioGel eluting with a 0.1 M ammonium bicarbonate solution. The fractions containing 1 were pooled, lyophilized, and subjected to HILIC-HPLC (see section 2f) for final purification to give the compound 1 (74 mg, 39%).

2c. Expression and Purification of Enzymes
Recombinant Expression and Purification of PmGlmU

The gene sequence of Pasteurella multocida N-acetylglucosamine-1-phosphate uridylyltransferase (PmGlmU) from Pasteurella multocida strain P-1059 (ATCC 15742) with a C-terminal His₆-tag²were synthesized, ligated into a pET15b plasmid using Ndel and Rhol restriction sites, and transformed into E. coli BL21 (DE3) cells by Genscript. E. coli BL21 cells harboring the pET15b-PmGlmU plasmid were cultured in LB medium containing ampicillin (100 μg/mL) at 37° C. until an OD_600umof 0.8-1.0 was reached. Protein expression was induced by the addition of isopropyl-1-thio-β-D-galactopyranoside (IPTG, final concentration 100 M) and cultures where incubated at 20° C. with rigorous shaking for 18 h. The cells were harvested by centrifugation (4,000×g) at 4° C. for 20 min and the resulting pellet was resuspended in lysis buffer (100 mM Tris-HCl, pH=8, containing 0.1% Triton X-100, lysozyme (100 μg/mL) and DNAse (5 μg/mL)). The cells were lysed by passing the suspension twice through a French Press at 10000 PSI and 4° C. and the lysate was clarified by centrifugation (10,000×g) at 4° C. for 45 min. Purification was performed by loading the supernatant onto a Ni-NTA superflow column pre-equilibrated with binding buffer (10 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl, pH=7.5). The column was washed with washing buffer (40 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl, pH=7.5) and the PmGlmU enzyme was eluted with elution buffer (200 mM imidazole, 0.5 M NaCl, 50 mM Tris-HCl, pH=7.5). Fractions containing purified PmGlmU enzyme were combined and 10% glycerol was added for storage at 4° C. From 1 L of culture medium 120-150 mg of PmGlmU was obtained.

Human Glycosyl Transferase Expression and Purification

The catalytic domains of human glycosyl transferases (as shown in the table below) were expressed as soluble, secreted fusion proteins by transient transfection of HEK293 suspension cultures^3,4. The coding regions were amplified from Mammalian Gene Collection clones, human tissue cDNAs, or generated by gene synthesis by a process that appended a tobacco etch virus (TEV) protease cleavage site⁵to the NH₂-terminal end of the coding region and attL1 and attL2 Gateway adaptor sites were extended on the 5′ and 3′ terminal ends of the coding region during transfer to pDONR²²¹vector backbone⁴. The pDONR²²¹clones were then recombined via LR clonase reaction into a custom Gateway adapted version of the pGEn2 mammalian expression vector4 to assemble a recombinant coding region comprised of a 25 amino acid NH2-terminal signal sequence from the T. cruzi lysosomal α-mannosidase⁶followed by an 8×His tag, 17 amino acid AviTag,⁷“superfolder” GFP⁸, the nine amino acid sequence encoded by attB1 recombination site, followed by the TEV protease cleavage site and the respective glycosyltransferase catalytic domain coding region.

Suspension culture HEK293 cells (Freestyle 293-F cells, Life Technologies, Grand Island, NY) were transfected as previously described^3,4and the culture supernatant was subjected to Ni²⁺-NTA superflow chromatography (Qiagen, Valencia, CA). Enzyme preparations were eluted with 300 mM imidazole, concentrated by ultrafiltration, and subjected to gel filtration on a Superdex 75 column (GE Healthcare) preconditioned with a buffer containing 20 mM HEPES, pH 7.0, 100 mM NaCl, 10% glycerol, 0.05% Na azide. Peak fractions were pooled and concentrated to ˜1 mg/mL using an ultrafiltration pressure cell membrane (Millipore, Billerica, MA) with a 10 kDa molecular weight cutoff.

2d. UDP-GleNTFA Preparation
Procedure for the One-Pot Three-Enzyme Preparation of UDP-GlcNTFA (4)

embedded image

GlcNTFA® (162 mg, 589 μmol), ATP (390 mg, 707 μmol) and UTP (390 mg, 707 μmol) were dissolved in 59 mL of 100 mM Tris-HCl buffer (pH=8.0) containing 10 mM MgCl₂. To this solution was added Bifidobacterium longum N-acetylhexosamine 1-kinase (NahK, 14 μg/μmol substrate), Pasteurella multocida N-acetylglucosamine-1-phosphate uridylyltransferase (PmGlmU. 17 μg/μmol substrate) and Pasteurella multocida inorganic pyrophosphatase (PmPpA, 7 μg/μmol substrate), and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by ESI-TOF MS, and once complete 59 mL of cold ethanol was added and the mixture was incubated at 4° C. for 1 h. The reaction mixture was centrifuged and the supernatant was removed, concentrated, and purified by a P2 BioGel column using 0.1 M NH₄HCO₃as eluent, followed by silica gel column chromatography (4:2:1 EtOAc/MeOH/H₂O) afforded UDP-GleNTFA 4 (273 mg, 70%) as a white solid². ¹H NMR (500 MHz, D₂O): δ 7.97 (d, J=8.1 Hz, 1H, H6-Uridine), 5.99 (d, J=4.5 Hz, 1H, H1-Ribose), 5.98 (d, J=8.0 Hz, 1H, H5-Uridine), 5.63 (dd, J=7.0, 3.3 Hz, 1H, H1-GlcNTFA), 4.42-4.34 (m, 2H, H2-Ribose, H3-Ribose), 4.33-4.28 (m, 1H, H4-Ribose), 4.24 (dd, J=4.5, 2.7 Hz, 1H, H5-Ribose). 4.21 (dd, J=5.6, 3.0 Hz, 1H, H5′-Ribose), 4.12 (dt, J=10.8, 2.9 Hz, 1H, H2-GlcNTFA), 4.00-3.95 (m, 2H. H3-GlcNTFA. H4-GlcNTFA), 3.89 (dd, J=12.5, 2.3 Hz, 1H, H6-GlcNTFA), 3.83 (dd, J=12.6, 4.3 Hz, 1H, H6′-GlcNTFA), 3.62-3.59 (m, 1H, H5-GlcNTFA. ¹³C NMR (76 MHz, D₂O): δ 141.6 (C6-Uridine), 102.5 (C5-Uridine), 93.7 (C1-GlcNTFA), 88.5 (C1-Ribose), 83.0 (C4-Ribose), 73.7 (C2-Ribose), 73.0 (C3-GlcNTFA), 70.1 (C4-GlcNTFA), 69.5 (C3-Ribose), 69.4 (C5-GlcNTFA), 64.9 (C5-Ribose), 60.1 (C6-GlcNTFA), 54.3 (C2-GlcNTFA). ESI-MS m/z caled for C₁₇H₂₃F₃N₃O₁₇P₂. [M−H]⁻: 660.0460, found 660.0417.

2e. General Protocols for Enzymatic Reactions and Glycosyl Asparagine Modification
General procedure for the installation of core a 1,6 Fuc using FUT8

Glycosyl asparagine acceptor A2-Asn-Cbz (79 mg, 55 μmol) and GDP-Fuc (75 mg, 111.8 μmol) were dissolved at a final acceptor concentration of 10 mM in a MES buffered solution (100 mM, pH 7.5) containing BSA (1% total volume, stock solution=10 mg mL⁻¹). Calf intestine alkaline phosphatase (CIAP, 1% total volume, stock solution=1kU mL⁻¹) and FUT8 (40 μg/μmol acceptor) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by ESI-TOF MS and if starting material remained after 18 h another portion of FUT8 was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins and the filtrate was lyophilized. Purification by HPLC using a HILIC column (supporting information 2f) provided desired product as a white fluffy solid (74 mg, 85%).

General Procedure for the Installation of β1,3 GlcNAc Using B3GNT2

Glycosyl asparagine acceptor (1 eq) and UDP-GlcNAc (1.5 eq) were dissolved to provide a final acceptor concentration of 2-5 mM in a HEPES buffered solution (50 mM, pH 7.3) containing KCI (25 mM), MgCl₂(2 mM) and DTT (1 mM). Calf intestine alkaline phosphotase (CIAP, 1% total volume. 1 kU mL⁻¹) and B3GNT2 (1% wt/wt relative to acceptor substrate) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by MALDI-TOF MS or ESI-TOF MS, and if starting material remained after 18 h another portion of B3GNT2 was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins and the filtrate was lyophilized. Purification by HILIC HPLC (see section 2f) or P2 size-exclusion column chromatography provided the desired product.

General Procedure for the Installation of β1,2-GlcNTFA Using MGAT1

Glycosyl asparagine acceptor Man3-Asn-Cbz (5.0 mg, 4.3 μmol) and UDP-GlcNTFA (5.7 mg, 8.6 μmol) were dissolved at a final acceptor concentration of 10 mM in a MES buffered solution (100 mM, pH 6.5) containing MnCl₂(10 mM) and BSA (1% total volume, stock solution=10 mg mL⁻¹). Calf intestine alkaline phosphatase (CIAP, 1% total volume) and MGAT1 (40 μg/μmol acceptor) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by ESI-TOF MS and if starting material remained after 18 h another portion of MGAT1 was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins and the filtrate was lyophilized. Purification by HPLC using a HILIC column provided desired product as a white fluffy solid (4.9 mg, 81%).

General Procedure for the Installation of β1,2-GIcNTFA using MGAT2

Glycosyl asparagine acceptor Man3A1-Asn-Cbz (3.0 mg, 2.2 μmol) and UDP-GleNTFA (3 mg, 4.5 μmol) were dissolved at a final acceptor concentration of 5 mM in a MES buffered solution (100 mM, pH 7.5) containing BSA (1% total volume). CIAP (1% total volume) and MGAT2 (400 μg/μmol acceptor) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by ESI-TOF MS, and if starting material remained after 18 h another portion of MGAT2 was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins, and the filtrate was lyophilized. Purification by HPLC using a HILIC column provided the desired product Man3A2-Asn-Cbz as a white fluffy solid (2.7 mg, 76%).

General Procedure for the Installation of B1,6-GleNTFA Using MGAT5

Glycosyl asparagine acceptor 1 (17.6 mg, 10.2 μmol) and UDP-GlcNTFA (13.5 mg, 20.4 μmol) were dissolved at a final acceptor concentration of 10 mM in a sodium cacodylate buffered solution (100 mM, pH 6.5) containing MnCl₂(10 mM) and BSA (1% total volume, stock solution=10 mg mL⁻¹). Calf intestine alkaline phosphatase (CIAP, 1% total volume, stock solution=1kU mL⁻¹) and MGAT5 (40 μg/μmol acceptor) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by MALDI-TOF MS and if starting material remained after 18 h another portion of MGAT5 was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins and the filtrate was lyophilized. Purification by HPLC using a HILIC column (supporting information 2f) provided desired product 14 as a white fluffy solid (18.6 mg, 92%).

General Procedure for the Installation of β1,4-GleNTFA Using MGAT4B

Glycosyl asparagine acceptor 2 (4.0 mg, 2.1 μmol) and UDP-GIcNTFA (2.75 mg, 4.2 μmol) were dissolved at a final acceptor concentration of 5 mM in a Tris buffered solution (100 mM, pH 7.5) containing MnCl₂(5 mM) and BSA (1% total volume). CIAP (1% total volume) and MGAT4B (400 μg/μmol acceptor) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by ESI-TOF MS, and if starting material remained after 18 h another portion of MGAT4B was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins, and the filtrate was lyophilized. Purification by HPLC using a HILIC column (supporting information 2f) provided the desired product S6 as a white fluffy solid (3.8 mg, 85%).

General Procedure for the Installation of β1,4 Gal Using B4GALT1

Glycosyl asparagine acceptor (1 eq) and UDP-Gal (1.5 eq per Gal to be added) were dissolved to a provide an acceptor concentration of 2-5 mM in a Tris buffered solution (100 mM, pH 7.5) containing MnCl₂(10 mM) and BSA (1% total volume). CIAP (1% volume total) and B4GALT1 (1% wt/wt relative to acceptor substrate) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by MALDI-TOF MS or ESI-TOF MS, and if starting material remained after 18 h another portion of B4GALTI was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins and the filtrate was lyophilized. Purification by HILIC HPLC (see section 2f) or P2 size-exclusion column chromatography provided the desired product.

General Procedure for the Installation of α1,3 Fuc Using FUT5

Glycosyl asparagine acceptor (1 eq) and GDP-Fuc (1.5 eq per Fuc to be added) were dissolved at a final acceptor concentration of 2-5 mM in a Tris buffered solution (50 mM, pH 7.3) containing MnCl₂(10 mM). CIAP (1% total volume) and FUT5 (1% wt/wt) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by MALDI-TOF MS or ESI-TOF MS, and if starting material remained after 18 h another portion of FUT5 was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins and the filtrate was lyophilized. Purification by HILIC HPLC (sec section 2f) or P2 size-exclusion column chromatography provided the desired product.

General Procedure for the Installation of a2,3 Neu5Ac Using ST3GAL4

Glycosyl asparagine acceptor (1 eq) and CMP-Neu5Ac (1.5 eq) were dissolved at a final acceptor concentration of 2-5 mM in a sodium cacodylate buffered solution (50 mM, pH 7.2) containing BSA (1% total volume). CIAP (1% volume total) and ST3GAL4 (1% wt/wt relative to acceptor substrate) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by ESI-TOF MS, and if starting material remained after 18 h another portion of ST3GAL4 was added until no starting material could be detected. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins, and the filtrate was lyophilized. Purification by HILIC HPLC (see section 2f) or P2 size-exclusion column chromatography provided the desired product.

General Procedure for the Selective Installation of Terminal α2,6 Neu5Ac Using ST6GALI

Glycosyl asparagine (1 cq) and CMP-Neu5Ac (1.1 eq) were dissolved at a final acceptor concentration of 2-5 mM in a sodium cacodylate buffered solution (100 mM, pH 6.5) containing BSA (1% volume total). CIAP (1% volume total) and ST6GAL1 (1% wt/wt relative to acceptor substrate) were added, and the reaction mixture was incubated overnight at 37° C. with gentle shaking. The reaction mixture was centrifuged over a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove reaction proteins, and the filtrate was lyophilized. Purification by HILIC HPLC (see section 2f) or P2 size-exclusion column chromatography provided the desired product.

General Procedure for the Selective Cleavage of Galactose Using E. coli β-Galactosidase¹⁰

Glycosyl asparagine was dissolved at a concentration of 5 mM in a Tris buffered solution (50 mM, pH 7.3) containing 5 mM MgCl₂. To this solution was added 50 U/μmol glycosyl asparagine of E. coli β-galactosidase (Sigma-Aldrich #, G5635) and the mixture was incubated overnight at 37° C. The reaction mixture was centrifuged using a Nanosep® Omega ultrafiltration device (10 kDa MWCO) to remove the enzyme and the filtrate was lyophilized Purification by HILIC HPLC (see section 2f) or P2 size-exclusion column chromatography provided the desired product.

General Procedure for Removal of TFA Protecting Group of an N-Glycan

The GlcNTFA moiety of S6 was converted to GleNH₂by dissolving the substrate (3.8 mg, 1.8 μM) in H₂O to a final concentration of 10 mM. The pH of the solution was adjusted to 10 using μL aliquots 1 M NaOH. The reaction mixture was incubated overnight at 37° C. with gentle shaking. Progress of the reaction was monitored by MALDI-TOF MS and once complete the solvent was removed by lyophilization. The reaction was neutralized by uL aliquots of 1 M acetic acid and purified by P2 size-exclusion chromatography eluting with 50 mM ammonium bicarbonate to yield the desired target 2 as a white fluffy solid (3.5 mg, 92%).

General Procedure for the Conversion of GleNH₂to GleN₃

Substrate 15 (9.3 mg, 5 μmol, 1 eq) was dissolved in water (1.6 mL) and to this solution was added imidazole-1-sulfonyl azide hydrogen sulfate (13.4 mg, 50 μmol), K₂CO₃(6.8 mg, 50 μmol) and catalytic CuSO₄.5H₂O. The reaction mixture was incubated overnight at 37° C. with gentle shaking. Reaction progress was monitored by MALDI-TOF MS and if starting material remained, an additional ½ portion of the imidazole-1-sulfonyl azide hydrogen sulfate, K₂CO₃, and CuSO₄was added until no starting material could be observed. The reaction solvent was removed by lyophilization and the salts were removed by P2 size-exclusion chromatography eluting with 50 mM ammonium bicarbonate to yield 23 as a white fluffy solid (7.2 mg. 76%).

General procedure for reduction of GleN₃

Intermediate 27 (2.3 mg, 0.66 μmol, 1 eq) was dissolved in a solution of 9:1 pyridine/triethylamine to give a final concentration of 5 mM. The mixture was vortexed until all solids dissolved and 10 eq. 1,3-dithiolpropane (0.7 mg, 6.6 μmol, 10 eq) were added in one portion. The reaction mixture was kept at 37° C. was until no azide could be detected by ESI-TOF-MS. Reaction was carried forward to acetylate the amine without further purification.

General Procedure for Amine Acetylation

18 (1.3 mg, 0.5 μmol, 1 eq) was dissolved in water to a final concentration of 2 mM. The pH was adjusted to 8 using μL aliquots of 1M NaOH. To this solution was added solid AcOSu (0.7 mg, 5 μmol, 10 eq) in one portion. The reaction mixture was vortexed vigorously until all solids were dissolved. The reaction was kept at 37° C. until full acetylation was observed by ESI-TOF-MS. In the event starting amine was detected, additional AcOSu (5 eq) was added until complete conversion was observed. The reaction was lyophilized and purified by HPLC using a HILIC column (supporting information 2f) to afford 19 as a white fluffy solid (0.9 mg, 67%).

2f. General Protocols for HILIC-HPLC Purification
HILIC-HPLC Purification Conditions for Glycosyl Asparagine Targets

Semi-preparative HILIC-HPLC was performed on a Shimadzu LC-ESI-IT-TOF with a Waters XBridge BEH, Amide column, 5 μm, 10×250 mm at a flow rate of 2.3 mL/min, injection volume of 100 μL (10-20 mg/mL), with 1% of the flow is diverted to the ESI-MS detector using a splitter. Mobile phase A was 10 mM ammonium formate in water, adjusted to pH 4.5 with formic acid; mobile phase B was 90% aceteonitrile with 10% 10 mM ammonium formate in water (pH=4.5). The general condition using a linear gradient is as follows:

Time (min)
A (%)
B (%)

0
20
80

40
55
45

45
80
20

55
20
80

60
20
80

1 was purified using a linear gradient with the following conditions:

Time (min)
A (%)
B (%)

0
20
80

60
40
60

70
50
50

71
80
20

80
80
20

85
20
80

90
20
80

The compositions and methods of the appended claims are not limited in scope by the specific compositions and methods described herein, which are intended as illustrations of a few aspects of the claims and any compositions and methods that are functionally equivalent are intended to fall within the scope of the claims. Various modifications of the compositions and methods in addition to those shown and described herein are intended to fall within the scope of the appended claims. Further, while only certain representative compositions and method steps disclosed herein are specifically described, other combinations of the compositions and method steps also are intended to fall within the scope of the appended claims, even if not specifically recited. Thus, a combination of steps, elements, components, or constituents may be explicitly mentioned herein or less, however, other combinations of steps, elements, components, and constituents are included, even though not explicitly stated. The term “comprising” and variations thereof as used herein is used synonymously with the term “including” and variations thereof and are open, non-limiting terms. Although the terms “comprising” and “including” have been used herein to describe various embodiments, the terms “consisting essentially of” and “consisting of” can be used in place of “comprising” and “including” to provide for more specific embodiments of the invention and are also disclosed. Other than in the examples, or where otherwise noted, all numbers expressing quantities of ingredients, reaction conditions, and so forth used in the specification and claims are to be understood at the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, to be construed in light of the number of significant digits and ordinary rounding approaches.

	Number	Date	Country
Parent	17116706	Dec 2020	US
Child	18240621		US

OLIGOSACCHARIDE ANALYTICAL STANDARDS

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

International Classifications

Abstract

Description

Claims

CROSS-REFERENCE TO RELATED APPLICATION

STATEMENT OF GOVERNMENT SUPPORT

Provisional Applications (1)

Continuations (1)